Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Rogers A, Antoshechkin I, Bieri T, Blasiar D, Bastiani C, Canaran P, Chan J, Chen WJ, Davis P, Fernandes J, Fiedler TJ, Han M, Harris TW, Kishore R, Lee R, McKay S, Müller HM, Nakamura C, Ozersky P, Petcherski A, Schindelman G, Schwarz EM, Spooner W, Tuli MA, Van Auken K, Wang D, Wang X, Williams G, Yook K, Durbin R, Stein LD, Spieth J, Sternberg PW. WormBase 2007. Nucleic Acids Res 2007;36:D612-7. [PMID: 17991679 PMCID: PMC2238927 DOI: 10.1093/nar/gkm975] [Citation(s) in RCA: 82] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022] Open

For:	Rogers A, Antoshechkin I, Bieri T, Blasiar D, Bastiani C, Canaran P, Chan J, Chen WJ, Davis P, Fernandes J, Fiedler TJ, Han M, Harris TW, Kishore R, Lee R, McKay S, Müller HM, Nakamura C, Ozersky P, Petcherski A, Schindelman G, Schwarz EM, Spooner W, Tuli MA, Van Auken K, Wang D, Wang X, Williams G, Yook K, Durbin R, Stein LD, Spieth J, Sternberg PW. WormBase 2007. Nucleic Acids Res 2007;36:D612-7. [PMID: 17991679 PMCID: PMC2238927 DOI: 10.1093/nar/gkm975] [Citation(s) in RCA: 82] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022] Open

Number

Cited by Other Article(s)

Dayi M. Diversity and evolution of transposable elements in the plant-parasitic nematodes. BMC Genomics 2024;25:511. [PMID: 38783171 PMCID: PMC11118728 DOI: 10.1186/s12864-024-10435-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 05/21/2024] [Indexed: 05/25/2024] Open

Abstract

BACKGROUND

Transposable elements (TEs) are mobile DNA sequences that propagate within genomes, occupying a significant portion of eukaryotic genomes and serving as a source of genetic variation and innovation. TEs can impact genome dynamics through their repetitive nature and mobility. Nematodes are incredibly versatile organisms, capable of thriving in a wide range of environments. The plant-parasitic nematodes are able to infect nearly all vascular plants, leading to significant crop losses and management expenses worldwide. It is worth noting that plant parasitism has evolved independently at least three times within this nematode group. Furthermore, the genome size of plant-parasitic nematodes can vary substantially, spanning from 41.5 Mbp to 235 Mbp. To investigate genome size variation and evolution in plant-parasitic nematodes, TE composition, diversity, and evolution were analysed in 26 plant-parasitic nematodes from 9 distinct genera in Clade IV.

RESULTS

Interestingly, despite certain species lacking specific types of DNA transposons or retrotransposon superfamilies, they still exhibit a diverse range of TE content. Identification of species-specific TE repertoire in nematode genomes provides a deeper understanding of genome evolution in plant-parasitic nematodes. An intriguing observation is that plant-parasitic nematodes possess extensive DNA transposons and retrotransposon insertions, including recent sightings of LTR/Gypsy and LTR/Pao superfamilies. Among them, the Gypsy superfamilies were found to encode Aspartic proteases in the plant-parasitic nematodes.

CONCLUSIONS

The study of the transposable element (TE) composition in plant-parasitic nematodes has yielded insightful discoveries. The findings revealed that certain species exhibit lineage-specific variations in their TE makeup. Discovering the species-specific TE repertoire in nematode genomes is a crucial element in understanding the evolution of genomes in plant-parasitic nematodes. It allows us to gain a deeper insight into the intricate workings of these organisms and their genetic makeup. With this knowledge, we are gaining a fundamental piece in the puzzle of understanding the evolution of these parasites. Moreover, recent transpositions have led to the acquisition of new TE superfamilies, especially Gypsy and Pao retrotransposons, further expanding the diversity of TEs in these nematodes. Significantly, the widely distributed Gypsy superfamily possesses proteases that are exclusively associated with parasitism during nematode-host interactions. These discoveries provide a deeper understanding of the TE landscape within plant-parasitic nematodes.

Collapse

Sternberg PW, Van Auken K, Wang Q, Wright A, Yook K, Zarowiecki M, Arnaboldi V, Becerra A, Brown S, Cain S, Chan J, Chen WJ, Cho J, Davis P, Diamantakis S, Dyer S, Grigoriadis D, Grove CA, Harris T, Howe K, Kishore R, Lee R, Longden I, Luypaert M, Müller HM, Nuin P, Quinton-Tulloch M, Raciti D, Schedl T, Schindelman G, Stein L. WormBase 2024: status and transitioning to Alliance infrastructure. Genetics 2024;227:iyae050. [PMID: 38573366 PMCID: PMC11075546 DOI: 10.1093/genetics/iyae050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 03/19/2024] [Accepted: 03/20/2024] [Indexed: 04/05/2024] Open

Affiliation(s)

Paul W Sternberg Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
Kimberly Van Auken Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
Qinghua Wang Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
Adam Wright Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
Karen Yook Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
Magdalena Zarowiecki European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
Valerio Arnaboldi Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
Andrés Becerra European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
Stephanie Brown School of Infection and Immunity, University of Glasgow, Glasgow G12 8TA, UK
Scott Cain Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
Juancarlos Chan Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
Wen J Chen Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
Jaehyoung Cho Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
Paul Davis European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
Stavros Diamantakis European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
Sarah Dyer European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
Dionysis Grigoriadis School of Infection and Immunity, University of Glasgow, Glasgow G12 8TA, UK
Christian A Grove Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
Todd Harris Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
Kevin Howe European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
Ranjana Kishore Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
Raymond Lee Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
Ian Longden Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
Manuel Luypaert European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
Hans-Michael Müller Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
Paulo Nuin Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
Mark Quinton-Tulloch European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
Daniela Raciti Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
Tim Schedl Department of Genetics, Washington University School of Medicine, St. Louis, MO 63110, USA
Gary Schindelman Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
Lincoln Stein Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada

Collapse

Lashari A, Kazi TG, Afridi HI, Baig JA, Arain MB, Lashari AA. Evaluate the Work-Related Exposure of Vanadium on Scalp Hair Samples of Outdoor and Administrative Workers of Oil Drilling Field: Related Health Risks. Biol Trace Elem Res 2024:10.1007/s12011-024-04101-y. [PMID: 38376729 DOI: 10.1007/s12011-024-04101-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Accepted: 02/08/2024] [Indexed: 02/21/2024]

Dayi M. Evolution of parasitism genes in the plant parasitic nematodes. Sci Rep 2024;14:3733. [PMID: 38355886 PMCID: PMC10866927 DOI: 10.1038/s41598-024-54330-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2023] [Accepted: 02/11/2024] [Indexed: 02/16/2024] Open

Abstract

The plant-parasitic nematodes are considered as one of the most destructive pests, from which the migratory and sedentary endoparasitic plant parasitic nematodes infect more than 4000 plant species and cause over $100 billion crop losses annually worldwide. These nematodes use multiple strategies to infect their host and to establish a successful parasitism inside the host such as cell-wall degradation enzymes, inhibition of host defense proteins, and molecular mimicry. In the present study, the main parasitism-associated gene families were identified and compared between the migratory and sedentary endoparasitic nematodes. The results showed that the migratory and sedentary endoparasitic nematodes share a core conserved parasitism mechanism established throughout the evolution of parasitism. However, genes involved in pectin degradation and hydrolase activity are rapidly evolving in the migratory endoparasitic nematodes. Additionally, cell-wall degrading enzymes such as GH45 cellulases and pectate lyase and peptidase and peptidase inhibitors were expanded in the migratory endoparasitic nematodes. The molecular mimicry mechanism was another key finding that differs between the endoparasitic and sedentary parasitic nematodes. The PL22 gene family, which is believed to play a significant role in the molecular mechanisms of nematode parasitism, has been found to be present exclusively in migratory endoparasitic nematodes. Phylogenetic analysis has suggested that it was de novo born in these nematodes. This discovery sheds new light on the molecular evolution of these parasites and has significant implications for our understanding of their biology and pathogenicity. This study contributes to our understanding of core parasitism mechanisms conserved throughout the nematodes and provides unique clues on the evolution of parasitism and the direction shaped by the host.

Collapse

Tanaka SE, Dayi M, Maeda Y, Tsai IJ, Tanaka R, Bligh M, Takeuchi-Kaneko Y, Fukuda K, Kanzaki N, Kikuchi T. Stage-specific transcriptome of Bursaphelenchus xylophilus reveals temporal regulation of effector genes and roles of the dauer-like stages in the lifecycle. Sci Rep 2019;9:6080. [PMID: 30988401 PMCID: PMC6465311 DOI: 10.1038/s41598-019-42570-7] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2018] [Accepted: 04/01/2019] [Indexed: 12/24/2022] Open

Breland A, Ha SE, Jorgensen BG, Jin B, Gardner TA, Sanders KM, Ro S. Smooth Muscle Transcriptome Browser: offering genome-wide references and expression profiles of transcripts expressed in intestinal SMC, ICC, and PDGFRα⁺ cells. Sci Rep 2019;9:387. [PMID: 30674925 PMCID: PMC6344548 DOI: 10.1038/s41598-018-36607-6] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2017] [Accepted: 11/26/2018] [Indexed: 01/02/2023] Open

Dubaj Price M, Hurd DD. WormBase: A Model Organism Database. Med Ref Serv Q 2019;38:70-80. [PMID: 30942676 DOI: 10.1080/02763869.2019.1548896] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Frommolt P, Schumacher B. Wormpath: searching for molecular interaction networks in Caenorhabditis elegans. SOURCE CODE FOR BIOLOGY AND MEDICINE 2015;10:5. [PMID: 25866556 PMCID: PMC4392734 DOI: 10.1186/s13029-015-0034-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/08/2015] [Accepted: 03/12/2015] [Indexed: 11/10/2022]

Glauser DA. The multiplicity of alternative splicing decisions in Caenorhabditis elegans is linked to specific intronic regulatory motifs and minisatellites. BMC Genomics 2014;15:364. [PMID: 24884695 PMCID: PMC4039745 DOI: 10.1186/1471-2164-15-364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2013] [Accepted: 04/15/2014] [Indexed: 11/28/2022] Open

Abstract

Background

Alternative splicing diversifies the pool of messenger RNA molecules encoded by individual genes. This diversity is particularly high when multiple splicing decisions cause a combinatorial arrangement of several alternate exons. We know very little on how the multiple decisions occurring during the maturation of single transcripts are coordinated and whether specific sequence elements might be involved.

Results

Here, the Caenorhabditis elegans genome was surveyed in order to identify sequence elements that might play a specific role in the regulation of multiple splicing decisions. The introns flanking alternate exons in transcripts whose maturation involves multiple alternative splicing decisions were compared to those whose maturation involves a single decision. Fifty-eight penta-, hexa-, and hepta-meric elements, clustered in 17 groups, were significantly over-represented in genes subject to multiple alternative splicing decisions. Most of these motifs relate to known splicing regulatory elements and appear to be well conserved in the related species Caenorhabditis briggsae. The usage of specific motifs is not linked to the gene product function, but rather depends on the gene structure, since it is influenced by the distance separating the multiple splicing decision sites. Two of these motifs are part of the CeRep25B minisatellite, which is also over-represented at the vicinity of alternative splicing regions. Most of the remaining motifs are not part of repeated sequence elements, but tend to occur in specific heterologous pairs in genes subject to multiple alternative splicing decisions.

Conclusions

The existence of specific intronic sequence elements linked to multiple alternative splicing decisions is intriguing and suggests that these elements might have some specialized regulatory role during splicing.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2164-15-364) contains supplementary material, which is available to authorized users.

Collapse

Wang J, Chen D, Lei Y, Chang JW, Hao BH, Xing F, Li S, Xu Q, Deng XX, Chen LL. Citrus sinensis annotation project (CAP): a comprehensive database for sweet orange genome. PLoS One 2014;9:e87723. [PMID: 24489955 PMCID: PMC3905029 DOI: 10.1371/journal.pone.0087723] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2013] [Accepted: 12/29/2013] [Indexed: 01/31/2023] Open

Pálfy M, Farkas IJ, Vellai T, Korcsmáros T. Uniform curation protocol of metazoan signaling pathways to predict novel signaling components. Methods Mol Biol 2013;1021:285-297. [PMID: 23715991 DOI: 10.1007/978-1-62703-450-0_15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]

Tacutu R, Shore DE, Budovsky A, de Magalhães JP, Ruvkun G, Fraifeld VE, Curran SP. Prediction of C. elegans longevity genes by human and worm longevity networks. PLoS One 2012;7:e48282. [PMID: 23144747 PMCID: PMC3483217 DOI: 10.1371/journal.pone.0048282] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2012] [Accepted: 09/21/2012] [Indexed: 11/18/2022] Open

Hutter H. Fluorescent protein methods: strategies and applications. Methods Cell Biol 2012;107:67-92. [PMID: 22226521 DOI: 10.1016/b978-0-12-394620-1.00003-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]

Bolser DM, Chibon PY, Palopoli N, Gong S, Jacob D, Del Angel VD, Swan D, Bassi S, González V, Suravajhala P, Hwang S, Romano P, Edwards R, Bishop B, Eargle J, Shtatland T, Provart NJ, Clements D, Renfro DP, Bhak D, Bhak J. MetaBase--the wiki-database of biological databases. Nucleic Acids Res 2011;40:D1250-4. [PMID: 22139927 PMCID: PMC3245051 DOI: 10.1093/nar/gkr1099] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open

Phenotype mining for functional genomics and gene discovery. Methods Mol Biol 2011;760:159-73. [PMID: 21779996 DOI: 10.1007/978-1-61779-176-5_10] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]

Walaas SI, Hemmings HC, Greengard P, Nairn AC. Beyond the dopamine receptor: regulation and roles of serine/threonine protein phosphatases. Front Neuroanat 2011;5:50. [PMID: 21904525 PMCID: PMC3162284 DOI: 10.3389/fnana.2011.00050] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2011] [Accepted: 07/23/2011] [Indexed: 11/17/2022] Open

Desalermos A, Muhammed M, Glavis-Bloom J, Mylonakis E. Using C. elegans for antimicrobial drug discovery. Expert Opin Drug Discov 2011;6:645-652. [PMID: 21686092 DOI: 10.1517/17460441.2011.573781] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

Wang Y, Chen J, Wei G, He H, Zhu X, Xiao T, Yuan J, Dong B, He S, Skogerbø G, Chen R. The Caenorhabditis elegans intermediate-size transcriptome shows high degree of stage-specific expression. Nucleic Acids Res 2011;39:5203-14. [PMID: 21378118 PMCID: PMC3130273 DOI: 10.1093/nar/gkr102] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open

Spencer WC, Zeller G, Watson JD, Henz SR, Watkins KL, McWhirter RD, Petersen S, Sreedharan VT, Widmer C, Jo J, Reinke V, Petrella L, Strome S, Von Stetina SE, Katz M, Shaham S, Rätsch G, Miller DM. A spatial and temporal map of C. elegans gene expression. Genome Res 2011;21:325-41. [PMID: 21177967 PMCID: PMC3032935 DOI: 10.1101/gr.114595.110] [Citation(s) in RCA: 211] [Impact Index Per Article: 16.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2010] [Accepted: 12/08/2010] [Indexed: 01/31/2023]

Affiliation(s)

W. Clay Spencer Department of Cell and Developmental Biology, Vanderbilt University, Nashville, Tennessee 37232, USA
Georg Zeller Friedrich Miescher Laboratory of the Max Planck Society, 72076 Tübingen, Germany Department of Molecular Biology, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany
Joseph D. Watson Department of Cell and Developmental Biology, Vanderbilt University, Nashville, Tennessee 37232, USA
Stefan R. Henz Department of Molecular Biology, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany
Kathie L. Watkins Department of Cell and Developmental Biology, Vanderbilt University, Nashville, Tennessee 37232, USA
Rebecca D. McWhirter Department of Cell and Developmental Biology, Vanderbilt University, Nashville, Tennessee 37232, USA
Sarah Petersen Department of Cell and Developmental Biology, Vanderbilt University, Nashville, Tennessee 37232, USA
Vipin T. Sreedharan Friedrich Miescher Laboratory of the Max Planck Society, 72076 Tübingen, Germany
Christian Widmer Friedrich Miescher Laboratory of the Max Planck Society, 72076 Tübingen, Germany
Jeanyoung Jo Department of Genetics, Yale University School of Medicine, New Haven, Connecticut 06520, USA
Valerie Reinke Department of Genetics, Yale University School of Medicine, New Haven, Connecticut 06520, USA
Lisa Petrella Department of MCD Biology, University of California Santa Cruz, Santa Cruz, California 95064, USA
Susan Strome Department of MCD Biology, University of California Santa Cruz, Santa Cruz, California 95064, USA
Stephen E. Von Stetina Department of Cell and Developmental Biology, Vanderbilt University, Nashville, Tennessee 37232, USA
Menachem Katz Laboratory of Developmental Genetics, The Rockefeller University, New York, New York 10065, USA
Shai Shaham Laboratory of Developmental Genetics, The Rockefeller University, New York, New York 10065, USA
Gunnar Rätsch Friedrich Miescher Laboratory of the Max Planck Society, 72076 Tübingen, Germany
David M. Miller Department of Cell and Developmental Biology, Vanderbilt University, Nashville, Tennessee 37232, USA

Collapse

Wang Z, Sherwood DR. Dissection of genetic pathways in C. elegans. Methods Cell Biol 2011;106:113-57. [PMID: 22118276 PMCID: PMC4116751 DOI: 10.1016/b978-0-12-544172-8.00005-0] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Jensen VL, Simonsen KT, Lee YH, Park D, Riddle DL. RNAi screen of DAF-16/FOXO target genes in C. elegans links pathogenesis and dauer formation. PLoS One 2010;5:e15902. [PMID: 21209831 PMCID: PMC3013133 DOI: 10.1371/journal.pone.0015902] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2010] [Accepted: 11/30/2010] [Indexed: 11/19/2022] Open

Jensen VL, Bialas NJ, Bishop-Hurley SL, Molday LL, Kida K, Nguyen PAT, Blacque OE, Molday RS, Leroux MR, Riddle DL. Localization of a guanylyl cyclase to chemosensory cilia requires the novel ciliary MYND domain protein DAF-25. PLoS Genet 2010;6:e1001199. [PMID: 21124868 PMCID: PMC2991253 DOI: 10.1371/journal.pgen.1001199] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2010] [Accepted: 10/07/2010] [Indexed: 11/19/2022] Open

Abstract

In harsh conditions, Caenorhabditis elegans arrests development to enter a non-aging, resistant diapause state called the dauer larva. Olfactory sensation modulates the TGF-β and insulin signaling pathways to control this developmental decision. Four mutant alleles of daf-25 (abnormal DAuer Formation) were isolated from screens for mutants exhibiting constitutive dauer formation and found to be defective in olfaction. The daf-25 dauer phenotype is suppressed by daf-10/IFT122 mutations (which disrupt ciliogenesis), but not by daf-6/PTCHD3 mutations (which prevent environmental exposure of sensory cilia), implying that DAF-25 functions in the cilia themselves. daf-25 encodes the C. elegans ortholog of mammalian Ankmy2, a MYND domain protein of unknown function. Disruption of DAF-25, which localizes to sensory cilia, produces no apparent cilia structure anomalies, as determined by light and electron microscopy. Hinting at its potential function, the dauer phenotype, epistatic order, and expression profile of daf-25 are similar to daf-11, which encodes a cilium-localized guanylyl cyclase. Indeed, we demonstrate that DAF-25 is required for proper DAF-11 ciliary localization. Furthermore, the functional interaction is evolutionarily conserved, as mouse Ankmy2 interacts with guanylyl cyclase GC1 from ciliary photoreceptors. The interaction may be specific because daf-25 mutants have normally-localized OSM-9/TRPV4, TAX-4/CNGA1, CHE-2/IFT80, CHE-11/IFT140, CHE-13/IFT57, BBS-8, OSM-5/IFT88, and XBX-1/D2LIC in the cilia. Intraflagellar transport (IFT) (required to build cilia) is not defective in daf-25 mutants, although the ciliary localization of DAF-25 itself is influenced in che-11 mutants, which are defective in retrograde IFT. In summary, we have discovered a novel ciliary protein that plays an important role in cGMP signaling by localizing a guanylyl cyclase to the sensory organelle.

Collapse

Jan CH, Friedman RC, Ruby JG, Bartel DP. Formation, regulation and evolution of Caenorhabditis elegans 3'UTRs. Nature 2010;469:97-101. [PMID: 21085120 PMCID: PMC3057491 DOI: 10.1038/nature09616] [Citation(s) in RCA: 367] [Impact Index Per Article: 26.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2010] [Accepted: 10/29/2010] [Indexed: 11/24/2022]

Abstract

Posttranscriptional gene regulation frequently occurs through elements in mRNA 3′ untranslated regions (UTRs)1,2. Although crucial roles for 3′UTR-mediated gene regulation have been found in Caenorhabditis elegans3,4,5, most C. elegans genes have lacked annotated 3′UTRs6,7. Here we describe a high-throughput method to reliably identify polyadenylated RNA termini, and we apply this method, called poly(A)-position profiling by sequencing (3P-Seq), to determine C. elegans 3′UTRs. Compared to standard methods also recently applied to C. elegans UTRs8, 3P-Seq identified 8,581 additional UTRs while excluding thousands of shorter UTR isoforms that do not appear to be authentic. Analysis of this expanded and corrected dataset suggested that the high A/U content of C. elegans 3′UTRs facilitated genome compaction, since the elements specifying cleavage and polyadenylation, which are A/U-rich, can more readily emerge in A/U rich regions. Indeed, 30% of the protein-coding genes have mRNAs with alternative, partially overlapping end regions that generate another 10,498 cleavage and polyadenylation sites that had gone largely unnoticed and represent potential evolutionary intermediates of progressive UTR shortening. Moreover, a third of the convergently transcribed genes utilize palindromic arrangements of bidirectional elements to specify UTRs with convergent overlap, which also contributes to genome compaction by eliminating regions between genes. Although nematode 3′UTRs have median length only one-sixth that of mammalian 3′UTRs, they have twice the density of conserved microRNA sites, in part because additional types of seed-complementary sites are preferentially conserved. These findings reveal the influence of cleavage and polyadenylation on the evolution of genome architecture and provide resources for studying posttranscriptional gene regulation.

Collapse

Rao W, Isaac RE, Keen JN. An analysis of the Caenorhabditis elegans lipid raft proteome using geLC-MS/MS. J Proteomics 2010;74:242-53. [PMID: 21070894 DOI: 10.1016/j.jprot.2010.11.001] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2010] [Revised: 10/20/2010] [Accepted: 11/02/2010] [Indexed: 11/16/2022]

Denver DR, Howe DK, Wilhelm LJ, Palmer CA, Anderson JL, Stein KC, Phillips PC, Estes S. Selective sweeps and parallel mutation in the adaptive recovery from deleterious mutation in Caenorhabditis elegans. Genome Res 2010;20:1663-71. [PMID: 21036923 DOI: 10.1101/gr.108191.110] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]

Sreedharan S, Stephansson O, Schiöth HB, Fredriksson R. Long evolutionary conservation and considerable tissue specificity of several atypical solute carrier transporters. Gene 2010;478:11-8. [PMID: 21044875 DOI: 10.1016/j.gene.2010.10.011] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2010] [Revised: 10/12/2010] [Accepted: 10/20/2010] [Indexed: 10/18/2022]

Davis MJ, Sehgal MSB, Ragan MA. Automatic, context-specific generation of Gene Ontology slims. BMC Bioinformatics 2010;11:498. [PMID: 20929524 PMCID: PMC3098080 DOI: 10.1186/1471-2105-11-498] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2010] [Accepted: 10/07/2010] [Indexed: 11/10/2022] Open

Sen TZ, Harper LC, Schaeffer ML, Andorf CM, Seigfried TE, Campbell DA, Lawrence CJ. Choosing a genome browser for a Model Organism Database: surveying the maize community. Database (Oxford) 2010;2010:baq007. [PMID: 20627860 PMCID: PMC2911842 DOI: 10.1093/database/baq007] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2009] [Revised: 03/08/2010] [Accepted: 03/09/2010] [Indexed: 11/12/2022]

Nematode parasite genes: what's in a name? Trends Parasitol 2010;26:334-40. [DOI: 10.1016/j.pt.2010.04.003] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2009] [Revised: 04/08/2010] [Accepted: 04/09/2010] [Indexed: 11/23/2022]

Korcsmáros T, Farkas IJ, Szalay MS, Rovó P, Fazekas D, Spiró Z, Böde C, Lenti K, Vellai T, Csermely P. Uniformly curated signaling pathways reveal tissue-specific cross-talks and support drug target discovery. ACTA ACUST UNITED AC 2010;26:2042-50. [PMID: 20542890 DOI: 10.1093/bioinformatics/btq310] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]

Haegeman A, Elsen A, De Waele D, Gheysen G. Emerging molecular knowledge on Radopholus similis, an important nematode pest of banana. MOLECULAR PLANT PATHOLOGY 2010;11:315-23. [PMID: 20447280 PMCID: PMC6640332 DOI: 10.1111/j.1364-3703.2010.00614.x] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]

The NetAge database: a compendium of networks for longevity, age-related diseases and associated processes. Biogerontology 2010;11:513-22. [DOI: 10.1007/s10522-010-9265-8] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2010] [Accepted: 02/09/2010] [Indexed: 12/11/2022]

Mi H, Dong Q, Muruganujan A, Gaudet P, Lewis S, Thomas PD. PANTHER version 7: improved phylogenetic trees, orthologs and collaboration with the Gene Ontology Consortium. Nucleic Acids Res 2010;38:D204-10. [PMID: 20015972 PMCID: PMC2808919 DOI: 10.1093/nar/gkp1019] [Citation(s) in RCA: 453] [Impact Index Per Article: 32.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2009] [Revised: 10/16/2009] [Accepted: 10/19/2009] [Indexed: 11/12/2022] Open

Mello LV, O'Meara H, Rigden DJ, Paterson S. Identification of novel aspartic proteases from Strongyloides ratti and characterisation of their evolutionary relationships, stage-specific expression and molecular structure. BMC Genomics 2009;10:611. [PMID: 20015380 PMCID: PMC2805697 DOI: 10.1186/1471-2164-10-611] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2009] [Accepted: 12/16/2009] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Aspartic proteases are known to play an important role in the biology of nematode parasitism. This role is best characterised in blood-feeding nematodes, where they digest haemoglobin, but they are also likely to play important roles in the biology of nematode parasites that do not feed on blood. In the present work, we investigate the evolution and expression of aspartic proteases in Strongyloides ratti, which permits a unique comparison between parasitic and free-living adult forms within its life-cycle.

RESULTS

We identified eight transcribed aspartic protease sequences and a further two genomic sequences and compared these to homologues in Caenorhabditis elegans and other nematode species. Phylogenetic analysis demonstrated a complex pattern of gene evolution, such that some S. ratti sequences had a one-to-one correspondence with orthologues of C. elegans but that lineage-specific expansions have occurred for other aspartic proteases in these two nematodes. These gene duplication events may have contributed to the adaptation of the two species to their different lifestyles. Among the set of S. ratti aspartic proteases were two closely-related isoforms that showed differential expression during different life stages: ASP-2A is highly expressed in parasitic females while ASP-2B is predominantly found in free-living adults. Molecular modelling of the ASP-2 isoforms reveals that their substrate specificities are likely to be very similar, but that ASP-2B is more electrostatically negative over its entire molecular surface than ASP-2A. This characteristic may be related to different pH values of the environments in which these two isoforms operate.

CONCLUSIONS

We have demonstrated that S. ratti provides a powerful model to explore the genetic adaptations associated with parasitic versus free-living life-styles. We have discovered gene duplication of aspartic protease genes in Strongyloides and identified a pair of paralogues differentially expressed in either the parasitic or the free-living phase of the nematode life-cycle, consistent with an adaptive role for aspartic proteases in the evolution of nematode parasitism.

Collapse

Yamazaki Y, Akashi R, Banno Y, Endo T, Ezura H, Fukami-Kobayashi K, Inaba K, Isa T, Kamei K, Kasai F, Kobayashi M, Kurata N, Kusaba M, Matuzawa T, Mitani S, Nakamura T, Nakamura Y, Nakatsuji N, Naruse K, Niki H, Nitasaka E, Obata Y, Okamoto H, Okuma M, Sato K, Serikawa T, Shiroishi T, Sugawara H, Urushibara H, Yamamoto M, Yaoita Y, Yoshiki A, Kohara Y. NBRP databases: databases of biological resources in Japan. Nucleic Acids Res 2009;38:D26-32. [PMID: 19934255 PMCID: PMC2808968 DOI: 10.1093/nar/gkp996] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Gouw JW, Krijgsveld J, Heck AJR. Quantitative proteomics by metabolic labeling of model organisms. Mol Cell Proteomics 2009;9:11-24. [PMID: 19955089 DOI: 10.1074/mcp.r900001-mcp200] [Citation(s) in RCA: 119] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open

Hutter H, Ng MP, Chen N. GExplore: a web server for integrated queries of protein domains, gene expression and mutant phenotypes. BMC Genomics 2009;10:529. [PMID: 19917126 PMCID: PMC2779824 DOI: 10.1186/1471-2164-10-529] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2009] [Accepted: 11/16/2009] [Indexed: 01/17/2023] Open

Brindley PJ, Mitreva M, Ghedin E, Lustigman S. Helminth genomics: The implications for human health. PLoS Negl Trop Dis 2009;3:e538. [PMID: 19855829 PMCID: PMC2757907 DOI: 10.1371/journal.pntd.0000538] [Citation(s) in RCA: 73] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open

Duan J, Li R, Cheng D, Fan W, Zha X, Cheng T, Wu Y, Wang J, Mita K, Xiang Z, Xia Q. SilkDB v2.0: a platform for silkworm (Bombyx mori ) genome biology. Nucleic Acids Res 2009;38:D453-6. [PMID: 19793867 PMCID: PMC2808975 DOI: 10.1093/nar/gkp801] [Citation(s) in RCA: 209] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open

High resolution transcriptome maps for wild-type and nonsense-mediated decay-defective Caenorhabditis elegans. Genome Biol 2009;10:R101. [PMID: 19778439 PMCID: PMC2768976 DOI: 10.1186/gb-2009-10-9-r101] [Citation(s) in RCA: 88] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2009] [Revised: 08/11/2009] [Accepted: 09/24/2009] [Indexed: 11/12/2022] Open

Abstract

The high-resolution transcriptome of wild-type and nonsense-mediated decay (NMD) defective C. elegans during development reveals insights into the NMD pathway and it’s role in development.

Background

While many genome sequences are complete, transcriptomes are less well characterized. We used both genome-scale tiling arrays and massively parallel sequencing to map the Caenorhabditis elegans transcriptome across development. We utilized this framework to identify transcriptome changes in animals lacking the nonsense-mediated decay (NMD) pathway.

Results

We find that while the majority of detectable transcripts map to known gene structures, >5% of transcribed regions fall outside current gene annotations. We show that >40% of these are novel exons. Using both technologies to assess isoform complexity, we estimate that >17% of genes change isoform across development. Next we examined how the transcriptome is perturbed in animals lacking NMD. NMD prevents expression of truncated proteins by degrading transcripts containing premature termination codons. We find that approximately 20% of genes produce transcripts that appear to be NMD targets. While most of these arise from splicing errors, NMD targets are enriched for transcripts containing open reading frames upstream of the predicted translational start (uORFs). We identify a relationship between the Kozak consensus surrounding the true start codon and the degree to which uORF-containing transcripts are targeted by NMD and speculate that translational efficiency may be coupled to transcript turnover via the NMD pathway for some transcripts.

Conclusions

We generated a high-resolution transcriptome map for C. elegans and used it to identify endogenous targets of NMD. We find that these transcripts arise principally through splicing errors, strengthening the prevailing view that splicing and NMD are highly interlinked processes.

Collapse

FGFRL1 is a neglected putative actor of the FGF signalling pathway present in all major metazoan phyla. BMC Evol Biol 2009;9:226. [PMID: 19740411 PMCID: PMC2754479 DOI: 10.1186/1471-2148-9-226] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2009] [Accepted: 09/09/2009] [Indexed: 12/16/2022] Open

Two distinct roles for EGL-9 in the regulation of HIF-1-mediated gene expression in Caenorhabditis elegans. Genetics 2009;183:821-9. [PMID: 19737748 DOI: 10.1534/genetics.109.107284] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open

Vizcaíno JA, Côté R, Reisinger F, M. Foster J, Mueller M, Rameseder J, Hermjakob H, Martens L. A guide to the Proteomics Identifications Database proteomics data repository. Proteomics 2009;9:4276-83. [PMID: 19662629 PMCID: PMC2970915 DOI: 10.1002/pmic.200900402] [Citation(s) in RCA: 207] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2009] [Revised: 06/24/2009] [Accepted: 06/25/2009] [Indexed: 01/02/2023]

Berriz GF, Beaver JE, Cenik C, Tasan M, Roth FP. Next generation software for functional trend analysis. Bioinformatics 2009;25:3043-4. [PMID: 19717575 DOI: 10.1093/bioinformatics/btp498] [Citation(s) in RCA: 200] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Gilchrist MJ, Christensen MB, Bronchain O, Brunet F, Chesneau A, Fenger U, Geach TJ, Ironfield HV, Kaya F, Kricha S, Lea R, Massé K, Néant I, Paillard E, Parain K, Perron M, Sinzelle L, Souopgui J, Thuret R, Ymlahi-Ouazzani Q, Pollet N. Database of queryable gene expression patterns for Xenopus. Dev Dyn 2009;238:1379-88. [PMID: 19347954 DOI: 10.1002/dvdy.21940] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open

Saito TL, Yoshimura J, Sasaki S, Ahsan B, Sasaki A, Kuroshu R, Morishita S. UTGB toolkit for personalized genome browsers. Bioinformatics 2009;25:1856-61. [PMID: 19497937 PMCID: PMC2712345 DOI: 10.1093/bioinformatics/btp350] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2009] [Revised: 05/27/2009] [Accepted: 05/30/2009] [Indexed: 11/12/2022] Open

Durinck S, Spellman PT, Birney E, Huber W. Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nat Protoc 2009;4:1184-91. [PMID: 19617889 PMCID: PMC3159387 DOI: 10.1038/nprot.2009.97] [Citation(s) in RCA: 2296] [Impact Index Per Article: 153.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Van Auken K, Jaffery J, Chan J, Müller HM, Sternberg PW. Semi-automated curation of protein subcellular localization: a text mining-based approach to Gene Ontology (GO) Cellular Component curation. BMC Bioinformatics 2009;10:228. [PMID: 19622167 PMCID: PMC2719631 DOI: 10.1186/1471-2105-10-228] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2009] [Accepted: 07/21/2009] [Indexed: 11/28/2022] Open

Abstract

Background

Manual curation of experimental data from the biomedical literature is an expensive and time-consuming endeavor. Nevertheless, most biological knowledge bases still rely heavily on manual curation for data extraction and entry. Text mining software that can semi- or fully automate information retrieval from the literature would thus provide a significant boost to manual curation efforts.

Results

We employ the Textpresso category-based information retrieval and extraction system , developed by WormBase to explore how Textpresso might improve the efficiency with which we manually curate C. elegans proteins to the Gene Ontology's Cellular Component Ontology. Using a training set of sentences that describe results of localization experiments in the published literature, we generated three new curation task-specific categories (Cellular Components, Assay Terms, and Verbs) containing words and phrases associated with reports of experimentally determined subcellular localization. We compared the results of manual curation to that of Textpresso queries that searched the full text of articles for sentences containing terms from each of the three new categories plus the name of a previously uncurated C. elegans protein, and found that Textpresso searches identified curatable papers with recall and precision rates of 79.1% and 61.8%, respectively (F-score of 69.5%), when compared to manual curation. Within those documents, Textpresso identified relevant sentences with recall and precision rates of 30.3% and 80.1% (F-score of 44.0%). From returned sentences, curators were able to make 66.2% of all possible experimentally supported GO Cellular Component annotations with 97.3% precision (F-score of 78.8%). Measuring the relative efficiencies of Textpresso-based versus manual curation we find that Textpresso has the potential to increase curation efficiency by at least 8-fold, and perhaps as much as 15-fold, given differences in individual curatorial speed.

Conclusion

Textpresso is an effective tool for improving the efficiency of manual, experimentally based curation. Incorporating a Textpresso-based Cellular Component curation pipeline at WormBase has allowed us to transition from strictly manual curation of this data type to a more efficient pipeline of computer-assisted validation. Continued development of curation task-specific Textpresso categories will provide an invaluable resource for genomics databases that rely heavily on manual curation.

Collapse

Vermeirssen V, Joshi A, Michoel T, Bonnet E, Casneuf T, Van de Peer Y. Transcription regulatory networks in Caenorhabditis elegans inferred through reverse-engineering of gene expression profiles constitute biological hypotheses for metazoan development. MOLECULAR BIOSYSTEMS 2009;5:1817-30. [PMID: 19763340 DOI: 10.1039/b908108a] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]

Fredman D, Engstrom PG, Lenhard B. Web-based tools and approaches to study long-range gene regulation in Metazoa. BRIEFINGS IN FUNCTIONAL GENOMICS AND PROTEOMICS 2009;8:231-42. [DOI: 10.1093/bfgp/elp023] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]