1
|
Dayi M. Diversity and evolution of transposable elements in the plant-parasitic nematodes. BMC Genomics 2024; 25:511. [PMID: 38783171 PMCID: PMC11118728 DOI: 10.1186/s12864-024-10435-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 05/21/2024] [Indexed: 05/25/2024] Open
Abstract
BACKGROUND Transposable elements (TEs) are mobile DNA sequences that propagate within genomes, occupying a significant portion of eukaryotic genomes and serving as a source of genetic variation and innovation. TEs can impact genome dynamics through their repetitive nature and mobility. Nematodes are incredibly versatile organisms, capable of thriving in a wide range of environments. The plant-parasitic nematodes are able to infect nearly all vascular plants, leading to significant crop losses and management expenses worldwide. It is worth noting that plant parasitism has evolved independently at least three times within this nematode group. Furthermore, the genome size of plant-parasitic nematodes can vary substantially, spanning from 41.5 Mbp to 235 Mbp. To investigate genome size variation and evolution in plant-parasitic nematodes, TE composition, diversity, and evolution were analysed in 26 plant-parasitic nematodes from 9 distinct genera in Clade IV. RESULTS Interestingly, despite certain species lacking specific types of DNA transposons or retrotransposon superfamilies, they still exhibit a diverse range of TE content. Identification of species-specific TE repertoire in nematode genomes provides a deeper understanding of genome evolution in plant-parasitic nematodes. An intriguing observation is that plant-parasitic nematodes possess extensive DNA transposons and retrotransposon insertions, including recent sightings of LTR/Gypsy and LTR/Pao superfamilies. Among them, the Gypsy superfamilies were found to encode Aspartic proteases in the plant-parasitic nematodes. CONCLUSIONS The study of the transposable element (TE) composition in plant-parasitic nematodes has yielded insightful discoveries. The findings revealed that certain species exhibit lineage-specific variations in their TE makeup. Discovering the species-specific TE repertoire in nematode genomes is a crucial element in understanding the evolution of genomes in plant-parasitic nematodes. It allows us to gain a deeper insight into the intricate workings of these organisms and their genetic makeup. With this knowledge, we are gaining a fundamental piece in the puzzle of understanding the evolution of these parasites. Moreover, recent transpositions have led to the acquisition of new TE superfamilies, especially Gypsy and Pao retrotransposons, further expanding the diversity of TEs in these nematodes. Significantly, the widely distributed Gypsy superfamily possesses proteases that are exclusively associated with parasitism during nematode-host interactions. These discoveries provide a deeper understanding of the TE landscape within plant-parasitic nematodes.
Collapse
Affiliation(s)
- Mehmet Dayi
- Forestry Vocational School, Düzce University, Konuralp Campus, 81620, Düzce, Türkiye.
- Faculty of Medicine, University of Miyazaki, Miyazaki, Japan.
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, 277-8562, Japan.
| |
Collapse
|
2
|
Sternberg PW, Van Auken K, Wang Q, Wright A, Yook K, Zarowiecki M, Arnaboldi V, Becerra A, Brown S, Cain S, Chan J, Chen WJ, Cho J, Davis P, Diamantakis S, Dyer S, Grigoriadis D, Grove CA, Harris T, Howe K, Kishore R, Lee R, Longden I, Luypaert M, Müller HM, Nuin P, Quinton-Tulloch M, Raciti D, Schedl T, Schindelman G, Stein L. WormBase 2024: status and transitioning to Alliance infrastructure. Genetics 2024; 227:iyae050. [PMID: 38573366 PMCID: PMC11075546 DOI: 10.1093/genetics/iyae050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 03/19/2024] [Accepted: 03/20/2024] [Indexed: 04/05/2024] Open
Abstract
WormBase has been the major repository and knowledgebase of information about the genome and genetics of Caenorhabditis elegans and other nematodes of experimental interest for over 2 decades. We have 3 goals: to keep current with the fast-paced C. elegans research, to provide better integration with other resources, and to be sustainable. Here, we discuss the current state of WormBase as well as progress and plans for moving core WormBase infrastructure to the Alliance of Genome Resources (the Alliance). As an Alliance member, WormBase will continue to interact with the C. elegans community, develop new features as needed, and curate key information from the literature and large-scale projects.
Collapse
Affiliation(s)
- Paul W Sternberg
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Kimberly Van Auken
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Qinghua Wang
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Adam Wright
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Karen Yook
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Magdalena Zarowiecki
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
| | - Valerio Arnaboldi
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Andrés Becerra
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
| | - Stephanie Brown
- School of Infection and Immunity, University of Glasgow, Glasgow G12 8TA, UK
| | - Scott Cain
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Juancarlos Chan
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Wen J Chen
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Jaehyoung Cho
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Paul Davis
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
| | - Stavros Diamantakis
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
| | - Sarah Dyer
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
| | | | - Christian A Grove
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Todd Harris
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Kevin Howe
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
| | - Ranjana Kishore
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Raymond Lee
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Ian Longden
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Manuel Luypaert
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
| | - Hans-Michael Müller
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Paulo Nuin
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| | - Mark Quinton-Tulloch
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
| | - Daniela Raciti
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Tim Schedl
- Department of Genetics, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Gary Schindelman
- Division of Biology and Biological Engineering 140-18, California Institute of Technology, Pasadena, CA 91125, USA
| | - Lincoln Stein
- Informatics and Bio-computing Platform, Ontario Institute for Cancer Research, Toronto, ON M5G0A3, Canada
| |
Collapse
|
3
|
Lashari A, Kazi TG, Afridi HI, Baig JA, Arain MB, Lashari AA. Evaluate the Work-Related Exposure of Vanadium on Scalp Hair Samples of Outdoor and Administrative Workers of Oil Drilling Field: Related Health Risks. Biol Trace Elem Res 2024:10.1007/s12011-024-04101-y. [PMID: 38376729 DOI: 10.1007/s12011-024-04101-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Accepted: 02/08/2024] [Indexed: 02/21/2024]
Abstract
Petrochemical facilities, including oil well drilling, are discharging resources of extensive noxious waste into the environment. The workers in different sections might be exposed to vanadium (V) through different routes (groundwater and soil), which is linked with extensive physiological disorders, hypertension, respiratory disorders, anemia, skin, and gastrointestinal disorders. This study determined the contents of V in a biological sample (scalp hair) of workers of different categories (outdoor and office workers) in an oil drilling field in Sindh, Pakistan. The environmental samples, groundwater, bottled mineral water, and soil samples were also analyzed for V. For comparative purposes, the scalp hair of age-matched male subjects residing in domestic areas of Hyderabad city, Pakistan, was also analyzed. Generally, the concentrations of V in groundwater near the oil drilling field and drilled soil illustrated significant variations. The results show that the vanadium concentration in the scalp hair of non-exposed referents (controls) and office workers (exposed referents) was 62% and 45% lower than those observed for outdoor drilling and cleaning mud workers. It was observed that high exposure to V in outdoor workers might be linked with different physiological disorders such as anemia, eye problems, and bronchial disorders.
Collapse
Affiliation(s)
- Anjum Lashari
- National Centre of Excellence in Analytical Chemistry, University of Sindh, Jamshoro, 76080, Pakistan
| | - Tasneem G Kazi
- National Centre of Excellence in Analytical Chemistry, University of Sindh, Jamshoro, 76080, Pakistan.
| | - Hassan I Afridi
- National Centre of Excellence in Analytical Chemistry, University of Sindh, Jamshoro, 76080, Pakistan
| | - Jameel A Baig
- National Centre of Excellence in Analytical Chemistry, University of Sindh, Jamshoro, 76080, Pakistan
| | - Mohammad B Arain
- Department of Chemistry University of Karachi, Karachi, Sindh, 75270, Pakistan
| | - Ayaz Ali Lashari
- National Centre of Excellence in Analytical Chemistry, University of Sindh, Jamshoro, 76080, Pakistan
| |
Collapse
|
4
|
Dayi M. Evolution of parasitism genes in the plant parasitic nematodes. Sci Rep 2024; 14:3733. [PMID: 38355886 PMCID: PMC10866927 DOI: 10.1038/s41598-024-54330-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2023] [Accepted: 02/11/2024] [Indexed: 02/16/2024] Open
Abstract
The plant-parasitic nematodes are considered as one of the most destructive pests, from which the migratory and sedentary endoparasitic plant parasitic nematodes infect more than 4000 plant species and cause over $100 billion crop losses annually worldwide. These nematodes use multiple strategies to infect their host and to establish a successful parasitism inside the host such as cell-wall degradation enzymes, inhibition of host defense proteins, and molecular mimicry. In the present study, the main parasitism-associated gene families were identified and compared between the migratory and sedentary endoparasitic nematodes. The results showed that the migratory and sedentary endoparasitic nematodes share a core conserved parasitism mechanism established throughout the evolution of parasitism. However, genes involved in pectin degradation and hydrolase activity are rapidly evolving in the migratory endoparasitic nematodes. Additionally, cell-wall degrading enzymes such as GH45 cellulases and pectate lyase and peptidase and peptidase inhibitors were expanded in the migratory endoparasitic nematodes. The molecular mimicry mechanism was another key finding that differs between the endoparasitic and sedentary parasitic nematodes. The PL22 gene family, which is believed to play a significant role in the molecular mechanisms of nematode parasitism, has been found to be present exclusively in migratory endoparasitic nematodes. Phylogenetic analysis has suggested that it was de novo born in these nematodes. This discovery sheds new light on the molecular evolution of these parasites and has significant implications for our understanding of their biology and pathogenicity. This study contributes to our understanding of core parasitism mechanisms conserved throughout the nematodes and provides unique clues on the evolution of parasitism and the direction shaped by the host.
Collapse
Affiliation(s)
- Mehmet Dayi
- Forestry Vocational School, Düzce University, Konuralp Campus, 81620, Düzce, Turkey.
- Faculty of Medicine, University of Miyazaki, Miyazaki, Japan.
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, 277-8562, Japan.
| |
Collapse
|
5
|
Tanaka SE, Dayi M, Maeda Y, Tsai IJ, Tanaka R, Bligh M, Takeuchi-Kaneko Y, Fukuda K, Kanzaki N, Kikuchi T. Stage-specific transcriptome of Bursaphelenchus xylophilus reveals temporal regulation of effector genes and roles of the dauer-like stages in the lifecycle. Sci Rep 2019; 9:6080. [PMID: 30988401 PMCID: PMC6465311 DOI: 10.1038/s41598-019-42570-7] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2018] [Accepted: 04/01/2019] [Indexed: 12/24/2022] Open
Abstract
The pine wood nematode Bursaphelenchus xylophilus is the causal agent of pine wilt disease, one of the most devastating forest diseases in East Asian and West European countries. The lifecycle of B. xylophilus includes four propagative larval stages and gonochoristic adults which are involved in the pathogenicity, and two stages of dispersal larvae involved in the spread of the disease. To elucidate the ecological roles of each developmental stage in the pathogenic life cycle, we performed a comprehensive transcriptome analysis using RNA-seq generated from all developmental stages of B. xylophilus and compared transcriptomes between stages. We found more than 9000 genes are differentially expressed in at least one stage of the life cycle including genes involved in general nematode biology such as reproduction and moulting but also effector genes likely to be involved in parasitism. The dispersal-stage transcriptome revealed its analogy to C. elegans dauer and the distinct roles of the two larval stages from each other regarding survival and transmission. This study provides important insights and resources to understand B. xylophilus parasitic biology.
Collapse
Affiliation(s)
- Suguru E Tanaka
- Laboratory of Forest Botany, Graduate School of Agricultural and Life Sciences, the University of Tokyo, Tokyo, 113-8657, Japan
| | - Mehmet Dayi
- Division of Parasitology, Faculty of Medicine, University of Miyazaki, Miyazaki, 889-1692, Japan
- Forestry Vocational School, Duzce University, 81620, Duzce, Turkey
| | - Yasunobu Maeda
- Division of Parasitology, Faculty of Medicine, University of Miyazaki, Miyazaki, 889-1692, Japan
| | - Isheng J Tsai
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | - Ryusei Tanaka
- Division of Parasitology, Faculty of Medicine, University of Miyazaki, Miyazaki, 889-1692, Japan
| | - Mark Bligh
- Division of Parasitology, Faculty of Medicine, University of Miyazaki, Miyazaki, 889-1692, Japan
| | - Yuko Takeuchi-Kaneko
- Laboratory of Terrestrial Microbial Ecology, Graduate School of Agriculture, Kyoto University, Kyoto, 606-8502, Japan
| | - Kenji Fukuda
- Laboratory of Forest Botany, Graduate School of Agricultural and Life Sciences, the University of Tokyo, Tokyo, 113-8657, Japan
| | - Natsumi Kanzaki
- Kansai Research Center, Forestry and Forest Products Research Institute, Kyoto, 612-0855, Japan
| | - Taisei Kikuchi
- Division of Parasitology, Faculty of Medicine, University of Miyazaki, Miyazaki, 889-1692, Japan.
| |
Collapse
|
6
|
Breland A, Ha SE, Jorgensen BG, Jin B, Gardner TA, Sanders KM, Ro S. Smooth Muscle Transcriptome Browser: offering genome-wide references and expression profiles of transcripts expressed in intestinal SMC, ICC, and PDGFRα + cells. Sci Rep 2019; 9:387. [PMID: 30674925 PMCID: PMC6344548 DOI: 10.1038/s41598-018-36607-6] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2017] [Accepted: 11/26/2018] [Indexed: 01/02/2023] Open
Abstract
Transcriptome data on the quantitative numbers of transcriptional variants expressed in primary cells offer essential clues into specific cellular functions and biological processes. We have previously collected transcriptomes from primary smooth muscle cells (SMC), interstitial cells of Cajal (ICC), and PDGFRα+ cells (fibroblast-like cells) isolated from murine jejunal and colonic smooth muscle and/or mucosal tissues as well as transcriptomes from the associated tissues (jejunal smooth muscle, colonic smooth muscle, and colonic mucosa). In this study, we have built the Smooth Muscle Transcriptome Browser (SMTB), https://med.unr.edu/physio/transcriptome , a web-based, graphical user interface that offers genetic references and expression profiles of all transcripts expressed at both the cellular (SMC, ICC, and PDGFRα+ cells) and tissue level (smooth muscle and mucosal tissue). This browser brings new insights into the cellular and biological functions of the cell types in gastrointestinal smooth muscle biology.
Collapse
Affiliation(s)
- Adrienne Breland
- Department of Physiology and Cell Biology, University of Nevada School of Medicine, Reno, Nevada, USA
| | - Se Eun Ha
- Department of Physiology and Cell Biology, University of Nevada School of Medicine, Reno, Nevada, USA
| | - Brian G Jorgensen
- Department of Physiology and Cell Biology, University of Nevada School of Medicine, Reno, Nevada, USA
| | - Byungchang Jin
- Department of Physiology and Cell Biology, University of Nevada School of Medicine, Reno, Nevada, USA
| | - Treg A Gardner
- Department of Physiology and Cell Biology, University of Nevada School of Medicine, Reno, Nevada, USA
| | - Kenton M Sanders
- Department of Physiology and Cell Biology, University of Nevada School of Medicine, Reno, Nevada, USA
| | - Seungil Ro
- Department of Physiology and Cell Biology, University of Nevada School of Medicine, Reno, Nevada, USA.
| |
Collapse
|
7
|
Abstract
WormBase is an open-access model-organism database that provides current and accurate genetic information of C. elegans and related nematodes. Users can search WormBase by several fields, including a gene or human disease. A special feature of the database is the inclusion of micropublications, peer-reviewed data that may go unpublished in traditional venues.
Collapse
Affiliation(s)
| | - Daryl D Hurd
- a St. John Fisher College , Rochester , New York , USA
| |
Collapse
|
8
|
Frommolt P, Schumacher B. Wormpath: searching for molecular interaction networks in Caenorhabditis elegans. SOURCE CODE FOR BIOLOGY AND MEDICINE 2015; 10:5. [PMID: 25866556 PMCID: PMC4392734 DOI: 10.1186/s13029-015-0034-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/08/2015] [Accepted: 03/12/2015] [Indexed: 11/10/2022]
Abstract
BACKGROUND High-throughput transcriptional profiling using Next-Generation Sequencing (RNA-Seq) or microarray technology have become standard tools in molecular biology. Successful investigations of gene regulatory mechanisms from these data typically employ mathematical models of biological networks. RESULTS We have developed Wormpath, a software for molecular network discovery which operates on the genetic and physical interaction data of the Wormbase, a comprehensive resource of molecular data on Caenorhabditis elegans. We use Wormpath to show that the insulin/insulin-like growth factor signalling (IIS) pathway responds to UV-induced DNA damage during development. CONCLUSIONS Our software provides highly facilitated access to C. elegans interaction data and is capable of identifying essential molecular networks within a list of differentially expressed genes.
Collapse
Affiliation(s)
- Peter Frommolt
- CECAD Research Center, University of Cologne, Joseph-Stelzmann-Str. 26, 50931 Cologne, Germany ; Cologne Center for Genomics, University of Cologne, Cologne, Germany ; Systems Biology of Aging Cologne (SyBACol), University of Cologne, Cologne, Germany
| | - Björn Schumacher
- CECAD Research Center, University of Cologne, Joseph-Stelzmann-Str. 26, 50931 Cologne, Germany ; Systems Biology of Aging Cologne (SyBACol), University of Cologne, Cologne, Germany ; Institute for Genome Stability in Aging and Disease, Medical Faculty, University of Cologne, Cologne, Germany ; Center for Molecular Medicine Cologne (CMMC), University of Cologne, Cologne, Germany
| |
Collapse
|
9
|
Glauser DA. The multiplicity of alternative splicing decisions in Caenorhabditis elegans is linked to specific intronic regulatory motifs and minisatellites. BMC Genomics 2014; 15:364. [PMID: 24884695 PMCID: PMC4039745 DOI: 10.1186/1471-2164-15-364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2013] [Accepted: 04/15/2014] [Indexed: 11/28/2022] Open
Abstract
Background Alternative splicing diversifies the pool of messenger RNA molecules encoded by individual genes. This diversity is particularly high when multiple splicing decisions cause a combinatorial arrangement of several alternate exons. We know very little on how the multiple decisions occurring during the maturation of single transcripts are coordinated and whether specific sequence elements might be involved. Results Here, the Caenorhabditis elegans genome was surveyed in order to identify sequence elements that might play a specific role in the regulation of multiple splicing decisions. The introns flanking alternate exons in transcripts whose maturation involves multiple alternative splicing decisions were compared to those whose maturation involves a single decision. Fifty-eight penta-, hexa-, and hepta-meric elements, clustered in 17 groups, were significantly over-represented in genes subject to multiple alternative splicing decisions. Most of these motifs relate to known splicing regulatory elements and appear to be well conserved in the related species Caenorhabditis briggsae. The usage of specific motifs is not linked to the gene product function, but rather depends on the gene structure, since it is influenced by the distance separating the multiple splicing decision sites. Two of these motifs are part of the CeRep25B minisatellite, which is also over-represented at the vicinity of alternative splicing regions. Most of the remaining motifs are not part of repeated sequence elements, but tend to occur in specific heterologous pairs in genes subject to multiple alternative splicing decisions. Conclusions The existence of specific intronic sequence elements linked to multiple alternative splicing decisions is intriguing and suggests that these elements might have some specialized regulatory role during splicing. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-364) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Dominique A Glauser
- Department of Biology, University of Fribourg, Chemin du Musée 10, 1700 Fribourg, Switzerland.
| |
Collapse
|
10
|
Wang J, Chen D, Lei Y, Chang JW, Hao BH, Xing F, Li S, Xu Q, Deng XX, Chen LL. Citrus sinensis annotation project (CAP): a comprehensive database for sweet orange genome. PLoS One 2014; 9:e87723. [PMID: 24489955 PMCID: PMC3905029 DOI: 10.1371/journal.pone.0087723] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2013] [Accepted: 12/29/2013] [Indexed: 01/31/2023] Open
Abstract
Citrus is one of the most important and widely grown fruit crop with global production ranking firstly among all the fruit crops in the world. Sweet orange accounts for more than half of the Citrus production both in fresh fruit and processed juice. We have sequenced the draft genome of a double-haploid sweet orange (C. sinensis cv. Valencia), and constructed the Citrus sinensis annotation project (CAP) to store and visualize the sequenced genomic and transcriptome data. CAP provides GBrowse-based organization of sweet orange genomic data, which integrates ab initio gene prediction, EST, RNA-seq and RNA-paired end tag (RNA-PET) evidence-based gene annotation. Furthermore, we provide a user-friendly web interface to show the predicted protein-protein interactions (PPIs) and metabolic pathways in sweet orange. CAP provides comprehensive information beneficial to the researchers of sweet orange and other woody plants, which is freely available at http://citrus.hzau.edu.cn/.
Collapse
Affiliation(s)
- Jia Wang
- Center for Bioinformatics, College of Life Science and Technology, Huazhong Agricultural University, Wuhan, P.R. China
- School of Science, Huazhong Agricultural University, Wuhan, P.R. China
| | - Dijun Chen
- Center for Bioinformatics, College of Life Science and Technology, Huazhong Agricultural University, Wuhan, P.R. China
| | - Yang Lei
- Center for Bioinformatics, College of Life Science and Technology, Huazhong Agricultural University, Wuhan, P.R. China
| | - Ji-Wei Chang
- Center for Bioinformatics, College of Life Science and Technology, Huazhong Agricultural University, Wuhan, P.R. China
| | - Bao-Hai Hao
- Center for Bioinformatics, College of Life Science and Technology, Huazhong Agricultural University, Wuhan, P.R. China
| | - Feng Xing
- Center for Bioinformatics, College of Life Science and Technology, Huazhong Agricultural University, Wuhan, P.R. China
| | - Sen Li
- Center for Bioinformatics, College of Life Science and Technology, Huazhong Agricultural University, Wuhan, P.R. China
| | - Qiang Xu
- Key Laboratory of Horticultural Plant Biology of Ministry of Education, Huazhong Agricultural University, Wuhan, P.R. China
| | - Xiu-Xin Deng
- Key Laboratory of Horticultural Plant Biology of Ministry of Education, Huazhong Agricultural University, Wuhan, P.R. China
| | - Ling-Ling Chen
- Center for Bioinformatics, College of Life Science and Technology, Huazhong Agricultural University, Wuhan, P.R. China
- * E-mail:
| |
Collapse
|
11
|
Pálfy M, Farkas IJ, Vellai T, Korcsmáros T. Uniform curation protocol of metazoan signaling pathways to predict novel signaling components. Methods Mol Biol 2013; 1021:285-297. [PMID: 23715991 DOI: 10.1007/978-1-62703-450-0_15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
A relatively large number of signaling databases available today have strongly contributed to our understanding of signaling pathway properties. However, pathway comparisons both within and across databases are currently severely hampered by the large variety of data sources and the different levels of detail of their information content (on proteins and interactions). In this chapter, we present a protocol for a uniform curation method of signaling pathways, which intends to overcome this insufficiency. This uniformly curated database called SignaLink ( http://signalink.org ) allows us to systematically transfer pathway annotations between different species, based on orthology, and thereby to predict novel signaling pathway components. Thus, this method enables the compilation of a comprehensive signaling map of a given species and identification of new potential drug targets in humans. We strongly believe that the strict curation protocol we have established to compile a signaling pathway database can also be applied for the compilation of other (e.g., metabolic) databases. Similarly, the detailed guide to the orthology-based prediction of novel signaling components across species may also be utilized for predicting components of other biological processes.
Collapse
Affiliation(s)
- Máté Pálfy
- Department of Genetics, Eötvös Loránd University, Budapest, Hungary
| | | | | | | |
Collapse
|
12
|
Tacutu R, Shore DE, Budovsky A, de Magalhães JP, Ruvkun G, Fraifeld VE, Curran SP. Prediction of C. elegans longevity genes by human and worm longevity networks. PLoS One 2012; 7:e48282. [PMID: 23144747 PMCID: PMC3483217 DOI: 10.1371/journal.pone.0048282] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2012] [Accepted: 09/21/2012] [Indexed: 11/18/2022] Open
Abstract
Intricate and interconnected pathways modulate longevity, but screens to identify the components of these pathways have not been saturating. Because biological processes are often executed by protein complexes and fine-tuned by regulatory factors, the first-order protein-protein interactors of known longevity genes are likely to participate in the regulation of longevity. Data-rich maps of protein interactions have been established for many cardinal organisms such as yeast, worms, and humans. We propose that these interaction maps could be mined for the identification of new putative regulators of longevity. For this purpose, we have constructed longevity networks in both humans and worms. We reasoned that the essential first-order interactors of known longevity-associated genes in these networks are more likely to have longevity phenotypes than randomly chosen genes. We have used C. elegans to determine whether post-developmental inactivation of these essential genes modulates lifespan. Our results suggest that the worm and human longevity networks are functionally relevant and possess a high predictive power for identifying new longevity regulators.
Collapse
Affiliation(s)
- Robi Tacutu
- The Shraga Segal Department of Microbiology, Immunology and Genetics, Center for Multidisciplinary Research on Aging, Ben-Gurion University of the Negev, Beer-Sheva, Israel
| | - David E. Shore
- Department of Molecular Biology, Massachusetts General Hospital, Boston, Massachusetts, United States of America
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Arie Budovsky
- The Shraga Segal Department of Microbiology, Immunology and Genetics, Center for Multidisciplinary Research on Aging, Ben-Gurion University of the Negev, Beer-Sheva, Israel
| | - João Pedro de Magalhães
- Integrative Genomics of Ageing Group, Institute of Integrative Biology, University of Liverpool, Liverpool, United Kingdom
| | - Gary Ruvkun
- Department of Molecular Biology, Massachusetts General Hospital, Boston, Massachusetts, United States of America
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Vadim E. Fraifeld
- The Shraga Segal Department of Microbiology, Immunology and Genetics, Center for Multidisciplinary Research on Aging, Ben-Gurion University of the Negev, Beer-Sheva, Israel
- * E-mail: (VEF); (SPC)
| | - Sean P. Curran
- Division of Biogerontology, Davis School of Gerontology, University of Southern California, Los Angeles, California, United States of America
- Department of Molecular and Computational Biology, Dornsife College of Letters, Arts, and Sciences, University of Southern California, Los Angeles, California, United States of America
- Department of Biochemistry and Molecular Biology, Keck School of Medicine, University of Southern California, Los Angeles, California, United States of America
- * E-mail: (VEF); (SPC)
| |
Collapse
|
13
|
Abstract
Fluorescent proteins such as the "green fluorescent protein" (GFP) are popular tools in Caenorhabditis elegans, because as genetically encoded markers they are easy to introduce. Furthermore, they can be used in a living animal without the need for extensive sample preparation, because C. elegans is transparent and small enough so that entire animals can be imaged directly. Consequently, fluorescent proteins have emerged as the method of choice to study gene expression in C. elegans and reporter constructs for thousands of genes are currently available. When fused to a protein of interest, fluorescent proteins allow the imaging of its subcellular localization in vivo, offering a powerful alternative to antibody staining techniques. Fluorescent proteins can be employed to label cellular and subcellular structures and as indicators for cell physiological parameters like calcium concentration. Genetic screens relying on fluorescent proteins to visualize anatomical structures and recent progress in automation techniques have tremendously expanded their potential uses. This chapter presents tools and techniques related to the use of fluorescent proteins, discusses their advantages and shortcomings, and provides practical considerations for various applications.
Collapse
Affiliation(s)
- Harald Hutter
- Department of Biological Sciences, Simon Fraser University, Burnaby, BC, Canada
| |
Collapse
|
14
|
Bolser DM, Chibon PY, Palopoli N, Gong S, Jacob D, Del Angel VD, Swan D, Bassi S, González V, Suravajhala P, Hwang S, Romano P, Edwards R, Bishop B, Eargle J, Shtatland T, Provart NJ, Clements D, Renfro DP, Bhak D, Bhak J. MetaBase--the wiki-database of biological databases. Nucleic Acids Res 2011; 40:D1250-4. [PMID: 22139927 PMCID: PMC3245051 DOI: 10.1093/nar/gkr1099] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
Biology is generating more data than ever. As a result, there is an ever increasing number of publicly available databases that analyse, integrate and summarize the available data, providing an invaluable resource for the biological community. As this trend continues, there is a pressing need to organize, catalogue and rate these resources, so that the information they contain can be most effectively exploited. MetaBase (MB) (http://MetaDatabase.Org) is a community-curated database containing more than 2000 commonly used biological databases. Each entry is structured using templates and can carry various user comments and annotations. Entries can be searched, listed, browsed or queried. The database was created using the same MediaWiki technology that powers Wikipedia, allowing users to contribute on many different levels. The initial release of MB was derived from the content of the 2007 Nucleic Acids Research (NAR) Database Issue. Since then, approximately 100 databases have been manually collected from the literature, and users have added information for over 240 databases. MB is synchronized annually with the static Molecular Biology Database Collection provided by NAR. To date, there have been 19 significant contributors to the project; each one is listed as an author here to highlight the community aspect of the project.
Collapse
Affiliation(s)
- Dan M Bolser
- Personal Genomics Institute, Genome Research Foundation, Suwon 443-270, South Korea.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
15
|
Abstract
In gene prediction, studying phenotypes is highly valuable for reducing the number of locus candidates in association studies and to aid disease gene candidate prioritization. This is due to the intrinsic nature of phenotypes to visibly reflect genetic activity, making them potentially one of the most useful data types for functional studies. However, systematic use of these data has begun only recently. 'Comparative phenomics' is the analysis of genotype-phenotype associations across species and experimental methods. This is an emerging research field of utmost importance for gene discovery and gene function annotation. In this chapter, we review the use of phenotype data in the biomedical field. We will give an overview of phenotype resources, focusing on PhenomicDB--a cross-species genotype-phenotype database--which is the largest available collection of phenotype descriptions across species and experimental methods. We report on its latest extension by which genotype-phenotype relationships can be viewed as graphical representations of similar phenotypes clustered together ('phenoclusters'), supplemented with information from protein-protein interactions and Gene Ontology terms. We show that such 'phenoclusters' represent a novel approach to group genes functionally and to predict novel gene functions with high precision. We explain how these data and methods can be used to supplement the results of gene discovery approaches. The aim of this chapter is to assist researchers interested in understanding how phenotype data can be used effectively in the gene discovery field.
Collapse
|
16
|
Walaas SI, Hemmings HC, Greengard P, Nairn AC. Beyond the dopamine receptor: regulation and roles of serine/threonine protein phosphatases. Front Neuroanat 2011; 5:50. [PMID: 21904525 PMCID: PMC3162284 DOI: 10.3389/fnana.2011.00050] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2011] [Accepted: 07/23/2011] [Indexed: 11/17/2022] Open
Abstract
Dopamine plays an important modulatory role in the central nervous system, helping to control critical aspects of motor function and reward learning. Alteration in normal dopaminergic neurotransmission underlies multiple neurological diseases including schizophrenia, Huntington’s disease, and Parkinson’s disease. Modulation of dopamine-regulated signaling pathways is also important in the addictive actions of most drugs of abuse. Our studies over the last 30 years have focused on the molecular actions of dopamine acting on medium spiny neurons, the predominant neurons of the neostriatum. Striatum-enriched phosphoproteins, particularly dopamine and adenosine 3′:5′-monophosphate-regulated phosphoprotein of 32 kDa (DARPP-32), regulator of calmodulin signaling (RCS), and ARPP-16, mediate pleiotropic actions of dopamine. Notably, each of these proteins, either directly or indirectly, regulates the activity of one of the three major subclasses of serine/threonine protein phosphatases, PP1, PP2B, and PP2A, respectively. For example, phosphorylation of DARPP-32 at Thr34 by protein kinase A results in potent inhibition of PP1, leading to potentiation of dopaminergic signaling at multiple steps from the dopamine receptor to the nucleus. The discovery of DARPP-32 and its emergence as a critical molecular integrator of striatal signaling will be discussed, as will more recent studies that highlight novel roles for RCS and ARPP-16 in dopamine-regulated striatal signaling pathways.
Collapse
Affiliation(s)
- Sven Ivar Walaas
- Department of Biochemistry, Institute of Basic Medical Sciences, University of Oslo Oslo, Norway
| | | | | | | |
Collapse
|
17
|
Desalermos A, Muhammed M, Glavis-Bloom J, Mylonakis E. Using C. elegans for antimicrobial drug discovery. Expert Opin Drug Discov 2011; 6:645-652. [PMID: 21686092 DOI: 10.1517/17460441.2011.573781] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
INTRODUCTION: The number of microorganism strains with resistance to known antimicrobials is increasing. Therefore, there is a high demand for new, non-toxic and efficient antimicrobial agents. Research with the microscopic nematode Caenorhabditis elegans can address this high demand for the discovery of new antimicrobial compounds. In particular, C. elegans can be used as a model host for in vivo drug discovery through high-throughput screens of chemical libraries. AREAS COVERED: This review introduces the use of substitute model hosts and especially C. elegans in the study of microbial pathogenesis. The authors also highlight recently published literature on the role of C. elegans in drug discovery and outline its use as a promising host with unique advantages in the discovery of new antimicrobial drugs. EXPERT OPINION: C. elegans can be used, as a model host, to research many diseases, including fungal infections and Alzheimer's disease. In addition, high-throughput techniques, for screening chemical libraries, can also be facilitated. Nevertheless, C. elegans and mammals have significant differences that both limit the use of the nematode in research and the degree by which results can be interpreted. That being said, the use of C. elegans in drug discovery still holds promise and the field continues to grow, with attempts to improve the methodology already underway.
Collapse
Affiliation(s)
- Athanasios Desalermos
- Division of Infectious Diseases, Massachusetts General Hospital and Harvard Medical School, Boston, MA, 02114, USA
| | | | | | | |
Collapse
|
18
|
Wang Y, Chen J, Wei G, He H, Zhu X, Xiao T, Yuan J, Dong B, He S, Skogerbø G, Chen R. The Caenorhabditis elegans intermediate-size transcriptome shows high degree of stage-specific expression. Nucleic Acids Res 2011; 39:5203-14. [PMID: 21378118 PMCID: PMC3130273 DOI: 10.1093/nar/gkr102] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
Earlier studies have revealed a substantial amount of transcriptional activity occurring outside annotated protein-coding genes of the Caenorhabditis elegans genome. One important fraction of this transcriptional activity relates to intermediate-size (70–500 nt) transcripts (is-ncRNAs) of mostly unknown function. Profiling the expression of this segment of the transcriptome on a tiling array through the C. elegans life cycle identified 5866 hitherto unannotated transcripts. The novel loci were distributed across intronic and intergenic space, with some enrichment toward protein-coding gene termini. The majority of the putative is-ncRNAs showed either stage-specific expression, or distinct developmental variation in their expression levels. More than 200 loci showed male-specific expression, and conserved loci were significantly enriched on the X chromosome, both observations strongly suggesting involvement of is-ncRNAs in sex-specific functions. Half of the novel loci were conserved in other nematodes, and numerous loci showed significant conservational correlations to nearby coding genes. Assuming functional roles for most of the novel loci, the data imply a nematode is-ncRNA tool kit of considerable size and variety.
Collapse
Affiliation(s)
- Yunfei Wang
- Bioinformatics Laboratory and National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
19
|
Spencer WC, Zeller G, Watson JD, Henz SR, Watkins KL, McWhirter RD, Petersen S, Sreedharan VT, Widmer C, Jo J, Reinke V, Petrella L, Strome S, Von Stetina SE, Katz M, Shaham S, Rätsch G, Miller DM. A spatial and temporal map of C. elegans gene expression. Genome Res 2011; 21:325-41. [PMID: 21177967 PMCID: PMC3032935 DOI: 10.1101/gr.114595.110] [Citation(s) in RCA: 211] [Impact Index Per Article: 16.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2010] [Accepted: 12/08/2010] [Indexed: 01/31/2023]
Abstract
The C. elegans genome has been completely sequenced, and the developmental anatomy of this model organism is described at single-cell resolution. Here we utilize strategies that exploit this precisely defined architecture to link gene expression to cell type. We obtained RNAs from specific cells and from each developmental stage using tissue-specific promoters to mark cells for isolation by FACS or for mRNA extraction by the mRNA-tagging method. We then generated gene expression profiles of more than 30 different cells and developmental stages using tiling arrays. Machine-learning-based analysis detected transcripts corresponding to established gene models and revealed novel transcriptionally active regions (TARs) in noncoding domains that comprise at least 10% of the total C. elegans genome. Our results show that about 75% of transcripts with detectable expression are differentially expressed among developmental stages and across cell types. Examination of known tissue- and cell-specific transcripts validates these data sets and suggests that newly identified TARs may exercise cell-specific functions. Additionally, we used self-organizing maps to define groups of coregulated transcripts and applied regulatory element analysis to identify known transcription factor- and miRNA-binding sites, as well as novel motifs that likely function to control subsets of these genes. By using cell-specific, whole-genome profiling strategies, we have detected a large number of novel transcripts and produced high-resolution gene expression maps that provide a basis for establishing the roles of individual genes in cellular differentiation.
Collapse
Affiliation(s)
- W. Clay Spencer
- Department of Cell and Developmental Biology, Vanderbilt University, Nashville, Tennessee 37232, USA
| | - Georg Zeller
- Friedrich Miescher Laboratory of the Max Planck Society, 72076 Tübingen, Germany
- Department of Molecular Biology, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany
| | - Joseph D. Watson
- Department of Cell and Developmental Biology, Vanderbilt University, Nashville, Tennessee 37232, USA
| | - Stefan R. Henz
- Department of Molecular Biology, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany
| | - Kathie L. Watkins
- Department of Cell and Developmental Biology, Vanderbilt University, Nashville, Tennessee 37232, USA
| | - Rebecca D. McWhirter
- Department of Cell and Developmental Biology, Vanderbilt University, Nashville, Tennessee 37232, USA
| | - Sarah Petersen
- Department of Cell and Developmental Biology, Vanderbilt University, Nashville, Tennessee 37232, USA
| | - Vipin T. Sreedharan
- Friedrich Miescher Laboratory of the Max Planck Society, 72076 Tübingen, Germany
| | - Christian Widmer
- Friedrich Miescher Laboratory of the Max Planck Society, 72076 Tübingen, Germany
| | - Jeanyoung Jo
- Department of Genetics, Yale University School of Medicine, New Haven, Connecticut 06520, USA
| | - Valerie Reinke
- Department of Genetics, Yale University School of Medicine, New Haven, Connecticut 06520, USA
| | - Lisa Petrella
- Department of MCD Biology, University of California Santa Cruz, Santa Cruz, California 95064, USA
| | - Susan Strome
- Department of MCD Biology, University of California Santa Cruz, Santa Cruz, California 95064, USA
| | - Stephen E. Von Stetina
- Department of Cell and Developmental Biology, Vanderbilt University, Nashville, Tennessee 37232, USA
| | - Menachem Katz
- Laboratory of Developmental Genetics, The Rockefeller University, New York, New York 10065, USA
| | - Shai Shaham
- Laboratory of Developmental Genetics, The Rockefeller University, New York, New York 10065, USA
| | - Gunnar Rätsch
- Friedrich Miescher Laboratory of the Max Planck Society, 72076 Tübingen, Germany
| | - David M. Miller
- Department of Cell and Developmental Biology, Vanderbilt University, Nashville, Tennessee 37232, USA
| |
Collapse
|
20
|
Abstract
With unique genetic and cell biological strengths, C. elegans has emerged as a powerful model system for studying many biological processes. These processes are typically regulated by complex genetic networks consisting of genes. Identifying those genes and organizing them into genetic pathways are two major steps toward understanding the mechanisms that regulate biological events. Forward genetic screens with various designs are a traditional approach for identifying candidate genes. The completion of the genome sequencing in C. elegans and the advent of high-throughput experimental techniques have led to the development of two additional powerful approaches: functional genomics and systems biology. Genes that are discovered by these approaches can be ordered into interacting pathways through a variety of strategies, involving genetics, cell biology, biochemistry, and functional genomics, to gain a more complete understanding of how gene regulatory networks control a particular biological process. The aim of this review is to provide an overview of the approaches available to identify and construct the genetic pathways using C. elegans.
Collapse
Affiliation(s)
- Zheng Wang
- Dept. of Biology, Duke University, Durham NC
| | | |
Collapse
|
21
|
Jensen VL, Simonsen KT, Lee YH, Park D, Riddle DL. RNAi screen of DAF-16/FOXO target genes in C. elegans links pathogenesis and dauer formation. PLoS One 2010; 5:e15902. [PMID: 21209831 PMCID: PMC3013133 DOI: 10.1371/journal.pone.0015902] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2010] [Accepted: 11/30/2010] [Indexed: 11/19/2022] Open
Abstract
The DAF-16/FOXO transcription factor is the major downstream output of the insulin/IGF1R signaling pathway controlling C. elegans dauer larva development and aging. To identify novel downstream genes affecting dauer formation, we used RNAi to screen candidate genes previously identified to be regulated by DAF-16. We used a sensitized genetic background [eri-1(mg366); sdf-9(m708)], which enhances both RNAi efficiency and constitutive dauer formation (Daf-c). Among 513 RNAi clones screened, 21 displayed a synthetic Daf-c (SynDaf) phenotype with sdf-9. One of these genes, srh-100, was previously identified to be SynDaf, but twenty have not previously been associated with dauer formation. Two of the latter genes, lys-1 and cpr-1, are known to participate in innate immunity and six more are predicted to do so, suggesting that the immune response may contribute to the dauer decision. Indeed, we show that two of these genes, lys-1 and clc-1, are required for normal resistance to Staphylococcus aureus. clc-1 is predicted to function in epithelial cohesion. Dauer formation exhibited by daf-8(m85), sdf-9(m708), and the wild-type N2 (at 27°C) were all enhanced by exposure to pathogenic bacteria, while not enhanced in a daf-22(m130) background. We conclude that knockdown of the genes required for proper pathogen resistance increases pathogenic infection, leading to increased dauer formation in our screen. We propose that dauer larva formation is a behavioral response to pathogens mediated by increased dauer pheromone production.
Collapse
Affiliation(s)
- Victor L. Jensen
- Department of Medical Genetics, University of British Columbia, Vancouver, British Columbia, Canada
- Michael Smith Laboratories, University of British Columbia, Vancouver, British Columbia, Canada
| | - Karina T. Simonsen
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Odense, Denmark
| | - Yu-Hui Lee
- Michael Smith Laboratories, University of British Columbia, Vancouver, British Columbia, Canada
| | - Donha Park
- Michael Smith Laboratories, University of British Columbia, Vancouver, British Columbia, Canada
| | - Donald L. Riddle
- Department of Medical Genetics, University of British Columbia, Vancouver, British Columbia, Canada
- Michael Smith Laboratories, University of British Columbia, Vancouver, British Columbia, Canada
- * E-mail:
| |
Collapse
|
22
|
Jensen VL, Bialas NJ, Bishop-Hurley SL, Molday LL, Kida K, Nguyen PAT, Blacque OE, Molday RS, Leroux MR, Riddle DL. Localization of a guanylyl cyclase to chemosensory cilia requires the novel ciliary MYND domain protein DAF-25. PLoS Genet 2010; 6:e1001199. [PMID: 21124868 PMCID: PMC2991253 DOI: 10.1371/journal.pgen.1001199] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2010] [Accepted: 10/07/2010] [Indexed: 11/19/2022] Open
Abstract
In harsh conditions, Caenorhabditis elegans arrests development to enter a non-aging, resistant diapause state called the dauer larva. Olfactory sensation modulates the TGF-β and insulin signaling pathways to control this developmental decision. Four mutant alleles of daf-25 (abnormal DAuer Formation) were isolated from screens for mutants exhibiting constitutive dauer formation and found to be defective in olfaction. The daf-25 dauer phenotype is suppressed by daf-10/IFT122 mutations (which disrupt ciliogenesis), but not by daf-6/PTCHD3 mutations (which prevent environmental exposure of sensory cilia), implying that DAF-25 functions in the cilia themselves. daf-25 encodes the C. elegans ortholog of mammalian Ankmy2, a MYND domain protein of unknown function. Disruption of DAF-25, which localizes to sensory cilia, produces no apparent cilia structure anomalies, as determined by light and electron microscopy. Hinting at its potential function, the dauer phenotype, epistatic order, and expression profile of daf-25 are similar to daf-11, which encodes a cilium-localized guanylyl cyclase. Indeed, we demonstrate that DAF-25 is required for proper DAF-11 ciliary localization. Furthermore, the functional interaction is evolutionarily conserved, as mouse Ankmy2 interacts with guanylyl cyclase GC1 from ciliary photoreceptors. The interaction may be specific because daf-25 mutants have normally-localized OSM-9/TRPV4, TAX-4/CNGA1, CHE-2/IFT80, CHE-11/IFT140, CHE-13/IFT57, BBS-8, OSM-5/IFT88, and XBX-1/D2LIC in the cilia. Intraflagellar transport (IFT) (required to build cilia) is not defective in daf-25 mutants, although the ciliary localization of DAF-25 itself is influenced in che-11 mutants, which are defective in retrograde IFT. In summary, we have discovered a novel ciliary protein that plays an important role in cGMP signaling by localizing a guanylyl cyclase to the sensory organelle.
Collapse
Affiliation(s)
- Victor L. Jensen
- Medical Genetics, University of British Columbia, Vancouver, Canada
| | - Nathan J. Bialas
- Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, Canada
| | - Sharon L. Bishop-Hurley
- Division of Biological Sciences, University of Missouri, Columbia, Missouri, United States of America
- CSIRO-Livestock Industries, Queensland Biosciences Precinct, Brisbane, Australia
| | - Laurie L. Molday
- Centre for Macular Research, University of British Columbia, Vancouver, Canada
| | - Katarzyna Kida
- School of Biomolecular and Biomedical Science, UCD Conway Institute, University College Dublin, Belfield, Dublin, Ireland
| | | | - Oliver E. Blacque
- School of Biomolecular and Biomedical Science, UCD Conway Institute, University College Dublin, Belfield, Dublin, Ireland
| | - Robert S. Molday
- Centre for Macular Research, University of British Columbia, Vancouver, Canada
| | - Michel R. Leroux
- Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, Canada
| | - Donald L. Riddle
- Medical Genetics, University of British Columbia, Vancouver, Canada
- Michael Smith Laboratories, University of British Columbia, Vancouver, Canada
- * E-mail:
| |
Collapse
|
23
|
Jan CH, Friedman RC, Ruby JG, Bartel DP. Formation, regulation and evolution of Caenorhabditis elegans 3'UTRs. Nature 2010; 469:97-101. [PMID: 21085120 PMCID: PMC3057491 DOI: 10.1038/nature09616] [Citation(s) in RCA: 367] [Impact Index Per Article: 26.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2010] [Accepted: 10/29/2010] [Indexed: 11/24/2022]
Abstract
Posttranscriptional gene regulation frequently occurs through elements in mRNA 3′ untranslated regions (UTRs)1,2. Although crucial roles for 3′UTR-mediated gene regulation have been found in Caenorhabditis elegans3,4,5, most C. elegans genes have lacked annotated 3′UTRs6,7. Here we describe a high-throughput method to reliably identify polyadenylated RNA termini, and we apply this method, called poly(A)-position profiling by sequencing (3P-Seq), to determine C. elegans 3′UTRs. Compared to standard methods also recently applied to C. elegans UTRs8, 3P-Seq identified 8,581 additional UTRs while excluding thousands of shorter UTR isoforms that do not appear to be authentic. Analysis of this expanded and corrected dataset suggested that the high A/U content of C. elegans 3′UTRs facilitated genome compaction, since the elements specifying cleavage and polyadenylation, which are A/U-rich, can more readily emerge in A/U rich regions. Indeed, 30% of the protein-coding genes have mRNAs with alternative, partially overlapping end regions that generate another 10,498 cleavage and polyadenylation sites that had gone largely unnoticed and represent potential evolutionary intermediates of progressive UTR shortening. Moreover, a third of the convergently transcribed genes utilize palindromic arrangements of bidirectional elements to specify UTRs with convergent overlap, which also contributes to genome compaction by eliminating regions between genes. Although nematode 3′UTRs have median length only one-sixth that of mammalian 3′UTRs, they have twice the density of conserved microRNA sites, in part because additional types of seed-complementary sites are preferentially conserved. These findings reveal the influence of cleavage and polyadenylation on the evolution of genome architecture and provide resources for studying posttranscriptional gene regulation.
Collapse
Affiliation(s)
- Calvin H Jan
- Whitehead Institute for Biomedical Research, Cambridge, Massachusetts 02142, USA
| | | | | | | |
Collapse
|
24
|
Rao W, Isaac RE, Keen JN. An analysis of the Caenorhabditis elegans lipid raft proteome using geLC-MS/MS. J Proteomics 2010; 74:242-53. [PMID: 21070894 DOI: 10.1016/j.jprot.2010.11.001] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2010] [Revised: 10/20/2010] [Accepted: 11/02/2010] [Indexed: 11/16/2022]
Abstract
Lipid rafts are microdomains of the phospholipid bilayer, proposed to form semi-stable "islands" that act as a platform for several important cellular processes; major classes of raft-resident proteins include signalling proteins and glycosylphosphatidylinositol (GPI)-anchored proteins. Proteomic studies into lipid rafts have been mainly carried out in mammalian cell lines and single cell organisms. The nematode Caenorhabditis elegans, the model organism with a well-defined developmental profile, is ideally suited for the study of this subcellular locale in a complex developmental context. A study of the lipid raft proteome of C. elegans is presented here. A total of 44 proteins were identified from the lipid raft fraction using geLC-MS/MS, of which 40 have been determined to be likely raft proteins after analysis of predicted functions. Prediction of GPI-anchoring of the proteins found 21 to be potentially modified in this way, two of which were experimentally confirmed to be GPI-anchored. This work is the first reported study of the lipid raft proteome in C. elegans. The results show that raft proteins, including numerous GPI-anchored proteins, may have a variety of potentially important roles within the nematode, and will hopefully lead to C. elegans becoming a useful model for the study of lipid rafts.
Collapse
Affiliation(s)
- Wei Rao
- Institute of Integrative and Comparative Biology, Faculty of Biological Sciences, University of Leeds, Leeds LS2 9JT, United Kingdom
| | | | | |
Collapse
|
25
|
Denver DR, Howe DK, Wilhelm LJ, Palmer CA, Anderson JL, Stein KC, Phillips PC, Estes S. Selective sweeps and parallel mutation in the adaptive recovery from deleterious mutation in Caenorhabditis elegans. Genome Res 2010; 20:1663-71. [PMID: 21036923 DOI: 10.1101/gr.108191.110] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Deleterious mutation poses a serious threat to human health and the persistence of small populations. Although adaptive recovery from deleterious mutation has been well-characterized in prokaryotes, the evolutionary mechanisms by which multicellular eukaryotes recover from deleterious mutation remain unknown. We applied high-throughput DNA sequencing to characterize genomic divergence patterns associated with the adaptive recovery from deleterious mutation using a Caenorhabditis elegans recovery-line system. The C. elegans recovery lines were initiated from a low-fitness mutation-accumulation (MA) line progenitor and allowed to independently evolve in large populations (N ∼ 1000) for 60 generations. All lines rapidly regained levels of fitness similar to the wild-type (N2) MA line progenitor. Although there was a near-zero probability of a single mutation fixing due to genetic drift during the recovery experiment, we observed 28 fixed mutations. Cross-generational analysis showed that all mutations went from undetectable population-level frequencies to a fixed state in 10-20 generations. Many recovery-line mutations fixed at identical timepoints, suggesting that the mutations, if not beneficial, hitchhiked to fixation during selective sweep events observed in the recovery lines. No MA line mutation reversions were detected. Parallel mutation fixation was observed for two sites in two independent recovery lines. Analysis using a C. elegans interactome map revealed many predicted interactions between genes with recovery line-specific mutations and genes with previously accumulated MA line mutations. Our study suggests that recovery-line mutations identified in both coding and noncoding genomic regions might have beneficial effects associated with compensatory epistatic interactions.
Collapse
Affiliation(s)
- Dee R Denver
- Department of Zoology, Oregon State University, Corvallis, Oregon 97331, USA.
| | | | | | | | | | | | | | | |
Collapse
|
26
|
Sreedharan S, Stephansson O, Schiöth HB, Fredriksson R. Long evolutionary conservation and considerable tissue specificity of several atypical solute carrier transporters. Gene 2010; 478:11-8. [PMID: 21044875 DOI: 10.1016/j.gene.2010.10.011] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2010] [Revised: 10/12/2010] [Accepted: 10/20/2010] [Indexed: 10/18/2022]
Abstract
The superfamily of Solute Carriers (SLCs) has around 384 members in the human genome grouped into at least 48 families. While many of these transporters have been well characterized with established important biological functions, there are few recently identified genes that are not studied regarding tissue distribution or evolutionary origin. Here we study 14 of these recently discovered SLC genes (HIAT1, HIATL1, MFSD1, MFSD5, MFSD6, MFSD9, MFSD10, SLC7A14, SLC7A15, SLC10A6, SLC15A5, SLC16A12, SLC30A10 and SLC21A21) with the purpose to give much better picture over the sequence relationship and tissue expression of the diverse SLC gene family. We used a range of bioinformatic methods to classify each of these genes into the different SLC gene families. We found that 9 of the 14 atypical SLCs are distant members of the Major Facilitator Superfamily (MFS) clan while the others belong to the APC clan, the DMT clan, the CPA_AT clan and the IT clan. We found most of the genes to be highly evolutionary conserved, likely to be present in most bilateral species, except for SLC21A21 that we found only present in mammals. Several of these transporter genes have highly specific tissue expression profile while it is notable that most are expressed in the CNS with the exception of SLC21A21 and SLC15A5. This work provides fundamental information on 14 transporters that previously have not received much attention enabling a more comprehensive view over the SLC superfamily.
Collapse
Affiliation(s)
- Smitha Sreedharan
- Department of Neuroscience, Functional Pharmacology, Uppsala University, BMC, Uppsala SE 75124, Sweden
| | | | | | | |
Collapse
|
27
|
Davis MJ, Sehgal MSB, Ragan MA. Automatic, context-specific generation of Gene Ontology slims. BMC Bioinformatics 2010; 11:498. [PMID: 20929524 PMCID: PMC3098080 DOI: 10.1186/1471-2105-11-498] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2010] [Accepted: 10/07/2010] [Indexed: 11/10/2022] Open
Abstract
Background The use of ontologies to control vocabulary and structure annotation has added value to genome-scale data, and contributed to the capture and re-use of knowledge across research domains. Gene Ontology (GO) is widely used to capture detailed expert knowledge in genomic-scale datasets and as a consequence has grown to contain many terms, making it unwieldy for many applications. To increase its ease of manipulation and efficiency of use, subsets called GO slims are often created by collapsing terms upward into more general, high-level terms relevant to a particular context. Creation of a GO slim currently requires manipulation and editing of GO by an expert (or community) familiar with both the ontology and the biological context. Decisions about which terms to include are necessarily subjective, and the creation process itself and subsequent curation are time-consuming and largely manual. Results Here we present an objective framework for generating customised ontology slims for specific annotated datasets, exploiting information latent in the structure of the ontology graph and in the annotation data. This framework combines ontology engineering approaches, and a data-driven algorithm that draws on graph and information theory. We illustrate this method by application to GO, generating GO slims at different information thresholds, characterising their depth of semantics and demonstrating the resulting gains in statistical power. Conclusions Our GO slim creation pipeline is available for use in conjunction with any GO-annotated dataset, and creates dataset-specific, objectively defined slims. This method is fast and scalable for application to other biomedical ontologies.
Collapse
Affiliation(s)
- Melissa J Davis
- The University of Queensland, Brisbane, QLD 4072, Australia.
| | | | | |
Collapse
|
28
|
Sen TZ, Harper LC, Schaeffer ML, Andorf CM, Seigfried TE, Campbell DA, Lawrence CJ. Choosing a genome browser for a Model Organism Database: surveying the maize community. Database (Oxford) 2010; 2010:baq007. [PMID: 20627860 PMCID: PMC2911842 DOI: 10.1093/database/baq007] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2009] [Revised: 03/08/2010] [Accepted: 03/09/2010] [Indexed: 11/12/2022]
Abstract
As the B73 maize genome sequencing project neared completion, MaizeGDB began to integrate a graphical genome browser with its existing web interface and database. To ensure that maize researchers would optimally benefit from the potential addition of a genome browser to the existing MaizeGDB resource, personnel at MaizeGDB surveyed researchers' needs. Collected data indicate that existing genome browsers for maize were inadequate and suggest implementation of a browser with quick interface and intuitive tools would meet most researchers' needs. Here, we document the survey's outcomes, review functionalities of available genome browser software platforms and offer our rationale for choosing the GBrowse software suite for MaizeGDB. Because the genome as represented within the MaizeGDB Genome Browser is tied to detailed phenotypic data, molecular marker information, available stocks, etc., the MaizeGDB Genome Browser represents a novel mechanism by which the researchers can leverage maize sequence information toward crop improvement directly. Database URL: http://gbrowse.maizegdb.org/
Collapse
Affiliation(s)
- Taner Z Sen
- USDA-ARS Corn Insects and Crop Genetics Research Unit, Ames, IA 50011, USA.
| | | | | | | | | | | | | |
Collapse
|
29
|
Nematode parasite genes: what's in a name? Trends Parasitol 2010; 26:334-40. [DOI: 10.1016/j.pt.2010.04.003] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2009] [Revised: 04/08/2010] [Accepted: 04/09/2010] [Indexed: 11/23/2022]
|
30
|
Korcsmáros T, Farkas IJ, Szalay MS, Rovó P, Fazekas D, Spiró Z, Böde C, Lenti K, Vellai T, Csermely P. Uniformly curated signaling pathways reveal tissue-specific cross-talks and support drug target discovery. ACTA ACUST UNITED AC 2010; 26:2042-50. [PMID: 20542890 DOI: 10.1093/bioinformatics/btq310] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
MOTIVATION Signaling pathways control a large variety of cellular processes. However, currently, even within the same database signaling pathways are often curated at different levels of detail. This makes comparative and cross-talk analyses difficult. RESULTS We present SignaLink, a database containing eight major signaling pathways from Caenorhabditis elegans, Drosophila melanogaster and humans. Based on 170 review and approximately 800 research articles, we have compiled pathways with semi-automatic searches and uniform, well-documented curation rules. We found that in humans any two of the eight pathways can cross-talk. We quantified the possible tissue- and cancer-specific activity of cross-talks and found pathway-specific expression profiles. In addition, we identified 327 proteins relevant for drug target discovery. CONCLUSIONS We provide a novel resource for comparative and cross-talk analyses of signaling pathways. The identified multi-pathway and tissue-specific cross-talks contribute to the understanding of the signaling complexity in health and disease, and underscore its importance in network-based drug target selection. AVAILABILITY http://SignaLink.org.
Collapse
|
31
|
Haegeman A, Elsen A, De Waele D, Gheysen G. Emerging molecular knowledge on Radopholus similis, an important nematode pest of banana. MOLECULAR PLANT PATHOLOGY 2010; 11:315-23. [PMID: 20447280 PMCID: PMC6640332 DOI: 10.1111/j.1364-3703.2010.00614.x] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
TAXONOMY Superkingdom Eukaryota; Kingdom Metazoa; Phylum Nematoda; Class Chromadorea; Order Rhabditida; Suborder Tylenchina; Infraorder Tylenchomorpha; Superfamily Tylenchoidea; Family Pratylenchidae; Subfamily Radopholinae; Genus Radopholus. PHYSICAL PROPERTIES Microscopic unsegmented worm; migratory endoparasite of plants. Strong sexual dimorphism; reproduction both by amphimixis and self-fertilization. HOSTS Over 250 different plant species, including citrus, black pepper and banana (main host plant). SYMPTOMS Purple to black lesions and extensive cavities in plant roots, leading to reduced uptake of water and nutrients. In banana, this may result in poor vegetative growth, reduced bunch weight and toppling of plants. DISEASE CONTROL Nematicides, alternative cropping systems, nematode-free planting material, some resistant cultivars. AGRONOMIC IMPORTANCE Major problem in banana plantations in tropical regions worldwide.
Collapse
Affiliation(s)
- Annelies Haegeman
- Department of Molecular Biotechnology, Faculty of Bioscience Engineering, Ghent University, Coupure links 653, 9000 Ghent, Belgium
| | | | | | | |
Collapse
|
32
|
The NetAge database: a compendium of networks for longevity, age-related diseases and associated processes. Biogerontology 2010; 11:513-22. [DOI: 10.1007/s10522-010-9265-8] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2010] [Accepted: 02/09/2010] [Indexed: 12/11/2022]
|
33
|
Mi H, Dong Q, Muruganujan A, Gaudet P, Lewis S, Thomas PD. PANTHER version 7: improved phylogenetic trees, orthologs and collaboration with the Gene Ontology Consortium. Nucleic Acids Res 2010; 38:D204-10. [PMID: 20015972 PMCID: PMC2808919 DOI: 10.1093/nar/gkp1019] [Citation(s) in RCA: 453] [Impact Index Per Article: 32.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2009] [Revised: 10/16/2009] [Accepted: 10/19/2009] [Indexed: 11/12/2022] Open
Abstract
Protein Analysis THrough Evolutionary Relationships (PANTHER) is a comprehensive software system for inferring the functions of genes based on their evolutionary relationships. Phylogenetic trees of gene families form the basis for PANTHER and these trees are annotated with ontology terms describing the evolution of gene function from ancestral to modern day genes. One of the main applications of PANTHER is in accurate prediction of the functions of uncharacterized genes, based on their evolutionary relationships to genes with functions known from experiment. The PANTHER website, freely available at http://www.pantherdb.org, also includes software tools for analyzing genomic data relative to known and inferred gene functions. Since 2007, there have been several new developments to PANTHER: (i) improved phylogenetic trees, explicitly representing speciation and gene duplication events, (ii) identification of gene orthologs, including least diverged orthologs (best one-to-one pairs), (iii) coverage of more genomes (48 genomes, up to 87% of genes in each genome; see http://www.pantherdb.org/panther/summaryStats.jsp), (iv) improved support for alternative database identifiers for genes, proteins and microarray probes and (v) adoption of the SBGN standard for display of biological pathways. In addition, PANTHER trees are being annotated with gene function as part of the Gene Ontology Reference Genome project, resulting in an increasing number of curated functional annotations.
Collapse
Affiliation(s)
- Huaiyu Mi
- Evolutionary Systems Biology Group, SRI International, dictyBase, Northwestern University and Berkeley Bioinformatics and Open-source Projects (BBOP), Lawrence Berkeley National Laboratory, USA
| | - Qing Dong
- Evolutionary Systems Biology Group, SRI International, dictyBase, Northwestern University and Berkeley Bioinformatics and Open-source Projects (BBOP), Lawrence Berkeley National Laboratory, USA
| | - Anushya Muruganujan
- Evolutionary Systems Biology Group, SRI International, dictyBase, Northwestern University and Berkeley Bioinformatics and Open-source Projects (BBOP), Lawrence Berkeley National Laboratory, USA
| | - Pascale Gaudet
- Evolutionary Systems Biology Group, SRI International, dictyBase, Northwestern University and Berkeley Bioinformatics and Open-source Projects (BBOP), Lawrence Berkeley National Laboratory, USA
| | - Suzanna Lewis
- Evolutionary Systems Biology Group, SRI International, dictyBase, Northwestern University and Berkeley Bioinformatics and Open-source Projects (BBOP), Lawrence Berkeley National Laboratory, USA
| | - Paul D. Thomas
- Evolutionary Systems Biology Group, SRI International, dictyBase, Northwestern University and Berkeley Bioinformatics and Open-source Projects (BBOP), Lawrence Berkeley National Laboratory, USA
| |
Collapse
|
34
|
Mello LV, O'Meara H, Rigden DJ, Paterson S. Identification of novel aspartic proteases from Strongyloides ratti and characterisation of their evolutionary relationships, stage-specific expression and molecular structure. BMC Genomics 2009; 10:611. [PMID: 20015380 PMCID: PMC2805697 DOI: 10.1186/1471-2164-10-611] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2009] [Accepted: 12/16/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Aspartic proteases are known to play an important role in the biology of nematode parasitism. This role is best characterised in blood-feeding nematodes, where they digest haemoglobin, but they are also likely to play important roles in the biology of nematode parasites that do not feed on blood. In the present work, we investigate the evolution and expression of aspartic proteases in Strongyloides ratti, which permits a unique comparison between parasitic and free-living adult forms within its life-cycle. RESULTS We identified eight transcribed aspartic protease sequences and a further two genomic sequences and compared these to homologues in Caenorhabditis elegans and other nematode species. Phylogenetic analysis demonstrated a complex pattern of gene evolution, such that some S. ratti sequences had a one-to-one correspondence with orthologues of C. elegans but that lineage-specific expansions have occurred for other aspartic proteases in these two nematodes. These gene duplication events may have contributed to the adaptation of the two species to their different lifestyles. Among the set of S. ratti aspartic proteases were two closely-related isoforms that showed differential expression during different life stages: ASP-2A is highly expressed in parasitic females while ASP-2B is predominantly found in free-living adults. Molecular modelling of the ASP-2 isoforms reveals that their substrate specificities are likely to be very similar, but that ASP-2B is more electrostatically negative over its entire molecular surface than ASP-2A. This characteristic may be related to different pH values of the environments in which these two isoforms operate. CONCLUSIONS We have demonstrated that S. ratti provides a powerful model to explore the genetic adaptations associated with parasitic versus free-living life-styles. We have discovered gene duplication of aspartic protease genes in Strongyloides and identified a pair of paralogues differentially expressed in either the parasitic or the free-living phase of the nematode life-cycle, consistent with an adaptive role for aspartic proteases in the evolution of nematode parasitism.
Collapse
Affiliation(s)
- Luciane V Mello
- School of Biological Sciences, University of Liverpool, Crown Street, Liverpool L69 7ZB, UK
| | - Helen O'Meara
- School of Biological Sciences, University of Liverpool, Crown Street, Liverpool L69 7ZB, UK
- Department of Pharmacology and Therapeutics, University of Liverpool, Ashton Street, Liverpool, L69 3GE, UK
| | - Daniel J Rigden
- School of Biological Sciences, University of Liverpool, Crown Street, Liverpool L69 7ZB, UK
| | - Steve Paterson
- School of Biological Sciences, University of Liverpool, Crown Street, Liverpool L69 7ZB, UK
| |
Collapse
|
35
|
Yamazaki Y, Akashi R, Banno Y, Endo T, Ezura H, Fukami-Kobayashi K, Inaba K, Isa T, Kamei K, Kasai F, Kobayashi M, Kurata N, Kusaba M, Matuzawa T, Mitani S, Nakamura T, Nakamura Y, Nakatsuji N, Naruse K, Niki H, Nitasaka E, Obata Y, Okamoto H, Okuma M, Sato K, Serikawa T, Shiroishi T, Sugawara H, Urushibara H, Yamamoto M, Yaoita Y, Yoshiki A, Kohara Y. NBRP databases: databases of biological resources in Japan. Nucleic Acids Res 2009; 38:D26-32. [PMID: 19934255 PMCID: PMC2808968 DOI: 10.1093/nar/gkp996] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The National BioResource Project (NBRP) is a Japanese project that aims to establish a system for collecting, preserving and providing bioresources for use as experimental materials for life science research. It is promoted by 27 core resource facilities, each concerned with a particular group of organisms, and by one information center. The NBRP database is a product of this project. Thirty databases and an integrated database-retrieval system (BioResource World: BRW) have been created and made available through the NBRP home page (http://www.nbrp.jp). The 30 independent databases have individual features which directly reflect the data maintained by each resource facility. The BRW is designed for users who need to search across several resources without moving from one database to another. BRW provides access to a collection of 4.5-million records on bioresources including wild species, inbred lines, mutants, genetically engineered lines, DNA clones and so on. BRW supports summary browsing, keyword searching, and searching by DNA sequences or gene ontology. The results of searches provide links to online requests for distribution of research materials. A circulation system allows users to submit details of papers published on research conducted using NBRP resources.
Collapse
Affiliation(s)
- Yukiko Yamazaki
- National Institute of Genetics, University of Miyazaki, Japan.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
36
|
Gouw JW, Krijgsveld J, Heck AJR. Quantitative proteomics by metabolic labeling of model organisms. Mol Cell Proteomics 2009; 9:11-24. [PMID: 19955089 DOI: 10.1074/mcp.r900001-mcp200] [Citation(s) in RCA: 119] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
In the biological sciences, model organisms have been used for many decades and have enabled the gathering of a large proportion of our present day knowledge of basic biological processes and their derailments in disease. Although in many of these studies using model organisms, the focus has primarily been on genetics and genomics approaches, it is important that methods become available to extend this to the relevant protein level. Mass spectrometry-based proteomics is increasingly becoming the standard to comprehensively analyze proteomes. An important transition has been made recently by moving from charting static proteomes to monitoring their dynamics by simultaneously quantifying multiple proteins obtained from differently treated samples. Especially the labeling with stable isotopes has proved an effective means to accurately determine differential expression levels of proteins. Among these, metabolic incorporation of stable isotopes in vivo in whole organisms is one of the favored strategies. In this perspective, we will focus on methodologies to stable isotope label a variety of model organisms in vivo, ranging from relatively simple organisms such as bacteria and yeast to Caenorhabditis elegans, Drosophila, and Arabidopsis up to mammals such as rats and mice. We also summarize how this has opened up ways to investigate biological processes at the protein level in health and disease, revealing conservation and variation across the evolutionary tree of life.
Collapse
Affiliation(s)
- Joost W Gouw
- Biomolecular Mass Spectrometry and Proteomics Group, Bijvoet Center for Biomolecular Research, and Utrecht Institute for Pharmaceutical Sciences, Utrecht University, Netherlands Proteomics Centre, 3584CH Utrecht, The Netherlands
| | | | | |
Collapse
|
37
|
Hutter H, Ng MP, Chen N. GExplore: a web server for integrated queries of protein domains, gene expression and mutant phenotypes. BMC Genomics 2009; 10:529. [PMID: 19917126 PMCID: PMC2779824 DOI: 10.1186/1471-2164-10-529] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2009] [Accepted: 11/16/2009] [Indexed: 01/17/2023] Open
Abstract
Background The majority of the genes even in well-studied multi-cellular model organisms have not been functionally characterized yet. Mining the numerous genome wide data sets related to protein function to retrieve potential candidate genes for a particular biological process remains a challenge. Description GExplore has been developed to provide a user-friendly database interface for data mining at the gene expression/protein function level to help in hypothesis development and experiment design. It supports combinatorial searches for proteins with certain domains, tissue- or developmental stage-specific expression patterns, and mutant phenotypes. GExplore operates on a stand-alone database and has fast response times, which is essential for exploratory searches. The interface is not only user-friendly, but also modular so that it accommodates additional data sets in the future. Conclusion GExplore is an online database for quick mining of data related to gene and protein function, providing a multi-gene display of data sets related to the domain composition of proteins as well as expression and phenotype data. GExplore is publicly available at: http://genome.sfu.ca/gexplore/
Collapse
Affiliation(s)
- Harald Hutter
- Department of Biological Sciences, Simon Fraser University, Burnaby, Canada.
| | | | | |
Collapse
|
38
|
Abstract
More than two billion people (one-third of humanity) are infected with parasitic roundworms or flatworms, collectively known as helminth parasites. These infections cause diseases that are responsible for enormous levels of morbidity and mortality, delays in the physical development of children, loss of productivity among the workforce, and maintenance of poverty. Genomes of the major helminth species that affect humans, and many others of agricultural and veterinary significance, are now the subject of intensive genome sequencing and annotation. Draft genome sequences of the filarial worm Brugia malayi and two of the human schistosomes, Schistosoma japonicum and S. mansoni, are now available, among others. These genome data will provide the basis for a comprehensive understanding of the molecular mechanisms involved in helminth nutrition and metabolism, host-dependent development and maturation, immune evasion, and evolution. They are likely also to predict new potential vaccine candidates and drug targets. In this review, we present an overview of these efforts and emphasize the potential impact and importance of these new findings.
Collapse
Affiliation(s)
- Paul J Brindley
- Department of Microbiology, Immunology, and Tropical Medicine, George Washington University Medical Center, Washington, D. C., USA.
| | | | | | | |
Collapse
|
39
|
Duan J, Li R, Cheng D, Fan W, Zha X, Cheng T, Wu Y, Wang J, Mita K, Xiang Z, Xia Q. SilkDB v2.0: a platform for silkworm (Bombyx mori ) genome biology. Nucleic Acids Res 2009; 38:D453-6. [PMID: 19793867 PMCID: PMC2808975 DOI: 10.1093/nar/gkp801] [Citation(s) in RCA: 209] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
The SilkDB is an open-access database for genome biology of the silkworm (Bombyx mori). Since the draft sequence was completed and the SilkDB was first released 5 years ago, we have collaborated with other groups to make much remarkable progress on silkworm genome research, such as the completion of a new high-quality assembly of the silkworm genome sequence as well as the construction of a genome-wide microarray to survey gene expression profiles. To accommodate these new genomic data and house more comprehensive genomic information, we have reconstructed SilkDB database with new web interfaces. In the new version (v2.0) of SilkDB, we updated the genomic data, including genome assembly, gene annotation, chromosomal mapping, orthologous relationship and experiment data, such as microarray expression data, Expressed Sequence Tags (ESTs) and corresponding references. Several new tools, including SilkMap, Silkworm Chromosome Browser (SCB) and BmArray, are developed to access silkworm genomic data conveniently. SilkDB is publicly available at the new URL of http://www.silkdb.org.
Collapse
Affiliation(s)
- Jun Duan
- The Key Sericultural Laboratory of Agricultural Ministry, Southwest University, Chongqing 400716, China
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
40
|
High resolution transcriptome maps for wild-type and nonsense-mediated decay-defective Caenorhabditis elegans. Genome Biol 2009; 10:R101. [PMID: 19778439 PMCID: PMC2768976 DOI: 10.1186/gb-2009-10-9-r101] [Citation(s) in RCA: 88] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2009] [Revised: 08/11/2009] [Accepted: 09/24/2009] [Indexed: 11/12/2022] Open
Abstract
The high-resolution transcriptome of wild-type and nonsense-mediated decay (NMD) defective C. elegans during development reveals insights into the NMD pathway and it’s role in development. Background While many genome sequences are complete, transcriptomes are less well characterized. We used both genome-scale tiling arrays and massively parallel sequencing to map the Caenorhabditis elegans transcriptome across development. We utilized this framework to identify transcriptome changes in animals lacking the nonsense-mediated decay (NMD) pathway. Results We find that while the majority of detectable transcripts map to known gene structures, >5% of transcribed regions fall outside current gene annotations. We show that >40% of these are novel exons. Using both technologies to assess isoform complexity, we estimate that >17% of genes change isoform across development. Next we examined how the transcriptome is perturbed in animals lacking NMD. NMD prevents expression of truncated proteins by degrading transcripts containing premature termination codons. We find that approximately 20% of genes produce transcripts that appear to be NMD targets. While most of these arise from splicing errors, NMD targets are enriched for transcripts containing open reading frames upstream of the predicted translational start (uORFs). We identify a relationship between the Kozak consensus surrounding the true start codon and the degree to which uORF-containing transcripts are targeted by NMD and speculate that translational efficiency may be coupled to transcript turnover via the NMD pathway for some transcripts. Conclusions We generated a high-resolution transcriptome map for C. elegans and used it to identify endogenous targets of NMD. We find that these transcripts arise principally through splicing errors, strengthening the prevailing view that splicing and NMD are highly interlinked processes.
Collapse
|
41
|
FGFRL1 is a neglected putative actor of the FGF signalling pathway present in all major metazoan phyla. BMC Evol Biol 2009; 9:226. [PMID: 19740411 PMCID: PMC2754479 DOI: 10.1186/1471-2148-9-226] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2009] [Accepted: 09/09/2009] [Indexed: 12/16/2022] Open
Abstract
BACKGROUND Fibroblast Growth Factors (FGF) and their receptors are well known for having major implications in cell signalling controlling embryonic development. Recently, a gene coding for a protein closely related to FGFRs (Fibroblast Growth Factor Receptors) called FGFR5 or FGFR-like 1 (FGFRL1), has been described in vertebrates. An orthologous gene was also found in the cephalochordate amphioxus, but no orthologous genes were found by the authors in other non-vertebrate species, even if a FGFRL1 gene was identified in the sea urchin genome, as well as a closely related gene, named nou-darake, in the planarian Dugesia japonica. These intriguing data of a deuterostome-specific gene that might be implicated in FGF signalling prompted us to search for putative FGFRL1 orthologues in the completely sequenced genomes of metazoans. RESULTS We found FGFRL1 genes in the cnidarian Nematostella vectensis as well as in many bilaterian species. Our analysis also shows that FGFRL1 orthologous genes are linked in the genome with other members of the FGF signalling pathway from cnidarians to bilaterians (distance < 10 Mb). To better understand the implication of FGFRL1 genes in chordate embryonic development, we have analyzed expression patterns of the amphioxus and the mouse genes by whole mount in situ hybridization. We show that some homologous expression territories can be defined, and we propose that FGFRL1 and FGF8/17/18 were already co-expressed in the pharyngeal endoderm in the ancestor of chordates. CONCLUSION Our work sheds light on the existence of a putative FGF signalling pathway actor present in the ancestor of probably all metazoans, the function of which has received little attention until now.
Collapse
|
42
|
Two distinct roles for EGL-9 in the regulation of HIF-1-mediated gene expression in Caenorhabditis elegans. Genetics 2009; 183:821-9. [PMID: 19737748 DOI: 10.1534/genetics.109.107284] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
Oxygen is critically important to metazoan life, and the EGL-9/PHD enzymes are key regulators of hypoxia (low oxygen) response. When oxygen levels are high, the EGL-9/PHD proteins hydroxylate hypoxia-inducible factor (HIF) transcription factors. Once hydroxylated, HIFalpha subunits bind to von Hippel-Lindau (VHL) E3 ligases and are degraded. Prior genetic analyses in Caenorhabditis elegans had shown that EGL-9 also acted through a vhl-1-independent pathway to inhibit HIF-1 transcriptional activity. Here, we characterize this novel EGL-9 function. We employ an array of complementary methods to inhibit EGL-9 hydroxylase activity in vivo. These include hypoxia, hydroxylase inhibitors, mutation of the proline in HIF-1 that is normally modified by EGL-9, and mutation of the EGL-9 catalytic core. Remarkably, we find that each of these treatments or mutations eliminates oxygen-dependent degradation of HIF-1 protein, but none of them abolishes EGL-9-mediated repression of HIF-1 transcriptional activity. Further, analyses of new egl-9 alleles reveal that the evolutionarily conserved EGL-9 MYND zinc finger domain does not have a major role in HIF-1 regulation. We conclude that C. elegans EGL-9 is a bifunctional protein. In addition to its well-established role as the oxygen sensor that regulates HIF-1 protein levels, EGL-9 inhibits HIF-1 transcriptional activity via a pathway that has little or no requirement for hydroxylase activity or for the EGL-9 MYND domain.
Collapse
|
43
|
Vizcaíno JA, Côté R, Reisinger F, M. Foster J, Mueller M, Rameseder J, Hermjakob H, Martens L. A guide to the Proteomics Identifications Database proteomics data repository. Proteomics 2009; 9:4276-83. [PMID: 19662629 PMCID: PMC2970915 DOI: 10.1002/pmic.200900402] [Citation(s) in RCA: 207] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2009] [Revised: 06/24/2009] [Accepted: 06/25/2009] [Indexed: 01/02/2023]
Abstract
The Proteomics Identifications Database (PRIDE, www.ebi.ac.uk/pride) is one of the main repositories of MS derived proteomics data. Here, we point out the main functionalities of PRIDE both as a submission repository and as a source for proteomics data. We describe the main features for data retrieval and visualization available through the PRIDE web and BioMart interfaces. We also highlight the mechanism by which tailored queries in the BioMart can join PRIDE to other resources such as Reactome, Ensembl or UniProt to execute extremely powerful across-domain queries. We then present the latest improvements in the PRIDE submission process, using the new easy-to-use, platform-independent graphical user interface submission tool PRIDE Converter. Finally, we speak about future plans and the role of PRIDE in the ProteomExchange consortium.
Collapse
Affiliation(s)
- Juan Antonio Vizcaíno
- EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome CampusHinxton, Cambridge, UK
| | - Richard Côté
- EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome CampusHinxton, Cambridge, UK
| | - Florian Reisinger
- EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome CampusHinxton, Cambridge, UK
| | - Joseph M. Foster
- EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome CampusHinxton, Cambridge, UK
| | - Michael Mueller
- EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome CampusHinxton, Cambridge, UK
| | - Jonathan Rameseder
- EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome CampusHinxton, Cambridge, UK
- Computational and Systems Biology Initiative, Massachusetts Institute of TechnologyCambridge, MA, USA
| | - Henning Hermjakob
- EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome CampusHinxton, Cambridge, UK
| | - Lennart Martens
- EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome CampusHinxton, Cambridge, UK
| |
Collapse
|
44
|
Berriz GF, Beaver JE, Cenik C, Tasan M, Roth FP. Next generation software for functional trend analysis. Bioinformatics 2009; 25:3043-4. [PMID: 19717575 DOI: 10.1093/bioinformatics/btp498] [Citation(s) in RCA: 200] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
UNLABELLED FuncAssociate is a web application that discovers properties enriched in lists of genes or proteins that emerge from large-scale experimentation. Here we describe an updated application with a new interface and several new features. For example, enrichment analysis can now be performed within multiple gene- and protein-naming systems. This feature avoids potentially serious translation artifacts to which other enrichment analysis strategies are subject. AVAILABILITY The FuncAssociate web application is freely available to all users at http://llama.med.harvard.edu/funcassociate.
Collapse
Affiliation(s)
- Gabriel F Berriz
- Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, 250 Longwood Avenue and Center for Cancer Systems Biology, Dana Farber Cancer Institute, 44 Binney Street, Boston, MA 02115, USA
| | | | | | | | | |
Collapse
|
45
|
Gilchrist MJ, Christensen MB, Bronchain O, Brunet F, Chesneau A, Fenger U, Geach TJ, Ironfield HV, Kaya F, Kricha S, Lea R, Massé K, Néant I, Paillard E, Parain K, Perron M, Sinzelle L, Souopgui J, Thuret R, Ymlahi-Ouazzani Q, Pollet N. Database of queryable gene expression patterns for Xenopus. Dev Dyn 2009; 238:1379-88. [PMID: 19347954 DOI: 10.1002/dvdy.21940] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Abstract
The precise localization of gene expression within the developing embryo, and how it changes over time, is one of the most important sources of information for elucidating gene function. As a searchable resource, this information has up until now been largely inaccessible to the Xenopus community. Here, we present a new database of Xenopus gene expression patterns, queryable by specific location or region in the embryo. Pattern matching can be driven either from an existing in situ image, or from a user-defined pattern based on development stage schematic diagrams. The data are derived from the work of a group of 21 Xenopus researchers over a period of 4 days. We used a novel, rapid manual annotation tool, XenMARK, which exploits the ability of the human brain to make the necessary distortions in transferring data from the in situ images to the standard schematic geometry. Developmental Dynamics 238:1379-1388, 2009. (c) 2009 Wiley-Liss, Inc.
Collapse
|
46
|
Saito TL, Yoshimura J, Sasaki S, Ahsan B, Sasaki A, Kuroshu R, Morishita S. UTGB toolkit for personalized genome browsers. Bioinformatics 2009; 25:1856-61. [PMID: 19497937 PMCID: PMC2712345 DOI: 10.1093/bioinformatics/btp350] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2009] [Revised: 05/27/2009] [Accepted: 05/30/2009] [Indexed: 11/12/2022] Open
Abstract
UNLABELLED The advent of high-throughput DNA sequencers has increased the pace of collecting enormous amounts of genomic information, yielding billions of nucleotides on a weekly basis. This advance represents an improvement of two orders of magnitude over traditional Sanger sequencers in terms of the number of nucleotides per unit time, allowing even small groups of researchers to obtain huge volumes of genomic data over fairly short period. Consequently, a pressing need exists for the development of personalized genome browsers for analyzing these immense amounts of locally stored data. The UTGB (University of Tokyo Genome Browser) Toolkit is designed to meet three major requirements for personalization of genome browsers: easy installation of the system with minimum efforts, browsing locally stored data and rapid interactive design of web interfaces tailored to individual needs. The UTGB Toolkit is licensed under an open source license. AVAILABILITY The software is freely available at http://utgenome.org/.
Collapse
Affiliation(s)
- Taro L Saito
- Department of Computational Biology, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa City, Chiba 277-0882, Japan
| | | | | | | | | | | | | |
Collapse
|
47
|
Durinck S, Spellman PT, Birney E, Huber W. Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nat Protoc 2009; 4:1184-91. [PMID: 19617889 PMCID: PMC3159387 DOI: 10.1038/nprot.2009.97] [Citation(s) in RCA: 2296] [Impact Index Per Article: 153.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Genomic experiments produce multiple views of biological systems, among them are DNA sequence and copy number variation, and mRNA and protein abundance. Understanding these systems needs integrated bioinformatic analysis. Public databases such as Ensembl provide relationships and mappings between the relevant sets of probe and target molecules. However, the relationships can be biologically complex and the content of the databases is dynamic. We demonstrate how to use the computational environment R to integrate and jointly analyze experimental datasets, employing BioMart web services to provide the molecule mappings. We also discuss typical problems that are encountered in making gene-to-transcript-to-protein mappings. The approach provides a flexible, programmable and reproducible basis for state-of-the-art bioinformatic data integration.
Collapse
Affiliation(s)
- Steffen Durinck
- Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Paul T. Spellman
- Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Ewan Birney
- European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Wolfgang Huber
- European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| |
Collapse
|
48
|
Van Auken K, Jaffery J, Chan J, Müller HM, Sternberg PW. Semi-automated curation of protein subcellular localization: a text mining-based approach to Gene Ontology (GO) Cellular Component curation. BMC Bioinformatics 2009; 10:228. [PMID: 19622167 PMCID: PMC2719631 DOI: 10.1186/1471-2105-10-228] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2009] [Accepted: 07/21/2009] [Indexed: 11/28/2022] Open
Abstract
Background Manual curation of experimental data from the biomedical literature is an expensive and time-consuming endeavor. Nevertheless, most biological knowledge bases still rely heavily on manual curation for data extraction and entry. Text mining software that can semi- or fully automate information retrieval from the literature would thus provide a significant boost to manual curation efforts. Results We employ the Textpresso category-based information retrieval and extraction system , developed by WormBase to explore how Textpresso might improve the efficiency with which we manually curate C. elegans proteins to the Gene Ontology's Cellular Component Ontology. Using a training set of sentences that describe results of localization experiments in the published literature, we generated three new curation task-specific categories (Cellular Components, Assay Terms, and Verbs) containing words and phrases associated with reports of experimentally determined subcellular localization. We compared the results of manual curation to that of Textpresso queries that searched the full text of articles for sentences containing terms from each of the three new categories plus the name of a previously uncurated C. elegans protein, and found that Textpresso searches identified curatable papers with recall and precision rates of 79.1% and 61.8%, respectively (F-score of 69.5%), when compared to manual curation. Within those documents, Textpresso identified relevant sentences with recall and precision rates of 30.3% and 80.1% (F-score of 44.0%). From returned sentences, curators were able to make 66.2% of all possible experimentally supported GO Cellular Component annotations with 97.3% precision (F-score of 78.8%). Measuring the relative efficiencies of Textpresso-based versus manual curation we find that Textpresso has the potential to increase curation efficiency by at least 8-fold, and perhaps as much as 15-fold, given differences in individual curatorial speed. Conclusion Textpresso is an effective tool for improving the efficiency of manual, experimentally based curation. Incorporating a Textpresso-based Cellular Component curation pipeline at WormBase has allowed us to transition from strictly manual curation of this data type to a more efficient pipeline of computer-assisted validation. Continued development of curation task-specific Textpresso categories will provide an invaluable resource for genomics databases that rely heavily on manual curation.
Collapse
Affiliation(s)
- Kimberly Van Auken
- Division of Biology, California Institute of Technology, Pasadena, CA 91125, USA.
| | | | | | | | | |
Collapse
|
49
|
Vermeirssen V, Joshi A, Michoel T, Bonnet E, Casneuf T, Van de Peer Y. Transcription regulatory networks in Caenorhabditis elegans inferred through reverse-engineering of gene expression profiles constitute biological hypotheses for metazoan development. MOLECULAR BIOSYSTEMS 2009; 5:1817-30. [PMID: 19763340 DOI: 10.1039/b908108a] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Differential gene expression governs the development, function and pathology of multicellular organisms. Transcription regulatory networks study differential gene expression at a systems level by mapping the interactions between regulatory proteins and target genes. While microarray transcription profiles are the most abundant data for gene expression, it remains challenging to correctly infer the underlying transcription regulatory networks. The reverse-engineering algorithm LeMoNe (learning module networks) uses gene expression profiles to extract ensemble transcription regulatory networks of coexpression modules and their prioritized regulators. Here we apply LeMoNe to a compendium of microarray studies of the worm Caenorhabditis elegans. We obtain 248 modules with a regulation program for 5020 genes and 426 regulators and a total of 24 012 predicted transcription regulatory interactions. Through GO enrichment analysis, comparison with the gene-gene association network WormNet and integration of other biological data, we show that LeMoNe identifies functionally coherent coexpression modules and prioritizes regulators that relate to similar biological processes as the module genes. Furthermore, we can predict new functional relationships for uncharacterized genes and regulators. Based on modules involved in molting, meiosis and oogenesis, ciliated sensory neurons and mitochondrial metabolism, we illustrate the value of LeMoNe as a biological hypothesis generator for differential gene expression in greater detail. In conclusion, through reverse-engineering of C. elegans expression data, we obtained transcription regulatory networks that can provide further insight into metazoan development.
Collapse
|
50
|
Fredman D, Engstrom PG, Lenhard B. Web-based tools and approaches to study long-range gene regulation in Metazoa. BRIEFINGS IN FUNCTIONAL GENOMICS AND PROTEOMICS 2009; 8:231-42. [DOI: 10.1093/bfgp/elp023] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
|