51
|
Menon R, Gasser RB, Mitreva M, Ranganathan S. An analysis of the transcriptome of Teladorsagia circumcincta: its biological and biotechnological implications. BMC Genomics 2012; 13 Suppl 7:S10. [PMID: 23282110 PMCID: PMC3521389 DOI: 10.1186/1471-2164-13-s7-s10] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
BACKGROUND Teladorsagia circumcincta (order Strongylida) is an economically important parasitic nematode of small ruminants (including sheep and goats) in temperate climatic regions of the world. Improved insights into the molecular biology of this parasite could underpin alternative methods required to control this and related parasites, in order to circumvent major problems associated with anthelmintic resistance. The aims of the present study were to define the transcriptome of the adult stage of T. circumcincta and to infer the main pathways linked to molecules known to be expressed in this nematode. Since sheep develop acquired immunity against T. circumcincta, there is some potential for the development of a vaccine against this parasite. Hence, we infer excretory/secretory molecules for T. circumcincta as possible immunogens and vaccine candidates. RESULTS A total of 407,357 ESTs were assembled yielding 39,852 putative gene sequences. Conceptual translation predicted 24,013 proteins, which were then subjected to detailed annotation which included pathway mapping of predicted proteins (including 112 excreted/secreted [ES] and 226 transmembrane peptides), domain analysis and GO annotation was carried out using InterProScan along with BLAST2GO. Further analysis was carried out for secretory signal peptides using SignalP and non-classical sec pathway using SecretomeP tools. For ES proteins, key pathways, including Fc epsilon RI, T cell receptor, and chemokine signalling as well as leukocyte transendothelial migration were inferred to be linked to immune responses, along with other pathways related to neurodegenerative diseases and infectious diseases, which warrant detailed future studies. KAAS could identify new and updated pathways like phagosome and protein processing in endoplasmic reticulum. Domain analysis for the assembled dataset revealed families of serine, cysteine and proteinase inhibitors which might represent targets for parasite intervention. InterProScan could identify GO terms pertaining to the extracellular region. Some of the important domain families identified included the SCP-like extracellular proteins which belong to the pathogenesis-related proteins (PRPs) superfamily along with C-type lectin, saposin-like proteins. The 'extracellular region' that corresponds to allergen V5/Tpx-1 related, considered important in parasite-host interactions, was also identified. Six cysteine motif (SXC1) proteins, transthyretin proteins, C-type lectins, activation-associated secreted proteins (ASPs), which could represent potential candidates for developing novel anthelmintics or vaccines were few other important findings. Of these, SXC1, protein kinase domain-containing protein, trypsin family protein, trypsin-like protease family member (TRY-1), putative major allergen and putative lipid binding protein were identified which have not been reported in the published T. circumcincta proteomics analysis. Detailed analysis of 6,058 raw EST sequences from dbEST revealed 315 putatively secreted proteins. Amongst them, C-type single domain activation associated secreted protein ASP3 precursor, activation-associated secreted proteins (ASP-like protein), cathepsin B-like cysteine protease, cathepsin L cysteine protease, cysteine protease, TransThyretin-Related and Venom-Allergen-like proteins were the key findings. CONCLUSIONS We have annotated a large dataset ESTs of T. circumcincta and undertaken detailed comparative bioinformatics analyses. The results provide a comprehensive insight into the molecular biology of this parasite and disease manifestation which provides potential focal point for future research. We identified a number of pathways responsible for immune response. This type of large-scale computational scanning could be coupled with proteomic and metabolomic studies of this parasite leading to novel therapeutic intervention and disease control strategies. We have also successfully affirmed the use of bioinformatics tools, for the study of ESTs, which could now serve as a benchmark for the development of new computational EST analysis pipelines.
Collapse
Affiliation(s)
- Ranjeeta Menon
- Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney, New South Wales 2109, Australia
| | | | | | | |
Collapse
|
52
|
Baptista MS, Duarte CB, Maciel P. Role of the ubiquitin-proteasome system in nervous system function and disease: using C. elegans as a dissecting tool. Cell Mol Life Sci 2012; 69:2691-715. [PMID: 22382927 PMCID: PMC11115168 DOI: 10.1007/s00018-012-0946-0] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2011] [Revised: 02/13/2012] [Accepted: 02/15/2012] [Indexed: 01/12/2023]
Abstract
In addition to its central roles in protein quality control, regulation of cell cycle, intracellular signaling, DNA damage response and transcription regulation, the ubiquitin-proteasome system (UPS) plays specific roles in the nervous system, where it contributes to precise connectivity through development, and later assures functionality by regulating a wide spectrum of neuron-specific cellular processes. Aberrations in this system have been implicated in the etiology of neurodevelopmental and neurodegenerative diseases. In this review, we provide an updated view on the UPS and highlight recent findings concerning its role in normal and diseased nervous systems. We discuss the advantages of the model organism Caenorhabditis elegans as a tool to unravel the major unsolved questions concerning this biochemical pathway and its involvement in nervous system function and dysfunction, and expose the new possibilities, using state-of-the-art techniques, to assess UPS function using this model system.
Collapse
Affiliation(s)
- Márcio S Baptista
- Life and Health Sciences Research Institute (ICVS), School of Health Sciences, University of Minho, Campus de Gualtar, 4710-057 Braga, Portugal.
| | | | | |
Collapse
|
53
|
Shaye DD, Greenwald I. OrthoList: a compendium of C. elegans genes with human orthologs. PLoS One 2011; 6:e20085. [PMID: 21647448 PMCID: PMC3102077 DOI: 10.1371/journal.pone.0020085] [Citation(s) in RCA: 313] [Impact Index Per Article: 24.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2011] [Accepted: 04/18/2011] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND C. elegans is an important model for genetic studies relevant to human biology and disease. We sought to assess the orthology between C. elegans and human genes to understand better the relationship between their genomes and to generate a compelling list of candidates to streamline RNAi-based screens in this model. RESULTS We performed a meta-analysis of results from four orthology prediction programs and generated a compendium, "OrthoList", containing 7,663 C. elegans protein-coding genes. Various assessments indicate that OrthoList has extensive coverage with low false-positive and false-negative rates. Part of this evaluation examined the conservation of components of the receptor tyrosine kinase, Notch, Wnt, TGF-ß and insulin signaling pathways, and led us to update compendia of conserved C. elegans kinases, nuclear hormone receptors, F-box proteins, and transcription factors. Comparison with two published genome-wide RNAi screens indicated that virtually all of the conserved hits would have been obtained had just the OrthoList set (∼38% of the genome) been targeted. We compiled Ortholist by InterPro domains and Gene Ontology annotation, making it easy to identify C. elegans orthologs of human disease genes for potential functional analysis. CONCLUSIONS We anticipate that OrthoList will be of considerable utility to C. elegans researchers for streamlining RNAi screens, by focusing on genes with apparent human orthologs, thus reducing screening effort by ∼60%. Moreover, we find that OrthoList provides a useful basis for annotating orthology and reveals more C. elegans orthologs of human genes in various functional groups, such as transcription factors, than previously described.
Collapse
Affiliation(s)
- Daniel D. Shaye
- Howard Hughes Medical Institute, Columbia University, College of Physicians and Surgeons, New York, New York, United States of America
| | - Iva Greenwald
- Howard Hughes Medical Institute, Columbia University, College of Physicians and Surgeons, New York, New York, United States of America
- Department of Biochemistry and Molecular Biophysics, Columbia University, College of Physicians and Surgeons, New York, New York, United States of America
- Department of Genetics and Development, Columbia University, College of Physicians and Surgeons, New York, New York, United States of America
| |
Collapse
|
54
|
Pandey UB, Nichols CD. Human disease models in Drosophila melanogaster and the role of the fly in therapeutic drug discovery. Pharmacol Rev 2011; 63:411-36. [PMID: 21415126 DOI: 10.1124/pr.110.003293] [Citation(s) in RCA: 675] [Impact Index Per Article: 51.9] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
The common fruit fly, Drosophila melanogaster, is a well studied and highly tractable genetic model organism for understanding molecular mechanisms of human diseases. Many basic biological, physiological, and neurological properties are conserved between mammals and D. melanogaster, and nearly 75% of human disease-causing genes are believed to have a functional homolog in the fly. In the discovery process for therapeutics, traditional approaches employ high-throughput screening for small molecules that is based primarily on in vitro cell culture, enzymatic assays, or receptor binding assays. The majority of positive hits identified through these types of in vitro screens, unfortunately, are found to be ineffective and/or toxic in subsequent validation experiments in whole-animal models. New tools and platforms are needed in the discovery arena to overcome these limitations. The incorporation of D. melanogaster into the therapeutic discovery process holds tremendous promise for an enhanced rate of discovery of higher quality leads. D. melanogaster models of human diseases provide several unique features such as powerful genetics, highly conserved disease pathways, and very low comparative costs. The fly can effectively be used for low- to high-throughput drug screens as well as in target discovery. Here, we review the basic biology of the fly and discuss models of human diseases and opportunities for therapeutic discovery for central nervous system disorders, inflammatory disorders, cardiovascular disease, cancer, and diabetes. We also provide information and resources for those interested in pursuing fly models of human disease, as well as those interested in using D. melanogaster in the drug discovery process.
Collapse
Affiliation(s)
- Udai Bhan Pandey
- Department of Pharmacology and Experimental Therapeutics, Louisiana State University Health Sciences Center, 1901 Perdido St., New Orleans, LA 70112, USA
| | | |
Collapse
|
55
|
Abbas S, Wink M. Epigallocatechin gallate inhibits beta amyloid oligomerization in Caenorhabditis elegans and affects the daf-2/insulin-like signaling pathway. PHYTOMEDICINE : INTERNATIONAL JOURNAL OF PHYTOTHERAPY AND PHYTOPHARMACOLOGY 2010; 17:902-9. [PMID: 20382008 DOI: 10.1016/j.phymed.2010.03.008] [Citation(s) in RCA: 104] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2009] [Revised: 02/04/2010] [Accepted: 03/04/2010] [Indexed: 05/19/2023]
Abstract
Epidemiological studies have repeatedly demonstrated that green tea protects against oxidative stress involved in many diseases. Health benefits of green tea are attributed to its principal active constituent, epigallocatechin gallate (EGCG). EGCG was shown to increase the stress resistance and lifespan of Caenorhabditis elegans. The mechanism of this action has been investigated in this study. The expression of hsp-16.1 and hsp-16.2 in EGCG-treated worms (N2), as quantified by real-time PCR, was significantly lower under oxidative stress induced by juglone than in controls without EGCG. In the strain TJ356 (DAF-16::GFP) EGCG treatment induced translocation of DAF-16 from the cytoplasm into the nucleus, suggesting that EGCG may affect the daf-2/insulin-like signaling pathway. EGCG decreased the formation of lipofuscin, an aging related pigment. Also, EGCG reduced beta amyloid (Abeta) deposits and inhibited Abeta oligomerization in transgenic C. elegans (CL2006). Thus, the use of green tea and EGCG is apparently rational alternatives for protecting against ROS-mediated and age-related diseases.
Collapse
Affiliation(s)
- S Abbas
- Institute of Pharmacy and Molecular Biotechnology, Department of Biology, Heidelberg University, Heidelberg, Germany
| | | |
Collapse
|
56
|
Delannoy-Normand A, Cortet J, Cabaret J, Neveu C. A suite of genes expressed during transition to parasitic lifestyle in the trichostrongylid nematode Haemonchus contortus encode potentially secreted proteins conserved in Teladorsagia circumcincta. Vet Parasitol 2010; 174:106-14. [PMID: 20843605 DOI: 10.1016/j.vetpar.2010.07.017] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2010] [Revised: 06/23/2010] [Accepted: 07/27/2010] [Indexed: 10/19/2022]
Abstract
The control of gastro-intestinal nematodes remains largely based on anthelminthic treatments, however spreading of anthelmintic resistance has reduced their efficacy. The genes involved in the transition to parasitic lifestyle could constitute targets of interest to develop alternative control strategies. In the trichostrongylid nematode Haemonchus contortus, we have used a SSH (Suppressive Subtractive Hybridization) based approach to generate two distinct subtracted cDNA libraries specifically enriched in cDNA expressed during the early parasitic fourth stage larvae L4 (5 days post-infection). A total of 200 clones were subjected to dot-blot experiments and 46 clones were selected for further characterization. The 46 corresponding expressed sequence tags (EST) were found to cluster into nine contigs. The corresponding full-length cDNA was obtained for all candidates. The genes encoding potentially secreted proteins were investigated in more detail. RT-PCR experiments confirmed their specific expression or over expression from the early L4 larvae to the adult stages and search for homologs in the trichostrongylid species T. circumcincta was performed in order to investigate whether they may be novel cross-specific targets.
Collapse
Affiliation(s)
- Alexia Delannoy-Normand
- French National Institute for Agricultural Research (INRA), UR1282 Infectiologie Animale et Santé Publique, F-37380 Nouzilly, France
| | | | | | | |
Collapse
|
57
|
Wang X, Li W, Zhao D, Liu B, Shi Y, Chen B, Yang H, Guo P, Geng X, Shang Z, Peden E, Kage-Nakadai E, Mitani S, Xue D. Caenorhabditis elegans transthyretin-like protein TTR-52 mediates recognition of apoptotic cells by the CED-1 phagocyte receptor. Nat Cell Biol 2010; 12:655-64. [PMID: 20526330 PMCID: PMC2896453 DOI: 10.1038/ncb2068] [Citation(s) in RCA: 96] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2010] [Accepted: 04/27/2010] [Indexed: 11/09/2022]
Abstract
During apoptosis, dying cells are swiftly removed by phagocytes. It is not fully understood how apoptotic cells are recognized by phagocytes. Here we report the identification and characterization of the Caenorhabditis elegans ttr-52 gene, which encodes a transthyretin-like protein and is required for efficient cell corpse engulfment. The TTR-52 protein is expressed in, and secreted from, C. elegans endoderm and clusters around apoptotic cells. Genetic analysis indicates that TTR-52 acts in the cell corpse engulfment pathway mediated by CED-1, CED-6 and CED-7 and affects clustering of the phagocyte receptor CED-1 around apoptotic cells. TTR-52 recognizes surface-exposed phosphatidylserine (PtdSer) in vivo and binds to both PtdSer and the extracellular domain of CED-1 in vitro. TTR-52 is therefore the first bridging molecule identified in C. elegans that mediates recognition of apoptotic cells by crosslinking the PtdSer 'eat me' signal with the phagocyte receptor CED-1.
Collapse
Affiliation(s)
- Xiaochen Wang
- Department of Molecular, Cellular, and Developmental Biology, University of Colorado, Boulder, CO 80309, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
58
|
Hao YJ, Montiel R, Abubucker S, Mitreva M, Simões N. Transcripts analysis of the entomopathogenic nematode Steinernema carpocapsae induced in vitro with insect haemolymph. Mol Biochem Parasitol 2009; 169:79-86. [PMID: 19836423 DOI: 10.1016/j.molbiopara.2009.10.002] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2009] [Revised: 08/31/2009] [Accepted: 10/07/2009] [Indexed: 01/24/2023]
Abstract
Steinernema carpocapsae is an insect parasitic nematode widely used in pest control programs. The efficacy of this nematode in controlling insects has been found to be related to the pathogenicity of the infective stage. In order to study the parasitic mechanisms exhibited by this parasite, a cDNA library of the induced S. carpocapsae parasitic phase was generated. A total of 2500 clones were sequenced and 2180 high-quality ESTs were obtained from this library. Cluster analysis generated a total of 1592 unique sequences including 1393 singletons. About 63% of the unique sequences had significant hits (e</=1e-05) to the non-redundant protein database. The remaining sequences most likely represent putative novel protein coding genes. Comparative analysis identified 377 homologs in C. elegans, 431 in C. briggsae and 75 in other nematodes. Classification of the predicted proteins revealed involvement in diverse cellular, metabolic and extracellular functions. One hundred and nineteen clusters were predicted to encode putative secreted proteins such as proteases, proteases inhibitors, lectins, saposin-like proteins, acetylcholinesterase, anti-oxidants, and heat-shock proteins, which can possibly have host interactions. This dataset provides a basis for genomic studies towards a better understanding of the events that occur in the parasitic process of this entomopathogenic nematode, including invasion of the insect haemocoelium, adaptations to insect innate immunity and stress responses, and production of virulence factors. The identification of key genes in the parasitic process provides useful tools for the improvement of S. carpocapsae as a biological agent.
Collapse
Affiliation(s)
- You-Jin Hao
- CIRN and Department of Biology, University of Azores, 9501-801 Ponta Delgada, Azores, Portugal.
| | | | | | | | | |
Collapse
|
59
|
Lundberg E, Olofsson A, Westermark GT, Sauer-Eriksson AE. Stability and fibril formation properties of human and fish transthyretin, and of the Escherichia coli transthyretin-related protein. FEBS J 2009; 276:1999-2011. [PMID: 19250316 DOI: 10.1111/j.1742-4658.2009.06936.x] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
Human transthyretin (hTTR) is one of several proteins known to cause amyloid disease. Conformational changes in its native structure result in aggregation of the protein, leading to insoluble amyloid fibrils. The transthyretin (TTR)-related proteins comprise a protein family of 5-hydroxyisourate hydrolases with structural similarity to TTR. In this study, we tested the amyloidogenic properties, if any, of sea bream TTR (sbTTR) and Escherichia coli transthyretin-related protein (ecTRP), which share 52% and 30% sequence identity, respectively, with hTTR. We obtained filamentous structures from all three proteins under various conditions, but, interestingly, different structures displayed different tinctorial properties. hTTR and sbTTR formed thin, curved fibrils at low pH (pH 2-3) that bound thioflavin-T (thioflavin-T-positive) but did not stain with Congo Red (CR) (CR-negative). Aggregates formed at the slightly higher pH of 4.0-5.5 had different morphology, displaying predominantly amorphous structures. CR-positive material of hTTR was found in this material, in agreement with previous results. ecTRP remained soluble at pH 2-12 at ambient temperatures. By raising of the temperature, fibril formation could be induced at neutral pH in all three proteins. Most of these temperature-induced fibrils were thicker and straighter than the in vitro fibrils seen at low pH. In other words, the temperature-induced fibrils were more similar to fibrils seen in vivo. The melting temperature of ecTRP was 66.7 degrees C. This is approximately 30 degrees C lower than the melting temperatures of sbTTR and hTTR. Information from the crystal structures was used to identify possible explanations for the reduced thermostability of ecTRP.
Collapse
Affiliation(s)
- Erik Lundberg
- Department of Chemistry, Umeå University, Umeå, Sweden
| | | | | | | |
Collapse
|
60
|
Silverman GA, Luke CJ, Bhatia SR, Long OS, Vetica AC, Perlmutter DH, Pak SC. Modeling molecular and cellular aspects of human disease using the nematode Caenorhabditis elegans. Pediatr Res 2009; 65:10-8. [PMID: 18852689 PMCID: PMC2731241 DOI: 10.1203/pdr.0b013e31819009b0] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
As an experimental system, Caenorhabditis elegans offers a unique opportunity to interrogate in vivo the genetic and molecular functions of human disease-related genes. For example, C. elegans has provided crucial insights into fundamental biologic processes, such as cell death and cell fate determinations, as well as pathologic processes such as neurodegeneration and microbial susceptibility. The C. elegans model has several distinct advantages, including a completely sequenced genome that shares extensive homology with that of mammals, ease of cultivation and storage, a relatively short lifespan and techniques for generating null and transgenic animals. However, the ability to conduct unbiased forward and reverse genetic screens in C. elegans remains one of the most powerful experimental paradigms for discovering the biochemical pathways underlying human disease phenotypes. The identification of these pathways leads to a better understanding of the molecular interactions that perturb cellular physiology, and forms the foundation for designing mechanism-based therapies. To this end, the ability to process large numbers of isogenic animals through automated work stations suggests that C. elegans, manifesting different aspects of human disease phenotypes, will become the platform of choice for in vivo drug discovery and target validation using high-throughput/content screening technologies.
Collapse
Affiliation(s)
- Gary A Silverman
- Department of Pediatrics, Children's Hospital of Pittsburgh and Magee-Womens Hospital Research Institute, Department of Cell Biology and Physiology, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania 15213, USA.
| | | | | | | | | | | | | |
Collapse
|
61
|
Improved insights into the transcriptomes of the human hookworm Necator americanus--fundamental and biotechnological implications. Biotechnol Adv 2008; 27:122-32. [PMID: 18977428 DOI: 10.1016/j.biotechadv.2008.10.002] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2008] [Revised: 10/04/2008] [Accepted: 10/06/2008] [Indexed: 12/25/2022]
Abstract
Hookworms of humans are blood-feeding parasitic nematodes of major socio-economic significance in a wide range of countries. They cause a neglected tropical disease (NTD) called "hookworm disease" (=necatoriasis and/or ancylostomiasis). Necator americanus is the most widely distributed hookworm of humans and is a leading cause of iron deficiency anaemia, which can cause physical and mental retardation and deaths in children as well as adverse maternal-foetal outcomes. Currently, there is a significant focus on the development of new approaches for the prevention and control of hookworms in humans. Technological advances are underpinning the discovery of drug and vaccine targets through insights into the molecular biology and genomics of these parasites and their relationship with the human host. In spite of the widespread socio-economic impacts of human necatoriasis, molecular datasets for N. americanus are scant, limiting progress in molecular research. The present article explores all currently available EST datasets for adult and larval stages of N. americanus using a semi-automated bioinformatic pipeline. In the current repertoire of molecules now available, some have been or are being considered as candidate vaccines against N. americanus. Among others, the most abundant sets of molecules relate to the pathogenesis-related protein (PRP) superfamily, comprising various members, such as the Ancylostoma-secreted or activation-associated proteins (ASPs) and the kunitz-type proteins, both of which are inferred to play key roles in the interplay between N. americanus and the human host. Understanding the molecular biology of these and other novel molecules discovered could have important implications for finding new ways of disrupting the pathways that they are involved in, and should facilitate the identification of new drug and vaccine targets. Also, the bioinformatic prediction of the essentiality of genes and gene products as well as molecular network connectivity of nematode-specific genes, together with sequencing by 454 technology, are likely to assist in the genomic discovery efforts in the very near future, to also underpin fundamental, molecular research of hookworms.
Collapse
|
62
|
Nagaraj SH, Gasser RB, Ranganathan S. Needles in the EST haystack: large-scale identification and analysis of excretory-secretory (ES) proteins in parasitic nematodes using expressed sequence tags (ESTs). PLoS Negl Trop Dis 2008; 2:e301. [PMID: 18820748 PMCID: PMC2553489 DOI: 10.1371/journal.pntd.0000301] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2008] [Accepted: 08/27/2008] [Indexed: 11/28/2022] Open
Abstract
Background Parasitic nematodes of humans, other animals and plants continue to impose a significant public health and economic burden worldwide, due to the diseases they cause. Promising antiparasitic drug and vaccine candidates have been discovered from excreted or secreted (ES) proteins released from the parasite and exposed to the immune system of the host. Mining the entire expressed sequence tag (EST) data available from parasitic nematodes represents an approach to discover such ES targets. Methods and Findings In this study, we predicted, using EST2Secretome, a novel, high-throughput, computational workflow system, 4,710 ES proteins from 452,134 ESTs derived from 39 different species of nematodes, parasitic in animals (including humans) or plants. In total, 2,632, 786, and 1,292 ES proteins were predicted for animal-, human-, and plant-parasitic nematodes. Subsequently, we systematically analysed ES proteins using computational methods. Of these 4,710 proteins, 2,490 (52.8%) had orthologues in Caenorhabditis elegans, whereas 621 (13.8%) appeared to be novel, currently having no significant match to any molecule available in public databases. Of the C. elegans homologues, 267 had strong “loss-of-function” phenotypes by RNA interference (RNAi) in this nematode. We could functionally classify 1,948 (41.3%) sequences using the Gene Ontology (GO) terms, establish pathway associations for 573 (12.2%) sequences using Kyoto Encyclopaedia of Genes and Genomes (KEGG), and identify protein interaction partners for 1,774 (37.6%) molecules. We also mapped 758 (16.1%) proteins to protein domains including the nematode-specific protein family “transthyretin-like” and “chromadorea ALT,” considered as vaccine candidates against filariasis in humans. Conclusions We report the large-scale analysis of ES proteins inferred from EST data for a range of parasitic nematodes. This set of ES proteins provides an inventory of known and novel members of ES proteins as a foundation for studies focused on understanding the biology of parasitic nematodes and their interactions with their hosts, as well as for the development of novel drugs or vaccines for parasite intervention and control. Excretory-secretory (ES) proteins are an important class of proteins in many organisms, spanning from bacteria to human beings, and are potential drug targets for several diseases. In this study, we first developed a software platform, EST2Secretome, comprised of carefully selected computational tools to identify and analyse ES proteins from expressed sequence tags (ESTs). By employing EST2Secretome, we analysed 4,710 ES proteins derived from 0.5 million ESTs for 39 economically important and disease-causing parasites from the phylum Nematoda. Several known and novel ES proteins that were either parasite- or nematode-specific were discovered, focussing on those that are either absent from or very divergent from similar molecules in their animal or plant hosts. In addition, we found many nematode-specific protein families of domains “transthyretin-like” and “chromadorea ALT,” considered vaccine candidates for filariasis in humans. We report numerous C. elegans homologues with loss-of-function RNAi phenotypes essential for parasite survival and therefore potential targets for parasite intervention. Overall, by developing freely available software to analyse large-scale EST data, we enabled researchers working on parasites for neglected tropical diseases to select specific genes and/or proteins to carry out directed functional assays for demystifying the molecular complexities of host–parasite interactions in a cell.
Collapse
Affiliation(s)
- Shivashankar H Nagaraj
- Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney, New South Wales, Australia
| | | | | |
Collapse
|
63
|
Amyloidogenic properties of transthyretin-like protein (TLP) fromEscherichia coli. FEBS Lett 2008; 582:2893-8. [DOI: 10.1016/j.febslet.2008.07.025] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2008] [Revised: 07/08/2008] [Accepted: 07/15/2008] [Indexed: 11/19/2022]
|
64
|
Wasmuth J, Schmid R, Hedley A, Blaxter M. On the extent and origins of genic novelty in the phylum Nematoda. PLoS Negl Trop Dis 2008; 2:e258. [PMID: 18596977 PMCID: PMC2432500 DOI: 10.1371/journal.pntd.0000258] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2008] [Accepted: 06/09/2008] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND The phylum Nematoda is biologically diverse, including parasites of plants and animals as well as free-living taxa. Underpinning this diversity will be commensurate diversity in expressed genes, including gene sets associated specifically with evolution of parasitism. METHODS AND FINDINGS Here we have analyzed the extensive expressed sequence tag data (available for 37 nematode species, most of which are parasites) and define over 120,000 distinct putative genes from which we have derived robust protein translations. Combined with the complete proteomes of Caenorhabditis elegans and Caenorhabditis briggsae, these proteins have been grouped into 65,000 protein families that in turn contain 40,000 distinct protein domains. We have mapped the occurrence of domains and families across the Nematoda and compared the nematode data to that available for other phyla. Gene loss is common, and in particular we identify nearly 5,000 genes that may have been lost from the lineage leading to the model nematode C. elegans. We find a preponderance of novelty, including 56,000 nematode-restricted protein families and 26,000 nematode-restricted domains. Mapping of the latest time-of-origin of these new families and domains across the nematode phylogeny revealed ongoing evolution of novelty. A number of genes from parasitic species had signatures of horizontal transfer from their host organisms, and parasitic species had a greater proportion of novel, secreted proteins than did free-living ones. CONCLUSIONS These classes of genes may underpin parasitic phenotypes, and thus may be targets for development of effective control measures.
Collapse
Affiliation(s)
- James Wasmuth
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, United Kingdom
- Program for Molecular Structure and Function, Hospital for Sick Children, Toronto, Ontario, Canada
| | - Ralf Schmid
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, United Kingdom
- Department of Biochemistry, University of Leicester, Leicester, United Kingdom
| | - Ann Hedley
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, United Kingdom
| | - Mark Blaxter
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, United Kingdom
- * E-mail:
| |
Collapse
|
65
|
Analysis of the transthyretin-like (TTL) gene family in Ostertagia ostertagi--comparison with other strongylid nematodes and Caenorhabditis elegans. Int J Parasitol 2008; 38:1545-56. [PMID: 18571174 DOI: 10.1016/j.ijpara.2008.04.004] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2008] [Revised: 04/07/2008] [Accepted: 04/10/2008] [Indexed: 11/21/2022]
Abstract
The transthyretin-like (ttl) gene family is one of the largest conserved nematode-specific gene families, coding for a group of proteins with significant sequence similarity to transthyretins (TTR) and transthyretin-related proteins (TRP). In the present study, we investigated the ttl family in Ostertagia ostertagi (a nematode of the abomasum of cattle). Mining of expressed sequence tag (EST) databases revealed the presence of at least 18 ttl genes in O. ostertagi (Oo-ttl), most of which are constitutively transcribed from the free-living, third larval stage onwards. The full-length cDNA of one of these genes (Oo-ttl-1) was amplified and cloned for recombinant expression. Western blot analysis using a specific antiserum showed that the native protein Oo-TTL-1 was highly present in the excretory-secretory (ES) products of adults of O. ostertagi. The protein was immunolocalized to the pseudocoelomic fluid of adult worms. A phylogenetic-bioinformatic analysis of all amino acid sequence data for TTL proteins from a range of strongylid nematodes showed that they could be divided into at least five different classes. This classification was based on conserved amino acids in the first TTL signature domain and the number and location of cysteine residues. The biological role(s) of the TTLs in nematode biology is still unclear. A theoretical three-dimensional model of Oo-TTL-1 indicated that it had a similar structure to TTRs (i.e., containing β-sheets, arranged in a β-sandwich). In contrast to TTRs, competitive binding studies using recombinant Oo-TTL-1 indicated that the protein was devoid of any hydrophobic ligand- or thyroid hormone-binding properties. Finally, combinatorial analysis by double-stranded RNA interference of five ttl genes in the free-living nematode Caenorhabditis elegans did not reveal any visible phenotypes. More information on the transcription profile and tissue distribution of TTLs in nematodes is needed to provide new insights into the biological role of this gene family.
Collapse
|
66
|
Milenkovic VM, Langmann T, Schreiber R, Kunzelmann K, Weber BHF. Molecular evolution and functional divergence of the bestrophin protein family. BMC Evol Biol 2008; 8:72. [PMID: 18307799 PMCID: PMC2292144 DOI: 10.1186/1471-2148-8-72] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2007] [Accepted: 02/28/2008] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Mutations in human bestrophin 1 are associated with at least three autosomal-dominant macular dystrophies including Best disease, adult onset vitelliform macular dystrophy and autosomal dominant vitreo-retinochoroidopathy. The protein is integral to the membrane and is likely involved in Ca2+-dependent transport of chloride ions across cellular membranes. Bestrophin 1 together with its three homologues forms a phylogenetically highly conserved family of proteins. RESULTS A bioinformatics study was performed to investigate the phylogenetic relationship among the bestrophin family members and to statistically evaluate sequence conservation and functional divergence. Phylogenetic tree assembly with all available eukaryotic bestrophin sequences suggests gene duplication events in the lineage leading to the vertebrates. A common N-terminal topology which includes four highly conserved transmembrane domains is shared by the members of the four paralogous groups of vertebrate bestrophins and has been constrained by purifying selection. Pairwise comparison shows that altered functional constraints have occurred at specific amino acid positions after phylogenetic diversification of the paralogues. Most notably, significant functional divergence was found between bestrophin 4 and the other family members, as well as between bestrophin 2 and bestrophin 3. Site-specific profiles were established by posterior probability analysis revealing significantly divergent clusters mainly in two hydrophilic loops and a region immediately adjacent to the last predicted transmembrane domain. Strikingly, codons 279 and 347 of human bestrophin 4 reveal high divergence when compared to the paralogous positions strongly indicating the functional importance of these residues for the bestrophin 4 protein. None of the functionally divergent amino acids were found to reside within obvious sequences patterns or motifs. CONCLUSION Our study highlights the molecular evolution of the bestrophin family of transmembrane proteins and indicates amino acid residues likely relevant for distinct functional properties of the paralogues. These findings may provide a starting point for further experimental verifications.
Collapse
|
67
|
Jacob J, Vanholme B, Haegeman A, Gheysen G. Four transthyretin-like genes of the migratory plant-parasitic nematode Radopholus similis: members of an extensive nematode-specific family. Gene 2007; 402:9-19. [PMID: 17765408 DOI: 10.1016/j.gene.2007.07.015] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2007] [Revised: 06/25/2007] [Accepted: 07/03/2007] [Indexed: 11/16/2022]
Abstract
Screening 1154 ESTs from the plant-parasitic nematode Radopholus similis resulted in seven tags coding for proteins holding a transthyretin-like domain (PF01060). The seven ESTs corresponded to four different genes which were cloned from a cDNA library (accession numbers AM691117, AM691118, AM691119, AM691120). Transthyretin-like genes belong to a large family, different from the transthyretin and the transthyretin-related genes with whom they share some sequence similarity at the protein level. This similarity has caused an inconsistent use of different names and abbreviations in the past. To avoid further confusion, we introduce a standardized nomenclature for this gene family, and chose to name this barely characterized gene family ttl (as for transthyretin-like). Further examination of the identified genes, named Rs-ttl-1 to -4, showed that they are expressed in both juveniles and adults, but not in young embryos. Whole mount in situ hybridization revealed a distinct spatial expression pattern for two of the genes: Rs-ttl-1 is expressed in the tissues surrounding the vulva, whereas Rs-ttl-2 is expressed in the ventral nerve cord. The deduced protein sequences contain a putative signal peptide for secretion, pointing to an extracellular function of the mature proteins. Database screens showed that the ttl family is restricted to nematodes. Moreover, a HMMER search revealed that ESTs derived from ttl genes are more abundant in parasitic nematode libraries, with a bias towards the parasitic stages. Despite their abundance in nematodes, including the extensively studied model organism Caenorhabditis elegans, the function of TTL proteins remains obscure. Our data suggest a role in the nervous system. Even without insight into their biological function, the nematode-specific nature of this gene family makes it a promising target for nematicides or RNAi mediated control strategies against parasitic nematodes.
Collapse
Affiliation(s)
- Joachim Jacob
- Department of Molecular Biotechnology, Faculty of Bioscience Engineering, Ghent University, Coupure links 653, B-9000 Ghent, Belgium
| | | | | | | |
Collapse
|
68
|
Kaletta T, Hengartner MO. Finding function in novel targets: C. elegans as a model organism. Nat Rev Drug Discov 2006; 5:387-98. [PMID: 16672925 DOI: 10.1038/nrd2031] [Citation(s) in RCA: 689] [Impact Index Per Article: 38.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
Despite its apparent simplicity, the nematode worm Caenorhabditis elegans has developed into an important model for biomedical research, particularly in the functional characterization of novel drug targets that have been identified using genomics technologies. The cellular complexity and the conservation of disease pathways between C. elegans and higher organisms, together with the simplicity and cost-effectiveness of cultivation, make for an effective in vivo model that is amenable to whole-organism high-throughput compound screens and large-scale target validation. This review describes how C. elegans models can be used to advance our understanding of the molecular mechanisms of drug action and disease pathogenesis.
Collapse
|
69
|
Olsen A, Vantipalli MC, Lithgow GJ. Using Caenorhabditis elegans as a model for aging and age-related diseases. Ann N Y Acad Sci 2006; 1067:120-8. [PMID: 16803977 DOI: 10.1196/annals.1354.015] [Citation(s) in RCA: 80] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
During the last three decades the soil nematode C. elegans has become a prominent model organism for studying aging. Initially research in the C. elegans aging field was focused on the genetics of aging and single gene mutations that dramatically increased the life span of the worm. Undoubtedly, the existence of such mutations is one of the main reasons for the popularity of the worm as model system for studying aging. However, today many different approaches are being used in the C. elegans aging field in addition to genetic manipulations that influence life span. For example, environmental manipulations such as caloric restriction and hormetic treatments, evolutionary studies, population studies, models of age-related diseases, and drug screening for compounds that extend life span are now being investigated using this nematode. This review will focus on the most recent developments in C. elegans aging research with the aim of illustrating the diversity of the field.
Collapse
|
70
|
Tamas I, Hodges E, Dessi P, Johnsen R, Vaz Gomes A. A combined approach exploring gene function based on worm-human orthology. BMC Genomics 2005; 6:65. [PMID: 15877817 PMCID: PMC1112593 DOI: 10.1186/1471-2164-6-65] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2004] [Accepted: 05/06/2005] [Indexed: 01/13/2023] Open
Abstract
Background Many aspects of the nematode Caenorhabditis elegans biology are conserved between invertebrates and vertebrates establishing this particular organism as an excellent genetic model. Because of its small size, large populations and self-fertilization of the hermaphrodite, functional predictions carried out by genetic modifications as well as RNAi screens, can be rapidly tested. Results In order to explore the function of a set of C. elegans genes of unknown function, as well as their potential functional roles in the human genome, we performed a phylogenetic analysis to select the most probable worm orthologs. A total of 13 C. elegans genes were subjected to down- regulation via RNAi and characterization of expression profiles using GFP strains. Previously unknown distinct expression patterns were observed for four of the analyzed genes, as well as four visible RNAi phenotypes. In addition, subcellular protein over-expression profiles of the human orthologs for seven out of the thirteen genes using human cells were also analyzed. Conclusion By combining a whole-organism approach using C. elegans with complementary experimental work done on human cell lines, this analysis extends currently available information on the selected set of genes.
Collapse
Affiliation(s)
- Ivica Tamas
- Department of Molecular Biology and Functional Genomics, Stockholm University, Sweden
| | - Emily Hodges
- Center for Genomics and Bioinformatics, Karolinska Institute, Stockholm, Sweden
| | | | - Robert Johnsen
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, Canada
| | - Ana Vaz Gomes
- Center for Genomics and Bioinformatics, Karolinska Institute, Stockholm, Sweden
| |
Collapse
|
71
|
Krämer F, Stöhr H, Weber BHF. Cloning and characterization of the murine Vmd2 RFP-TM gene family. Cytogenet Genome Res 2004; 105:107-14. [PMID: 15218265 DOI: 10.1159/000078016] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2003] [Accepted: 12/12/2003] [Indexed: 11/19/2022] Open
Abstract
Mutations in the human vitelliform macular dystrophy type 2 (VMD2) gene are known to cause autosomal dominant Best macular dystrophy (BMD), a degenerative disorder of the central retina. VMD2, together with VMD2L1, VMD2L2 and VMD2L3, belong to a closely related gene family characterized by several transmembrane (TM) spanning helical domains and an invariant arginine, phenylalanine and proline (RFP) tripeptide motif, thus termed VMD2 RFP-TM. The four genes are thought to encode a novel family of anion channels. We now report the cloning and characterization of the murine orthologs by combining biocomputational analyses and molecular genetic approaches. While the murine Vmd2, Vmd2l1 and Vmd2l3 genes are functional, murine Vmd2l2p was found to be a non-transcribed pseudogene. Expression profiling of the murine Vmd2 RFP-TM family members revealed tissue-restricted expression with predominant transcription of Vmd2 in testis, of Vmd2l1 in colon and of Vmd2l3 in heart. Differential splicing was observed for Vmd2l3 in a number of tissues (e.g. in brain, retina/RPE, kidney) although the functional importance of the splice variants remains to be determined.
Collapse
Affiliation(s)
- F Krämer
- Institut für Humangenetik, Julius-Maximilians-Universität Würzburg, Würzburg, Germany
| | | | | |
Collapse
|
72
|
Hazkani-Covo E, Levanon EY, Rotman G, Graur D, Novik A. Evolution of multicellularity in Metazoa: comparative analysis of the subcellular localization of proteins in Saccharomyces, Drosophila and Caenorhabditis. Cell Biol Int 2004; 28:171-8. [PMID: 14984742 DOI: 10.1016/j.cellbi.2003.11.016] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2003] [Revised: 10/21/2003] [Accepted: 11/20/2003] [Indexed: 10/26/2022]
Abstract
A comparison of the subcellular assignments of proteins between the unicellular Saccharomyces cerevisiae and the multicellular Drosophila melanogaster and Caenorhabditis elegans was performed using a computational tool for the prediction of subcellular localization. Nine subcellular compartments were studied: (1) extracellular domain, (2) cell membrane, (3) cytoplasm, (4) endoplasmic reticulum, (5) Golgi apparatus, (6) lysosome, (7) peroxisome, (8) mitochondria, and (9) nucleus. The transition to multicellularity was found to be characterized by an increase in the total number of proteins encoded by the genome. Interestingly, this increase is distributed unevenly among the subcellular compartments. That is, a disproportionate increase in the number of proteins in the extracellular domain, the cell membrane, and the cytoplasm is observed in multicellular organisms, while no such increase is seen in other subcellular compartments. A possible explanation involves signal transduction. In terms of protein numbers, signal transduction pathways may be roughly described as a pyramid with an expansive base in the extracellular domain (the numerous extracellular signal proteins), progressively narrowing at the cell membrane and cytoplasmic levels, and ending in a narrow tip consisting of only a handful of transcription modulators in the nucleus. Our observations suggest that extracellular signaling interactions among metazoan cells account for the uneven increase in the numbers of proteins among subcellular compartments during the transition to multicellularity.
Collapse
Affiliation(s)
- Einat Hazkani-Covo
- Department of Zoology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv 69978, Israel
| | | | | | | | | |
Collapse
|
73
|
Götz J, Streffer JR, David D, Schild A, Hoerndli F, Pennanen L, Kurosinski P, Chen F. Transgenic animal models of Alzheimer's disease and related disorders: histopathology, behavior and therapy. Mol Psychiatry 2004; 9:664-83. [PMID: 15052274 DOI: 10.1038/sj.mp.4001508] [Citation(s) in RCA: 199] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
Alzheimer's disease (AD) is a devastating neurodegenerative disease that affects more than 15 million people worldwide. Within the next generation, these numbers will more than double. To assist in the elucidation of pathogenic mechanisms of AD and related disorders, such as frontotemporal dementia (FTDP-17), genetically modified mice, flies, fish and worms were developed, which reproduce aspects of the human histopathology, such as beta-amyloid-containing plaques and tau-containing neurofibrillary tangles (NFT). In mice, the tau pathology caused selective behavioral impairment, depending on the distribution of the tau aggregates in the brain. Beta-amyloid induced an increase in the numbers of NFT, whereas the opposite was not observed in mice. In beta-amyloid-producing transgenic mice, memory impairment was associated with increased levels of beta-amyloid. Active and passive beta-amyloid-directed immunization caused the removal of beta-amyloid plaques and restored memory functions. These findings have since been translated to human therapy. This review aims to discuss the suitability and limitations of the various animal models and their contribution to an understanding of the pathophysiology of AD and related disorders.
Collapse
Affiliation(s)
- J Götz
- Division of Psychiatry Research, University of Zürich, August Forel Str. 1, CH-8008 Zürich, Switzerland.
| | | | | | | | | | | | | | | |
Collapse
|
74
|
Thompson LM, Marsh JL. Invertebrate models of neurologic disease: insights into pathogenesis and therapy. Curr Neurol Neurosci Rep 2003; 3:442-8. [PMID: 12914688 DOI: 10.1007/s11910-003-0028-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
The search for cures of human diseases can be very slow, expensive, and serendipitous. Roughly five decades of basic research in a handful of model systems has revealed that most animals are quite similar to one another especially at the cellular and molecular levels. The commonalities allow one to use animal models to investigate human disease mechanisms. Here, we review contributions demonstrating the use of invertebrate models to investigate human neurodegenerative diseases. We conclude that the integration of fly and worm models into programs seeking to identify therapeutic strategies for neurodegenerative disease can significantly speed progress toward finding cures for these devastating diseases.
Collapse
Affiliation(s)
- Leslie Michels Thompson
- Department of Psychiatry and Human Behavior, University of California, Irvine, Gillespie 2121, Irvine, CA 92697-4260, USA.
| | | |
Collapse
|
75
|
Driscoll M, Gerstbrein B. Dying for a cause: invertebrate genetics takes on human neurodegeneration. Nat Rev Genet 2003; 4:181-94. [PMID: 12610523 DOI: 10.1038/nrg1018] [Citation(s) in RCA: 107] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
If invertebrate neurons are injured by hostile environments or aberrant proteins they die much like human neurons, indicating that the powerful advantages of invertebrate molecular genetics might be successfully used for testing specific hypotheses about human neurological diseases, for drug discovery and for non-biased screens for suppressors and enhancers of neurodegeneration. Recent molecular dissection of the genetic requirements for hypoxia, excitotoxicity and death in models of Alzheimer disease, polyglutamine-expansion disorders, Parkinson disease and more, is providing mechanistic insights into neurotoxicity and suggesting new therapeutic interventions. An emerging theme is that neuronal crises of distinct origins might converge to disrupt common cellular functions, such as protein folding and turnover.
Collapse
Affiliation(s)
- Monica Driscoll
- Department of Molecular Biology and Biochemistry, Rutgers, The State University of New Jersey, A232 Nelson Biological Laboratories, 604 Allison Road, Piscataway, New Jersey 08855, USA.
| | | |
Collapse
|
76
|
McCarter JP, Mitreva MD, Martin J, Dante M, Wylie T, Rao U, Pape D, Bowers Y, Theising B, Murphy CV, Kloek AP, Chiapelli BJ, Clifton SW, Bird DM, Waterston RH. Analysis and functional classification of transcripts from the nematode Meloidogyne incognita. Genome Biol 2003; 4:R26. [PMID: 12702207 PMCID: PMC154577 DOI: 10.1186/gb-2003-4-4-r26] [Citation(s) in RCA: 115] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2002] [Revised: 02/17/2003] [Accepted: 02/28/2003] [Indexed: 11/12/2022] Open
Abstract
BACKGROUND Plant parasitic nematodes are major pathogens of most crops. Molecular characterization of these species as well as the development of new techniques for control can benefit from genomic approaches. As an entrée to characterizing plant parasitic nematode genomes, we analyzed 5,700 expressed sequence tags (ESTs) from second-stage larvae (L2) of the root-knot nematode Meloidogyne incognita. RESULTS From these, 1,625 EST clusters were formed and classified by function using the Gene Ontology (GO) hierarchy and the Kyoto KEGG database. L2 larvae, which represent the infective stage of the life cycle before plant invasion, express a diverse array of ligand-binding proteins and abundant cytoskeletal proteins. L2 are structurally similar to Caenorhabditis elegans dauer larva and the presence of transcripts encoding glyoxylate pathway enzymes in the M. incognita clusters suggests that root-knot nematode larvae metabolize lipid stores while in search of a host. Homology to other species was observed in 79% of translated cluster sequences, with the C. elegans genome providing more information than any other source. In addition to identifying putative nematode-specific and Tylenchida-specific genes, sequencing revealed previously uncharacterized horizontal gene transfer candidates in Meloidogyne with high identity to rhizobacterial genes including homologs of nodL acetyltransferase and novel cellulases. CONCLUSIONS With sequencing from plant parasitic nematodes accelerating, the approaches to transcript characterization described here can be applied to more extensive datasets and also provide a foundation for more complex genome analyses.
Collapse
Affiliation(s)
- James P McCarter
- Genome Sequencing Center, Department of Genetics, Box 8501, Washington University School of Medicine, St, Louis, MO 63108, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
77
|
Kutsenko AS, Gizatullin RZ, Al-Amin AN, Wang F, Kvasha SM, Podowski RM, Matushkin YG, Gyanchandani A, Muravenko OV, Levitsky VG, Kolchanov NA, Protopopov AI, Kashuba VI, Kisselev LL, Wasserman W, Wahlestedt C, Zabarovsky ER. NotI flanking sequences: a tool for gene discovery and verification of the human genome. Nucleic Acids Res 2002; 30:3163-70. [PMID: 12136098 PMCID: PMC135748 DOI: 10.1093/nar/gkf428] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
A set of 22 551 unique human NotI flanking sequences (16.2 Mb) was generated. More than 40% of the set had regions with significant similarity to known proteins and expressed sequences. The data demonstrate that regions flanking NotI sites are less likely to form nucleosomes efficiently and resemble promoter regions. The draft human genome sequence contained 55.7% of the NotI flanking sequences, Celera's database contained matches to 57.2% of the clones and all public databases (including non-human and previously sequenced NotI flanks) matched 89.2% of the NotI flanking sequences (identity > or =90% over at least 50 bp, data from December 2001). The data suggest that the shotgun sequencing approach used to generate the draft human genome sequence resulted in a bias against cloning and sequencing of NotI flanks. A rough estimation (based primarily on chromosomes 21 and 22) is that the human genome contains 15 000-20 000 NotI sites, of which 6000-9000 are unmethylated in any particular cell. The results of the study suggest that the existing tools for computational determination of CpG islands fail to identify a significant fraction of functional CpG islands, and unmethylated DNA stretches with a high frequency of CpG dinucleotides can be found even in regions with low CG content.
Collapse
MESH Headings
- Cell Line, Transformed
- Chromosomes, Human, Pair 21/genetics
- Chromosomes, Human, Pair 22/genetics
- CpG Islands/genetics
- DNA/chemistry
- DNA/genetics
- DNA/metabolism
- Databases, Nucleic Acid
- Deoxyribonucleases, Type II Site-Specific/metabolism
- Genes/genetics
- Genome, Human
- Humans
- Molecular Sequence Data
- Repetitive Sequences, Nucleic Acid/genetics
- Sequence Analysis, DNA/methods
Collapse
Affiliation(s)
- Alexey S Kutsenko
- Center for Genomics and Bioinformatics, Karolinska Institute, 171 77 Stockholm, Sweden
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
78
|
Harrison PM, Gerstein M. Studying genomes through the aeons: protein families, pseudogenes and proteome evolution. J Mol Biol 2002; 318:1155-74. [PMID: 12083509 DOI: 10.1016/s0022-2836(02)00109-2] [Citation(s) in RCA: 120] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Protein families can be used to understand many aspects of genomes, both their "live" and their "dead" parts (i.e. genes and pseudogenes). Surveys of genomes have revealed that, in every organism, there are always a few large families and many small ones, with the overall distribution following a power-law. This commonality is equally true for both genes and pseudogenes, and exists despite the fact that the specific families that are enlarged differ greatly between organisms. Furthermore, because of family structure there is great redundancy in proteomes, a fact linked to the large number of dispensable genes for each organism and the small size of the minimal, indispensable sub-proteome. Pseudogenes in prokaryotes represent families that are in the process of being dispensed with. In particular, the genome sequences of certain pathogenic bacteria (Mycobacterium leprae, Yersinia pestis and Rickettsia prowazekii) show how an organism can undergo reductive evolution on a large scale (i.e. the dying out of families) as a result of niche change. There appears to be less pressure to delete pseudogenes in eukaryotes. These can be divided into two varieties, duplicated and processed, where the latter involves reverse transcription from an mRNA intermediate. We discuss these collectively in yeast, worm, fly, and human. The fly has few pseudogenes apparently because of its high rate of genomic DNA deletion. In the other three organisms, the distribution of pseudogenes on the chromosome and amongst different families is highly non-uniform. Pseudogenes tend not to occur in the middle of chromosome arms, and tend to be associated with lineage-specific (as opposed to highly conserved) families that have environmental-response functions. This may be because, rather than being dead, they may form a reservoir of diverse "extra parts" that can be resurrected to help an organism adapt to its surroundings. In yeast, there may be a novel mechanism involving the [PSI+] prion that potentially enables this resurrection. In worm, the pseudogenes tend to arise out of families (e.g. chemoreceptors) that are greatly expanded in it compared to the fly. The human genome stands out in having many processed pseudogenes. These have a character very different from those of the duplicated variety, to a large extent just representing random insertions. Thus, their occurrence tends to be roughly in proportion to the amount of mRNA for a particular protein and to reflect the extent of the intergenic sequences. Further information about pseudogenes is available at http://genecensus.org/pseudogene
Collapse
Affiliation(s)
- Paul M Harrison
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520-8114, USA
| | | |
Collapse
|
79
|
|
80
|
Stöhr H, Marquardt A, Nanda I, Schmid M, Weber BHF. Three novel human VMD2-like genes are members of the evolutionary highly conserved RFP-TM family. Eur J Hum Genet 2002; 10:281-4. [PMID: 12032738 DOI: 10.1038/sj.ejhg.5200796] [Citation(s) in RCA: 62] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2001] [Revised: 02/11/2002] [Accepted: 02/14/2002] [Indexed: 11/08/2022] Open
Abstract
The RFP-TM protein family was first described in Caenorhabditis elegans as hypothetical transmembrane proteins containing a conserved 350-400 amino acid domain including the invariant peptide motif RFP. The VMD2 gene underlying Best disease was shown to represent the first human member of the RFP-TM protein family. More than 97% of the disease-causing mutations are located in the N-terminal RFP-TM domain implying important functional properties. Here, we have identified three novel VMD2-related human genes (VMD2L1, VMD2L2 and VMD2L3) demonstrating a high degree of conservation in their respective RFP-TM domains. Each of the VMD2-like proteins has a unique C-terminus that lack similarity to other proteins or motifs. By FISH analysis, VMD2L1 was localised to chromosome 19p13.2-p13.12, VMD2L2 to 1p32.3-p33 and VMD2L3 to 12q14.2-q15. RT-PCR analyses revealed tissue-restricted expression of the three genes with both VMD2L1 and VMD2L2 abundantly transcribed in colon. VMD2L1 is present in the retinal pigment epithelium while VMD2L3 shows predominant expression in skeletal muscle.
Collapse
MESH Headings
- Amino Acid Sequence
- Animals
- Bestrophins
- Chloride Channels
- Chromosome Mapping
- Chromosomes, Human, Pair 1/genetics
- Chromosomes, Human, Pair 12/genetics
- Chromosomes, Human, Pair 19/genetics
- Colon/metabolism
- Conserved Sequence
- Evolution, Molecular
- Eye Proteins/chemistry
- Eye Proteins/genetics
- Gene Expression
- Humans
- In Situ Hybridization, Fluorescence
- Macular Degeneration/genetics
- Molecular Sequence Data
- Muscle Proteins/chemistry
- Muscle Proteins/genetics
- Muscle, Skeletal/metabolism
- Pigment Epithelium of Eye/metabolism
- Protein Structure, Tertiary
- Proteins/chemistry
- Proteins/genetics
- Reverse Transcriptase Polymerase Chain Reaction
- Sequence Alignment
- Sequence Analysis, DNA
Collapse
Affiliation(s)
- Heidi Stöhr
- Institut für Humangenetik, Universität Würzburg, Germany
| | | | | | | | | |
Collapse
|
81
|
Kawabata T, Fukuchi S, Homma K, Ota M, Araki J, Ito T, Ichiyoshi N, Nishikawa K. GTOP: a database of protein structures predicted from genome sequences. Nucleic Acids Res 2002; 30:294-8. [PMID: 11752318 PMCID: PMC99104 DOI: 10.1093/nar/30.1.294] [Citation(s) in RCA: 57] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Large-scale genome projects generate an unprecedented number of protein sequences, most of them are experimentally uncharacterized. Predicting the 3D structures of sequences provides important clues as to their functions. We constructed the Genomes TO Protein structures and functions (GTOP) database, containing protein fold predictions of a huge number of sequences. Predictions are mainly carried out with the homology search program PSI-BLAST, currently the most popular among high-sensitivity profile search methods. GTOP also includes the results of other analyses, e.g. homology and motif search, detection of transmembrane helices and repetitive sequences. We have completed analyzing the sequences of 41 organisms, with the number of proteins exceeding 120 000 in total. GTOP uses a graphical viewer to present the analytical results of each ORF in one page in a 'color-bar' format. The assigned 3D structures are presented by Chime plug-in or RasMol. The binding sites of ligands are also included, providing functional information. The GTOP server is available at http://spock.genes.nig.ac.jp/~genome/gtop.html.
Collapse
Affiliation(s)
- Takeshi Kawabata
- Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, 1-111 Yata, Mishima, Shizuoka 411-8540, Japan
| | | | | | | | | | | | | | | |
Collapse
|
82
|
Hanák P, Jezek P. Mitochondrial uncoupling proteins and phylogenesis--UCP4 as the ancestral uncoupling protein. FEBS Lett 2001; 495:137-41. [PMID: 11334880 DOI: 10.1016/s0014-5793(01)02338-9] [Citation(s) in RCA: 79] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
We searched for the previously defined uncoupling protein (UCP) signatures [Jezek, P. and Urbánková, E. (2000) IUBMB Life 49, 63-70] in genomes of Drosophila melanogaster, Caenorhabditis elegans, Dictyostelium discoideum, and Arabidopsis thaliana. We identified four UCPs in Drosophila and one in Caenorhabditis or Dictyostelium as close relatives of human UCP4 (BMCP), but distant from UCP1, UCP2, UCP3, and two plant UCPs of Arabidopsis. But the third Arabidopsis UCP is the closest UCP4 relative. This suggests that UCP4 represents the ancestral UCP from which other mammalian and plant UCPs diverged. Speculations on UCP4 participation in apoptosis are thus supported by its early phylogenetic occurrence.
Collapse
Affiliation(s)
- P Hanák
- Department of Membrane Transport Biophysics, No.375, Institute of Physiology, Academy of Sciences of the Czech Republic, Vídenská 1083, CZ 14220, Prague, Czech Republic
| | | |
Collapse
|
83
|
Remm M, Sonnhammer E. Classification of transmembrane protein families in the Caenorhabditis elegans genome and identification of human orthologs. Genome Res 2000; 10:1679-89. [PMID: 11076853 PMCID: PMC310950 DOI: 10.1101/gr.gr-1491r] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The complete genome sequence of the nematode Caenorhabditis elegans provides an excellent basis for studying the distribution and evolution of protein families in higher eukaryotes. Three fundamental questions are as follows: How many paralog clusters exist in one species, how many of these are shared with other species, and how many proteins can be assigned a functional counterpart in other species? We have addressed these questions in a detailed study of predicted membrane proteins in C. elegans and their mammalian homologs. All worm proteins predicted to contain at least two transmembrane segments were clustered on the basis of sequence similarity. This resulted in 189 groups with two or more sequences, containing, in total, 2647 worm proteins. Hidden Markov models (HMMs) were created for each family, and were used to retrieve mammalian homologs from the SWISSPROT, TREMBL, and VTS databases. About one-half of these clusters had mammalian homologs. Putative worm-mammalian orthologs were extracted by use of nine different phylogenetic methods and BLAST. Eight clusters initially thought to be worm-specific were assigned mammalian homologs after searching EST and genomic sequences. A compilation of 174 orthology assignments made with high confidence is presented.
Collapse
Affiliation(s)
- M Remm
- Center for Genomics Research, Karolinska Institute, Stockholm, 17177 Sweden
| | | |
Collapse
|
84
|
Zabarovsky ER, Gizatullin R, Podowski RM, Zabarovska VV, Xie L, Muravenko OV, Kozyrev S, Petrenko L, Skobeleva N, Li J, Protopopov A, Kashuba V, Ernberg I, Winberg G, Wahlestedt C. NotI clones in the analysis of the human genome. Nucleic Acids Res 2000; 28:1635-9. [PMID: 10710430 PMCID: PMC102789 DOI: 10.1093/nar/28.7.1635] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Not I linking clones contain sequences flanking Not I recognition sites and were previously shown to be tightly associated with CpG islands and genes. To directly assess the value of Not I clones in genome research, high density grids with 50 000 Not I linking clones originating from six representative Not I linking libraries were constructed. Altogether, these libraries contained nearly 100 times the total number of Not I sites in the human genome. A total of 3437 sequences flanking Not I sites were generated. Analysis of 3265 unique sequences demonstrated that 51% of the clones displayed significant protein similarity to SWISSPROT and TREMBL database proteins based on MSPcrunch filtering with stringent parameters. Of the 3265 sequences, 1868 (57.2%) were new sequences, not present in the EMBL and EST databases (similarity < or =90%). Among these new sequences, 795 (24.3%) showed similarity to known proteins and 712 (21.8%) displayed an identity of >75% at the nucleotide level to sequences from EMBL or EST databases. The remaining 361 (11.1%) sequences were completely new, i.e. <75% identical. The work also showed tight, specific association of Not I sites with the first exon and suggest that the so-called 3' ESTs can actually be generated from 5'-ends of genes that contain Not I sites in their first exon.
Collapse
Affiliation(s)
- E R Zabarovsky
- Center for Genomics Research, Karolinska Institute, Stockholm, Sweden.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
85
|
Hutter H, Vogel BE, Plenefisch JD, Norris CR, Proenca RB, Spieth J, Guo C, Mastwal S, Zhu X, Scheel J, Hedgecock EM. Conservation and novelty in the evolution of cell adhesion and extracellular matrix genes. Science 2000; 287:989-94. [PMID: 10669422 DOI: 10.1126/science.287.5455.989] [Citation(s) in RCA: 204] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
New proteins and modules have been invented throughout evolution. Gene "birth dates" in Caenorhabditis elegans range from the origins of cellular life through adaptation to a soil habitat. Possibly half are "metazoan" genes, having arisen sometime between the yeast-metazoan and nematode-chordate separations. These include basement membrane and cell adhesion molecules implicated in tissue organization. By contrast, epithelial surfaces facing the environment have specialized components invented within the nematode lineage. Moreover, interstitial matrices were likely elaborated within the vertebrate lineage. A strategy for concerted evolution of new gene families, as well as conservation of adaptive genes, may underlie the differences between heterochromatin and euchromatin.
Collapse
Affiliation(s)
- H Hutter
- Max-Planck-Institute for Medical Research, Jahnstrasse 29, 69120 Heidelberg, Germany.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
86
|
Robertson HM. The large srh family of chemoreceptor genes in Caenorhabditis nematodes reveals processes of genome evolution involving large duplications and deletions and intron gains and losses. Genome Res 2000; 10:192-203. [PMID: 10673277 DOI: 10.1101/gr.10.2.192] [Citation(s) in RCA: 107] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The srh family of chemoreceptors in the nematode Caenorhabditis elegans is very large, containing 214 genes and 90 pseudogenes. It is related to the str, stl, and srd families of seven-transmembrane or serpentine receptors. Like these three families, most srh genes are concentrated on chromosome V, and mapping of their chromosomal locations on a phylogenetic tree reveals 27 different movements of genes to other chromosomes. Mapping of intron gains and losses onto the phylogenetic tree reveals that the last common ancestral gene of the family had five introns, which are inferred to have been lost 70 times independently during evolution of the family. In addition, seven intron gains are revealed, three of which are fairly recent. Comparisons with 20 family members in the C. briggsae genome confirms these patterns, including two intron losses in C. briggsae since the species split. There are 14 clear C. elegans orthologs for these 20 genes, whose average amino acid divergence of 68% allows estimation of 85 gene duplications in the C. elegans lineage since the species split. The absence of six orthologs in C. elegans also indicates that gene loss occurs; consideration of all deletions and terminal truncations of srh pseudogenes reveals that large deletions are common. Together these observations provide insight into the evolutionary dynamics of this compact animal genome.
Collapse
Affiliation(s)
- H M Robertson
- Department of Entomology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801 USA.
| |
Collapse
|
87
|
Liu LX, Spoerke JM, Mulligan EL, Chen J, Reardon B, Westlund B, Sun L, Abel K, Armstrong B, Hardiman G, King J, McCague L, Basson M, Clover R, Johnson CD. High-throughput isolation of Caenorhabditis elegans deletion mutants. Genome Res 1999; 9:859-67. [PMID: 10508845 PMCID: PMC310813 DOI: 10.1101/gr.9.9.859] [Citation(s) in RCA: 143] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/1999] [Accepted: 07/15/1999] [Indexed: 11/25/2022]
Abstract
The nematode Caenorhabditis elegans is the first animal whose genome is completely sequenced, providing a rich source of gene information relevant to metazoan biology and human disease. This abundant sequence information permits a broad-based gene inactivation approach in C. elegans, in which chemically mutagenized nematode populations are screened by PCR for deletion mutations in a specific targeted gene. By handling mutagenized worm growth, genomic DNA templates, PCR screens, and mutant recovery all in 96-well microtiter plates, we have scaled up this approach to isolate deletion mutations in >100 genes to date. Four chemical mutagens, including ethyl methane sulfonate, ethlynitrosourea, diepoxyoctane, and ultraviolet-activated trimethylpsoralen, induced detectable deletions at comparable frequencies. The deletions averaged approximately 1400 bp in size when using a approximately 3 kb screening window. The vast majority of detected deletions removed portions of one or more exons, likely resulting in loss of gene function. This approach requires only the knowledge of a target gene sequence and a suitable mutagen, and thus provides a scalable systematic approach to gene inactivation for any organism that can be handled in high density arrays.
Collapse
Affiliation(s)
- L X Liu
- Axys Pharmaceuticals, NemaPharm Group, South San Francisco, California 94080, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
88
|
|
89
|
Marra M, Hillier L, Kucaba T, Allen M, Barstead R, Beck C, Blistain A, Bonaldo M, Bowers Y, Bowles L, Cardenas M, Chamberlain A, Chappell J, Clifton S, Favello A, Geisel S, Gibbons M, Harvey N, Hill F, Jackson Y, Kohn S, Lennon G, Mardis E, Martin J, Mila L, McCann R, Morales R, Pape D, Person B, Prange C, Ritter E, Soares M, Schurk R, Shin T, Steptoe M, Swaller T, Theising B, Underwood K, Wylie T, Yount T, Wilson R, Waterston R. An encyclopedia of mouse genes. Nat Genet 1999; 21:191-4. [PMID: 9988271 DOI: 10.1038/5976] [Citation(s) in RCA: 91] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The laboratory mouse is the premier model system for studies of mammalian development due to the powerful classical genetic analysis possible (see also the Jackson Laboratory web site, http://www.jax.org/) and the ever-expanding collection of molecular tools. To enhance the utility of the mouse system, we initiated a program to generate a large database of expressed sequence tags (ESTs) that can provide rapid access to genes. Of particular significance was the possibility that cDNA libraries could be prepared from very early stages of development, a situation unrealized in human EST projects. We report here the development of a comprehensive database of ESTs for the mouse. The project, initiated in March 1996, has focused on 5' end sequences from directionally cloned, oligo-dT primed cDNA libraries. As of 23 October 1998, 352,040 sequences had been generated, annotated and deposited in dbEST, where they comprised 93% of the total ESTs available for mouse. EST data are versatile and have been applied to gene identification, comparative sequence analysis, comparative gene mapping and candidate disease gene identification, genome sequence annotation, microarray development and the development of gene-based map resources.
Collapse
Affiliation(s)
- M Marra
- Washington University Genome Sequencing Center, St. Louis, Missouri 63108, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
90
|
Abstract
Neurotransmitter receptors, neurotransmitter synthesis and release pathways, and heterotrimeric GTP-binding protein (G protein)-coupled second messenger pathways are highly conserved between Caenorhabditis elegans and mammals, but gap junctions and chemosensory receptors have independent origins in vertebrates and nematodes. Most ion channels are similar to vertebrate channels but there are no predicted voltage-activated sodium channels. The C. elegans genome encodes at least 80 potassium channels, 90 neurotransmitter-gated ion channels, 50 peptide receptors, and up to 1000 orphan receptors that may be chemoreceptors. For many gene families, C. elegans has both conventional members and divergent outliers with weak homology to known genes; these outliers may provide insights into previously unknown functions of conserved protein families.
Collapse
Affiliation(s)
- C I Bargmann
- Howard Hughes Medical Institute, Programs in Developmental Biology, Neuroscience, and Genetics, Department of Anatomy, University of California, San Francisco, CA 94143-0452, USA
| |
Collapse
|
91
|
Abstract
Caenorhabditis elegans is a rhabditid nematode. What relevance does this have for the interpretation of the complete genome sequence, and how will it affect the exploitation of the sequence for scientific and social ends? Nematodes are only distantly related to humans and other animal groups; will this limit the universality of the C. elegans story? Many nematodes are parasites; can knowledge of the C. elegans sequence aid in the prevention and treatment of disease?
Collapse
Affiliation(s)
- M Blaxter
- Institute of Cell, Animal, and Population Biology, University of Edinburgh, Edinburgh EH9 3JT, UK
| |
Collapse
|
92
|
Abstract
The 97-megabase genomic sequence of the nematode Caenorhabditis elegans reveals over 19,000 genes. More than 40 percent of the predicted protein products find significant matches in other organisms. There is a variety of repeated sequences, both local and dispersed. The distinctive distribution of some repeats and highly conserved genes provides evidence for a regional organization of the chromosomes.
Collapse
|
93
|
Abstract
Eight microbial genomes are compared in terms of protein structure. Specifically, yeast, H. influenzae, M. genitalium, M. jannaschii, Synechocystis, M. pneumoniae, H. pylori, and E. coli are compared in terms of patterns of fold usage-whether a given fold occurs in a particular organism. Of the approximately 340 soluble protein folds currently in the structure databank (PDB), 240 occur in at least one of the eight genomes, and 30 are shared amongst all eight. The shared folds are depleted in allhelical structure and enriched in mixed helix-sheet structure compared to the folds in the PDB. The top-10 most common of the shared 30 are enriched in superfolds, uniting many non-homologous sequence families, and are especially similar in overall architecture-eight having helices packed onto a central sheet. They are also very different from the common folds in the PBD, highlighting databank biases. Folds can be ranked in terms of expression as well as genome duplication. In yeast the top-10 most highly expressed folds are considerably different from the most highly duplicated folds. A tree can be constructed grouping genomes in terms of their shared folds. This has a remarkably similar topology to more conventional classifications, based on very different measures of relatedness. Finally, folds of membrane proteins can be analyzed through transmembrane-helix (TM) prediction. All the genomes appear to have similar usage patterns for these folds, with the occurrence of a particular fold falling off rapidly with increasing numbers of TM-elements, according to a "Zipf-like" law. This implies there are no marked preferences for proteins with particular numbers of TM-helices (e.g. 7-TM) in microbial genomes.
Collapse
Affiliation(s)
- M Gerstein
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA.
| |
Collapse
|
94
|
Gerstein M, Hegyi H. Comparing genomes in terms of protein structure: surveys of a finite parts list. FEMS Microbiol Rev 1998; 22:277-304. [PMID: 10357579 DOI: 10.1111/j.1574-6976.1998.tb00371.x] [Citation(s) in RCA: 67] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
We give an overview of the emerging field of structural genomics, describing how genomes can be compared in terms of protein structure. As the number of genes in a genome and the total number of protein folds are both quite limited, these comparisons take the form of surveys of a finite parts list, similar in respects to demographic censuses. Fold surveys have many similarities with other whole-genome characterizations, e.g., analyses of motifs or pathways. However, structure has a number of aspects that make it particularly suitable for comparing genomes, namely the way it allows for the precise definition of a basic protein module and the fact that it has a better defined relationship to sequence similarity than does protein function. An essential requirement for a structure survey is a library of folds, which groups the known structures into 'fold families.' This library can be built up automatically using a structure comparison program, and we described how important objective statistical measures are for assessing similarities within the library and between the library and genome sequences. After building the library, one can use it to count the number of folds in genomes, expressing the results in the form of Venn diagrams and 'top-10' statistics for shared and common folds. Depending on the counting methodology employed, these statistics can reflect different aspects of the genome, such as the amount of internal duplication or gene expression. Previous analyses have shown that the common folds shared between very different microorganisms, i.e., in different kingdoms, have a remarkably similar structure, being comprised of repeated strand-helix-strand super-secondary structure units. A major difficulty with this sort of 'fold-counting' is that only a small subset of the structures in a complete genome are currently known and this subset is prone to sampling bias. One way of overcoming biases is through structure prediction, which can be applied uniformly and comprehensively to a whole genome. Various investigators have, in fact, already applied many of the existing techniques for predicting secondary structure and transmembrane (TM) helices to the recently sequenced genomes. The results have been consistent: microbial genomes have similar fractions of strands and helices even though they have significantly different amino acid composition. The fraction of membrane proteins with a given number of TM helices falls off rapidly with more TM elements, approximately according to a Zipf law. This latter finding indicates that there is no preference for the highly studied 7-TM proteins in microbial genomes. Continuously updated tables and further information pertinent to this review are available over the web at http://bioinfo.mbb.yale.edu/genome.
Collapse
Affiliation(s)
- M Gerstein
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA.
| | | |
Collapse
|
95
|
Robertson HM. Two large families of chemoreceptor genes in the nematodes Caenorhabditis elegans and Caenorhabditis briggsae reveal extensive gene duplication, diversification, movement, and intron loss. Genome Res 1998; 8:449-63. [PMID: 9582190 DOI: 10.1101/gr.8.5.449] [Citation(s) in RCA: 119] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
The str family of genes encoding seven-transmembrane G-protein-coupled or serpentine receptors related to the ODR-10 diacetyl chemoreceptor is very large, with at least 197 members in the Caenorhabditis elegans genome. The closely related stl family has 43 genes, and both families are distantly related to the srd family with 55 genes. Analysis of the structures of these genes indicates that a third of them are clearly or likely pseudogenes. Preliminary surveys of other candidate chemoreceptor families indicates that as many as 800 genes and pseudogenes or 6% of the genome might encode 550 functional chemoreceptors constituting 4% of the C. elegans protein complement. Phylogenetic analyses of the str and stl families, and comparisons with a few orthologs in Caenorhabditis briggsae, reveal ongoing processes of gene duplication, diversification, and movement. The reconstructed ancestral gene structures for these two families have eight introns each, four of which are homologous. Mapping of intron distributions on the phylogenetic tree reveals that each intron has been lost many times independently. Most of these introns were lost individually, which might best be explained by precise in-frame deletions involving nonhomologous recombination between short direct repeats at their termini. [Alignment of the putatively functional proteins in the str and stl families is available from Pfam (http://genome. wustl.edu/Pfam); alignments of all translations are available at http://cshl.org/gr; alignments of the genes are available from the author at hughrobe@uiuc.edu]
Collapse
Affiliation(s)
- H M Robertson
- Department of Entomology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA.
| |
Collapse
|