Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Mulder NJ, Apweiler R, Attwood TK, Bairoch A, Bateman A, Binns D, Bork P, Buillard V, Cerutti L, Copley R, Courcelle E, Das U, Daugherty L, Dibley M, Finn R, Fleischmann W, Gough J, Haft D, Hulo N, Hunter S, Kahn D, Kanapin A, Kejariwal A, Labarga A, Langendijk-Genevaux PS, Lonsdale D, Lopez R, Letunic I, Madera M, Maslen J, McAnulla C, McDowall J, Mistry J, Mitchell A, Nikolskaya AN, Orchard S, Orengo C, Petryszak R, Selengut JD, Sigrist CJA, Thomas PD, Valentin F, Wilson D, Wu CH, Yeats C. New developments in the InterPro database. Nucleic Acids Res 2007;35:D224-8. [PMID: 17202162 PMCID: PMC1899100 DOI: 10.1093/nar/gkl841] [Citation(s) in RCA: 349] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2006] [Revised: 10/06/2006] [Accepted: 10/06/2006] [Indexed: 11/14/2022] Open

For:	Mulder NJ, Apweiler R, Attwood TK, Bairoch A, Bateman A, Binns D, Bork P, Buillard V, Cerutti L, Copley R, Courcelle E, Das U, Daugherty L, Dibley M, Finn R, Fleischmann W, Gough J, Haft D, Hulo N, Hunter S, Kahn D, Kanapin A, Kejariwal A, Labarga A, Langendijk-Genevaux PS, Lonsdale D, Lopez R, Letunic I, Madera M, Maslen J, McAnulla C, McDowall J, Mistry J, Mitchell A, Nikolskaya AN, Orchard S, Orengo C, Petryszak R, Selengut JD, Sigrist CJA, Thomas PD, Valentin F, Wilson D, Wu CH, Yeats C. New developments in the InterPro database. Nucleic Acids Res 2007;35:D224-8. [PMID: 17202162 PMCID: PMC1899100 DOI: 10.1093/nar/gkl841] [Citation(s) in RCA: 349] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2006] [Revised: 10/06/2006] [Accepted: 10/06/2006] [Indexed: 11/14/2022] Open

Number

Cited by Other Article(s)

201

Poetsch A, Wolters D. Bacterial membrane proteomics. Proteomics 2009;8:4100-22. [PMID: 18780352 DOI: 10.1002/pmic.200800273] [Citation(s) in RCA: 79] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

202

Wurm Y, Uva P, Ricci F, Wang J, Jemielity S, Iseli C, Falquet L, Keller L. Fourmidable: a database for ant genomics. BMC Genomics 2009;10:5. [PMID: 19126223 PMCID: PMC2639375 DOI: 10.1186/1471-2164-10-5] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2008] [Accepted: 01/06/2009] [Indexed: 11/10/2022] Open

203

Beeby M, Bobik TA, Yeates TO. Exploiting genomic patterns to discover new supramolecular protein assemblies. Protein Sci 2009;18:69-79. [PMID: 19177352 PMCID: PMC2708037 DOI: 10.1002/pro.1] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2008] [Revised: 09/19/2008] [Accepted: 09/22/2008] [Indexed: 01/29/2023]

204

Barrell D, Dimmer E, Huntley RP, Binns D, O'Donovan C, Apweiler R. The GOA database in 2009--an integrated Gene Ontology Annotation resource. Nucleic Acids Res 2009;37:D396-403. [PMID: 18957448 PMCID: PMC2686469 DOI: 10.1093/nar/gkn803] [Citation(s) in RCA: 448] [Impact Index Per Article: 29.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2008] [Revised: 10/09/2008] [Accepted: 10/10/2008] [Indexed: 11/25/2022] Open

205

Tweedie S, Ashburner M, Falls K, Leyland P, McQuilton P, Marygold S, Millburn G, Osumi-Sutherland D, Schroeder A, Seal R, Zhang H. FlyBase: enhancing Drosophila Gene Ontology annotations. Nucleic Acids Res 2009;37:D555-9. [PMID: 18948289 PMCID: PMC2686450 DOI: 10.1093/nar/gkn788] [Citation(s) in RCA: 604] [Impact Index Per Article: 40.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2008] [Revised: 10/08/2008] [Accepted: 10/09/2008] [Indexed: 11/13/2022] Open

206

Bhasi A, Philip P, Manikandan V, Senapathy P. ExDom: an integrated database for comparative analysis of the exon-intron structures of protein domains in eukaryotes. Nucleic Acids Res 2009;37:D703-11. [PMID: 18984624 PMCID: PMC2686582 DOI: 10.1093/nar/gkn746] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2008] [Revised: 10/02/2008] [Accepted: 10/03/2008] [Indexed: 11/27/2022] Open

207

Ordóñez GR, Puente XS, Quesada V, López-Otín C. Proteolytic systems: constructing degradomes. Methods Mol Biol 2009;539:33-47. [PMID: 19377972 DOI: 10.1007/978-1-60327-003-8_2] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]

208

Protein Sequence Databases. Bioinformatics 2009. [DOI: 10.1007/978-0-387-92738-1_10] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022] Open

209

Wichadakul D, McDermott J, Samudrala R. Prediction and integration of regulatory and protein-protein interactions. Methods Mol Biol 2009;541:101-43. [PMID: 19381527 DOI: 10.1007/978-1-59745-243-4_6] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]

210

McDowall MD, Scott MS, Barton GJ. PIPs: human protein-protein interaction prediction database. Nucleic Acids Res 2009;37:D651-6. [PMID: 18988626 PMCID: PMC2686497 DOI: 10.1093/nar/gkn870] [Citation(s) in RCA: 203] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2008] [Revised: 09/25/2008] [Accepted: 10/18/2008] [Indexed: 12/14/2022] Open

211

Fey P, Gaudet P, Curk T, Zupan B, Just EM, Basu S, Merchant SN, Bushmanova YA, Shaulsky G, Kibbe WA, Chisholm RL. dictyBase--a Dictyostelium bioinformatics resource update. Nucleic Acids Res 2009;37:D515-9. [PMID: 18974179 PMCID: PMC2686522 DOI: 10.1093/nar/gkn844] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2008] [Revised: 10/14/2008] [Accepted: 10/15/2008] [Indexed: 12/14/2022] Open

Affiliation(s)

Petra Fey dictyBase, Northwestern University Biomedical Informatics Center and Center for Genetic Medicine, Chicago, IL 60611, USA, Faculty of Computer and Information Science, University of Ljubljana, Slovenia and Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
Pascale Gaudet dictyBase, Northwestern University Biomedical Informatics Center and Center for Genetic Medicine, Chicago, IL 60611, USA, Faculty of Computer and Information Science, University of Ljubljana, Slovenia and Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
Tomaz Curk dictyBase, Northwestern University Biomedical Informatics Center and Center for Genetic Medicine, Chicago, IL 60611, USA, Faculty of Computer and Information Science, University of Ljubljana, Slovenia and Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
Blaz Zupan dictyBase, Northwestern University Biomedical Informatics Center and Center for Genetic Medicine, Chicago, IL 60611, USA, Faculty of Computer and Information Science, University of Ljubljana, Slovenia and Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
Eric M. Just dictyBase, Northwestern University Biomedical Informatics Center and Center for Genetic Medicine, Chicago, IL 60611, USA, Faculty of Computer and Information Science, University of Ljubljana, Slovenia and Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
Siddhartha Basu dictyBase, Northwestern University Biomedical Informatics Center and Center for Genetic Medicine, Chicago, IL 60611, USA, Faculty of Computer and Information Science, University of Ljubljana, Slovenia and Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
Sohel N. Merchant dictyBase, Northwestern University Biomedical Informatics Center and Center for Genetic Medicine, Chicago, IL 60611, USA, Faculty of Computer and Information Science, University of Ljubljana, Slovenia and Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
Yulia A. Bushmanova dictyBase, Northwestern University Biomedical Informatics Center and Center for Genetic Medicine, Chicago, IL 60611, USA, Faculty of Computer and Information Science, University of Ljubljana, Slovenia and Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
Gad Shaulsky dictyBase, Northwestern University Biomedical Informatics Center and Center for Genetic Medicine, Chicago, IL 60611, USA, Faculty of Computer and Information Science, University of Ljubljana, Slovenia and Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
Warren A. Kibbe dictyBase, Northwestern University Biomedical Informatics Center and Center for Genetic Medicine, Chicago, IL 60611, USA, Faculty of Computer and Information Science, University of Ljubljana, Slovenia and Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
Rex L. Chisholm dictyBase, Northwestern University Biomedical Informatics Center and Center for Genetic Medicine, Chicago, IL 60611, USA, Faculty of Computer and Information Science, University of Ljubljana, Slovenia and Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA

Collapse

212

Ogata Y, Sakurai N, Aoki K, Suzuki H, Okazaki K, Saito K, Shibata D. KAGIANA: an excel-based tool for retrieving summary information on Arabidopsis genes. PLANT & CELL PHYSIOLOGY 2009;50:173-7. [PMID: 19043069 PMCID: PMC2638708 DOI: 10.1093/pcp/pcn179] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/22/2008] [Accepted: 11/17/2008] [Indexed: 05/21/2023]

213

Tilford CA, Siemers NO. Gene set enrichment analysis. Methods Mol Biol 2009;563:99-121. [PMID: 19597782 DOI: 10.1007/978-1-60761-175-2_6] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]

214

Letunic I, Doerks T, Bork P. SMART 6: recent updates and new developments. Nucleic Acids Res 2009. [PMID: 18978020 DOI: 10.1093/nar] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/16/2023] Open

215

Park H, Huxley-Jones J, Boot-Handford RP, Bishop PN, Attwood TK, Bella J. LRRCE: a leucine-rich repeat cysteine capping motif unique to the chordate lineage. BMC Genomics 2008;9:599. [PMID: 19077264 PMCID: PMC2637281 DOI: 10.1186/1471-2164-9-599] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2008] [Accepted: 12/12/2008] [Indexed: 01/27/2023] Open

Abstract

Background

The small leucine-rich repeat proteins and proteoglycans (SLRPs) form an important family of regulatory molecules that participate in many essential functions. They typically control the correct assembly of collagen fibrils, regulate mineral deposition in bone, and modulate the activity of potent cellular growth factors through many signalling cascades. SLRPs belong to the group of extracellular leucine-rich repeat proteins that are flanked at both ends by disulphide-bonded caps that protect the hydrophobic core of the terminal repeats. A capping motif specific to SLRPs has been recently described in the crystal structures of the core proteins of decorin and biglycan. This motif, designated as LRRCE, differs in both sequence and structure from other, more widespread leucine-rich capping motifs. To investigate if the LRRCE motif is a common structural feature found in other leucine-rich repeat proteins, we have defined characteristic sequence patterns and used them in genome-wide searches.

Results

The LRRCE motif is a structural element exclusive to the main group of SLRPs. It appears to have evolved during early chordate evolution and is not found in protein sequences from non-chordate genomes. Our search has expanded the family of SLRPs to include new predicted protein sequences, mainly in fishes but with intriguing putative orthologs in mammals. The chromosomal locations of the newly predicted SLRP genes would support the large-scale genome or gene duplications that are thought to have occurred during vertebrate evolution. From this expanded list we describe a new class of SLRP sequences that could be representative of an ancestral SLRP gene.

Conclusion

Given its exclusivity the LRRCE motif is a useful annotation tool for the identification and classification of new SLRP sequences in genome databases. The expanded list of members of the SLRP family offers interesting insights into early vertebrate evolution and suggests an early chordate evolutionary origin for the LRRCE capping motif.

Collapse

216

GeneDistiller--distilling candidate genes from linkage intervals. PLoS One 2008;3:e3874. [PMID: 19057649 PMCID: PMC2587712 DOI: 10.1371/journal.pone.0003874] [Citation(s) in RCA: 90] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2008] [Accepted: 11/10/2008] [Indexed: 11/19/2022] Open

217

Jung K, Park J, Choi J, Park B, Kim S, Ahn K, Choi J, Choi D, Kang S, Lee YH. SNUGB: a versatile genome browser supporting comparative and functional fungal genomics. BMC Genomics 2008;9:586. [PMID: 19055845 PMCID: PMC2649115 DOI: 10.1186/1471-2164-9-586] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2008] [Accepted: 12/04/2008] [Indexed: 12/24/2022] Open

218

Comparative genomic analysis of carbon and nitrogen assimilation mechanisms in three indigenous bioleaching bacteria: predictions and validations. BMC Genomics 2008;9:581. [PMID: 19055775 PMCID: PMC2607301 DOI: 10.1186/1471-2164-9-581] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2008] [Accepted: 12/03/2008] [Indexed: 11/10/2022] Open

219

Coates BS, Sumerford DV, Hellmich RL, Lewis LC. Mining an Ostrinia nubilalis midgut expressed sequence tag (EST) library for candidate genes and single nucleotide polymorphisms (SNPs). INSECT MOLECULAR BIOLOGY 2008;17:607-620. [PMID: 19133073 DOI: 10.1111/j.1365-2583.2008.00833.x] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]

220

Mabey Gilsenan JE, Atherton G, Bartholomew J, Giles PF, Attwood TK, Denning DW, Bowyer P. Aspergillus genomes and the Aspergillus cloud. Nucleic Acids Res 2008;37:D509-14. [PMID: 19039001 PMCID: PMC2686514 DOI: 10.1093/nar/gkn876] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open

221

Droc G, Périn C, Fromentin S, Larmande P. OryGenesDB 2008 update: database interoperability for functional genomics of rice. Nucleic Acids Res 2008;37:D992-5. [PMID: 19036791 PMCID: PMC2686528 DOI: 10.1093/nar/gkn821] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open

222

Wilson D, Pethica R, Zhou Y, Talbot C, Vogel C, Madera M, Chothia C, Gough J. SUPERFAMILY--sophisticated comparative genomics, data mining, visualization and phylogeny. Nucleic Acids Res 2008;37:D380-6. [PMID: 19036790 PMCID: PMC2686452 DOI: 10.1093/nar/gkn762] [Citation(s) in RCA: 330] [Impact Index Per Article: 20.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open

223

Osorio H, Martínez V, Nieto PA, Holmes DS, Quatrini R. Microbial iron management mechanisms in extremely acidic environments: comparative genomics evidence for diversity and versatility. BMC Microbiol 2008;8:203. [PMID: 19025650 PMCID: PMC2631029 DOI: 10.1186/1471-2180-8-203] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2008] [Accepted: 11/24/2008] [Indexed: 01/17/2023] Open

Abstract

BACKGROUND

Iron is an essential nutrient but can be toxic at high intracellular concentrations and organisms have evolved tightly regulated mechanisms for iron uptake and homeostasis. Information on iron management mechanisms is available for organisms living at circumneutral pH. However, very little is known about how acidophilic bacteria, especially those used for industrial copper bioleaching, cope with environmental iron loads that can be 1018 times the concentration found in pH neutral environments. This study was motivated by the need to fill this lacuna in knowledge. An understanding of how microorganisms thrive in acidic ecosystems with high iron loads requires a comprehensive investigation of the strategies to acquire iron and to coordinate this acquisition with utilization, storage and oxidation of iron through metal responsive regulation. In silico prediction of iron management genes and Fur regulation was carried out for three Acidithiobacilli: Acidithiobacillus ferrooxidans (iron and sulfur oxidizer) A. thiooxidans and A. caldus (sulfur oxidizers) that can live between pH 1 and pH 5 and for three strict iron oxidizers of the Leptospirillum genus that live at pH 1 or below.

RESULTS

Acidithiobacilli have predicted FeoB-like Fe(II) and Nramp-like Fe(II)-Mn(II) transporters. They also have 14 different TonB dependent ferri-siderophore transporters of diverse siderophore affinity, although they do not produce classical siderophores. Instead they have predicted novel mechanisms for dicitrate synthesis and possibly also for phosphate-chelation mediated iron uptake. It is hypothesized that the unexpectedly large number and diversity of Fe(III)-uptake systems confers versatility to this group of acidophiles, especially in higher pH environments (pH 4-5) where soluble iron may not be abundant. In contrast, Leptospirilla have only a FtrI-Fet3P-like permease and three TonB dependent ferri-dicitrate siderophore systems. This paucity of iron uptake systems could reflect their obligatory occupation of extremely low pH environments where high concentrations of soluble iron may always be available and were oxidized sulfur species might not compromise iron speciation dynamics. Presence of bacterioferritin in the Acidithiobacilli, polyphosphate accumulation functions and variants of FieF-like diffusion facilitators in both Acidithiobacilli and Leptospirilla, indicate that they may remove or store iron under conditions of variable availability. In addition, the Fe(II)-oxidizing capacity of both A. ferrooxidans and Leptospirilla could itself be a way to evade iron stress imposed by readily available Fe(II) ions at low pH. Fur regulatory sites have been predicted for a number of gene clusters including iron related and non-iron related functions in both the Acidithiobacilli and Leptospirilla, laying the foundation for the future discovery of iron regulated and iron-phosphate coordinated regulatory control circuits.

CONCLUSION

In silico analyses of the genomes of acidophilic bacteria are beginning to tease apart the mechanisms that mediate iron uptake and homeostasis in low pH environments. Initial models pinpoint significant differences in abundance and diversity of iron management mechanisms between Leptospirilla and Acidithiobacilli, and begin to reveal how these two groups respond to iron cycling and iron fluctuations in naturally acidic environments and in industrial operations. Niche partitions and ecological successions between acidophilic microorganisms may be partially explained by these observed differences. Models derived from these analyses pave the way for improved hypothesis testing and well directed experimental investigation. In addition, aspects of these models should challenge investigators to evaluate alternative iron management strategies in non-acidophilic model organisms.

Collapse

224

Holzmann J, Frank P, Löffler E, Bennett KL, Gerner C, Rossmanith W. RNase P without RNA: identification and functional reconstitution of the human mitochondrial tRNA processing enzyme. Cell 2008;135:462-74. [PMID: 18984158 DOI: 10.1016/j.cell.2008.09.013] [Citation(s) in RCA: 432] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2008] [Revised: 07/17/2008] [Accepted: 09/02/2008] [Indexed: 11/26/2022]

225

The UniProtKB/Swiss-Prot knowledgebase and its Plant Proteome Annotation Program. J Proteomics 2008;72:567-73. [PMID: 19084081 DOI: 10.1016/j.jprot.2008.11.010] [Citation(s) in RCA: 66] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2008] [Revised: 11/04/2008] [Accepted: 11/10/2008] [Indexed: 11/21/2022]

226

Vizcaíno JA, Mueller M, Hermjakob H, Martens L. Charting online OMICS resources: A navigational chart for clinical researchers. Proteomics Clin Appl 2008;3:18-29. [PMID: 21136933 DOI: 10.1002/prca.200800082] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2008] [Indexed: 12/22/2022]

227

Grasso LC, Maindonald J, Rudd S, Hayward DC, Saint R, Miller DJ, Ball EE. Microarray analysis identifies candidate genes for key roles in coral development. BMC Genomics 2008;9:540. [PMID: 19014561 PMCID: PMC2629781 DOI: 10.1186/1471-2164-9-540] [Citation(s) in RCA: 108] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2008] [Accepted: 11/14/2008] [Indexed: 01/25/2023] Open

228

Taji T, Sakurai T, Mochida K, Ishiwata A, Kurotani A, Totoki Y, Toyoda A, Sakaki Y, Seki M, Ono H, Sakata Y, Tanaka S, Shinozaki K. Large-scale collection and annotation of full-length enriched cDNAs from a model halophyte, Thellungiella halophila. BMC PLANT BIOLOGY 2008;8:115. [PMID: 19014467 PMCID: PMC2621223 DOI: 10.1186/1471-2229-8-115] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/25/2008] [Accepted: 11/12/2008] [Indexed: 05/15/2023]

Abstract

BACKGROUND

Thellungiella halophila (also known as Thellungiella salsuginea) is a model halophyte with a small plant size, short life cycle, and small genome. It easily undergoes genetic transformation by the floral dipping method used with its close relative, Arabidopsis thaliana. Thellungiella genes exhibit high sequence identity (approximately 90% at the cDNA level) with Arabidopsis genes. Furthermore, Thellungiella not only shows tolerance to extreme salinity stress, but also to chilling, freezing, and ozone stress, supporting the use of Thellungiella as a good genomic resource in studies of abiotic stress tolerance.

RESULTS

We constructed a full-length enriched Thellungiella (Shan Dong ecotype) cDNA library from various tissues and whole plants subjected to environmental stresses, including high salinity, chilling, freezing, and abscisic acid treatment. We randomly selected about 20,000 clones and sequenced them from both ends to obtain a total of 35 171 sequences. CAP3 software was used to assemble the sequences and cluster them into 9569 nonredundant cDNA groups. We named these cDNAs "RTFL" (RIKEN Thellungiella Full-Length) cDNAs. Information on functional domains and Gene Ontology (GO) terms for the RTFL cDNAs were obtained using InterPro. The 8289 genes assigned to InterPro IDs were classified according to the GO terms using Plant GO Slim. Categorical comparison between the whole Arabidopsis genome and Thellungiella genes showing low identity to Arabidopsis genes revealed that the population of Thellungiella transport genes is approximately 1.5 times the size of the corresponding Arabidopsis genes. This suggests that these genes regulate a unique ion transportation system in Thellungiella.

CONCLUSION

As the number of Thellungiella halophila (Thellungiella salsuginea) expressed sequence tags (ESTs) was 9388 in July 2008, the number of ESTs has increased to approximately four times the original value as a result of this effort. Our sequences will thus contribute to correct future annotation of the Thellungiella genome sequence. The full-length enriched cDNA clones will enable the construction of overexpressing mutant plants by introduction of the cDNAs driven by a constitutive promoter, the complementation of Thellungiella mutants, and the determination of promoter regions in the Thellungiella genome.

Collapse

229

Shionyu M, Yamaguchi A, Shinoda K, Takahashi KI, Go M. AS-ALPS: a database for analyzing the effects of alternative splicing on protein structure, interaction and network in human and mouse. Nucleic Acids Res 2008;37:D305-9. [PMID: 19015123 PMCID: PMC2686549 DOI: 10.1093/nar/gkn869] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

230

Tremblay PL, Hallenbeck PC. Of blood, brains and bacteria, the Amt/Rh transporter family: emerging role of Amt as a unique microbial sensor. Mol Microbiol 2008;71:12-22. [PMID: 19007411 DOI: 10.1111/j.1365-2958.2008.06514.x] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

231

Gagné JP, Isabelle M, Lo KS, Bourassa S, Hendzel MJ, Dawson VL, Dawson TM, Poirier GG. Proteome-wide identification of poly(ADP-ribose) binding proteins and poly(ADP-ribose)-associated protein complexes. Nucleic Acids Res 2008;36:6959-76. [PMID: 18981049 PMCID: PMC2602769 DOI: 10.1093/nar/gkn771] [Citation(s) in RCA: 314] [Impact Index Per Article: 19.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open

232

Uribe P, Fuentes D, Valdés J, Shmaryahu A, Zúñiga A, Holmes D, Valenzuela PDT. Preparation and analysis of an expressed sequence tag library from the toxic dinoflagellate Alexandrium catenella. MARINE BIOTECHNOLOGY (NEW YORK, N.Y.) 2008;10:692-700. [PMID: 18478293 DOI: 10.1007/s10126-008-9107-8] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/12/2007] [Revised: 04/10/2008] [Accepted: 04/10/2008] [Indexed: 05/26/2023]

233

Yang X, Kalluri UC, Jawdy S, Gunter LE, Yin T, Tschaplinski TJ, Weston DJ, Ranjan P, Tuskan GA. The F-box gene family is expanded in herbaceous annual plants relative to woody perennial plants. PLANT PHYSIOLOGY 2008;148:1189-200. [PMID: 18775973 PMCID: PMC2577272 DOI: 10.1104/pp.108.121921] [Citation(s) in RCA: 102] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/28/2008] [Accepted: 08/24/2008] [Indexed: 05/20/2023]

Abstract

F-box proteins are generally responsible for substrate recognition in the Skp1-Cullin-F-box complexes that are involved in protein degradation via the ubiquitin-26S proteasome pathway. In plants, F-box genes influence a variety of biological processes, such as leaf senescence, branching, self-incompatibility, and responses to biotic and abiotic stresses. The number of F-box genes in Populus (Populus trichocarpa; approximately 320) is less than half that found in Arabidopsis (Arabidopsis thaliana; approximately 660) or Oryza (Oryza sativa; approximately 680), even though the total number of genes in Populus is equivalent to that in Oryza and 1.5 times that in Arabidopsis. We performed comparative genomics analysis between the woody perennial plant Populus and the herbaceous annual plants Arabidopsis and Oryza in order to explicate the functional implications of this large gene family. Our analyses reveal interspecific differences in genomic distribution, orthologous relationship, intron evolution, protein domain structure, and gene expression. The set of F-box genes shared by these species appear to be involved in core biological processes essential for plant growth and development; lineage-specific differences primarily occurred because of an expansion of the F-box genes via tandem duplications in Arabidopsis and Oryza. The number of F-box genes in the newly sequenced woody species Vitis (Vitis vinifera; 156) and Carica (Carica papaya; 139) is similar to that in Populus, supporting the hypothesis that the F-box gene family is expanded in herbaceous annual plants relative to woody perennial plants. This study provides insights into the relationship between the structure and composition of the F-box gene family in herbaceous and woody species and their associated developmental and physiological features.

Collapse

234

Letunic I, Doerks T, Bork P. SMART 6: recent updates and new developments. Nucleic Acids Res 2008;37:D229-32. [PMID: 18978020 PMCID: PMC2686533 DOI: 10.1093/nar/gkn808] [Citation(s) in RCA: 738] [Impact Index Per Article: 46.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

235

Price MN, Dehal PS, Arkin AP. FastBLAST: homology relationships for millions of proteins. PLoS One 2008;3:e3589. [PMID: 18974889 PMCID: PMC2571987 DOI: 10.1371/journal.pone.0003589] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2008] [Accepted: 10/10/2008] [Indexed: 11/18/2022] Open

236

Ding G, Lorenz P, Kreutzer M, Li Y, Thiesen HJ. SysZNF: the C2H2 zinc finger gene database. Nucleic Acids Res 2008;37:D267-73. [PMID: 18974185 PMCID: PMC2686507 DOI: 10.1093/nar/gkn782] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open

237

Stage- and gender-specific proteomic analysis of Brugia malayi excretory-secretory products. PLoS Negl Trop Dis 2008;2:e326. [PMID: 18958170 PMCID: PMC2569413 DOI: 10.1371/journal.pntd.0000326] [Citation(s) in RCA: 121] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2008] [Accepted: 10/01/2008] [Indexed: 11/19/2022] Open

238

Vinogradov AE. Modularity of cellular networks shows general center-periphery polarization. ACTA ACUST UNITED AC 2008;24:2814-7. [PMID: 18953046 DOI: 10.1093/bioinformatics/btn555] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]

239

Klimke W, Agarwala R, Badretdin A, Chetvernin S, Ciufo S, Fedorov B, Kiryutin B, O'Neill K, Resch W, Resenchuk S, Schafer S, Tolstoy I, Tatusova T. The National Center for Biotechnology Information's Protein Clusters Database. Nucleic Acids Res 2008;37:D216-23. [PMID: 18940865 PMCID: PMC2686591 DOI: 10.1093/nar/gkn734] [Citation(s) in RCA: 219] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open

240

Hunter S, Apweiler R, Attwood TK, Bairoch A, Bateman A, Binns D, Bork P, Das U, Daugherty L, Duquenne L, Finn RD, Gough J, Haft D, Hulo N, Kahn D, Kelly E, Laugraud A, Letunic I, Lonsdale D, Lopez R, Madera M, Maslen J, McAnulla C, McDowall J, Mistry J, Mitchell A, Mulder N, Natale D, Orengo C, Quinn AF, Selengut JD, Sigrist CJA, Thimma M, Thomas PD, Valentin F, Wilson D, Wu CH, Yeats C. InterPro: the integrative protein signature database. Nucleic Acids Res 2008;37:D211-5. [PMID: 18940856 PMCID: PMC2686546 DOI: 10.1093/nar/gkn785] [Citation(s) in RCA: 1438] [Impact Index Per Article: 89.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

241

Global gene expression profiles for life stages of the deadly amphibian pathogen Batrachochytrium dendrobatidis. Proc Natl Acad Sci U S A 2008;105:17034-9. [PMID: 18852473 DOI: 10.1073/pnas.0804173105] [Citation(s) in RCA: 84] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

242

Carver T, Berriman M, Tivey A, Patel C, Böhme U, Barrell BG, Parkhill J, Rajandream MA. Artemis and ACT: viewing, annotating and comparing sequences stored in a relational database. ACTA ACUST UNITED AC 2008;24:2672-6. [PMID: 18845581 PMCID: PMC2606163 DOI: 10.1093/bioinformatics/btn529] [Citation(s) in RCA: 480] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

243

Zerlotini A, Heiges M, Wang H, Moraes RLV, Dominitini AJ, Ruiz JC, Kissinger JC, Oliveira G. SchistoDB: a Schistosoma mansoni genome resource. Nucleic Acids Res 2008;37:D579-82. [PMID: 18842636 PMCID: PMC2686589 DOI: 10.1093/nar/gkn681] [Citation(s) in RCA: 67] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

244

Floris M, Orsini M, Thanaraj TA. Splice-mediated Variants of Proteins (SpliVaP) - data and characterization of changes in signatures among protein isoforms due to alternative splicing. BMC Genomics 2008;9:453. [PMID: 18831736 PMCID: PMC2573899 DOI: 10.1186/1471-2164-9-453] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2008] [Accepted: 10/02/2008] [Indexed: 12/22/2022] Open

Abstract

BACKGROUND

It is often the case that mammalian genes are alternatively spliced; the resulting alternate transcripts often encode protein isoforms that differ in amino acid sequences. Changes among the protein isoforms can alter the cellular properties of proteins. The effect can range from a subtle modulation to a complete loss of function.

RESULTS

(i) We examined human splice-mediated protein isoforms (as extracted from a manually curated data set, and from a computationally predicted data set) for differences in the annotation for protein signatures (Pfam domains and PRINTS fingerprints) and we characterized the differences & their effects on protein functionalities. An important question addressed relates to the extent of protein isoforms that may lack any known function in the cell. (ii) We present a database that reports differences in protein signatures among human splice-mediated protein isoform sequences.

CONCLUSION

(i) Characterization: The work points to distinct sets of alternatively spliced genes with varying degrees of annotation for the splice-mediated protein isoforms. Protein molecular functions seen to be often affected are those that relate to: binding, catalytic, transcription regulation, structural molecule, transporter, motor, and antioxidant; and the processes that are often affected are nucleic acid binding, signal transduction, and protein-protein interactions. Signatures are often included/excluded and truncated in length among protein isoforms; truncation is seen as the predominant type of change. Analysis points to the following novel aspects: (a) Analysis using data from the manually curated Vega indicates that one in 8.9 genes can lead to a protein isoform of no "known" function; and one in 18 expressed protein isoforms can be such an "orphan" isoform; the corresponding numbers as seen with computationally predicted ASD data set are: one in 4.9 genes and one in 9.8 isoforms. (b) When swapping of signatures occurs, it is often between those of same functional classifications. (c) Pfam domains can occur in varying lengths, and PRINTS fingerprints can occur with varying number of constituent motifs among isoforms - since such a variation is seen in large number of genes, it could be a general mechanism to modulate protein function. (ii)

DATA

The reported resource (at http://www.bioinformatica.crs4.org/tools/dbs/splivap/) provides the community ability to access data on splice-mediated protein isoforms (with value-added annotation such as association with diseases) through changes in protein signatures.

Collapse

245

Nagaraj SH, Gasser RB, Ranganathan S. Needles in the EST haystack: large-scale identification and analysis of excretory-secretory (ES) proteins in parasitic nematodes using expressed sequence tags (ESTs). PLoS Negl Trop Dis 2008;2:e301. [PMID: 18820748 PMCID: PMC2553489 DOI: 10.1371/journal.pntd.0000301] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2008] [Accepted: 08/27/2008] [Indexed: 11/28/2022] Open

Abstract

Background

Parasitic nematodes of humans, other animals and plants continue to impose a significant public health and economic burden worldwide, due to the diseases they cause. Promising antiparasitic drug and vaccine candidates have been discovered from excreted or secreted (ES) proteins released from the parasite and exposed to the immune system of the host. Mining the entire expressed sequence tag (EST) data available from parasitic nematodes represents an approach to discover such ES targets.

Methods and Findings

In this study, we predicted, using EST2Secretome, a novel, high-throughput, computational workflow system, 4,710 ES proteins from 452,134 ESTs derived from 39 different species of nematodes, parasitic in animals (including humans) or plants. In total, 2,632, 786, and 1,292 ES proteins were predicted for animal-, human-, and plant-parasitic nematodes. Subsequently, we systematically analysed ES proteins using computational methods. Of these 4,710 proteins, 2,490 (52.8%) had orthologues in Caenorhabditis elegans, whereas 621 (13.8%) appeared to be novel, currently having no significant match to any molecule available in public databases. Of the C. elegans homologues, 267 had strong “loss-of-function” phenotypes by RNA interference (RNAi) in this nematode. We could functionally classify 1,948 (41.3%) sequences using the Gene Ontology (GO) terms, establish pathway associations for 573 (12.2%) sequences using Kyoto Encyclopaedia of Genes and Genomes (KEGG), and identify protein interaction partners for 1,774 (37.6%) molecules. We also mapped 758 (16.1%) proteins to protein domains including the nematode-specific protein family “transthyretin-like” and “chromadorea ALT,” considered as vaccine candidates against filariasis in humans.

Conclusions

We report the large-scale analysis of ES proteins inferred from EST data for a range of parasitic nematodes. This set of ES proteins provides an inventory of known and novel members of ES proteins as a foundation for studies focused on understanding the biology of parasitic nematodes and their interactions with their hosts, as well as for the development of novel drugs or vaccines for parasite intervention and control.

Excretory-secretory (ES) proteins are an important class of proteins in many organisms, spanning from bacteria to human beings, and are potential drug targets for several diseases. In this study, we first developed a software platform, EST2Secretome, comprised of carefully selected computational tools to identify and analyse ES proteins from expressed sequence tags (ESTs). By employing EST2Secretome, we analysed 4,710 ES proteins derived from 0.5 million ESTs for 39 economically important and disease-causing parasites from the phylum Nematoda. Several known and novel ES proteins that were either parasite- or nematode-specific were discovered, focussing on those that are either absent from or very divergent from similar molecules in their animal or plant hosts. In addition, we found many nematode-specific protein families of domains “transthyretin-like” and “chromadorea ALT,” considered vaccine candidates for filariasis in humans. We report numerous C. elegans homologues with loss-of-function RNAi phenotypes essential for parasite survival and therefore potential targets for parasite intervention. Overall, by developing freely available software to analyse large-scale EST data, we enabled researchers working on parasites for neglected tropical diseases to select specific genes and/or proteins to carry out directed functional assays for demystifying the molecular complexities of host–parasite interactions in a cell.

Collapse

246

Davis AP, Murphy CG, Saraceni-Richards CA, Rosenstein MC, Wiegers TC, Mattingly CJ. Comparative Toxicogenomics Database: a knowledgebase and discovery tool for chemical-gene-disease networks. Nucleic Acids Res 2008;37:D786-92. [PMID: 18782832 PMCID: PMC2686584 DOI: 10.1093/nar/gkn580] [Citation(s) in RCA: 215] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

247

Hydrogenomics of the extremely thermophilic bacterium Caldicellulosiruptor saccharolyticus. Appl Environ Microbiol 2008;74:6720-9. [PMID: 18776029 DOI: 10.1128/aem.00968-08] [Citation(s) in RCA: 125] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

248

Cao PJ, Bartley LE, Jung KH, Ronald PC. Construction of a rice glycosyltransferase phylogenomic database and identification of rice-diverged glycosyltransferases. MOLECULAR PLANT 2008;1:858-77. [PMID: 19825588 DOI: 10.1093/mp/ssn052] [Citation(s) in RCA: 67] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]

Abstract

Glycosyltransferases (GTs; EC 2.4.x.y) constitute a large group of enzymes that form glycosidic bonds through transfer of sugars from activated donor molecules to acceptor molecules. GTs are critical to the biosynthesis of plant cell walls, among other diverse functions. Based on the Carbohydrate-Active enZymes (CAZy) database and sequence similarity searches, we have identified 609 potential GT genes (loci) corresponding to 769 transcripts (gene models) in rice (Oryza sativa), the reference monocotyledonous species. Using domain composition and sequence similarity, these rice GTs were classified into 40 CAZy families plus an additional unknown class. We found that two Pfam domains of unknown function, PF04577 and PF04646, are associated with GT families GT61 and GT31, respectively. To facilitate functional analysis of this important and large gene family, we created a phylogenomic Rice GT Database (http://ricephylogenomics.ucdavis.edu/cellwalls/gt/). Through the database, several classes of functional genomic data, including mutant lines and gene expression data, can be displayed for each rice GT in the context of a phylogenetic tree, allowing for comparative analysis both within and between GT families. Comprehensive digital expression analysis of public gene expression data revealed that most ( approximately 80%) rice GTs are expressed. Based on analysis with Inparanoid, we identified 282 'rice-diverged' GTs that lack orthologs in sequenced dicots (Arabidopsis thaliana, Populus tricocarpa, Medicago truncatula, and Ricinus communis). Combining these analyses, we identified 33 rice-diverged GT genes (45 gene models) that are highly expressed in above-ground, vegetative tissues. From the literature and this analysis, 21 of these loci are excellent targets for functional examination toward understanding and manipulating grass cell wall qualities. Study of the remainder may reveal aspects of hormone and protein metabolism that are critical for rice biology. This list of 33 genes and the Rice GT Database will facilitate the study of GTs and cell wall synthesis in rice and other plants.

Collapse

249

Chatr-aryamontri A, Kerrien S, Khadake J, Orchard S, Ceol A, Licata L, Castagnoli L, Costa S, Derow C, Huntley R, Aranda B, Leroy C, Thorneycroft D, Apweiler R, Cesareni G, Hermjakob H. MINT and IntAct contribute to the Second BioCreative challenge: serving the text-mining community with high quality molecular interaction data. Genome Biol 2008;9 Suppl 2:S5. [PMID: 18834496 PMCID: PMC2559989 DOI: 10.1186/gb-2008-9-s2-s5] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open

Abstract

Background

In the absence of consolidated pipelines to archive biological data electronically, information dispersed in the literature must be captured by manual annotation. Unfortunately, manual annotation is time consuming and the coverage of published interaction data is therefore far from complete. The use of text-mining tools to identify relevant publications and to assist in the initial information extraction could help to improve the efficiency of the curation process and, as a consequence, the database coverage of data available in the literature. The 2006 BioCreative competition was aimed at evaluating text-mining procedures in comparison with manual annotation of protein-protein interactions.

Results

To aid the BioCreative protein-protein interaction task, IntAct and MINT (Molecular INTeraction) provided both the training and the test datasets. Data from both databases are comparable because they were curated according to the same standards. During the manual curation process, the major cause of data loss in mining the articles for information was ambiguity in the mapping of the gene names to stable UniProtKB database identifiers. It was also observed that most of the information about interactions was contained only within the full-text of the publication; hence, text mining of protein-protein interaction data will require the analysis of the full-text of the articles and cannot be restricted to the abstract.

Conclusion

The development of text-mining tools to extract protein-protein interaction information may increase the literature coverage achieved by manual curation. To support the text-mining community, databases will highlight those sentences within the articles that describe the interactions. These will supply data-miners with a high quality dataset for algorithm development. Furthermore, the dictionary of terms created by the BioCreative competitors could enrich the synonym list of the PSI-MI (Proteomics Standards Initiative-Molecular Interactions) controlled vocabulary, which is used by both databases to annotate their data content.

Collapse

250

Mulvenna J, Hamilton B, Nagaraj SH, Smyth D, Loukas A, Gorman JJ. Proteomics analysis of the excretory/secretory component of the blood-feeding stage of the hookworm, Ancylostoma caninum. Mol Cell Proteomics 2008;8:109-21. [PMID: 18753127 DOI: 10.1074/mcp.m800206-mcp200] [Citation(s) in RCA: 146] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open

Abstract

Hookworms are blood-feeding intestinal parasites of mammalian hosts and are one of the major human ailments affecting approximately 600 million people worldwide. These parasites form an intimate association with the host and are able to avoid vigorous immune responses in many ways including skewing of the response phenotype to promote parasite survival and longevity. The primary interface between the parasite and the host is the excretory/secretory component, a complex mixture of proteins, carbohydrates, and lipids secreted from the surface or oral openings of the parasite. The composition of this complex mixture is for the most part unknown but is likely to contain proteins important for the parasitic lifestyle and hence suitable as drug or vaccine targets. Using a strategy combining the traditional technology of one-dimensional SDS-PAGE and the newer fractionation technology of OFFGEL electrophoresis we identified 105 proteins from the excretory/secretory products of the blood-feeding stage of the dog hookworm, Ancylostoma caninum. Highly represented among the identified proteins were lectins, including three C-type lectins and three beta-galactoside-specific S-type galectins, as well as a number of proteases belonging to the three major classes found in nematodes, aspartic, cysteine, and metalloproteases. Interestingly 28% of the identified proteins were homologous to activation-associated secreted proteins, a family of cysteine-rich secreted proteins belonging to the sterol carrier protein/Tpx-1/Ag5/PR-1/Sc-7 (TAPS) superfamily. Thirty-four of these proteins were identified suggesting an important role in host-parasite interactions. Other protein families identified included hyaluronidases, lysozyme-like proteins, and transthyretin-like proteins. This work identified a suite of proteins important for the parasitic lifestyle and provides new insight into the biology of hookworm infection.

Collapse