Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Mulder NJ, Apweiler R, Attwood TK, Bairoch A, Bateman A, Binns D, Bork P, Buillard V, Cerutti L, Copley R, Courcelle E, Das U, Daugherty L, Dibley M, Finn R, Fleischmann W, Gough J, Haft D, Hulo N, Hunter S, Kahn D, Kanapin A, Kejariwal A, Labarga A, Langendijk-Genevaux PS, Lonsdale D, Lopez R, Letunic I, Madera M, Maslen J, McAnulla C, McDowall J, Mistry J, Mitchell A, Nikolskaya AN, Orchard S, Orengo C, Petryszak R, Selengut JD, Sigrist CJA, Thomas PD, Valentin F, Wilson D, Wu CH, Yeats C. New developments in the InterPro database. Nucleic Acids Res 2007;35:D224-8. [PMID: 17202162 PMCID: PMC1899100 DOI: 10.1093/nar/gkl841] [Citation(s) in RCA: 349] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2006] [Revised: 10/06/2006] [Accepted: 10/06/2006] [Indexed: 11/14/2022] Open

For:	Mulder NJ, Apweiler R, Attwood TK, Bairoch A, Bateman A, Binns D, Bork P, Buillard V, Cerutti L, Copley R, Courcelle E, Das U, Daugherty L, Dibley M, Finn R, Fleischmann W, Gough J, Haft D, Hulo N, Hunter S, Kahn D, Kanapin A, Kejariwal A, Labarga A, Langendijk-Genevaux PS, Lonsdale D, Lopez R, Letunic I, Madera M, Maslen J, McAnulla C, McDowall J, Mistry J, Mitchell A, Nikolskaya AN, Orchard S, Orengo C, Petryszak R, Selengut JD, Sigrist CJA, Thomas PD, Valentin F, Wilson D, Wu CH, Yeats C. New developments in the InterPro database. Nucleic Acids Res 2007;35:D224-8. [PMID: 17202162 PMCID: PMC1899100 DOI: 10.1093/nar/gkl841] [Citation(s) in RCA: 349] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2006] [Revised: 10/06/2006] [Accepted: 10/06/2006] [Indexed: 11/14/2022] Open

Number

Cited by Other Article(s)

Janda JO, Popal A, Bauer J, Busch M, Klocke M, Spitzer W, Keller J, Merkl R. H2rs: deducing evolutionary and functionally important residue positions by means of an entropy and similarity based analysis of multiple sequence alignments. BMC Bioinformatics 2014;15:118. [PMID: 24766829 PMCID: PMC4021312 DOI: 10.1186/1471-2105-15-118] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2014] [Accepted: 04/17/2014] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

The identification of functionally important residue positions is an important task of computational biology. Methods of correlation analysis allow for the identification of pairs of residue positions, whose occupancy is mutually dependent due to constraints imposed by protein structure or function. A common measure assessing these dependencies is the mutual information, which is based on Shannon's information theory that utilizes probabilities only. Consequently, such approaches do not consider the similarity of residue pairs, which may degrade the algorithm's performance. One typical algorithm is H2r, which characterizes each individual residue position k by the conn(k)-value, which is the number of significantly correlated pairs it belongs to.

RESULTS

To improve specificity of H2r, we developed a revised algorithm, named H2rs, which is based on the von Neumann entropy (vNE). To compute the corresponding mutual information, a matrix A is required, which assesses the similarity of residue pairs. We determined A by deducing substitution frequencies from contacting residue pairs observed in the homologs of 35 809 proteins, whose structure is known. In analogy to H2r, the enhanced algorithm computes a normalized conn(k)-value. Within the framework of H2rs, only statistically significant vNE values were considered. To decide on significance, the algorithm calculates a p-value by performing a randomization test for each individual pair of residue positions. The analysis of a large in silico testbed demonstrated that specificity and precision were higher for H2rs than for H2r and two other methods of correlation analysis. The gain in prediction quality is further confirmed by a detailed assessment of five well-studied enzymes. The outcome of H2rs and of a method that predicts contacting residue positions (PSICOV) overlapped only marginally. H2rs can be downloaded from http://www-bioinf.uni-regensburg.de.

CONCLUSIONS

Considering substitution frequencies for residue pairs by means of the von Neumann entropy and a p-value improved the success rate in identifying important residue positions. The integration of proven statistical concepts and normalization allows for an easier comparison of results obtained with different proteins. Comparing the outcome of the local method H2rs and of the global method PSICOV indicates that such methods supplement each other and have different scopes of application.

Collapse

Computational prediction of protein function based on weighted mapping of domains and GO terms. BIOMED RESEARCH INTERNATIONAL 2014;2014:641469. [PMID: 24868539 PMCID: PMC4017789 DOI: 10.1155/2014/641469] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/21/2013] [Accepted: 03/12/2014] [Indexed: 11/17/2022]

Mesiti M, Re M, Valentini G. Think globally and solve locally: secondary memory-based network learning for automated multi-species function prediction. Gigascience 2014;3:5. [PMID: 24843788 PMCID: PMC4006453 DOI: 10.1186/2047-217x-3-5] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2013] [Accepted: 04/01/2014] [Indexed: 01/08/2023] Open

Abstract

Background

Network-based learning algorithms for automated function prediction (AFP) are negatively affected by the limited coverage of experimental data and limited a priori known functional annotations. As a consequence their application to model organisms is often restricted to well characterized biological processes and pathways, and their effectiveness with poorly annotated species is relatively limited. A possible solution to this problem might consist in the construction of big networks including multiple species, but this in turn poses challenging computational problems, due to the scalability limitations of existing algorithms and the main memory requirements induced by the construction of big networks. Distributed computation or the usage of big computers could in principle respond to these issues, but raises further algorithmic problems and require resources not satisfiable with simple off-the-shelf computers.

Results

We propose a novel framework for scalable network-based learning of multi-species protein functions based on both a local implementation of existing algorithms and the adoption of innovative technologies: we solve “locally” the AFP problem, by designing “vertex-centric” implementations of network-based algorithms, but we do not give up thinking “globally” by exploiting the overall topology of the network. This is made possible by the adoption of secondary memory-based technologies that allow the efficient use of the large memory available on disks, thus overcoming the main memory limitations of modern off-the-shelf computers. This approach has been applied to the analysis of a large multi-species network including more than 300 species of bacteria and to a network with more than 200,000 proteins belonging to 13 Eukaryotic species. To our knowledge this is the first work where secondary-memory based network analysis has been applied to multi-species function prediction using biological networks with hundreds of thousands of proteins.

Conclusions

The combination of these algorithmic and technological approaches makes feasible the analysis of large multi-species networks using ordinary computers with limited speed and primary memory, and in perspective could enable the analysis of huge networks (e.g. the whole proteomes available in SwissProt), using well-equipped stand-alone machines.

Collapse

Muñoz-Mérida A, Viguera E, Claros MG, Trelles O, Pérez-Pulido AJ. Sma3s: a three-step modular annotator for large sequence datasets. DNA Res 2014;21:341-53. [PMID: 24501397 PMCID: PMC4131829 DOI: 10.1093/dnares/dsu001] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open

Yi F, Xie S, Liu Y, Qi X, Yu J. Genome-wide characterization of microRNA in foxtail millet (Setaria italica). BMC PLANT BIOLOGY 2013;13:212. [PMID: 24330712 PMCID: PMC3878754 DOI: 10.1186/1471-2229-13-212] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/15/2013] [Accepted: 11/27/2013] [Indexed: 05/23/2023]

Hirakawa H, Shirasawa K, Kosugi S, Tashiro K, Nakayama S, Yamada M, Kohara M, Watanabe A, Kishida Y, Fujishiro T, Tsuruoka H, Minami C, Sasamoto S, Kato M, Nanri K, Komaki A, Yanagi T, Guoxin Q, Maeda F, Ishikawa M, Kuhara S, Sato S, Tabata S, Isobe SN. Dissection of the octoploid strawberry genome by deep sequencing of the genomes of Fragaria species. DNA Res 2013;21:169-81. [PMID: 24282021 PMCID: PMC3989489 DOI: 10.1093/dnares/dst049] [Citation(s) in RCA: 130] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

An innovative portal for rare genetic diseases research: the semantic Diseasecard. J Biomed Inform 2013;46:1108-15. [PMID: 23973272 DOI: 10.1016/j.jbi.2013.08.006] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2013] [Revised: 06/26/2013] [Accepted: 08/13/2013] [Indexed: 12/17/2022]

Abstract

Advances in "omics" hardware and software technologies are bringing rare diseases research back from the sidelines. Whereas in the past these disorders were seldom considered relevant, in the era of whole genome sequencing the direct connections between rare phenotypes and a reduced set of genes are of vital relevance. This increased interest in rare genetic diseases research is pushing forward investment and effort towards the creation of software in the field, and leveraging the wealth of available life sciences data. Alas, most of these tools target one or more rare diseases, are focused solely on a single type of user, or are limited to the most relevant scientific breakthroughs for a specific niche. Furthermore, despite some high quality efforts, the ever-growing number of resources, databases, services and applications is still a burden to this area. Hence, there is a clear interest in new strategies to deliver a holistic perspective over the entire rare genetic diseases research domain. This is Diseasecard's reasoning, to build a true lightweight knowledge base covering rare genetic diseases. Developed with the latest semantic web technologies, this portal delivers unified access to a comprehensive network for researchers, clinicians, patients and bioinformatics developers. With in-context access covering over 20 distinct heterogeneous resources, Diseasecard's workspace provides access to the most relevant scientific knowledge regarding a given disorder, whether through direct common identifiers or through full-text search over all connected resources. In addition to its user-oriented features, Diseasecard's semantic knowledge base is also available for direct querying, enabling everyone to include rare genetic diseases knowledge in new or existing information systems. Diseasecard is publicly available at http://bioinformatics.ua.pt/diseasecard/.

Collapse

Fujinami S, Takarada H, Kasai H, Sekine M, Omata S, Harada T, Fukai R, Hosoyama A, Horikawa H, Kato Y, Nakazawa H, Fujita N. Complete genome sequence of Ilumatobacter coccineum YM16-304(T.). Stand Genomic Sci 2013;8:430-40. [PMID: 24501628 PMCID: PMC3910706 DOI: 10.4056/sigs.4007734] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

Genotypic and phenotypic versatility of Aspergillus flavus during maize exploitation. PLoS One 2013;8:e68735. [PMID: 23894339 PMCID: PMC3716879 DOI: 10.1371/journal.pone.0068735] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2013] [Accepted: 05/31/2013] [Indexed: 11/19/2022] Open

Shiraishi A, Niijima S, Brown JB, Nakatsui M, Okuno Y. Chemical genomics approach for GPCR-ligand interaction prediction and extraction of ligand binding determinants. J Chem Inf Model 2013;53:1253-62. [PMID: 23721295 DOI: 10.1021/ci300515z] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]

Peng FY, Weselake RJ. Genome-wide identification and analysis of the B3 superfamily of transcription factors in Brassicaceae and major crop plants. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2013;126:1305-19. [PMID: 23377560 DOI: 10.1007/s00122-013-2054-4] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/28/2012] [Accepted: 01/09/2013] [Indexed: 05/04/2023]

Zhang Y, Li Q, Huang W, Zhang J, Han Z, Wei H, Cui J, Wang Y, Yan W. Increased expression of apoptosis-related protein 3 is highly associated with tumorigenesis and progression of cervical squamous cell carcinoma. Hum Pathol 2013;44:388-93. [DOI: 10.1016/j.humpath.2012.05.028] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/20/2012] [Revised: 05/23/2012] [Accepted: 05/25/2012] [Indexed: 02/03/2023]

Katano Y, Fujinami S, Kawakoshi A, Nakazawa H, Oji S, Iino T, Oguchi A, Ankai A, Fukui S, Terui Y, Kamata S, Harada T, Tanikawa S, Suzuki KI, Fujita N. Complete genome sequence of Oscillibacter valericigenes Sjm18-20(T) (=NBRC 101213(T)). Stand Genomic Sci 2013;6:406-14. [PMID: 23408234 PMCID: PMC3558957 DOI: 10.4056/sigs.2826118] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Koo HJ, McDowell ET, Ma X, Greer KA, Kapteyn J, Xie Z, Descour A, Kim H, Yu Y, Kudrna D, Wing RA, Soderlund CA, Gang DR. Ginger and turmeric expressed sequence tags identify signature genes for rhizome identity and development and the biosynthesis of curcuminoids, gingerols and terpenoids. BMC PLANT BIOLOGY 2013;13:27. [PMID: 23410187 PMCID: PMC3608961 DOI: 10.1186/1471-2229-13-27] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/11/2012] [Accepted: 02/11/2013] [Indexed: 05/23/2023]

Abstract

BACKGROUND

Ginger (Zingiber officinale) and turmeric (Curcuma longa) accumulate important pharmacologically active metabolites at high levels in their rhizomes. Despite their importance, relatively little is known regarding gene expression in the rhizomes of ginger and turmeric.

RESULTS

In order to identify rhizome-enriched genes and genes encoding specialized metabolism enzymes and pathway regulators, we evaluated an assembled collection of expressed sequence tags (ESTs) from eight different ginger and turmeric tissues. Comparisons to publicly available sorghum rhizome ESTs revealed a total of 777 gene transcripts expressed in ginger/turmeric and sorghum rhizomes but apparently absent from other tissues. The list of rhizome-specific transcripts was enriched for genes associated with regulation of tissue growth, development, and transcription. In particular, transcripts for ethylene response factors and AUX/IAA proteins appeared to accumulate in patterns mirroring results from previous studies regarding rhizome growth responses to exogenous applications of auxin and ethylene. Thus, these genes may play important roles in defining rhizome growth and development. Additional associations were made for ginger and turmeric rhizome-enriched MADS box transcription factors, their putative rhizome-enriched homologs in sorghum, and rhizomatous QTLs in rice. Additionally, analysis of both primary and specialized metabolism genes indicates that ginger and turmeric rhizomes are primarily devoted to the utilization of leaf supplied sucrose for the production and/or storage of specialized metabolites associated with the phenylpropanoid pathway and putative type III polyketide synthase gene products. This finding reinforces earlier hypotheses predicting roles of this enzyme class in the production of curcuminoids and gingerols.

CONCLUSION

A significant set of genes were found to be exclusively or preferentially expressed in the rhizome of ginger and turmeric. Specific transcription factors and other regulatory genes were found that were common to the two species and that are excellent candidates for involvement in rhizome growth, differentiation and development. Large classes of enzymes involved in specialized metabolism were also found to have apparent tissue-specific expression, suggesting that gene expression itself may play an important role in regulating metabolite production in these plants.

Collapse

Affiliation(s)

Hyun Jo Koo School of Plant Sciences and BIO5 Institute, The University of Arizona, Tucson, AZ, 85721, USA Present address: Salk Institute for Biological Studies, PO Box 85800, San Diego, CA, 92186, USA
Eric T McDowell School of Plant Sciences and BIO5 Institute, The University of Arizona, Tucson, AZ, 85721, USA
Xiaoqiang Ma School of Plant Sciences and BIO5 Institute, The University of Arizona, Tucson, AZ, 85721, USA Present address: XenoBiotic Laboratories, Inc., Morgan Ln 107, Plainsboro, NJ, 08536, USA
Kevin A Greer Arizona Genomics Computational Laboratory and BIO5 Institute, The University of Arizona, Tucson, AZ, 85721, USA Present address: Department of Surgery, College of Medicine, The University of Arizona, Tucson, AZ, 85724, USA
Jeremy Kapteyn School of Plant Sciences and BIO5 Institute, The University of Arizona, Tucson, AZ, 85721, USA
Zhengzhi Xie School of Plant Sciences and BIO5 Institute, The University of Arizona, Tucson, AZ, 85721, USA Department of Pharmaceutical Sciences, The University of Arizona, Tucson, AZ, 85721, USA Present address: Division of Cardiovascular Medicine, University of Louisville, Louisville, KY, 40202, USA
Anne Descour Arizona Genomics Computational Laboratory and BIO5 Institute, The University of Arizona, Tucson, AZ, 85721, USA
HyeRan Kim School of Plant Sciences and BIO5 Institute, The University of Arizona, Tucson, AZ, 85721, USA Arizona Genomics Institute, The University of Arizona, Tucson, AZ, 85721, USA Present address: Plant Genome Research Center, KRIBB, Daejeon, 305-803, South Korea
Yeisoo Yu School of Plant Sciences and BIO5 Institute, The University of Arizona, Tucson, AZ, 85721, USA Arizona Genomics Institute, The University of Arizona, Tucson, AZ, 85721, USA
David Kudrna School of Plant Sciences and BIO5 Institute, The University of Arizona, Tucson, AZ, 85721, USA Arizona Genomics Institute, The University of Arizona, Tucson, AZ, 85721, USA
Rod A Wing School of Plant Sciences and BIO5 Institute, The University of Arizona, Tucson, AZ, 85721, USA Arizona Genomics Institute, The University of Arizona, Tucson, AZ, 85721, USA
Carol A Soderlund Arizona Genomics Computational Laboratory and BIO5 Institute, The University of Arizona, Tucson, AZ, 85721, USA
David R Gang School of Plant Sciences and BIO5 Institute, The University of Arizona, Tucson, AZ, 85721, USA Institute of Biological Chemistry, Washington State University, Pullman, WA, 99164, USA Institute of Biological Chemistry, Washington State University, P.O. Box 646340, Pullman, WA, 99164-6340, USA

Collapse

Blandin G, Marchand S, Charton K, Danièle N, Gicquel E, Boucheteil JB, Bentaib A, Barrault L, Stockholm D, Bartoli M, Richard I. A human skeletal muscle interactome centered on proteins involved in muscular dystrophies: LGMD interactome. Skelet Muscle 2013;3:3. [PMID: 23414517 PMCID: PMC3610214 DOI: 10.1186/2044-5040-3-3] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2012] [Accepted: 02/07/2013] [Indexed: 02/01/2023] Open

Intricate interplay between astrocytes and motor neurons in ALS. Proc Natl Acad Sci U S A 2013;110:E756-65. [PMID: 23388633 DOI: 10.1073/pnas.1222361110] [Citation(s) in RCA: 104] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open

Renier S, Chambon C, Viala D, Chagnot C, Hébraud M, Desvaux M. Exoproteomic analysis of the SecA2-dependent secretion in Listeria monocytogenes EGD-e. J Proteomics 2013;80:183-95. [PMID: 23291529 DOI: 10.1016/j.jprot.2012.11.027] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2012] [Revised: 11/12/2012] [Accepted: 11/29/2012] [Indexed: 12/21/2022]

Menon R, Gasser RB, Mitreva M, Ranganathan S. An analysis of the transcriptome of Teladorsagia circumcincta: its biological and biotechnological implications. BMC Genomics 2012;13 Suppl 7:S10. [PMID: 23282110 PMCID: PMC3521389 DOI: 10.1186/1471-2164-13-s7-s10] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open

Abstract

BACKGROUND

Teladorsagia circumcincta (order Strongylida) is an economically important parasitic nematode of small ruminants (including sheep and goats) in temperate climatic regions of the world. Improved insights into the molecular biology of this parasite could underpin alternative methods required to control this and related parasites, in order to circumvent major problems associated with anthelmintic resistance. The aims of the present study were to define the transcriptome of the adult stage of T. circumcincta and to infer the main pathways linked to molecules known to be expressed in this nematode. Since sheep develop acquired immunity against T. circumcincta, there is some potential for the development of a vaccine against this parasite. Hence, we infer excretory/secretory molecules for T. circumcincta as possible immunogens and vaccine candidates.

RESULTS

A total of 407,357 ESTs were assembled yielding 39,852 putative gene sequences. Conceptual translation predicted 24,013 proteins, which were then subjected to detailed annotation which included pathway mapping of predicted proteins (including 112 excreted/secreted [ES] and 226 transmembrane peptides), domain analysis and GO annotation was carried out using InterProScan along with BLAST2GO. Further analysis was carried out for secretory signal peptides using SignalP and non-classical sec pathway using SecretomeP tools. For ES proteins, key pathways, including Fc epsilon RI, T cell receptor, and chemokine signalling as well as leukocyte transendothelial migration were inferred to be linked to immune responses, along with other pathways related to neurodegenerative diseases and infectious diseases, which warrant detailed future studies. KAAS could identify new and updated pathways like phagosome and protein processing in endoplasmic reticulum. Domain analysis for the assembled dataset revealed families of serine, cysteine and proteinase inhibitors which might represent targets for parasite intervention. InterProScan could identify GO terms pertaining to the extracellular region. Some of the important domain families identified included the SCP-like extracellular proteins which belong to the pathogenesis-related proteins (PRPs) superfamily along with C-type lectin, saposin-like proteins. The 'extracellular region' that corresponds to allergen V5/Tpx-1 related, considered important in parasite-host interactions, was also identified. Six cysteine motif (SXC1) proteins, transthyretin proteins, C-type lectins, activation-associated secreted proteins (ASPs), which could represent potential candidates for developing novel anthelmintics or vaccines were few other important findings. Of these, SXC1, protein kinase domain-containing protein, trypsin family protein, trypsin-like protease family member (TRY-1), putative major allergen and putative lipid binding protein were identified which have not been reported in the published T. circumcincta proteomics analysis. Detailed analysis of 6,058 raw EST sequences from dbEST revealed 315 putatively secreted proteins. Amongst them, C-type single domain activation associated secreted protein ASP3 precursor, activation-associated secreted proteins (ASP-like protein), cathepsin B-like cysteine protease, cathepsin L cysteine protease, cysteine protease, TransThyretin-Related and Venom-Allergen-like proteins were the key findings.

CONCLUSIONS

We have annotated a large dataset ESTs of T. circumcincta and undertaken detailed comparative bioinformatics analyses. The results provide a comprehensive insight into the molecular biology of this parasite and disease manifestation which provides potential focal point for future research. We identified a number of pathways responsible for immune response. This type of large-scale computational scanning could be coupled with proteomic and metabolomic studies of this parasite leading to novel therapeutic intervention and disease control strategies. We have also successfully affirmed the use of bioinformatics tools, for the study of ESTs, which could now serve as a benchmark for the development of new computational EST analysis pipelines.

Collapse

Wang XR, Moreno YA, Wu HR, Ma C, Li YF, Zhang JA, Yang C, Sun S, Ma WJ, Geary TG. Proteomic profiles of soluble proteins from the esophageal gland in female Meloidogyne incognita. Int J Parasitol 2012;42:1177-83. [PMID: 23142006 DOI: 10.1016/j.ijpara.2012.10.008] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2012] [Revised: 10/09/2012] [Accepted: 10/10/2012] [Indexed: 12/17/2022]

A Novel Type III Endosome Transmembrane Protein, TEMP. Cells 2012;1:1029-44. [PMID: 24710541 PMCID: PMC3901140 DOI: 10.3390/cells1041029] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2012] [Revised: 10/26/2012] [Accepted: 10/30/2012] [Indexed: 12/18/2022] Open

Galetto CD, Izaguirre MF, Bessone V, Casco VH. Isolation and nucleotide sequence analysis of the of Rhinella arenarum β-catenin: an mRNA and protein expression study during the larval stages of the digestive tract development. Gene 2012;511:256-64. [PMID: 23000021 DOI: 10.1016/j.gene.2012.09.030] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2012] [Revised: 05/16/2012] [Accepted: 09/05/2012] [Indexed: 12/18/2022]

Abstract

β-catenin interacts with several proteins mediating key biological processes, such as cadherin-mediated cell-cell adhesion as well as signal transduction. This work was done to establish the molecular basis and regulation of the formation pattern of cadherin/β-catenin-mediated adherens junctions, using an animal model of unknown gene sequence, the toad Rhinella arenarum. A Rhinella arenarum β-catenin homolog was isolated from larval tissue, their sequence compared and analyzed with those of eight other vertebrates using bioinformatics tools. The mRNA and protein expression levels of β-catenin were determined during the development of Rhinella arenarum digestive tract both by Reverse Transcriptase-Polymerase Chain Reaction (RT-PCR) and immunohistochemistry-morphometry respectively. Using Xenopus laevis frog specific primers, a fragment 539 bp of Rhinella arenarum toad β-catenin cDNA was obtained and sequenced. The resulting putative sequence of 177 amino acids showed high similarity at the amino acid level (97%) when compared to other six vertebrates (Xenopus laevis, Xenopus tropicalis, Mus musculus, Rattus norvegicus, Bos taurus and Homo sapiens), with sequences and structural domains characteristic of catenins. Subsequently, using primers specifically designed for Rhinella arenarum nucleotide sequence, β-catenin-mRNA increasing levels were found during the Rhinella arenarum metamorphosis. Finally, increasing β-catenin protein expression during development has confirmed the specificity the detection of Rhinella arenarum β-catenin. Summarizing, we have isolated and sequenced a β-catenin-homologue sequence from the Rhinella arenarum toad, which is highly conserved between species, and following we have detected β-catenin mRNA and protein levels during their digestive tract development.

Collapse

Messih MA, Chitale M, Bajic VB, Kihara D, Gao X. Protein domain recurrence and order can enhance prediction of protein functions. Bioinformatics 2012;28:i444-i450. [PMID: 22962465 PMCID: PMC3436825 DOI: 10.1093/bioinformatics/bts398] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open

Ontological Analysis and Pathway Modelling in Drug Discovery. Pharmaceut Med 2012. [DOI: 10.1007/bf03256689] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Kawakoshi A, Nakazawa H, Fukada J, Sasagawa M, Katano Y, Nakamura S, Hosoyama A, Sasaki H, Ichikawa N, Hanada S, Kamagata Y, Nakamura K, Yamazaki S, Fujita N. Deciphering the genome of polyphosphate accumulating actinobacterium Microlunatus phosphovorus. DNA Res 2012;19:383-94. [PMID: 22923697 PMCID: PMC3473371 DOI: 10.1093/dnares/dss020] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open

Renier S, Micheau P, Talon R, Hébraud M, Desvaux M. Subcellular localization of extracytoplasmic proteins in monoderm bacteria: rational secretomics-based strategy for genomic and proteomic analyses. PLoS One 2012;7:e42982. [PMID: 22912771 PMCID: PMC3415414 DOI: 10.1371/journal.pone.0042982] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2012] [Accepted: 07/13/2012] [Indexed: 11/20/2022] Open

Characterization of microRNAs expression during maize seed development. BMC Genomics 2012;13:360. [PMID: 22853295 PMCID: PMC3468377 DOI: 10.1186/1471-2164-13-360] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2012] [Accepted: 07/09/2012] [Indexed: 12/21/2022] Open

Geary J, Satti M, Moreno Y, Madrill N, Whitten D, Headley SA, Agnew D, Geary T, Mackenzie C. First analysis of the secretome of the canine heartworm, Dirofilaria immitis. Parasit Vectors 2012;5:140. [PMID: 22781075 PMCID: PMC3439246 DOI: 10.1186/1756-3305-5-140] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2012] [Accepted: 06/13/2012] [Indexed: 12/18/2022] Open

The yak genome and adaptation to life at high altitude. Nat Genet 2012;44:946-9. [DOI: 10.1038/ng.2343] [Citation(s) in RCA: 540] [Impact Index Per Article: 45.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2011] [Accepted: 06/06/2012] [Indexed: 01/17/2023]

He QF, Li D, Xu QY, Zheng S. Predicted essential proteins of Plasmodium falciparum for potential drug targets. ASIAN PAC J TROP MED 2012;5:352-4. [PMID: 22546649 DOI: 10.1016/s1995-7645(12)60057-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2012] [Revised: 03/15/2012] [Accepted: 04/15/2012] [Indexed: 11/16/2022] Open

A systematic comparison of genome-scale clustering algorithms. BMC Bioinformatics 2012;13 Suppl 10:S7. [PMID: 22759431 PMCID: PMC3382433 DOI: 10.1186/1471-2105-13-s10-s7] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

Abstract

Background

A wealth of clustering algorithms has been applied to gene co-expression experiments. These algorithms cover a broad range of approaches, from conventional techniques such as k-means and hierarchical clustering, to graphical approaches such as k-clique communities, weighted gene co-expression networks (WGCNA) and paraclique. Comparison of these methods to evaluate their relative effectiveness provides guidance to algorithm selection, development and implementation. Most prior work on comparative clustering evaluation has focused on parametric methods. Graph theoretical methods are recent additions to the tool set for the global analysis and decomposition of microarray co-expression matrices that have not generally been included in earlier methodological comparisons. In the present study, a variety of parametric and graph theoretical clustering algorithms are compared using well-characterized transcriptomic data at a genome scale from Saccharomyces cerevisiae.

Methods

For each clustering method under study, a variety of parameters were tested. Jaccard similarity was used to measure each cluster's agreement with every GO and KEGG annotation set, and the highest Jaccard score was assigned to the cluster. Clusters were grouped into small, medium, and large bins, and the Jaccard score of the top five scoring clusters in each bin were averaged and reported as the best average top 5 (BAT5) score for the particular method.

Results

Clusters produced by each method were evaluated based upon the positive match to known pathways. This produces a readily interpretable ranking of the relative effectiveness of clustering on the genes. Methods were also tested to determine whether they were able to identify clusters consistent with those identified by other clustering methods.

Conclusions

Validation of clusters against known gene classifications demonstrate that for this data, graph-based techniques outperform conventional clustering approaches, suggesting that further development and application of combinatorial strategies is warranted.

Collapse

Reimand J, Hui S, Jain S, Law B, Bader GD. Domain-mediated protein interaction prediction: From genome to network. FEBS Lett 2012;586:2751-63. [PMID: 22561014 DOI: 10.1016/j.febslet.2012.04.027] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2012] [Accepted: 04/17/2012] [Indexed: 11/19/2022]

Wang MC, Chen FC, Chen YZ, Huang YT, Chuang TJ. LDGIdb: a database of gene interactions inferred from long-range strong linkage disequilibrium between pairs of SNPs. BMC Res Notes 2012;5:212. [PMID: 22551073 PMCID: PMC3441865 DOI: 10.1186/1756-0500-5-212] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2011] [Accepted: 04/26/2012] [Indexed: 12/22/2022] Open

Abstract

Background

Complex human diseases may be associated with many gene interactions. Gene interactions take several different forms and it is difficult to identify all of the interactions that are potentially associated with human diseases. One approach that may fill this knowledge gap is to infer previously unknown gene interactions via identification of non-physical linkages between different mutations (or single nucleotide polymorphisms, SNPs) to avoid hitchhiking effect or lack of recombination. Strong non-physical SNP linkages are considered to be an indication of biological (gene) interactions. These interactions can be physical protein interactions, regulatory interactions, functional compensation/antagonization or many other forms of interactions. Previous studies have shown that mutations in different genes can be linked to the same disorders. Therefore, non-physical SNP linkages, coupled with knowledge of SNP-disease associations may shed more light on the role of gene interactions in human disorders. A user-friendly web resource that integrates information about non-physical SNP linkages, gene annotations, SNP information, and SNP-disease associations may thus be a good reference for biomedical research.

Findings

Here we extracted the SNPs located within the promoter or exonic regions of protein-coding genes from the HapMap database to construct a database named the Linkage-Disequilibrium-based Gene Interaction database (LDGIdb). The database stores 646,203 potential human gene interactions, which are potential interactions inferred from SNP pairs that are subject to long-range strong linkage disequilibrium (LD), or non-physical linkages. To minimize the possibility of hitchhiking, SNP pairs inferred to be non-physically linked were required to be located in different chromosomes or in different LD blocks of the same chromosomes. According to the genomic locations of the involved SNPs (i.e., promoter, untranslated region (UTR) and coding region (CDS)), the SNP linkages inferred were categorized into promoter-promoter, promoter-UTR, promoter-CDS, CDS-CDS, CDS-UTR and UTR-UTR linkages. For the CDS-related linkages, the coding SNPs were further classified into nonsynonymous and synonymous variations, which represent potential gene interactions at the protein and RNA level, respectively. The LDGIdb also incorporates human disease-association databases such as Genome-Wide Association Studies (GWAS) and Online Mendelian Inheritance in Man (OMIM), so that the user can search for potential disease-associated SNP linkages. The inferred SNP linkages are also classified in the context of population stratification to provide a resource for investigating potential population-specific gene interactions.

Conclusion

The LDGIdb is a user-friendly resource that integrates non-physical SNP linkages and SNP-disease associations for studies of gene interactions in human diseases. With the help of the LDGIdb, it is plausible to infer population-specific SNP linkages for more focused studies, an avenue that is potentially important for pharmacogenetics. Moreover, by referring to disease-association information such as the GWAS data, the LDGIdb may help identify previously uncharacterized disease-associated gene interactions and potentially lead to new discoveries in studies of human diseases.

Keywords

Gene interaction, SNP, Linkage disequilibrium, Systems biology, Bioinformatics

Collapse

Aguileta G, Lengelle J, Chiapello H, Giraud T, Viaud M, Fournier E, Rodolphe F, Marthey S, Ducasse A, Gendrault A, Poulain J, Wincker P, Gout L. Genes under positive selection in a model plant pathogenic fungus, Botrytis. INFECTION GENETICS AND EVOLUTION 2012;12:987-96. [PMID: 22406010 DOI: 10.1016/j.meegid.2012.02.012] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2011] [Revised: 02/15/2012] [Accepted: 02/23/2012] [Indexed: 11/29/2022]

Chowdhary R, Tan SL, Pavesi G, Jin J, Dong D, Mathur SK, Burkart A, Narang V, Glurich I, Raby BA, Weiss ST, Wong L, Liu JS, Bajic VB. A database of annotated promoters of genes associated with common respiratory and related diseases. Am J Respir Cell Mol Biol 2012;47:112-9. [PMID: 22383585 DOI: 10.1165/rcmb.2011-0419oc] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open

Proteomic analysis of Plasmodium in the mosquito: progress and pitfalls. Parasitology 2012;139:1131-45. [PMID: 22336136 PMCID: PMC3417538 DOI: 10.1017/s0031182012000133] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Rao RU, Huang Y, Abubucker S, Heinz M, Crosby SD, Mitreva M, Weil GJ. Effects of doxycycline on gene expression in Wolbachia and Brugia malayi adult female worms in vivo. J Biomed Sci 2012;19:21. [PMID: 22321609 PMCID: PMC3352068 DOI: 10.1186/1423-0127-19-21] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2011] [Accepted: 02/09/2012] [Indexed: 12/28/2022] Open

Abstract

Background

Most filarial nematodes contain Wolbachia symbionts. The purpose of this study was to examine the effects of doxycycline on gene expression in Wolbachia and adult female Brugia malayi.

Methods

Brugia malayi infected gerbils were treated with doxycycline for 6-weeks. This treatment largely cleared Wolbachia and arrested worm reproduction. RNA recovered from treated and control female worms was labeled by random priming and hybridized to the Version 2- filarial microarray to obtain expression profiles.

Results and discussion

Results showed significant changes in expression for 200 Wolbachia (29% of Wolbachia genes with expression signals in untreated worms) and 546 B. malayi array elements after treatment. These elements correspond to known genes and also to novel genes with unknown biological functions. Most differentially expressed Wolbachia genes were down-regulated after treatment (98.5%). In contrast, doxycycline had a mixed effect on B. malayi gene expression with many more genes being significantly up-regulated after treatment (85% of differentially expressed genes). Genes and processes involved in reproduction (gender-regulated genes, collagen, amino acid metabolism, ribosomal processes, and cytoskeleton) were down-regulated after doxycycline while up-regulated genes and pathways suggest adaptations for survival in response to stress (energy metabolism, electron transport, anti-oxidants, nutrient transport, bacterial signaling pathways, and immune evasion).

Conclusions

Doxycycline reduced Wolbachia and significantly decreased bacterial gene expression. Wolbachia ribosomes are believed to be the primary biological target for doxycycline in filarial worms. B. malayi genes essential for reproduction, growth and development were also down-regulated; these changes are consistent with doxycycline effects on embryo development and reproduction. On the other hand, many B. malayi genes involved in energy production, electron-transport, metabolism, anti-oxidants, and others with unknown functions had increased expression signals after doxycycline treatment. These results suggest that female worms are able to compensate in part for the loss of Wolbachia so that they can survive, albeit without reproductive capacity. This study of doxycycline induced changes in gene expression has provided new clues regarding the symbiotic relationship between Wolbachia and B. malayi.

Collapse

Shahbaba B, Shachaf CM, Yu Z. A pathway analysis method for genome-wide association studies. Stat Med 2012;31:988-1000. [PMID: 22302470 DOI: 10.1002/sim.4477] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2011] [Revised: 10/20/2011] [Accepted: 11/02/2011] [Indexed: 12/20/2022]

Childs KL, Konganti K, Buell CR. The Biofuel Feedstock Genomics Resource: a web-based portal and database to enable functional genomics of plant biofuel feedstock species. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2012;2012:bar061. [PMID: 22250003 PMCID: PMC3259624 DOI: 10.1093/database/bar061] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Ellis JT, Sims RC, Miller CD. Monitoring microbial diversity of bioreactors using metagenomic approaches. Subcell Biochem 2012;64:73-94. [PMID: 23080246 DOI: 10.1007/978-94-007-5055-5_4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]

Generation and Analysis of Large-Scale Data-Driven Mycobacterium tuberculosis Functional Networks for Drug Target Identification. Adv Bioinformatics 2011;2011:801478. [PMID: 22190924 PMCID: PMC3235424 DOI: 10.1155/2011/801478] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2011] [Accepted: 08/28/2011] [Indexed: 11/18/2022] Open

Hamilton JP, Neeno-Eckwall EC, Adhikari BN, Perna NT, Tisserat N, Leach JE, Lévesque CA, Buell CR. The Comprehensive Phytopathogen Genomics Resource: a web-based resource for data-mining plant pathogen genomes. Database (Oxford) 2011;2011:bar053. [PMID: 22120664 PMCID: PMC3225079 DOI: 10.1093/database/bar053] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Affiliation(s)

John P. Hamilton Department of Plant Biology, 178 Wilson Lane, Michigan State University, East Lansing, MI, 48824, USA, Department of Genetics, 4434 Genetics-Biotech Center BLDG, 425 Henry Mall, University of Wisconsin, Madison, WI, 53706, USA, Department of Bioagricultural Sciences and Pest Management, Plant Science C129, Colorado State University, Fort Collins, CO, 80523–1177, USA, Agriculture and Agri-Food Canada, 960 Carling Ave., ON, K1A 0C6 and Department of Biology, Carleton University, ON, K1S 5B6, Ottawa, Canada
Eric C. Neeno-Eckwall Department of Plant Biology, 178 Wilson Lane, Michigan State University, East Lansing, MI, 48824, USA, Department of Genetics, 4434 Genetics-Biotech Center BLDG, 425 Henry Mall, University of Wisconsin, Madison, WI, 53706, USA, Department of Bioagricultural Sciences and Pest Management, Plant Science C129, Colorado State University, Fort Collins, CO, 80523–1177, USA, Agriculture and Agri-Food Canada, 960 Carling Ave., ON, K1A 0C6 and Department of Biology, Carleton University, ON, K1S 5B6, Ottawa, Canada
Bishwo N. Adhikari Department of Plant Biology, 178 Wilson Lane, Michigan State University, East Lansing, MI, 48824, USA, Department of Genetics, 4434 Genetics-Biotech Center BLDG, 425 Henry Mall, University of Wisconsin, Madison, WI, 53706, USA, Department of Bioagricultural Sciences and Pest Management, Plant Science C129, Colorado State University, Fort Collins, CO, 80523–1177, USA, Agriculture and Agri-Food Canada, 960 Carling Ave., ON, K1A 0C6 and Department of Biology, Carleton University, ON, K1S 5B6, Ottawa, Canada
Nicole T. Perna Department of Plant Biology, 178 Wilson Lane, Michigan State University, East Lansing, MI, 48824, USA, Department of Genetics, 4434 Genetics-Biotech Center BLDG, 425 Henry Mall, University of Wisconsin, Madison, WI, 53706, USA, Department of Bioagricultural Sciences and Pest Management, Plant Science C129, Colorado State University, Fort Collins, CO, 80523–1177, USA, Agriculture and Agri-Food Canada, 960 Carling Ave., ON, K1A 0C6 and Department of Biology, Carleton University, ON, K1S 5B6, Ottawa, Canada
Ned Tisserat Department of Plant Biology, 178 Wilson Lane, Michigan State University, East Lansing, MI, 48824, USA, Department of Genetics, 4434 Genetics-Biotech Center BLDG, 425 Henry Mall, University of Wisconsin, Madison, WI, 53706, USA, Department of Bioagricultural Sciences and Pest Management, Plant Science C129, Colorado State University, Fort Collins, CO, 80523–1177, USA, Agriculture and Agri-Food Canada, 960 Carling Ave., ON, K1A 0C6 and Department of Biology, Carleton University, ON, K1S 5B6, Ottawa, Canada
Jan E. Leach Department of Plant Biology, 178 Wilson Lane, Michigan State University, East Lansing, MI, 48824, USA, Department of Genetics, 4434 Genetics-Biotech Center BLDG, 425 Henry Mall, University of Wisconsin, Madison, WI, 53706, USA, Department of Bioagricultural Sciences and Pest Management, Plant Science C129, Colorado State University, Fort Collins, CO, 80523–1177, USA, Agriculture and Agri-Food Canada, 960 Carling Ave., ON, K1A 0C6 and Department of Biology, Carleton University, ON, K1S 5B6, Ottawa, Canada
C. André Lévesque Department of Plant Biology, 178 Wilson Lane, Michigan State University, East Lansing, MI, 48824, USA, Department of Genetics, 4434 Genetics-Biotech Center BLDG, 425 Henry Mall, University of Wisconsin, Madison, WI, 53706, USA, Department of Bioagricultural Sciences and Pest Management, Plant Science C129, Colorado State University, Fort Collins, CO, 80523–1177, USA, Agriculture and Agri-Food Canada, 960 Carling Ave., ON, K1A 0C6 and Department of Biology, Carleton University, ON, K1S 5B6, Ottawa, Canada
C. Robin Buell Department of Plant Biology, 178 Wilson Lane, Michigan State University, East Lansing, MI, 48824, USA, Department of Genetics, 4434 Genetics-Biotech Center BLDG, 425 Henry Mall, University of Wisconsin, Madison, WI, 53706, USA, Department of Bioagricultural Sciences and Pest Management, Plant Science C129, Colorado State University, Fort Collins, CO, 80523–1177, USA, Agriculture and Agri-Food Canada, 960 Carling Ave., ON, K1A 0C6 and Department of Biology, Carleton University, ON, K1S 5B6, Ottawa, Canada

Collapse

Forslund K, Pekkari I, Sonnhammer ELL. Domain architecture conservation in orthologs. BMC Bioinformatics 2011;12:326. [PMID: 21819573 PMCID: PMC3215765 DOI: 10.1186/1471-2105-12-326] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2011] [Accepted: 08/05/2011] [Indexed: 11/16/2022] Open

Abstract

Background

As orthologous proteins are expected to retain function more often than other homologs, they are often used for functional annotation transfer between species. However, ortholog identification methods do not take into account changes in domain architecture, which are likely to modify a protein's function. By domain architecture we refer to the sequential arrangement of domains along a protein sequence.

To assess the level of domain architecture conservation among orthologs, we carried out a large-scale study of such events between human and 40 other species spanning the entire evolutionary range. We designed a score to measure domain architecture similarity and used it to analyze differences in domain architecture conservation between orthologs and paralogs relative to the conservation of primary sequence. We also statistically characterized the extents of different types of domain swapping events across pairs of orthologs and paralogs.

Results

The analysis shows that orthologs exhibit greater domain architecture conservation than paralogous homologs, even when differences in average sequence divergence are compensated for, for homologs that have diverged beyond a certain threshold. We interpret this as an indication of a stronger selective pressure on orthologs than paralogs to retain the domain architecture required for the proteins to perform a specific function. In general, orthologs as well as the closest paralogous homologs have very similar domain architectures, even at large evolutionary separation.

The most common domain architecture changes observed in both ortholog and paralog pairs involved insertion/deletion of new domains, while domain shuffling and segment duplication/deletion were very infrequent.

Conclusions

On the whole, our results support the hypothesis that function conservation between orthologs demands higher domain architecture conservation than other types of homologs, relative to primary sequence conservation. This supports the notion that orthologs are functionally more similar than other types of homologs at the same evolutionary distance.

Collapse

Silkov A, Yoon Y, Lee H, Gokhale N, Adu-Gyamfi E, Stahelin RV, Cho W, Murray D. Genome-wide structural analysis reveals novel membrane binding properties of AP180 N-terminal homology (ANTH) domains. J Biol Chem 2011;286:34155-63. [PMID: 21828048 DOI: 10.1074/jbc.m111.265611] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open

De Martino A, Bartual A, Willis A, Meichenin A, Villazán B, Maheswari U, Bowler C. Physiological and Molecular Evidence that Environmental Changes Elicit Morphological Interconversion in the Model Diatom Phaeodactylum tricornutum. Protist 2011;162:462-81. [DOI: 10.1016/j.protis.2011.02.002] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2010] [Accepted: 01/17/2011] [Indexed: 11/30/2022]

Airoldi EM, Heller KA, Silva R. Small sets of interacting proteins suggest functional linkage mechanisms via Bayesian analogical reasoning. Bioinformatics 2011;27:i374-82. [PMID: 21685095 PMCID: PMC3117334 DOI: 10.1093/bioinformatics/btr236] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Abstract

MOTIVATION

Proteins and protein complexes coordinate their activity to execute cellular functions. In a number of experimental settings, including synthetic genetic arrays, genetic perturbations and RNAi screens, scientists identify a small set of protein interactions of interest. A working hypothesis is often that these interactions are the observable phenotypes of some functional process, which is not directly observable. Confirmatory analysis requires finding other pairs of proteins whose interaction may be additional phenotypical evidence about the same functional process. Extant methods for finding additional protein interactions rely heavily on the information in the newly identified set of interactions. For instance, these methods leverage the attributes of the individual proteins directly, in a supervised setting, in order to find relevant protein pairs. A small set of protein interactions provides a small sample to train parameters of prediction methods, thus leading to low confidence.

RESULTS

We develop RBSets, a computational approach to ranking protein interactions rooted in analogical reasoning; that is, the ability to learn and generalize relations between objects. Our approach is tailored to situations where the training set of protein interactions is small, and leverages the attributes of the individual proteins indirectly, in a Bayesian ranking setting that is perhaps closest to propensity scoring in mathematical psychology. We find that RBSets leads to good performance in identifying additional interactions starting from a small evidence set of interacting proteins, for which an underlying biological logic in terms of functional processes and signaling pathways can be established with some confidence. Our approach is scalable and can be applied to large databases with minimal computational overhead. Our results suggest that analogical reasoning within a Bayesian ranking problem is a promising new approach for real-time biological discovery.

AVAILABILITY

Java code is available at: www.gatsby.ucl.ac.uk/~rbas.

CONTACT

airoldi@fas.harvard.edu; kheller@mit.edu; ricardo@stats.ucl.ac.uk.

Collapse

NELL-1 binds to APR3 affecting human osteoblast proliferation and differentiation. FEBS Lett 2011;585:2410-8. [PMID: 21723284 DOI: 10.1016/j.febslet.2011.06.024] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2010] [Revised: 06/11/2011] [Accepted: 06/17/2011] [Indexed: 11/23/2022]

Wang Y, Wu W, Negre NN, White KP, Li C, Shah PK. Determinants of antigenicity and specificity in immune response for protein sequences. BMC Bioinformatics 2011;12:251. [PMID: 21693021 PMCID: PMC3133554 DOI: 10.1186/1471-2105-12-251] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2010] [Accepted: 06/21/2011] [Indexed: 11/22/2022] Open

Abstract

Background

Target specific antibodies are pivotal for the design of vaccines, immunodiagnostic tests, studies on proteomics for cancer biomarker discovery, identification of protein-DNA and other interactions, and small and large biochemical assays. Therefore, it is important to understand the properties of protein sequences that are important for antigenicity and to identify small peptide epitopes and large regions in the linear sequence of the proteins whose utilization result in specific antibodies.

Results

Our analysis using protein properties suggested that sequence composition combined with evolutionary information and predicted secondary structure, as well as solvent accessibility is sufficient to predict successful peptide epitopes. The antigenicity and the specificity in immune response were also found to depend on the epitope length. We trained the B-Cell Epitope Oracle (BEOracle), a support vector machine (SVM) classifier, for the identification of continuous B-Cell epitopes with these protein properties as learning features. The BEOracle achieved an F1-measure of 81.37% on a large validation set. The BEOracle classifier outperformed the classical methods based on propensity and sophisticated methods like BCPred and Bepipred for B-Cell epitope prediction. The BEOracle classifier also identified peptides for the ChIP-grade antibodies from the modENCODE/ENCODE projects with 96.88% accuracy. High BEOracle score for peptides showed some correlation with the antibody intensity on Immunofluorescence studies done on fly embryos. Finally, a second SVM classifier, the B-Cell Region Oracle (BROracle) was trained with the BEOracle scores as features to predict the performance of antibodies generated with large protein regions with high accuracy. The BROracle classifier achieved accuracies of 75.26-63.88% on a validation set with immunofluorescence, immunohistochemistry, protein arrays and western blot results from Protein Atlas database.

Conclusions

Together our results suggest that antigenicity is a local property of the protein sequences and that protein sequence properties of composition, secondary structure, solvent accessibility and evolutionary conservation are the determinants of antigenicity and specificity in immune response. Moreover, specificity in immune response could also be accurately predicted for large protein regions without the knowledge of the protein tertiary structure or the presence of discontinuous epitopes. The dataset prepared in this work and the classifier models are available for download at https://sites.google.com/site/oracleclassifiers/.

Collapse

Woo NS, Gordon MJ, Graham SR, Rossel JB, Badger MR, Pogson BJ. A mutation in the purine biosynthetic enzyme ATASE2 impacts high light signalling and acclimation responses in green and chlorotic sectors of Arabidopsis leaves. FUNCTIONAL PLANT BIOLOGY : FPB 2011;38:401-419. [PMID: 32480896 DOI: 10.1071/fp10218] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2010] [Accepted: 03/22/2011] [Indexed: 05/14/2023]

Panphut W, Senapin S, Sriurairatana S, Withyachumnarnkul B, Flegel TW. A novel integrase-containing element may interact with Laem-Singh virus (LSNV) to cause slow growth in giant tiger shrimp. BMC Vet Res 2011;7:18. [PMID: 21569542 PMCID: PMC3117699 DOI: 10.1186/1746-6148-7-18] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2010] [Accepted: 05/14/2011] [Indexed: 11/24/2022] Open

Abstract

Background

From 2001-2003 monodon slow growth syndrome (MSGS) caused severe economic losses for Thai shrimp farmers who cultivated the native, giant tiger shrimp, and this led them to adopt exotic stocks of the domesticated whiteleg shrimp as the species of cultivation choice, despite the higher value of giant tiger shrimp. In 2008, newly discovered Laem-Singh virus (LSNV) was proposed as a necessary but insufficient cause of MSGS, and this stimulated the search for the additional component cause(s) of MSGS in the hope that discovery would lead to preventative measures that could revive cultivation of the higher value native shrimp species.

Results

Using a universal shotgun cloning protocol, a novel RNA, integrase-containing element (ICE) was found in giant tiger shrimp from MSGS ponds (GenBank accession number FJ498866). In situ hybridization probes and RT-PCR tests revealed that ICE and Laem-Singh virus (LSNV) occurred together in lymphoid organs (LO) of shrimp from MSGS ponds but not in shrimp from normal ponds. Tissue homogenates of shrimp from MSGS ponds yielded a fraction that gave positive RT-PCR reactions for both ICE and LSNV and showed viral-like particles by transmission electron microscopy (TEM). Bioassays of this fraction with juvenile giant tiger shrimp resulted in retarded growth with gross signs of MSGS, and in situ hybridization assays revealed ICE and LSNV together in LO, eyes and gills. Viral-like particles similar to those seen in tissue extracts from natural infections were also seen by TEM.

Conclusions

ICE and LSNV were found together only in shrimp from MSGS ponds and only in shrimp showing gross signs of MSGS after injection with a preparation containing ICE and LSNV. ICE was never found in the absence of LSNV although LSNV was sometimes found in normal shrimp in the absence of ICE. The results suggest that ICE and LSNV may act together as component causes of MSGS, but this cannot be proven conclusively without single and combined bioassays using purified preparations of both ICE and LSNV. Despite this ambiguity, it is recommended in the interim that ICE be added to the agents such as LSNV already listed for exclusion from domesticated stocks of the black tiger shrimp.

Collapse

100

Cohen-Gihon I, Sharan R, Nussinov R. Processes of fungal proteome evolution and gain of function: gene duplication and domain rearrangement. Phys Biol 2011;8:035009. [PMID: 21572172 DOI: 10.1088/1478-3975/8/3/035009] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]