Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Dimmer EC, Huntley RP, Alam-Faruque Y, Sawford T, O'Donovan C, Martin MJ, Bely B, Browne P, Mun Chan W, Eberhardt R, Gardner M, Laiho K, Legge D, Magrane M, Pichler K, Poggioli D, Sehra H, Auchincloss A, Axelsen K, Blatter MC, Boutet E, Braconi-Quintaje S, Breuza L, Bridge A, Coudert E, Estreicher A, Famiglietti L, Ferro-Rojas S, Feuermann M, Gos A, Gruaz-Gumowski N, Hinz U, Hulo C, James J, Jimenez S, Jungo F, Keller G, Lemercier P, Lieberherr D, Masson P, Moinat M, Pedruzzi I, Poux S, Rivoire C, Roechert B, Schneider M, Stutz A, Sundaram S, Tognolli M, Bougueleret L, Argoud-Puy G, Cusin I, Duek-Roggli P, Xenarios I, Apweiler R. The UniProt-GO Annotation database in 2011. Nucleic Acids Res 2011;40:D565-70. [PMID: 22123736 PMCID: PMC3245010 DOI: 10.1093/nar/gkr1048] [Citation(s) in RCA: 310] [Impact Index Per Article: 23.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open

For:	Dimmer EC, Huntley RP, Alam-Faruque Y, Sawford T, O'Donovan C, Martin MJ, Bely B, Browne P, Mun Chan W, Eberhardt R, Gardner M, Laiho K, Legge D, Magrane M, Pichler K, Poggioli D, Sehra H, Auchincloss A, Axelsen K, Blatter MC, Boutet E, Braconi-Quintaje S, Breuza L, Bridge A, Coudert E, Estreicher A, Famiglietti L, Ferro-Rojas S, Feuermann M, Gos A, Gruaz-Gumowski N, Hinz U, Hulo C, James J, Jimenez S, Jungo F, Keller G, Lemercier P, Lieberherr D, Masson P, Moinat M, Pedruzzi I, Poux S, Rivoire C, Roechert B, Schneider M, Stutz A, Sundaram S, Tognolli M, Bougueleret L, Argoud-Puy G, Cusin I, Duek-Roggli P, Xenarios I, Apweiler R. The UniProt-GO Annotation database in 2011. Nucleic Acids Res 2011;40:D565-70. [PMID: 22123736 PMCID: PMC3245010 DOI: 10.1093/nar/gkr1048] [Citation(s) in RCA: 310] [Impact Index Per Article: 23.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open

Number

Cited by Other Article(s)

201

Murakami Y, Matsumoto Y, Tsuru S, Ying BW, Yomo T. Global coordination in adaptation to gene rewiring. Nucleic Acids Res 2015;43:1304-16. [PMID: 25564530 PMCID: PMC4333410 DOI: 10.1093/nar/gku1366] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open

202

The Little Known Universe of Short Proteins in Insects: A Machine Learning Approach. SHORT VIEWS ON INSECT GENOMICS AND PROTEOMICS 2015. [DOI: 10.1007/978-3-319-24235-4_8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]

203

Zhao Y, Liu T, Luo J, Zhang Q, Xu S, Han C, Xu J, Chen M, Chen Y, Kong L. Integration of a Decrescent Transcriptome and Metabolomics Dataset of Peucedanum praeruptorum to Investigate the CYP450 and MDR Genes Involved in Coumarins Biosynthesis and Transport. FRONTIERS IN PLANT SCIENCE 2015;6:996. [PMID: 26697023 PMCID: PMC4674560 DOI: 10.3389/fpls.2015.00996] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/06/2015] [Accepted: 10/30/2015] [Indexed: 05/09/2023]

Abstract

Peucedanum praeruptorum Dunn is well-known traditional Chinese medicine. However, little is known in the biosynthesis and the transport mechanisms of its coumarin compounds at the molecular level. Although transcriptomic sequence is playing an increasingly significant role in gene discovery, it is not sufficient in predicting the specific function of target gene. Furthermore, there is also a huge database to be analyzed. In this study, RNA sequencing assisted transcriptome dataset and high-performance liquid chromatography (HPLC) coupled with electrospray-ionization quadrupole time-of-flight mass spectrometry (Q-TOF MS)-based metabolomics dataset of P. praeruptorum were firstly constructed for gene discovery and compound identification. Subsequently, methyl jasmonate (MeJA)-induced gene expression analysis and metabolomics analysis were conducted to narrow-down the dataset for selecting the candidate genes and the potential marker metabolites. Finally, the genes involved in coumarins biosynthesis and transport were predicted with parallel analysis of transcript and metabolic profiles. As a result, a total of 40,952 unigenes and 19 coumarin compounds were obtained. Based on the results of gene expression and metabolomics analysis, 7 cytochrome-P450 and 8 multidrug resistance transporter unigenes were selected as candidate genes and 8 marker compounds were selected as biomarkers, respectively. The parallel analysis of gene expression and metabolites accumulation indicated that the gene labeled as 23,746, 228, and 30,922 were related to the formation of the coumarin core compounds whereas 36,276 and 9533 participated in the prenylation, hydroxylation, cyclization or structural modification. Similarly, 1462, 20,815, and 15,318 participated in the transport of coumarin core compounds while 124,029 and 324,293 participated in the transport of the modified compounds. This finding suggested that integration of a decrescent transcriptome and metabolomics dataset could largely narrow down the number of gene to be investigated and significantly improve the efficiency of functional gene predication. In addition, the large amount of transcriptomic data produced from P. praeruptorum and the genes discovered in this study would provide useful information in investigating the biosynthesis and transport mechanism of coumarins.

Collapse

204

Gene essentiality analysis based on DEG 10, an updated database of essential genes. Methods Mol Biol 2015;1279:219-33. [PMID: 25636622 DOI: 10.1007/978-1-4939-2398-4_14] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]

205

Targeting the proteome of cellular fractions: focus on secreted proteins. Methods Mol Biol 2015;1243:29-41. [PMID: 25384738 DOI: 10.1007/978-1-4939-1872-0_2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]

206

Fang X, Chen W, Zhao Y, Ruan S, Zhang H, Yan C, Jin L, Cao L, Zhu J, Ma H, Cheng Z. Global analysis of lysine acetylation in strawberry leaves. FRONTIERS IN PLANT SCIENCE 2015;6:739. [PMID: 26442052 PMCID: PMC4569977 DOI: 10.3389/fpls.2015.00739] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/24/2015] [Accepted: 08/31/2015] [Indexed: 05/08/2023]

207

Biomedical ontologies—A review. Biocybern Biomed Eng 2015. [DOI: 10.1016/j.bbe.2014.06.002] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]

208

Diament A, Pinter RY, Tuller T. Three-dimensional eukaryotic genomic organization is strongly correlated with codon usage expression and function. Nat Commun 2014;5:5876. [PMID: 25510862 DOI: 10.1038/ncomms6876] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2014] [Accepted: 11/17/2014] [Indexed: 01/08/2023] Open

209

Transcriptome sequencing reveals the virulence and environmental genetic programs of Vibrio vulnificus exposed to host and estuarine conditions. PLoS One 2014;9:e114376. [PMID: 25489854 PMCID: PMC4260858 DOI: 10.1371/journal.pone.0114376] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2014] [Accepted: 11/09/2014] [Indexed: 12/31/2022] Open

Abstract

Vibrio vulnificus is a natural inhabitant of estuarine waters worldwide and is of medical relevance due to its ability to cause grievous wound infections and/or fatal septicemia. Genetic polymorphisms within the virulence-correlated gene (vcg) serve as a primary feature to distinguish clinical (C-) genotypes from environmental (E-) genotypes. C-genotypes demonstrate superior survival in human serum relative to E-genotypes, and genome comparisons have allowed for the identification of several putative virulence factors that could potentially aid C-genotypes in disease progression. We used RNA sequencing to analyze the transcriptome of C-genotypes exposed to human serum relative to seawater, which revealed two divergent genetic programs under these two conditions. In human serum, cells displayed a distinct "virulence profile" in which a number of putative virulence factors were upregulated, including genes involved in intracellular signaling, substrate binding and transport, toxin and exoenzyme production, and the heat shock response. Conversely, the "environmental profile" exhibited by cells in seawater revealed upregulation of transcription factors such as rpoS, rpoN, and iscR, as well as genes involved in intracellular signaling, chemotaxis, adherence, and biofilm formation. This dichotomous genetic switch appears to be largely governed by cyclic-di-GMP signaling, and remarkably resembles the dual life-style of V. cholerae as it transitions from host to environment. Furthermore, we found a "general stress response" module, known as the stressosome, to be upregulated in seawater. This signaling system has been well characterized in Gram-positive bacteria, however its role in V. vulnificus is not clear. We examined temporal gene expression patterns of the stressosome and found it to be upregulated in natural estuarine waters indicating that this system plays a role in sensing and responding to the environment. This study advances our understanding of gene regulation in V. vulnificus, and brings to the forefront a number of previously overlooked genetic networks.

Collapse

210

Goldstone RJ, Popat R, Schuberth HJ, Sandra O, Sheldon IM, Smith DGE. Genomic characterisation of an endometrial pathogenic Escherichia coli strain reveals the acquisition of genetic elements associated with extra-intestinal pathogenicity. BMC Genomics 2014;15:1075. [PMID: 25481482 PMCID: PMC4298941 DOI: 10.1186/1471-2164-15-1075] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2014] [Accepted: 11/24/2014] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Strains of Escherichia coli cause a wide variety of intestinal and extra-intestinal diseases in both humans and animals, and are also often found in healthy individuals or the environment. Broadly, a strong phylogenetic relationship exists that distinguishes most E. coli causing intestinal disease from those that cause extra-intestinal disease, however, isolates within a recently described subclass of Extra-Intestinal Pathogenic E. coli (ExPEC), termed endometrial pathogenic E. coli, tend to be phylogenetically distant from the vast majority of characterised ExPECs, and more closely related to human intestinal pathogens. In this work, we investigate the genetic basis for ExPEC infection in the prototypic endometrial pathogenic E. coli strain MS499.

RESULTS

By investigating the genome of MS499 in comparison with a range of other E. coli sequences, we have discovered that this bacterium has acquired substantial lengths of DNA which encode factors more usually associated with ExPECs and less frequently found in the phylogroup relatives of MS499. Many of these acquired factors, including several iron acquisition systems and a virulence plasmid similar to that found in several ExPECs such as APEC O1 and the neonatal meningitis E. coli S88, play characterised roles in a variety of typical ExPEC infections and appear to have been acquired recently by the evolutionary lineage leading to MS499.

CONCLUSIONS

Taking advantage of the phylogenetic relationship we describe between MS499 and several other closely related E. coli isolates from across the globe, we propose a step-wise evolution of a novel clade of sequence type 453 ExPECs within phylogroup B1, involving the recruitment of ExPEC virulence factors into the genome of an ancestrally non-extraintestinal E. coli, which has repurposed this lineage with the capacity to cause extraintestinal disease. These data reveal the genetic components which may be involved in this phenotype switching, and argue that horizontal gene exchange may be a key factor in the emergence of novel lineages of ExPECs.

Collapse

211

Penarete-Vargas DM, Boisson A, Urbach S, Chantelauze H, Peyrottes S, Fraisse L, Vial HJ. A chemical proteomics approach for the search of pharmacological targets of the antimalarial clinical candidate albitiazolium in Plasmodium falciparum using photocrosslinking and click chemistry. PLoS One 2014;9:e113918. [PMID: 25470252 PMCID: PMC4254740 DOI: 10.1371/journal.pone.0113918] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2014] [Accepted: 10/31/2014] [Indexed: 11/18/2022] Open

212

Gene Ontology Consortium: going forward. Nucleic Acids Res 2014;43:D1049-56. [PMID: 25428369 PMCID: PMC4383973 DOI: 10.1093/nar/gku1179] [Citation(s) in RCA: 2174] [Impact Index Per Article: 217.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open

213

Bennett L, Kittas A, Liu S, Papageorgiou LG, Tsoka S. Community structure detection for overlapping modules through mathematical programming in protein interaction networks. PLoS One 2014;9:e112821. [PMID: 25412367 PMCID: PMC4239042 DOI: 10.1371/journal.pone.0112821] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2014] [Accepted: 10/15/2014] [Indexed: 12/05/2022] Open

214

Altenhoff AM, Škunca N, Glover N, Train CM, Sueki A, Piližota I, Gori K, Tomiczek B, Müller S, Redestig H, Gonnet GH, Dessimoz C. The OMA orthology database in 2015: function predictions, better plant support, synteny view and other improvements. Nucleic Acids Res 2014;43:D240-9. [PMID: 25399418 PMCID: PMC4383958 DOI: 10.1093/nar/gku1158] [Citation(s) in RCA: 177] [Impact Index Per Article: 17.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

215

Day-Williams AG, Sun C, Jelcic I, McLaughlin H, Harris T, Martin R, Carulli JP. Whole Genome Sequencing Reveals a Chromosome 9p Deletion Causing DOCK8 Deficiency in an Adult Diagnosed with Hyper IgE Syndrome Who Developed Progressive Multifocal Leukoencephalopathy. J Clin Immunol 2014;35:92-6. [PMID: 25388448 PMCID: PMC4306731 DOI: 10.1007/s10875-014-0114-4] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2014] [Accepted: 10/23/2014] [Indexed: 11/27/2022]

216

Jung WY, Lee SS, Kim CW, Kim HS, Min SR, Moon JS, Kwon SY, Jeon JH, Cho HS. RNA-seq analysis and de novo transcriptome assembly of Jerusalem artichoke (Helianthus tuberosus Linne). PLoS One 2014;9:e111982. [PMID: 25375764 PMCID: PMC4222968 DOI: 10.1371/journal.pone.0111982] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2014] [Accepted: 10/09/2014] [Indexed: 11/18/2022] Open

217

Huntley RP, Sawford T, Mutowo-Meullenet P, Shypitsyna A, Bonilla C, Martin MJ, O'Donovan C. The GOA database: gene Ontology annotation updates for 2015. Nucleic Acids Res 2014;43:D1057-63. [PMID: 25378336 PMCID: PMC4383930 DOI: 10.1093/nar/gku1113] [Citation(s) in RCA: 378] [Impact Index Per Article: 37.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open

218

Lee T, Yang S, Kim E, Ko Y, Hwang S, Shin J, Shim JE, Shim H, Kim H, Kim C, Lee I. AraNet v2: an improved database of co-functional gene networks for the study of Arabidopsis thaliana and 27 other nonmodel plant species. Nucleic Acids Res 2014;43:D996-1002. [PMID: 25355510 PMCID: PMC4383895 DOI: 10.1093/nar/gku1053] [Citation(s) in RCA: 104] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open

219

Vorwerk S, Krieger V, Deiwick J, Hensel M, Hansmeier N. Proteomes of host cell membranes modified by intracellular activities of Salmonella enterica. Mol Cell Proteomics 2014;14:81-92. [PMID: 25348832 DOI: 10.1074/mcp.m114.041145] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open

220

UniProt: a hub for protein information. Nucleic Acids Res 2014;43:D204-12. [PMID: 25348405 PMCID: PMC4384041 DOI: 10.1093/nar/gku989] [Citation(s) in RCA: 3483] [Impact Index Per Article: 348.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open

221

Torto-Alalibo T, Purwantini E, Lomax J, Setubal JC, Mukhopadhyay B, Tyler BM. Genetic resources for advanced biofuel production described with the Gene Ontology. Front Microbiol 2014;5:528. [PMID: 25346727 PMCID: PMC4193338 DOI: 10.3389/fmicb.2014.00528] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2014] [Accepted: 09/22/2014] [Indexed: 12/12/2022] Open

222

Patwardhan A, Ashton A, Brandt R, Butcher S, Carzaniga R, Chiu W, Collinson L, Doux P, Duke E, Ellisman MH, Franken E, Grünewald K, Heriche JK, Koster A, Kühlbrandt W, Lagerstedt I, Larabell C, Lawson CL, Saibil HR, Sanz-García E, Subramaniam S, Verkade P, Swedlow JR, Kleywegt GJ. A 3D cellular context for the macromolecular world. Nat Struct Mol Biol 2014;21:841-5. [PMID: 25289590 PMCID: PMC4346196 DOI: 10.1038/nsmb.2897] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Affiliation(s)

Ardan Patwardhan Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
Alun Ashton Diamond Light Source, Didcot, UK
Robert Brandt FEI Visualization Sciences Group, Mérignac, France
Sarah Butcher Institute of Biotechnology, University of Helsinki, Helsinki, Finland
Raffaella Carzaniga Electron Microscopy Unit, Cancer Research UK London Research Institute, London, UK
Wah Chiu National Center for Macromolecular Imaging, Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, Texas
Lucy Collinson Electron Microscopy Unit, Cancer Research UK London Research Institute, London, UK
Pascal Doux FEI Visualization Sciences Group, Mérignac, France
Elizabeth Duke Diamond Light Source, Didcot, UK
Mark H Ellisman Center for Research in Biological Systems, National Center for Microscopy and Imaging Research (NCMIR), University of California, San Diego, San Diego, California, USA
Erik Franken FEI Electron Optics B.V., Eindhoven, the Netherlands
Kay Grünewald Division of Structural Biology, Wellcome Trust Centre for Human Genetics, Oxford, UK
Jean-Karim Heriche Cell Biology and Biophysics Unit, European Molecular Biology Laboratory, Heidelberg, Germany
Abraham Koster Department of Molecular Cell Biology, Leiden University Medical Center, Leiden, the Netherlands
Werner Kühlbrandt Department of Structural Biology, Max Planck Institute for Biophysics, Frankfurt, Germany
Ingvar Lagerstedt Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
Carolyn Larabell Department of Anatomy, University of California, San Francisco, San Francisco, California, USA
Catherine L Lawson Research Collaboratory for Structural Bioinformatics, Rutgers University, Piscataway, New Jersey, USA
Helen R Saibil Institute of Structural and Molecular Biology, Department of Crystallography, Birkbeck College, London, UK
Eduardo Sanz-García Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
Sriram Subramaniam Center for Cancer Research, National Cancer Institute, Bethesda, Maryland, USA
Paul Verkade Wolfson Bioimaging Facility, School of Biochemistry, University of Bristol, Bristol, UK
Jason R Swedlow Centre for Gene Regulation and Expression, University of Dundee, Dundee, UK
Gerard J Kleywegt Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK

Collapse

223

van Dam JCJ, Schaap PJ, Martins dos Santos VAP, Suárez-Diez M. Integration of heterogeneous molecular networks to unravel gene-regulation in Mycobacterium tuberculosis. BMC SYSTEMS BIOLOGY 2014;8:111. [PMID: 25279447 PMCID: PMC4181829 DOI: 10.1186/s12918-014-0111-5] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/15/2014] [Accepted: 09/05/2014] [Indexed: 12/23/2022]

Abstract

BACKGROUND

Different methods have been developed to infer regulatory networks from heterogeneous omics datasets and to construct co-expression networks. Each algorithm produces different networks and efforts have been devoted to automatically integrate them into consensus sets. However each separate set has an intrinsic value that is diluted and partly lost when building a consensus network. Here we present a methodology to generate co-expression networks and, instead of a consensus network, we propose an integration framework where the different networks are kept and analysed with additional tools to efficiently combine the information extracted from each network.

RESULTS

We developed a workflow to efficiently analyse information generated by different inference and prediction methods. Our methodology relies on providing the user the means to simultaneously visualise and analyse the coexisting networks generated by different algorithms, heterogeneous datasets, and a suite of analysis tools. As a show case, we have analysed the gene co-expression networks of Mycobacterium tuberculosis generated using over 600 expression experiments. Regarding DNA damage repair, we identified SigC as a key control element, 12 new targets for LexA, an updated LexA binding motif, and a potential mismatch repair system. We expanded the DevR regulon with 27 genes while identifying 9 targets wrongly assigned to this regulon. We discovered 10 new genes linked to zinc uptake and a new regulatory mechanism for ZuR. The use of co-expression networks to perform system level analysis allows the development of custom made methodologies. As show cases we implemented a pipeline to integrate ChIP-seq data and another method to uncover multiple regulatory layers.

CONCLUSIONS

Our workflow is based on representing the multiple types of information as network representations and presenting these networks in a synchronous framework that allows their simultaneous visualization while keeping specific associations from the different networks. By simultaneously exploring these networks and metadata, we gained insights into regulatory mechanisms in M. tuberculosis that could not be obtained through the separate analysis of each data type.

Collapse

224

Peterson EJR, Reiss DJ, Turkarslan S, Minch KJ, Rustad T, Plaisier CL, Longabaugh WJR, Sherman DR, Baliga NS. A high-resolution network model for global gene regulation in Mycobacterium tuberculosis. Nucleic Acids Res 2014;42:11291-303. [PMID: 25232098 PMCID: PMC4191388 DOI: 10.1093/nar/gku777] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open

225

Orfanoudaki G, Economou A. Proteome-wide subcellular topologies of E. coli polypeptides database (STEPdb). Mol Cell Proteomics 2014;13:3674-87. [PMID: 25210196 DOI: 10.1074/mcp.o114.041137] [Citation(s) in RCA: 54] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open

Abstract

Cell compartmentalization serves both the isolation and the specialization of cell functions. After synthesis in the cytoplasm, over a third of all proteins are targeted to other subcellular compartments. Knowing how proteins are distributed within the cell and how they interact is a prerequisite for understanding it as a whole. Surface and secreted proteins are important pathogenicity determinants. Here we present the STEP database (STEPdb) that contains a comprehensive characterization of subcellular localization and topology of the complete proteome of Escherichia coli. Two widely used E. coli proteomes (K-12 and BL21) are presented organized into thirteen subcellular classes. STEPdb exploits the wealth of genetic, proteomic, biochemical, and functional information on protein localization, secretion, and targeting in E. coli, one of the best understood model organisms. Subcellular annotations were derived from a combination of bioinformatics prediction, proteomic, biochemical, functional, topological data and extensive literature re-examination that were refined through manual curation. Strong experimental support for the location of 1553 out of 4303 proteins was based on 426 articles and some experimental indications for another 526. Annotations were provided for another 320 proteins based on firm bioinformatic predictions. STEPdb is the first database that contains an extensive set of peripheral IM proteins (PIM proteins) and includes their graphical visualization into complexes, cellular functions, and interactions. It also summarizes all currently known protein export machineries of E. coli K-12 and pairs them, where available, with the secretory proteins that use them. It catalogs the Sec- and TAT-utilizing secretomes and summarizes their topological features such as signal peptides and transmembrane regions, transmembrane topologies and orientations. It also catalogs physicochemical and structural features that influence topology such as abundance, solubility, disorder, heat resistance, and structural domain families. Finally, STEPdb incorporates prediction tools for topology (TMHMM, SignalP, and Phobius) and disorder (IUPred) and implements the BLAST2STEP that performs protein homology searches against the STEPdb.

Collapse

226

Engelke R, Riede J, Hegermann J, Wuerch A, Eimer S, Dengjel J, Mittler G. The Quantitative Nuclear Matrix Proteome as a Biochemical Snapshot of Nuclear Organization. J Proteome Res 2014;13:3940-56. [DOI: 10.1021/pr500218f] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

227

Hu Q, Wang Z, Zhang Z. FSim: a novel functional similarity search algorithm and tool for discovering functionally related gene products. BIOMED RESEARCH INTERNATIONAL 2014;2014:509149. [PMID: 25184141 PMCID: PMC4145548 DOI: 10.1155/2014/509149] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/13/2014] [Revised: 06/24/2014] [Accepted: 07/22/2014] [Indexed: 01/21/2023]

228

Peng C, Gao F. Protein localization analysis of essential genes in prokaryotes. Sci Rep 2014;4:6001. [PMID: 25105358 PMCID: PMC4126397 DOI: 10.1038/srep06001] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2014] [Accepted: 07/22/2014] [Indexed: 01/27/2023] Open

229

Gilbert TM, McDaniel SL, Byrum SD, Cades JA, Dancy BCR, Wade H, Tackett AJ, Strahl BD, Taverna SD. A PWWP domain-containing protein targets the NuA3 acetyltransferase complex via histone H3 lysine 36 trimethylation to coordinate transcriptional elongation at coding regions. Mol Cell Proteomics 2014;13:2883-95. [PMID: 25104842 DOI: 10.1074/mcp.m114.038224] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open

Abstract

Post-translational modifications of histones, such as acetylation and methylation, are differentially positioned in chromatin with respect to gene organization. For example, although histone H3 is often trimethylated on lysine 4 (H3K4me3) and acetylated on lysine 14 (H3K14ac) at active promoter regions, histone H3 lysine 36 trimethylation (H3K36me3) occurs throughout the open reading frames of transcriptionally active genes. The conserved yeast histone acetyltransferase complex, NuA3, specifically binds H3K4me3 through a plant homeodomain (PHD) finger in the Yng1 subunit, and subsequently catalyzes the acetylation of H3K14 through the histone acetyltransferase domain of Sas3, leading to transcription initiation at a subset of genes. We previously found that Ylr455w (Pdp3), an uncharacterized proline-tryptophan-tryptophan-proline (PWWP) domain-containing protein, copurifies with stable members of NuA3. Here, we employ mass-spectrometric analysis of affinity purified Pdp3, biophysical binding assays, and genetic analyses to classify NuA3 into two functionally distinct forms: NuA3a and NuA3b. Although NuA3a uses the PHD finger of Yng1 to interact with H3K4me3 at the 5'-end of open reading frames, NuA3b contains the unique member, Pdp3, which regulates an interaction between NuA3b and H3K36me3 at the transcribed regions of genes through its PWWP domain. We find that deletion of PDP3 decreases NuA3-directed transcription and results in growth defects when combined with transcription elongation mutants, suggesting NuA3b acts as a positive elongation factor. Finally, we determine that NuA3a, but not NuA3b, is synthetically lethal in combination with a deletion of the histone acetyltransferase GCN5, indicating NuA3b has a specialized role at coding regions that is independent of Gcn5 activity. Collectively, these studies define a new form of the NuA3 complex that associates with H3K36me3 to effect transcriptional elongation. MS data are available via ProteomeXchange with identifier PXD001156.

Collapse

230

Mazandu GK, Mulder NJ. The use of semantic similarity measures for optimally integrating heterogeneous Gene Ontology data from large scale annotation pipelines. Front Genet 2014;5:264. [PMID: 25147557 PMCID: PMC4123725 DOI: 10.3389/fgene.2014.00264] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2014] [Accepted: 07/18/2014] [Indexed: 11/14/2022] Open

Abstract

With the advancement of new high throughput sequencing technologies, there has been an increase in the number of genome sequencing projects worldwide, which has yielded complete genome sequences of human, animals and plants. Subsequently, several labs have focused on genome annotation, consisting of assigning functions to gene products, mostly using Gene Ontology (GO) terms. As a consequence, there is an increased heterogeneity in annotations across genomes due to different approaches used by different pipelines to infer these annotations and also due to the nature of the GO structure itself. This makes a curator's task difficult, even if they adhere to the established guidelines for assessing these protein annotations. Here we develop a genome-scale approach for integrating GO annotations from different pipelines using semantic similarity measures. We used this approach to identify inconsistencies and similarities in functional annotations between orthologs of human and Drosophila melanogaster, to assess the quality of GO annotations derived from InterPro2GO mappings compared to manually annotated GO annotations for the Drosophila melanogaster proteome from a FlyBase dataset and human, and to filter GO annotation data for these proteomes. Results obtained indicate that an efficient integration of GO annotations eliminates redundancy up to 27.08 and 22.32% in the Drosophila melanogaster and human GO annotation datasets, respectively. Furthermore, we identified lack of and missing annotations for some orthologs, and annotation mismatches between InterPro2GO and manual pipelines in these two proteomes, thus requiring further curation. This simplifies and facilitates tasks of curators in assessing protein annotations, reduces redundancy and eliminates inconsistencies in large annotation datasets for ease of comparative functional genomics.

Collapse

231

In vivo mRNA profiling of uropathogenic Escherichia coli from diverse phylogroups reveals common and group-specific gene expression profiles. mBio 2014;5:e01075-14. [PMID: 25096872 PMCID: PMC4128348 DOI: 10.1128/mbio.01075-14] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open

Abstract

mRNA profiling of pathogens during the course of human infections gives detailed information on the expression levels of relevant genes that drive pathogenicity and adaptation and at the same time allows for the delineation of phylogenetic relatedness of pathogens that cause specific diseases. In this study, we used mRNA sequencing to acquire information on the expression of Escherichia coli pathogenicity genes during urinary tract infections (UTI) in humans and to assign the UTI-associated E. coli isolates to different phylogenetic groups. Whereas the in vivo gene expression profiles of the majority of genes were conserved among 21 E. coli strains in the urine of elderly patients suffering from an acute UTI, the specific gene expression profiles of the flexible genomes was diverse and reflected phylogenetic relationships. Furthermore, genes transcribed in vivo relative to laboratory media included well-described virulence factors, small regulatory RNAs, as well as genes not previously linked to bacterial virulence. Knowledge on relevant transcriptional responses that drive pathogenicity and adaptation of isolates to the human host might lead to the introduction of a virulence typing strategy into clinical microbiology, potentially facilitating management and prevention of the disease.

Urinary tract infections (UTI) are very common; at least half of all women experience UTI, most of which are caused by pathogenic Escherichia coli strains. In this study, we applied massive parallel cDNA sequencing (RNA-seq) to provide unbiased, deep, and accurate insight into the nature and the dimension of the uropathogenic E. coli gene expression profile during an acute UTI within the human host. This work was undertaken to identify key players in physiological adaptation processes and, hence, potential targets for new infection prevention and therapy interventions specifically aimed at sabotaging bacterial adaptation to the human host.

Collapse

232

Chibucos MC, Mungall CJ, Balakrishnan R, Christie KR, Huntley RP, White O, Blake JA, Lewis SE, Giglio M. Standardized description of scientific evidence using the Evidence Ontology (ECO). DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2014;2014:bau075. [PMID: 25052702 PMCID: PMC4105709 DOI: 10.1093/database/bau075] [Citation(s) in RCA: 82] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Affiliation(s)

Marcus C Chibucos Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA, Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, USA, Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA, Saccharomyces Genome Database, Department of Genetics, Stanford University, Stanford, CA 94305, USA, Computational Biology and Bioinformatics, The Jackson Laboratory, Bar Harbor, ME 04609, USA, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD UK, Department of Epidemiology, University of Maryland School of Medicine, Baltimore, MD 21201, USA and Department of Medicine, University of Maryland School of Medicine, Baltimore, MD 21201, USAInstitute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA, Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, USA, Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA, Saccharomyces Genome Database, Department of Genetics, Stanford University, Stanford, CA 94305, USA, Computational Biology and Bioinformatics, The Jackson Laboratory, Bar Harbor, ME 04609, USA, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD UK, Department of Epidemiology, University of Maryland School of Medicine, Baltimore, MD 21201, USA and Department of Medicine, University of Maryland School of Medicine, Baltimore, MD 21201, USA
Christopher J Mungall Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA, Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, USA, Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA, Saccharomyces Genome Database, Department of Genetics, Stanford University, Stanford, CA 94305, USA, Computational Biology and Bioinformatics, The Jackson Laboratory, Bar Harbor, ME 04609, USA, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD UK, Department of Epidemiology, University of Maryland School of Medicine, Baltimore, MD 21201, USA and Department of Medicine, University of Maryland School of Medicine, Baltimore, MD 21201, USA
Rama Balakrishnan Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA, Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, USA, Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA, Saccharomyces Genome Database, Department of Genetics, Stanford University, Stanford, CA 94305, USA, Computational Biology and Bioinformatics, The Jackson Laboratory, Bar Harbor, ME 04609, USA, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD UK, Department of Epidemiology, University of Maryland School of Medicine, Baltimore, MD 21201, USA and Department of Medicine, University of Maryland School of Medicine, Baltimore, MD 21201, USA
Karen R Christie Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA, Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, USA, Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA, Saccharomyces Genome Database, Department of Genetics, Stanford University, Stanford, CA 94305, USA, Computational Biology and Bioinformatics, The Jackson Laboratory, Bar Harbor, ME 04609, USA, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD UK, Department of Epidemiology, University of Maryland School of Medicine, Baltimore, MD 21201, USA and Department of Medicine, University of Maryland School of Medicine, Baltimore, MD 21201, USA
Rachael P Huntley Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA, Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, USA, Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA, Saccharomyces Genome Database, Department of Genetics, Stanford University, Stanford, CA 94305, USA, Computational Biology and Bioinformatics, The Jackson Laboratory, Bar Harbor, ME 04609, USA, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD UK, Department of Epidemiology, University of Maryland School of Medicine, Baltimore, MD 21201, USA and Department of Medicine, University of Maryland School of Medicine, Baltimore, MD 21201, USA
Owen White Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA, Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, USA, Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA, Saccharomyces Genome Database, Department of Genetics, Stanford University, Stanford, CA 94305, USA, Computational Biology and Bioinformatics, The Jackson Laboratory, Bar Harbor, ME 04609, USA, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD UK, Department of Epidemiology, University of Maryland School of Medicine, Baltimore, MD 21201, USA and Department of Medicine, University of Maryland School of Medicine, Baltimore, MD 21201, USAInstitute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA, Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, USA, Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA, Saccharomyces Genome Database, Department of Genetics, Stanford University, Stanford, CA 94305, USA, Computational Biology and Bioinformatics, The Jackson Laboratory, Bar Harbor, ME 04609, USA, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD UK, Department of Epidemiology, University of Maryland School of Medicine, Baltimore, MD 21201, USA and Department of Medicine, University of Maryland School of Medicine, Baltimore, MD 21201, USA
Judith A Blake Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA, Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, USA, Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA, Saccharomyces Genome Database, Department of Genetics, Stanford University, Stanford, CA 94305, USA, Computational Biology and Bioinformatics, The Jackson Laboratory, Bar Harbor, ME 04609, USA, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD UK, Department of Epidemiology, University of Maryland School of Medicine, Baltimore, MD 21201, USA and Department of Medicine, University of Maryland School of Medicine, Baltimore, MD 21201, USA
Suzanna E Lewis Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA, Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, USA, Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA, Saccharomyces Genome Database, Department of Genetics, Stanford University, Stanford, CA 94305, USA, Computational Biology and Bioinformatics, The Jackson Laboratory, Bar Harbor, ME 04609, USA, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD UK, Department of Epidemiology, University of Maryland School of Medicine, Baltimore, MD 21201, USA and Department of Medicine, University of Maryland School of Medicine, Baltimore, MD 21201, USA
Michelle Giglio Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA, Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, USA, Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA, Saccharomyces Genome Database, Department of Genetics, Stanford University, Stanford, CA 94305, USA, Computational Biology and Bioinformatics, The Jackson Laboratory, Bar Harbor, ME 04609, USA, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD UK, Department of Epidemiology, University of Maryland School of Medicine, Baltimore, MD 21201, USA and Department of Medicine, University of Maryland School of Medicine, Baltimore, MD 21201, USAInstitute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA, Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, USA, Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA, Saccharomyces Genome Database, Department of Genetics, Stanford University, Stanford, CA 94305, USA, Computational Biology and Bioinformatics, The Jackson Laboratory, Bar Harbor, ME 04609, USA, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD UK, Department of Epidemiology, University of Maryland School of Medicine, Baltimore, MD 21201, USA and Department of Medicine, University of Maryland School of Medicine, Baltimore, MD 21201, USA

Collapse

233

Panek J, El Alaoui H, Mone A, Urbach S, Demettre E, Texier C, Brun C, Zanzoni A, Peyretaillade E, Parisot N, Lerat E, Peyret P, Delbac F, Biron DG. Hijacking of host cellular functions by an intracellular parasite, the microsporidian Anncaliia algerae. PLoS One 2014;9:e100791. [PMID: 24967735 PMCID: PMC4072689 DOI: 10.1371/journal.pone.0100791] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2014] [Accepted: 05/29/2014] [Indexed: 11/18/2022] Open

Affiliation(s)

Johan Panek Clermont Université, Université Blaise Pascal, Laboratoire Microorganismes: Génome et Environnement, Clermont-Ferrand, France CNRS, UMR 6023, LMGE, Aubière, France
Hicham El Alaoui Clermont Université, Université Blaise Pascal, Laboratoire Microorganismes: Génome et Environnement, Clermont-Ferrand, France CNRS, UMR 6023, LMGE, Aubière, France * E-mail: (HEA); (DGB)
Anne Mone Clermont Université, Université Blaise Pascal, Laboratoire Microorganismes: Génome et Environnement, Clermont-Ferrand, France CNRS, UMR 6023, LMGE, Aubière, France
Serge Urbach Functional Proteomics Platform. UMR CNRS 5203, Montpellier, France
Edith Demettre Functional Proteomics Platform. UMS CNRS 3426, Montpellier, France
Catherine Texier Clermont Université, Université Blaise Pascal, Laboratoire Microorganismes: Génome et Environnement, Clermont-Ferrand, France CNRS, UMR 6023, LMGE, Aubière, France
Christine Brun INSERM, UMR1090 TAGC, Marseille, Marseille, France Aix-Marseille Université, UMR1090 TAGC, Marseille, France CNRS, Marseille, France
Andreas Zanzoni INSERM, UMR1090 TAGC, Marseille, Marseille, France Aix-Marseille Université, UMR1090 TAGC, Marseille, France
Eric Peyretaillade Clermont Université, Université d'Auvergne, I.U.T., UFR Pharmacie, Clermont-Ferrand, France Clermont Université, Université d'Auvergne, EA 4678, Conception, Ingénierie et Développement de l'Aliment et du Médicament, Clermont-Ferrand, France
Nicolas Parisot Clermont Université, Université d'Auvergne, I.U.T., UFR Pharmacie, Clermont-Ferrand, France Clermont Université, Université d'Auvergne, EA 4678, Conception, Ingénierie et Développement de l'Aliment et du Médicament, Clermont-Ferrand, France
Emmanuelle Lerat Université de Lyon, Université Lyon 1, CNRS, UMR5558, Laboratoire de Biométrie et Biologie Evolutive, Villeurbanne, France
Pierre Peyret Clermont Université, Université d'Auvergne, I.U.T., UFR Pharmacie, Clermont-Ferrand, France Clermont Université, Université d'Auvergne, EA 4678, Conception, Ingénierie et Développement de l'Aliment et du Médicament, Clermont-Ferrand, France
Frederic Delbac Clermont Université, Université Blaise Pascal, Laboratoire Microorganismes: Génome et Environnement, Clermont-Ferrand, France CNRS, UMR 6023, LMGE, Aubière, France
David G. Biron Clermont Université, Université Blaise Pascal, Laboratoire Microorganismes: Génome et Environnement, Clermont-Ferrand, France CNRS, UMR 6023, LMGE, Aubière, France * E-mail: (HEA); (DGB)

Collapse

234

Bragina EY, Tiys ES, Freidin MB, Koneva LA, Demenkov PS, Ivanisenko VA, Kolchanov NA, Puzyrev VP. Insights into pathophysiology of dystropy through the analysis of gene networks: an example of bronchial asthma and tuberculosis. Immunogenetics 2014;66:457-65. [PMID: 24954693 DOI: 10.1007/s00251-014-0786-1] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2013] [Accepted: 06/12/2014] [Indexed: 01/18/2023]

235

Dikicioglu D, Wood V, Rutherford KM, McDowall MD, Oliver SG. Improving functional annotation for industrial microbes: a case study with Pichia pastoris. Trends Biotechnol 2014;32:396-9. [PMID: 24929579 PMCID: PMC4111905 DOI: 10.1016/j.tibtech.2014.05.003] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2014] [Revised: 05/10/2014] [Accepted: 05/13/2014] [Indexed: 11/29/2022]

236

Arighi CN, Wu CH, Cohen KB, Hirschman L, Krallinger M, Valencia A, Lu Z, Wilbur JW, Wiegers TC. BioCreative-IV virtual issue. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2014;2014:bau039. [PMID: 24852177 PMCID: PMC4030502 DOI: 10.1093/database/bau039] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]

237

Goldberg T, Hecht M, Hamp T, Karl T, Yachdav G, Ahmed N, Altermann U, Angerer P, Ansorge S, Balasz K, Bernhofer M, Betz A, Cizmadija L, Do KT, Gerke J, Greil R, Joerdens V, Hastreiter M, Hembach K, Herzog M, Kalemanov M, Kluge M, Meier A, Nasir H, Neumaier U, Prade V, Reeb J, Sorokoumov A, Troshani I, Vorberg S, Waldraff S, Zierer J, Nielsen H, Rost B. LocTree3 prediction of localization. Nucleic Acids Res 2014;42:W350-5. [PMID: 24848019 PMCID: PMC4086075 DOI: 10.1093/nar/gku396] [Citation(s) in RCA: 196] [Impact Index Per Article: 19.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open

Affiliation(s)

Tatyana Goldberg Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany TUM Graduate School, Center of Doctoral Studies in Informatics and its Applications (CeDoSIA), 85748 Garching, Germany
Maximilian Hecht Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Tobias Hamp Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Timothy Karl Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Guy Yachdav Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany Biosof LLC, New York, NY 10001, USA
Nadeem Ahmed Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Uwe Altermann Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Philipp Angerer Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Sonja Ansorge Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Kinga Balasz Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Michael Bernhofer Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Alexander Betz Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Laura Cizmadija Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Kieu Trinh Do Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Julia Gerke Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Robert Greil Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Vadim Joerdens Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Maximilian Hastreiter Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Katharina Hembach Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Max Herzog Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Maria Kalemanov Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Michael Kluge Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Alice Meier Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Hassan Nasir Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Ulrich Neumaier Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Verena Prade Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Jonas Reeb Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Aleksandr Sorokoumov Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Ilira Troshani Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Susann Vorberg Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Sonja Waldraff Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Jonas Zierer Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany
Henrik Nielsen Center for Biological Sequence Analysis, Department of Systems Biology, DTU, 2800 Lyngby, Denmark
Burkhard Rost Department of Informatics, Bioinformatics-I12, TUM, 85748 Garching, Germany Biosof LLC, New York, NY 10001, USA Institute for Advanced Study (TUM-IAS), 85748 Garching, Germany New York Consortium on Membrane Protein Structure (NYCOMPS) & Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA Institute for Food and Plant Sciences WZW - Weihenstephan, 85350 Freising, Germany

Collapse

238

Huntley RP, Harris MA, Alam-Faruque Y, Blake JA, Carbon S, Dietze H, Dimmer EC, Foulger RE, Hill DP, Khodiyar VK, Lock A, Lomax J, Lovering RC, Mutowo-Meullenet P, Sawford T, Van Auken K, Wood V, Mungall CJ. A method for increasing expressivity of Gene Ontology annotations using a compositional approach. BMC Bioinformatics 2014;15:155. [PMID: 24885854 PMCID: PMC4039540 DOI: 10.1186/1471-2105-15-155] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2014] [Accepted: 05/15/2014] [Indexed: 11/22/2022] Open

239

Mashiyama ST, Malabanan MM, Akiva E, Bhosle R, Branch MC, Hillerich B, Jagessar K, Kim J, Patskovsky Y, Seidel RD, Stead M, Toro R, Vetting MW, Almo SC, Armstrong RN, Babbitt PC. Large-scale determination of sequence, structure, and function relationships in cytosolic glutathione transferases across the biosphere. PLoS Biol 2014;12:e1001843. [PMID: 24756107 PMCID: PMC3995644 DOI: 10.1371/journal.pbio.1001843] [Citation(s) in RCA: 69] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2013] [Accepted: 03/14/2014] [Indexed: 12/11/2022] Open

Abstract

Global networks of the cytosolic glutathione S-transferases illuminate sequence-structure-function relationships across more than 13,000 members of this superfamily, including experimental confirmation of enzymatic activity for 82 members and new crystal structures for 27.

The cytosolic glutathione transferase (cytGST) superfamily comprises more than 13,000 nonredundant sequences found throughout the biosphere. Their key roles in metabolism and defense against oxidative damage have led to thousands of studies over several decades. Despite this attention, little is known about the physiological reactions they catalyze and most of the substrates used to assay cytGSTs are synthetic compounds. A deeper understanding of relationships across the superfamily could provide new clues about their functions. To establish a foundation for expanded classification of cytGSTs, we generated similarity-based subgroupings for the entire superfamily. Using the resulting sequence similarity networks, we chose targets that broadly covered unknown functions and report here experimental results confirming GST-like activity for 82 of them, along with 37 new 3D structures determined for 27 targets. These new data, along with experimentally known GST reactions and structures reported in the literature, were painted onto the networks to generate a global view of their sequence-structure-function relationships. The results show how proteins of both known and unknown function relate to each other across the entire superfamily and reveal that the great majority of cytGSTs have not been experimentally characterized or annotated by canonical class. A mapping of taxonomic classes across the superfamily indicates that many taxa are represented in each subgroup and highlights challenges for classification of superfamily sequences into functionally relevant classes. Experimental determination of disulfide bond reductase activity in many diverse subgroups illustrate a theme common for many reaction types. Finally, sequence comparison between an enzyme that catalyzes a reductive dechlorination reaction relevant to bioremediation efforts with some of its closest homologs reveals differences among them likely to be associated with evolution of this unusual reaction. Interactive versions of the networks, associated with functional and other types of information, can be downloaded from the Structure-Function Linkage Database (SFLD; http://sfld.rbvi.ucsf.edu).

Cytosolic glutathione transferases (cytGSTs) are a large and diverse superfamily of enzymes that have important roles in metabolism and defense against oxidative damage. They have been studied for several decades but because of the synthetic nature of the chemicals used to test these proteins to determine if they have cytGST activity, little is known about the physiological reactions and roles of cytGSTs. In this large, collaborative study, we constructed networks where more than 13,000 cytGST sequences were grouped by sequence similarity and then used these networks to prioritize new targets for experimental characterization in relatively unexplored regions of the superfamily. We report here experimental results confirming GST-like activity for 82 of them, along with 37 new three-dimensional molecular structures determined for 27 targets. These new data, along with experimental data previously reported in the literature, were painted onto the networks to generate a global view of their sequence-structure-function relationships. The results show how proteins of both known and unknown function relate to each other across the entire superfamily and illuminate the complex ways in which their variations in sequence and structure affect our ability to predict unknown functional properties.

Collapse

Affiliation(s)

Susan T. Mashiyama Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, California, United States of America
M. Merced Malabanan Department of Biochemistry, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
Eyal Akiva Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, California, United States of America
Rahul Bhosle Department of Biochemistry, Albert Einstein College of Medicine, Bronx, New York, United States of America
Megan C. Branch Department of Biochemistry, University of Wisconsin, Madison, Wisconsin, United States of America
Brandan Hillerich Department of Biochemistry, Albert Einstein College of Medicine, Bronx, New York, United States of America
Kevin Jagessar Department of Biochemistry, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
Jungwook Kim Department of Biochemistry, Albert Einstein College of Medicine, Bronx, New York, United States of America
Yury Patskovsky Department of Biochemistry, Albert Einstein College of Medicine, Bronx, New York, United States of America
Ronald D. Seidel Department of Biochemistry, Albert Einstein College of Medicine, Bronx, New York, United States of America
Mark Stead Department of Biochemistry, Albert Einstein College of Medicine, Bronx, New York, United States of America
Rafael Toro Department of Biochemistry, Albert Einstein College of Medicine, Bronx, New York, United States of America
Matthew W. Vetting Department of Biochemistry, Albert Einstein College of Medicine, Bronx, New York, United States of America
Steven C. Almo Department of Biochemistry, Albert Einstein College of Medicine, Bronx, New York, United States of America * E-mail: (SCA); (RNA); (PCB)
Richard N. Armstrong Departments of Biochemistry and Chemistry, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America * E-mail: (SCA); (RNA); (PCB)
Patricia C. Babbitt Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, California, United States of America Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, California, United States of America California Institute for Quantitative Biosciences, University of California, San Francisco, San Francisco, California, United States of America * E-mail: (SCA); (RNA); (PCB)

Collapse

240

Murri M, Insenser M, Luque M, Tinahones FJ, Escobar-Morreale HF. Proteomic analysis of adipose tissue: informing diabetes research. Expert Rev Proteomics 2014;11:491-502. [PMID: 24684164 DOI: 10.1586/14789450.2014.903158] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]

241

Identification of microRNAs in the coral Stylophora pistillata. PLoS One 2014;9:e91101. [PMID: 24658574 PMCID: PMC3962355 DOI: 10.1371/journal.pone.0091101] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2013] [Accepted: 02/06/2014] [Indexed: 12/22/2022] Open

Abstract

Coral reefs are major contributors to marine biodiversity. However, they are in rapid decline due to global environmental changes such as rising sea surface temperatures, ocean acidification, and pollution. Genomic and transcriptomic analyses have broadened our understanding of coral biology, but a study of the microRNA (miRNA) repertoire of corals is missing. miRNAs constitute a class of small non-coding RNAs of ∼22 nt in size that play crucial roles in development, metabolism, and stress response in plants and animals alike. In this study, we examined the coral Stylophora pistillata for the presence of miRNAs and the corresponding core protein machinery required for their processing and function. Based on small RNA sequencing, we present evidence for 31 bona fide microRNAs, 5 of which (miR-100, miR-2022, miR-2023, miR-2030, and miR-2036) are conserved in other metazoans. Homologues of Argonaute, Piwi, Dicer, Drosha, Pasha, and HEN1 were identified in the transcriptome of S. pistillata based on strong sequence conservation with known RNAi proteins, with additional support derived from phylogenetic trees. Examination of putative miRNA gene targets indicates potential roles in development, metabolism, immunity, and biomineralisation for several of the microRNAs. Here, we present first evidence of a functional RNAi machinery and five conserved miRNAs in S. pistillata, implying that miRNAs play a role in organismal biology of scleractinian corals. Analysis of predicted miRNA target genes in S. pistillata suggests potential roles of miRNAs in symbiosis and coral calcification. Given the importance of miRNAs in regulating gene expression in other metazoans, further expression analyses of small non-coding RNAs in transcriptional studies of corals should be informative about miRNA-affected processes and pathways.

Collapse

242

Huntley RP, Sawford T, Martin MJ, O'Donovan C. Understanding how and why the Gene Ontology and its annotations evolve: the GO within UniProt. Gigascience 2014;3:4. [PMID: 24641996 PMCID: PMC3995153 DOI: 10.1186/2047-217x-3-4] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2013] [Accepted: 03/10/2014] [Indexed: 11/01/2022] Open

243

Croset S, Overington JP, Rebholz-Schuhmann D. The functional therapeutic chemical classification system. Bioinformatics 2014;30:876-83. [PMID: 24177719 PMCID: PMC3957075 DOI: 10.1093/bioinformatics/btt628] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2013] [Revised: 10/15/2013] [Accepted: 10/27/2013] [Indexed: 01/27/2023] Open

244

Häuser R, Ceol A, Rajagopala SV, Mosca R, Siszler G, Wermke N, Sikorski P, Schwarz F, Schick M, Wuchty S, Aloy P, Uetz P. A second-generation protein-protein interaction network of Helicobacter pylori. Mol Cell Proteomics 2014;13:1318-29. [PMID: 24627523 DOI: 10.1074/mcp.o113.033571] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open

245

Poux S, Magrane M, Arighi CN, Bridge A, O'Donovan C, Laiho K. Expert curation in UniProtKB: a case study on dealing with conflicting and erroneous data. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2014;2014:bau016. [PMID: 24622611 PMCID: PMC3950660 DOI: 10.1093/database/bau016] [Citation(s) in RCA: 69] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

246

Barbier M, Damron FH, Bielecki P, Suárez-Diez M, Puchałka J, Albertí S, dos Santos VM, Goldberg JB. From the environment to the host: re-wiring of the transcriptome of Pseudomonas aeruginosa from 22°C to 37°C. PLoS One 2014;9:e89941. [PMID: 24587139 PMCID: PMC3933690 DOI: 10.1371/journal.pone.0089941] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2013] [Accepted: 01/25/2014] [Indexed: 11/18/2022] Open

Abstract

Pseudomonas aeruginosa is a highly versatile opportunistic pathogen capable of colonizing multiple ecological niches. This bacterium is responsible for a wide range of both acute and chronic infections in a variety of hosts. The success of this microorganism relies on its ability to adapt to environmental changes and re-program its regulatory and metabolic networks. The study of P. aeruginosa adaptation to temperature is crucial to understanding the pathogenesis upon infection of its mammalian host. We examined the effects of growth temperature on the transcriptome of the P. aeruginosa PAO1. Microarray analysis of PAO1 grown in Lysogeny broth at mid-exponential phase at 22°C and 37°C revealed that temperature changes are responsible for the differential transcriptional regulation of 6.4% of the genome. Major alterations were observed in bacterial metabolism, replication, and nutrient acquisition. Quorum-sensing and exoproteins secreted by type I, II, and III secretion systems, involved in the adaptation of P. aeruginosa to the mammalian host during infection, were up-regulated at 37°C compared to 22°C. Genes encoding arginine degradation enzymes were highly up-regulated at 22°C, together with the genes involved in the synthesis of pyoverdine. However, genes involved in pyochelin biosynthesis were up-regulated at 37°C. We observed that the changes in expression of P. aeruginosa siderophores correlated to an overall increase in Fe²⁺ extracellular concentration at 37°C and a peak in Fe³⁺ extracellular concentration at 22°C. This suggests a distinct change in iron acquisition strategies when the bacterium switches from the external environment to the host. Our work identifies global changes in bacterial metabolism and nutrient acquisition induced by growth at different temperatures. Overall, this study identifies factors that are regulated in genome-wide adaptation processes and discusses how this life-threatening pathogen responds to temperature.

Collapse

247

Mangiola S, Young ND, Sternberg PW, Strube C, Korhonen PK, Mitreva M, Scheerlinck JP, Hofmann A, Jex AR, Gasser RB. Analysis of the transcriptome of adult Dictyocaulus filaria and comparison with Dictyocaulus viviparus, with a focus on molecules involved in host-parasite interactions. Int J Parasitol 2014;44:251-61. [PMID: 24487001 DOI: 10.1016/j.ijpara.2013.12.003] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2013] [Revised: 12/11/2013] [Accepted: 12/18/2013] [Indexed: 01/09/2023]

248

Lee TY, Chang CW, Lu CT, Cheng TH, Chang TH. Identification and characterization of lysine-methylated sites on histones and non-histone proteins. Comput Biol Chem 2014;50:11-8. [PMID: 24560580 DOI: 10.1016/j.compbiolchem.2014.01.009] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/23/2013] [Indexed: 01/17/2023]

Abstract

Protein methylation is a kind of post-translational modification (PTM), and typically takes place on lysine and arginine amino acid residues. Protein methylation is involved in many important biological processes, and most recent studies focused on lysine methylation of histones due to its critical roles in regulating transcriptional repression and activation. Histones possess highly conserved sequences and are homologous in most species. However, there is much less sequence conservation among non-histone proteins. Therefore, mechanisms for identifying lysine-methylated sites may greatly differ between histones and non-histone proteins. Nevertheless, this point of view was not considered in previous studies. Here we constructed two support vector machine (SVM) models by using lysine-methylated data from histones and non-histone proteins for predictions of lysine-methylated sites. Numerous features, such as the amino acid composition (AAC) and accessible surface area (ASA), were used in the SVM models, and the predictive performance was evaluated using five-fold cross-validations. For histones, the predictive sensitivity was 85.62% and specificity was 80.32%. For non-histone proteins, the predictive sensitivity was 69.1% and specificity was 88.72%. Results showed that our model significantly improved the predictive accuracy of histones compared to previous approaches. In addition, features of the flanking region of lysine-methylated sites on histones and non-histone proteins were also characterized and are discussed. A gene ontology functional analysis of lysine-methylated proteins and correlations of lysine-methylated sites with other PTMs in histones were also analyzed in detail. Finally, a web server, MethyK, was constructed to identify lysine-methylated sites. MethK now is available at http://csb.cse.yzu.edu.tw/MethK/.

Collapse

249

Predicting human protein subcellular locations by the ensemble of multiple predictors via protein-protein interaction network with edge clustering coefficients. PLoS One 2014;9:e86879. [PMID: 24466278 PMCID: PMC3900678 DOI: 10.1371/journal.pone.0086879] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2013] [Accepted: 12/18/2013] [Indexed: 12/14/2022] Open

250

Moore CB, Wallace JR, Wolfe DJ, Frase AT, Pendergrass SA, Weiss KM, Ritchie MD. Low frequency variants, collapsed based on biological knowledge, uncover complexity of population stratification in 1000 genomes project data. PLoS Genet 2013;9:e1003959. [PMID: 24385916 PMCID: PMC3873241 DOI: 10.1371/journal.pgen.1003959] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2013] [Accepted: 10/01/2013] [Indexed: 12/13/2022] Open

Abstract

Analyses investigating low frequency variants have the potential for explaining additional genetic heritability of many complex human traits. However, the natural frequencies of rare variation between human populations strongly confound genetic analyses. We have applied a novel collapsing method to identify biological features with low frequency variant burden differences in thirteen populations sequenced by the 1000 Genomes Project. Our flexible collapsing tool utilizes expert biological knowledge from multiple publicly available database sources to direct feature selection. Variants were collapsed according to genetically driven features, such as evolutionary conserved regions, regulatory regions genes, and pathways. We have conducted an extensive comparison of low frequency variant burden differences (MAF<0.03) between populations from 1000 Genomes Project Phase I data. We found that on average 26.87% of gene bins, 35.47% of intergenic bins, 42.85% of pathway bins, 14.86% of ORegAnno regulatory bins, and 5.97% of evolutionary conserved regions show statistically significant differences in low frequency variant burden across populations from the 1000 Genomes Project. The proportion of bins with significant differences in low frequency burden depends on the ancestral similarity of the two populations compared and types of features tested. Even closely related populations had notable differences in low frequency burden, but fewer differences than populations from different continents. Furthermore, conserved or functionally relevant regions had fewer significant differences in low frequency burden than regions under less evolutionary constraint. This degree of low frequency variant differentiation across diverse populations and feature elements highlights the critical importance of considering population stratification in the new era of DNA sequencing and low frequency variant genomic analyses.

Low frequency variants are likely to play an important role in uncovering complex trait heritability; however, they are often continent or population specific. This specificity complicates genetic analyses investigating low frequency variants for two reasons: low frequency variant signals in an association test are often difficult to generalize beyond a single population or continental group, and there is an increase in false positive results in association analyses due to underlying population stratification. In order to reveal the magnitude of low frequency population stratification, we performed pairwise population comparisons using the 1000 Genomes Project Phase I data to investigate differences in low frequency variant burden across multiple biological features. We found that low frequency variant confounding is much more prevalent than one might expect, even within continental groups. The proportion of significant differences in low frequency variant burden was also dependent on the region of interest; for example, annotated regulatory regions showed fewer low frequency burden differences between populations than intergenic regions. Knowledge of population structure and the genomic landscape in a region of interest are important factors in determining the extent of confounding due to population stratification in a low frequency genomic analysis.

Collapse

Affiliation(s)

Carrie B. Moore Center for Human Genetic Research, Department of Molecular Physiology and Biophysics, Vanderbilt University, Nashville, Tennessee, United States of America Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, Eberly College of Science, The Huck Institutes of the Life Sciences, University Park, Pennsylvania, United States of America
John R. Wallace Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, Eberly College of Science, The Huck Institutes of the Life Sciences, University Park, Pennsylvania, United States of America
Daniel J. Wolfe Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, Eberly College of Science, The Huck Institutes of the Life Sciences, University Park, Pennsylvania, United States of America
Alex T. Frase Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, Eberly College of Science, The Huck Institutes of the Life Sciences, University Park, Pennsylvania, United States of America
Sarah A. Pendergrass Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, Eberly College of Science, The Huck Institutes of the Life Sciences, University Park, Pennsylvania, United States of America
Kenneth M. Weiss Department of Anthropology, The Pennsylvania State University, University Park, Pennsylvania, United States of America
Marylyn D. Ritchie Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, Eberly College of Science, The Huck Institutes of the Life Sciences, University Park, Pennsylvania, United States of America * E-mail:

Collapse