Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Plewniak F, Bianchetti L, Brelivet Y, Carles A, Chalmel F, Lecompte O, Mochel T, Moulinier L, Muller A, Muller J, Prigent V, Ripp R, Thierry JC, Thompson JD, Wicker N, Poch O. PipeAlign: A new toolkit for protein family analysis. Nucleic Acids Res 2003;31:3829-32. [PMID: 12824430 PMCID: PMC168925 DOI: 10.1093/nar/gkg518] [Citation(s) in RCA: 94] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

For:	Plewniak F, Bianchetti L, Brelivet Y, Carles A, Chalmel F, Lecompte O, Mochel T, Moulinier L, Muller A, Muller J, Prigent V, Ripp R, Thierry JC, Thompson JD, Wicker N, Poch O. PipeAlign: A new toolkit for protein family analysis. Nucleic Acids Res 2003;31:3829-32. [PMID: 12824430 PMCID: PMC168925 DOI: 10.1093/nar/gkg518] [Citation(s) in RCA: 94] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Number

Cited by Other Article(s)

Chitale M, Hawkins T, Park C, Kihara D. ESG: extended similarity group method for automated protein function prediction. ACTA ACUST UNITED AC 2009;25:1739-45. [PMID: 19435743 DOI: 10.1093/bioinformatics/btp309] [Citation(s) in RCA: 70] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]

Schmitt E, Galimand M, Panvert M, Courvalin P, Mechulam Y. Structural bases for 16 S rRNA methylation catalyzed by ArmA and RmtB methyltransferases. J Mol Biol 2009;388:570-82. [PMID: 19303884 DOI: 10.1016/j.jmb.2009.03.034] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2009] [Revised: 03/04/2009] [Accepted: 03/13/2009] [Indexed: 10/21/2022]

Delfosse V, Girard E, Birck C, Delmarcelle M, Delarue M, Poch O, Schultz P, Mayer C. Structure of the archaeal pab87 peptidase reveals a novel self-compartmentalizing protease family. PLoS One 2009;4:e4712. [PMID: 19266066 PMCID: PMC2651629 DOI: 10.1371/journal.pone.0004712] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2009] [Accepted: 01/28/2009] [Indexed: 11/18/2022] Open

Aniba MR, Siguenza S, Friedrich A, Plewniak F, Poch O, Marchler-Bauer A, Thompson JD. Knowledge-based expert systems and a proof-of-concept case study for multiple sequence alignment construction and analysis. Brief Bioinform 2009;10:11-23. [PMID: 18971242 PMCID: PMC2638625 DOI: 10.1093/bib/bbn045] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2008] [Revised: 10/02/2008] [Indexed: 11/15/2022] Open

Benelli D, Marzi S, Mancone C, Alonzi T, la Teana A, Londei P. Function and ribosomal localization of aIF6, a translational regulator shared by archaea and eukarya. Nucleic Acids Res 2008;37:256-67. [PMID: 19036786 PMCID: PMC2615626 DOI: 10.1093/nar/gkn959] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open

Pompidor G, Maillard AP, Girard E, Gambarelli S, Kahn R, Covès J. X-ray structure of the metal-sensor CnrX in both the apo- and copper-bound forms. FEBS Lett 2008;582:3954-8. [PMID: 18992246 DOI: 10.1016/j.febslet.2008.10.042] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2008] [Revised: 10/17/2008] [Accepted: 10/24/2008] [Indexed: 10/21/2022]

Gallien S, Perrodou E, Carapito C, Deshayes C, Reyrat JM, Van Dorsselaer A, Poch O, Schaeffer C, Lecompte O. Ortho-proteogenomics: multiple proteomes investigation through orthology and a new MS-based protocol. Genome Res 2008;19:128-35. [PMID: 18955433 DOI: 10.1101/gr.081901.108] [Citation(s) in RCA: 92] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Lecompte O, Poch O, Laporte J. PtdIns5P regulation through evolution: roles in membrane trafficking? Trends Biochem Sci 2008;33:453-60. [PMID: 18774718 DOI: 10.1016/j.tibs.2008.07.002] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2008] [Revised: 07/01/2008] [Accepted: 07/02/2008] [Indexed: 01/27/2023]

Campagnoli MF, Ramenghi U, Armiraglio M, Quarello P, Garelli E, Carando A, Avondo F, Pavesi E, Fribourg S, Gleizes PE, Loreni F, Dianzani I. RPS19 mutations in patients with Diamond-Blackfan anemia. Hum Mutat 2008;29:911-20. [PMID: 18412286 DOI: 10.1002/humu.20752] [Citation(s) in RCA: 79] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Mathieu-Daudé F, Lafay B, Touzet O, Lelièvre J, Parrado F, Bosseno MF, Rojas AM, Fatha S, Ouaissi A, Brenière SF. Exploring the FL-160-CRP gene family through sequence variability of the complement regulatory protein (CRP) expressed by the trypomastigote stage of Trypanosoma cruzi. INFECTION GENETICS AND EVOLUTION 2008;8:258-66. [DOI: 10.1016/j.meegid.2007.12.010] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/13/2007] [Revised: 12/14/2007] [Accepted: 12/17/2007] [Indexed: 11/25/2022]

Kamesh N, Aradhyam GK, Manoj N. The repertoire of G protein-coupled receptors in the sea squirt Ciona intestinalis. BMC Evol Biol 2008;8:129. [PMID: 18452600 PMCID: PMC2396169 DOI: 10.1186/1471-2148-8-129] [Citation(s) in RCA: 77] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2007] [Accepted: 05/01/2008] [Indexed: 12/19/2022] Open

Abstract

BACKGROUND

G protein-coupled receptors (GPCRs) constitute a large family of integral transmembrane receptor proteins that play a central role in signal transduction in eukaryotes. The genome of the protochordate Ciona intestinalis has a compact size with an ancestral complement of many diversified gene families of vertebrates and is a good model system for studying protochordate to vertebrate diversification. An analysis of the Ciona repertoire of GPCRs from a comparative genomic perspective provides insight into the evolutionary origins of the GPCR signalling system in vertebrates.

RESULTS

We have identified 169 gene products in the Ciona genome that code for putative GPCRs. Phylogenetic analyses reveal that Ciona GPCRs have homologous representatives from the five major GRAFS (Glutamate, Rhodopsin, Adhesion, Frizzled and Secretin) families concomitant with other vertebrate GPCR repertoires. Nearly 39% of Ciona GPCRs have unambiguous orthologs of vertebrate GPCR families, as defined for the human, mouse, puffer fish and chicken genomes. The Rhodopsin family accounts for ~68% of the Ciona GPCR repertoire wherein the LGR-like subfamily exhibits a lineage specific gene expansion of a group of receptors that possess a novel domain organisation hitherto unobserved in metazoan genomes.

CONCLUSION

Comparison of GPCRs in Ciona to that in human reveals a high level of orthology of a protochordate repertoire with that of vertebrate GPCRs. Our studies suggest that the ascidians contain the basic ancestral complement of vertebrate GPCR genes. This is evident at the subfamily level comparisons since Ciona GPCR sequences are significantly analogous to vertebrate GPCR subfamilies even while exhibiting Ciona specific genes. Our analysis provides a framework to perform future experimental and comparative studies to understand the roles of the ancestral chordate versions of GPCRs that predated the divergence of the urochordates and the vertebrates.

Collapse

Perrodou E, Chica C, Poch O, Gibson TJ, Thompson JD. A new protein linear motif benchmark for multiple sequence alignment software. BMC Bioinformatics 2008;9:213. [PMID: 18439277 PMCID: PMC2374782 DOI: 10.1186/1471-2105-9-213] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2007] [Accepted: 04/25/2008] [Indexed: 12/14/2022] Open

Abstract

BACKGROUND

Linear motifs (LMs) are abundant short regulatory sites used for modulating the functions of many eukaryotic proteins. They play important roles in post-translational modification, cell compartment targeting, docking sites for regulatory complex assembly and protein processing and cleavage. Methods for LM detection are now being developed that are strongly dependent on scores for motif conservation in homologous proteins. However, most LMs are found in natively disordered polypeptide segments that evolve rapidly, unhindered by structural constraints on the sequence. These regions of modular proteins are difficult to align using classical multiple sequence alignment programs that are specifically optimised to align the globular domains. As a consequence, poor motif alignment quality is hindering efforts to detect new LMs.

RESULTS

We have developed a new benchmark, as part of the BAliBASE suite, designed to assess the ability of standard multiple alignment methods to detect and align LMs. The reference alignments are organised into different test sets representing real alignment problems and contain examples of experimentally verified functional motifs, extracted from the Eukaryotic Linear Motif (ELM) database. The benchmark has been used to evaluate and compare a number of multiple alignment programs. With distantly related proteins, the worst alignment program correctly aligns 48% of LMs compared to 73% for the best program. However, the performance of all the programs is adversely affected by the introduction of other sequences containing false positive motifs. The ranking of the alignment programs based on LM alignment quality is similar to that observed when considering full-length protein alignments, however little correlation was observed between LM and overall alignment quality for individual alignment test cases.

CONCLUSION

We have shown that none of the programs currently available is capable of reliably aligning LMs in distantly related sequences and we have highlighted a number of specific problems. The results of the tests suggest possible ways to improve program accuracy for difficult, divergent sequences.

Collapse

Mutagenesis in the alpha3alpha4 GyrA helix and in the Toprim domain of GyrB refines the contribution of Mycobacterium tuberculosis DNA gyrase to intrinsic resistance to quinolones. Antimicrob Agents Chemother 2008;52:2909-14. [PMID: 18426901 DOI: 10.1128/aac.01380-07] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Lagier-Tourenne C, Tazir M, López LC, Quinzii CM, Assoum M, Drouot N, Busso C, Makri S, Ali-Pacha L, Benhassine T, Anheim M, Lynch DR, Thibault C, Plewniak F, Bianchetti L, Tranchant C, Poch O, DiMauro S, Mandel JL, Barros MH, Hirano M, Koenig M. ADCK3, an ancestral kinase, is mutated in a form of recessive ataxia associated with coenzyme Q10 deficiency. Am J Hum Genet 2008;82:661-72. [PMID: 18319074 DOI: 10.1016/j.ajhg.2007.12.024] [Citation(s) in RCA: 223] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2007] [Revised: 12/15/2007] [Accepted: 12/28/2007] [Indexed: 01/17/2023] Open

Fuellen G. Homology and phylogeny and their automated inference. Naturwissenschaften 2008;95:469-81. [PMID: 18288471 DOI: 10.1007/s00114-008-0348-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2007] [Revised: 12/20/2007] [Accepted: 01/12/2008] [Indexed: 11/25/2022]

Kuntz S, Kieffer E, Bianchetti L, Lamoureux N, Fuhrmann G, Viville S. Tex19, a mammalian-specific protein with a restricted expression in pluripotent stem cells and germ line. Stem Cells 2007;26:734-44. [PMID: 18096721 DOI: 10.1634/stemcells.2007-0772] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]

Marzi S, Myasnikov AG, Serganov A, Ehresmann C, Romby P, Yusupov M, Klaholz BP. Structured mRNAs regulate translation initiation by binding to the platform of the ribosome. Cell 2007;130:1019-31. [PMID: 17889647 DOI: 10.1016/j.cell.2007.07.008] [Citation(s) in RCA: 105] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2007] [Revised: 05/18/2007] [Accepted: 07/06/2007] [Indexed: 01/04/2023]

Chalmel F, Léveillard T, Jaillard C, Lardenois A, Berdugo N, Morel E, Koehl P, Lambrou G, Holmgren A, Sahel JA, Poch O. Rod-derived Cone Viability Factor-2 is a novel bifunctional-thioredoxin-like protein with therapeutic potential. BMC Mol Biol 2007;8:74. [PMID: 17764561 PMCID: PMC2064930 DOI: 10.1186/1471-2199-8-74] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2007] [Accepted: 08/31/2007] [Indexed: 11/10/2022] Open

Gregory LA, Aguissa-Touré AH, Pinaud N, Legrand P, Gleizes PE, Fribourg S. Molecular basis of Diamond-Blackfan anemia: structure and function analysis of RPS19. Nucleic Acids Res 2007;35:5913-21. [PMID: 17726054 PMCID: PMC2034476 DOI: 10.1093/nar/gkm626] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open

Affiliation(s)

Lynn A. Gregory INSERM U869, Institut Européen de Chimie et Biologie, 2 rue Robert Escarpit Pessac, F-33607, Université Victor Segalen, Bordeaux 2, F-33076, Laboratoire de Biologie Moléculaire des eucaryotes (UMR5099) and Institut d’Exploration Fonctionnelle des Génomes (IFR109), CNRS and Université Paul Sabatier, 118 route de Narbonne F-31062 Toulouse and Synchrotron SOLEIL L’Orme des Merisiers, Saint Aubin- BP48, 91192 Gif sur Yvette Cedex, France
Almass-Houd Aguissa-Touré INSERM U869, Institut Européen de Chimie et Biologie, 2 rue Robert Escarpit Pessac, F-33607, Université Victor Segalen, Bordeaux 2, F-33076, Laboratoire de Biologie Moléculaire des eucaryotes (UMR5099) and Institut d’Exploration Fonctionnelle des Génomes (IFR109), CNRS and Université Paul Sabatier, 118 route de Narbonne F-31062 Toulouse and Synchrotron SOLEIL L’Orme des Merisiers, Saint Aubin- BP48, 91192 Gif sur Yvette Cedex, France
Noël Pinaud INSERM U869, Institut Européen de Chimie et Biologie, 2 rue Robert Escarpit Pessac, F-33607, Université Victor Segalen, Bordeaux 2, F-33076, Laboratoire de Biologie Moléculaire des eucaryotes (UMR5099) and Institut d’Exploration Fonctionnelle des Génomes (IFR109), CNRS and Université Paul Sabatier, 118 route de Narbonne F-31062 Toulouse and Synchrotron SOLEIL L’Orme des Merisiers, Saint Aubin- BP48, 91192 Gif sur Yvette Cedex, France
Pierre Legrand INSERM U869, Institut Européen de Chimie et Biologie, 2 rue Robert Escarpit Pessac, F-33607, Université Victor Segalen, Bordeaux 2, F-33076, Laboratoire de Biologie Moléculaire des eucaryotes (UMR5099) and Institut d’Exploration Fonctionnelle des Génomes (IFR109), CNRS and Université Paul Sabatier, 118 route de Narbonne F-31062 Toulouse and Synchrotron SOLEIL L’Orme des Merisiers, Saint Aubin- BP48, 91192 Gif sur Yvette Cedex, France
Pierre-Emmanuel Gleizes INSERM U869, Institut Européen de Chimie et Biologie, 2 rue Robert Escarpit Pessac, F-33607, Université Victor Segalen, Bordeaux 2, F-33076, Laboratoire de Biologie Moléculaire des eucaryotes (UMR5099) and Institut d’Exploration Fonctionnelle des Génomes (IFR109), CNRS and Université Paul Sabatier, 118 route de Narbonne F-31062 Toulouse and Synchrotron SOLEIL L’Orme des Merisiers, Saint Aubin- BP48, 91192 Gif sur Yvette Cedex, France *To whom correspondence should be addressed. 00 33 5 40 00 30 6300 33 5 40 00 30 68 Correspondence may also be addressed to Pierre-Emmanuel Gleizes. Tel/Fax: 00 33 5 61 33 59 26/58 86,
Sébastien Fribourg INSERM U869, Institut Européen de Chimie et Biologie, 2 rue Robert Escarpit Pessac, F-33607, Université Victor Segalen, Bordeaux 2, F-33076, Laboratoire de Biologie Moléculaire des eucaryotes (UMR5099) and Institut d’Exploration Fonctionnelle des Génomes (IFR109), CNRS and Université Paul Sabatier, 118 route de Narbonne F-31062 Toulouse and Synchrotron SOLEIL L’Orme des Merisiers, Saint Aubin- BP48, 91192 Gif sur Yvette Cedex, France *To whom correspondence should be addressed. 00 33 5 40 00 30 6300 33 5 40 00 30 68 Correspondence may also be addressed to Pierre-Emmanuel Gleizes. Tel/Fax: 00 33 5 61 33 59 26/58 86,

Collapse

Brown DP, Krishnamurthy N, Sjölander K. Automated protein subfamily identification and classification. PLoS Comput Biol 2007;3:e160. [PMID: 17708678 PMCID: PMC1950344 DOI: 10.1371/journal.pcbi.0030160] [Citation(s) in RCA: 96] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2006] [Accepted: 06/25/2007] [Indexed: 11/22/2022] Open

Abstract

Function prediction by homology is widely used to provide preliminary functional annotations for genes for which experimental evidence of function is unavailable or limited. This approach has been shown to be prone to systematic error, including percolation of annotation errors through sequence databases. Phylogenomic analysis avoids these errors in function prediction but has been difficult to automate for high-throughput application. To address this limitation, we present a computationally efficient pipeline for phylogenomic classification of proteins. This pipeline uses the SCI-PHY (Subfamily Classification in Phylogenomics) algorithm for automatic subfamily identification, followed by subfamily hidden Markov model (HMM) construction. A simple and computationally efficient scoring scheme using family and subfamily HMMs enables classification of novel sequences to protein families and subfamilies. Sequences representing entirely novel subfamilies are differentiated from those that can be classified to subfamilies in the input training set using logistic regression. Subfamily HMM parameters are estimated using an information-sharing protocol, enabling subfamilies containing even a single sequence to benefit from conservation patterns defining the family as a whole or in related subfamilies. SCI-PHY subfamilies correspond closely to functional subtypes defined by experts and to conserved clades found by phylogenetic analysis. Extensive comparisons of subfamily and family HMM performances show that subfamily HMMs dramatically improve the separation between homologous and non-homologous proteins in sequence database searches. Subfamily HMMs also provide extremely high specificity of classification and can be used to predict entirely novel subtypes. The SCI-PHY Web server at http://phylogenomics.berkeley.edu/SCI-PHY/ allows users to upload a multiple sequence alignment for subfamily identification and subfamily HMM construction. Biologists wishing to provide their own subfamily definitions can do so. Source code is available on the Web page. The Berkeley Phylogenomics Group PhyloFacts resource contains pre-calculated subfamily predictions and subfamily HMMs for more than 40,000 protein families and domains at http://phylogenomics.berkeley.edu/phylofacts/.

Collapse

Legrand P, Pinaud N, Minvielle-Sébastia L, Fribourg S. The structure of the CstF-77 homodimer provides insights into CstF assembly. Nucleic Acids Res 2007;35:4515-22. [PMID: 17584787 PMCID: PMC1935011 DOI: 10.1093/nar/gkm458] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open

Vuilleumier R, Boeuf G, Fuentes M, Gehring WJ, Falcón J. Cloning and early expression pattern of two melatonin biosynthesis enzymes in the turbot (Scophthalmus maximus). Eur J Neurosci 2007;25:3047-57. [PMID: 17561818 DOI: 10.1111/j.1460-9568.2007.05578.x] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Ranjith-Kumar CT, Miller W, Sun J, Xiong J, Santos J, Yarbrough I, Lamb RJ, Mills J, Duffy KE, Hoose S, Cunningham M, Holzenburg A, Mbow ML, Sarisky RT, Kao CC. Effects of single nucleotide polymorphisms on Toll-like receptor 3 activity and expression in cultured cells. J Biol Chem 2007;282:17696-705. [PMID: 17434873 DOI: 10.1074/jbc.m700209200] [Citation(s) in RCA: 110] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open

Blast sampling for structural and functional analyses. BMC Bioinformatics 2007;8:62. [PMID: 17319945 PMCID: PMC1819393 DOI: 10.1186/1471-2105-8-62] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2006] [Accepted: 02/23/2007] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

The post-genomic era is characterised by a torrent of biological information flooding the public databases. As a direct consequence, similarity searches starting with a single query sequence frequently lead to the identification of hundreds, or even thousands of potential homologues. The huge volume of data renders the subsequent structural, functional and evolutionary analyses very difficult. It is therefore essential to develop new strategies for efficient sampling of this large sequence space, in order to reduce the number of sequences to be processed. At the same time, it is important to retain the most pertinent sequences for structural and functional studies.

RESULTS

An exhaustive analysis on a large scale test set (284 protein families) was performed to compare the efficiency of four different sampling methods aimed at selecting the most pertinent sequences. These four methods sample the proteins detected by BlastP searches and can be divided into two categories: two customisable methods where the user defines either the maximal number or the percentage of sequences to be selected; two automatic methods in which the number of sequences selected is determined by the program. We focused our analysis on the potential information content of the sampled sets of sequences using multiple alignment of complete sequences as the main validation tool. The study considered two criteria: the total number of sequences in BlastP and their associated E-values. The subsequent analyses investigated the influence of the sampling methods on the E-value distributions, the sequence coverage, the final multiple alignment quality and the active site characterisation at various residue conservation thresholds as a function of these criteria.

CONCLUSION

The comparative analysis of the four sampling methods allows us to propose a suitable sampling strategy that significantly reduces the number of homologous sequences required for alignment, while at the same time maintaining the relevant information concerning the active site residues.

Collapse

Garnier N, Friedrich A, Bolze R, Bettler E, Moulinier L, Geourjon C, Thompson JD, Deléage G, Poch O. MAGOS: multiple alignment and modelling server. Bioinformatics 2006;22:2164-5. [PMID: 16820425 DOI: 10.1093/bioinformatics/btl349] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Thompson JD, Muller A, Waterhouse A, Procter J, Barton GJ, Plewniak F, Poch O. MACSIMS: multiple alignment of complete sequences information management system. BMC Bioinformatics 2006;7:318. [PMID: 16792820 PMCID: PMC1539025 DOI: 10.1186/1471-2105-7-318] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2006] [Accepted: 06/23/2006] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

In the post-genomic era, systems-level studies are being performed that seek to explain complex biological systems by integrating diverse resources from fields such as genomics, proteomics or transcriptomics. New information management systems are now needed for the collection, validation and analysis of the vast amount of heterogeneous data available. Multiple alignments of complete sequences provide an ideal environment for the integration of this information in the context of the protein family.

RESULTS

MACSIMS is a multiple alignment-based information management program that combines the advantages of both knowledge-based and ab initio sequence analysis methods. Structural and functional information is retrieved automatically from the public databases. In the multiple alignment, homologous regions are identified and the retrieved data is evaluated and propagated from known to unknown sequences with these reliable regions. In a large-scale evaluation, the specificity of the propagated sequence features is estimated to be >99%, i.e. very few false positive predictions are made. MACSIMS is then used to characterise mutations in a test set of 100 proteins that are known to be involved in human genetic diseases. The number of sequence features associated with these proteins was increased by 60%, compared to the features available in the public databases. An XML format output file allows automatic parsing of the MACSIM results, while a graphical display using the JalView program allows manual analysis.

CONCLUSION

MACSIMS is a new information management system that incorporates detailed analyses of protein families at the structural, functional and evolutionary levels. MACSIMS thus provides a unique environment that facilitates knowledge extraction and the presentation of the most pertinent information to the biologist. A web server and the source code are available at http://bips.u-strasbg.fr/MACSIMS/.

Collapse

Busso D, Poussin-Courmontagne P, Rosé D, Ripp R, Litt A, Thierry JC, Moras D. Structural genomics of eukaryotic targets at a laboratory scale. ACTA ACUST UNITED AC 2006;6:81-8. [PMID: 16211503 DOI: 10.1007/s10969-005-1909-6] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2004] [Accepted: 01/16/2005] [Indexed: 11/29/2022]

Müller SA, Pozidis C, Stone R, Meesters C, Chami M, Engel A, Economou A, Stahlberg H. Double hexameric ring assembly of the type III protein translocase ATPase HrcN. Mol Microbiol 2006;61:119-25. [PMID: 16824099 DOI: 10.1111/j.1365-2958.2006.05219.x] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Lam CS, Rastegar S, Strähle U. Distribution of cannabinoid receptor 1 in the CNS of zebrafish. Neuroscience 2005;138:83-95. [PMID: 16368195 DOI: 10.1016/j.neuroscience.2005.10.069] [Citation(s) in RCA: 69] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2005] [Revised: 10/21/2005] [Accepted: 10/25/2005] [Indexed: 12/11/2022]

Abstract

The cannabinoid receptor 1 (Cb1) mediates the psychoactive effect of marijuana. In mammals, there is abundant evidence advocating the importance of cannabinoid signaling; activation of Cb1 exerts diverse functions, chiefly by its ability to modulate neurotransmission. Thus, much attention has been devoted to understand its role in health and disease and to evaluate its therapeutic potential. Here, we have cloned zebrafish cb1 and investigated its expression in developing and adult zebrafish brain. Sequence analysis showed that there is a high degree of conservation, especially in residues demonstrated to be critical for function in mammals. In situ hybridization revealed that zebrafish cb1 appears first in the preoptic area at 24 hours post-fertilization. Subsequently, transcripts are detected in the dorsal telencephalon, hypothalamus, pretectum and torus longitudinalis. A similar pattern of expression is recapitulated in the adult brain. While cb1 is intensively stained in the medial zone of the dorsal telencephalon, expression elsewhere is weak by comparison. In particular, localization of cb1 in the telencephalic periventricular matrix is suggestive of the involvement of Cb1 in neurogenesis, bearing strong resemblance in terms of expression and function to the proliferative mammalian hippocampal formation. In addition, a gradient-like expression of cb1 is detected in the torus longitudinalis, a teleost specific neural tissue. In relation to dopaminergic neurons in the diencephalic posterior tuberculum (considered to be the teleostean homologue of the mammalian midbrain dopaminergic system), both cb1 and tyrosine hydroxylase-expressing cells occupy non-overlapping domains. However there is evidence that they are co-localized in the caudal zone of the hypothalamus, implying a direct modulation of dopamine release in this particular region. Collectively, our data indicate the propensity of zebrafish cb1 to participate in multiple neurological processes.

Collapse

Bianchetti L, Thompson JD, Lecompte O, Plewniak F, Poch O. vALId: validation of protein sequence quality based on multiple alignment data. J Bioinform Comput Biol 2005;3:929-47. [PMID: 16078368 DOI: 10.1142/s0219720005001326] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2004] [Revised: 02/02/2005] [Accepted: 02/06/2005] [Indexed: 11/18/2022]

Myasnikov AG, Marzi S, Simonetti A, Giuliodori AM, Gualerzi CO, Yusupova G, Yusupov M, Klaholz BP. Conformational transition of initiation factor 2 from the GTP- to GDP-bound state visualized on the ribosome. Nat Struct Mol Biol 2005;12:1145-9. [PMID: 16284619 DOI: 10.1038/nsmb1012] [Citation(s) in RCA: 121] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2005] [Accepted: 10/03/2005] [Indexed: 11/08/2022]

Uhring M, Bey G, Lecompte O, Cavarelli J, Moras D, Poch O. Cloning, purification and crystallization of a Walker-type Pyrococcus abyssi ATPase family member. Acta Crystallogr Sect F Struct Biol Cryst Commun 2005;61:925-7. [PMID: 16511197 PMCID: PMC1991322 DOI: 10.1107/s174430910502868x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2005] [Accepted: 09/12/2005] [Indexed: 11/11/2022]

Tao AL, He SH. Cloning, expression, and characterization of pollen allergens from Humulus scandens (Lour) Merr and Ambrosia artemisiifolia L. Acta Pharmacol Sin 2005;26:1225-32. [PMID: 16174439 DOI: 10.1111/j.1745-7254.2005.00194.x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open

Abstract

AIM

To clone the pollen allergen genes in Humulus scandens (Lour) Merr (LvCao in Chinese) and short ragweed (Ambrosia artemisiifolia L) for recombinant allergen production and immunotherapy.

METHODS

The allergen genes were selectively amplified in the weed pollen cDNA pool by using a special PCR profile, with the primers designed by a modeling procedure. Following truncated gene cloning and confirmation of the pollen source, unknown 3'cDNA ends were identified by using the 3'-RACE method. The gene function conferred by the full-length coding region was evaluated by a homologue search in the GenBank database. Recombinant proteins expressed in Escherichia coli pET-44 RosettaBlue cells were subsequently characterized by N-terminal end sequencing, IgE binding, and cross-reactivity.

RESULTS

Three full-length cDNAs were obtained in each weed. Multiple alignment analysis revealed that the deduced amino acid sequences were 83% identical to each other and 56%-90% identical to panallergen profilins from other species. Five recombinant proteins were abundantly expressed in non-fusion forms and were confirmed by using the N-terminal end sequence identity. Sera from patients who were allergic to A artemisiifolia reacted not only with rAmb a 8(D03) derived from A artemisiifolia, but also with recombinant protein rHum s 1(LCM9) derived from H scandens, which confirmed the allergenicity and cross-reactivity of the recombinant proteins from the 2 sources. Comparison of the degenerate primers used for truncated gene cloning with the full-length cDNA demonstrated that alternative nucleotide degeneracy occurred.

CONCLUSION

This study demonstrates a useful method for cloning homologous allergen genes across different species, particularly for little-studied species. The recombinant allergens obtained might be useful for the immunotherapeutic treatment of H scandens and/or A artemisiifolia pollen allergies.

Collapse

Schmitt E, Panvert M, Blanquet S, Mechulam Y. Structural Basis for tRNA-Dependent Amidotransferase Function. Structure 2005;13:1421-33. [PMID: 16216574 DOI: 10.1016/j.str.2005.06.016] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2005] [Revised: 06/20/2005] [Accepted: 06/30/2005] [Indexed: 11/26/2022]

Muller J, Oma Y, Vallar L, Friederich E, Poch O, Winsor B. Sequence and comparative genomic analysis of actin-related proteins. Mol Biol Cell 2005;16:5736-48. [PMID: 16195354 PMCID: PMC1289417 DOI: 10.1091/mbc.e05-06-0508] [Citation(s) in RCA: 90] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open

Vingadassalom D, Kolb A, Mayer C, Rybkine T, Collatz E, Podglajen I. An unusual primary sigma factor in the Bacteroidetes phylum. Mol Microbiol 2005;56:888-902. [PMID: 15853878 DOI: 10.1111/j.1365-2958.2005.04590.x] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Waagmeester A, Thompson J, Reyrat JM. Identifying sigma factors in Mycobacterium smegmatis by comparative genomic analysis. Trends Microbiol 2005;13:505-9. [PMID: 16140533 DOI: 10.1016/j.tim.2005.08.009] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2005] [Revised: 08/05/2005] [Accepted: 08/23/2005] [Indexed: 11/28/2022]

Thompson JD, Holbrook SR, Katoh K, Koehl P, Moras D, Westhof E, Poch O. MAO: a Multiple Alignment Ontology for nucleic acid and protein sequences. Nucleic Acids Res 2005;33:4164-71. [PMID: 16043635 PMCID: PMC1180671 DOI: 10.1093/nar/gki735] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Chalmel F, Lardenois A, Thompson JD, Muller J, Sahel JA, Léveillard T, Poch O. GOAnno: GO annotation based on multiple alignment. Bioinformatics 2005;21:2095-6. [PMID: 15647299 DOI: 10.1093/bioinformatics/bti252] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Biarrotte-Sorin S, Maillard AP, Delettré J, Sougakoff W, Arthur M, Mayer C. Crystal structures of Weissella viridescens FemX and its complex with UDP-MurNAc-pentapeptide: insights into FemABX family substrates recognition. Structure 2004;12:257-67. [PMID: 14962386 DOI: 10.1016/j.str.2004.01.006] [Citation(s) in RCA: 69] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2003] [Revised: 10/28/2003] [Accepted: 10/28/2003] [Indexed: 11/16/2022]

Degot S, Le Hir H, Alpy F, Kedinger V, Stoll I, Wendling C, Seraphin B, Rio MC, Tomasetto C. Association of the breast cancer protein MLN51 with the exon junction complex via its speckle localizer and RNA binding module. J Biol Chem 2004;279:33702-15. [PMID: 15166247 DOI: 10.1074/jbc.m402754200] [Citation(s) in RCA: 87] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open

Klaholz BP, Myasnikov AG, Van Heel M. Visualization of release factor 3 on the ribosome during termination of protein synthesis. Nature 2004;427:862-5. [PMID: 14985767 DOI: 10.1038/nature02332] [Citation(s) in RCA: 113] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2003] [Accepted: 01/08/2004] [Indexed: 11/09/2022]

Thompson JD, Prigent V, Poch O. LEON: multiple aLignment Evaluation Of Neighbours. Nucleic Acids Res 2004;32:1298-307. [PMID: 14982955 PMCID: PMC390283 DOI: 10.1093/nar/gkh294] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2003] [Revised: 01/16/2004] [Accepted: 01/29/2004] [Indexed: 11/13/2022] Open

Duval D, Duval G, Kedinger C, Poch O, Boeuf H. The 'PINIT' motif, of a newly identified conserved domain of the PIAS protein family, is essential for nuclear retention of PIAS3L. FEBS Lett 2003;554:111-8. [PMID: 14596924 DOI: 10.1016/s0014-5793(03)01116-5] [Citation(s) in RCA: 61] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]