1
|
Cázares-Raga FE, González-Lázaro M, Montero-Solís C, González-Cerón L, Zamudio F, Martínez-Barnetche J, Torres-Monzón JA, Ovilla-Muñoz M, Aguilar-Fuentes J, Rodríguez MH, de la Cruz Hernández-Hernández F. GP35 ANOAL, an abundant acidic glycoprotein of female Anopheles albimanus saliva. INSECT MOLECULAR BIOLOGY 2007; 16:187-98. [PMID: 17298558 DOI: 10.1111/j.1365-2583.2006.00712.x] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
Salivary glands of female mosquitoes produce proteins, not completely described yet, that participate in carbohydrate and blood feeding. Here, we report an acidic glycoprotein of 35 kDa (GP35 ANOAL) secreted in the saliva of the malaria vector mosquito Anopheles albimanus. GP35 ANOAL is produced exclusively in the distal lateral lobes of adult female salivary glands, it has a pI of 4.45 and is negatively stained by regular silver stain. An 888 bp cDNA clone encoding a predicted product of 240 amino acids has a signal peptide, potential post-translational modification sites, and a disintegrin signature RGD. The GP35 ANOAL sequence depicts high similarities with the 30 kDa saliva allergen of Aedes aegypti, 30 kDa allergen-like hypothetical proteins, and GE-rich proteins present in several Anopheles species, as well as in Ae. albopictus and Culex pipiens quinquefasciatus. The function of this protein family is still unknown.
Collapse
Affiliation(s)
- F E Cázares-Raga
- Centro de Investigación Sobre Enfermedades Infecciosas, Instituto Nacional de Salud Pública, Cuernavaca, Morelos, Mexico
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
2
|
Irlinger F, Bimet F, Delettre J, Lefèvre M, Grimont PAD. Arthrobacter bergerei sp. nov. and Arthrobacter arilaitensis sp. nov., novel coryneform species isolated from the surfaces of cheeses. Int J Syst Evol Microbiol 2005; 55:457-462. [PMID: 15653918 DOI: 10.1099/ijs.0.63125-0] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Fourteen isolates of two different bacterial species isolated from the surface of smear-ripened cheeses were found to exhibit many characteristics of the genus Arthrobacter. The isolates were aerobic, Gram-positive, catalase-positive, non-spore-forming and non-motile. The cell-wall peptidoglycan contained lysine, alanine and glutamic acid. rrs sequence analysis indicated that the new isolates Re117T and Ca106T are closely related to the Arthrobacter nicotianae group and showed highest sequence similarity (>98 %) to Arthrobacter nicotianae and Arthrobacter protophormiae. However, DNA-DNA hybridization studies indicated that the strains represented two novel genomic species within the genus Arthrobacter and did not belong to A. nicotianae or A. protophormiae (<43 % DNA-DNA relatedness). On the basis of the phylogenetic and phenotypic distinctiveness of the new isolates, these bacteria should be classified as two novel Arthrobacter species, for which the names Arthrobacter bergerei sp. nov. and Arthrobacter arilaitensis sp. nov. are proposed. Type strains have been deposited in culture collections as Arthrobacter bergerei Ca106T (=CIP 108036T=DSM 16367T) and Arthrobacter arilaitensis Re117T (=CIP 108037T=DSM 16368T).
Collapse
Affiliation(s)
- Françoise Irlinger
- Unité Mixte de Recherche Génie et Microbiologie des Procédés Alimentaires (UMR GMPA), Institut National de la Recherche Agronomique, 78850 Thiverval-Grignon, France
| | - François Bimet
- Collection de l'Institut Pasteur, Institut Pasteur, 75724 Paris Cedex 15, France
| | - Jérome Delettre
- Unité Mixte de Recherche Génie et Microbiologie des Procédés Alimentaires (UMR GMPA), Institut National de la Recherche Agronomique, 78850 Thiverval-Grignon, France
| | - Martine Lefèvre
- Unité de Biodiversité des bactéries pathogènes émergentes, INSERM U389, Institut Pasteur, 75724 Paris Cedex 15, France
| | - Patrick A D Grimont
- Unité de Biodiversité des bactéries pathogènes émergentes, INSERM U389, Institut Pasteur, 75724 Paris Cedex 15, France
| |
Collapse
|
3
|
Oliva ML, Souza-Pinto JC, Batista IF, Araujo MS, Silveira VF, Auerswald EA, Mentele R, Eckerskorn C, Sampaio MU, Sampaio CA. Leucaena leucocephala serine proteinase inhibitor: primary structure and action on blood coagulation, kinin release and rat paw edema. BIOCHIMICA ET BIOPHYSICA ACTA 2000; 1477:64-74. [PMID: 10708849 DOI: 10.1016/s0167-4838(99)00285-x] [Citation(s) in RCA: 94] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
A serine proteinase inhibitor isolated from Leucaena leucocephala seeds (LlTI) was purified to homogeneity by acetone fractionation, ion exchange chromatography, gel filtration and reverse phase chromatography (HPLC). SDS-PAGE indicated a protein with M(r) 20000 and two polypeptide chains (alpha-chain, M(r) 15000, and beta-chain, M(r) 5000), the sequence being determined by automatic Edman degradation and by mass spectroscopy. LlTI is a 174 amino acid residue protein which shows high homology to plant Kunitz inhibitors, especially those double chain proteins purified from the Mimosoideae subfamily. LlTI inhibits plasmin (K(i) 3.2 x 10(-10) M), human plasma kallikrein (K(i) 6.3 x 10(-9) M), trypsin (K(i) 2.5 x 10(-8) M) and chymotrypsin (K(i) 1.4 x 10(-8) M). Factor XIIa activity is inhibited but K(i) was not determined, and factor Xa, tissue kallikrein and thrombin are not inhibited by LlTI. The action of LlTI on enzymes that participate in the blood clotting extrinsic pathway is confirmed by the prolongation of activated partial thromboplastin time, used as clotting time assay. The inhibition of the fibrinolytic activity of plasmin was confirmed on the hydrolysis of fibrin plates. LlTI inhibits kinin release from high molecular weight kininogen by human plasma kallikrein in vitro and, administered intravenously, causes a decrease in paw edema induced by carrageenin or heat in male Wistar rats. In addition, lower concentrations of bradykinin were found in limb perfusion fluids of LlTI-treated rats.
Collapse
Affiliation(s)
- M L Oliva
- Departamento de Bioquímica, Universidade Federal de São Paulo, Rua Três de Maio 100, 04044-020, São Paulo, Brazil.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
4
|
Koivula TT, Hantula J. Diversity within bacterial isolates hybridizing with Comamonas probe ppT. J Basic Microbiol 1997; 37:129-37. [PMID: 9151425 DOI: 10.1002/jobm.3620370207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Diversity within bacteria isolated from activated sludge and the Baltic sea and recognized by the rRNA targeted oligonucleotide probe ppT (designed by BRAUN-HOWLAND et al., 1993 to recognize C. testosteroni) was studied. The partial nucleotide sequences of 16S genes of the isolates revealed that the activated sludge and Baltic sea isolates each formed a separate group distinct from C. testosteroni. The analysis of phage and bacteriocin sensitivity as well as whole cell protein patterns revealed the same grouping, but in these characters also intragroup variation was observed. As a conclusion, neither of the studied groups were C. testosteroni, but they probably belong to two previously unknown species common in activated sludge or the Baltic sea.
Collapse
Affiliation(s)
- T T Koivula
- Department of Biosciences, University of Helsinki, Finland
| | | |
Collapse
|
5
|
Martin G, Keller W. Mutational analysis of mammalian poly(A) polymerase identifies a region for primer binding and catalytic domain, homologous to the family X polymerases, and to other nucleotidyltransferases. EMBO J 1996; 15:2593-603. [PMID: 8665867 PMCID: PMC450192] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
We have tested deletion and substitution mutants of bovine poly(A) polymerase, and have identified a small region that overlaps with a nuclear localization signal and binds to the RNA primer. Systematic mutagenesis of carboxylic amino acids led to the identification of three aspartates that are essential for catalysis. Sequence and secondary structure comparisons of regions surrounding these aspartates with sequences of other polymerases revealed a significant homology to the palm structure of DNA polymerase beta, terminal deoxynucleotidyltransferase and DNA polymerase IV of Saccharomyces cerevisiae, all members of the family X of polymerases. This homology extends as far as cca: tRNA nucleotidyltransferase and streptomycin adenylyltransferase, an antibiotic resistance factor.
Collapse
Affiliation(s)
- G Martin
- Department of Cell Biology, University of Basel, Basel, Switzerland
| | | |
Collapse
|
6
|
Duchrow M, Schlüter C, Wohlenberg C, Flad HD, Gerdes J. Molecular characterization of the gene locus of the human cell proliferation-associated nuclear protein defined by monoclonal antibody Ki-67. Cell Prolif 1996. [DOI: 10.1111/j.1365-2184.1996.tb00090.x] [Citation(s) in RCA: 53] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
|
7
|
Hantula J, Koivula TT, Luo C, Bamford DH. Bacterial diversity at surface water in three locations within the Baltic sea as revealed by culture-dependent molecular techniques. J Basic Microbiol 1996; 36:163-76. [PMID: 8676282 DOI: 10.1002/jobm.3620360303] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Diversity of culturable bacteria inhabiting the Baltic sea surface waters was studied in three separate locations. Based on electrophoretically separated whole cell proteins the number of operational taxonomic units (OTU) within each sampling location was high. Most of the OTUs were unique to single locations. Within each sampling location 8-22% of isolates belonged to a single OTU. Rarefaction analysis revealed that the bacterial community was more divergent at a polluted location than at clean areas. Also the most common OTUs were different in clean locations compared to the polluted site suggesting that both diversity and species composition of the bacterial community is greatly affected by pollution. The partial 16S rRNA gene sequences of the isolates of the most common OTUs are unique. Intragroup variation and an OTU-specific bacteriocin system was observed among the isolates of the second common OTU. The bacteriocin activity was linked to restriction fragment length polymorphism grouping, although additional variation correlating to geographic origin of isolates was observed.
Collapse
Affiliation(s)
- J Hantula
- Finnish Forest Research Institute, Vantaa, Finland
| | | | | | | |
Collapse
|
8
|
Sexton R, Gill PR, Callanan MJ, O'Sullivan DJ, Dowling DN, O'Gara F. Iron-responsive gene expression in Pseudomonas fluorescens M114: cloning and characterization of a transcription-activating factor, PbrA. Mol Microbiol 1995; 15:297-306. [PMID: 7746151 DOI: 10.1111/j.1365-2958.1995.tb02244.x] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
In response to iron limitation. Pseudomonas fluorescens M114 induces a number of genes including an iron-scavenging siderophore termed pseudobactin M114, its cognate receptor, PbuA, and a casein protease. A Tn5lacZ-induced mutant (M114FA1) was isolated that exhibits a pleiotropic phenotype and lacks the ability to express these iron-regulated genes. A cosmid clone was identified which complements this mutation. This clone is capable of activating a number of iron-regulated promoter fusion constructs from P. fluorescens M114 and Pseudomonas putida WCS358 and can also promote expression of these fusions in Escherichia coli. A series of insertion mutants was constructed by homologous recombination which were unable to transcribe the promoter fusions. DNA sequence analysis of the complementing region identified one open reading frame (ORF) termed pbrA (pseudobactin regulation activation) and the deduced amino acid sequence shows domains with significant homology to a number of ECF (extracytoplasmic function) transcriptional regulators of the sigma 70 sigma factor family, including fecl required for expression of the ferric dicitrate outer-membrane receptor protein of E. coli. Sequences upstream of the pbrA gene suggest that transcription of pbrA may also be iron regulated.
Collapse
Affiliation(s)
- R Sexton
- Department of Microbiology, University College Cork, Ireland
| | | | | | | | | | | |
Collapse
|
9
|
Abstract
The SRPDB (signal recognition particle database) provides aligned SRP RNA and protein sequences, annotated and phylogenetically ordered. This release includes 82 SRP RNAs (including 22 bacterial and 9 archaeal homologs) and a total of 20 protein sequences representing SRP9, SRP14, SRP19, SRP54, SRP68, and SRP72. The offerings also include representative RNA secondary structure diagrams.
Collapse
Affiliation(s)
- C Zwieb
- Department of Molecular Biology, University of Texas Health Science Center, Tyler 75710
| | | |
Collapse
|
10
|
Mewes HW, Doelz R, George DG. Sequence databases: an indispensible source for biotechnological research. J Biotechnol 1994; 35:239-56. [PMID: 7765061 DOI: 10.1016/0168-1656(94)90039-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
The current status of the nucleic acid and protein sequence databases and their importance for the development of a potent biotechnological infrastructure is described. The basic technological background for the data collection and data distribution services are elucidated.
Collapse
Affiliation(s)
- H W Mewes
- Martinsried Institute for Protein Sequences (MIPS), Max-Planck-Institut für Biochemie, Germany
| | | | | |
Collapse
|
11
|
Pavesi A, Conterio F, Bolchi A, Dieci G, Ottonello S. Identification of new eukaryotic tRNA genes in genomic DNA databases by a multistep weight matrix analysis of transcriptional control regions. Nucleic Acids Res 1994; 22:1247-56. [PMID: 8165140 PMCID: PMC523650 DOI: 10.1093/nar/22.7.1247] [Citation(s) in RCA: 76] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
A linear method for the search of eukaryotic nuclear tRNA genes in DNA databases is described. Based on a modified version of the general weight matrix procedure, our algorithm relies on the recognition of two intragenic control regions known as A and B boxes, a transcription termination signal, and on the evaluation of the spacing between these elements. The scanning of the eukaryotic nuclear DNA database using this search algorithm correctly identified 933 of the 940 known tRNA genes (0.74% of false negatives). Thirty new potential tRNA genes were identified, and the transcriptional activity of two of them was directly verified by in vitro transcription. The total false positive rate of the algorithm was 0.014%. Structurally unusual tRNA genes, like those coding for selenocysteine tRNAs, could also be recognized using a set of rules concerning their specific properties, and one human gene coding for such tRNA was identified. Some of the newly identified tRNA genes were found in rather uncommon genomic positions: 2 in centromeric regions and 3 within introns. Furthermore, the presence of extragenically located B boxes in tRNA genes from various organisms could be detected through a specific subroutine of the standard search program.
Collapse
Affiliation(s)
- A Pavesi
- Department of Evolutionary Biology, University of Parma, Italy
| | | | | | | | | |
Collapse
|
12
|
Higgins DG, Labeit S, Gautel M, Gibson TJ. The evolution of titin and related giant muscle proteins. J Mol Evol 1994; 38:395-404. [PMID: 8007007 DOI: 10.1007/bf00163156] [Citation(s) in RCA: 28] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Titin and twitchin are giant proteins expressed in muscle. They are mainly composed of domains belonging to the fibronectin class III and immunoglobulin c2 families, repeated many times. In addition, both proteins have a protein kinase domain near the C-terminus. This paper explores the evolution of these and related muscle proteins in an attempt to determine the order of events that gave rise to the different repeat patterns and the order of appearance of the proteins. Despite their great similarity at the level of sequence organization, titin and twitchin diverged from each other at least as early as the divergence between vertebrates and nematodes. Most of the repeating units in titin and twitchin were estimated to derive from three original domains. Chicken smooth-muscle myosin light-chain kinase (smMLCK) also has a kinase domain, several immunoglobulin domains, and a fibronectin domain. From a comparison of the kinase domains, titin is predicted to have appeared first during the evolution of the family, followed by twitchin and with the vertebrate MLCKs last to appear. The so-called C-protein from chicken is also a member of this family but has no kinase domain. Its origin remains unclear but it most probably pre-dates the titin/twitchin duplication.
Collapse
Affiliation(s)
- D G Higgins
- European Molecular Biology Laboratory, Heidelberg, Germany
| | | | | | | |
Collapse
|
13
|
Hughes GJ, Frutiger S, Paquet N, Pasquali C, Sanchez JC, Tissot JD, Bairoch A, Appel RD, Hochstrasser DF. Human liver protein map: update 1993. Electrophoresis 1993; 14:1216-22. [PMID: 8313870 DOI: 10.1002/elps.11501401181] [Citation(s) in RCA: 63] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
This publication updates the reference human liver protein map. By microsequencing, 27 spots or 34 polypeptide chains were identified. The most abundant polypeptides detected on the silver stained liver map were key elements in major hepatic biochemical pathways. The new polypeptides and previously known proteins are listed in a table and/or labeled on the protein map, thus providing the 1993 reference human liver SWISS-2DPAGE database. SWISS-2DPAGE and the SWISS-PROT protein sequence databases are closely linked together through the use of common accession numbers.
Collapse
Affiliation(s)
- G J Hughes
- Medical Biochemistry Department, Geneva University
| | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Golaz O, Hughes GJ, Frutiger S, Paquet N, Bairoch A, Pasquali C, Sanchez JC, Tissot JD, Appel RD, Walzer C. Plasma and red blood cell protein maps: update 1993. Electrophoresis 1993; 14:1223-31. [PMID: 8313871 DOI: 10.1002/elps.11501401183] [Citation(s) in RCA: 109] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
This publication updates the reference plasma and red blood cell protein maps obtained with immobilized pH gradients. Seventeen polypeptide spots or chains were partially characterized by direct N-terminal sequencing or by sequencing of peptides obtained from enzymatic digestion. Additional new polypeptides and previously known proteins are listed in a table and/or labeled on the protein maps, thus providing the 1993 update of the human plasma and red blood cell two-dimensional gel SWISS-2DPAGE database. SWISS-2DPAGE and the SWISS-PROT protein sequence databases are closely linked together through the use of common accession numbers.
Collapse
Affiliation(s)
- O Golaz
- Medicine Department, Geneva University
| | | | | | | | | | | | | | | | | | | |
Collapse
|
15
|
Gibson TJ, Rice PM, Thompson JD, Heringa J. KH domains within the FMR1 sequence suggest that fragile X syndrome stems from a defect in RNA metabolism. Trends Biochem Sci 1993; 18:331-3. [PMID: 7694397 DOI: 10.1016/0968-0004(93)90068-x] [Citation(s) in RCA: 50] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Affiliation(s)
- T J Gibson
- European Molecular Biology Laboratory, Heidelberg, Germany
| | | | | | | |
Collapse
|
16
|
Médigue C, Viari A, Hénaut A, Danchin A. Colibri: a functional data base for the Escherichia coli genome. Microbiol Rev 1993; 57:623-54. [PMID: 8246843 PMCID: PMC372929 DOI: 10.1128/mr.57.3.623-654.1993] [Citation(s) in RCA: 43] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
Several data libraries have been created to organize all the data obtained worldwide about the Escherichia coli genome. Because the known data now amount to more than 40% of the whole genome sequence, it has become necessary to organize the data in such a way that appropriate procedures can associate knowledge produced by experiments about each gene to its position on the chromosome and its relation to other relevant genes, for example. In addition, global properties of genes, affected by the introduction of new entries, should be present as appropriate description fields. A data base, implemented on Macintosh by using the data base management system 4th Dimension, is described. It is constructed around a core constituted by known contigs of E. coli sequences and links data collected in general libraries (unmodified) to data associated with evolving knowledge (with modifiable fields). Biologically significant results obtained through the coupling of appropriate procedures (learning or statistical data analysis) are presented. The data base is available through a 4th Dimension runtime and through FTP on Internet. It has been regularly updated and will be systematically linked to other E. coli data bases (M. Kroger, R. Wahl, G. Schachtel, and P. Rice, Nucleic Acids Res. 20(Suppl.):2119-2144, 1992; K. E. Rudd, W. Miller, C. Werner, J. Ostell, C. Tolstoshev, and S. G. Satterfield, Nucleic Acids Res. 19:637-647, 1991) in the near future.
Collapse
|
17
|
Affiliation(s)
- C M Rice
- European Molecular Biology Laboratory, Heidelberg, Germany
| | | | | | | | | |
Collapse
|
18
|
Affiliation(s)
- A Bairoch
- Department of Medical Biochemistry, University of Geneva, Switzerland
| |
Collapse
|
19
|
Barker WC, George DG, Mewes HW, Pfeiffer F, Tsugita A. The PIR-International databases. Nucleic Acids Res 1993; 21:3089-92. [PMID: 8332528 PMCID: PMC309735 DOI: 10.1093/nar/21.13.3089] [Citation(s) in RCA: 56] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
PIR-International is an association of macromolecular sequence data collection centers dedicated to fostering international cooperation as an essential element in the development of scientific databases. PIR-International is most noted for the Protein Sequence Database. This database originated in the early 1960's with the pioneering work of the late Margaret Dayhoff as a research tool for the study of protein evolution and intersequence relationships; it is maintained as a scientific resource, organized by biological concepts, using sequence homology as a guiding principle. PIR-International also maintains a number of other genomic, protein sequence, and sequence-related databases. The databases of PIR-International are made widely available. This paper briefly describes the architecture of the Protein Sequence Database, a number of other PIR-International databases, and mechanisms for providing access to and for distribution of these databases.
Collapse
Affiliation(s)
- W C Barker
- Protein Information Resource, National Biomedical Research Foundation, Georgetown University Medical Center
| | | | | | | | | |
Collapse
|
20
|
Abstract
The SRPDB (signal recognition particle database) provides annotated SRP RNA sequences from Eucaryotes and Archaea, phylogenetically ordered and aligned with their bacterial equivalents. We also make available representative RNA secondary structure diagrams, where each base pair is proven by comparative sequence analysis. New to this release are 17 SRP RNA sequences (a total of 64 sequences) and alignments of proteins SRP19 and SRP54 with their RNA binding sites.
Collapse
Affiliation(s)
- N Larsen
- Department of Microbiology, University of Illinois, Urbana 61801
| | | |
Collapse
|
21
|
Pongor S, Skerl V, Cserzö M, Hátsági Z, Simon G, Bevilacqua V. The SBASE protein domain library, release 2.0: a collection of annotated protein sequence segments. Nucleic Acids Res 1993; 21:3111-5. [PMID: 8332532 PMCID: PMC309739 DOI: 10.1093/nar/21.13.3111] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
SBASE 2.0 is the second release of SBASE, a collection of annotated protein domain sequences. SBASE entries represent various structural, functional, ligand-binding and topogenic segments of proteins [Pongor, S. et al. (1993) Prot. Eng., in press]. This release contains 34,518 entries provided with standardized names and it is cross-referenced to the major protein and nucleic acid databanks as well as to the PROSITE catalog of protein sequence patterns [Bairoch, A. (1992) Nucl. Acids Res., 20 suppl, 2013-2018]. SBASE can be used for establishing domain homologies using different database-search tools such as FASTA [Lipman and Pearson (1985) Science, 227, 1436-1441], FASTDB [Brutlag et al. (1990) Comp. Appl. Biosci., 6, 237-245] or BLAST3 [Altschul and Lipman (1990) Proc. Natl. Acad. Sci. USA, 87, 5509-5513] which is especially useful in the case of loosely defined domain types for which efficient consensus patterns can not be established. SBASE 2.0 and a set of search and retrieval tools are freely available on request to the authors or by anonymous 'ftp' file transfer from mean value of ftp.icgeb.trieste.it.
Collapse
Affiliation(s)
- S Pongor
- International Centre for Genetic Engineering and Biotechnology, Area Science Park, Trieste, Italy
| | | | | | | | | | | |
Collapse
|
22
|
Larsen N, Olsen GJ, Maidak BL, McCaughey MJ, Overbeek R, Macke TJ, Marsh TL, Woese CR. The ribosomal database project. Nucleic Acids Res 1993; 21:3021-3. [PMID: 8332524 PMCID: PMC309730 DOI: 10.1093/nar/21.13.3021] [Citation(s) in RCA: 405] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
The Ribosomal Database Project (RDP) is a curated database that offers ribosome data along with related programs and services. The offerings include phylogenetically ordered alignments of ribosomal RNA (rRNA) sequences, derived phylogenetic trees, rRNA secondary structure diagrams and various software packages for handling, analyzing and displaying alignments and trees. The data are available via ftp and electronic mail. Certain analytic services are also provided by the electronic mail server.
Collapse
Affiliation(s)
- N Larsen
- Department of Microbiology, University of Illinois, Urbana 61801
| | | | | | | | | | | | | | | |
Collapse
|
23
|
Bairoch A, Boeckmann B. The SWISS-PROT protein sequence data bank, recent developments. Nucleic Acids Res 1993; 21:3093-6. [PMID: 8332529 PMCID: PMC309736 DOI: 10.1093/nar/21.13.3093] [Citation(s) in RCA: 123] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Affiliation(s)
- A Bairoch
- Department of Medical Biochemistry, University of Geneva, Switzerland
| | | |
Collapse
|
24
|
Brown CM, Dalphin ME, Stockwell PA, Tate WP. The translational termination signal database. Nucleic Acids Res 1993; 21:3119-23. [PMID: 8332534 PMCID: PMC309741 DOI: 10.1093/nar/21.13.3119] [Citation(s) in RCA: 62] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
The Translational Termination Database (TransTerm) consists of the immediate context sequences around the natural termination codons from 45 organisms, and summary tables. The influence of termination codon context on their effectivness as stop signals has been widely documented. The SPECIES--TRI.DAT table shows trinucleotide stop codon usage in each organism and for comparison the occurrence of these sequences in the noncoding region. The SPECIES--TETRA.DAT table contains is a similar table of tetranucleotide stop signal usage. The database is available from EMBL.
Collapse
Affiliation(s)
- C M Brown
- Department of Biochemistry, University of Otago, Dunedin, New Zealand
| | | | | | | |
Collapse
|
25
|
Fickett JW, Guigó R. Estimation of protein coding density in a corpus of DNA sequence data. Nucleic Acids Res 1993; 21:2837-44. [PMID: 8332493 PMCID: PMC309664 DOI: 10.1093/nar/21.12.2837] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
A number of experimental methods have been reported for estimating the number of genes in a genome, or the closely related coding density of a genome, defined as the fraction of base pairs in codons. Recently, DNA sequence data representative of the genome as a whole have become available for several organisms, making the problem of estimating coding density amenable to sequence analytic methods. Estimates of coding density for a single genome vary widely, so that methods with characterized error bounds have become increasingly desirable. We present a method to estimate the protein coding density in a corpus of DNA sequence data, in which a 'coding statistic' is calculated for a large number of windows of the sequence under study, and the distribution of the statistic is decomposed into two normal distributions, assumed to be the distributions of the coding statistic in the coding and noncoding fractions of the sequence windows. The accuracy of the method is evaluated using known data and application is made to the yeast chromosome III sequence and to C. elegans cosmid sequences. It can also be applied to fragmentary data, for example a collection of short sequences determined in the course of STS mapping.
Collapse
Affiliation(s)
- J W Fickett
- Center for Human Genome Studies, Los Alamos National Laboratory, NM 87545
| | | |
Collapse
|
26
|
Raschke E. Comprehensive restriction enzyme lists to update any DNA sequence computer program. GENETIC ANALYSIS, TECHNIQUES AND APPLICATIONS 1993; 10:49-60. [PMID: 8217304 DOI: 10.1016/1050-3862(93)90045-k] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
Restriction enzyme lists are presented for the practical working geneticist to update any DNA computer program. These lists combine formerly scattered information and contain all presently known restriction enzymes with a unique recognition sequence, a cut site, or methylation (in)sensitivity. The lists are in the shortest possible form to also be functional with small DNA computer programs, and will produce clear restriction maps without any redundancy or loss of information. The lists discern between commercial and noncommercial enzymes, and prototype enzymes and different isoschizomers are cross-referenced. Differences in general methylation sensitivities and (in)sensitivities against Dam and Dcm methylases of Escherichia coli are indicated. Commercial methylases and intron-encoded endonucleases are included. An address list is presented to contact commercial suppliers. The lists are constantly updated and available in electronic form as pure US ASCII files, and in formats for the DNA computer programs DNA-Strider for Apple Macintosh, and DNAsis for IBM personal computers or compatibles via e-mail from the internet address: NETSERV@EMBL-HEIDELBERG.DE by sending only the message HELP RELIBRARY.
Collapse
Affiliation(s)
- E Raschke
- Botanical Institute, University of Bonn, Germany
| |
Collapse
|
27
|
|
28
|
Butler B. An overview of computer software developed to search biological sequence databases. ANTISENSE RESEARCH AND DEVELOPMENT 1993; 3:243-52. [PMID: 8286925 DOI: 10.1089/ard.1993.3.243] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
The scientific community has established a number of databases to receive and maintain the abundant biological sequence information being generated by research investigators worldwide. For researchers in an increasing number of biological disciplines, the information stored in these databases has become an invaluable tool in their daily research endeavors. This article reviews the organization of the largest nucleic acid and protein sequence databases and discusses some of the commonly available computer software that has been developed for searching them.
Collapse
Affiliation(s)
- B Butler
- Genetics Computer Group, Inc., Madison, Wisconsin 53711
| |
Collapse
|
29
|
Abstract
A number of methods for recognizing protein coding genes in DNA sequence have been published over the last 13 years, and new, more comprehensive algorithms, drawing on the repertoire of existing techniques, continue to be developed. To optimize continued development, it is valuable to systematically review and evaluate published techniques. At the core of most gene recognition algorithms is one or more coding measures--functions which produce, given any sample window of sequence, a number or vector intended to measure the degree to which a sample sequence resembles a window of 'typical' exonic DNA. In this paper we review and synthesize the underlying coding measures from published algorithms. A standardized benchmark is described, and each of the measures is evaluated according to this benchmark. Our main conclusion is that a very simple and obvious measure--counting oligomers--is more effective than any of the more sophisticated measures. Different measures contain different information. However there is a great deal of redundancy in the current suite of measures. We show that in future development of gene recognition algorithms, attention can probably be limited to six of the twenty or so measures proposed to date.
Collapse
Affiliation(s)
- J W Fickett
- Theoretical Biology and Biophysics Group, Los Alamos National Laboratory, NM 87545
| | | |
Collapse
|