1
|
|
2
|
Rafalski JA, Wiewiórowski M, Söll D. Organization of ribosomal DNA in yellow lupine (Lupinus luteus) and sequence of the 5.8 S RNA gene. FEBS Lett 2001. [DOI: 10.1016/0014-5793(83)80388-3] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
3
|
Davis CA, Wyatt GR. Distribution and sequence homogeneity of an abundant satellite DNA in the beetle, Tenebrio molitor. Nucleic Acids Res 1989; 17:5579-86. [PMID: 2762148 PMCID: PMC318180 DOI: 10.1093/nar/17.14.5579] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
The mealworm beetle, Tenebrio molitor, contains an unusually abundant and homogeneous satellite DNA which constitutes up to 60% of its genome. The satellite DNA is shown to be present in all of the chromosomes by in situ hybridization. 18 dimers of the repeat unit were cloned and sequenced. The consensus sequence is 142 nt long and lacks any internal repeat structure. Monomers of the sequence are very similar, showing on average a 2% divergence from the calculated consensus. Variant nucleotides are scattered randomly throughout the sequence although some variants are more common than others. Neighboring repeat units are no more alike than randomly chosen ones. The results suggest that some mechanism, perhaps gene conversion, is acting to maintain the homogeneity of the satellite DNA despite its abundance and distribution on all of the chromosomes.
Collapse
Affiliation(s)
- C A Davis
- Department of Biology, Queen's University, Kingston, Ontario, Canada
| | | |
Collapse
|
4
|
Szankasi P, Heyer WD, Schuchert P, Kohli J. DNA sequence analysis of the ade6 gene of Schizosaccharomyces pombe. Wild-type and mutant alleles including the recombination host spot allele ade6-M26. J Mol Biol 1988; 204:917-25. [PMID: 3221399 DOI: 10.1016/0022-2836(88)90051-4] [Citation(s) in RCA: 131] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
The gene ade6 is located on chromosome III of the fission yeast Schizosaccharomyces pombe. It codes for the enzyme phosphoribosylaminoimidazole carboxylase involved in purine biosynthesis. A DNA fragment of 3043 nucleotides has been sequenced. It complements ade6 mutations when present on plasmids. An uninterrupted open reading frame of 552 amino acid residues was identified. A method for the cloning of chromosomal mutations by repair of gapped replication vectors in vivo has been developed. Twelve ade6 mutant alleles have been isolated. The sequence alterations of four mutant alleles have been determined. Among them are the ade6-M26 recombination hot spot mutation and the nearby ade6-M375 control mutation. Both are G to T base substitutions, converting adjacent glycine codons to TGA termination codons. They are suppressed by defined tRNA nonsense suppressors of the UGA type. The ade6-M26 mutation leads to a tenfold increase of the occurrence of conversion tetrads in comparison with other ade6 mutations. Possible explanations for the M26-induced increase of recombination frequency are discussed in relation to specific features of the nucleotide sequence identified in the region of the M26 mutation.
Collapse
Affiliation(s)
- P Szankasi
- Institute of General Microbiology, University of Bern, Switzerland
| | | | | | | |
Collapse
|
5
|
Stalder J, Wirthmüller U, Beck J, Gruber A, Meyerhof W, Knöchel W, Weber R. Primary structure and evolutionary relationship between the adult alpha-globin genes and their 5'-flanking regions of Xenopus laevis and Xenopus tropicalis. J Mol Evol 1988; 28:64-71. [PMID: 3148743 DOI: 10.1007/bf02143498] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
To investigate the evolution of globin genes in the genus Xenopus, we have determined the primary structure of the related adult alpha I- and alpha II-globin genes of X. laevis and of the adult alpha-globin gene of X. tropicalis, including their 5'-flanking regions. All three genes are comprised of three exons and two introns at homologous positions. The exons are highly conserved and code for 141 amino acids. By contrast, the corresponding introns vary in length and show considerable divergence. Comparison of 900 bp of the 5'-flanking region revealed that the X. tropicalis gene contains a conserved proximal 310-bp promoter sequence, comprised of the canonical TATA and CCAAT motifs at homologous positions, and five conserved elements in the same order and at similar positions as previously shown for the corresponding genes of X. laevis. We therefore conclude that these conserved upstream elements may represent regulatory sequences for cell-specific regulation of the adult Xenopus globin genes.
Collapse
Affiliation(s)
- J Stalder
- Zoologisches Institut, Abteilung für Zell- und Entwicklungsbiologie, Bern, Switzerland
| | | | | | | | | | | | | |
Collapse
|
6
|
Abstract
We discuss the statistical significance of local similarities found between DNA sequences, and illustrate the procedure with reference to the Queen and Korn algorithm. If the longest similarity found for two sequences has length L, this length is said to be significant at the 5% level if there is a probability of no more than 0.05 of finding a length of L or greater between a pair of sequences consisting of randomly chosen bases with the same overall base frequencies. The distribution of longest lengths is related to that of lengths from any particular pair of starting positions on the two sequences. For our implementation of the Queen and Korn algorithm, this latter distribution is constructed by combining the five different blocks of bases that may be added to extend a similarity. A table is given to assess the significance of longest similarities in sequences of length up to 1000 bases. Quite long similarities are expected to occur by chance alone. The critical values we calculate for assessing significance are preferable to expected numbers of similarities used by some commercial computer packages.
Collapse
Affiliation(s)
- L D Brooks
- Department of Statistics, North Carolina State University, Raleigh 27695-8203
| | | | | |
Collapse
|
7
|
Locke J, White BN, Wyatt GR. Cloning and 5' end nucleotide sequences of two juvenile hormone-inducible vitellogenin genes of the African migratory locust. DNA (MARY ANN LIEBERT, INC.) 1987; 6:331-42. [PMID: 2820677 DOI: 10.1089/dna.1987.6.331] [Citation(s) in RCA: 52] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
Screening of lambda libraries prepared with locust (Locusta migratoria) genomic DNA yielded overlapping clones containing two complete vitellogenin genes, designated A and B, along with extensive 3'- and 5'-flanking sequences. Genes A and B are about 12 kb and 10.5 kb long, respectively, and both hybridize on Northern blots with 6300-nucleotide RNAs previously identified as vitellogenin mRNAs. Each gene contains a large intron near the 5' end and must also contain other introns. Repetitive "Lm1 elements," similar to the human Alu repetitive DNA elements, and other repeated sequences occur within the large intron and in the 3'- and 5'-flanking regions of each gene. Southern blots showed no cross-hybridization between the major portions of mRNA-coding sequences of genes A and B. DNA sequences have been determined for the 5' terminal exons and flanking regions of both genes. Both genes contain short (56-58 nucleotide) 5' leader exons, which, unlike the rest of the coding region, are highly conserved in sequence (87% identical) and code for amino acids that resemble the amino-terminal sequences of vertebrate vitellogenins. The upstream 5'-flanking sequences of the two genes contain several short stretches of similarity that may be involved in juvenile hormone regulated expression.
Collapse
Affiliation(s)
- J Locke
- Department of Biology, Queen's University, Kingston, Ontario, Canada
| | | | | |
Collapse
|
8
|
Molecular cloning and structural analysis of murine thymidine kinase genomic and cDNA sequences. Mol Cell Biol 1986. [PMID: 3018504 DOI: 10.1128/mcb.5.11.3149] [Citation(s) in RCA: 67] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Two functional cytosolic thymidine kinase (tk) cDNA clones were isolated from a mouse L-cell library. An RNA blot analysis indicated that one of these clones contains a nearly full-length tk sequence and that LTK- cells contain little or no TK message. The nucleotide sequences of both clones were determined, and the functional mouse tk cDNA contains 1,156 base pairs. An analysis of the sequence implied that there is an untranslated 32-nucleotide region at the 5' end of the mRNA, followed by an open reading frame of 699 nucleotides. The 3' untranslated region is 422 nucleotides long. Thus, the gene codes for a protein containing 233 amino acids, with a molecular weight of 25,873. A comparison of the coding sequences of the mouse tk cDNA with the human and chicken tk genes revealed about 86 and 70% homology, respectively. We also isolated the tk gene from a mouse C57BL/10J cosmid library. The structural organization was determined by restriction mapping, Southern blotting, and heteroduplex analysis of the cloned sequences, in combination with a mouse tk cDNA. The tk gene spans approximately 11 kilobases and contains at least five introns. Southern blot analysis revealed that this gene is deleted in mouse LTK- cells, consistent with the inability of these cells to synthesize TK message. This analysis also showed that tk-related sequences are present in the genomes of several mouse strains, as well as in LTK- cells. These segments may represent pseudogenes.
Collapse
|
9
|
Stalder J, Meyerhof W, Wirthmüller U, Gruber A, Wyler T, Knöchel W, Weber R. Conserved sequences and cell-specific DNase I hypersensitive sites upstream from the co-ordinately expressed alpha I- and alpha II-globin genes of Xenopus laevis. J Mol Biol 1986; 188:119-28. [PMID: 3014154 DOI: 10.1016/0022-2836(86)90298-6] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
The globin gene family of Xenopus laevis comprises pairs of closely related genes that are arranged in two clusters, each pair of genes being co-ordinately and stage-specifically expressed. To get information on putative regulatory elements, we compared the DNA sequences and the chromatin conformation 5' to the co-ordinately expressed adult alpha-globin genes. Sequence analysis revealed a relatively conserved region from the cap site up to position -289, and further upstream seven distinct boxes of homology, separated by more diverged sequences or deletions/insertions. The homology boxes comprise 22 to 194 base-pairs showing 78 to 95% homology. Analysis of chromatin conformation showed that DNase I preferentially cuts the upstream region of both genes at similar positions, 5' to the T-A-T-A and the C-C-A-A-T boxes, only in chromatin of adult erythroblasts and erythrocytes, where adult globin genes are expressed, but not in chromatin of adult liver cells or larval erythrocytes, where these genes are silent. This suggests that cell- and stage-specific activation of these genes coincides with specific changes in chromatin conformation within the proximal upstream region. No difference was found in the nucleotide sequence within the DNase I hypersensitive region proximal to the adult alpha 1-globin gene in DNA from embryonic cells, in which this gene is inactive, and adult erythrocytes, expressing this gene.
Collapse
|
10
|
Lin PF, Lieberman HB, Yeh DB, Xu T, Zhao SY, Ruddle FH. Molecular cloning and structural analysis of murine thymidine kinase genomic and cDNA sequences. Mol Cell Biol 1985; 5:3149-56. [PMID: 3018504 PMCID: PMC369130 DOI: 10.1128/mcb.5.11.3149-3156.1985] [Citation(s) in RCA: 28] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Two functional cytosolic thymidine kinase (tk) cDNA clones were isolated from a mouse L-cell library. An RNA blot analysis indicated that one of these clones contains a nearly full-length tk sequence and that LTK- cells contain little or no TK message. The nucleotide sequences of both clones were determined, and the functional mouse tk cDNA contains 1,156 base pairs. An analysis of the sequence implied that there is an untranslated 32-nucleotide region at the 5' end of the mRNA, followed by an open reading frame of 699 nucleotides. The 3' untranslated region is 422 nucleotides long. Thus, the gene codes for a protein containing 233 amino acids, with a molecular weight of 25,873. A comparison of the coding sequences of the mouse tk cDNA with the human and chicken tk genes revealed about 86 and 70% homology, respectively. We also isolated the tk gene from a mouse C57BL/10J cosmid library. The structural organization was determined by restriction mapping, Southern blotting, and heteroduplex analysis of the cloned sequences, in combination with a mouse tk cDNA. The tk gene spans approximately 11 kilobases and contains at least five introns. Southern blot analysis revealed that this gene is deleted in mouse LTK- cells, consistent with the inability of these cells to synthesize TK message. This analysis also showed that tk-related sequences are present in the genomes of several mouse strains, as well as in LTK- cells. These segments may represent pseudogenes.
Collapse
|
11
|
Davis CA, Riddell DC, Higgins MJ, Holden JJ, White BN. A gene family in Drosophila melanogaster coding for trypsin-like enzymes. Nucleic Acids Res 1985; 13:6605-19. [PMID: 2414727 PMCID: PMC321980 DOI: 10.1093/nar/13.18.6605] [Citation(s) in RCA: 80] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
We have isolated a clustered gene family in D. melanogaster that codes for trypsin-like enzymes. The gene family has been localized to 47D-F by in situ hybridization to polytene chromosomes. The four genes in the family are transcribed in alternating orientations, and code for 1000 nt mRNAs. Transcripts are present at all stages of the life cycle. In situ hybridization to mRNA in tissue sections of third instar larvae showed that transcripts were restricted to the mid-gut. One gene was sequenced. The translated amino acid sequence of the proposed active enzyme is 42% homologous to bovine trypsin. Regions of functional importance are more strongly conserved. These include the active site residues asp102, his57, ser195, and the residue asp189 which is reputed to bind the basic residue at the substrate cleavage site. The activation peptide is not homologous to that of most vertebrate trypsins, suggesting a modified activation mechanism. The sequence further strengthens the hypothesis that the chymotrypsin cleavage specificity developed separately in the vertebrates and invertebrates.
Collapse
|
12
|
Krupp G, Thurianx P, Willis I, Gamulin V, Söll D. First identification of an amber nonsense mutation in Schizosaccharomyces pombe: major differences in the efficiency of homologous versus heterologous yeast suppressor tRNA genes. MOLECULAR & GENERAL GENETICS : MGG 1985; 201:82-7. [PMID: 3903436 DOI: 10.1007/bf00397990] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
In Saccharomyces cerevisiae ochre and opal, as well as amber mutations are known, whereas in the fission yeast Schizosaccharomyces pombe no amber alleles have been described. We have characterized trp1-566, an amber allele in the trp1 locus of S. pombe. The identification of trp1-566 as an amber allele is based on the following results: (a) The nonsense allele can be converted to an ochre allele by nitrosoguanidine mutagenesis. (b) trp1-566 is suppressed by a bona fide S. pombe amber suppressor tRNA, supSI. The supSI gene was obtained by primer-directed in vitro mutagenesis of a tRNASer from S. pombe. Unexpectedly, an S. cerevisiae amber suppressor tRNASer, supR21, transformed into S. pombe, failed to suppress trp1-566. Northern analysis of S. pombe transformants, containing supRL1 or S. cerevisiae tRNALeu or tRNATyr genes reveals that these genes are not transcribed in the fission yeast. As an additional tool for the analysis of nonsense mutations in S. pombe, we obtained by nitrosoguanidine mutagenesis two unlinked amber suppressor alleles, sup13 and sup14, which act on trp1-566.
Collapse
|
13
|
Sumner-Smith M, Rafalski JA, Sugiyama T, Stoll M, Söll D. Conservation and variability of wheat alpha/beta-gliadin genes. Nucleic Acids Res 1985; 13:3905-916. [PMID: 3839304 PMCID: PMC341285 DOI: 10.1093/nar/13.11.3905] [Citation(s) in RCA: 74] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
We have sequenced two genomic clones for wheat alpha/beta-gliadin storage protein genes. Comparison with a known sequence reveals close homology between the three and confirms the previously suspected evolutionary relatedness of members of this gliadin family. The coding region can be divided into six domains. Two unusual structures were found within this region: (i) The P-boxes which are composed of 12 codons, six of which are for proline, that are tandemly repeated four or five times; and (ii) Two polyglutamine stretches which consist of 18-22 tandemly repeated glutamine codons in one case, and 7-28 in the second. Analysis of the P-box structures revealed that certain mutations were probably present in the hypothetical ancestral alpha/beta-gliadin gene prior to gene multiplication. None of the genes have introns. All of the genes appear to contain typical eukaryotic promoters and also possess the double polyadenylation signal of plants.
Collapse
|
14
|
The phi 80 and P22 attachment sites. Primary structure and interaction with Escherichia coli integration host factor. J Biol Chem 1985. [DOI: 10.1016/s0021-9258(18)89288-2] [Citation(s) in RCA: 126] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
|
15
|
Transcription factor binding is limited by the 5'-flanking regions of a Drosophila tRNAHis gene and a tRNAHis pseudogene. Mol Cell Biol 1985. [PMID: 6570190 DOI: 10.1128/mcb.4.12.2714] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
We determined the sequence of a Drosophila tRNA gene cluster containing a tRNAHis gene and a tRNAHis pseudogene in close proximity on the same DNA strand. The pseudogene contains eight consecutive base pairs different from the region of the bona fide gene which codes for the 3' portion of the anticodon stem of tRNAHis. The tRNAHis gene is transcribed efficiently in Drosophila Kc cell extract, whereas the pseudogene is not. The pseudogene is also a much poorer competitor than the real gene in a stable transcription complex formation assay, even though the sequence alteration in the pseudogene does not affect the sequence or spacing of the putative internal transcription control regions. Recombinant clones were constructed in which the 5'-flanking regions are exchanged. The transcription efficiencies and competitive abilities of the recombinant clones resemble those of the genes from which the 5' flank was derived; for example, the tRNAHis pseudogene with the 5'-flanking sequence of the tRNAHis gene is now efficiently transcribed. Deletion analysis of the pseudogene 5' flank failed to uncover an inhibitory element. Deletion analysis of the real gene showed very high dependence on the presence of the wild-type 5'-flanking sequence for factor binding to the internal control regions and stable complex formation. The 5'-flanking sequence of a Drosophila tRNAArg gene active in the Drosophila Kc cell extract does not restore transcriptional activity or stable complex formation. The tRNAHis gene and pseudogene behave atypically in HeLa cell extract. Both genes compete for HeLa transcription factors, but neither of them is efficiently transcribed. Removal of the 5'-flanking sequences of each gene and replacement with various sequences, including the tRNAArg gene 5' flank, does not allow increased transcription in HeLa cell extract.
Collapse
|
16
|
Stark MJ, Gourse RL, Jemiolo DK, Dahlberg AE. A mutation in an Escherichia coli ribosomal RNA operon that blocks the production of precursor 23 S ribosomal RNA by RNase III in vivo and in vitro. J Mol Biol 1985; 182:205-16. [PMID: 2582139 DOI: 10.1016/0022-2836(85)90339-0] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
We have isolated on a multicopy plasmid a mutant rrnB ribosomal RNA operon containing a 130 base-pair deletion immediately preceding the 23 S rRNA gene. The deletion shortens by just three base-pairs the 26 base-pair complementarity of the sequences that flank the 23 S rRNA gene, and which normally form an RNase III cleavage site in the rrnB primary transcript. Both in vivo and in vitro, cleavage at the altered RNase III site was almost completely abolished by the mutation. Our results therefore indicate that even a small perturbation of the double-stranded region normally recognized by RNase III strongly inhibits the action of the enzyme.
Collapse
|
17
|
Cooley L, Schaack J, Burke DJ, Thomas B, Söll D. Transcription factor binding is limited by the 5'-flanking regions of a Drosophila tRNAHis gene and a tRNAHis pseudogene. Mol Cell Biol 1984; 4:2714-22. [PMID: 6570190 PMCID: PMC369281 DOI: 10.1128/mcb.4.12.2714-2722.1984] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
Abstract
We determined the sequence of a Drosophila tRNA gene cluster containing a tRNAHis gene and a tRNAHis pseudogene in close proximity on the same DNA strand. The pseudogene contains eight consecutive base pairs different from the region of the bona fide gene which codes for the 3' portion of the anticodon stem of tRNAHis. The tRNAHis gene is transcribed efficiently in Drosophila Kc cell extract, whereas the pseudogene is not. The pseudogene is also a much poorer competitor than the real gene in a stable transcription complex formation assay, even though the sequence alteration in the pseudogene does not affect the sequence or spacing of the putative internal transcription control regions. Recombinant clones were constructed in which the 5'-flanking regions are exchanged. The transcription efficiencies and competitive abilities of the recombinant clones resemble those of the genes from which the 5' flank was derived; for example, the tRNAHis pseudogene with the 5'-flanking sequence of the tRNAHis gene is now efficiently transcribed. Deletion analysis of the pseudogene 5' flank failed to uncover an inhibitory element. Deletion analysis of the real gene showed very high dependence on the presence of the wild-type 5'-flanking sequence for factor binding to the internal control regions and stable complex formation. The 5'-flanking sequence of a Drosophila tRNAArg gene active in the Drosophila Kc cell extract does not restore transcriptional activity or stable complex formation. The tRNAHis gene and pseudogene behave atypically in HeLa cell extract. Both genes compete for HeLa transcription factors, but neither of them is efficiently transcribed. Removal of the 5'-flanking sequences of each gene and replacement with various sequences, including the tRNAArg gene 5' flank, does not allow increased transcription in HeLa cell extract.
Collapse
|
18
|
McClelland A, Kühn LC, Ruddle FH. The human transferrin receptor gene: genomic organization, and the complete primary structure of the receptor deduced from a cDNA sequence. Cell 1984; 39:267-74. [PMID: 6094009 DOI: 10.1016/0092-8674(84)90004-7] [Citation(s) in RCA: 276] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Heteroduplex analysis shows that the transferrin receptor gene contains at least 19 distinct coding sequences distributed over 31 kb of genomic DNA. The nucleotide sequence of these coding regions has been determined from a cDNA clone. The sequence contains a single complete open reading frame of 2280 bases which specifies a 760 residue polypeptide with a molecular weight of 85K daltons. The deduced amino acid sequence of the receptor shows that it does not contain an N-terminal hydrophobic signal peptide. We have found a single region of sufficient length and hydrophobicity to span the membrane, located 61 amino acids from the N-terminus. This leads to the prediction that the receptor is oriented in the membrane with a cytoplasmic N-terminus and an extracellular C-terminus. The receptor has no significant homology with transferrin, or with any receptor for which a sequence is available.
Collapse
|
19
|
|
20
|
Bishop MJ. Software club: Software for molecular biology. III. General analysis programs classified according to the computer on which they run. Bioessays 1984. [DOI: 10.1002/bies.950010309] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
21
|
Clark CG, Tague BW, Ware VC, Gerbi SA. Xenopus laevis 28S ribosomal RNA: a secondary structure model and its evolutionary and functional implications. Nucleic Acids Res 1984; 12:6197-220. [PMID: 6147812 PMCID: PMC320067 DOI: 10.1093/nar/12.15.6197] [Citation(s) in RCA: 146] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
Based upon the three experimentally derived models of E. coli 23S rRNA (1-3) and the partial model for yeast 26S rRNA (4), which was deduced by homology to E. coli, we derived a secondary structure model for Xenopus laevis 28S rRNA. This is the first complete model presented for eukaryotic 28S rRNA. Compensatory base changes support the general validity of our model and offer help to resolve which of the three E. coli models is correct in regions where they are different from one another. Eukaryotic rDNA is longer than prokaryotic rDNA by virtue of introns, expansion segments and transcribed spacers, all of which are discussed relative to our secondary structure model. Comments are made on the evolutionary origins of these three categories and the processing fates of their transcripts. Functionally important sites on our 28S rRNA secondary structure model are suggested by analogy for ribosomal protein binding, the GTPase center, the peptidyl transferase center, and for rRNA interaction with tRNA and 5S RNA. We discuss how RNA-RNA interactions may play a vital role in translocation.
Collapse
|
22
|
Tague BW, Gerbi SA. Processing of the large rRNA precursor: two proposed categories of RNA-RNA interactions in eukaryotes. J Mol Evol 1984; 20:362-7. [PMID: 6210374 DOI: 10.1007/bf02104742] [Citation(s) in RCA: 44] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
The 5.8S RNA gene of eukaryotes is separated from the 26-28S rRNA gene by the internal transcribed spacer 2 (ITS 2). A compilation of known ITS 2 sequences is presented here. Four characteristic features of the ITS 2 primary structure are shared by all vertebrates. In contrast, lower eukaryotes lack most of these features, suggesting that the excision of the ITS 2 transcript during processing may differ between vertebrates and lower eukaryotes. Since the transcripts of rRNA ITS 2 and mRNA introns share some similarity, analogies have been made between the mechanisms of their removal during RNA maturation. A model is proposed for hydrogen-bonding of U3 snRNA with the 5' end of the vertebrate ITS 2 transcript. This U3 snRNA-ITS 2 RNA interaction does not appear to be used in ITS 2 processing in lower eukaryotes. Instead, in lower eukaryotes a region within the ITS 2 itself has the potential to hydrogen-bond to the 5' end of the ITS 2 transcript.
Collapse
|
23
|
Lycett GW, Croy RR, Shirsat AH, Boulter D. The complete nucleotide sequence of a legumin gene from pea (Pisum sativum L.). Nucleic Acids Res 1984; 12:4493-506. [PMID: 6330672 PMCID: PMC318853 DOI: 10.1093/nar/12.11.4493] [Citation(s) in RCA: 102] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
One of several genes coding for the major pea storage protein, legumin, has been completely sequenced. The sequence covers the whole of the transcribed region, plus 5' and 3' untranscribed sequences. The predicted protein sequence starts with a signal peptide and is followed by the legumin alpha polypeptide sequence of 36. 44kd and the beta polypeptide sequence of 20. 19kd . Compared to other legume storage proteins, the alpha and beta polypeptide sequences encoded by this legumin gene, which contain 3 met and 5 cys residues, are relatively rich in the sulphur amino acids. The coding sequence is interrupted by three introns which show boundary sequences typical of higher plant genes. The 5' end of the gene sequence contains a 'TATA box', a ' CAAT box' and a sequence showing some homology to an ' AGGA box'. An extra sequence, identical to the normal polyadenylation signal of the legumin message is seen in the 3' untranscribed region. The structure of the gene and the possible significance of secondary structures in the nascent RNA transcript in affecting the choice of polyadenylation site is discussed.
Collapse
|
24
|
Rafalski JA, Scheets K, Metzler M, Peterson DM, Hedgcoth C, Söll DG. Developmentally regulated plant genes: the nucleotide sequence of a wheat gliadin genomic clone. EMBO J 1984; 3:1409-15. [PMID: 6204862 PMCID: PMC557531 DOI: 10.1002/j.1460-2075.1984.tb01985.x] [Citation(s) in RCA: 66] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Gliadins, the major wheat seed storage proteins, are encoded by a multigene family. Northern blot analysis shows that gliadin genes are transcribed in endosperm tissue into two classes of poly(A)+ mRNA, 1400 bases (class I) and 1600 bases (class II) in length. Using poly(A)+ RNA from developing wheat endosperm we constructed a cDNA library from which a number of clones coding for alpha/beta and gamma gliadins were identified by hybrid-selected mRNA translation and DNA sequencing. These cDNA clones were used as probes for the isolation of genomic gliadin clones from a wheat genomic library. One such genomic clone was characterized in detail and its DNA sequence determined. It contains a gene for a 33-kd alpha/beta gliadin protein (a 20 amino acid signal peptide and a 266 amino acid mature protein) which is very rich in glutamine (33.8%) and proline (15.4%). The gene sequence does not contain introns. A typical eukaryotic promoter sequence is present at -104 (relative to the translation initiation codon) and there are two normal polyadenylation signals 77 and 134 bases downstream from the translation termination codon. The coding sequence contains some internal sequence repetition, and is highly homologous to several alpha/beta gliadin cDNA clones. Homology to a gamma-gliadin cDNA clone is low, and there is no homology with known glutenin or zein cDNA sequences.
Collapse
|
25
|
Brendel V, Trifonov EN. A computer algorithm for testing potential prokaryotic terminators. Nucleic Acids Res 1984; 12:4411-27. [PMID: 6374619 PMCID: PMC318841 DOI: 10.1093/nar/12.10.4411] [Citation(s) in RCA: 259] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
The nucleotide sequences of 30 factor-independent terminators of transcription with RNA polymerase from E. coli have been compiled and analyzed. The standard features - a stretch of thymine residues and a preceding dyad symmetry - are shared by most sequences, but there are striking exceptions which indicate that these features alone are not sufficient to describe these sites. In two thirds of the sequences the 3'-half of the dyad symmetry contains the pentanucleotide CGGG (G/C) or a close derivative; about one third have TCTG or a close derivative just downstream of the termination point. The TCTG -box might be implied in termination of stringently controlled operons of E. coli. An algorithm to locate terminators in templates of known nucleotide sequence has been constructed on the basis of correlation to the distribution of dinucleotides along the aligned signal sequences. The algorithm has been tested on natural sequences of a total length of about 11,500 N. It finds all known independent terminators and only a few other sites, including some of the rho-dependent and putative terminators.
Collapse
|
26
|
Mount DW, Conrad B. Microcomputer programs for back translation of protein to DNA sequences and analysis of ambiguous DNA sequences. Nucleic Acids Res 1984; 12:819-23. [PMID: 6546440 PMCID: PMC321096 DOI: 10.1093/nar/12.1part2.819] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Three computer programs are described which may be used to translate a DNA sequence into a protein sequence, back translate the protein sequence into an ambiguous DNA sequence, and then do pattern searching in the ambiguous sequence. The programs are written in the C programming language, have been compiled to run on a microcomputer under the CP/M 80 operating system, and may be copied in binary format through a modem. They are also to become available for the IBM/PC.
Collapse
|
27
|
Lycett GW, Delauney AJ, Zhao W, Gatehouse JA, Croy RR, Boulter D. Two cDNA clones coding for the legumin protein of Pisum sativum L. contain sequence repeats. PLANT MOLECULAR BIOLOGY 1984; 3:91-96. [PMID: 24310304 DOI: 10.1007/bf00040033] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/05/1983] [Revised: 01/02/1984] [Accepted: 01/06/1984] [Indexed: 06/02/2023]
Abstract
The sequence of two cDNA clones coding for the whole of the β-subunit and most of the α-subunit of legumin are presented together with a considerable amount of protein sequence data to confirm the predicted amino acid sequence. A unique feature shown by these cDNAs is the presence of three 56 base pair tandem repeats in the region encoding the C terminal of the α polypeptide. The tandem repeats are also exhibited in the predicted polypeptide sequence as three 18 amino acid repeats which contain extremely high proportions of polar, mainly acidic, residues. The new sequences are compared to the previously published sequence of some shorter legumin cDNAs (Nature 295: 76-79). In the region where the sequences overlap, the previous cDNAs differ from the new ones by only a few base substitutions but most of the repeated region is not present though the sequences on either side are. The possibility that the absence of the repeats may reflect the difference between two types of legumin gene, rather than an artefact of the cloning of the cDNAs, is discussed.
Collapse
Affiliation(s)
- G W Lycett
- Department of Botany, University of Durham, DH1 3LE, Durham, England
| | | | | | | | | | | |
Collapse
|
28
|
The extent of a eukaryotic tRNA gene. 5‘- and 3‘-flanking sequence dependence for transcription and stable complex formation. J Biol Chem 1984. [DOI: 10.1016/s0021-9258(17)43429-6] [Citation(s) in RCA: 40] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
|
29
|
Ware VC, Tague BW, Clark CG, Gourse RL, Brand RC, Gerbi SA. Sequence analysis of 28S ribosomal DNA from the amphibian Xenopus laevis. Nucleic Acids Res 1983; 11:7795-817. [PMID: 6359063 PMCID: PMC326536 DOI: 10.1093/nar/11.22.7795] [Citation(s) in RCA: 143] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
We have determined the complete nucleotide sequence of Xenopus laevis 28S rDNA (4110 bp). In order to locate evolutionarily conserved regions within rDNA, we compared the Xenopus 28S sequence to homologous rDNA sequences from yeast, Physarum, and E. coli. Numerous regions of sequence homology are dispersed throughout the entire length of rDNA from all four organisms. These conserved regions have a higher A + T base composition than the remainder of the rDNA. The Xenopus 28S rDNA has nine major areas of sequence inserted when compared to E. coli 23S rDNA. The total base composition of these inserts in Xenopus is 83% G + C, and is generally responsible for the high (66%) G + C content of Xenopus 28S rDNA as a whole. Although the length of the inserted sequences varies, the inserts are found in the same relative positions in yeast 26S, Physarum 26S, and Xenopus 28S rDNAs. In one insert there are 25 bases completely conserved between the various eukaryotes, suggesting that this area is important for eukaryotic ribosomes. The other inserts differ in sequence between species and may or may not play a functional role.
Collapse
|
30
|
Makaroff CA, Zalkin H, Switzer RL, Vollmer SJ. Cloning of the Bacillus subtilis glutamine phosphoribosylpyrophosphate amidotransferase gene in Escherichia coli. Nucleotide sequence determination and properties of the plasmid-encoded enzyme. J Biol Chem 1983. [DOI: 10.1016/s0021-9258(17)44497-8] [Citation(s) in RCA: 73] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
|
31
|
Lycett GW, Delauney AJ, Gatehouse JA, Gilroy J, Croy RR, Boulter D. The vicilin gene family of pea (Pisum sativum L.): a complete cDNA coding sequence for preprovicilin. Nucleic Acids Res 1983; 11:2367-80. [PMID: 6687941 PMCID: PMC325890 DOI: 10.1093/nar/11.8.2367] [Citation(s) in RCA: 76] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
A cDNA plasmid bank has been constructed using mRNA from developing pea seeds and three cDNAs coding for vicilin polypeptides have been selected. These cDNAs have been sequenced and between them cover the whole of the coding sequence plus part of the 5' and 3' untranslated regions. Comparison with amino acid sequence data from the protein indicates that vicilin is synthesised as preprovicilin with subsequent removal of a signal peptide and a C-terminal peptide as well as post translational endo-proteolytic cleavage. The cDNAs represent two different classes of vicilin genes whilst amino acid data show that there are at least three major classes of vicilin polypeptide. The vicilin sequences show extensive homology with conglycinin and phaseolin except in the regions of the internal proteolytic cleavages. The evolutionary significance of this relationship is discussed.
Collapse
|
32
|
Wolin SL, Steitz JA. Genes for two small cytoplasmic Ro RNAs are adjacent and appear to be single-copy in the human genome. Cell 1983; 32:735-44. [PMID: 6187471 DOI: 10.1016/0092-8674(83)90059-4] [Citation(s) in RCA: 136] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Anti-Ro autoantibodies precipitate several small cytoplasmic ribonucleoproteins from mammalian cells. The RNA components of these particles, designated hY1-hY5 in human cells and mY1 and mY2 in mouse cells, are about 100 nucleotides long. We have analyzed a genomic clone that appears to contain true RNA-coding regions for two of the human Ro RNAs, hY1 and hY3. These RNAs exhibit many sequence and secondary structure homologies, both with each other and with the recently sequenced hY5 RNA. The hY2 RNA is a slightly truncated form of hY1; several shorter versions of hY3 are also detected in cell extracts and immunoprecipitates. The human hY1 and hY3 genes cross-hybridize with the mouse Ro RNAs, mY1 and mY2, respectively; we show that the mouse Ro RNAs are exclusively contained in Ro particles. The genes for hY1 and hY3 are transcribed in vitro by RNA polymerase III. In contrast with all other mammalian class III genes described, they appear to be present as single copies in the human genome.
Collapse
|
33
|
|
34
|
Abstract
Striking similarities have been pointed out between the structures of the human language and the genetic code. The primary genetic code utilizes the principle of linear representation much like e.g. the Indo-European languages do. There are numerous indications that more complex secondary and tertiary structural elements in DNA direct highly specific interactions with proteins. Thus, more complex genetic codes might exist which might be superimposed on DNA sequences coding for polypeptides or might be extended to "non-coding" DNA sequences. Structural features of highly complex languages, like Chinese or Egyptian hieroglyphics using conceptual expression patterns have been compared to the more complex ways of encoding. It is proposed that the application of linguistic principles may be helpful in the computer analyses of known DNA sequences. There is considerable evidence for the innate specification at least for the basic structural elements of human languages. This innate specification may be the cause for language university. Based on the striking structural similarities between language and genetic code, the question is raised to what extent and in what way DNA sequences might be related to the innate specification of human languages.
Collapse
|
35
|
Yamao F, Inokuchi H, Cheung A, Ozeki H, Söll D. Escherichia coli glutaminyl-tRNA synthetase. I. Isolation and DNA sequence of the glnS gene. J Biol Chem 1982. [DOI: 10.1016/s0021-9258(18)33810-9] [Citation(s) in RCA: 60] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
|
36
|
Hoben P, Royal N, Cheung A, Yamao F, Biemann K, Söll D. Escherichia coli glutaminyl-tRNA synthetase. II. Characterization of the glnS gene product. J Biol Chem 1982. [DOI: 10.1016/s0021-9258(18)33811-0] [Citation(s) in RCA: 65] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
|
37
|
Schaak J, Mao J, Söll D. The 5.8S RNA gene sequence and the ribosomal repeat of Schizosaccharomyces pombe. Nucleic Acids Res 1982; 10:2851-64. [PMID: 6285312 PMCID: PMC320660 DOI: 10.1093/nar/10.9.2851] [Citation(s) in RCA: 56] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
We have characterized the rRNA gene repeat in Schizosaccharomyces pombe. This repeat, which does not contain the 5S RNA gene, is found in a 10.4 kb HindIII DNA fragment. We have determined the nucleotide sequences of the S. pombe 5.8S RNA gene and intergenic spacers from two different 10.4 kb DNA fragments. Analysis of isolated total cellular 5.8S RNA revealed the presence of eight species of 5.8S RNA, differing in the number of nucleotides at the 5'-end. The eight 4.8S RNA species vary in length from 158 to 165 nucleotides. Apart from the heterogeneity observed at the 5'-end, the sequence of the eight 5.8S RNA species appears to be identical and is the same sequence as coded for by the 5.8S genes. The gene sequence shows great homology to the 5.8S RNA genes or S. cerevisiae and N. crassa. Most of the base differences are confined to the highly variable stem though to be involved in co-axial helix stacking with the 25S RNA, where base pairing is nearly identical despite the sequence differences. Secondary structure models are examined in light of 5.8S RNA oligonucleotide conservation across species from yeasts to higher eukaryotes.
Collapse
|
38
|
|
39
|
Mao J, Appel B, Schaack J, Sharp S, Yamada H, Söll D. The 5S RNA genes of Schizosaccharomyces pombe. Nucleic Acids Res 1982; 10:487-500. [PMID: 6278416 PMCID: PMC326152 DOI: 10.1093/nar/10.2.487] [Citation(s) in RCA: 76] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
The genomic arrangement and sequences of S. pombe 5S RNA genes are reported here. The 5S gene sequences appear to be dispersed within the genome, and are found independently of other rRNA genes. The sequences of two 5S genes examined show identical coding regions of 119 base pairs but have widely varying flanking sequences. A tRNAAsp gene is found in the 3' flanking region of one of the 5S genes. The tRNAAsp gene is faithfully transcribed in an X. laevis in vitro system, while the 5S genes are not transcribed in this system. The phylogenetic position of S. pombe is examined through comparison of 5S RNA sequences.
Collapse
|
40
|
Abstract
We present interactive computer programs for the analysis of nucleic acid sequences. In order to handle these programs, minimum computer experience is sufficient. The nucleotide sequence of the human gamma globin gene complex is used as an example to illustrate the data analysis.
Collapse
|
41
|
Abstract
Extensive modifications have been incorporated into many of the computer programs written by Staden (1-4) to make them easier to cope with DNA and amino acid sequence data. These programs can be easily used by persons with minimal knowledge of computers.
Collapse
|
42
|
Brutlag DL, Clayton J, Friedland P, Kedes LH. SEQ: a nucleotide sequence analysis and recombination system. Nucleic Acids Res 1982; 10:279-94. [PMID: 7063402 PMCID: PMC326133 DOI: 10.1093/nar/10.1.279] [Citation(s) in RCA: 167] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
SEQ is an interactive, self-documenting computer program that contains procedures for the analysis of nucleotide sequences and the manipulation of such sequences to allow the simulation and prediction of the results of recombinant DNA experiments.
Collapse
|
43
|
Queen C, Wegman MN, Korn LJ. Improvements to a program for DNA analysis: a procedure to find homologies among many sequences. Nucleic Acids Res 1982; 10:449-56. [PMID: 6174938 PMCID: PMC326145 DOI: 10.1093/nar/10.1.449] [Citation(s) in RCA: 34] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
We have devised an algorithm for finding partial homologies among a set of nucleotide sequences. The algorithm and other improvements have been incorporated into a commonly used computer program for the analysis of sequence data.
Collapse
|
44
|
Patarca R, Dorta B, Ramirez JL. Creation of a data base for sequences of ribosomal nucleic acids and detection of conserved restriction endonucleases sites through computerized processing. Nucleic Acids Res 1982; 10:175-82. [PMID: 6278402 PMCID: PMC326124 DOI: 10.1093/nar/10.1.175] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
As part of a project pertaining the organization of ribosomal genes in Kinetoplastidae, we have created a data base for published sequences of ribosomal nucleic acids, with information in Spanish. As a first step in their processing, we have written a computer program which introduces the new feature of determining the length of the fragments produced after single or multiple digestion with any of the known restriction enzymes. With this information we have detected conserved SAU 3A sites: (i) at the 5' end of the 5.8S rRNA and at the 3' end of the small subunit rRNA, both included in similar larger sequences; (ii) in the 5.8S rRNA of vertebrates (a second one), which is not present in lower eukaryotes, showing a clear evolutive divergence; and, (iii) at the 5' terminal of the small subunit rRNA, included in a larger conserved sequence. The possible biological importance of these sequences is discussed.
Collapse
|
45
|
Dumas JP, Ninio J. Efficient algorithms for folding and comparing nucleic acid sequences. Nucleic Acids Res 1982; 10:197-206. [PMID: 6174935 PMCID: PMC326126 DOI: 10.1093/nar/10.1.197] [Citation(s) in RCA: 109] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
Fast algorithms for analysing sequence data are presented. An algorithm for strict homologies finds all common subsequences of length greater than or equal to 6 in two given sequences. With it, nucleic acid pieces five thousand nucleotides long can be compared in five seconds on CDC 6600. Secondary structure algorithms generate the N most stable secondary structures of an RNA molecule, taking into account all loop contributions, and the formation of all possible base-pairs in stems, including odd pairs (G.G., C.U., etc.). They allow a typical 100-nucleotide sequence to be analysed in 10 seconds. The homology and secondary structure programs are respectively illustrated with a comparison of two phage genomes, and a discussion of Drosophila melanogaster 55 RNA folding.
Collapse
|
46
|
Sege RD, Saxberg BE. A statistical test for comparing several nucleotide sequences. Nucleic Acids Res 1982; 10:375-89. [PMID: 7063406 PMCID: PMC326140 DOI: 10.1093/nar/10.1.375] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
A statistical test useful for the simultaneous comparison of nucleotide sequences is presented. A likelihood ratio test is derived, examples demonstrated, and tables of critical values are provided.
Collapse
|
47
|
Abstract
An interactive system for computer analysis of nucleic acid and protein sequences has been developed for the Los Alamos DNA Sequence Database. It provides a convenient way to search or verify various sequence features, e.g., restriction enzyme sites, protein coding frames, and properties of coded proteins. Further, the comprehensive analysis package on a large-scale database can be used for comparative studies on sequence and structural homologies in order to find unnoted information stored in nucleic acid sequences.
Collapse
|
48
|
Patents and literature. Appl Biochem Biotechnol 1981; 6:339-51. [DOI: 10.1007/bf02798284] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|