1
|
Milosavljević A, Savković S, Crkvenjakov R, Salbego D, Serrato H, Kreuzer H, Gemmell A, Batus S, Grujić D, Carnahan S, Paunesku T, Tepavcević J. DNA sequence recognition by hybridization to short oligomers: experimental verification of the method on the E. coli genome. Genomics 1996; 37:77-86. [PMID: 8921373 DOI: 10.1006/geno.1996.0523] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
A newly developed method for sequence recognition by hybridization to short oligomers is verified for the first time in genome-scale experiments. The experiments involved hybridization of 15,328 randomly selected 2-kb genomic clones of Escherichia coli with 997 short oligomer probes to detect complementary oligomers within the clones. Lists of oligomers detected within individual clones were compiled into a database. The database was then searched using known E. coli sequences as queries. The goal was to recognize the clones that are identical or similar to the query sequences. A total of 76 putative recognitions were tested in two separate but complementary recognition experiments. The results indicate high specificity of recognition. Current and prospective applications of this novel method are discussed.
Collapse
Affiliation(s)
- A Milosavljević
- Center for Mechanistic Biology and Biotechnology, Argonne National Laboratory, Illinois 60439-4833, USA.
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
2
|
Milosavljevic A, Zeremski M, Strezoska Z, Grujic D, Dyanov H, Batus S, Salbego D, Paunesku T, Soares MB, Crkvenjakov R. Discovering distinct genes represented in 29,570 clones from infant brain cDNA libraries by applying sequencing by hybridization methodology. Genome Res 1996; 6:132-41. [PMID: 8919692 DOI: 10.1101/gr.6.2.132] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
To discover all distinct human genes and to determine their patterns of expression across different cell types, developmental stages, and physiological conditions, a procedure is needed for fast, mutual comparison of hundreds of thousands (and perhaps millions) of clones from cDNA libraries, as well as their comparison against data bases of sequenced DNA. In a pilot study, 29,570 clones in duplicate from both original and normalized, directional, infant brain cDNA libraries were hybridized with 107-215 heptamer oligonucleotide probes to obtain oligonucleotide sequence signatures (OSSs). The OSSs were compared and clustered based on mutual similarity into 16,741 clusters, each corresponding to a distinct cDNA. A number of distinct cDNAs were successfully recognized by matching their 107-probe OSSs against GenBank entries, indicating the possibility of sequence recognition with only a few hundred randomly chosen oligomers.
Collapse
Affiliation(s)
- A Milosavljevic
- Center for Mechanistic Biology and Biotechnology, Argonne National Laboratory, Illinois 60439, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
3
|
Abstract
DNA sequencing by hybridization (SBH) Format 1 technique is based on experiments in which thousands of short oligomers are consecutively hybridized with dense arrays of clones. In this paper we present the description of a method for obtaining hybridization signatures for individual clones that guarantees reproducibility despite a wide range of variations in experimental circumstances, a sensitive method for signature comparison at prespecified significance levels, and a clustering algorithm that correctly identifies clusters of significantly similar signatures. The methods and the algorithm have been verified experimentally on a control set of 422 signatures that originate from 9 distinct clones of known sequence. Experiments indicate that only 30 to 50 oligomer probes suffice for correct clustering. This information about the identity of clones can be used to guide both genomic and cDNA sequencing by SBH or by standard gel-based methods.
Collapse
Affiliation(s)
- A Milosavljević
- Center for Mechanistic Biology and Biotechnology, Argonne National Laboratory, Illinois 60439-4833, USA
| | | | | | | | | | | |
Collapse
|
4
|
Grujic D, Strezoska Z, Crkvenjakov R. High throughput PCR procedure for up to 6-kb lengths of DNA. Biotechniques 1994; 17:291-2, 294. [PMID: 7980930] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Affiliation(s)
- D Grujic
- Argonne National Laboratory, IL 60439-4833
| | | | | |
Collapse
|
5
|
Paunesku T, Gemmell MA, Crkvenjakov R, Woloschak GE. A presumed B6 strain-specific p53 polymorphism is confined to a B6 cell line and is likely to represent a facilitating mutation. Mamm Genome 1994; 5:106-7. [PMID: 8180468 DOI: 10.1007/bf00292336] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Affiliation(s)
- T Paunesku
- Center for Mechanistic Biology and Biotechnology, Argonne National Laboratory, Illinois 60439-4833
| | | | | | | |
Collapse
|
6
|
Drmanac R, Drmanac S, Labat I, Crkvenjakov R, Vicentic A, Gemmell A. Sequencing by hybridization: towards an automated sequencing of one million M13 clones arrayed on membranes. Electrophoresis 1992; 13:566-73. [PMID: 1451694 DOI: 10.1002/elps.11501301115] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
An immediately applicable variant of the sequencing by hybridization (SBH) method is under development with the capacity to determine up to 100 million base pairs per year. The proposed method comprises six steps: (i) arraying genomic or cDNA M13 clones in 864-well plates (wells of 2 mm); (ii) preparation of DNA samples for spotting by growth of the M13 clones or by polymerase chain reaction (PCR) of the inserts using standard 96-well plates, or plates having as many as 864 correspondingly smaller wells; (iii) robotic spotting of 13,824 samples on an 8 x 12 cm nylon membrane, or correspondingly more, on up to 6 times larger filters, by offset printing with a 96 or 864 0.4 mm pin device; (iv) hybridization of dotted samples with 200-2000 32P-labeled probes comprising 16-256 10-mers having a common 8-mer, 7-mer, or 6-mer in the middle (20 probes per day, each hybridized with 250,000 dots); (v) scoring hybridization signals of 5 million sample-probe pairs per day using storage phosphor plates; and (vi) computing clone order and partial-to-complete DNA sequences using various heuristic algorithms. Genome sequencing based on a combination of this method and gel sequencing techniques may be significantly more economical than gel methods alone.
Collapse
Affiliation(s)
- R Drmanac
- Biological and Medical Research Division, Argonne National Laboratory, IL 60439-4833
| | | | | | | | | | | |
Collapse
|
7
|
Strezoska Z, Paunesku T, Radosavljević D, Labat I, Drmanac R, Crkvenjakov R. DNA sequencing by hybridization: 100 bases read by a non-gel-based method. Proc Natl Acad Sci U S A 1991; 88:10089-93. [PMID: 1946427 PMCID: PMC52873 DOI: 10.1073/pnas.88.22.10089] [Citation(s) in RCA: 62] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Determination of the sequences of human and other complex genomes requires much faster and less expensive sequencing processes than the methods in use today. Sequencing by hybridization is potentially such a process. In this paper we present hybridization data sufficient to accurately read a known sequence of 100 base pairs. In independent reactions, octamer and nonamer oligonucleotides derived from the sequence hybridized more strongly to this DNA than to controls. The 93 consecutive overlapping probes were derived from a 100-base-pair segment of test DNA and additional probes were generated by incorporation of a noncomplementary base at one of the ends of 12 of the basic probes. These 12 additional probes also had a full-match target in one of the control DNAs. The test and one of five control DNAs spotted on nylon filters were hybridized with 83 octamers and 22 nonamers under low-temperature conditions. A stronger signal in DNA containing a full-match target compared to DNA with only mismatched targets was obtained with all 105 probes. In 3 cases (2.9%), the difference of signals was not significant (less than 2-fold) due to inefficient hybridization and the consequently higher influence of background. The hybridization pattern obtained enabled us to resequence the 100 base pairs by applying an algorithm that tolerates an error rate much higher than was observed in the experiment. With this result, the technological components of large-scale DNA sequencing using the sequencing by hybridization method are in place.
Collapse
Affiliation(s)
- Z Strezoska
- Institute of Molecular Genetics and Genetic Engineering, Belgrade, Yugoslavia
| | | | | | | | | | | |
Collapse
|
8
|
Drmanac R, Labat I, Crkvenjakov R. An algorithm for the DNA sequence generation from k-tuple word contents of the minimal number of random fragments. J Biomol Struct Dyn 1991; 8:1085-102. [PMID: 1878166 DOI: 10.1080/07391102.1991.10507867] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
An algorithm is described for generation of the long sequence written in a four letter alphabet from the constituent k-tuple words in the minimal number of separate, randomly defined fragments of the starting sequence. It is primarily intended for use in sequencing by hybridization (SBH) process- a potential method for sequencing human genome DNA (Drmanac et al., Genomics 4, pp. 114-128, 1989). The algorithm is based on the formerly defined rules and informative entities of the linear sequence. The algorithm requires neither knowledge on the number of appearances of a given k-tuple in sequence fragments, nor the information on which k-tuple words are on the ends of a fragment. It operates with the mixed content of k-tuples of the various lengths. The concept of the algorithm enables operations with the k-tuple sets containing false positive and false negative k-tuples. The content of the false k-tuples primarily affects the completeness of the generated sequence, and its correctness in the specific cases only. The algorithm can be used for the optimization of SBH parameters in the simulation experiments, as well as for the sequence generation in the real SBH experiments on the genomic DNA.
Collapse
Affiliation(s)
- R Drmanac
- Institute for Molecular Genetics and Genetic Engineering, Belgrade, Yugoslavia
| | | | | |
Collapse
|
9
|
Abstract
We have reported in rat three adult beta-gene haplotypes containing either five or three genes. Detailed sequence analysis reveals that the leftmost gene is the major gene and that at the opposite end downstream lies the minor gene. All of the genes lying between them are minor-major hybrids indicating their origin by unequal crossing-over. In two haplotypes beta-globin genes were found with an L1(1) element inserted directly into IVS2. The described results allow the formulation of a pathway of mutational events leading from the ancient two-beta-gene rodent ancestor through a three-gene haplotype to five-gene haplotypes, one of which is postulated to have arisen in common laboratory strains since their capture in the wild.
Collapse
Affiliation(s)
- T Paunesku
- Institute for Molecular Genetics and Genetic Engineering, Belgrade, Yugoslavia
| | | | | | | | | |
Collapse
|
10
|
Abstract
Although there are many new applications for hybridizing short, synthetic oligonucleotide probes to DNA, such applications have not included determining unknown sequences of DNA. The lack of clear discrimination in hybridization of oligo probes shorter than 11 nucleotides and the lack of a theoretical understanding of factors influencing hybridization of short oligos have hampered the development of their use. We have found conditions for reliable hybridization of oligonucleotides as short as seven nucleotides to cloned DNA or to oligonucleotides attached to filters. Low-temperature hybridization and washing conditions, in contrast to the high stringency conditions currently used in hybridization experiments, have the potential for allowing the simple use of all oligos of six nucleotides or longer in meaningful hybridizations. We also present the hybridization discrimination theory that provides the conceptual framework for understanding these results.
Collapse
Affiliation(s)
- R Drmanac
- Institute for Molecular Genetics and Genetic Engineering, Belgrade, Yugoslavia
| | | | | | | | | |
Collapse
|
11
|
Abstract
The genomic organization of three haplotypes of beta-globin genes was determined to resolve the question of the number of those genes in rat. Haplotype a, found in inbred strain DA, has three genes or pseudogenes, while haplotypes b, found in AO, Y5 and Wistar strains, and c, found in Wistar strain, have five genes or pseudogenes each. In haplotypes b and c, the first gene is of beta major type and the remaining four are of beta minor type. Partial sequencing of six out of 13 genes shows that duplications of beta minor genes are causing polymorphism in a number of genes. Also, in haplotype b two beta minor genes have a 6.5-kb intron 2, while in haplotype c only one beta minor gene contains such a large intron 2. The three structurally different haplotypes described are not interconvertible by single recombination events. The results indicate that the rat has the highest number of adult beta-globin genes found in mammals so far.
Collapse
Affiliation(s)
- M Stevanović
- Genetic Engineering Center, Belgrade, Yugoslavia
| | | | | | | | | |
Collapse
|
12
|
|
13
|
|
14
|
Abstract
A mismatch-free hybridization of oligonucleotides containing from 11 to 20 monomers to unknown DNA represents, in essence, a sequencing of a complementary target. Realizing this, we have used probability calculations and, in part, computer simulations to estimate the types and numbers of oligonucleotides that would have to be synthesized in order to sequence a megabase plus segment of DNA. We estimate that 95,000 specific mixes of 11-mers, mainly of the 5'(A,T,C,G)(A,T,C,G)N8(A,T,C,G)3' type, hybridized consecutively to dot blots of cloned genomic DNA fragments would provide primary data for the sequence assembly. An optimal mixture of representative libraries in M13 vector, having inserts of (i) 7 kb, (ii) 0.5 kb genomic fragments randomly ligated in up to 10-kb inserts, and (iii) tandem "jumping" fragments 100 kb apart in the genome, will be needed. To sequence each million base pairs of DNA, one would need hybridization data from about 2100 separate hybridization sample dots. Inevitable gaps and uncertainties in alignment of sequenced fragments arising from nonrandom or repetitive sequence organization of complex genomes and difficulties in cloning "poisonous" sequences in Escherichia coli, inherent to large sequencing by any method, have been considered and minimized by choice of libraries and number of subclones used for hybridization. Because it is based on simpler biochemical procedures, our method is inherently easier to automate than existing sequencing methods. The sequence can be derived from simple primary data only by extensive computing. Phased experimental tests and computer simulations increasing in complexity are needed before accurate estimates can be made in terms of cost and speed of sequencing by the new approach. Nevertheless, sequencing by hybridization should show advantages over existing methods because of the inherent redundancy and parallelism in its data gathering.
Collapse
Affiliation(s)
- R Drmanac
- Genetic Engineering Center, Belgrade, Yugoslavia
| | | | | | | |
Collapse
|
15
|
Nizetić D, Stevanović M, Soldatović B, Savić I, Crkvenjakov R. Limited polymorphism of both classes of MHC genes in four different species of the Balkan mole rat. Immunogenetics 1988; 28:91-8. [PMID: 2899548 DOI: 10.1007/bf00346156] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
We analyzed the restriction fragment length polymorphism of class I and class II MHC genes in DNA from 20 individuals belonging to the four different species of the complex of species of Balkan mole rats Spalax leucodon captured at four different localities in Yugoslavia. All populations were tested with four restriction enzymes and one conserved mouse probe for each of the two classes of MHC genes. The probes employed detect either limited polymorphism of class I genes or lack of polymorphic bands containing class II genes. Of the two other subterranean rodents that have been studied, four karyotype forms of the Israeli mole rat show polymorphism in both classes of MHC genes similar to the one found in all other mammals (Nizetić et al. 1985), and the Syrian hamster shows limited polymorphism of class I genes and high polymorphism of class II genes (McGuire et al. 1985). Balkan mole rats belong to a new group in this respect, different from all mammals studied so far, since they apparently show limited polymorphism of both classes of MHC genes.
Collapse
Affiliation(s)
- D Nizetić
- Genetic Engineering Center, Belgrade, Yugoslavia
| | | | | | | | | |
Collapse
|
16
|
Drmanac R, Petrović N, Glisin V, Crkvenjakov R. A calculation of fragment lengths obtainable from human DNA with 78 restriction enzymes: an aid for cloning and mapping. Nucleic Acids Res 1986; 14:4691-2. [PMID: 3012475 PMCID: PMC311475 DOI: 10.1093/nar/14.11.4691] [Citation(s) in RCA: 41] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
|
17
|
Abstract
The rat globin gene system is suitable for studying a coordinated regulation of seven genes from two gene families. A rat reticulocyte cDNA globin library has been constructed and two clones analyzed in detail. pBRrg 5 contains alpha while pBRrg X contains beta type sequence. These cloned cDNAs will be useful probes of structure and function of rat globin genes. The deduced amino acid sequences extend the information about the variability of rat globin chains.
Collapse
|
18
|
Crkvenjakov R, Maksimović V, Glisin V. A pool of nonpolysomal globin mRNAs in globin deficient reticulocytes of the anemic Belgrade rat. Biochem Biophys Res Commun 1982; 105:1524-31. [PMID: 6896643 DOI: 10.1016/0006-291x(82)90961-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
|
19
|
Rave N, Crkvenjakov R, Boedtker H. Identification of procollagen mRNAs transferred to diazobenzyloxymethyl paper from formaldehyde agarose gels. Nucleic Acids Res 1979; 6:3559-67. [PMID: 573889 PMCID: PMC327956 DOI: 10.1093/nar/6.11.3559] [Citation(s) in RCA: 497] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Poly A containing RNA isolated from embryonic chick calvaria was transferred from 6% formaldehyde 0.75% agarose gels to diazobenzyloxymethyl paper and the paper then hybridized to either nick translated pro alpha 1 collagen cDNA clones, pCg1 or pCg54, or to the nick translated pro alpha 2 collagen cDNA clone, pCg45. From the mobilities of the bands hybridizing most strongly to each, pro alpha 2 collagen mRNA was shown to be slightly larger than pro alpha 1 mRNA; they are 5100 and 4900 nucleotides long respectively. pCg54 also hybridized weakly to two bands of lower mobility, corresponding to RNAs 6.4 and 5.6 kb long. Neither pCg54 nor pCg45 hybridized to type II procollagen mRNA in poly A containing RNA isolated from embryonic chick sterna.
Collapse
|
20
|
Lehrach H, Frischauf AM, Hanahan D, Wozney J, Fuller F, Crkvenjakov R, Boedtker H, Doty P. Construction and characterization of a 2.5-kilobase procollagen clone. Proc Natl Acad Sci U S A 1978; 75:5417-21. [PMID: 364479 PMCID: PMC392975 DOI: 10.1073/pnas.75.11.5417] [Citation(s) in RCA: 93] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Recombinant bacterial plasmids have been constructed by inserting double-stranded chicken procollagen cDNA sequences linked to chemically synthesized decanucleotides containing HindIII sites into the HindIII site of pBR322. After transformation of Escherichia coli chi1776, colonies were selected by ampicillin resistance and recombinants containing procollagen sequences were identified by colony hybridization to 32P-labeled procollagen cDNA. The inserts from three recombinant plasmids, pCg10, pCg13, and pCg45, were 1200, 2200, and 2550 base pairs long respectively. Their sequence homology has been established by restriction mapping and crosshybridization of nick-translated plasmids to Southern blots of Hpa II fragments of the inserts, pCg45 has been positively identified as containing the pro alpha2 collagen sequence by partial determination of the DNA sequence of its ends: it has a short thymine-rich sequence at one end and a sequence coding for residues 478--499 in the chicken alpha2 chain at the other end.
Collapse
|
21
|
Crkvenjakov R, Cusić S, Ivanović I, Glisin V. Rat b/b anemia: translation of normal and anemic globin mRNA in wheat-germ cell-free system. Eur J Biochem 1976; 71:85-91. [PMID: 1009956 DOI: 10.1111/j.1432-1033.1976.tb11092.x] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Globin mRNAs were isolated from circulating reticulocytes both from rats carrying a homozygous, recessive mutation causing a severe thalassemia-like syndrome and from normal rats. After first identifying the rat globin chains as alpha or beta chains, the translational products primed by both polysomal and nonpolysomal mRNAs in wheat germ 30000 x g supernatant were analyzed: the ratio of alpha to beta globin mRNAs found in polysomes isolated from mutant rats is identical to the ratio of their products synthesized in vivo while the ratio of these mRNAs is quite different in the nonpolysomal fraction, the latter being enriched in alpha globin mRNA. No difference is found in the ratio of alpha and beta globin mRNAs in the polysomal and nonpolysomal RNA isolated from normal rats, both being identical to the ratio of their products synthesized in vivo. One third of the total amount of mRNA found in mutant cells is not in polysomes as compared to only 6 percent for the mRNA from normal lysates. These results suggest that a translational control mechanism is involved although the decreased globin synthesis in b/b anemia can not be fully accounted for by its operation.
Collapse
|
22
|
|
23
|
Crkvenjakov R, Bajković N, Glisin V. The effect of 5-azacytidine on development, nucleic acid and protein metabolism in sea urchin embryos. Biochem Biophys Res Commun 1970; 39:655-60. [PMID: 5490214 DOI: 10.1016/0006-291x(70)90255-x] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
|