1
|
George JT, Acree C, Park JU, Kong M, Wiegand T, Pignot YL, Kellogg EH, Greene EC, Sternberg SH. Mechanism of target site selection by type V-K CRISPR-associated transposases. Science 2023; 382:eadj8543. [PMID: 37972161 PMCID: PMC10771339 DOI: 10.1126/science.adj8543] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Accepted: 09/23/2023] [Indexed: 11/19/2023]
Abstract
CRISPR-associated transposases (CASTs) repurpose nuclease-deficient CRISPR effectors to catalyze RNA-guided transposition of large genetic payloads. Type V-K CASTs offer potential technology advantages but lack accuracy, and the molecular basis for this drawback has remained elusive. Here, we reveal that type V-K CASTs maintain an RNA-independent, "untargeted" transposition pathway alongside RNA-dependent integration, driven by the local availability of TnsC filaments. Using cryo-electron microscopy, single-molecule experiments, and high-throughput sequencing, we found that a minimal, CRISPR-less transpososome preferentially directs untargeted integration at AT-rich sites, with additional local specificity imparted by TnsB. By exploiting this knowledge, we suppressed untargeted transposition and increased type V-K CAST specificity up to 98.1% in cells without compromising on-target integration efficiency. These findings will inform further engineering of CAST systems for accurate, kilobase-scale genome engineering applications.
Collapse
Affiliation(s)
- Jerrin Thomas George
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
| | - Christopher Acree
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
| | - Jung-Un Park
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| | - Muwen Kong
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
| | - Tanner Wiegand
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
| | - Yanis Luca Pignot
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
| | - Elizabeth H. Kellogg
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| | - Eric C. Greene
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
| | - Samuel H. Sternberg
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
| |
Collapse
|
2
|
George JT, Acree C, Park JU, Kong M, Wiegand T, Pignot YL, Kellogg EH, Greene EC, Sternberg SH. Mechanism of target site selection by type V-K CRISPR-associated transposases. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.14.548620. [PMID: 37503092 PMCID: PMC10370016 DOI: 10.1101/2023.07.14.548620] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
Unlike canonical CRISPR-Cas systems that rely on RNA-guided nucleases for target cleavage, CRISPR-associated transposases (CASTs) repurpose nuclease-deficient CRISPR effectors to facilitate RNA-guided transposition of large genetic payloads. Type V-K CASTs offer several potential upsides for genome engineering, due to their compact size, easy programmability, and unidirectional integration. However, these systems are substantially less accurate than type I-F CASTs, and the molecular basis for this difference has remained elusive. Here we reveal that type V-K CASTs undergo two distinct mobilization pathways with remarkably different specificities: RNA-dependent and RNA-independent transposition. Whereas RNA-dependent transposition relies on Cas12k for accurate target selection, RNA-independent integration events are untargeted and primarily driven by the local availability of TnsC filaments. The cryo-EM structure of the untargeted complex reveals a TnsB-TnsC-TniQ transpososome that encompasses two turns of a TnsC filament and otherwise resembles major architectural aspects of the Cas12k-containing transpososome. Using single-molecule experiments and genome-wide meta-analyses, we found that AT-rich sites are preferred substrates for untargeted transposition and that the TnsB transposase also imparts local specificity, which collectively determine the precise insertion site. Knowledge of these motifs allowed us to direct untargeted transposition events to specific hotspot regions of a plasmid. Finally, by exploiting TnsB's preference for on-target integration and modulating the availability of TnsC, we suppressed RNA-independent transposition events and increased type V-K CAST specificity up to 98.1%, without compromising the efficiency of on-target integration. Collectively, our results reveal the importance of dissecting target site selection mechanisms and highlight new opportunities to leverage CAST systems for accurate, kilobase-scale genome engineering applications.
Collapse
Affiliation(s)
- Jerrin Thomas George
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
| | - Christopher Acree
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
- Present address: Department of Molecular Physiology and Biophysics, Vanderbilt University, Nashville, TN 37212, USA
| | - Jung-Un Park
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
- Future address: Department of Structural Biology. St Jude Children’s Research Hospital, Memphis, TN 38105, USA
| | - Muwen Kong
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
| | - Tanner Wiegand
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
| | - Yanis Luca Pignot
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
- Present address: Department of Biochemistry, Ludwig-Maximilians-University Munich, Munich, Germany
| | - Elizabeth H. Kellogg
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
- Future address: Department of Structural Biology. St Jude Children’s Research Hospital, Memphis, TN 38105, USA
| | - Eric C. Greene
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
| | - Samuel H. Sternberg
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
| |
Collapse
|
3
|
Zebrowska J, Jezewska-Frackowiak J, Wieczerzak E, Kasprzykowski F, Zylicz-Stachula A, Skowron PM. Novel parameter describing restriction endonucleases: Secondary-Cognate-Specificity and chemical stimulation of TsoI leading to substrate specificity change. Appl Microbiol Biotechnol 2019; 103:3439-3451. [PMID: 30879089 PMCID: PMC6449304 DOI: 10.1007/s00253-019-09731-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2018] [Revised: 02/22/2019] [Accepted: 02/27/2019] [Indexed: 11/30/2022]
Abstract
Over 470 prototype Type II restriction endonucleases (REases) are currently known. Most recognise specific DNA sequences 4–8 bp long, with very few exceptions cleaving DNA more frequently. TsoI is a thermostable Type IIC enzyme that recognises the DNA sequence TARCCA (R = A or G) and cleaves downstream at N11/N9. The enzyme exhibits extensive top-strand nicking of the supercoiled single-site DNA substrate. The second DNA strand of such substrate is specifically cleaved only in the presence of duplex oligonucleotides containing a cognate site. We have previously shown that some Type IIC/IIG/IIS enzymes from the Thermus-family exhibit ‘affinity star’ activity, which can be induced by the S-adenosyl-L-methionine (SAM) cofactor analogue—sinefungin (SIN). Here, we define a novel type of inherently built-in ‘star’ activity, exemplified by TsoI. The TsoI ‘star’ activity cannot be described under the definition of the classic ‘star’ activity as it is independent of the reaction conditions used and cannot be separated from the cognate specificity. Therefore, we define this phenomenon as Secondary-Cognate-Specificity (SCS). The TsoI SCS comprises several degenerated variants of the cognate site. Although the efficiency of TsoI SCS cleavage is lower in comparison to the cognate TsoI recognition sequence, it can be stimulated by S-adenosyl-L-cysteine (SAC). We present a new route for the chemical synthesis of SAC. The TsoI/SAC REase may serve as a novel tool for DNA manipulation.
Collapse
Affiliation(s)
- Joanna Zebrowska
- Department of Molecular Biotechnology, Faculty of Chemistry, University of Gdansk, 63 Wita Stwosza Street, 80-308, Gdansk, Poland
| | - Joanna Jezewska-Frackowiak
- Department of Molecular Biotechnology, Faculty of Chemistry, University of Gdansk, 63 Wita Stwosza Street, 80-308, Gdansk, Poland
| | - Ewa Wieczerzak
- Department of Biomedical Chemistry, Faculty of Chemistry, University of Gdansk, 63 Wita Stwosza Street, 80-308, Gdansk, Poland
| | - Franciszek Kasprzykowski
- Department of Biomedical Chemistry, Faculty of Chemistry, University of Gdansk, 63 Wita Stwosza Street, 80-308, Gdansk, Poland
| | - Agnieszka Zylicz-Stachula
- Department of Molecular Biotechnology, Faculty of Chemistry, University of Gdansk, 63 Wita Stwosza Street, 80-308, Gdansk, Poland.
| | - Piotr M Skowron
- Department of Molecular Biotechnology, Faculty of Chemistry, University of Gdansk, 63 Wita Stwosza Street, 80-308, Gdansk, Poland.
| |
Collapse
|
4
|
Abstract
My laboratory investigations have been driven by an abiding interest in understanding the consequences of genetic rearrangement in evolution and disease, and in using viruses to elucidate fundamental mechanisms in biology. Starting with bacteriophages and moving to the retroviruses, my use of the tools of genetics, molecular biology, biochemistry, and biophysics has spanned more than half a century-from the time when DNA structure was just discovered to the present day of big data and epigenetics. Both riding and contributing to the successive waves of technology, my laboratory has elucidated fundamental mechanisms in DNA replication, repair, and recombination. We have made substantial contributions in the area of retroviral oncogenesis, delineated mechanisms that control retroviral gene expression, and elucidated critical details of the structure and function of the retroviral enzymes-reverse transcriptase, protease, and integrase-and have had the satisfaction of knowing that the fundamental knowledge gained from these studies contributed important groundwork for the eventual development of antiviral drugs to treat AIDS. While pursuing laboratory research as a principal investigator, I have also been a science administrator-moving from laboratory head to department chair and, finally, to institute director. In addition, I have undertaken a number of community service, science-related "extracurricular" activities during this time. Filling all of these roles, while being a wife and mother, has required family love and support, creative management, and, above all, personal flexibility-with not too much long-term planning. I hope that this description of my journey, with various roles, obstacles, and successes, will be both interesting and informative, especially to young female scientists.
Collapse
Affiliation(s)
- Anna Marie Ann Skalka
- Institute for Cancer Research, Fox Chase Cancer Center, Philadelphia, Pennsylvania 19111;
| |
Collapse
|
5
|
Zuanetti DA, Milan LA. Second-order autoregressive Hidden Markov Model. BRAZ J PROBAB STAT 2017. [DOI: 10.1214/16-bjps328] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
6
|
Forsdyke DR. Homostability. Evol Bioinform Online 2016. [DOI: 10.1007/978-3-319-28755-3_11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022] Open
|
7
|
Abstract
Among species within a phylogenetic group, genomic GC% values can cover a wide range that is particularly evident at third codon positions. However, among genes within a genome, genic GC% values can also cover a wide range that is, again, particularly evident at third codon positions. Individual genes and genomes each have a "homostabilizing propensity" to adopt a relatively uniform GC%. Each gene (a "microisochore") occupies a discrete GC% niche of relatively uniform base composition amongst its fellow genes, which can collectively span a wide GC% range. Homostabilization serves to recombinationally isolate both genome sectors (facilitating gene duplication and differentiation) and genomes (facilitating genome duplication and differentiation; e.g., speciation). Although they may sometimes be in conflict, the individualities of genomes, and of genes within those genomes, are separately sustained by a common mechanism, uniformity of GC%. The protection against inadvertent recombination afforded by GC% differentiation is, in the general case, a prerequisite for phenotypic differentiation.
Collapse
Affiliation(s)
- D. R. FORSDYKE
- Department of Biochemistry, Queen's University, Kingston, Ontario K7L3N6, Canada
| |
Collapse
|
8
|
Marttinen P, Baldwin A, Hanage WP, Dowson C, Mahenthiralingam E, Corander J. Bayesian modeling of recombination events in bacterial populations. BMC Bioinformatics 2008; 9:421. [PMID: 18840286 PMCID: PMC2579306 DOI: 10.1186/1471-2105-9-421] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2008] [Accepted: 10/07/2008] [Indexed: 11/10/2022] Open
Abstract
Background We consider the discovery of recombinant segments jointly with their origins within multilocus DNA sequences from bacteria representing heterogeneous populations of fairly closely related species. The currently available methods for recombination detection capable of probabilistic characterization of uncertainty have a limited applicability in practice as the number of strains in a data set increases. Results We introduce a Bayesian spatial structural model representing the continuum of origins over sites within the observed sequences, including a probabilistic characterization of uncertainty related to the origin of any particular site. To enable a statistically accurate and practically feasible approach to the analysis of large-scale data sets representing a single genus, we have developed a novel software tool (BRAT, Bayesian Recombination Tracker) implementing the model and the corresponding learning algorithm, which is capable of identifying the posterior optimal structure and to estimate the marginal posterior probabilities of putative origins over the sites. Conclusion A multitude of challenging simulation scenarios and an analysis of real data from seven housekeeping genes of 120 strains of genus Burkholderia are used to illustrate the possibilities offered by our approach. The software is freely available for download at URL .
Collapse
Affiliation(s)
- Pekka Marttinen
- Department of Mathematics and statistics, University of Helsinki, FIN-00014, Finland.
| | | | | | | | | | | |
Collapse
|
9
|
Falkow S, Haapala DK, Silver RP. Relationships Between Extrachromosomal Elements. CIBA FOUNDATION SYMPOSIUM - BACTERIAL EPISOMES AND PLASMIDS 2008. [DOI: 10.1002/9780470715345.ch9] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
10
|
Chargaff’s GC rule. Evol Bioinform Online 2006. [DOI: 10.1007/978-0-387-33419-6_8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
|
11
|
Lee SJ, Mortimer JR, Forsdyke DR. Genomic conflict settled in favour of the species rather than the gene at extreme GC percentage values. ACTA ACUST UNITED AC 2005; 3:219-28. [PMID: 15702952 DOI: 10.2165/00822942-200403040-00003] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
Wada and colleagues have shown that, whether prokaryotic or eukaryotic, each gene has a "homostabilising propensity" to adopt a relatively uniform GC percentage (GC%). Accordingly, each gene can be viewed as a "microisochore" occupying a discrete GC% niche of relatively uniform base composition amongst its fellow genes. Although first, second and third codon positions usually differ in GC%, each position tends to maintain a uniform, gene-specific GC% value. Thus, within a genome, genic GC% values can cover a wide range. This is most evident at third codon positions, which are least constrained by amino acid encoding needs. In 1991, Wada and colleagues further noted that, within a phylogenetic group, genomic GC% values can also cover a wide range. This is again most evident at third codon positions. Thus, the dispersion of GC% values among genes within a genome matches the dispersion of GC% values among genomes within a phylogenetic group. Wada described the context-independence of plots of different codon position GC% values against total GC% as a "universal" characteristic. Several studies relate this to recombination. We have confirmed that third codon positions usually relate more to the genes that contain them than to the species. However, in genomes with extreme GC% values (low or high), third codon positions tend to maintain a constant GC%, thus relating more to the species than to the genes that contain them. Genes in an extreme-GC% genome collectively span a smaller GC% range, and mainly rely on first and second codon positions for differentiation as "microisochores". Our results are consistent with the view that differences in GC% serve to recombinationally isolate both genome sectors (facilitating gene duplication) and genomes (facilitating genome duplication, e.g. speciation). In intermediate-GC% genomes, conflict between the needs of the species and the needs of individual genes within that species is minimal. However, in extreme-GC% genomes there is a conflict, which is settled in favour of the species (i.e. group selection) rather than in favour of the gene (genic selection).
Collapse
Affiliation(s)
- Shang-Jung Lee
- Genetics Graduate Program, University of British Columbia, Vancouver, British Columbia, Canada
| | | | | |
Collapse
|
12
|
Na MH, Son YS, Cha JH. Recent Advances In Burn-in. COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS 2005. [DOI: 10.5351/ckss.2005.12.1.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
13
|
Yang TY, Kim J. Binary Segmentation Procedure for Detecting Change Points in a DNA Sequence. COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS 2005. [DOI: 10.5351/ckss.2005.12.1.139] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
14
|
Müller HG, Wai N. Change Trees and Mutagrams for the Visualization of Local Changes in Sequence Data. J Comput Graph Stat 2004. [DOI: 10.1198/106186004x2660] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
15
|
Forsdyke DR. Chromosomal speciation: a reply. J Theor Biol 2004; 230:189-96. [PMID: 15302550 DOI: 10.1016/j.jtbi.2004.04.020] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2004] [Revised: 04/09/2004] [Accepted: 04/26/2004] [Indexed: 11/23/2022]
Abstract
The "genic" and the "non-genic" (chromosomal) hypotheses for the predominant mechanism by which species diverge into two have long been in contention. In 1998 Coyne and Orr attacked certain formulations of the chromosomal hypothesis on the grounds that they required macromutations (structural changes in chromosomes). In 1999 I replied that numerous independent micromutations (single DNA base changes) should suffice (GC% hypothesis). Kliman et al., with the support of Coyne and Charlesworth, have presented various counterarguments, to which the present paper responds with evidence that GC% differences are primary to genic differences and would operate by changing the structure of stem-loops extruded from duplex DNAs. Chromosomes attempting to align by means of complementary loop-loop interactions would fail if GC% differences exceeded a critical threshold. This would disrupt meiosis (hybrid sterility) and the parents of organisms with failed meiosis would be reproductively isolated from each other. If they could find new mates with which they were GC-compatible, then new species could emerge. The model leads to predictions consistent with several lines of evidence. The GC% version of the chromosomal hypothesis has a sound basis and deserves at least as much attention as its genic rival.
Collapse
Affiliation(s)
- Donald R Forsdyke
- Department of Biochemistry, Queen's University, Kingston, Ont., Canada K7L3N6.
| |
Collapse
|
16
|
Abstract
Many deoxyribonucleic acid (DNA) sequences display compositional heterogeneity in the form of segments of similar structure. This article describes a Bayesian method that identifies such segments by using a Markov chain governed by a hidden Markov model. Markov chain Monte Carlo (MCMC) techniques are employed to compute all posterior quantities of interest and, in particular, allow inferences to be made regarding the number of segment types and the order of Markov dependence in the DNA sequence. The method is applied to the segmentation of the bacteriophage lambda genome, a common benchmark sequence used for the comparison of statistical segmentation algorithms.
Collapse
Affiliation(s)
- Richard J Boys
- School of Mathematics and Statistics, Newcastle University, Newcastle upon Tyne, UK.
| | | |
Collapse
|
17
|
Kropinski AM. Sequence of the genome of the temperate, serotype-converting, Pseudomonas aeruginosa bacteriophage D3. J Bacteriol 2000; 182:6066-74. [PMID: 11029426 PMCID: PMC94740 DOI: 10.1128/jb.182.21.6066-6074.2000] [Citation(s) in RCA: 93] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Temperate bacteriophage D3, a member of the virus family Siphoviridae, is responsible for serotype conversion in its host, Pseudomonas aeruginosa. The complete sequence of the double-stranded DNA genome has been determined. The 56,426 bp contains 90 putative open reading frames (ORFs) and four genes specifying tRNAs. The latter are specific for methionine (AUG), glycine (GGA), asparagine (AAC), and threonine (ACA). The tRNAs may function in the translation of certain highly expressed proteins from this relatively AT-rich genome. D3 proteins which exhibited a high degree of sequence similarity to previously characterized phage proteins included the portal, major head, tail, and tail tape measure proteins, endolysin, integrase, helicase, and NinG. The layout of genes was reminiscent of lambdoid phages, with the exception of the placement of the endolysin gene, which parenthetically also lacked a cognate holin. The greatest sequence similarity was found in the morphogenesis genes to coliphages HK022 and HK97. Among the ORFs was discovered the gene encoding the fucosamine O-acetylase, which is in part responsible for the serotype conversion events.
Collapse
MESH Headings
- Acetylesterase/genetics
- Amino Acid Sequence
- Base Sequence
- Cloning, Molecular
- Genome, Viral
- Molecular Sequence Data
- Open Reading Frames
- Protein Biosynthesis
- Pseudomonas Phages/genetics
- Pseudomonas aeruginosa/virology
- RNA, Bacterial/genetics
- RNA, Transfer, Asp/genetics
- RNA, Transfer, Gly/genetics
- RNA, Transfer, Met/genetics
- RNA, Transfer, Thr/genetics
- Sequence Homology, Nucleic Acid
- Siphoviridae/genetics
- Viral Proteins/genetics
Collapse
Affiliation(s)
- A M Kropinski
- Department of Microbiology and Immunology, Faculty of Health Sciences, Queen's University, Kingston, Ontario K7L 3N6, Canada.
| |
Collapse
|
18
|
Affiliation(s)
- A Campbell
- Department of Biological Sciences, Stanford University, California 94305, USA.
| | | |
Collapse
|
19
|
Affiliation(s)
- F W Stahl
- Institute of Molecular Biology, University of Oregon, Eugene, Oregon 97403-1229, USA
| |
Collapse
|
20
|
|
21
|
|
22
|
Abstract
The composition of naturally occurring DNA sequences is often strikingly heterogeneous. In this paper, the DNA sequence is viewed as a stochastic process with local compositional properties determined by the states of a hidden Markov chain. The model used is a discrete-state, discrete-outcome version of a general model for non-stationary time series proposed by Kitagawa (1987). A smoothing algorithm is described which can be used to reconstruct the hidden process and produce graphic displays of the compositional structure of a sequence. The problem of parameter estimation is approached using likelihood methods and an EM algorithm for approximating the maximum likelihood estimate is derived. The methods are applied to sequences from yeast mitochondrial DNA, human and mouse mitochondrial DNAs, a human X chromosomal fragment and the complete genome of bacteriophage lambda.
Collapse
|
23
|
Vardapetjan RR, Letnansky K, Vardevanjan PO, Panosjan GH. The evaluation of an average nucleotide composition from melting curves. Biophys Chem 1983; 18:11-3. [PMID: 17005119 DOI: 10.1016/0301-4622(83)80022-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/1982] [Revised: 02/07/1983] [Accepted: 03/09/1983] [Indexed: 11/23/2022]
Abstract
A new formula has been derived for the calculation of the average G + C content - X of DNAs from different origins using thermal melting data. As compared to existing formulas the new method gives highly accurate results, although being much easier to use than similar equations.
Collapse
Affiliation(s)
- R R Vardapetjan
- Institute for Cancer Research, University of Vienna, Vienna, Austria
| | | | | | | |
Collapse
|
24
|
Yamagishi H, Ikemura T. Two-dimensional display of lambda and Escherichia coli restriction fragments separated by Hg-Cs2SO4 centrifugation and gel electrophoresis. Gene 1981; 14:251-62. [PMID: 6271626 DOI: 10.1016/0378-1119(81)90158-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
EcoRI restriction fragments derived from the DNA of bacteriophage lambda and Escherichia coli were fractionated by density gradient centrifugation of their mercury complexes in Cs2SO4 and subsequent electrophoresis on a horizontal agarose-gel slab. In this two-dimensional display, lambda fragments were resolved into six components and E coli fragments into more than 108 components. Bacterial chromosome regions contiguous to lambda prophage integrated at different sites were amplified by induction, and the EcoRI fragments were subjected to the two-dimensional analysis. As expected, the sets of amplified fragments were clearly different among the various lysogens. The approximate genome region affected by induction was estimated as one-tenth of the whole chromosome.
Collapse
|
25
|
Gotoh O, Tagashira Y. Locations of frequently opening regions on natural DNAs and their relation to functional loci. Biopolymers 1981; 20:1043-58. [PMID: 7225530 DOI: 10.1002/bip.1981.360200514] [Citation(s) in RCA: 45] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
|
26
|
|
27
|
Meyer J, Iida S, Arber W. Does the insertion element IS1 transpose preferentially into A+T-rich DNA segments? MOLECULAR & GENERAL GENETICS : MGG 1980; 178:471-473. [PMID: 6248730 DOI: 10.1007/bf00270502] [Citation(s) in RCA: 50] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
IS1-mediated insertion and deletion formation occur preferentially into A+T-rich regions of DNA of bacteriophate P1 and of the r-determinant of the R plasmid NR1. The significance of this correlation is discussed in view of other published data.
Collapse
|
28
|
Wada A, Yabuki S, Husimi Y. Fine structure in the thermal denaturation of DNA: high temperature-resolution spectrophotometric studies. CRC CRITICAL REVIEWS IN BIOCHEMISTRY 1980; 9:87-144. [PMID: 6777116 DOI: 10.3109/10409238009105432] [Citation(s) in RCA: 94] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
Fine structures which appear in the optical melting profile of DNA are examined from both the experimental and theoretical aspects. After a brief historical survey of the DNA melting experiments during the pre-fine-structure era in Section II, the high temperature-resolution experimental techniques which are essential to the investigation of fine structure are described in Section III. Then, the current status of the high-resolution study is reviewed first by a phenomenological description of the melting profile (Section IV) and then of the refolding profile (Section V), where a general idea about the cooperatively melting region and several factors affecting it is given. Sections VI and VII are devoted to the review of current theoretical works. Several well-established theoretical frameworks which correlate the base sequence with the melting phenomena are examined in terms of their rigorousness and usefulness. The molecular thermodynamic parameters concerning the DNA melting which have been evaluated by several research groups are compared and discussed. Finally, in Section VIII, current ideas on the correlation between the fine structure and genetic functions and genetic maps are reviewed. Some future problems relating to the fine structure are also discussed.
Collapse
|
29
|
|
30
|
Valentin-Hansen P, Hammer-Jespersen K, Buxton RS. Evidence for the existence of three promoters for the deo operon of Escherichia coli K12 in vitro. J Mol Biol 1979; 133:1-17. [PMID: 231107 DOI: 10.1016/0022-2836(79)90248-1] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
31
|
Gotoh O, Wada A, Yabuki S. Salt-concentration dependence of melting profiles of lambda phage DNAs: evidence for long-range interactions and pronounced end effects. Biopolymers 1979; 18:805-24. [PMID: 435606 DOI: 10.1002/bip.1979.360180406] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
|
32
|
Ott GS, Bastia D, Bauer W. A spectroscopic and electron microscopic examination of the highly condensed DNA structures formed by denaturation in Mg(ClO4)2. BIOCHIMICA ET BIOPHYSICA ACTA 1978; 518:216-32. [PMID: 207326 DOI: 10.1016/0005-2787(78)90179-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
1. Thermal denaturation in 1.5 M Mg(ClO4)2 of the DNA from bacteriophage lambda results in four well-separated subtransitions, as monitored by the accompanying increase in absorbance. The midpoint of the hyperchromic spectrum is significantly lowered compared to either 1.5 M MgCl2 or 3.0 M NaClO4. 2. The first two subtransitions are associated with the melting of the A . T-richest regions of the lambda DNA, as revealed by electron micrographs following fixation with formaldehyde. 3. Commencing with the third subtransition, an unusual DNA structure is observed in electron micrographs. In this structure the A . T-rich half of the molecule appears completely condensed, whereas the G . C-rich half remains native. 4. During the fourth subtransition DNA molecules condense completely and eventually aggregate to form extremely high molecular weight particles containing centers of electron density. Tendrils of DNA, primarily duplex, radiate outward from these centers. 5. The aggregation may be reversed by the removal of magnesium. The intramolecular condensation may be at least partly reversed by increasing the Mg(ClO4)2 concentrations to saturating levels.
Collapse
|
33
|
|
34
|
Akiyama C, Gotoh O, Wada A. Spectral analysis on the melting fine structure of is lambda DNA and T2 DNA. Biopolymers 1977; 16:427-35. [PMID: 831870 DOI: 10.1002/bip.1977.360160215] [Citation(s) in RCA: 27] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
35
|
Reuben RC, Skalka A. Identification of the site of interruption in relaxed circles producing during bacteriophage lambda DNA circle replication. J Virol 1977; 21:673-82. [PMID: 833943 PMCID: PMC353870 DOI: 10.1128/jvi.21.2.673-682.1977] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
The DNA that accumulates in the lambda infection restricted to the early (circular) stage of replication consists of approximately two-thirds covalently closed circles and one-third relaxed circles bearing a single interruption in either strand of the duplex. The latter molecules are presumed to be a unique class in that the interruption is not repairable by DNA polymerase and ligase. Preferential radioisotopic labeling of the region immediately adjacent to the interruption, followed by hybridization to sheared fragments of the lambda chromosome with varying guanine plus cytosine content, suggested that the nick resides at the position of the mature molecular ends of the lambda chromosome. Digestion of the labeled molecules with restriction enzymes and reconstruction experiments in which Hershey circles were generated by annealing of interrupted strands isolated from the relaxed circles support this interpretation. The results indicate that the relaxed circles consist of a population containing one interruption in either of the two strands of the duplex jointly representing the two "nicks" contained in Hershey circles (in which the cohesive ends are annealed). These molecules could result from the inability of the maturation function to make the required staggered endonucleolytic cuts when the DNA substrate is a monomeric circle rather than a multimeric linear molecule. Alternatively, this interruption could be the result of an endonucleolytic cutting event critical to DNA replication.
Collapse
|
36
|
|
37
|
Hewish DR, Smith KK, Spencer JH. Nucleotide clusters in deoxyribonucleic acids. XIV. Pyrimidine oligonucleotides of the left and right halves and lambdadv region of bacteriophage lambda DNA. Virology 1976; 74:363-76. [PMID: 982832 DOI: 10.1016/0042-6822(76)90342-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
|
38
|
Lyubchenko YL, Frank-Kamenetskii MD, Vologodskii AV, Lazurkin YS, Gause GG. Fine structure of DNA melting curves. Biopolymers 1976; 15:1019-36. [PMID: 1268312 DOI: 10.1002/bip.1976.360150602] [Citation(s) in RCA: 69] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
|
39
|
Slavik NS, Hershberger CL. Internal structural organization of chloroplast DNA from Euglena gracilis Z. J Mol Biol 1976; 103:563-81. [PMID: 820861 DOI: 10.1016/0022-2836(76)90217-5] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
40
|
Schmidt F, Besemer J, Starlinger P. The isolation of IS1 and IS2 DNA. MOLECULAR & GENERAL GENETICS : MGG 1976; 145:145-54. [PMID: 934052 DOI: 10.1007/bf00269586] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
DNA of the IS-elements IS1 and IS2 was prepared by digestion of appropriate heteroduplex molecules with endonuclease S1, followed by sucrose gradient centrifugation or gel electrophoresis. The material obtained is homogeneous with regard to size. The length of IS1 DNA is 820 +/- 65 nucleotides, the length of IS2 DNA is 1,350 +/- 70 nucleotides. IS1 DNA is not cleaved by the restriction endonucleases Eco R1, Hind II or Hind III. IS2 DNA is cleaved once by each of the two latter enzymes. The buoyand density determined by equilibrium centrifugation of Hg-complexes in Cs2so4 corresponds to a GC content of approximately 50%. Labelling with polynucleotide kinase indicates that both IS DNA's have a guanosyl residue at both of their 5'-termini.
Collapse
|
41
|
Gotoh O, Husimi Y, Yabuki S, Wada A. Hyperfine structure in melting profile of bacteriophage lambda DNA. Biopolymers 1976; 15:655-70. [PMID: 1252601 DOI: 10.1002/bip.1976.360150406] [Citation(s) in RCA: 47] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
|
42
|
Paleček E. Premelting Changes in DNA Conformation. PROGRESS IN NUCLEIC ACID RESEARCH AND MOLECULAR BIOLOGY 1976. [DOI: 10.1016/s0079-6603(08)60588-7] [Citation(s) in RCA: 54] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/11/2023]
|
43
|
Yabuki S, Gotoh O, Wada A. Fine structures in denaturation curves of bacteriophage lambda DNA. Their relation to the intramolecular heterogeneity in base compositon. BIOCHIMICA ET BIOPHYSICA ACTA 1975; 395:258-73. [PMID: 1148236 DOI: 10.1016/0005-2787(75)90196-3] [Citation(s) in RCA: 33] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Precise recording of polyphasic optical melting curves was carried out for three kinds of bacteriophage lambda DNA differing in length (lambdac1857s7, lambdacIb2 and lambdacIb2b5). Each of denaturation steps in melting profiles was characterized by two parameters, the melting temperature and the relative size. Any difference in fine structures in melting profiles was not recognized between the intact lambdacI857s7DNA and the DNA fragmented into halves. The change in fine structures in melting profiles caused by the deletions of the b2 and b5 region agreed qualitatively well with the prediction based on the physical and the genetical maps of phage lambda chromosome. The combined results indicate that, first, the well-known linear relationship between melting temperature and G+C content may apply also to each of denaturation steps in polyphasic melting curves due to heterogeneity of nucleotide distribution in a single DNA species, and, second, the effect of molecular ends on melting fine structures can be neglected at moderate salt concentration (0.01 M less than or equal to Na+ less than or equal to 0.2 M) for such a high molecular weight DNA. The heterogeneous distribution of nucleotides was derived for lambdaDNA and for its b2 and b5 regions.
Collapse
|
44
|
|
45
|
Abstract
A doublet frequency count (set of frequencies of the 16 possible two-base sequences) can be calculated from the experimentally determined overall sequence of a nucleic acid. In this paper, a statistical methodology is developed for comparing such counts with random, with others of the same type or with doublet proportions found in whole DNAs. The methods are applied to two major categories of sequenced RNAs. It is found that vertebrate ribosomal and transfer RNAs show significant differences from the overall vertebrate DNA pattern, especially in the frequency of the doublet CG. Bacterial rRNA and tRNA, on the other hand, show less dissimilarity from total DNA. In the RNA of the small bacteriophage MS2, the doublet frequencies of the translated regions of the genome resemble those in the host E. coli, whereas those in the intercistronic regions differ substantially. All these findings are discussed in relation to the origin, evolution and selection of the nucleic acids concerned.
Collapse
|
46
|
|
47
|
Elton RA. Sites of cleavage by restriction enzymes in viral DNAs: comparison of observed and expected frequencies. Nucleic Acids Res 1974; 1:1343-50. [PMID: 10793694 PMCID: PMC344355 DOI: 10.1093/nar/1.10.1343] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Observed frequencies of restriction enzyme cleavage in viral DNAs are compared to those expected from the cleavage site sequence and the known size and nearest neighbour frequencies of the DNAs. It is found that a number of significant deviations from expectation occur, and possible explanations are put forward.
Collapse
|
48
|
|
49
|
Hayward GS. Unique double-stranded fragments of bacteriophage T5 DNA resulting from preferential shear-induced breakage at nicks. Proc Natl Acad Sci U S A 1974; 71:2108-12. [PMID: 4525320 PMCID: PMC388396 DOI: 10.1073/pnas.71.5.2108] [Citation(s) in RCA: 41] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
Nicks within one strand of the bacteriophage T5 DNA molecule act as " weak points" for a novel kind mechanical breakage that can be utilized for "dissecting" the genome. The products from sheared T5(+) DNA include five unique double-stranded segments of the molecule and various combinations of adjacent segments. These specific fragments are not obtained after repair of the nicks with DNA ligase (EC 6.5.1.1). The duplex fragments and most of their single-stranded components have been separated, identified, and mapped by means of agarose gel electrophoresis. Even the complementary strands of the unique fragments separate in agarose gels; hence, there are now three useful classes of DNA fragments available from T5: the natural r-strand fragments, their complements from the normally intact l strand, and the corresponding duplex segments. By summing the apparent molecular weights of their single-stranded components, the unique duplex fragments from T5(+) DNA can be assigned molecular weights of approximately 6 x 10(6) (A), 8 x 10(6) (B), 11 x 10(6) (C), 27 x 10(6) (D), and 30 x 10(6) (E) from left to right along the genome. The most abundant overlapping fragments are segments AB (14 x 10(6)) and ABC (26 x 10(6)). Differences in the number and relative positions of nicks within two distinct groups of heatstable deletion mutants [represented by T5st(O) and T5b3] account for the large differences observed in their patterns of breakage products.
Collapse
|
50
|
Roizès G. A nuclease from Neurospora crassa specific for d(A-T) rich regions in double stranded DNA. Nucleic Acids Res 1974; 1:443-53. [PMID: 10793677 PMCID: PMC344027 DOI: 10.1093/nar/1.3.443] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
A nuclease from N. crassa has been prepared to the hydroxylapatite stage of purification described by Rabin and Fraser (1). It degrades single stranded DNA in an essentially exonucleolytic process. It does not give any appreciable acid soluble material with double stranded DNA as substrate. This shows its high degree of specificity towards single stranded DNA. An extra activity which makes double strand breaks in the AT rich regions of DNA is described. A limited number of breaks are observed with calf or mouse DNAs, as judged by the decrease of sedimentation velocity in alkaline or neutral sucrose gradients. The number of breaks decreases with increasing the ionic strength. lambda DNA is broken in half at the AT rich middle region of the molecule. The circular replicative form of fd DNA is converted to linear pieces of a rather homogeneous length. It is concluded from these results that it is an endonuclease activity specific for d(A-T) rich regions in double stranded DNA.
Collapse
|