1
|
Jost D, Everaers R. Genome wide application of DNA melting analysis. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2009; 21:034108. [PMID: 21817253 DOI: 10.1088/0953-8984/21/3/034108] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Correspondences between functional and thermodynamic melting properties in a genome are being increasingly employed for ab initio gene finding and for the interpretation of the evolution of genomes. Here we present the first systematic genome wide comparison between biologically coding domains and thermodynamically stable regions. In particular, we develop statistical methods to estimate the reliability of the resulting predictions. Not surprisingly, we find that the success of the approach depends on the difference in GC content between the coding and the non-coding parts of the genome and on the percentage of coding base-pairs in the sequence. These prerequisites vary strongly between species, where we observe no systematic differences between eukaryotes and prokaryotes. We find a number of organisms in which the strong correlation of coding domains and thermodynamically stable regions allows us to identify putative exons or genes to complement existing approaches. In contrast to previous investigations along these lines we have not employed the Poland-Scheraga (PS) model of DNA melting but use the earlier Zimm-Bragg (ZB) model. The Ising-like form of the ZB model can be viewed as an approximation to the PS model, with averaged loop entropies included into the cooperative factor [Formula: see text]. This results in a speed-up by a factor of 20-100 compared to the Fixman-Freire algorithm for the solution of the PS model. We show that for genomic sequences the resulting systematic errors are negligible compared to the parameterization uncertainty of the models. We argue that for limited computing resources, available CPU power is better invested in broadening the statistical base for genomic investigations than in marginal improvements of the description of the physical melting behavior.
Collapse
Affiliation(s)
- Daniel Jost
- Laboratoire de Physique de l'École Normale Supérieure de Lyon, Université de Lyon, CNRS UMR 5672, 46 Allée d'Italie 69364 Lyon Cedex 07, France
| | | |
Collapse
|
2
|
Hanai R, Suyama A, Wada A. Characteristic features of thermal stability map of DNA in Escherichia coli and eukaryotic genes. J Biomol Struct Dyn 1988; 6:51-62. [PMID: 3078238 DOI: 10.1080/07391102.1988.10506482] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Distribution of double-helix thermal stability of Escherichia coli and eukaryotic DNAs was analyzed. The results confirmed the previous propositions based on the study of the stability distribution in phage DNAs: (1) stability fluctuation appears near the boundaries of protein coding regions (PCRs) and non protein coding regions (NPCRs); (2) PCRs have less fluctuation than NPCRs. The present analysis also revealed that the local G + C content is lower in the beginning of PCRs of E. coli than the average G + C content of PCR and that deviations in the amino acid composition and the third letter usage PCRs are involved in the low G + C content; the biological meaning of this is discussed in relation to mRNA structure.
Collapse
Affiliation(s)
- R Hanai
- Department of Physics, Faculty of Science, University of Tokyo, Japan
| | | | | |
Collapse
|
3
|
Adams RL, Davis T, Rinaldi A, Eason R. CpG deficiency, dinucleotide distributions and nucleosome positioning. EUROPEAN JOURNAL OF BIOCHEMISTRY 1987; 165:107-15. [PMID: 3569286 DOI: 10.1111/j.1432-1033.1987.tb11200.x] [Citation(s) in RCA: 22] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
The dinucleotide CpG is deficient in (A + T)-rich regions of vertebrate DNA in both coding and non-coding sequences and there is a corresponding increase above expectation in the occurrence of TpG and CpA. By contrast in (G + C)-rich regions no deficiency of CpG is found. Such (G + C)-rich sequences, containing the expected number of CpG dinucleotides, alternate along the genome with (A + T)-rich sequences which have a lower than expected CpG content. The G + C content of vertebrate DNA can oscillate with a period of 150-200 bp and this may be a factor in positioning nucleosomes. The role of mutagenesis in loss of CpG and increase of A + T, particularly in non-coding regions, is discussed.
Collapse
|
4
|
Broders F, Razin S, Farache G, Moreau J, Scherrer K. Correlations of repetitive and AT-rich DNA segments within the chicken globin gene domains. Mol Biol Rep 1986; 11:177-87. [PMID: 3020393 DOI: 10.1007/bf00419739] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
The repetitive DNA segments were mapped within a 30 Kbp genomic domain including (in 5' to 3' order) the chicken embryonic pi and adult alpha D (minor) and alpha A (major) globin genes. Two repeats map 5 and 8 Kbp upstream from the embryonic pi gene and another 3 Kbp downstream of the adult alpha A gene. These repetitive DNA sequences are placed within, or immediately adjacent to the AT-rich DNA segments framing this domain. Similar correlations exist also within the chicken beta globin gene domain. The positions of these AT-rich and repetitive DNA segments framing the alpha globin gene domain also correlate with other already explored features of long range DNA organisation, as clusters of sites of DNAse I hypersensitivity and differential methylation, sites of Matrix-DNA attachment, and with the beginning and end of the transcribed domain.
Collapse
|
5
|
Sheflin LG, Kowalski D. Altered DNA conformations detected by mung bean nuclease occur in promoter and terminator regions of supercoiled pBR322 DNA. Nucleic Acids Res 1985; 13:6137-54. [PMID: 2995917 PMCID: PMC321943 DOI: 10.1093/nar/13.17.6137] [Citation(s) in RCA: 72] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Mung bean nuclease was used to probe for recognizable DNA unwinding and unpairing in the plasmid pBR322. In negatively supercoiled DNA, but not relaxed DNA, cleavages occurred preferentially in non-coding regions of the genome. The types of nucleotide sequences cleaved and which non-coding regions were cleaved depended upon environmental conditions. At 37 degrees C, cleavages occurred in an 84 bp A+T-rich sequence in the terminator region of the ampicillin-resistance gene. Recognition is likely based on a novel DNA conformation which occurs in the longest, most dA+dT-rich region of pBR322. In the presence of 1 mM Mg2+, cleavages occurred in inverted repeated sequences in the promoter regions of the RNA primer for DNA replication and ampicillin- and tetracycline-resistance genes as well as the terminator of RNA-1. Potential loops of hairpin (cruciform) structures were cleaved. At 27 degrees C, cleavages occurred near a promoter activated by cAMP receptor protein in vitro and in the 3' non-coding region of the tetracycline-resistance gene. Thus, in supercoiled pBR322 DNA, recognizable DNA unwinding and unpairing occurs preferentially in regulatory regions for transcription and DNA replication.
Collapse
|
6
|
Srivenugopal KS, Morris DR. Differential modulation by spermidine of reactions catalyzed by type 1 prokaryotic and eukaryotic topoisomerases. Biochemistry 1985; 24:4766-71. [PMID: 3000418 DOI: 10.1021/bi00339a009] [Citation(s) in RCA: 40] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
In the absence of DNA aggregation, spermidine inhibited the relaxation of negatively supercoiled DNA by Escherichia coli topoisomerase I at concentrations of the polyamine normally found intracellularly. Spermidine also curtailed the cleavage of negatively supercoiled ColE1 DNA by the enzyme in the absence of Mg2+. On the contrary, knotting of M13 single-stranded DNA circles catalyzed by topoisomerase I was stimulated by the polyamine. Relaxation of supercoiled DNA by eukaryotic type 1 topoisomerases, such as calf thymus topoisomerase I and wheat germ topoisomerase, was significantly stimulated by spermidine in the same range of concentrations that inhibited the prokaryotic enzyme. In reactions catalyzed by S1 nuclease, the polyamine enhanced the digestion of single-stranded DNA and inhibited the nicking of negatively supercoiled DNA. These results suggest that spermidine modifies the supercoiled duplex substrate in these reactions by modulating the degree of single strandedness.
Collapse
|
7
|
Wada A, Suyama A. Stability distribution in the phage lambda-DNA double helix: a correlation between physical and genetic structure. J Biomol Struct Dyn 1984; 2:573-91. [PMID: 6242869 DOI: 10.1080/07391102.1984.10507592] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
Statistical analyses on the positional correlation of physical-stability and base-sequence distribution maps with genetic map are made for the whole DNA (48502 bases) of lambda-phage. The susceptibility to a double-helix unfolding perturbation and the fraction of the transient opening of a particular region of the double helix are adopted to define this physical stability. The principal features obtained are: A) The DNA double strand of protein coding regions is found to have homostabilizing propensity around a defined stability which is characteristic to each individual gene. B) The stability of the double helix in non-protein coding region fluctuates, on average over the whole region, more than that in protein coding region. C) Boundary regions of protein coding and non-protein coding regions are regions of high stability-fluctuation. Stability especially fluctuates at the protein-coding-region side of the boundary. Contrary to the quiet feature of the interior part of protein coding region rather noisy part exists at its edge. D) One frequently opening region coincides with the attaching site for the site specific recombination between phage and bacterial DNA. There are two possible ways to explain the noisy feature in the stability distribution in non-protein coding regions: 1) The region has been used as the locus of recombination as evolution took place. Thus DNAs which were homostabilized around a different value characteristic to each individual DNA, have been joined there many times, so that the noise has accumulated as a remnant of evolutional history; and/or 2) the base-composition homogenizing or double-helix homostabilizing mechanism does not work in unneeded region such as non-protein coding region or introns. Since corresponding characteristics have been found in our previous analyses on other viral and globin-gene DNAs, the rules mentioned above may be comprehensively extended to other DNAs.
Collapse
Affiliation(s)
- A Wada
- Department of Physics, Faculty of Science, University of Tokyo, Japan
| | | |
Collapse
|
8
|
Sheflin LG, Kowalski D. Mung bean nuclease cleavage of a dA + dT-rich sequence or an inverted repeat sequence in supercoiled PM2 DNA depends on ionic environment. Nucleic Acids Res 1984; 12:7087-104. [PMID: 6091054 PMCID: PMC320144 DOI: 10.1093/nar/12.18.7087] [Citation(s) in RCA: 44] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
We have determined the nucleotide sequences around two alternative sites cleaved in supercoiled PM2 DNA by single-strand-specific mung bean nuclease in different ionic environments. In 10 mM Tris-HC1 (pH 7.0, 37 degrees C), the major site is a dA+dT-rich sequence which maps with a known early denaturation region at 0.75 map units. About 30 cleavages occurred in a 135 bp region. Cleavages were largely excluded at (dA)n . (dT)n (n = 3-7) sequences. Cleavage patterns of this type have not been previously observed in dA+dT-rich sequences. With the addition of 0.1 M NaC1 the major alternative site occurred in a hyphenated inverted repeat sequence 500 bp away (0.70 map units) and did not map to an early denaturation region. One major and 4 minor cleavages occurred in the region between the repeats, suggesting that a hairpin containing at most a 12 bp stem and 10 base loop is recognized. The basis for nuclease recognition of the dA+dT-rich sequence is not clear. The differences in the sequences and cleavage patterns at the alternative sites indicate that their secondary structures differ.
Collapse
|
9
|
Suyama A, Wada A. Correlation between thermal stability maps and genetic maps of double-stranded DNAs. J Theor Biol 1983; 105:133-45. [PMID: 6317987 DOI: 10.1016/0022-5193(83)90429-0] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
The positional correlation of boundaries of cooperatively melting regions with boundaries of protein coding regions and mere open reading frames, and with cleavage sites of restriction enzymes and S1 nuclease are statistically examined for phi X174, G4, fd, SV40, BKV, and polyoma DNAs. A statistically significant correlation does exist in the case of boundaries of protein coding regions, but none is detected for boundaries of mere open reading frames or cleavage sites of restriction enzymes and S1 nuclease. The significant correlation disappears when the cooperativity of melting of the DNA double strand decreases to a nonphysiological condition. The result presents the first evidence showing that a physical property of DNAs correlates with their biological function.
Collapse
|
10
|
Klein RD, Wells RD. Effects of neighboring DNA homopolymers on the biochemical and physical properties of the Escherichia coli lactose promoter. I. Cloning and characterization studies. J Biol Chem 1982. [DOI: 10.1016/s0021-9258(18)33608-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022] Open
|
11
|
Otsuka J, Kunisawa T. Characteristic base sequence patterns of promoter and terminator sites in phi X174 and fd phage DNAs. J Theor Biol 1982; 97:415-36. [PMID: 6215541 DOI: 10.1016/0022-5193(82)90374-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
|
12
|
Gotoh O, Tagashira Y. Locations of frequently opening regions on natural DNAs and their relation to functional loci. Biopolymers 1981; 20:1043-58. [PMID: 7225530 DOI: 10.1002/bip.1981.360200514] [Citation(s) in RCA: 45] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
|
13
|
Wells RD, Goodman TC, Hillen W, Horn GT, Klein RD, Larson JE, Müller UR, Neuendorf SK, Panayotatos N, Stirdivant SM. DNA structure and gene regulation. PROGRESS IN NUCLEIC ACID RESEARCH AND MOLECULAR BIOLOGY 1980; 24:167-267. [PMID: 7005967 DOI: 10.1016/s0079-6603(08)60674-1] [Citation(s) in RCA: 175] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
|
14
|
Wada A, Yabuki S, Husimi Y. Fine structure in the thermal denaturation of DNA: high temperature-resolution spectrophotometric studies. CRC CRITICAL REVIEWS IN BIOCHEMISTRY 1980; 9:87-144. [PMID: 6777116 DOI: 10.3109/10409238009105432] [Citation(s) in RCA: 94] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
Fine structures which appear in the optical melting profile of DNA are examined from both the experimental and theoretical aspects. After a brief historical survey of the DNA melting experiments during the pre-fine-structure era in Section II, the high temperature-resolution experimental techniques which are essential to the investigation of fine structure are described in Section III. Then, the current status of the high-resolution study is reviewed first by a phenomenological description of the melting profile (Section IV) and then of the refolding profile (Section V), where a general idea about the cooperatively melting region and several factors affecting it is given. Sections VI and VII are devoted to the review of current theoretical works. Several well-established theoretical frameworks which correlate the base sequence with the melting phenomena are examined in terms of their rigorousness and usefulness. The molecular thermodynamic parameters concerning the DNA melting which have been evaluated by several research groups are compared and discussed. Finally, in Section VIII, current ideas on the correlation between the fine structure and genetic functions and genetic maps are reviewed. Some future problems relating to the fine structure are also discussed.
Collapse
|
15
|
Funnell BE, Inman RB. Comparison of partial denaturation maps with the known sequence of simian virus 40 and phi X174 replicative form DNA. J Mol Biol 1979; 131:331-40. [PMID: 226718 DOI: 10.1016/0022-2836(79)90079-2] [Citation(s) in RCA: 34] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|