1
|
Jabbari K, Chakraborty M, Wiehe T. DNA sequence-dependent chromatin architecture and nuclear hubs formation. Sci Rep 2019; 9:14646. [PMID: 31601866 PMCID: PMC6787200 DOI: 10.1038/s41598-019-51036-9] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2019] [Accepted: 09/18/2019] [Indexed: 02/08/2023] Open
Abstract
In this study, by exploring chromatin conformation capture data, we show that the nuclear segregation of Topologically Associated Domains (TADs) is contributed by DNA sequence composition. GC-peaks and valleys of TADs strongly influence interchromosomal interactions and chromatin 3D structure. To gain insight on the compositional and functional constraints associated with chromatin interactions and TADs formation, we analysed intra-TAD and intra-loop GC variations. This led to the identification of clear GC-gradients, along which, the density of genes, super-enhancers, transcriptional activity, and CTCF binding sites occupancy co-vary non-randomly. Further, the analysis of DNA base composition of nucleolar aggregates and nuclear speckles showed strong sequence-dependant effects. We conjecture that dynamic DNA binding affinity and flexibility underlay the emergence of chromatin condensates, their growth is likely promoted in mechanically soft regions (GC-rich) of the lowest chromatin and nucleosome densities. As a practical perspective, the strong linear association between sequence composition and interchromosomal contacts can help define consensus chromatin interactions, which in turn may be used to study alternative states of chromatin architecture.
Collapse
Affiliation(s)
- Kamel Jabbari
- Institute for Genetics, Biocenter Cologne, University of Cologne, Zülpicher Straße 47a, 50674, Köln, Germany.
| | - Maharshi Chakraborty
- Institute for Genetics, Biocenter Cologne, University of Cologne, Zülpicher Straße 47a, 50674, Köln, Germany
| | - Thomas Wiehe
- Institute for Genetics, Biocenter Cologne, University of Cologne, Zülpicher Straße 47a, 50674, Köln, Germany
| |
Collapse
|
2
|
Apostolou-Karampelis K, Polychronopoulos D, Almirantis Y. Introduction of 'Generalized Genomic Signatures' for the quantification of neighbour preferences leads to taxonomy- and functionality-based distinction among sequences. Sci Rep 2019; 9:1700. [PMID: 30737442 PMCID: PMC6368578 DOI: 10.1038/s41598-018-38157-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2018] [Accepted: 12/06/2018] [Indexed: 11/16/2022] Open
Abstract
Analysis of DNA composition at several length scales constitutes the bulk of many early studies aimed at unravelling the complexity of the organization and functionality of genomes. Dinucleotide relative abundances are considered an idiosyncratic feature of genomes, regarded as a ‘genomic signature’. Motivated by this finding, we introduce the ‘Generalized Genomic Signatures’ (GGSs), composed of over- and under-abundances of all oligonucleotides of a given length, thus filtering out compositional trends and neighbour preferences at any shorter range. Previous works on alignment-free genomic comparisons mostly rely on k-mer frequencies and not on distance-dependent neighbour preferences. Therein, nucleotide composition and proximity preferences are combined, while in the present work they are strictly separated, focusing uniquely on neighbour relationships. GGSs retain the potential or even outperform genomic signatures defined at the dinucleotide level in distinguishing between taxonomic subdivisions of bacteria, and can be more effectively implemented in microbial phylogenetic reconstruction. Moreover, we compare DNA sequences from the human genome corresponding to protein coding segments, conserved non-coding elements and non-functional DNA stretches. These classes of sequences have distinctive GGSs according to their genomic role and degree of conservation. Overall, GGSs constitute a trait characteristic of the evolutionary origin and functionality of different genomic segments.
Collapse
Affiliation(s)
| | | | - Yannis Almirantis
- Institute of Biosciences and Applications, National Center for Scientific Research "Demokritos", 15310, Athens, Greece.
| |
Collapse
|
3
|
Franck S, Strodtman KN, Qiu J, Emerich DW. Transcriptomic Characterization of Bradyrhizobium diazoefficiens Bacteroids Reveals a Post-Symbiotic, Hemibiotrophic-Like Lifestyle of the Bacteria within Senescing Soybean Nodules. Int J Mol Sci 2018; 19:E3918. [PMID: 30544498 PMCID: PMC6321122 DOI: 10.3390/ijms19123918] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2018] [Revised: 11/26/2018] [Accepted: 11/28/2018] [Indexed: 12/23/2022] Open
Abstract
The transcriptional activity of Bradyrhizobium diazoefficens isolated from soybean nodules was monitored over the period from symbiosis to late plant nodule senescence. The bacteria retained a near constant level of RNA throughout this period, and the variation in genes demonstrating increased, decreased, and/or patterned transcriptional activity indicates that the bacteria are responding to the changing environment within the nodule as the plant cells progress from an organized cellular structure to an unorganized state of internal decay. The transcriptional variation and persistence of the bacteria suggest that the bacteria are adapting to their environment and acting similar to hemibiotrophs, which survive both as saprophytes on live plant tissues and then as necrophytes on decaying plant tissues. The host plant restrictions of symbiosis make B. diazoefficiens a highly specialized, restricted hemibiotroph.
Collapse
Affiliation(s)
- Sooyoung Franck
- Division of Biochemistry, University of Missouri, Columbia, MO 65211, USA.
| | - Kent N Strodtman
- Division of Biochemistry, University of Missouri, Columbia, MO 65211, USA.
| | - Jing Qiu
- Applied Economics and Statistics, University of Delaware, Newark, DE 19716, USA.
| | - David W Emerich
- Division of Biochemistry, University of Missouri, Columbia, MO 65211, USA.
| |
Collapse
|
4
|
Brunet TDP, Doolittle WF. Multilevel Selection Theory and the Evolutionary Functions of Transposable Elements. Genome Biol Evol 2015; 7:2445-57. [PMID: 26253318 PMCID: PMC4558868 DOI: 10.1093/gbe/evv152] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
One of several issues at play in the renewed debate over “junk DNA” is the organizational level at which genomic features might be seen as selected, and thus to exhibit function, as etiologically defined. The intuition frequently expressed by molecular geneticists that junk DNA is functional because it serves to “speed evolution” or as an “evolutionary repository” could be recast as a claim about selection between species (or clades) rather than within them, but this is not often done. Here, we review general arguments for the importance of selection at levels above that of organisms in evolution, and develop them further for a common genomic feature: the carriage of transposable elements (TEs). In many species, not least our own, TEs comprise a large fraction of all nuclear DNA, and whether they individually or collectively contribute to fitness—or are instead junk— is a subject of ongoing contestation. Even if TEs generally owe their origin to selfish selection at the lowest level (that of genomes), their prevalence in extant organisms and the prevalence of extant organisms bearing them must also respond to selection within species (on organismal fitness) and between species (on rates of speciation and extinction). At an even higher level, the persistence of clades may be affected (positively or negatively) by TE carriage. If indeed TEs speed evolution, it is at these higher levels of selection that such a function might best be attributed to them as a class.
Collapse
Affiliation(s)
- Tyler D P Brunet
- Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia, Canada
| | - W Ford Doolittle
- Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia, Canada
| |
Collapse
|
5
|
|
6
|
Abstract
Do data from the Encyclopedia Of DNA Elements (ENCODE) project render the notion of junk DNA obsolete? Here, I review older arguments for junk grounded in the C-value paradox and propose a thought experiment to challenge ENCODE's ontology. Specifically, what would we expect for the number of functional elements (as ENCODE defines them) in genomes much larger than our own genome? If the number were to stay more or less constant, it would seem sensible to consider the rest of the DNA of larger genomes to be junk or, at least, assign it a different sort of role (structural rather than informational). If, however, the number of functional elements were to rise significantly with C-value then, (i) organisms with genomes larger than our genome are more complex phenotypically than we are, (ii) ENCODE's definition of functional element identifies many sites that would not be considered functional or phenotype-determining by standard uses in biology, or (iii) the same phenotypic functions are often determined in a more diffuse fashion in larger-genomed organisms. Good cases can be made for propositions ii and iii. A larger theoretical framework, embracing informational and structural roles for DNA, neutral as well as adaptive causes of complexity, and selection as a multilevel phenomenon, is needed.
Collapse
Affiliation(s)
- W Ford Doolittle
- Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, NS, Canada B3H 4R2.
| |
Collapse
|
7
|
Mahale KN, Kempraj V, Dasgupta D. Does the growth temperature of a prokaryote influence the purine content of its mRNAs? Gene 2012; 497:83-9. [PMID: 22305982 DOI: 10.1016/j.gene.2012.01.040] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2011] [Accepted: 01/19/2012] [Indexed: 11/20/2022]
Abstract
The formation and breaking of hydrogen bonds between nucleic acid bases are dependent on temperature. The high G+C content of organisms was surmised to be an adaptation for high temperature survival because of the thermal stability of G:C pairs. However, a survey of genomic GC% and optimum growth temperature (OGT) of several prokaryotes revoked any direct relation between them. Significantly high purine (R=A or G) content in mRNAs is also seen as a selective response for survival among thermophiles. Nevertheless, the biological relevance of thermophiles loading their unstable mRNAs with excess purines (purine-loading or R-loading) is not persuasive. Here, we analysed the mRNA sequences from the genomes of 168 prokaryotes (as obtained from NCBI Genome database) with their OGTs ranging from -5 °C to 100 °C to verify the relation between R-loading and OGT. Our analysis fails to demonstrate any correlation between R-loading of the mRNA pool and OGT of a prokaryote. The percentage of purine-loaded mRNAs in prokaryotes is found to be in a rough negative correlation with the genomic GC% (r(2)=0.655, slope=-1.478, P<000.1). We conclude that genomic GC% and bias against certain combinations of nucleotides drive the mRNA-synonymous (sense) strands of DNA towards variations in R-loading.
Collapse
|
8
|
BASSI PAOLA. QUANTITATIVE VARIATIONS OF NUCLEAR DNA DURING PLANT DEVELOPMENT: A CRITICAL ANALYSIS. Biol Rev Camb Philos Soc 2008. [DOI: 10.1111/j.1469-185x.1990.tb01424.x] [Citation(s) in RCA: 50] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
9
|
Abstract
The vertebrate genome is a mosaic of GC-poor and GC-rich isochores, megabase-sized DNA regions of fairly homogeneous base composition that differ in relative amount, gene density, gene expression, replication timing, and recombination frequency. At the emergence of warm-blooded vertebrates, the gene-rich, moderately GC-rich isochores of the cold-blooded ancestors underwent a GC increase. This increase was similar in mammals and birds and was maintained during the evolution of mammalian and avian orders. Neither the GC increase nor its conservation can be accounted for by the random fixation of neutral or nearly neutral single-nucleotide changes (i.e., the vast majority of nucleotide substitutions) or by a biased gene conversion process occurring at random genome locations. Both phenomena can be explained, however, by the neoselectionist theory of genome evolution that is presented here. This theory fully accepts Ohta's nearly neutral view of point mutations but proposes in addition (i) that the AT-biased mutational input present in vertebrates pushes some DNA regions below a certain GC threshold; (ii) that these lower GC levels cause regional changes in chromatin structure that lead to deleterious effects on replication and transcription; and (iii) that the carriers of these changes undergo negative (purifying) selection, the final result being a compositional conservation of the original isochore pattern in the surviving population. Negative selection may also largely explain the GC increase accompanying the emergence of warm-blooded vertebrates. In conclusion, the neoselectionist theory not only provides a solution to the neutralist/selectionist debate but also introduces an epigenomic component in genome evolution.
Collapse
Affiliation(s)
- Giorgio Bernardi
- Molecular Evolution Laboratory, Stazione Zoologica Anton Dohrn, Villa Comunale, 80121 Naples, Italy.
| |
Collapse
|
10
|
Paz A, Mester D, Nevo E, Korol A. Looking for organization patterns of highly expressed genes: purine-pyrimidine composition of precursor mRNAs. J Mol Evol 2007; 64:248-60. [PMID: 17211550 DOI: 10.1007/s00239-006-0135-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2006] [Accepted: 11/19/2006] [Indexed: 01/05/2023]
Abstract
We analyzed precursor messenger RNAs (pre-mRNAs) of 12 eukaryotic species. In each species, three groups of highly expressed genes, ribosomal proteins, heat shock proteins, and amino-acyl tRNA synthetases, were compared with a control group (randomly selected genes). The purine-pyrimidine (R-Y) composition of pre-mRNAs of the three targeted gene groups proved to differ significantly from the control. The exons of the three groups tested have higher purine contents and R-tract abundance and lower abundance of Y-tracts compared to the control (R-tract-tract of sequential purines with Rn>or=5; Y-tract-tract of sequential pyrimidines with Yn>or=5). In species widely employing "intron definition" in the splicing process, the Y content of introns of the three targeted groups appeared to be higher compared to the control group. Furthermore, in all examined species, the introns of the targeted genes have a lower abundance of R-tracts compared to the control. We hypothesized that the R-Y composition of the targeted gene groups contributes to high rate and efficiency of both splicing and translation, in addition to the mRNA coding role. This is presumably achieved by (1) reducing the possibility of the formation of secondary structures in the mRNA, (2) using the R-tracts and R-biased sequences as exonic splicing enhancers, (3) lowering the amount of targets for pyrimidine tract binding protein in the exons, and (4) reducing the amount of target sequences for binding of serine/arginine-rich (SR) proteins in the introns, thereby allowing SR proteins to bind to proper (exonic) targets.
Collapse
Affiliation(s)
- A Paz
- Institute of Evolution, Haifa University, Mount Carmel, Haifa, 31905, Israel
| | | | | | | |
Collapse
|
11
|
Paz A, Mester D, Baca I, Nevo E, Korol A. Adaptive role of increased frequency of polypurine tracts in mRNA sequences of thermophilic prokaryotes. Proc Natl Acad Sci U S A 2004; 101:2951-6. [PMID: 14973185 PMCID: PMC365726 DOI: 10.1073/pnas.0308594100] [Citation(s) in RCA: 63] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The mechanism of an organism's adaptation to high temperatures has been investigated intensively in recent years. It was suggested that the macromolecules of thermophilic microorganisms (especially proteins) have structural features that enhance their thermostability. We compared mRNA sequences of 72 fully sequenced prokaryotic proteomes (14 thermophilic and 58 mesophilic species). Although the differences between the percentage of adenine plus guanine content of whole mRNAs of different prokaryotic species are much lower than those of guanine plus cytosine content, the thermophile purine-pyrimidine (R/Y) ratio within their mRNAs is significantly higher than that of the mesophiles. The first and third codon positions of both thermophiles and mesophiles are purine-biased, with the bias more pronounced by the thermophiles. Thermophile mRNAs that display the highest R/Y ratio (1.43-1.69) are those of the ribosomal proteins, histone-like proteins, DNA-dependent RNA polymerase subunits, and heat-shock proteins. Within mesophilic prokaryotes and five eukaryotic species, the R/Y ratio of the mRNAs of heat-shock proteins is higher than their average over coding part of the genome. Polypurine tracts (R)(n) (with n > or = 5) are much more abundant within the thermophile mRNAs compared with mesophiles. Between two sequential pure-purinic codons of thermophile mRNAs, there is a rather strong tendency for the occurrence of adenine but not guanine tracts. The data suggest that mixed adenine.guanine and polyadenine tracts in mRNAs increase the thermostability beyond the contribution of amino acids encoded by purine tracts, which highlights the importance of ecological stress in the evolution of genome architecture.
Collapse
Affiliation(s)
- Arnon Paz
- Institute of Evolution, Haifa University, Mount Carmel, Haifa 31905, Israel
| | | | | | | | | |
Collapse
|
12
|
Nikolaou C, Almirantis Y. A study of the middle-scale nucleotide clustering in DNA sequences of various origin and functionality, by means of a method based on a modified standard deviation. J Theor Biol 2002; 217:479-92. [PMID: 12234754 DOI: 10.1006/jtbi.2002.3045] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The deviation from randomness in the distribution of nucleotides in genomic sequences is quantified and studied, using a modified standard deviation (MSD). This method implies a "per block" computation of the standard deviation of the nucleotide frequencies of occurrence, using local means (means taken in a neighborhood of each block). This quantity may serve as a scale-dependent measure of the nucleotide clustering. In the present work, the meso-scale of tenths of nucleotides is principally explored, by means of suitably adjusted filter parameters. This length scale is of an order of magnitude not directly affected by the grammar and syntax rules of the protein-coding procedure, remaining shorter than the scale of appearance of large-scale characteristics of the genome. MSD has been found to distinguish systematically between the sequences of different origin and functionality. The most near-random are found to be coding sequences of prokaryotes, while in intronic and intergenic regions of eukaryotic genomes, extended clustering of similar nucleotides is observed. The distributions of MSD values of large collections of sequences are found to be in most cases characteristic of their biological role and origin. Protein- and non-coding, prokaryotic and eukaryotic DNA as well as promoter, rRNA, viral and organelle sequences have been examined. The presented results corroborate a recently proposed model for genome evolution. The method is also applied for an assessment of the annotation of ORFs taken from the complete genome of Saccharomyces cerevisiae.
Collapse
Affiliation(s)
- Christoforos Nikolaou
- Institute of Biology, National Research Center for Physical Sciences, "Demokritos" 15310, Athens, Greece
| | | |
Collapse
|
13
|
Lao PJ, Forsdyke DR. Thermophilic bacteria strictly obey Szybalski's transcription direction rule and politely purine-load RNAs with both adenine and guanine. Genome Res 2000; 10:228-36. [PMID: 10673280 PMCID: PMC310832 DOI: 10.1101/gr.10.2.228] [Citation(s) in RCA: 83] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/1999] [Accepted: 12/16/1999] [Indexed: 11/24/2022]
Abstract
When transcription is to the right of the promoter, the "top," mRNA-synonymous strand of DNA tends to be purine-rich. When transcription is to the left of the promoter, the top, mRNA-template strand tends to be pyrimidine-rich. This transcription-direction rule suggests that there has been an evolutionary selection pressure for the purine-loading of RNAs. The politeness hypothesis states that purine-loading prevents distracting RNA-RNA interactions and excessive formation of double-stranded RNA, which might trigger various intracellular alarms. Because RNA-RNA interactions have a distinct entropy-driven component, the pressure for the evolution of purine-loading might be greater in organisms living at high temperatures. In support of this, we find that Chargaff differences (a measure of purine-loading) are greater in thermophiles than in nonthermophiles and extend to both purine bases. In thermophiles the pressure to purine-load affects codon choice, indicating that some features of their amino acid composition (e.g., high levels of glutamic acid) might reflect purine-loading pressure (i.e., constraints on mRNA) rather than direct constraints on protein structure and function.
Collapse
Affiliation(s)
- P J Lao
- Department of Biochemistry, Queen's University, Kingston, Ontario, K7L 3N6, Canada
| | | |
Collapse
|
14
|
Brosius J. RNAs from all categories generate retrosequences that may be exapted as novel genes or regulatory elements. Gene 1999; 238:115-34. [PMID: 10570990 DOI: 10.1016/s0378-1119(99)00227-9] [Citation(s) in RCA: 275] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
While the significance of middle repetitive elements had been neglected for a long time, there are again tendencies to ascribe most members of a given middle repetitive sequence family a functional role--as if the discussion of SINE (short interspersed repetitive elements) function only can occupy extreme positions. In this article, I argue that differences between the various classes of retrosequences concern mainly their copy numbers. Consequently, the function of SINEs should be viewed as pragmatic such as, for example, mRNA-derived retrosequences, without underestimating the impact of retroposition for generation of novel protein coding genes or parts thereof (exon shuffling by retroposition) and in particular of SINEs (and retroelements) in modulating genes and their expression. Rapid genomic change by accumulating retrosequences may even facilitate speciation [McDonald, J.F., 1995. Transposable elements: possible catalysts of organismic evolution. Trends Ecol. Evol. 10, 123-126.] In addition to providing mobile regulatory elements, small RNA-derived retrosequences including SINEs can, in analogy to mRNA-derived retrosequences, also give rise to novel small RNA genes. Perhaps not representative for all SINE/master gene relationships, we gained significant knowledge by studying the small neuronal non-messenger RNAs, namely BC1 RNA in rodents and BC200 RNA in primates. BC1 is the first identified master gene generating a subclass of ID repetitive elements, and BC200 is the only known Alu element (monomeric) that was exapted as a novel small RNA encoding gene.
Collapse
Affiliation(s)
- J Brosius
- Institute of Experimental Pathology/Molecular Neurobiology, ZMBE, University of Münster, Germany.
| |
Collapse
|
15
|
Matzke AJ, Matzke MA. Position effects and epigenetic silencing of plant transgenes. CURRENT OPINION IN PLANT BIOLOGY 1998; 1:142-8. [PMID: 10066569 DOI: 10.1016/s1369-5266(98)80016-2] [Citation(s) in RCA: 202] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Nuclear processes that silence plant transgenes are being revealed by analyses of natural triggers of epigenetic modifications, particularly cytosine methylation, and by comparisons of the genomic environments of differentially expressed transgene loci. It is increasingly apparent that plant genomes can sense and respond to the presence of foreign DNA in certain sequence contexts and at multiple dispersed sites. Determining the basis of this sensitivity and how nuclear defense systems are activated poses major challenges for the future.
Collapse
Affiliation(s)
- A J Matzke
- Institute of Molecular Biology, Austrian Academy of Sciences, Billrothstrasse 11, A-5020 Salzburg, Austria
| | | |
Collapse
|
16
|
Abstract
Transcriptional repression in eukaryotes often involves tens or hundreds of kilobase pairs, two to three orders of magnitude more than the bacterial operator/repressor model does. Classical repression, represented by this model, was maintained over the whole span of evolution under different guises, and consists of repressor factors interacting primarily with promoters and, in later evolution, also with enhancers. The use of much larger amounts of DNA in the other mode of repression, here called the sectorial mode ('superrepression'), results in the conceptual transfer of so-called junk DNA to the domain of functional DNA. This contribution to the solution of the c-value paradox involves perhaps 15% of genomic 'junk,' and encompasses the bulk of the introns, thought to fill a stabilizing role in sectorially repressed chromatin structures. In the case of developmental genes, such structures appear to be heterochromatoid in character. However, solid clues regarding general structural features of superrepressed terminal differentiation genes remain elusive. The competition among superrepressible DNA sectors for sectorially binding factors offers, in principle, a molecular mechanism for developmental switches. Position effect variegation may be considered an abnormal manifestation of normal processes that underly development and involve heterochromatoid sectorial repression, which is apparently required for local elimination or modulation of morphological features (morpholysis). Sectorial repression of genes participating either in development or in terminal differentiation is considered instrumental in establishing stable cell types, and provides a basis for the distinction between determination and cell type specification. The gamut of possible stable cell types may have been broadened by the appearance in evolution of heavy isochores. Additional types of relatively frequent GC-rich cis-acting DNA motifs may offer reiterated binding sites to factors endowed with a selective (though not individually strong) affinity for these motifs. The majority of sequence motifs thought to be used in superrepression need not be individually maintained by natural selection. It is re-emphasized that the dispensability of sequences is not an indicator of their nonfunctionality and that in many cases, along noncoding sequences, nucleotides tend to fill functions collectively, rather than individually.
Collapse
Affiliation(s)
- E Zuckerkandl
- Institute of Molecular Medical Sciences, Palo Alto, CA 94306, USA
| |
Collapse
|
17
|
von Sternberg R. The role of constrained self-organization in genome structural evolution. Acta Biotheor 1996; 44:95-118. [PMID: 9028019 DOI: 10.1007/bf00048418] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
A hypothesis of genome structural evolution is explored. Rapid and cohesive alterations in genome organization are viewed as resulting from the dynamic and constrained interactions of chromosomal subsystem components. A combination of macromolecular boundary conditions and DNA element involvement in far-from-equilibrium reactions is proposed to increase the complexity of genomic subsystems via the channelling of genome turnover; interactions between subsystems create higher-order subsystems expanding the phase space for further genetic evolution. The operation of generic constraints on structuration in genome evolution is suggested by i) universal, homoplasic features of chromosome organization and ii) the metastable nature of genome structures where lower-level flux is constrained by higher-order structures. Phenomena such as 'genomic shock', bursts of transposable element activity, concerted evolution, etc., are hypothesized to result from constrained systemic responses to endogenous/exogenous, micro/macro perturbations. The constraints operating on genome turnover are expected to increase with chromosomal structural complexity, the number of interacting subsystems, and the degree to which interactions between genomic components are tightly ordered.
Collapse
Affiliation(s)
- R von Sternberg
- Center for Intelligent Systems, T.J. Watson School, State University of New York at Binghamton 13902, USA
| |
Collapse
|
18
|
Abstract
The Second International Workshop on Drosophila Heterochromatin, held in Honolulu from January 4-7, 1995, brought together about 70 scientists from the US, Canada, Germany, Italy, Russia, and the Netherlands. After the first of these international meetings, five years ago, Mary Lou Pardue and Wolfgang Hennig, in these columns, commented on its proceedings, and on heterochromatin in general. Although the questions that they raised cannot yet be answered exhaustively, important and sometimes surprising new observations have been made, some previously tentative answers have been firmed up, and some theoretical views underwent significant shifts. We wish to reflect here a few of the data presented at the second workshop, and express some thoughts suggested to us by these recent findings.
Collapse
Affiliation(s)
- E Zuckerkandl
- Institute of Molecular Medical Sciences, 460 Page Mill Road, Palo Alto, CA 94306, USA
| | | |
Collapse
|
19
|
Abstract
The distribution of functions within genomes of higher organisms relative to processes that lead to the spread of mutations in populations is examined in its general outlines. A number of points are enumerated that collectively put in question the concept of junk DNA: the plausible compatibility of DNA function with rapid substitution rates; the likelihood of superimposed functions along much of eukaryotic DNA; the potential for a merely conditional functionality in sequence repeats; the apparent adoption of macromolecular waste as a strategy for maintaining a function without selective grooming of individual sequence repeats that carry out the function; the likely requirement that any DNA sequence must be "polite" vis-'a-vis (compatible with) functional sequences in its genomic environment; the existence in germ-cell lineages of selective constraints that are not apparent in populations of individuals; and the fact that DNA techtonics - the appearance and disappearance of genomic DNA - are not incompatible with function. It is pointed out that the inverse correlation between functional constraints and rates of substitution cannot be claimed to be pillar of the neutral theory, because it is also predicted from a selectionist viewpoint. The dispensability of functional structures is brought into relation with the concept of reproductive sufficiency the survivability of genotypes in the absence of fitter alleles.
Collapse
Affiliation(s)
- E Zuckerkandl
- Linus Pauling Institute of Science and Medicine, Palo Alto, CA 94306
| |
Collapse
|
20
|
|
21
|
von Sternberg RM, Novick GE, Gao GP, Herrera RJ. Genome canalization: the coevolution of transposable and interspersed repetitive elements with single copy DNA. Genetica 1992; 86:215-46. [PMID: 1334910 DOI: 10.1007/bf00133722] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Transposable and interspersed repetitive elements (TIREs) are ubiquitous features of both prokaryotic and eukaryotic genomes. However, controversy has arisen as to whether these sequences represent useless 'selfish' DNA elements, with no cellular function, as opposed to useful genetic units. In this review, we selected two insect species, the Dipteran Drosophila and the Lepidopteran Bombyx mori (the silkmoth), in an attempt to resolve this debate. These two species were selected on the basis of the special interest that our laboratory has had over the years in Bombyx with its well known molecular and developmental biology, and the wealth of genetic data that exist for Drosophila. In addition, these two species represent contrasting repetitive element types and patterns of distribution. On one hand, Bombyx exhibits the short interspersion pattern in which Alu-like TIREs predominate while Drosophila possesses the long interspersion pattern in which retroviral-like TIREs are prevalent. In Bombyx, the main TIRE family is Bm-1 while the Drosophila group contains predominantly copia-like elements, non-LTR retroposons, bacterial-type retroposons and fold-back transposable elements sequences. Our analysis of the information revealed highly non-random patterns of both TIRE biology and evolution, more indicative of these sequences acting as genomic symbionts under cellular regulation rather than useless or selfish junk DNA. In addition, we extended our analysis of potential TIRE functionality to what is known from other eukaryotic systems. From this study, it became apparent that these DNA elements may have originated as innocuous or selfish sequences and then adopted functions. The mechanism for this conversion from non-functionality to specific roles is a process of coevolution between the repetitive element and other cellular DNA often times in close physical proximity. The resulting interdependence between repetitive elements and other cellular sequences restrict the number of evolutionarily successful mutational changes for a given function or cistron. This mutual limitation is what we call genome canalization. Well documented examples are discussed to support this hypothesis and a mechanistic model is presented for how such genomic canalization can occur. Also proposed are empirical studies which would support or invalidate aspects of this hypothesis.
Collapse
Affiliation(s)
- R M von Sternberg
- Department of Biological Sciences, Florida International University, Miami 33199
| | | | | | | |
Collapse
|
22
|
Martínez-Cruzado JC. Evolution of the autosomal chorion cluster in Drosophila. IV. The Hawaiian Drosophila: rapid protein evolution and constancy in the rate of DNA divergence. J Mol Evol 1990; 31:402-23. [PMID: 2124630 DOI: 10.1007/bf02106055] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Autosomal chorion genes s18, s15, and s19 are shown to diverge at extremely rapid rates in closely related taxa of Hawaiian Drosophila. Their nucleotide divergence rates are at least as fast as those of intergenic regions that are known to evolve more extensively between distantly related species. Their amino acid divergence rates are the fastest known to date. There are two nucleotide replacement substitutions for every synonymous one. The molecular basis for observed length and substitution mutations is analyzed. Length mutations are strongly associated with direct repeats in general, and with tandem repeats in particular, whereas the rate for an average transition is twice that for an average transversion. The DNA sequence of the cluster was used to construct a phylogenetic tree for five taxa of the Hawaiian picture-winged species group of Drosophila. Assignment of observed base substitutions occurring in various branches of the tree reveals an excess of would-be homoplasies in a centrally localized 1.8-kb segment containing the s15 gene. This observation may be a reflection of ancestral excess polymorphisms in the segment. The chorion cluster appears to evolve at a constant rate regardless of whether the central 1.8-kb segment is included or not in the analysis. Assuming that the time of divergence of Drosophila grimshawi and the planitibia subgroup coincides with the emergence of the island of Kauai, the overall rate of base substitution in the cluster is estimated to be 0.8% million years, whereas synonymous sites are substituted at a rate of 1.2% million years.
Collapse
Affiliation(s)
- J C Martínez-Cruzado
- Museum of Comparative Zoology, Harvard University, Cambridge, Massachusetts 02138
| |
Collapse
|
23
|
Oliver JL, Marín A, Martínez-Zapater JM. Chloroplast genes transferred to the nuclear plant genome have adjusted to nuclear base composition and codon usage. Nucleic Acids Res 1990; 18:65-73. [PMID: 2308837 PMCID: PMC330204 DOI: 10.1093/nar/18.1.65] [Citation(s) in RCA: 29] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
During plant evolution, some plastid genes have been moved to the nuclear genome. These transferred genes are now correctly expressed in the nucleus, their products being transported into the chloroplast. We compared the base compositions, the distributions of some dinucleotides and codon usages of transferred, nuclear and chloroplast genes in two dicots and two monocots plant species. Our results indicate that transferred genes have adjusted to nuclear base composition and codon usage, being now more similar to the nuclear genes than to the chloroplast ones in every species analyzed.
Collapse
Affiliation(s)
- J L Oliver
- Unidad de Genética, Facultad de Ciencias, Universidad de Granada, Spain
| | | | | |
Collapse
|
24
|
Evolution of DNA Sequence Contributions of Mutational Bias and Selection to the Origin of Chromosomal Compartments. ACTA ACUST UNITED AC 1990. [DOI: 10.1007/978-3-642-75599-6_1] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]
|
25
|
Abstract
Nucleotide sequences carry genetic information of many different kinds, not just instructions for protein synthesis (triplet code). Several codes of nucleotide sequences are discussed including: (1) the translation framing code, responsible for correct triplet counting by the ribosome during protein synthesis; (2) the chromatin code, which provides instructions on appropriate placement of nucleosomes along the DNA molecules and their spatial arrangement; (3) a putative loop code for single-stranded RNA-protein interactions. The codes are degenerate and corresponding messages are not only interspersed but actually overlap, so that some nucleotides belong to several messages simultaneously. Tandemly repeated sequences frequently considered as functionless "junk" are found to be grouped into certain classes of repeat unit lengths. This indicates some functional involvement of these sequences. A hypothesis is formulated according to which the tandem repeats are given the role of weak enhancer-silencers that modulate, in a copy number-dependent way, the expression of proximal genes. Fast amplification and elimination of the repeats provides an attractive mechanism of species adaptation to a rapidly changing environment.
Collapse
|
26
|
Abstract
Giemsa dark bands, G-bands, are a derived chromatin character that evolved along the chromosomes of early chordates. They are facultative heterochromatin reflecting acquisition of a late replication mechanism to repress tissue-specific genes. Subsequently, R-bands, the primitive chromatin state, became directionally GC rich as evidenced by Q-banding of mammalian and avian chromosomes. Contrary to predictions from the neutral mutation theory, noncoding DNA is positionally constrained along the banding pattern with short interspersed repeats in R-bands and long interspersed repeats in G-bands. Chromosomes seem dynamically stable: the banding pattern and gene arrangement along several human and murine autosomes has remained constant for 100 million years, whereas much of the noncoding DNA, especially retroposons, has changed. Several coding sequence attributes and probably mutation rates are determined more by where a gene lives than by what it does. R-band exons in homeotherms but not G-band exons have directionally acquired GC-rich wobble bases and the corresponding codon usage: CpG islands in mammals are specific to R-band exons, exons not facultatively heterochromatinized, and are independent of the tissue expression pattern of the gene. The dynamic organization of noncoding DNA suggests a feedback loop that could influence codon usage and stabilize the chromosome's chromatin pattern: DNA sequences determine affinities of----proteins that together form----a chromatin that modulates----rate constants for DNA modification that determine----DNA sequences. Theories of hierarchical selection and molecular ecology show how selection can act on Darwinian units of noncoding DNA at the genome level thus creating positionally constrained DNA and contributing minimal genetic load at the individual level.
Collapse
Affiliation(s)
- G P Holmquist
- Beckman Research Institute of the City of Hope, Department of Biology, Duarte, California 91010
| |
Collapse
|
27
|
Bernardi G, Mouchiroud D, Gautier C, Bernardi G. Compositional patterns in vertebrate genomes: conservation and change in evolution. J Mol Evol 1988; 28:7-18. [PMID: 3148744 DOI: 10.1007/bf02143493] [Citation(s) in RCA: 107] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
The evolution of vertebrate genomes can be investigated by analyzing their regional compositional patterns, namely the compositional distributions of large DNA fragments (in the 30-100-kb size range), of coding sequences, and of their different codon positions. This approach has shown the existence of two evolutionary modes. In the conservative mode, compositional patterns are maintained over long times (many million years), in spite of the accumulation of enormous numbers of base substitutions. In the transitional, or shifting, mode, compositional patterns change into new ones over much shorter times. The conservation of compositional patterns, which has been investigated in mammalian genomes, appears to be due in part to some measure of compositional conservation in the base substitution process, and in part to negative selection acting at regional (isochore) levels in the genome and eliminating deviations from a narrow range of values, presumably corresponding to optimal functional properties. On the other hand, shifts of compositional patterns, such as those that occurred between cold-blooded and warm-blooded vertebrates, appear to be due essentially to both negative and positive selection again operating at the isochore level, largely under the influence of changes in environmental conditions, and possibly taking advantage of mutational biases in the replication/repair enzymes and/or in the enzyme make-up of nucleotide precursor pools. Other events (like translocations and changes in chromosomal structure) also play a role in the transitional mode of genome evolution. The present findings (1) indicate that isochores, which correspond to the DNA segments of individual or contiguous chromatin domains, represent selection units in the vertebrate genome; and (2) shed new light on the selectionist-neutralist controversy.
Collapse
Affiliation(s)
- G Bernardi
- Laboratoire de Génétique Moléculaire, Institut Jacques Monod, Paris, France
| | | | | | | |
Collapse
|
28
|
Zuckerkandl E, Villet R. Generation of high specificity of effect through low-specificity binding of proteins to DNA. FEBS Lett 1988; 231:291-8. [PMID: 3360135 DOI: 10.1016/0014-5793(88)80836-6] [Citation(s) in RCA: 31] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
It is proposed that proteins can bind with relatively low-affinity and specificity to multiple sites, defined as sequence motifs, on polynucleotide chains, and that such binding can collectively be turned into high-affinity, high-specificity binding through cooperative effects, especially when the sequence motifs recur periodically. The selection of individual nucleotides has in general been thought to be the condition of the existence and conservation of function in most of the noncoding sequences. This condition seems unnecessary. Calculations are presented as a step in the direction of giving credibility to a model of stable gene repression.
Collapse
Affiliation(s)
- E Zuckerkandl
- Linus Pauling Institute of Science and Medicine, Palo Alto, CA 94306
| | | |
Collapse
|
29
|
Abstract
The conceptual framework surrounding the origin of the molecular evolutionary clock and circumstances of this origin are described. In regard to the quest for the best available molecular clocks, a return to protein clocks is conditionally recommended. On the basis of recent data and certain considerations, it is pointed out that the realm of neutrality in evolution is probably less extensive than is now commonly thought, in the three distinct senses of the term neutrality--neutrality as nonfunctionality of mutations, neutrality as equifunctionality of mutations, and neutrality as a mode of fixation of mutations. The possibility is raised that complex sets of interacting components forming a system that is bounded with respect to its environment may quite generally display an intrinsic trend to a quasi-clockwise evolutionary behavior.
Collapse
Affiliation(s)
- E Zuckerkandl
- Linus Pauling Institute of Science and Medicine, Palo Alto, California 94306
| |
Collapse
|
30
|
Abstract
Nucleotide sequences of all genomes are subject to compositional constraints that affect, to about the same extent, both coding and noncoding sequences; influence not only the structure and function of the genome, but also those of transcripts and proteins; are the result of environmental pressures; and largely control the fixation of mutations. These findings indicate that noncoding sequences are associated with biological functions; that the organismal phenotype comprises two components, the classical phenotype, corresponding to the "gene products," and a "genome phenotype," which is defined by the compositional constraints; and that natural selection plays a more important role in genome evolution than do random events.
Collapse
|