1
|
Kyriacou RG, Mulhair PO, Holland PWH. GC Content Across Insect Genomes: Phylogenetic Patterns, Causes and Consequences. J Mol Evol 2024; 92:138-152. [PMID: 38491221 PMCID: PMC10978632 DOI: 10.1007/s00239-024-10160-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Accepted: 02/06/2024] [Indexed: 03/18/2024]
Abstract
The proportions of A:T and G:C nucleotide pairs are often unequal and can vary greatly between animal species and along chromosomes. The causes and consequences of this variation are incompletely understood. The recent release of high-quality genome sequences from the Darwin Tree of Life and other large-scale genome projects provides an opportunity for GC heterogeneity to be compared across a large number of insect species. Here we analyse GC content along chromosomes, and within protein-coding genes and codons, of 150 insect species from four holometabolous orders: Coleoptera, Diptera, Hymenoptera, and Lepidoptera. We find that protein-coding sequences have higher GC content than the genome average, and that Lepidoptera generally have higher GC content than the other three insect orders examined. GC content is higher in small chromosomes in most Lepidoptera species, but this pattern is less consistent in other orders. GC content also increases towards subtelomeric regions within protein-coding genes in Diptera, Coleoptera and Lepidoptera. Two species of Diptera, Bombylius major and B. discolor, have very atypical genomes with ubiquitous increase in AT content, especially at third codon positions. Despite dramatic AT-biased codon usage, we find no evidence that this has driven divergent protein evolution. We argue that the GC landscape of Lepidoptera, Diptera and Coleoptera genomes is influenced by GC-biased gene conversion, strongest in Lepidoptera, with some outlier taxa affected drastically by counteracting processes.
Collapse
Affiliation(s)
- Riccardo G Kyriacou
- Department of Biology, University of Oxford, 11a Mansfield Road, Oxford, OX1 3SZ, UK
| | - Peter O Mulhair
- Department of Biology, University of Oxford, 11a Mansfield Road, Oxford, OX1 3SZ, UK
| | - Peter W H Holland
- Department of Biology, University of Oxford, 11a Mansfield Road, Oxford, OX1 3SZ, UK.
| |
Collapse
|
2
|
Auboeuf D. The Physics-Biology continuum challenges darwinism: Evolution is directed by the homeostasis-dependent bidirectional relation between genome and phenotype. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2021; 167:121-139. [PMID: 34097984 DOI: 10.1016/j.pbiomolbio.2021.05.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/19/2021] [Revised: 05/19/2021] [Accepted: 05/31/2021] [Indexed: 10/21/2022]
Abstract
The physics-biology continuum relies on the fact that life emerged from prebiotic molecules. Here, I argue that life emerged from the coupling between nucleic acid and protein synthesis during which proteins (or proto-phenotypes) maintained the physicochemical parameter equilibria (or proto-homeostasis) in the proximity of their encoding nucleic acids (or proto-genomes). This protected the proto-genome physicochemical integrity (i.e., atomic composition) from environmental physicochemical constraints, and therefore increased the probability of reproducing the proto-genome without variation. From there, genomes evolved depending on the biological activities they generated in response to environmental fluctuations. Thus, a genome maintaining homeostasis (i.e., internal physicochemical parameter equilibria), despite and in response to environmental fluctuations, maintains its physicochemical integrity and has therefore a higher probability to be reproduced without variation. Consequently, descendants have a higher probability to share the same phenotype than their parents. Otherwise, the genome is modified during replication as a consequence of the imbalance of the internal physicochemical parameters it generates, until new mutation-deriving biological activities maintain homeostasis in offspring. In summary, evolution depends on feedforward and feedback loops between genome and phenotype, as the internal physicochemical conditions that a genome generates ─ through its derived phenotype in response to environmental fluctuations ─ in turn either guarantee its stability or direct its variation. Evolution may not be explained by the Darwinism-derived, unidirectional principle (random mutations-phenotypes-natural selection) but rather by the bidirectional relationship between genome and phenotype, in which the phenotype in interaction with the environment directs the evolution of the genome it derives from.
Collapse
Affiliation(s)
- Didier Auboeuf
- ENS de Lyon, Univ Claude Bernard, CNRS UMR 5239, INSERM U1210, Laboratory of Biology and Modelling of the Cell, 46 Allée D'Italie, Site Jacques Monod, F-69007, Lyon, France.
| |
Collapse
|
3
|
Guiblet WM, Cremona MA, Harris RS, Chen D, Eckert KA, Chiaromonte F, Huang YF, Makova KD. Non-B DNA: a major contributor to small- and large-scale variation in nucleotide substitution frequencies across the genome. Nucleic Acids Res 2021; 49:1497-1516. [PMID: 33450015 PMCID: PMC7897504 DOI: 10.1093/nar/gkaa1269] [Citation(s) in RCA: 58] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2020] [Revised: 12/14/2020] [Accepted: 01/11/2021] [Indexed: 12/12/2022] Open
Abstract
Approximately 13% of the human genome can fold into non-canonical (non-B) DNA structures (e.g. G-quadruplexes, Z-DNA, etc.), which have been implicated in vital cellular processes. Non-B DNA also hinders replication, increasing errors and facilitating mutagenesis, yet its contribution to genome-wide variation in mutation rates remains unexplored. Here, we conducted a comprehensive analysis of nucleotide substitution frequencies at non-B DNA loci within noncoding, non-repetitive genome regions, their ±2 kb flanking regions, and 1-Megabase windows, using human-orangutan divergence and human single-nucleotide polymorphisms. Functional data analysis at single-base resolution demonstrated that substitution frequencies are usually elevated at non-B DNA, with patterns specific to each non-B DNA type. Mirror, direct and inverted repeats have higher substitution frequencies in spacers than in repeat arms, whereas G-quadruplexes, particularly stable ones, have higher substitution frequencies in loops than in stems. Several non-B DNA types also affect substitution frequencies in their flanking regions. Finally, non-B DNA explains more variation than any other predictor in multiple regression models for diversity or divergence at 1-Megabase scale. Thus, non-B DNA substantially contributes to variation in substitution frequencies at small and large scales. Our results highlight the role of non-B DNA in germline mutagenesis with implications to evolution and genetic diseases.
Collapse
Affiliation(s)
- Wilfried M Guiblet
- Bioinformatics and Genomics Graduate Program, Penn State University, UniversityPark, PA 16802, USA
| | - Marzia A Cremona
- Department of Statistics, The Pennsylvania State University, University Park, PA 16802, USA
- Department of Operations and Decision Systems, Université Laval, Canada
- CHU de Québec – Université Laval Research Center, Canada
| | - Robert S Harris
- Department of Biology, Penn State University, University Park, PA 16802, USA
| | - Di Chen
- Intercollege Graduate Degree Program in Genetics, Huck Institutes of the Life Sciences, Penn State University, UniversityPark, PA 16802, USA
| | - Kristin A Eckert
- Department of Pathology, Penn State University, College of Medicine, Hershey, PA 17033, USA
- Center for Medical Genomics, Penn State University, University Park and Hershey, PA, USA
| | - Francesca Chiaromonte
- Department of Statistics, The Pennsylvania State University, University Park, PA 16802, USA
- Center for Medical Genomics, Penn State University, University Park and Hershey, PA, USA
- EMbeDS, Sant’Anna School of Advanced Studies, 56127 Pisa, Italy
| | - Yi-Fei Huang
- Department of Biology, Penn State University, University Park, PA 16802, USA
- Center for Medical Genomics, Penn State University, University Park and Hershey, PA, USA
| | - Kateryna D Makova
- Department of Biology, Penn State University, University Park, PA 16802, USA
- Center for Medical Genomics, Penn State University, University Park and Hershey, PA, USA
| |
Collapse
|
4
|
Possible Physical Basis of Mirror Symmetry Effect in Racemic Mixtures of Enantiomers: From Wallach’s Rule, Nonlinear Effects, B–Z DNA Transition, and Similar Phenomena to Mirror Symmetry Effects of Chiral Objects. Symmetry (Basel) 2020. [DOI: 10.3390/sym12060889] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
Effects associated with mirror symmetry may be underlying for a number of phenomena in chemistry and physics. Increase in the density and melting point of the 50%L/50%D collection of enantiomers of a different sign (Wallach’s rule) is probably based on a physical effect of the mirror image. The catalytic activity of metal complexes with racemic ligands differs from the corresponding complexes with enantiomers as well (nonlinear effect). A similar difference in the physical properties of enantiomers and racemate underlies L/D inversion points of linear helical macromolecules, helical nanocrystals of magnetite and boron nitride etc., B–Z DNA transition and phenomenon of mirror neurons may have a similar nature. Here we propose an explanation of the Wallach effect along with some similar chemical, physical, and biological phenomena related to mirror image.
Collapse
|
5
|
Auboeuf D. Physicochemical Foundations of Life that Direct Evolution: Chance and Natural Selection are not Evolutionary Driving Forces. Life (Basel) 2020; 10:life10020007. [PMID: 31973071 PMCID: PMC7175370 DOI: 10.3390/life10020007] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2019] [Revised: 01/15/2020] [Accepted: 01/16/2020] [Indexed: 12/11/2022] Open
Abstract
The current framework of evolutionary theory postulates that evolution relies on random mutations generating a diversity of phenotypes on which natural selection acts. This framework was established using a top-down approach as it originated from Darwinism, which is based on observations made of complex multicellular organisms and, then, modified to fit a DNA-centric view. In this article, it is argued that based on a bottom-up approach starting from the physicochemical properties of nucleic and amino acid polymers, we should reject the facts that (i) natural selection plays a dominant role in evolution and (ii) the probability of mutations is independent of the generated phenotype. It is shown that the adaptation of a phenotype to an environment does not correspond to organism fitness, but rather corresponds to maintaining the genome stability and integrity. In a stable environment, the phenotype maintains the stability of its originating genome and both (genome and phenotype) are reproduced identically. In an unstable environment (i.e., corresponding to variations in physicochemical parameters above a physiological range), the phenotype no longer maintains the stability of its originating genome, but instead influences its variations. Indeed, environment- and cellular-dependent physicochemical parameters define the probability of mutations in terms of frequency, nature, and location in a genome. Evolution is non-deterministic because it relies on probabilistic physicochemical rules, and evolution is driven by a bidirectional interplay between genome and phenotype in which the phenotype ensures the stability of its originating genome in a cellular and environmental physicochemical parameter-depending manner.
Collapse
Affiliation(s)
- Didier Auboeuf
- Laboratory of Biology and Modelling of the Cell, Univ Lyon, ENS de Lyon, Univ Claude Bernard, CNRS UMR 5239, INSERM U1210, 46 Allée d'Italie, Site Jacques Monod, F-69007, Lyon, France
| |
Collapse
|
6
|
Сhiral and Racemic Fields Concept for Understanding of the Homochirality Origin, Asymmetric Catalysis, Chiral Superstructure Formation from Achiral Molecules, and B-Z DNA Conformational Transition. Symmetry (Basel) 2019. [DOI: 10.3390/sym11050649] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
The four most important and well-studied phenomena of mirror symmetry breaking of molecules were analyzed for the first time in terms of available common features and regularities. Mirror symmetry breaking of the primary origin of biological homochirality requires the involvement of an external chiral inductor (environmental chirality). All reviewed mirror symmetry breaking phenomena were considered from that standpoint. A concept of chiral and racemic fields was highly helpful in this analysis. A chiral gravitational field in combination with a static magnetic field (Earth’s environmental conditions) may be regarded as a hypothetical long-term chiral inductor. Experimental evidences suggest a possible effect of the environmental chiral inductor as a chiral trigger on the mirror symmetry breaking effect. Also, this effect explains a conformational transition of the right-handed double DNA helix to the left-handed double DNA helix (B-Z DNA transition) as possible DNA damage.
Collapse
|
7
|
Contributions of Zea mays subspecies mexicana haplotypes to modern maize. Nat Commun 2017; 8:1874. [PMID: 29187731 PMCID: PMC5707364 DOI: 10.1038/s41467-017-02063-5] [Citation(s) in RCA: 66] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2017] [Accepted: 11/03/2017] [Indexed: 11/09/2022] Open
Abstract
Maize was domesticated from lowland teosinte (Zea mays ssp. parviglumis), but the contribution of highland teosinte (Zea mays ssp. mexicana, hereafter mexicana) to modern maize is not clear. Here, two genomes for Mo17 (a modern maize inbred) and mexicana are assembled using a meta-assembly strategy after sequencing of 10 lines derived from a maize-teosinte cross. Comparative analyses reveal a high level of diversity between Mo17, B73, and mexicana, including three Mb-size structural rearrangements. The maize spontaneous mutation rate is estimated to be 2.17 × 10-8 ~3.87 × 10-8 per site per generation with a nonrandom distribution across the genome. A higher deleterious mutation rate is observed in the pericentromeric regions, and might be caused by differences in recombination frequency. Over 10% of the maize genome shows evidence of introgression from the mexicana genome, suggesting that mexicana contributed to maize adaptation and improvement. Our data offer a rich resource for constructing the pan-genome of Zea mays and genetic improvement of modern maize varieties.
Collapse
|
8
|
Rosenbloom DIS, Allen B. Frequency-dependent selection can lead to evolution of high mutation rates. Am Nat 2014; 183:E131-53. [PMID: 24739203 DOI: 10.1086/675505] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
Theoretical and experimental studies have shown that high mutation rates can be advantageous, especially in novel or fluctuating environments. Here we examine how frequency-dependent competition may lead to fluctuations in trait frequencies that exert upward selective pressure on mutation rates. We use a mathematical model to show that cyclical trait dynamics generated by "rock-paper-scissors" competition can cause the mutation rate in a population to converge to a high evolutionarily stable mutation rate, reflecting a trade-off between generating novelty and reproducing past success. Introducing recombination lowers the evolutionarily stable mutation rate but allows stable coexistence between mutation rates above and below the evolutionarily stable rate. Even considering strong mutational load and ignoring the costs of faithful replication, evolution favors positive mutation rates if the selective advantage of prevailing in competition exceeds the ratio of recombining to nonrecombining offspring. We discuss a number of genomic mechanisms that may meet our theoretical requirements for the adaptive evolution of mutation. Overall, our results suggest that local mutation rates may be higher on genes influencing cyclical competition and that global mutation rates in asexual species may be higher in populations subject to strong cyclical competition.
Collapse
Affiliation(s)
- Daniel I S Rosenbloom
- Program for Evolutionary Dynamics, Harvard University, Cambridge, Massachusetts 02138
| | | |
Collapse
|
9
|
H2A.Z nucleosome positioning has no impact on genetic variation in Drosophila genome. PLoS One 2013; 8:e58295. [PMID: 23472174 PMCID: PMC3589275 DOI: 10.1371/journal.pone.0058295] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2012] [Accepted: 02/01/2013] [Indexed: 11/20/2022] Open
Abstract
Nucleosome occupancy results in complex sequence variation rate heterogeneity by either increasing mutation rate or inhibiting DNA repair in yeast, fish, and human. H2A.Z nucleosome is extensively involved in gene transcription activation and regulation. To test whether H2A.Z nucleosome has the similar impact on sequence variability in the Drosophila genome, we profiled the H2A.Z nucleosome occupancy and sequence variation rate at gene ends and splicing sites. Consistent with previous studies, H2A.Z nucleosome positioning helps to demarcate the borders of exons. Nucleosome occupancy is anticorrelated with sequence divergence rate in the regions flanking transcription start sites and splicing sites. However, there is no rate heterogeneity between the linker DNA and H2A.Z nucleosomal DNA regardless of nucleosome occupancy, fuzziness, positioning in promoter, coding, and intergenic regions, young or old genes. But the rate at intergenic nucleosomes and the flanking linker regions is higher than that at the genic counterparts. Further analyses found that the high sequence divergence rate in the promoter regions that are usually nucleosome depleted regions may be likely resulted from the high mutation rate in the enriched tandem repeats. Interestingly, within nucleosomes spanning splicing sites, sequence variability of nucleosomal DNA significantly increases from the end within exons to the other end protruding into introns. The relaxed functional constraint in introns contributes to the high rate of nucleosomal DNA residing in introns while the strict functional constraint in exons maintains the low rate of nucleosomal DNA residing in exons. Taken together, H2A.Z nucleosome occupancy has no effect on sequence variability of Drosophila genome, which is likely determined by local sequence composition and the concomitant selection pressure.
Collapse
|
10
|
Crow KD, Smith CD, Cheng JF, Wagner GP, Amemiya CT. An independent genome duplication inferred from Hox paralogs in the American paddlefish--a representative basal ray-finned fish and important comparative reference. Genome Biol Evol 2012; 4:937-53. [PMID: 22851613 PMCID: PMC3509897 DOI: 10.1093/gbe/evs067] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
Vertebrates have experienced two rounds of whole-genome duplication (WGD) in the stem lineages of deep nodes within the group and a subsequent duplication event in the stem lineage of the teleosts—a highly diverse group of ray-finned fishes. Here, we present the first full Hox gene sequences for any member of the Acipenseriformes, the American paddlefish, and confirm that an independent WGD occurred in the paddlefish lineage, approximately 42 Ma based on sequences spanning the entire HoxA cluster and eight genes on the HoxD gene cluster. These clusters comprise different HOX loci and maintain conserved synteny relative to bichir, zebrafish, stickleback, and pufferfish, as well as human, mouse, and chick. We also provide a gene genealogy for the duplicated fzd8 gene in paddlefish and present evidence for the first Hox14 gene in any ray-finned fish. Taken together, these data demonstrate that the American paddlefish has an independently duplicated genome. Substitution patterns of the “alpha” paralogs on both the HoxA and HoxD gene clusters suggest transcriptional inactivation consistent with functional diploidization. Further, there are similarities in the pattern of sequence divergence among duplicated Hox genes in paddlefish and teleost lineages, even though they occurred independently approximately 200 Myr apart. We highlight implications on comparative analyses in the study of the “fin-limb transition” as well as gene and genome duplication in bony fishes, which includes all ray-finned fishes as well as the lobe-finned fishes and tetrapod vertebrates.
Collapse
Affiliation(s)
- Karen D Crow
- Department of Biology, San Francisco State University, CA, USA.
| | | | | | | | | |
Collapse
|
11
|
Abstract
It has been known for many years that the mutation rate varies across the genome. However, only with the advent of large genomic data sets is the full extent of this variation becoming apparent. The mutation rate varies over many different scales, from adjacent sites to whole chromosomes, with the strongest variation seen at the smallest scales. Some of these patterns have clear mechanistic bases, but much of the rate variation remains unexplained, and some of it is deeply perplexing. Variation in the mutation rate has important implications in evolutionary biology and underexplored implications for our understanding of hereditary disease and cancer.
Collapse
|
12
|
Brázda V, Laister RC, Jagelská EB, Arrowsmith C. Cruciform structures are a common DNA feature important for regulating biological processes. BMC Mol Biol 2011; 12:33. [PMID: 21816114 PMCID: PMC3176155 DOI: 10.1186/1471-2199-12-33] [Citation(s) in RCA: 177] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2011] [Accepted: 08/05/2011] [Indexed: 04/10/2023] Open
Abstract
DNA cruciforms play an important role in the regulation of natural processes involving DNA. These structures are formed by inverted repeats, and their stability is enhanced by DNA supercoiling. Cruciform structures are fundamentally important for a wide range of biological processes, including replication, regulation of gene expression, nucleosome structure and recombination. They also have been implicated in the evolution and development of diseases including cancer, Werner's syndrome and others. Cruciform structures are targets for many architectural and regulatory proteins, such as histones H1 and H5, topoisomerase IIβ, HMG proteins, HU, p53, the proto-oncogene protein DEK and others. A number of DNA-binding proteins, such as the HMGB-box family members, Rad54, BRCA1 protein, as well as PARP-1 polymerase, possess weak sequence specific DNA binding yet bind preferentially to cruciform structures. Some of these proteins are, in fact, capable of inducing the formation of cruciform structures upon DNA binding. In this article, we review the protein families that are involved in interacting with and regulating cruciform structures, including (a) the junction-resolving enzymes, (b) DNA repair proteins and transcription factors, (c) proteins involved in replication and (d) chromatin-associated proteins. The prevalence of cruciform structures and their roles in protein interactions, epigenetic regulation and the maintenance of cell homeostasis are also discussed.
Collapse
Affiliation(s)
- Václav Brázda
- Institute of Biophysics, Academy of Sciences of the Czech Republic, v,v,i,, Královopolská 135, Brno, 612 65, Czech Republic.
| | | | | | | |
Collapse
|
13
|
Ananda G, Chiaromonte F, Makova KD. A genome-wide view of mutation rate co-variation using multivariate analyses. Genome Biol 2011; 12:R27. [PMID: 21426544 PMCID: PMC3129677 DOI: 10.1186/gb-2011-12-3-r27] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2010] [Revised: 02/21/2011] [Accepted: 03/22/2011] [Indexed: 01/03/2023] Open
Abstract
Background While the abundance of available sequenced genomes has led to many studies of regional heterogeneity in mutation rates, the co-variation among rates of different mutation types remains largely unexplored, hindering a deeper understanding of mutagenesis and genome dynamics. Here, utilizing primate and rodent genomic alignments, we apply two multivariate analysis techniques (principal components and canonical correlations) to investigate the structure of rate co-variation for four mutation types and simultaneously explore the associations with multiple genomic features at different genomic scales and phylogenetic distances. Results We observe a consistent, largely linear co-variation among rates of nucleotide substitutions, small insertions and small deletions, with some non-linear associations detected among these rates on chromosome X and near autosomal telomeres. This co-variation appears to be shaped by a common set of genomic features, some previously investigated and some novel to this study (nuclear lamina binding sites, methylated non-CpG sites and nucleosome-free regions). Strong non-linear relationships are also detected among genomic features near the centromeres of large chromosomes. Microsatellite mutability co-varies with other mutation rates at finer scales, but not at 1 Mb, and shows varying degrees of association with genomic features at different scales. Conclusions Our results allow us to speculate about the role of different molecular mechanisms, such as replication, recombination, repair and local chromatin environment, in mutagenesis. The software tools developed for our analyses are available through Galaxy, an open-source genomics portal, to facilitate the use of multivariate techniques in future large-scale genomics studies.
Collapse
Affiliation(s)
- Guruprasad Ananda
- Center for Medical Genomics, Penn State University, University Park, PA 16802, USA
| | | | | |
Collapse
|
14
|
Tolstorukov MY, Volfovsky N, Stephens RM, Park PJ. Impact of chromatin structure on sequence variability in the human genome. Nat Struct Mol Biol 2011; 18:510-5. [PMID: 21399641 DOI: 10.1038/nsmb.2012] [Citation(s) in RCA: 60] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2009] [Accepted: 12/10/2010] [Indexed: 02/02/2023]
Abstract
DNA sequence variations in individual genomes give rise to different phenotypes within the same species. One mechanism in this process is the alteration of chromatin structure due to sequence variation that influences gene regulation. We composed a high-confidence collection of human single-nucleotide polymorphisms and indels based on analysis of publicly available sequencing data and investigated whether the DNA loci associated with stable nucleosome positions are protected against mutations. We addressed how the sequence variation reflects the occupancy profiles of nucleosomes bearing different epigenetic modifications on genome scale. We found that indels are depleted around nucleosome positions of all considered types, whereas single-nucleotide polymorphisms are enriched around the positions of bulk nucleosomes but depleted around the positions of epigenetically modified nucleosomes. These findings indicate an increased level of conservation for the sequences associated with epigenetically modified nucleosomes, highlighting complex organization of the human chromatin.
Collapse
Affiliation(s)
- Michael Y Tolstorukov
- Center for Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
| | | | | | | |
Collapse
|
15
|
Ying H, Epps J, Williams R, Huttley G. Evidence that localized variation in primate sequence divergence arises from an influence of nucleosome placement on DNA repair. Mol Biol Evol 2010; 27:637-49. [PMID: 19843619 PMCID: PMC2822288 DOI: 10.1093/molbev/msp253] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Understanding the origins of localized substitution rate heterogeneity has important implications for identifying functional genomic sequences. Outside of gene regions, the origins of rate heterogeneity remain unclear. Experimental studies establish that chromatin compaction affects rates of both DNA lesion formation and repair. A functional association between chromatin status and 5-methyl-cytosine also exists. These suggest that both the total rate and the type of substitution will be affected by chromatin status. Regular positioning of nucleosomes, the building block of chromatin, further predicts that substitution rate and type should vary spatially in an oscillating manner. We addressed chromatin's influence on substitution rate and type in primates. Matched numbers of sites were sampled from Dnase I hypersensitive (DHS) and closed chromatin control flank (Flank). Likelihood ratio tests revealed significant excesses of total and of transition substitutions in Flank compared with matched DHS for both intergenic and intronic samples. An additional excess of CpG transitions was evident for the intergenic, but not intronic, regions. Fluctuation in substitution rate along approximately 1,800 primate promoters was measured using phylogenetic footprinting. Significant positive correlations were evident between the substitution rate and a nucleosome score from resting human T-cells, with up to approximately 50% of the variance in substitution rate accounted for. Using signal processing techniques, a dominant oscillation at approximately 200 bp was evident in both the substitution rate and the nucleosome score. Our results support a role for differential DNA repair rates between open and closed chromatin in the spatial distribution of rate heterogeneity.
Collapse
Affiliation(s)
- Hua Ying
- John Curtin School of Medical Research, Australian National University, Canberra, ACT, Australia
| | - Julian Epps
- School of Electrical Engineering and Telecommunications, The University of New South Wales, Sydney, New South Wales, Australia
| | - Rohan Williams
- John Curtin School of Medical Research, Australian National University, Canberra, ACT, Australia
| | - Gavin Huttley
- John Curtin School of Medical Research, Australian National University, Canberra, ACT, Australia
| |
Collapse
|
16
|
Chojnowski JL, Braun EL. Turtle isochore structure is intermediate between amphibians and other amniotes. Integr Comp Biol 2008; 48:454-62. [PMID: 21669806 DOI: 10.1093/icb/icn062] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Vertebrate genomes are comprised of isochores that are relatively long (>100 kb) regions with a relatively homogenous (either GC-rich or AT-rich) base composition and with rather sharp boundaries with neighboring isochores. Mammals and living archosaurs (birds and crocodilians) have heterogeneous genomes that include very GC-rich isochores. In sharp contrast, the genomes of amphibians and fishes are more homogeneous and they have a lower overall GC content. Because DNA with higher GC content is more thermostable, the elevated GC content of mammalian and archosaurian DNA has been hypothesized to be an adaptation to higher body temperatures. This hypothesis can be tested by examining structure of isochores across the reptilian clade, which includes the archosaurs, testudines (turtles), and lepidosaurs (lizards and snakes), because reptiles exhibit diverse body sizes, metabolic rates, and patterns of thermoregulation. This study focuses on a comparative analysis of a new set of expressed genes of the red-eared slider turtle and orthologs of the turtle genes in mammalian (human, mouse, dog, and opossum), archosaurian (chicken and alligator), and amphibian (western clawed frog) genomes. EST (expressed sequence tag) data from a turtle cDNA library enriched for genes that have specialized functions (developmental genes) revealed using the GC content of the third-codon-position to examine isochore structure requires careful consideration of the types of genes examined. The more highly expressed genes (e.g., housekeeping genes) are more likely to be GC-rich than are genes with specialized functions. However, the set of highly expressed turtle genes demonstrated that the turtle genome has a GC content that is intermediate between the GC-poor amphibians and the GC-rich mammals and archosaurs. There was a strong correlation between the GC content of all turtle genes and the GC content of other vertebrate genes, with the slope of the line describing this relationship also indicating that the isochore structure of turtles is intermediate between that of amphibians and other amniotes. These data are consistent with some thermal hypotheses of isochore evolution, but we believe that the credible set of models for isochore evolution still includes a variety of models. These data expand the amount of genomic data available from reptiles upon which future studies of reptilian genomics can build.
Collapse
Affiliation(s)
- Jena L Chojnowski
- Department of Zoology, University of Florida, 223 Bartram Hall, PO Box 118525, Gainesville, FL 32611, USA
| | | |
Collapse
|
17
|
Baer CF, Miyamoto MM, Denver DR. Mutation rate variation in multicellular eukaryotes: causes and consequences. Nat Rev Genet 2007; 8:619-31. [PMID: 17637734 DOI: 10.1038/nrg2158] [Citation(s) in RCA: 294] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
A basic knowledge about mutation rates is central to our understanding of a myriad of evolutionary phenomena, including the maintenance of sex and rates of molecular evolution. Although there is substantial evidence that mutation rates vary among taxa, relatively little is known about the factors that underlie this variation at an empirical level, particularly in multicellular eukaryotes. Here we integrate several disparate lines of theoretical and empirical inquiry into a unified framework to guide future studies that are aimed at understanding why and how mutation rates evolve in multicellular species.
Collapse
Affiliation(s)
- Charles F Baer
- Department of Zoology, University of Florida, Gainesville, Florida 32611, USA.
| | | | | |
Collapse
|
18
|
Manova V, Gecheff K, Stoilov L. Efficient repair of bleomycin-induced double-strand breaks in barley ribosomal genes. Mutat Res 2006; 601:179-90. [PMID: 16930631 DOI: 10.1016/j.mrfmmm.2006.07.004] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2005] [Revised: 06/28/2006] [Accepted: 07/11/2006] [Indexed: 05/11/2023]
Abstract
Ability of barley ribosomal genes to cope with damage produced in vivo by the radiomimetic agent bleomycin was investigated. Repair kinetics of bleomycin-induced double-strand breaks in ribosomal and total genomic DNA was compared. Induction and repair of double-strand breaks in defined regions of the ribosomal genes was also analyzed. Preferential sensitivity of barley linker DNA towards bleomycin treatment in vivo was established. Relatively higher yield of initially induced double-strand breaks in genomic DNA in comparison to ribosomal DNA was also found. Fragments containing intergenic spacers of barley rRNA genes displayed higher sensitivity to bleomycin than the coding sequences. No heterogeneity in the repair of DSB between transcribed and non-transcribed regions of ribosomal genes was detected. Data indicate that DSB repair in barley rDNA, although more efficient than in genomic DNA, does not correlate with the activity of nucleolus organizer regions.
Collapse
Affiliation(s)
- Vasilissa Manova
- Department of Molecular Genetics, Institute of Genetics, Acad D Kostoff, BAS, Sofia, Bulgaria
| | | | | |
Collapse
|
19
|
Vieira A. Atypical mutational properties of human chromosome 21 suggested by comparative genome-scale analyses. Cytogenet Genome Res 2006; 115:30-4. [PMID: 16974081 DOI: 10.1159/000094798] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2006] [Accepted: 02/03/2006] [Indexed: 11/19/2022] Open
Abstract
Mutation of genetic material is a necessary component of evolutionary change. There is evidence for both intragenome and intergenome heterogeneity in terms of mutation frequencies. Reported comparisons of DNA sequence differences between human and chimpanzee (Pan troglodytes) suggest that human chromosome 21 may exhibit mutational hypervariability relative to the other autosomes. In the present study, further evidence is provided for such hypervariability based on large-scale analyses of amino acid composition of (translated) human genes and pseudogenes. A comparison of the variation in the above cases (i.e., DNA sequence differences and amino acid composition differences) yields similar ratios (1.2-1.4) for chromosome 21 relative to the other autosomes, e.g., human chromosome 22 - an autosome that is more typical in this respect and is of similar size to 21. Human chromosome 21 is also presented in this study as being atypical in terms of reported associations between mutation rates and GC content or CpG dinucleotides. In terms of GC distribution patterns, a comparison of NT_011512 and NT_011520 contigs revealed a lower heterogeneity for human chromosome 21 relative to 22. Possible hypermutability of chromosome 21 is further discussed in the context of GC patterns, reported long interspersed nuclear element content (LINE1s), and the implications of these parameters for chromatin structure.
Collapse
Affiliation(s)
- A Vieira
- Endocrine and Metabolic Research Laboratory, Faculty of Applied Sciences, Simon Fraser University, Burnaby, BC, Canada.
| |
Collapse
|
20
|
Kejnovský E, Nejedlý K, Kypr J. Factors influencing resistance of UV-irradiated DNA to the restriction endonuclease cleavage. Int J Biol Macromol 2005; 34:213-22. [PMID: 15225994 DOI: 10.1016/j.ijbiomac.2004.04.004] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
DNA molecules of pUC19, pBR322 and PhiX174 were irradiated by various doses of UV light and the irradiated molecules were cleaved by about two dozen type II restrictases. The irradiation generally blocked the cleavage in a dose-dependent way. In accordance with previous studies, the (A + T)-richness and the (PyPy) dimer content of the restriction site belongs among the factors that on average, cause an increase in the resistance of UV damaged DNA to the restrictase cleavage. However, we observed strong effects of UV irradiation even with (G + C)-rich and (PyPy)-poor sites. In addition, sequences flanking the restriction site influenced the protection in some cases (e.g. HindIII), but not in others (e.g. SalI), whereas neoschizomer couples SmaI and AvaI, or SacI and Ecl136II, cleaved the UV-irradiated DNA similarly. Hence the intrastrand thymine dimers located in the recognition site are not the only photoproduct blocking the restrictases. UV irradiation of the A-form generally made the irradiated DNA less resistant to restrictase cleavage than irradiation in the B-form and in some cases, the A-form completely protected the UV-irradiated DNA against the damage recognized by the restrictases. The present results also demonstrate that the UV irradiation approach used to generate partial digests in genomic DNA studies, can be extended to the (G + C)-rich and (PyPy)-poor restriction sites. The present extensive and quantitative data can be used in genomic applications of UV damage probing by restrictases.
Collapse
Affiliation(s)
- Eduard Kejnovský
- Institute of Biophysics, Academy of Sciences of the Czech Republic, Královopolská 135, CZ-612 65 Brno, Czech Republic
| | | | | |
Collapse
|
21
|
Bardeleben C, Moore RL, Wayne RK. Isolation and Molecular Evolution of the Selenocysteine tRNA (Cf TRSP) and RNase P RNA (Cf RPPH1) Genes in the Dog Family, Canidae. Mol Biol Evol 2004; 22:347-59. [PMID: 15496554 DOI: 10.1093/molbev/msi022] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
In an effort to identify rapidly evolving nuclear sequences useful for phylogenetic analyses of closely related species, we isolated two genes transcribed by RNA polymerase III (pol III), the selenocysteine tRNA gene (TRSP) and an RNase P RNA (RPPH1) gene from the domestic dog (Canis familiaris). We focus on genes transcribed by pol III because their coding regions are small (generally 100-300 base pairs [bp]) and their essential promoter elements are located within a couple of hundred bps upstream of the coding region. Therefore, we predicted that regions flanking the coding region and outside of the promoter elements would be free of constraint and would evolve rapidly. We amplified TRSP from 23 canids and RPPH1 from 12 canids and analyzed the molecular evolution of these genes and their utility as phylogenetic markers for resolving relationships among species in Canidae. We compared the rate of evolution of the gene-flanking regions to other noncoding regions of nuclear DNA (introns) and to the mitochondrial encoded COII gene. Alignment of TRSP from 23 canids revealed that regions directly adjacent to the coding region display high sequence variability. We discuss this pattern in terms of functional mechanisms of transcription. Although the flanking regions evolve no faster than introns, both genes were found to be useful phylogenetic markers, in part, because of the synapomorphic indels found in the flanking regions. Gene trees generated from the TRSP and RPPH1 loci were generally in agreement with the published mtDNA phylogeny and are the first phylogeny of Canidae based on nuclear sequences.
Collapse
Affiliation(s)
- Carolyne Bardeleben
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, USA.
| | | | | |
Collapse
|
22
|
Abstract
We develop techniques to estimate the statistical significance of gap-free alignments between two genomic DNA sequences, using human-mouse alignments as an example. The sequences are assumed to be sufficiently similar that some but not all of the neutrally evolving regions (i.e., those under no evolutionary constraint) can be reliably aligned. Our goal is to model the situation in which the neutral rate of evolution, and hence the extent of the aligning intervals, varies across the genome. In some cases, this permits the weaker of two matches to be judged as less likely to have arisen by chance, provided it lies in a genomic interval with a high level of background divergence. We employ a hidden Markov model to capture variations in divergence rates and assign probability values to gap-free alignments using techniques of Dembo and Karlin, which are related to those used for the same purpose by BLAST. Our methods are illustrated in detail using a 1.49 Mb genomic region. Results obtained from the analysis of human chromosome 22 using these techniques are also provided.
Collapse
Affiliation(s)
- Jia Li
- Statistics Department, Penn State, University Park, PA 16802, USA.
| | | |
Collapse
|
23
|
Abstract
We studied the dependence of the rate of short deletions and insertions on their contexts using the data on mutations within coding exons at 19 human loci that cause mendelian diseases. We confirm that periodic sequences consisting of three to five or more nucleotides are mutagenic. Mutability of sequences with strongly biased nucleotide composition is also elevated, even when mutations within homonucleotide runs longer than three nucleotides are ignored. In contrast, no elevated mutation rates have been detected for imperfect direct or inverted repeats. Among known candidate contexts, the indel context GTAAGT and regions with purine-pyrimidine imbalance between the two DNA strands are mutagenic in our sample, and many others are not mutagenic. Data on mutation hot spots suggest two novel contexts that increase the deletion rate. Comprehensive analysis of mutability of all possible contexts of lengths four, six, and eight indicates a substantially elevated deletion rate within YYYTG and similar sequences, which is one of the two contexts revealed by the hot spots. Possible contexts that increase the insertion rate (AT(A/C)(A/C)GCC and TACCRC) and decrease deletion (TATCGC) or insertion (GCGG) rates have also been identified. Two-thirds of deletions remove a repeat, and over 80% of insertions create a repeat, i.e., they are duplications.
Collapse
Affiliation(s)
- Alexey S Kondrashov
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, Maryland
| | - Igor B Rogozin
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, Maryland
| |
Collapse
|
24
|
Abstract
Barley nucleolus organizing regions (NORs) were previously found to behave as prominent aberration hot-spots after treatment with some restriction endonucleases. The ability of MspI for directed induction of double-strand breaks in barley ribosomal DNA was further analyzed. Ionizing radiation-produced strand breakage within the ribosomal gene clusters was also a subject of investigation. Reconstructed barley karyotypes T1586 and T35 with normal and increased expression of rRNA genes were utilized to evaluate the relationship between transcriptional activity and damage induction. Scanning densitometry of the hybridization profiles revealed that MspI is generating double-strand breaks in barley rDNA with efficiency being independent from the NOR activity. Damage induction observed after treatment with gamma-rays was also not influenced by the transcriptional status of the ribosomal genes. A tendency towards restoration of rDNA integrity after irradiation of both germinating and dry seeds was observed which is indicative for the efficient recovery of double-strand breaks in barley ribosomal DNA.
Collapse
Affiliation(s)
- Vasilissa I Manova
- Department of Molecular Genetics, Bulgarian Academy of Sciences, Institute of Genetics Acad. D. Kostoff, Sofia 1113, Bulgaria
| | | |
Collapse
|
25
|
Rogozin IB, Pavlov YI. Theoretical analysis of mutation hotspots and their DNA sequence context specificity. Mutat Res 2003; 544:65-85. [PMID: 12888108 DOI: 10.1016/s1383-5742(03)00032-2] [Citation(s) in RCA: 123] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Mutation frequencies vary significantly along nucleotide sequences such that mutations often concentrate at certain positions called hotspots. Mutation hotspots in DNA reflect intrinsic properties of the mutation process, such as sequence specificity, that manifests itself at the level of interaction between mutagens, DNA, and the action of the repair and replication machineries. The hotspots might also reflect structural and functional features of the respective DNA sequences. When mutations in a gene are identified using a particular experimental system, resulting hotspots could reflect the properties of the gene product and the mutant selection scheme. Analysis of the nucleotide sequence context of hotspots can provide information on the molecular mechanisms of mutagenesis. However, the determinants of mutation frequency and specificity are complex, and there are many analytical methods for their study. Here we review computational approaches for analyzing mutation spectra (distribution of mutations along the target genes) that include many mutable (detectable) positions. The following methods are reviewed: derivation of a consensus sequence, application of regression approaches to correlate nucleotide sequence features with mutation frequency, mutation hotspot prediction, analysis of oligonucleotide composition of regions containing mutations, pairwise comparison of mutation spectra, analysis of multiple spectra, and analysis of "context-free" characteristics. The advantages and pitfalls of these methods are discussed and illustrated by examples from the literature. The most reliable analyses were obtained when several methods were combined and information from theoretical analysis and experimental observations was considered simultaneously. Simple, robust approaches should be used with small samples of mutations, whereas combinations of simple and complex approaches may be required for large samples. We discuss several well-documented studies where analysis of mutation spectra has substantially contributed to the current understanding of molecular mechanisms of mutagenesis. The nucleotide sequence context of mutational hotspots is a fingerprint of interactions between DNA and DNA repair, replication, and modification enzymes, and the analysis of hotspot context provides evidence of such interactions.
Collapse
Affiliation(s)
- Igor B Rogozin
- Institute of Cytology and Genetics, Russian Academy of Sciences, Novosibirsk, Russia
| | | |
Collapse
|
26
|
von Sternberg R. On the roles of repetitive DNA elements in the context of a unified genomic-epigenetic system. Ann N Y Acad Sci 2002; 981:154-88. [PMID: 12547679 DOI: 10.1111/j.1749-6632.2002.tb04917.x] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
Repetitive DNA sequences comprise a substantial portion of most eukaryotic and some prokaryotic chromosomes. Despite nearly forty years of research, the functions of various sequence families as a whole and their monomer units remain largely unknown. The inability to map specific functional roles onto many repetitive DNA elements (REs), coupled with the taxon-specificity of sequence families, have led many to speculate that these genomic components are "selfish" replicators generating genomic "junk." The purpose of this paper is to critically examine the selfishness, evolutionary effects, and functionality of REs. First, a brief overview of the range of ideas pertaining to RE function is presented. Second, the argument is presented that the selfish DNA "hypothesis" is actually a narrative scheme, that it serves to protect neo-Darwinian assumptions from criticism, and that this story is untestable and therefore not a hypothesis. Third, attempts to synthesize the selfish DNA concept with complex systems models of the genome and RE functionality are critiqued. Fourth, the supposed connection between RE-induced mutations and macroevolutionary events are stated to be at variance with empirical evidence and theoretical considerations. Hypotheses that base phylogenetic transitions in repetitive sequence changes thus remain speculative. Fifth and finally, the case is made for viewing REs as integrally functional components of chromosomes, genomes, and cells. It is argued throughout that a new conceptual framework is needed for understanding the roles of repetitive DNA in genomic/epigenetic systems, and that neo-Darwinian "narratives" have been the primary obstacle to elucidating the effects of these enigmatic components of chromosomes.
Collapse
Affiliation(s)
- Richard von Sternberg
- Department of Systematic Biology, NHB-163, National Museum of Natural History, Smithsonian Institution, Washington, DC 20560, USA.
| |
Collapse
|
27
|
Silva JC, Kondrashov AS. Patterns in spontaneous mutation revealed by human-baboon sequence comparison. Trends Genet 2002; 18:544-7. [PMID: 12414177 DOI: 10.1016/s0168-9525(02)02757-9] [Citation(s) in RCA: 51] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
We have analyzed the alignment of a long homologous region of the human and baboon genomes (approximately 1.5 Mb). We show that the frequency of gaps between aligned segments decreases slowly with gap length, indicating that several successive nucleotides are often deleted or inserted in one event. By contrast, runs of consecutive mismatches decrease rapidly in frequency with increasing length, following an exponential distribution, indicating that nucleotides are mostly substituted one at a time. Nucleotide substitutions are clumped at the scales of <10 and 1000-10,000 nucleotides, but show almost no aggregation at the scales of <10-100 and over approximately 50,000 nucleotides. Apparently, two rather different factors make the substitution rate not exactly uniform along the DNA sequence. Comparison of regions of very similar genomes that are approximately selectively neutral makes it possible to study spontaneous mutation at a new level of resolution.
Collapse
Affiliation(s)
- Joana C Silva
- The Institute for Genomic Research, Medical Center Drive, Rockville, MD 20850, USA.
| | | |
Collapse
|
28
|
Castillo-Davis CI, Mekhedov SL, Hartl DL, Koonin EV, Kondrashov FA. Selection for short introns in highly expressed genes. Nat Genet 2002; 31:415-8. [PMID: 12134150 DOI: 10.1038/ng940] [Citation(s) in RCA: 374] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Transcription is a slow and expensive process: in eukaryotes, approximately 20 nucleotides can be transcribed per second at the expense of at least two ATP molecules per nucleotide. Thus, at least for highly expressed genes, transcription of long introns, which are particularly common in mammals, is costly. Using data on the expression of genes that encode proteins in Caenorhabditis elegans and Homo sapiens, we show that introns in highly expressed genes are substantially shorter than those in genes that are expressed at low levels. This difference is greater in humans, such that introns are, on average, 14 times shorter in highly expressed genes than in genes with low expression, whereas in C. elegans the difference in intron length is only twofold. In contrast, the density of introns in a gene does not strongly depend on the level of gene expression. Thus, natural selection appears to favor short introns in highly expressed genes to minimize the cost of transcription and other molecular processes, such as splicing.
Collapse
Affiliation(s)
- Cristian I Castillo-Davis
- Department of Organismic and Evolutionary Biology, Harvard University, 16 Divinity Avenue, Cambridge, Massachusetts 02138, USA
| | | | | | | | | |
Collapse
|
29
|
Abstract
Within-intron difference of correlation with base composition of the adjacent exons was studied in the genomes of 34 species. For this purpose, GC-percent was determined for segments of 50 bp in length taken at both intron margins and in the internal part of the intron. It was found that in certain genomes the coefficient of correlation with GC-percent of the adjacent exon was significantly higher for the intron margin than for the internal part of the intron (homeotherms, cereals). Only part of this difference can be explained by unequal probability of insertion of transposable elements. Those multicellular organisms which have a low or no within-intron difference in correlation with the adjacent exons (anamniotes, invertebrates, dicots) show a higher local compositional heterogeneity (a greater exon/intron contrast in the GC-content). These results are evidence against the mutational bias being a possible explanation for the compositional genome heterogeneity. Thus, in the genomes with a high global heterogeneity there seems to be a selective force for compliance of intron base composition with the adjacent exons. This force is stronger in those parts of the intron that are closer to exons. In addition, the previously found positive general correlation between the genome size and average intron length was confirmed with a much larger dataset. However, within separate phylogenetic groups this rule can be broken, as it occurs in the cereals (family Poaceae), where a negative correlation was found.
Collapse
Affiliation(s)
- A E Vinogradov
- Institute of Cytology, Russian Academy of Sciences, Tikhoretsky Avenue 4, 194064, St. Petersburg, Russia.
| |
Collapse
|
30
|
Abstract
One of the most striking features of mammalian chromosomes is the variation in G+C content that occurs over scales of hundreds of kilobases to megabases, the so-called 'isochore' structure of the human genome. This variation in base composition affects both coding and non-coding sequences and seems to reflect a fundamental level of genome organization. However, although we have known about isochores for over 25 years, we still have a poor understanding of why they exist. In this article, we review the current evidence for the three main hypotheses.
Collapse
Affiliation(s)
- A Eyre-Walker
- Centre for the Study of Evolution and School of Biological Sciences, University of Sussex, Brighton BN1 9QG, UK.
| | | |
Collapse
|
31
|
Nejedlý K, Kittner R, Kypr J. Genomic DNA regions whose complementary strands are prone to UV light-induced crosslinking. Arch Biochem Biophys 2001; 388:216-24. [PMID: 11368157 DOI: 10.1006/abbi.2001.2280] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
We prepared an extensive set of DNA restriction fragments, irradiated them with UV light, and detected crosslinked complementary strands by electrophoresis in denaturing agarose gels. These experimental data were quantified by densitometry to determine tetranucleotide contributions to crosslinking. The tetranucleotide contributions were used to predict genomic maps of the crosslinking probability that permitted us to identify two strongly crosslinking genomic regions having 295 and 389 base pairs in length. The two sequences shared the (ATTTTATA).(TATAAAAT) octamer, which is a candidate for the hotspot of UV light-induced crosslinking between the complementary strands of DNA.
Collapse
Affiliation(s)
- K Nejedlý
- Institute of Biophysics of the Academy of Sciences of the Czech Republic, Brno
| | | | | |
Collapse
|
32
|
Nejedlý K, Kittner R, Pospísilová S, Kypr J. Crosslinking of the complementary strands of DNA by UV light: dependence on the oligonucleotide composition of the UV irradiated DNA. BIOCHIMICA ET BIOPHYSICA ACTA 2001; 1517:365-75. [PMID: 11342214 DOI: 10.1016/s0167-4781(00)00299-2] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
UV light crosslinks the complementary strands of DNA. The interstrand crosslinks may contribute to the biological and pathological effects that UV irradiation is known to bring about. Here alkaline agarose gel electrophoresis was used to assess the crosslinked fraction of 31 selected restriction fragments of six viral and plasmid DNA molecules exposed to UVC light irradiation. As many as 17 independent experiments were performed with the particular DNA fragments to get sufficiently precise data suitable for quantitative analyses. The data were used to determine how the crosslinked fraction depended on the dinucleotide, trinucleotide and tetranucleotide contents of the irradiated DNA fragments. This analysis demonstrated that DNA conformation and/or flexibility, rather than the local double helix thermostability, governed the phenomenon of crosslinking. For example, (GA).(TC) suppressed the crosslink formation in DNA more than any dinucleotide composed of only G and C. In addition, (CTAG).(CTAG) promoted crosslinking much more than any other tetranucleotide, including e.g. (TATA).(TATA), whereas the closely related (CATG).(CATG) belonged among the tetranucleotides that most suppressed the UV light induced crosslinks between the complementary strands of DNA. The present data reproduced crosslinking of the analyzed 31 restriction fragments with a correlation coefficient exceeding 0.90. This result will be useful to predict crosslinking along the whole human genome.
Collapse
Affiliation(s)
- K Nejedlý
- Institute of Biophysics of the Academy of Sciences of the Czech Republic, Královopolská 135, CZ-612 65, Brno, Czech Republic
| | | | | | | |
Collapse
|
33
|
Abstract
The study and comparison of mutation(al) spectra is an important problem in molecular biology, because these spectra often reflect on important features of mutations and their fixation. Such features include the interaction of DNA with various mutagens, the function of repair/replication enzymes, and properties of target proteins. It is known that mutability varies significantly along nucleotide sequences, such that mutations often concentrate at certain positions, called "hotspots," in a sequence. In this paper, we discuss in detail two approaches for mutation spectra analysis: the comparison of mutation spectra with a HG-PUBL program, (FTP: sunsite.unc.edu/pub/academic/biology/dna-mutations/hyperg) and hotspot prediction with the CLUSTERM program (www.itba.mi.cnr.it/webmutation; ftp.bionet.nsc.ru/pub/biology/dbms/clusterm.zip). Several other approaches for mutational spectra analysis, such as the analysis of a target protein structure, hotspot context revealing, multiple spectra comparisons, as well as a number of mutation databases are briefly described. Mutation spectra in the lacI gene of E. coli and the human p53 gene are used for illustration of various difficulties of such analysis.
Collapse
Affiliation(s)
- I Rogozin
- Institute of Cytology and Genetics, Russian Academy of Sciences, Novosibirsk, Russia.
| | | | | |
Collapse
|
34
|
Silva JM, Garcia JM, Dominguez G, Silva J, Rodriguez R, Portero JL, Corbacho C, Provencio M, España P, Bonilla F. DNA damage after chemotherapy correlates with tumor response and survival in small cell lung cancer patients. Mutat Res 2000; 456:65-71. [PMID: 11087897 DOI: 10.1016/s0027-5107(00)00113-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
To explore the induction of chemotherapy (CT) DNA damage and its correlation with tumor response and patient survival, we undertook the present study in 20 small cell lung cancer (SCLC) patients. All patients underwent the same treatment based on CT courses of carboplatin and etoposide. Blood samples were taken before and immediately after CT and every 12 weeks during follow-up. Nuclear DNA damage was determined through the variations in three mitochondrial pseudogene mutations in DNA of peripheral blood mononuclear cells. They were detected by mutation-specific PCR and assessed by a semiquantitative method. The relative level of mutation rose after chemotherapy in all cases. Among the 11 patients (55%) with higher relative levels of mutations, 9 (82%) of them achieved a complete response. In contrast, of the 9 patients (45%) with lower relative levels of mutations, only 2 (18%) achieved a complete response, displaying a statistically significant difference (P=0.02). The overall survival for patients with marked genomic damage was 18 months (range 10-24), and for patients with low degree of DNA damage, it was 12 months (range 5-15) (P=0.002). Genomic damage detected after chemotherapy treatment correlates positively with tumor response and patient survival.
Collapse
Affiliation(s)
- J M Silva
- Department of Medical Oncology, Clinica Puerta de Hierro, Madrid, Spain
| | | | | | | | | | | | | | | | | | | |
Collapse
|
35
|
Abstract
The human genome is not a uniform structure but, instead, is a mosaic of bands. Some of these bands can be seen by the eye. Stained with Giemsa and viewed under the microscope each human chromosome has a prototypical pattern of light and dark bands (G and R bands respectively). Other bands are not so easily viewed. The human genome is, for example, a mosaic of isochores, blocks of DNA within which the proportion of the bases G and C at silent sites (introns, third positions in codons, intergene spacer) is fairly uniform. Recent work by Matassi and colleagues(1) has revealed what might be a new and unexpected banding pattern. They have found that the genes which are close together on the chromosome have similar rates of evolution. BioEssays 22:105-107, 2000.
Collapse
Affiliation(s)
- L D Hurst
- Department of Biology and Biochemistry, University of Bath, Claverton Down, Bath BA2 7AY, UK
| | | |
Collapse
|
36
|
Abstract
The terrestrial pulmonate mollusks were found to have the significantly larger genomes than the aquatic pulmonates. Being shown in the independent phylogenetic branch, this phenomenon suggests that the previously observed genome enlargement in the vertebrate land pioneers (amphibians and lungfishes) was not casual. As in the vertebrates, the larger molluskan genomes are also more GC-rich.
Collapse
Affiliation(s)
- A E Vinogradov
- Institute of Cytology, Russian Academy of Sciences, St. Petersburg, Russia
| |
Collapse
|
37
|
Fullerton SM, Bond J, Schneider JA, Hamilton B, Harding RM, Boyce AJ, Clegg JB. Polymorphism and divergence in the beta-globin replication origin initiation region. Mol Biol Evol 2000; 17:179-88. [PMID: 10666717 DOI: 10.1093/oxfordjournals.molbev.a026230] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
DNA sequence polymorphism and divergence was examined in the vicinity of the human beta-globin gene cluster origin of replication initiation region (IR), a 1.3-kb genomic region located immediately 5' of the adult-expressed beta-globin gene. DNA sequence variation in the replication origin IR and 5 kb of flanking DNA was surveyed in samples drawn from two populations, one African (from the Gambia, West Africa) and the other European (from Oxford, England). In these samples, levels of nucleotide and length polymorphism in the IR were found to be more than two times as high as adjacent non-IR-associated regions (estimates of per-nucleotide heterozygosity were 0.30% and 0.12%, respectively). Most polymorphic positions identified in the origin IR fall within or just adjacent to a 52-bp alternating purine-pyrimidine ((RY)n) sequence repeat. Within- and between-populations divergence is highest in this portion of the IR, and interspecific divergence in the same region, determined by comparison with an orthologous sequence from the chimpanzee, is also pronounced. Higher levels of diversity in this subregion are not, however, primarily attributable to slippage-mediated repeat unit changes, as nucleotide substitution contributes disproportionately to allelic heterogeneity. An estimate of helical stability in the sequenced region suggests that the hypervariable (RY)n constitutes the major DNA unwinding element (DUE) of the replication origin IR, the location at which the DNA duplex first unwinds and new strand synthesis begins. These findings suggest that the beta-globin IR experiences a higher underlying rate of neutral mutation than do adjacent genomic regions and that enzyme fidelity associated with the initiation of DNA replication at this origin may be compromised. The significance of these findings for our understanding of eukaryotic replication origin biology is discussed.
Collapse
Affiliation(s)
- S M Fullerton
- Department of Biology, Pennsylvania State University, University Park 16802, USA.
| | | | | | | | | | | | | |
Collapse
|
38
|
Duret L, Mouchiroud D. Determinants of substitution rates in mammalian genes: expression pattern affects selection intensity but not mutation rate. Mol Biol Evol 2000; 17:68-74. [PMID: 10666707 DOI: 10.1093/oxfordjournals.molbev.a026239] [Citation(s) in RCA: 390] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
To determine whether gene expression patterns affect mutation rates and/or selection intensity in mammalian genes, we studied the relationships between substitution rates and tissue distribution of gene expression. For this purpose, we analyzed 2,400 human/rodent and 834 mouse/rat orthologous genes, and we measured (using expressed sequence tag data) their expression patterns in 19 tissues from three development states. We show that substitution rates at nonsynonymous sites are strongly negatively correlated with tissue distribution breadth: almost threefold lower in ubiquitous than in tissue-specific genes. Nonsynonymous substitution rates also vary considerably according to the tissues: the average rate is twofold lower in brain-, muscle-, retina- and neuron-specific genes than in lymphocyte-, lung-, and liver-specific genes. Interestingly, 5' and 3' untranslated regions (UTRs) show exactly the same trend. These results demonstrate that the expression pattern is an essential factor in determining the selective pressure on functional sites in both coding and noncoding regions. Conversely, silent substitution rates do not vary with expression pattern, even in ubiquitously expressed genes. This latter result thus suggests that synonymous codon usage is not constrained by selection in mammals. Furthermore, this result also indicates that there is no reduction of mutation rates in genes expressed in the germ line, contrary to what had been hypothesized based on the fact that transcribed DNA is more efficiently repaired than nontranscribed DNA.
Collapse
Affiliation(s)
- L Duret
- Laboratoire de Biométrie, Génétique et Biologie des Populations, Université Claude Bernard, Villeurbanne, France.
| | | |
Collapse
|
39
|
Abstract
Ultraviolet radiation has provided an evolutionary challenge to life on Earth. Recent increases in surficial ultraviolet B fluxes have focused attention on the role of UV radiation in protistan ecology, cancer, and DNA damage. Exploiting this new wealth of data, I examine the possibility that ultraviolet radiation may have played a significant role in the evolution of the first eukaryotes, that is, protists. Protists probably arose well before the formation of a significant ozone shield, and thus were probably subjected to substantial ultraviolet A, ultraviolet B, and ultraviolet C fluxes early in their evolution. Evolution consists of the generation of heritable variations and the subsequent selection of these variants. Ultraviolet radiation has played a role both as a mutagen and as a selective agent. In its role as a mutagen, it may have been crucial in the origin of sex and as a driver of molecular evolution. As a selective agent, its influence has been broad. Discussed in this paper are the influence of ultraviolet radiation on biogeography, photosynthesis, and desiccation resistance.
Collapse
Affiliation(s)
- L J Rothschild
- Ecosystem Science and Technology Branch, NASA Ames Research Center, Moffett Field, California 94035-1000, USA.
| |
Collapse
|
40
|
Abstract
BACKGROUND Nucleotide substitution rates and G + C content vary considerably among mammalian genes. It has been proposed that the mammalian genome comprises a mosaic of regions - termed isochores - with differing G + C content. The regional variation in gene G + C content might therefore be a reflection of the isochore structure of chromosomes, but the factors influencing the variation of nucleotide substitution rate are still open to question. RESULTS To examine whether nucleotide substitution rates and gene G + C content are influenced by the chromosomal location of genes, we compared human and murid (mouse or rat) orthologues known to belong to one of the chromosomal (autosomal) segments conserved between these species. Multiple members of gene families were excluded from the dataset. Sets of neighbouring genes were defined as those lying within 1 centiMorgan (cM) of each other on the mouse genetic map. For both synonymous substitution rates and G + C content at silent sites, neighbouring genes were found to be significantly more similar to each other than sets of genes randomly drawn from the dataset. Moreover, we demonstrated that the regional similarities in G + C content (isochores) and synonymous substitution rate were independent of each other. CONCLUSIONS Our results provide the first substantial statistical evidence for the existence of a regional variation in the synonymous substitution rate within the mammalian genome, indicating that different chromosomal regions evolve at different rates. This regional phenomenon which shapes gene evolution could reflect the existence of 'evolutionary rate units' along the chromosome.
Collapse
Affiliation(s)
- G Matassi
- Institute of Genetics, University of Nottingham, Queens Medical Centre, Nottingham, NG7 2UH, UK.
| | | | | |
Collapse
|
41
|
Abstract
Selection of a pathway of differentiation in multipotent progenitor cells may depend on the amount of histone H1 or H1 zero relative to the core histones. With low levels of these linker histones, it is proposed that an evolutionarily more ancient cell differentiation occurs. Greater repetition of AT-rich regulatory motifs allows more frequent, hence earlier, transcription of genes, accounting for this type of cell differentiation. It is further proposed that a decrease of total cell protein accumulation leads to an increase of histone H1 or H1 zero relative to the core histones. It is suggested that these linker histones preferentially bind the more AT-rich regulatory sequences, thereby restricting the phylogenetically more ancient differentiation potency. This allows differentiation of an evolutionarily younger cell type.
Collapse
Affiliation(s)
- R A Flickinger
- Department of Biological Sciences, State University of New York, Buffalo, USA
| |
Collapse
|
42
|
Schumacher A, Lichtarge O, Schwartz S, Magnuson T. The murine Polycomb-group gene eed and its human orthologue: functional implications of evolutionary conservation. Genomics 1998; 54:79-88. [PMID: 9806832 DOI: 10.1006/geno.1998.5509] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
Similar to Drosophila, murine Polycomb-group (PcG) genes regulate anterior-posterior patterning of segmented axial structures by transcriptional repression of homeotic gene expression. The murine PcG gene eed (embryonic ectoderm development) encodes a 441-amino-acid protein with five WD motifs which, except for the amino terminus, is highly homologous to Drosophila ESC (Extra Sex Combs). Here, sequence and expression analysis as well as chromosomal mapping of the human orthologue of eed is described. Absolute conservation of the human eed protein along with significant divergence at the nucleotide level reveals functional constraints operating on all residues. The human orthologue appears to be ubiquitously expressed and maps to chromsome 11q14.2-q22.3. Using the first WD motif of the beta-subunit of the bovine G protein as a structural reference, the predicted locations of two previously identified eed point mutations (A. Schumacher et al., 1996, Nature 383: 250-253) are also reported herein. The proline substitution (L196P) in the second WD motif of the l7Rn5(3354SB) null allele maps to the internal core of the inner end of the beta-propeller blade and is likely to disrupt protein folding. In contrast, the asparagine substitution (I193N) in the second WD motif of the hypomorphic l7Rn5(1989SB) allele maps onto the surface of the beta-propeller blade near the central cavity and may affect surface interactions without compromising propeller packing. These results illustrate the critical importance of all residues for eed function in mammals and support a model whereby the amino terminus might implement function(s) related to embryonic development in higher organisms.
Collapse
Affiliation(s)
- A Schumacher
- Department of Genetics, Case Western Reserve University, Cleveland, Ohio, 44106-4955, USA
| | | | | | | |
Collapse
|
43
|
PospíŠlová Š, Kypr J. UV Light-induced Duplex-to-duplex Crosslinking of DNA Molecules in Aqueous Ethanol Solutions. Photochem Photobiol 1998. [DOI: 10.1111/j.1751-1097.1998.tb05215.x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
44
|
Señorale-Pose M, Rougeon F. The mouse Vcs2 gene is a composite structure which evolved by gene fusion and encodes five distinct salivary mRNA species. Gene 1997; 201:75-85. [PMID: 9409774 DOI: 10.1016/s0378-1119(97)00430-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Genes of the VCS (variable coding sequence) family are characterized by an extensive evolutionary divergence in the protein-coding sequence. The VCS family has been characterized by cDNA cloning from submandibular glands in the rat, mouse and humans. At the genomic level, the sequences of two members of this family are known in the rat Rattus norvegicus: the VCSA1 gene, encoding the prohormone-like polypeptide SMR1, and the VCSB1 gene, encoding a salivary Pro-rich polypeptide. No genomic data were available for the VCS genes of other species. To understand the evolution of the VCS gene family better, we have now sequenced 23 kilobases (kb) of the mouse Vcs2 gene. The Vcs2 sequence reveals numerous genomic reorganizations such as an inversion, insertions of short elements and an unusually high number of long interspersed repeated elements (LINEs), which make up 42% of this region. Interestingly, Vcs2 is composed of three different VCS-like regions. The first of these regions contains all the exons necessary to encode the previously described mouse submandibular gland polypeptide MSG2alpha. This region aligns with the entire genomic sequences of rat VCSA1 and VCSB1 genes. The two other regions align with fragments of these rat sequences. The three regions are arrayed in tandem and flanked by LINEs. In particular, the third region also contains exons that were found in mRNA species from the submandibular gland. In total, we have characterized five mRNAs from mouse submandibular glands which have in common their first exon, and are produced by alternative splicing. Vcs2 is thus a single gene that arose by the fusion of three genes (or pseudogenes) of the VCS multigene family.
Collapse
Affiliation(s)
- M Señorale-Pose
- Unité de Génétique et Biochimie du Développement, URA CNRS 1960, Département d'Immunologie, Institut Pasteur, Paris, France
| | | |
Collapse
|
45
|
Abstract
For over a century, actuaries and biologists working independently of each other have presented arguments for why total mortality needs to be partitioned into biologically meaningful subcomponents. These mortality partitions tended to overlook genetic diseases that are inherited because the partitions were motivated by a paradigm focused on aging. In this article, we combine and extend the concepts from these disciplines to develop a conceptual partitioning of total mortality into extrinsic and intrinsic causes of death. An extrinsic death is either caused or initiated by something that orginates outside the body of an individual, while an intrinsic death is either caused or initiated by processes that originate within the body. It is argued that extrinsic mortality has been a driving force in determining why we die when we do from intrinsic causes of death. This biologically motivated partitioning of mortality provides a useful perspective for researchers interested in comparative mortality analyses, the consequences of population aging, limits to human life expectancy, the progress made by the biomedical sciences against lethal diseases, and demographic models that predict the life expectancy of future populations.
Collapse
Affiliation(s)
- B A Carnes
- Center for Mechanistic Biology and Biotechnology, Argonne National Laboratory 60439-4833, Illinois, USA
| | | |
Collapse
|
46
|
Widlak P, Bykov VJ, Hemminki K, Rzeszowska-Wolny J. The non-random distribution of UV-induced photoproducts in the nuclear matrix and non-matrix DNA fractions. Cancer Lett 1996; 108:215-23. [PMID: 8973598 DOI: 10.1016/s0304-3835(96)04426-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
The formation of UV-induced photoproducts in the chromatin fractions of human lymphocytes was studied by 32P-post-labeling. A higher level of DNA lesions was found in the matrix-attached DNA fraction as compared to non-matrix DNA of irradiated cells (about 150 and 110 adducts per 10(6) nucleotides, respectively, at a 500 J/m2 254 nm-UV dose). Formation of photoproducts in a MAR (matrix attached region) sequence from the mouse kappa immunoglobulin gene irradiated in vitro was examined as well. The MAR sequence showed a two-fold higher level of adducts as compared to non-MAR DNA. The effect of photoproducts on complex-formation between MAR DNA and proteins of the nuclear matrix was studied in vitro. The amount of UV-induced adducts was 1.5-fold higher in matrix-bound fraction as compared to non-fractionated DNA (and five-fold higher as compared to unbound fraction), which possibly resulted from preferential binding of lesion-containing DNA fragments to the nuclear matrix proteins.
Collapse
Affiliation(s)
- P Widlak
- Department of Tumor Biology, Institute of Oncology, Gliwice, Poland
| | | | | | | |
Collapse
|
47
|
Ogata H, Fujibuchi W, Kanehisa M. The size differences among mammalian introns are due to the accumulation of small deletions. FEBS Lett 1996; 390:99-103. [PMID: 8706839 DOI: 10.1016/0014-5793(96)00636-9] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
In order to investigate the molecular mechanisms that alter intron size, we conducted an extensive interspecies comparison of homologous introns among three mammalian groups: human, artiodactyls, and rodents. The size differences of introns were statistically significant among all three groups (longest intron was for human and shortest for rodents), and appear to be due to the accumulation of small deletions, according to the separate count of insertion and deletion frequencies. The distribution of intron size differences also has a shape similar to that for the distribution of insertion/deletion sizes found in pseudogenes. It is suggested that introns are selectively neutral to small-scale changes of the genome size, which inherently contain the bias of favoring short deletions against short insertions.
Collapse
Affiliation(s)
- H Ogata
- Institute for Chemical Research, Kyoto University, Japan
| | | | | |
Collapse
|
48
|
von Sternberg R. The role of constrained self-organization in genome structural evolution. Acta Biotheor 1996; 44:95-118. [PMID: 9028019 DOI: 10.1007/bf00048418] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
A hypothesis of genome structural evolution is explored. Rapid and cohesive alterations in genome organization are viewed as resulting from the dynamic and constrained interactions of chromosomal subsystem components. A combination of macromolecular boundary conditions and DNA element involvement in far-from-equilibrium reactions is proposed to increase the complexity of genomic subsystems via the channelling of genome turnover; interactions between subsystems create higher-order subsystems expanding the phase space for further genetic evolution. The operation of generic constraints on structuration in genome evolution is suggested by i) universal, homoplasic features of chromosome organization and ii) the metastable nature of genome structures where lower-level flux is constrained by higher-order structures. Phenomena such as 'genomic shock', bursts of transposable element activity, concerted evolution, etc., are hypothesized to result from constrained systemic responses to endogenous/exogenous, micro/macro perturbations. The constraints operating on genome turnover are expected to increase with chromosomal structural complexity, the number of interacting subsystems, and the degree to which interactions between genomic components are tightly ordered.
Collapse
Affiliation(s)
- R von Sternberg
- Center for Intelligent Systems, T.J. Watson School, State University of New York at Binghamton 13902, USA
| |
Collapse
|
49
|
Boulikas T. Chromatin domains and prediction of MAR sequences. INTERNATIONAL REVIEW OF CYTOLOGY 1996; 162A:279-388. [PMID: 8575883 DOI: 10.1016/s0074-7696(08)61234-6] [Citation(s) in RCA: 90] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
Polynuceosomes are constrained into loops or domains and are insulated from the effects of chromatin structure and torsional strain from flanking domains by the cross-complexation of matrix-attached regions (MARs) and matrix proteins. MARs or SARs have an average size of 500 bp, are spaced about every 30 kb, and are control elements maintaining independent realms of gene activity. A fraction of MARs may cohabit with core origin replication (ORIs) and another fraction might cohabit with transcriptional enhancers. DNA replication, transcription, repair, splicing, and recombination seem to take place on the nuclear matrix. Classical AT-rich MARs have been proposed to anchor the core enhancers and core origins complexed with low abundancy transcription factors to the nuclear matrix via the cooperative binding to MARs of abundant classical matrix proteins (topoisomerase II, histone H1, lamins, SP120, ARBP, SATB1); this creates a unique nuclear microenvironment rich in regulatory proteins able to sustain transcription, replication, repair, and recombination. Theoretical searches and experimental data strongly support a model of activation of MARs and ORIs by transcription factors. A set of 21 characteristics are deduced or proposed for MAR/ORI sequences including their enrichment in inverted repeats, AT tracts, DNA unwinding elements, replication initiator protein sites, homooligonucleotide repeats (i.e., AAA, TTT, CCC), curved DNA, DNase I-hypersensitive sites, nucleosome-free stretches, polypurine stretches, and motifs with a potential for left-handed and triplex structures. We are establishing Banks of ORI and MAR sequences and have undertaken a large project of sequencing a large number of MARs in an effort to determine classes of DNA sequences in these regulatory elements and to understand their role at the origins of replication and transcriptional enhancers.
Collapse
Affiliation(s)
- T Boulikas
- Institute of Molecular Medical Sciences, Palo Alto, California 94306, USA
| |
Collapse
|
50
|
Epp TA, Wang R, Sole MJ, Liew CC. Concerted evolution of mammalian cardiac myosin heavy chain genes. J Mol Evol 1995. [DOI: 10.1007/bf01215175] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
|