1
|
Abstract
The p-arms of the five human acrocentric chromosomes bear nucleolar organizer regions (NORs) comprising ribosomal gene (rDNA) repeats that are organized in a homogeneous tandem array and transcribed in a telomere-to-centromere direction. Precursor ribosomal RNA transcripts are processed and assembled into ribosomal subunits, the nucleolus being the physical manifestation of this process. I review current understanding of nucleolar chromosome biology and describe current exploration into a role for the NOR chromosomal context. Full DNA sequences for acrocentric p-arms are now emerging, aided by the current revolution in long-read sequencing and genome assembly. Acrocentric p-arms vary from 10.1 to 16.7 Mb, accounting for ∼2.2% of the genome. Bordering rDNA arrays, distal junctions, and proximal junctions are shared among the p-arms, with distal junctions showing evidence of functionality. The remaining p-arm sequences comprise multiple satellite DNA classes and segmental duplications that facilitate recombination between heterologous chromosomes, which is likely also involved in Robertsonian translocations.
Collapse
Affiliation(s)
- Brian McStay
- Centre for Chromosome Biology, College of Science and Engineering, University of Galway, Galway, Ireland;
| |
Collapse
|
2
|
Guarracino A, Buonaiuto S, de Lima LG, Potapova T, Rhie A, Koren S, Rubinstein B, Fischer C, Gerton JL, Phillippy AM, Colonna V, Garrison E. Recombination between heterologous human acrocentric chromosomes. Nature 2023; 617:335-343. [PMID: 37165241 PMCID: PMC10172130 DOI: 10.1038/s41586-023-05976-y] [Citation(s) in RCA: 25] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Accepted: 03/17/2023] [Indexed: 05/12/2023]
Abstract
The short arms of the human acrocentric chromosomes 13, 14, 15, 21 and 22 (SAACs) share large homologous regions, including ribosomal DNA repeats and extended segmental duplications1,2. Although the resolution of these regions in the first complete assembly of a human genome-the Telomere-to-Telomere Consortium's CHM13 assembly (T2T-CHM13)-provided a model of their homology3, it remained unclear whether these patterns were ancestral or maintained by ongoing recombination exchange. Here we show that acrocentric chromosomes contain pseudo-homologous regions (PHRs) indicative of recombination between non-homologous sequences. Utilizing an all-to-all comparison of the human pangenome from the Human Pangenome Reference Consortium4 (HPRC), we find that contigs from all of the SAACs form a community. A variation graph5 constructed from centromere-spanning acrocentric contigs indicates the presence of regions in which most contigs appear nearly identical between heterologous acrocentric chromosomes in T2T-CHM13. Except on chromosome 15, we observe faster decay of linkage disequilibrium in the pseudo-homologous regions than in the corresponding short and long arms, indicating higher rates of recombination6,7. The pseudo-homologous regions include sequences that have previously been shown to lie at the breakpoint of Robertsonian translocations8, and their arrangement is compatible with crossover in inverted duplications on chromosomes 13, 14 and 21. The ubiquity of signals of recombination between heterologous acrocentric chromosomes seen in the HPRC draft pangenome suggests that these shared sequences form the basis for recurrent Robertsonian translocations, providing sequence and population-based confirmation of hypotheses first developed from cytogenetic studies 50 years ago9.
Collapse
Affiliation(s)
- Andrea Guarracino
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
- Genomics Research Centre, Human Technopole, Milan, Italy
| | - Silvia Buonaiuto
- Institute of Genetics and Biophysics, National Research Council, Naples, Italy
| | | | - Tamara Potapova
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Arang Rhie
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Sergey Koren
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | | | - Christian Fischer
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | | | - Adam M Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Vincenza Colonna
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
- Institute of Genetics and Biophysics, National Research Council, Naples, Italy
| | - Erik Garrison
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA.
| |
Collapse
|
3
|
Somatic Hypomethylation of Pericentromeric SST1 Repeats and Tetraploidization in Human Colorectal Cancer Cells. Cancers (Basel) 2021; 13:cancers13215353. [PMID: 34771515 PMCID: PMC8582499 DOI: 10.3390/cancers13215353] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 10/06/2021] [Accepted: 10/21/2021] [Indexed: 01/08/2023] Open
Abstract
Somatic DNA hypomethylation and aneuploidy are hallmarks of cancer, and there is evidence for a causal relationship between them in knockout mice but not in human cancer. The non-mobile pericentromeric repetitive elements SST1 are hypomethylated in about 17% of human colorectal cancers (CRC) with some 5-7% exhibiting strong age-independent demethylation. We studied the frequency of genome doubling, a common event in solid tumors linked to aneuploidy, in randomly selected single cell clones of near-diploid LS174T human CRC cells differing in their level of SST1 demethylation. Near-diploid LS174T cells underwent frequent genome-doubling events generating near-tetraploid clones with lower levels of SST1 methylation. In primary CRC, strong SST1 hypomethylation was significantly associated with global genomic hypomethylation and mutations in TP53. This work uncovers the association of the naturally occurring demethylation of the SST1 pericentromeric repeat with the onset of spontaneous tetraploidization in human CRC cells in culture and with TP53 mutations in primary CRCs. Altogether, our findings provide further support for an oncogenic pathway linking somatic hypomethylation and genetic copy number alterations in a subset of human CRC.
Collapse
|
4
|
Tremblay DC, Alexander G, Moseley S, Chadwick BP. Expression, tandem repeat copy number variation and stability of four macrosatellite arrays in the human genome. BMC Genomics 2010; 11:632. [PMID: 21078170 PMCID: PMC3018141 DOI: 10.1186/1471-2164-11-632] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2010] [Accepted: 11/15/2010] [Indexed: 11/13/2022] Open
Abstract
Background Macrosatellites are some of the largest variable number tandem repeats in the human genome, but what role these unusual sequences perform is unknown. Their importance to human health is clearly demonstrated by the 4q35 macrosatellite D4Z4 that is associated with the onset of the muscle degenerative disease facioscapulohumeral muscular dystrophy. Nevertheless, many other macrosatellite arrays in the human genome remain poorly characterized. Results Here we describe the organization, tandem repeat copy number variation, transmission stability and expression of four macrosatellite arrays in the human genome: the TAF11-Like array located on chromosomes 5p15.1, the SST1 arrays on 4q28.3 and 19q13.12, the PRR20 array located on chromosome 13q21.1, and the ZAV array at 9q32. All are polymorphic macrosatellite arrays that at least for TAF11-Like and SST1 show evidence of meiotic instability. With the exception of the SST1 array that is ubiquitously expressed, all are expressed at high levels in the testis and to a lesser extent in the brain. Conclusions Our results extend the number of characterized macrosatellite arrays in the human genome and provide the foundation for formulation of hypotheses to begin assessing their functional role in the human genome.
Collapse
Affiliation(s)
- Deanna C Tremblay
- Department of Biological Sciences, Florida State University, King Life Science Building, Tallahassee, FL 32306-4295, USA
| | | | | | | |
Collapse
|
5
|
Cohen S, Agmon N, Sobol O, Segal D. Extrachromosomal circles of satellite repeats and 5S ribosomal DNA in human cells. Mob DNA 2010; 1:11. [PMID: 20226008 PMCID: PMC3225859 DOI: 10.1186/1759-8753-1-11] [Citation(s) in RCA: 90] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2009] [Accepted: 03/08/2010] [Indexed: 12/29/2022] Open
Abstract
BACKGROUND Extrachomosomal circular DNA (eccDNA) is ubiquitous in eukaryotic organisms and was detected in every organism tested, including in humans. A two-dimensional gel electrophoresis facilitates the detection of eccDNA in preparations of genomic DNA. Using this technique we have previously demonstrated that most of eccDNA consists of exact multiples of chromosomal tandemly repeated DNA, including both coding genes and satellite DNA. RESULTS Here we report the occurrence of eccDNA in every tested human cell line. It has heterogeneous mass ranging from less than 2 kb to over 20 kb. We describe eccDNA homologous to human alpha satellite and the SstI mega satellite. Moreover, we show, for the first time, circular multimers of the human 5S ribosomal DNA (rDNA), similar to previous findings in Drosophila and plants. We further demonstrate structures that correspond to intermediates of rolling circle replication, which emerge from the circular multimers of 5S rDNA and SstI satellite. CONCLUSIONS These findings, and previous reports, support the general notion that every chromosomal tandem repeat is prone to generate eccDNA in eukryoric organisms including humans. They suggest the possible involvement of eccDNA in the length variability observed in arrays of tandem repeats. The implications of eccDNA on genome biology may include mechanisms of centromere evolution, concerted evolution and homogenization of tandem repeats and genomic plasticity.
Collapse
Affiliation(s)
- Sarit Cohen
- Department of Molecular Microbiology & Biotechnology Tel-Aviv University, Tel-Aviv 69978, Israel.
| | | | | | | |
Collapse
|
6
|
Illingworth RS, Bird AP. CpG islands--'a rough guide'. FEBS Lett 2009; 583:1713-20. [PMID: 19376112 DOI: 10.1016/j.febslet.2009.04.012] [Citation(s) in RCA: 571] [Impact Index Per Article: 38.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2009] [Revised: 04/04/2009] [Accepted: 04/06/2009] [Indexed: 02/07/2023]
Abstract
Mammalian genomes are punctuated by DNA sequences containing an atypically high frequency of CpG sites termed CpG islands (CGIs). CGIs generally lack DNA methylation and associate with the majority of annotated gene promoters. Many studies, however, have identified examples of CGI methylation in malignant cells, leading to improper gene silencing. CGI methylation also occurs in normal tissues and is known to function in X-inactivation and genomic imprinting. More recently, differential methylation has been shown between tissues, suggesting a potential role in transcriptional regulation during cell specification. Many of these tissue-specific methylated CGIs localise to regions distal to promoters, the regulatory function of which remains to be determined.
Collapse
Affiliation(s)
- Robert S Illingworth
- Wellcome Trust Centre for Cell Biology, Michael Swann Building, University of Edinburgh, Mayfield Road, Edinburgh EH9 3JR, United Kingdom.
| | | |
Collapse
|
7
|
Warburton PE, Hasson D, Guillem F, Lescale C, Jin X, Abrusan G. Analysis of the largest tandemly repeated DNA families in the human genome. BMC Genomics 2008; 9:533. [PMID: 18992157 PMCID: PMC2588610 DOI: 10.1186/1471-2164-9-533] [Citation(s) in RCA: 108] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2008] [Accepted: 11/07/2008] [Indexed: 01/26/2023] Open
Abstract
Background Tandemly Repeated DNA represents a large portion of the human genome, and accounts for a significant amount of copy number variation. Here we present a genome wide analysis of the largest tandem repeats found in the human genome sequence. Results Using Tandem Repeats Finder (TRF), tandem repeat arrays greater than 10 kb in total size were identified, and classified into simple sequence e.g. GAATG, classical satellites e.g. alpha satellite DNA, and locus specific VNTR arrays. Analysis of these large sequenced regions revealed that several "simple sequence" arrays actually showed complex domain and/or higher order repeat organization. Using additional methods, we further identified a total of 96 additional arrays with tandem repeat units greater than 2 kb (the detection limit of TRF), 53 of which contained genes or repeated exons. The overall size of an array of tandem 12 kb repeats which spanned a gap on chromosome 8 was found to be 600 kb to 1.7 Mbp in size, representing one of the largest non-centromeric arrays characterized. Several novel megasatellite tandem DNA families were observed that are characterized by repeating patterns of interspersed transposable elements that have expanded presumably by unequal crossing over. One of these families is found on 11 different chromosomes in >25 arrays, and represents one of the largest most widespread megasatellite DNA families. Conclusion This study represents the most comprehensive genome wide analysis of large tandem repeats in the human genome, and will serve as an important resource towards understanding the organization and copy number variation of these complex DNA families.
Collapse
Affiliation(s)
- Peter E Warburton
- Deptartment of Genetics and Genomic Sciences, Mount Sinai School of Medicine, New York, NY 10029, USA.
| | | | | | | | | | | |
Collapse
|
8
|
Abstract
Cancer-associated DNA hypomethylation is as prevalent as cancer-linked hypermethylation, but these two types of epigenetic abnormalities usually seem to affect different DNA sequences. Much more of the genome is generally subject to undermethylation rather than overmethylation. Genomic hypermethylation in cancer has been observed most often in CpG islands in gene regions. In contrast, very frequent hypomethylation is seen in both highly and moderately repeated DNA sequences in cancer, including heterochromatic DNA repeats, dispersed retrotransposons, and endogenous retroviral elements. Also, unique sequences, including transcription control sequences, are often subject to cancer-associated undermethylation. The high frequency of cancer-linked DNA hypomethylation, the nature of the affected sequences, and the absence of associations with DNA hypermethylation are consistent with an independent role for DNA undermethylation in cancer formation or tumor progression. Increased karyotypic instability and activation of tumor-promoting genes by cis or trans effects, that might include altered heterochromatin-euchromatin interactions, may be important consequences of DNA hypomethylation which favor oncogenesis. The relationship of DNA hypomethylation to tumorigenesis is important to be considered in the light of cancer therapies involving decreasing DNA methylation. Inducing DNA hypomethylation may have short-term anticancer effects, but might also help speed tumor progression from cancer cells surviving the DNA demethylation chemotherapy.
Collapse
Affiliation(s)
- Melanie Ehrlich
- Human Genetics Program/SL31, Department of Biochemistry, Tulane Medical School, New Orleans, Louisiana, LA 70122, USA.
| |
Collapse
|
9
|
Nagai H, Kim YS, Yasuda T, Ohmachi Y, Yokouchi H, Monden M, Emi M, Konishi N, Nogami M, Okumura K, Matsubara K. A novel sperm-specific hypomethylation sequence is a demethylation hotspot in human hepatocellular carcinomas. Gene 1999; 237:15-20. [PMID: 10524231 DOI: 10.1016/s0378-1119(99)00322-4] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Certain human DNA regions are strikingly undermethylated at CpG sites in sperm compared to adult somatic tissues. These sperm-specific hypomethylation sequences are thought to function early in embryogenesis or gametogenesis. By using the restriction landmark genomic scanning (RLGS) cloning method, we have isolated a novel sperm-specific hypomethylation sequence, the status of which changes during spermatogenesis, embryonal growth and differentiation. This sequence is a part of a new 'NotI repeat' consisting of a 1.4 kb repetitive unit sequence named DE-1. The sequence is GC-rich and has high homology to a CpG DNA clone that was isolated by a methyl CpG protein binding column, indicating that it was normally highly methylated. We investigated the methylation status of this sequence. In the normal genome the sequence was methylated, but in the human hepatocellular carcinoma (HCC) genome, the target sequence was demethylated at the cytosine residue of the CpG dinucleotides with high frequency (75% in the previous study). These data suggest that this regional DNA hypomethylation may play a role in both cell differentiation and hepatocarcinogenesis.
Collapse
Affiliation(s)
- H Nagai
- Institute for Molecular and Cellular Biology, Osaka University, Japan
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Gondo Y, Okada T, Matsuyama N, Saitoh Y, Yanagisawa Y, Ikeda JE. Human megasatellite DNA RS447: copy-number polymorphisms and interspecies conservation. Genomics 1998; 54:39-49. [PMID: 9806828 DOI: 10.1006/geno.1998.5545] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
We previously isolated a novel 4.7-kb RS447 sequence, which tandemly repeated approximately 50-70 copies and resided on human chromosome 4p15 (M. Kogi et al., 1997, Genomics 42: 278-283). Another tandem array (or arrays) of several RS447 copies was hereby identified on the distal part of chromosome 8p. To analyze copy-number polymorphisms of the RS447 repeats, genomic DNA samples of eight nonkindred Japanese were subjected to pulsed-field gel electrophoresis. The copy numbers of the RS447 tandem arrays on 4p15 varied drastically from allele to allele and ranged from approximately 34 to 94 copies. All eight Japanese subjects were apparently heterozygous for the RS447 copy number, and 12 copy-number-different alleles have been at least clearly distinguished. The RS447 tandem repeats were thus found to be hypervariable and highly polymorphic in a human population. The RS447 sequences, however, do not appear to be either "selfish" or "junk" DNA. The unit size and sequence of RS447 were found to be very similar between members in the human genome. The unit size of 4746 bp comprises a putative open reading frame of 1590 bp. The RS447 sequence was well conserved in all the tested mammalian species. The head-to-tail tandem repetitive structure in the RS447 homologs was also confirmed in those species. The RS447 sequence is, therefore, considered to consist of a new class of tandemly repeated satellite DNA elements in the mammalian genome, which may thus be called "megasatellite DNA."
Collapse
Affiliation(s)
- Y Gondo
- The Institute of Medical Sciences, Tokai University, Isehara 259-1193, Japan
| | | | | | | | | | | |
Collapse
|
11
|
Thoraval D, Asakawa J, Wimmer K, Kuick R, Lamb B, Richardson B, Ambros P, Glover T, Hanash S. Demethylation of repetitive DNA sequences in neuroblastoma. Genes Chromosomes Cancer 1996; 17:234-44. [PMID: 8946205 DOI: 10.1002/(sici)1098-2264(199612)17:4<234::aid-gcc5>3.0.co;2-4] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
Altered genomic methylcytosine content has been described for a number of tumor types, including neuroblastoma. However, it remains to be determined for different tumor types whether specific loci or chromosomal regions are affected by a methylation change or whether the change is random. We have implemented a computer-based approach for the analysis of two-dimensional separations of human genomic restriction fragments. Through the use of methylation-sensitive restriction enzymes, methylation differences in genomic DNA between tumor and normal tissues can be detected. We report the cloning and sequencing of two fragments detectable in two-dimensional separations of genomic DNA of neuroblastomas. These fragments were found to be a part of repetitive units that exhibited demethylation in neuroblastoma relative to other tumor types. Our finding of a distinct pattern of methylation of repetitive units in neuroblastoma suggests that altered methylation at certain loci may contribute to the biology of this tumor.
Collapse
Affiliation(s)
- D Thoraval
- Department of Pediatrics, University of Michigan School of Medicine, Ann Arbor 48109-0510, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Zhu Z, Chao J, Yu H, Waggoner AS. Directly labeled DNA probes using fluorescent nucleotides with different length linkers. Nucleic Acids Res 1994; 22:3418-22. [PMID: 8078779 PMCID: PMC523738 DOI: 10.1093/nar/22.16.3418] [Citation(s) in RCA: 113] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
Directly labeled fluorescent DNA probes have been made by nick translation and PCR using dUTP attached to the fluorescent label, Cy3, with different length linkers. With preparation of probes by PCR we find that linker length affects the efficiency of incorporation of Cy3-dUTP, the yield of labeled probe, and the signal intensity of labeled probes hybridized to chromosome target sequences. For nick translation and PCR, both the level of incorporation and the hybridization fluorescence signal increased in parallel when the length of the linker arm is increased. Under optimal conditions, PCR yielded more densely labeled probes, however, the yield of PCR labeled probe decreased with greater linear density of labeling. By using a Cy3-modified dUTP with the longest linker under optimal conditions it was possible to label up to 28% of the possible substitution sites on the target DNA with reasonable yield by PCR and 18% by nick translation. A mechanism involving steric interactions between the polymerase, cyanine-labeled sites on template and extending chains and the modified dUTP substrate is proposed to explain the inverse correlation between the labeling efficiency and the yield of DNA probe synthesis by PCR.
Collapse
Affiliation(s)
- Z Zhu
- Center for Light Microscope Imaging and Biotechnology, Carnegie Mellon University, Pittsburgh, PA 15213
| | | | | | | |
Collapse
|
13
|
Yu H, Chao J, Patek D, Mujumdar R, Mujumdar S, Waggoner AS. Cyanine dye dUTP analogs for enzymatic labeling of DNA probes. Nucleic Acids Res 1994; 22:3226-32. [PMID: 8065939 PMCID: PMC310300 DOI: 10.1093/nar/22.15.3226] [Citation(s) in RCA: 67] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
Fluorescence in situ hybridization (FISH) has become and indispensable tool in a variety of areas of research and clinical diagnostics. Many applications demand an approach for simultaneous detection of multiple target sequences that is rapid and simple, yet sensitive. In this work, we describe the synthesis of two new cyanine dye-labeled dUTP analogs, Cy3-dUTP and Cy5-dUTP. They are efficient substrates for DNA polymerases and can be incorporated into DNA probes by standard nick translation, random priming and polymerase chain reactions. Optimal labeling conditions have been identified which yield probes with 20-40 dyes per kilobase. The directly labeled DNA probes obtained with these analogs offer a simple approach for multicolor multisequence analysis that requires no secondary detection reagents and steps.
Collapse
Affiliation(s)
- H Yu
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA 15213
| | | | | | | | | | | |
Collapse
|
14
|
Abstract
We report a collection of 53 prototypic sequences representing known families of repetitive elements from the human genome. The prototypic sequences are either consensus sequences or selected examples of repetitive sequences. The collection includes: prototypes for high and medium reiteration frequency interspersed repeats, long terminal repeats of endogenous retroviruses, alphoid repeats, telomere-associated repeats, and some miscellaneous repeats. The collection is annotated and available electronically.
Collapse
Affiliation(s)
- J Jurka
- Linus Pauling Institute of Science and Medicine, Palo Alto, CA 94306
| | | | | |
Collapse
|
15
|
Abstract
CpG islands are short, dispersed regions of unmethylated DNA with a high frequency of CpG dinucleotides relative to the bulk genome. We have analyzed 375 genes and 58 pseudogenes from the human entries in the EMBL Database for the presence of CpG islands. All 240 islands identified are associated with genes, and almost all cover at least a part of one exon; i.e., they are useful landmarks in the genome for identifying genes. More than half of the genes analyzed were associated with islands. All housekeeping and widely expressed genes have a CpG island covering the transcription start, whereas 40% of the genes with a tissue-specific or limited expression are associated with islands. In this latter group of genes, the position of the islands was not biased toward the 5' end of the transcription unit.
Collapse
Affiliation(s)
- F Larsen
- Biotechnology Centre of Oslo, University of Oslo, Norway
| | | | | | | |
Collapse
|
16
|
Giacalone J, Friedes J, Francke U. A novel GC-rich human macrosatellite VNTR in Xq24 is differentially methylated on active and inactive X chromosomes. Nat Genet 1992; 1:137-43. [PMID: 1302007 DOI: 10.1038/ng0592-137] [Citation(s) in RCA: 70] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
A new X chromosome-specific repetitive sequence, a 3 kilobase HindIII clone with a base composition of 63% C+G, has been isolated. The sequence is organized as a hypervariable tandem repeat cluster ranging in size from 150-350 kilobases, with outlying single copies. This locus, designated DXZ4 and mapped to chromosome band Xq24, may consist of as many as 50 variable-length alleles. It represents a class of variable number of tandem repeat polymorphism which may be termed 'macrosatellite'. The cluster is highly methylated on the active X chromosome and hypomethylated on the inactive X.
Collapse
Affiliation(s)
- J Giacalone
- Department of Genetics, Howard Hughes Medical Institute, Stanford University School of Medicine, California 94305
| | | | | |
Collapse
|
17
|
Kotin RM, Menninger JC, Ward DC, Berns KI. Mapping and direct visualization of a region-specific viral DNA integration site on chromosome 19q13-qter. Genomics 1991; 10:831-4. [PMID: 1653762 DOI: 10.1016/0888-7543(91)90470-y] [Citation(s) in RCA: 226] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
A human parvovirus, adeno-associated virus (AAV), is unique among eukaryotic DNA viruses in its ability to integrate site specifically into a defined region of human chromosome 19. In this study we used in situ hybridization to visualized directly the site of AAV DNA integration in latently infected human cell lines and normal human cells.
Collapse
Affiliation(s)
- R M Kotin
- Department of Microbiology, Hearst Microbiology Research Center, Cornell University Medical College, New York, New York 10021
| | | | | | | |
Collapse
|
18
|
Brook JD, Knight SJ, Roberts SH, Harley HG, Walsh KV, Rundle SA, Freyne K, Koch MC, Epstein ND, Wieringa B. The physical map of chromosome arm 19q: some new assignments, confirmations and re-assessments. Hum Genet 1991; 87:65-72. [PMID: 2037284 DOI: 10.1007/bf01213095] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
We have constructed and analysed somatic cell hybrids from cell lines containing balanced reciprocal translocations involving chromosome 19 and providing two new breakpoints on 19q. These and other hybrids have been tested with a series of markers from 19q to enhance the existing map. Several new cloned DNA sequences that map to 19q13.3-19qter are reported; the locus D19Z1 has been analysed by CHEF gel electrophoresis.
Collapse
Affiliation(s)
- J D Brook
- Institute of Medical Genetics, University of Wales College of Medicine, Heath Park, Cardiff, UK
| | | | | | | | | | | | | | | | | | | |
Collapse
|
19
|
Vogt P. Potential genetic functions of tandem repeated DNA sequence blocks in the human genome are based on a highly conserved "chromatin folding code". Hum Genet 1990; 84:301-36. [PMID: 2407640 DOI: 10.1007/bf00196228] [Citation(s) in RCA: 99] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
This review is based on a thorough description of the structure and sequence organization of tandemly organized repetitive DNA sequence families in the human genome; it is aimed at revealing the locus-specific sequence organization of tandemly repetitive sequence structures as a highly conserved DNA sequence code. These repetitive so-called "super-structures" or "higher-order" structures are able to attract specific nuclear proteins. I shall define this code therefore as a "chromatin folding code". Since locus-specific superstructures of tandemly repetitive sequence units are present not only in the chromosome centromere or telomere region but also on the arms of the chromosomes, I assume that their chromatin folding code may contribute to, or even organize, the folding pathway of the chromatin chain in the nucleus. The "chromatin folding code" is based on its specific "chromatin code", which describes the sequence dependence of the helical pathway of the DNA primary sequence (i.e., secondary structure) entrapping the histone octamers in preferential positions. There is no periodicity in the distribution of the nucleosomes along the DNA chain. The folding pathway of the nucleosomal chromatin chain is however still flexible and determined by e.g., the length of the DNA chain between the nucleosomes. The fixation and stabilization of the chromatin chain in the space of the nucleus (i.e., its "functional state") may be mediated by additionally unique DNA protein interactions that are dictated by the "chromatin folding code". The unique DNA-protein interactions around the centromeres of human chromosomes are revealed for example by their "C-banding". I wish to stress that it is not my aim to relate each block of repetitive DNA sequences to a specific "chromatin folding code", but I shall demonstrate that there is an inherent potential for tandem repeated sequence units to develop a locus-specific repetitive higher order structure; this potential may create a specific chromatin folding code whenever a selection force exists at the position of this repetitive DNA structure in the genome.
Collapse
Affiliation(s)
- P Vogt
- Institut für Humangenetik und Anthropologie der Universität, Heidelberg, Federal Republic of Germany
| |
Collapse
|
20
|
Abstract
The clone designated hMF #1 represents a clustered DNA family, located on chromosome 1, consisting of tandem arrays displaying a monomeric length of 40 bp and a repetition frequency of approximately 7 x 10(3) copies per haploid genome. The sequence hMF #1 reveals multiple restriction fragment length polymorphisms (RFLPs) when human genomic DNA is digested with a variety of 4-6-bp recognition sequence restriction enzymes (i.e., Taq I, Eco RI, Pst I, etc.). When hamster and mouse genomic DNA was digested and analyzed, no cross-species homology could be observed. Further investigation revealed considerable hybridization in the higher primates (chimpanzee, gorilla, and orangutan) as well as some monkey species. The evolutionary relationship of this repetitive DNA sequence, found in humans, to that of other primates was explored using two hybridization methods: DNA dot blot to establish copy number and Southern DNA analysis to examine the complexity of the RFLPs. Homology to the hMF #1 sequence was found throughout the suborder Anthropoidea in 14 ape and New and Old World monkey species. However the sequence was absent in one species of the suborder Prosimii. Several discrepancies between "established" evolutionary relationships and those predicted by hMF #1 exist, which suggests that repetitive elements of this type are not reliable indicators of phylogenetic branching patterns. The phenomenon of marked diversity between sequence homologies and copy numbers of dispersed repetitive DNA of closely related species has been observed in Drosophila, mice, Galago, and higher primates. We report here a similar phenomenon for a clustered repeat that may have originated at an early stage of primate evolution.
Collapse
Affiliation(s)
- K M Tynan
- Department of Medical Biochemistry, University of Calgary, Alberta Childrens Hospital Research Centre, Canada
| | | |
Collapse
|
21
|
Tyler-Smith C, Taylor L, Müller U. Structure of a hypervariable tandemly repeated DNA sequence on the short arm of the human Y chromosome. J Mol Biol 1988; 203:837-48. [PMID: 3210241 DOI: 10.1016/0022-2836(88)90110-6] [Citation(s) in RCA: 78] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
The structure of a repeated DNA sequence located on the short arm of the human Y chromosome is described. Genomic mapping and cloning in lambda or cosmid vectors show that the repeated sequence consists of units 20.3 x 10(3) base-pairs long that contain the three previously described DNA sequences: Y-156, Y-190 and Y-223a. Analysis of male genomic DNA by pulsed-field gel electrophoresis shows that the units are tandemly arranged and are organized into two blocks. The major block is hypervariable in size and alleles in the range approximately 540 x 10(3) to 800 x 10(3) base-pairs were detected. The minor block is not variable in size and is approximately 60 x 10(3) base-pairs long. Analysis of rearranged Y chromosomes shows that both blocks are located on the short arm of the chromosome. Most commonly, the major block is distal to the minor block, but the opposite arrangement is also found.
Collapse
Affiliation(s)
- C Tyler-Smith
- Department of Biochemistry, University of Oxford, U.K
| | | | | |
Collapse
|
22
|
Zhang XY, Loflin PT, Gehrke CW, Andrews PA, Ehrlich M. Hypermethylation of human DNA sequences in embryonal carcinoma cells and somatic tissues but not in sperm. Nucleic Acids Res 1987; 15:9429-49. [PMID: 2825135 PMCID: PMC306478 DOI: 10.1093/nar/15.22.9429] [Citation(s) in RCA: 34] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
Certain human DNA sequences are much less methylated at CpG sites in sperm than in various adult somatic tissues. The DNA of term placenta displays intermediate levels of methylation at these sequences (Sp-0.3 sequences). We report here that pluripotent embryonal carcinoma (EC) cells derived from testicular germ cell tumors are hypermethylated at the three previously cloned Sp-0.3 sequences and seven newly isolated sequences that exhibit sperm-specific hypomethylation. In contrast to their hypermethylation in EC cells, the Sp-0.3 sequences are hypomethylated in a line of yolk sac carcinoma cells, which like placenta, represent an extraembryonic lineage. These DNA sequences, therefore, appear to be subject to coordinate changes in their methylation during differentiation, probably early in embryogenesis, despite their diversity in copy number (1 to 10(4] and primary structure. Two of these Sp-0.3 sequences are highly homologous to DNA sequences in human chromosomal regions that might be recombination hotspots, namely, a cryptic satellite DNA sequence at a fragile site and the downstream region of the beta-globin gene cluster.
Collapse
Affiliation(s)
- X Y Zhang
- Department of Biochemistry, Tulane Medical School, New Orleans, LA 70112
| | | | | | | | | |
Collapse
|