1
|
A classical revival: Human satellite DNAs enter the genomics era. Semin Cell Dev Biol 2022; 128:2-14. [PMID: 35487859 DOI: 10.1016/j.semcdb.2022.04.012] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2022] [Revised: 04/11/2022] [Accepted: 04/12/2022] [Indexed: 12/30/2022]
Abstract
The classical human satellite DNAs, also referred to as human satellites 1, 2 and 3 (HSat1, HSat2, HSat3, or collectively HSat1-3), occur on most human chromosomes as large, pericentromeric tandem repeat arrays, which together constitute roughly 3% of the human genome (100 megabases, on average). Even though HSat1-3 were among the first human DNA sequences to be isolated and characterized at the dawn of molecular biology, they have remained almost entirely missing from the human genome reference assembly for 20 years, hindering studies of their sequence, regulation, and potential structural roles in the nucleus. Recently, the Telomere-to-Telomere Consortium produced the first truly complete assembly of a human genome, paving the way for new studies of HSat1-3 with modern genomic tools. This review provides an account of the history and current understanding of HSat1-3, with a view towards future studies of their evolution and roles in health and disease.
Collapse
|
2
|
Ahmad SF, Singchat W, Jehangir M, Suntronpong A, Panthum T, Malaivijitnond S, Srikulnath K. Dark Matter of Primate Genomes: Satellite DNA Repeats and Their Evolutionary Dynamics. Cells 2020; 9:E2714. [PMID: 33352976 PMCID: PMC7767330 DOI: 10.3390/cells9122714] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Revised: 12/15/2020] [Accepted: 12/16/2020] [Indexed: 12/12/2022] Open
Abstract
A substantial portion of the primate genome is composed of non-coding regions, so-called "dark matter", which includes an abundance of tandemly repeated sequences called satellite DNA. Collectively known as the satellitome, this genomic component offers exciting evolutionary insights into aspects of primate genome biology that raise new questions and challenge existing paradigms. A complete human reference genome was recently reported with telomere-to-telomere human X chromosome assembly that resolved hundreds of dark regions, encompassing a 3.1 Mb centromeric satellite array that had not been identified previously. With the recent exponential increase in the availability of primate genomes, and the development of modern genomic and bioinformatics tools, extensive growth in our knowledge concerning the structure, function, and evolution of satellite elements is expected. The current state of knowledge on this topic is summarized, highlighting various types of primate-specific satellite repeats to compare their proportions across diverse lineages. Inter- and intraspecific variation of satellite repeats in the primate genome are reviewed. The functional significance of these sequences is discussed by describing how the transcriptional activity of satellite repeats can affect gene expression during different cellular processes. Sex-linked satellites are outlined, together with their respective genomic organization. Mechanisms are proposed whereby satellite repeats might have emerged as novel sequences during different evolutionary phases. Finally, the main challenges that hinder the detection of satellite DNA are outlined and an overview of the latest methodologies to address technological limitations is presented.
Collapse
Affiliation(s)
- Syed Farhan Ahmad
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Worapong Singchat
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Maryam Jehangir
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Department of Structural and Functional Biology, Institute of Bioscience at Botucatu, São Paulo State University (UNESP), Botucatu, São Paulo 18618-689, Brazil
| | - Aorarat Suntronpong
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Thitipong Panthum
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Suchinda Malaivijitnond
- National Primate Research Center of Thailand, Chulalongkorn University, Saraburi 18110, Thailand;
- Department of Biology, Faculty of Science, Chulalongkorn University, Bangkok 10330, Thailand
| | - Kornsorn Srikulnath
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
- National Primate Research Center of Thailand, Chulalongkorn University, Saraburi 18110, Thailand;
- Center of Excellence on Agricultural Biotechnology (AG-BIO/PERDO-CHE), Bangkok 10900, Thailand
- Omics Center for Agriculture, Bioresources, Food and Health, Kasetsart University (OmiKU), Bangkok 10900, Thailand
| |
Collapse
|
3
|
Jarmuz M, Glotzbach CD, Bailey KA, Bandyopadhyay R, Shaffer LG. The Evolution of satellite III DNA subfamilies among primates. Am J Hum Genet 2007; 80:495-501. [PMID: 17273970 PMCID: PMC1821104 DOI: 10.1086/512132] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2006] [Accepted: 12/16/2006] [Indexed: 01/30/2023] Open
Abstract
We demonstrate that satellite III (SatIII) DNA subfamilies cloned from human acrocentric chromosomes arose in the Hominoidea superfamily. Two groups, distinguished by sequence composition, evolved nonconcurrently, with group 2 evolving 16-23 million years ago (MYA) and the more recent group 1 sequences emerging approximately 4.5 MYA. We also show the relative order of emergence of each group 2 subfamily in the various primate species. Our results show that each SatIII subfamily is an independent evolutionary unit, that the rate of evolution is not uniform between species, and that the evolution within a species is not uniform between chromosomes.
Collapse
Affiliation(s)
- Malgorzata Jarmuz
- Health Research and Education Center, Washington State University, Spokane 99210, USA
| | | | | | | | | |
Collapse
|
4
|
Fowler JC, Burgoyne LA, Baker EG, Ringenbergs ML, Callen DF. Human satellite III DNA: genomic location and sequence homogeneity of the TaqI-deficient polymorphic sequences. Chromosoma 1989; 98:266-72. [PMID: 2575486 DOI: 10.1007/bf00327312] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Human Satellite III DNA is a major tandem repeat in the human genome and presents a TaqI-specific hypervariable restriction fragment length polymorphism when a Satellite III related sequence (228S) is used as a probe. In situ examination shows this sequence to be near specific for the region 9qh on chromosome 9 when it is used at low probe concentrations. However the region 9qh does not appear to be the only or even the primary source of the TaqI-deficient polymorphic sequences (TDPS). Rather, such sequences appear to be mostly present in chromosomes 20, 21, and 22, and these represent the largest regions of homogeneous Satellite III in the genome; they are also resistant to digestion with a range of other restriction endonucleases. The TDPS do not arise from either of the two currently recognized Satellite III-enriched genomic regions, namely autosomal 'K-domains', which form part of 15p in chromosome 15 or the heterochromatin of chromosome Y.
Collapse
Affiliation(s)
- J C Fowler
- School of Biological Sciences, Flinders University of South Australia, Bedford Park
| | | | | | | | | |
Collapse
|
6
|
Miller DA, Sharma V, Mitchell AR. A human-derived probe, p82H, hybridizes to the centromeres of gorilla, chimpanzee, and orangutan. Chromosoma 1988; 96:270-4. [PMID: 3133178 DOI: 10.1007/bf00286913] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
A human-derived centromeric sequence, p82H, hybridizes to DNA from gorilla, chimpanzee, pygmy chimpanzee, and orangutan. On DNA blots, multimeric ladders based on 170 or 340 bp repeat units are seen. In metaphase chromosome preparations from these species, p82H hybridizes to the centromeric region of every chromosome. p82H forms less stable hybrids with DNA from the lion-tailed macaque and does not hybridize to DNA or chromosomes of the owl monkey or the mouse.
Collapse
Affiliation(s)
- D A Miller
- Department of Molecular Biology and Genetics, Wayne State University, Detroit, MI 48201
| | | | | |
Collapse
|
7
|
Collet C, Westerman M. Interspecies comparison of the highly-repeated DNA of Australasian Luzula (Juncaceae). Genetica 1987; 74:95-103. [PMID: 2465202 DOI: 10.1007/bf00055219] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
The woodrush genus Luzula is characterised by having holocentric chromosomes. DNA of nine related Australasian species shows similar satellite DNAs which re very similar in nucleotide sequence content and unit length. Differences between the repetitive DNAs are evident as either the presence or absence of particular restriction enzyme sites. Sequence variants have probably been introduced into the repeated DNA components of ancestral species and particular variants reamplified during the evolution of the genus. Sequence amplification appears to be restricted to sequences already present in the genome rather than the de novo generation of repeats. The evolution of highly-repeated DNA sequences dispersed throughout the holocentric chromosomes of Luzula thus appears to be very similar to that known in eukaryotes with the more normal monocentric chromosome organisation.
Collapse
Affiliation(s)
- C Collet
- CSIRO Wildlife, Lyncham, A.C.T., Australia
| | | |
Collapse
|
9
|
Abstract
The simple sequence components of three human classical satellite DNAs have been defined, and some segments of each satellite have been sequenced. Each of the classical satellites I, II and III was found to contain, as a major component, a single family of simple repeated sequences. The three simple-sequence families have been called satellites 1, 2 and 3, to indicate the enrichment of each in one of the classical satellites I, II and III, and to differentiate them from these classical satellites, which also contain other repeated components. Satellite 3, the simple sequence component of classical satellite III, when digested with the restriction endonuclease HinfI, forms a ladder based on a repeat of five base-pairs, 5' A-T-T-C-C. The HinfI ladder was shown to be composed of repeated elements with the general sequence 5' (A-T-T-C-C)n-A-TC-T-C-G-G-G-T-T-G. Satellite 2, the simple sequence component of classical satellite II, is digested by HinfI into a large number of very small fragments, of length 10 to 80 base-pairs. These were found to contain the simple repeat 5' A-T-T-C-C, in a highly diverged form. Analysis of satellite 2 sequences suggested that the five base-pair repeat was originally amplified as a higher-order repeat like that of satellite 3. However, the main tandemly repeated segments of satellite 2 in the human genome are much longer, and the simple sequence elements on which they are based are quite degenerate. Satellite 1, the simple sequence component of classical satellite I, is digested by the restriction endonuclease RsaI into a ladder of fragments less than 150 base-pairs in length. These ladder fragments were found to be formed by the loss of RsaI sites from two related A + T-rich sequences, A (17 base-pairs) and B (25 base-pairs), arranged in alternating arrays, -A-B-A-B-A-. Analysis of a large number of cloned fragments from the RsaI ladder of satellite 1 showed that the tandem arrays, -A-B-A-B-A, have a more complex arrangement, with apparent amplification of segments containing particular sequence variants of the repeat units, A and B. No sequence relationship was evident between the repeat elements of satellite 1 and those of satellites 2 and 3.
Collapse
|
12
|
Sakaki Y, Kurata Y, Miyake T, Saigo K. Two-dimensional gel electrophoretic analysis of the HindIII 1.8-kb repetitive-sequence family in the human genome. Gene X 1983; 24:179-90. [PMID: 6315535 DOI: 10.1016/0378-1119(83)90078-1] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
The structure and organization of a human repetitive DNA family containing the HindIII 1.8-kb repetitive sequence were studied, using two-dimensional (2D) gel electrophoresis. The HindIII 1.8-kb sequence proved to be part of a repetitive sequence about 5 kb long and interspersed on the genome. The long repetitive sequence family contained several subgroups, as based on polymorphism of the restriction site. Recombinant phages containing the long repetitive sequence were isolated from the human genomic DNA library. Heteroduplex and restriction analysis showed that the structure of the repetitive sequence carried by the phages was close to that expected from 2D gel electrophoretic analysis. The 2D gel electrophoretic analysis was shown to be a reliable and useful approach for surveying and mass analysis of repetitive sequence families.
Collapse
|