1
|
Louzada S, Lopes M, Ferreira D, Adega F, Escudeiro A, Gama-Carvalho M, Chaves R. Decoding the Role of Satellite DNA in Genome Architecture and Plasticity-An Evolutionary and Clinical Affair. Genes (Basel) 2020; 11:E72. [PMID: 31936645 PMCID: PMC7017282 DOI: 10.3390/genes11010072] [Citation(s) in RCA: 40] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2019] [Revised: 12/29/2019] [Accepted: 01/08/2020] [Indexed: 12/11/2022] Open
Abstract
Repetitive DNA is a major organizational component of eukaryotic genomes, being intrinsically related with their architecture and evolution. Tandemly repeated satellite DNAs (satDNAs) can be found clustered in specific heterochromatin-rich chromosomal regions, building vital structures like functional centromeres and also dispersed within euchromatin. Interestingly, despite their association to critical chromosomal structures, satDNAs are widely variable among species due to their high turnover rates. This dynamic behavior has been associated with genome plasticity and chromosome rearrangements, leading to the reshaping of genomes. Here we present the current knowledge regarding satDNAs in the light of new genomic technologies, and the challenges in the study of these sequences. Furthermore, we discuss how these sequences, together with other repeats, influence genome architecture, impacting its evolution and association with disease.
Collapse
Affiliation(s)
- Sandra Louzada
- Laboratory of Cytogenomics and Animal Genomics (CAG), Department of Genetics and Biotechnology (DGB), University of Trás-os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal; (S.L.); (M.L.); (D.F.); (F.A.); (A.E.)
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisboa, 1749-016 Lisbon, Portugal;
| | - Mariana Lopes
- Laboratory of Cytogenomics and Animal Genomics (CAG), Department of Genetics and Biotechnology (DGB), University of Trás-os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal; (S.L.); (M.L.); (D.F.); (F.A.); (A.E.)
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisboa, 1749-016 Lisbon, Portugal;
| | - Daniela Ferreira
- Laboratory of Cytogenomics and Animal Genomics (CAG), Department of Genetics and Biotechnology (DGB), University of Trás-os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal; (S.L.); (M.L.); (D.F.); (F.A.); (A.E.)
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisboa, 1749-016 Lisbon, Portugal;
| | - Filomena Adega
- Laboratory of Cytogenomics and Animal Genomics (CAG), Department of Genetics and Biotechnology (DGB), University of Trás-os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal; (S.L.); (M.L.); (D.F.); (F.A.); (A.E.)
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisboa, 1749-016 Lisbon, Portugal;
| | - Ana Escudeiro
- Laboratory of Cytogenomics and Animal Genomics (CAG), Department of Genetics and Biotechnology (DGB), University of Trás-os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal; (S.L.); (M.L.); (D.F.); (F.A.); (A.E.)
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisboa, 1749-016 Lisbon, Portugal;
| | - Margarida Gama-Carvalho
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisboa, 1749-016 Lisbon, Portugal;
| | - Raquel Chaves
- Laboratory of Cytogenomics and Animal Genomics (CAG), Department of Genetics and Biotechnology (DGB), University of Trás-os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal; (S.L.); (M.L.); (D.F.); (F.A.); (A.E.)
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisboa, 1749-016 Lisbon, Portugal;
| |
Collapse
|
2
|
Satović E, Vojvoda Zeljko T, Luchetti A, Mantovani B, Plohl M. Adjacent sequences disclose potential for intra-genomic dispersal of satellite DNA repeats and suggest a complex network with transposable elements. BMC Genomics 2016; 17:997. [PMID: 27919246 PMCID: PMC5139131 DOI: 10.1186/s12864-016-3347-1] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2016] [Accepted: 11/25/2016] [Indexed: 11/14/2022] Open
Abstract
Background Satellite DNA (satDNA) sequences are typically arranged as arrays of tandemly repeated monomers. Due to the similarity among monomers, their organizational pattern and abundance, satDNAs are hardly accessible to structural and functional studies and still represent the most obscure genome component. Although many satDNA arrays of diverse length and even single monomers exist in the genome, surprisingly little is known about transition from satDNAs to other sequences. Studying satDNA monomers at junctions and identifying DNA sequences adjacent to them can help to understand the processes that (re)distribute satDNAs and significance that evolution of these sequence elements might have in creating the genomic landscape. Results We explored sets of randomly selected satDNA-harboring genomic fragments in four mollusc species to examine satDNA transition sites, and the nature of adjacent sequences. All examined junctions are characterized by abrupt transitions from satDNAs to other sequences. Among them, junctions of only one examined satDNA mapped non-randomly (within the palindrome), indicating that well-defined sequence feature is not a necessary prerequisite in the junction formation. In the studied sample, satDNA flanking sequences can be roughly classified into two groups. The first group is composed of anonymous DNA sequences which occasionally include short segments of transposable elements (TEs) as well as segments of other satDNA sequences. In the second group, satDNA repeats and the array flanking sequences are identified as parts of TEs of the Helitron superfamily. There, some array flanking regions hold fragmented satDNA monomers alternating with anonymous sequences of comparable length as missing monomer parts, suggesting a process of sequence reorganization by a mechanism able to excise short monomer parts and replace them with unrelated sequences. Conclusions The observed architecture of satDNA transition sites can be explained as a result of insertion and/or recombination events involving short arrays of satDNA monomers and TEs, in combination with hypothetical transposition-related ability of satDNA monomers to be shuffled independently in the genome. We conclude that satDNAs and TEs can form a complex network of sequences which essentially share the propagation mechanisms and in synergy shape the genome. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-3347-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Eva Satović
- Division of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia
| | | | - Andrea Luchetti
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali-Università di Bologna, Bologna, Italy
| | - Barbara Mantovani
- Dipartimento di Scienze Biologiche, Geologiche e Ambientali-Università di Bologna, Bologna, Italy
| | - Miroslav Plohl
- Division of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia.
| |
Collapse
|
3
|
Sharma A, Wolfgruber TK, Presting GG. Tandem repeats derived from centromeric retrotransposons. BMC Genomics 2013; 14:142. [PMID: 23452340 PMCID: PMC3648361 DOI: 10.1186/1471-2164-14-142] [Citation(s) in RCA: 73] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2012] [Accepted: 02/23/2013] [Indexed: 12/26/2022] Open
Abstract
Background Tandem repeats are ubiquitous and abundant in higher eukaryotic genomes and constitute, along with transposable elements, much of DNA underlying centromeres and other heterochromatic domains. In maize, centromeric satellite repeat (CentC) and centromeric retrotransposons (CR), a class of Ty3/gypsy retrotransposons, are enriched at centromeres. Some satellite repeats have homology to retrotransposons and several mechanisms have been proposed to explain the expansion, contraction as well as homogenization of tandem repeats. However, the origin and evolution of tandem repeat loci remain largely unknown. Results CRM1TR and CRM4TR are novel tandem repeats that we show to be entirely derived from CR elements belonging to two different subfamilies, CRM1 and CRM4. Although these tandem repeats clearly originated in at least two separate events, they are derived from similar regions of their respective parent element, namely the long terminal repeat (LTR) and untranslated region (UTR). The 5′ ends of the monomer repeat units of CRM1TR and CRM4TR map to different locations within their respective LTRs, while their 3′ ends map to the same relative position within a conserved region of their UTRs. Based on the insertion times of heterologous retrotransposons that have inserted into these tandem repeats, amplification of the repeats is estimated to have begun at least ~4 (CRM1TR) and ~1 (CRM4TR) million years ago. Distinct CRM1TR sequence variants occupy the two CRM1TR loci, indicating that there is little or no movement of repeats between loci, even though they are separated by only ~1.4 Mb. Conclusions The discovery of two novel retrotransposon derived tandem repeats supports the conclusions from earlier studies that retrotransposons can give rise to tandem repeats in eukaryotic genomes. Analysis of monomers from two different CRM1TR loci shows that gene conversion is the major cause of sequence variation. We propose that successive intrastrand deletions generated the initial repeat structure, and gene conversions increased the size of each tandem repeat locus.
Collapse
|
4
|
The life in science. Mol Biol 2008. [DOI: 10.1134/s0026893308050026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
5
|
Plohl M, Luchetti A, Mestrović N, Mantovani B. Satellite DNAs between selfishness and functionality: structure, genomics and evolution of tandem repeats in centromeric (hetero)chromatin. Gene 2007; 409:72-82. [PMID: 18182173 DOI: 10.1016/j.gene.2007.11.013] [Citation(s) in RCA: 234] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2007] [Revised: 11/08/2007] [Accepted: 11/20/2007] [Indexed: 12/21/2022]
Abstract
Satellite DNAs (tandemly repeated, non-coding DNA sequences) stretch over almost all native centromeres and surrounding pericentromeric heterochromatin. Once considered as inert by-products of genome dynamics in heterochromatic regions, recent studies showed that satellite DNA evolution is interplay of stochastic events and selective pressure. This points to a functional significance of satellite sequences, which in (peri)centromeres may play some fundamental functional roles. First, specific interactions with DNA-binding proteins are proposed to complement sequence-independent epigenetic processes. The second role is achieved through RNAi mechanism, in which transcripts of satellite sequences initialize heterochromatin formation. In addition, satellite DNAs in (peri)centromeric regions affect chromosomal dynamics and genome plasticity. Paradoxically, while centromeric function is conserved through eukaryotes, the profile of satellite DNAs in this region is almost always species-specific. We argue that tandem repeats may be advantageous forms of DNA sequences in (peri)centromeres due to concerted evolution, which maintains high intra-array and intrapopulation sequence homogeneity of satellite arrays, while allowing rapid changes in nucleotide sequence and/or composition of satellite repeats. This feature may be crucial for long-term stability of DNA-protein interactions in centromeric regions.
Collapse
Affiliation(s)
- Miroslav Plohl
- Department of Molecular Genetics, Ruder Bosković Institute, Bijenicka 54, HR-10002 Zagreb, Croatia.
| | | | | | | |
Collapse
|
6
|
Alkan C, Ventura M, Archidiacono N, Rocchi M, Sahinalp SC, Eichler EE. Organization and evolution of primate centromeric DNA from whole-genome shotgun sequence data. PLoS Comput Biol 2007; 3:1807-18. [PMID: 17907796 PMCID: PMC1994983 DOI: 10.1371/journal.pcbi.0030181] [Citation(s) in RCA: 70] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2007] [Accepted: 07/31/2007] [Indexed: 11/18/2022] Open
Abstract
The major DNA constituent of primate centromeres is alpha satellite DNA. As much as 2%–5% of sequence generated as part of primate genome sequencing projects consists of this material, which is fragmented or not assembled as part of published genome sequences due to its highly repetitive nature. Here, we develop computational methods to rapidly recover and categorize alpha-satellite sequences from previously uncharacterized whole-genome shotgun sequence data. We present an algorithm to computationally predict potential higher-order array structure based on paired-end sequence data and then experimentally validate its organization and distribution by experimental analyses. Using whole-genome shotgun data from the human, chimpanzee, and macaque genomes, we examine the phylogenetic relationship of these sequences and provide further support for a model for their evolution and mutation over the last 25 million years. Our results confirm fundamental differences in the dispersal and evolution of centromeric satellites in the Old World monkey and ape lineages of evolution. Centromeric DNA has been described as the last frontier of genomic sequencing; such regions are typically poorly assembled during the whole-genome shotgun sequence assembly process due to their repetitive complexity. This paper develops a computational algorithm to systematically extract data regarding primate centromeric DNA structure and organization from that ∼5% of sequence that is not included as part of standard genome sequence assemblies. Using this computational approach, we identify and reconstruct published human higher-order alpha satellite arrays and discover new families in human, chimpanzee, and Old World monkeys. Experimental validation confirms the utility of this computational approach to understanding the centromere organization of other nonhuman primates. An evolutionary analysis in diverse primate genomes supports fundamental differences in the structure and organization of centromere DNA between ape and Old World monkey lineages. The ability to extract meaningful biological data from random shotgun sequence data helps to fill an important void in large-scale sequencing of primate genomes, with implications for other genome sequencing projects.
Collapse
Affiliation(s)
- Can Alkan
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington, United States of America
| | - Mario Ventura
- Department of Genetics and Microbiology, University of Bari, Bari, Italy
| | | | - Mariano Rocchi
- Department of Genetics and Microbiology, University of Bari, Bari, Italy
| | - S. Cenk Sahinalp
- Department of Computing Science, Simon Fraser University, Burnaby, British Columbia, Canada
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington, United States of America
- Howard Hughes Medical Institute, Seattle, Washington, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
7
|
Luchetti A, Scanabissi F, Mantovani B. Evolution of LEP150 sub-repeat array within the ribosomal IGS of the clam shrimp Leptestheria dahalacensis (Crustacea Branchiopoda Conchostraca). Gene 2007; 400:174-80. [PMID: 17651923 DOI: 10.1016/j.gene.2007.06.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2007] [Revised: 06/14/2007] [Accepted: 06/19/2007] [Indexed: 11/28/2022]
Abstract
Leptestheria dahalacensis genome harbours repeats of the LEP150 satellite DNA family linked to 5S gene, within the ribosomal intergenic spacer. In genetically isolated samples, the sequence analysis of the region (5S, flanking region, first satellite monomer: unit A, second satellite monomer: unit B) evidenced three 5S variants. The alpha and gamma variants share a greater homology. They co-occur in the Central European samples, while in the Italian one, the highly divergent alpha and beta variants are present. In phylogenetic analyses, A and B LEP150 monomers show a peculiar clustering; this was further confirmed through the sequencing for the alpha variant of four monomers at the 5' and 3' tails (units A, B, C, D and D', C', B', A', respectively). Horizontal homogenisation was observed only across C, D, C' and D' units. Furthermore, repeat sequence diversity decrease toward terminal repeats, at variance of literature data. The pattern of variation observed is explained taking into account the presence at the LEP150 array borders of two loci under natural selection: the 5S rRNA gene, upstream, and the rDNA transcription promoter, downstream. These elements may drive the dynamics of flanking regions and linked repeats in a process similar to selective sweep. At variance of classical genetic hitchhiking, the selective sweep here scored should be realized and maintained through an interplay of selection and molecular drive.
Collapse
Affiliation(s)
- Andrea Luchetti
- Dipartimento di Biologia E. S., Università degli Studi di Bologna, Bologna, Italy.
| | | | | |
Collapse
|
8
|
Li J, Wang X, Leung FC. The intragenomic polymorphism of a partially inverted repeat (PIR) in Gallus gallus domesticus, potential role of inverted repeats in satellite DNAs evolution. Gene 2006; 387:118-25. [PMID: 17113248 DOI: 10.1016/j.gene.2006.08.033] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2006] [Revised: 08/08/2006] [Accepted: 08/22/2006] [Indexed: 11/17/2022]
Abstract
We report here the molecular characterization of the basic repeating unit of a novel repetitive family, partially inverted repeat (PIR), previously identified from chicken genome. This repetitive DNA family shares a close evolutionary relationship with XhoI/EcoRI repeats and chicken nuclear-membrane-associated (CNM) repeat. Sequence analyses reveal the 1430 bp basic repeating unit can be divided into two regions: the central region ( approximately 1000 bp) and the flanking region ( approximately 430 bp). Within the central region, a pair of repeats (86 bp) flanks the central core ( approximately 828 bp) in inversed orientation. Due to the tandem array feature shared by the repeating units, the inverted repeats fall between the central core and flanking region. Southern blot analyses further reveal the intragenomic polymorphism of PIR, and the molecular size of repeating units ranges from 1.1 kb to 1.6 kb. The identified monomer variants may result from multiple crossing-over events, implying the potential roles of inverted repeats in satellite DNAs variation.
Collapse
Affiliation(s)
- Juan Li
- Department of Zoology, The University of Hong Kong, Pokfulam road, Hong Kong SAR, China
| | | | | |
Collapse
|
9
|
Bulazel K, Metcalfe C, Ferreri GC, Yu J, Eldridge MDB, O'Neill RJ. Cytogenetic and molecular evaluation of centromere-associated DNA sequences from a marsupial (Macropodidae: Macropus rufogriseus) X chromosome. Genetics 2005; 172:1129-37. [PMID: 16387881 PMCID: PMC1456211 DOI: 10.1534/genetics.105.047654] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The constitution of the centromeric portions of the sex chromosomes of the red-necked wallaby, Macropus rufogriseus (family Macropodidae, subfamily Macropodinae), was investigated to develop an overview of the sequence composition of centromeres in a marsupial genome that harbors large amounts of centric and pericentric heterochromatin. The large, C-band-positive centromeric region of the X chromosome was microdissected and the isolated DNA was microcloned. Further sequence and cytogenetic analyses of three representative clones show that all chromosomes in this species carry a 178-bp satellite sequence containing a CENP-B DNA binding domain (CENP-B box) shown herein to selectively bind marsupial CENP-B protein. Two other repeats isolated in this study localize specifically to the sex chromosomes yet differ in copy number and intrachromosomal distribution. Immunocytohistochemistry assays with anti-CENP-E, anti-CREST, anti-CENP-B, and anti-trimethyl-H3K9 antibodies defined a restricted point localization of the outer kinetochore at the functional centromere within an enlarged pericentric and heterochromatic region. The distribution of these repeated sequences within the karyotype of this species, coupled with the apparent high copy number of these sequences, indicates a capacity for retention of large amounts of centromere-associated DNA in the genome of M. rufogriseus.
Collapse
Affiliation(s)
- Kira Bulazel
- Department of Molecular and Cell Biology, University of Connecticut, Storrs 06269, USA
| | | | | | | | | | | |
Collapse
|
10
|
Schueler MG, Dunn JM, Bird CP, Ross MT, Viggiano L, Rocchi M, Willard HF, Green ED. Progressive proximal expansion of the primate X chromosome centromere. Proc Natl Acad Sci U S A 2005; 102:10563-8. [PMID: 16030148 PMCID: PMC1180780 DOI: 10.1073/pnas.0503346102] [Citation(s) in RCA: 72] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Previous studies of the pericentromeric region of the human X chromosome short arm (Xp) revealed an age gradient from ancient DNA that contains expressed genes to recent human-specific DNA at the functional centromere. We analyzed the finished sequence of this human genomic region to investigate its evolutionary history. Phylogenetic analysis of >1,500 alpha-satellite monomers from the region revealed the presence of five physical domains, each containing monomers from a distinct phylogenetic clade. The most distal domain contains long interspersed nucleotide element repeats that were active >35 million years ago, whereas the four proximal domains contain more recently active long interspersed nucleotide element repeats. An out-of-register, unequal recombination (i.e., crossover) detected at the edge of the X chromosome-specific alpha-satellite array (DXZ1) may reflect the most recent of a series of punctuating events during evolution that resulted in a proximal physical expansion of the X centromere. The first 18 kb of this array has 97-99% pairwise identity among all 2-kb repeat units. To perform more detailed evolutionary comparisons, we sequenced the junction between the ancient DNA of Xp and the primate-specific alpha satellite in chimpanzee, gorilla, orangutan, vervet, macaque, and baboon. The striking conservation found in all cases supports the ancestral nature of the alpha satellite at this location. These studies demonstrate that the primate X centromere appears to have evolved through repeated expansion events occurring within the central, active region of centromeric DNA, with the newly added sequences then conferring centromere function.
Collapse
Affiliation(s)
- Mary G Schueler
- Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | | | | | | | | | | | | | | |
Collapse
|
11
|
Luchetti A, Marino A, Scanabissi F, Mantovani B. Genomic dynamics of a low-copy-number satellite DNA family in Leptestheria dahalacensis (Crustacea, Branchiopoda, Conchostraca). Gene 2004; 342:313-20. [PMID: 15527990 DOI: 10.1016/j.gene.2004.08.018] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2004] [Revised: 08/06/2004] [Accepted: 08/19/2004] [Indexed: 10/26/2022]
Abstract
The LEP150 satellite DNA (satDNA) family found in Leptestheria dahalacensis (Ruppel, 1837) (Conchostraca) is a low-copy-number satellite with a canonical monomer of 150 bp. Nucleotide variation analyses suggest a 14-bp palindromic region as a possible protein binding site with constraints acting on the whole sequence but a 25-bp variable box. Besides the head-to-tail arrangement of 150 bp monomers, multimers analyses evidenced incomplete monomers, one duplication event, and three inversions. Both observed rearrangements and the higher values of sequence variability scored suggest that rearranged monomers reside in regions with a lower degree of homogenisation efficiency. Sixty-seven percent of the breakpoints occurs at kinkable dinucleotides, thus supporting their role in rearrangements as documented in alphoid satDNA recombination events. Monomers of different lengths may result from crossing over between repeats misaligned through the direct and inverted subrepeats of LEP150 monomers. ANOVA results indicate that the same range of sequence diversity is experienced at the individual and population ranks; therefore, the evolution of the L. dahalacensis satDNA is concerted.
Collapse
Affiliation(s)
- Andrea Luchetti
- Dipartimento di Biologia Evoluzionistica Sperimentale, Università di Bologna, Via Selmi 3, Bologna 40126, Italy
| | | | | | | |
Collapse
|
12
|
Alkan C, Eichler EE, Bailey JA, Sahinalp SC, Tüzün E. The Role of Unequal Crossover in Alpha-Satellite DNA Evolution: A Computational Analysis. J Comput Biol 2004; 11:933-44. [PMID: 15700410 DOI: 10.1089/cmb.2004.11.933] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Human DNA consists of a large number of tandem repeat sequences. Such sequences are usually called satellites, with the primary example being the centromeric alpha-satellite DNA. The basic repeat unit of the alpha-satellite DNA is a 171 bp monomer. Arbitrary monomer pairs usually have considerable sequence divergence (20-40%). However, with the exception of peripheral alpha-satellite DNA, monomers can be grouped into blocks of k-monomers (4 < or = k < or = 20) between which the divergence rate is much smaller (e.g., 5%). Perhaps the simplest and best understood mechanism for tandem repeat array evolution is unequal crossover. Although it is possible that alpha-satellite sequences developed as a result of subsequent unequal crossovers only, no formal computational framework seems to have been developed to verify this possibility. In this paper, we develop such a framework and report on experiments which imply that pericentromeric alpha-satellite segments (which are devoid of higher order structure) are evolutionarily distinct from the higher order repeat segments. It is likely that the higher order repeats developed independently in distinct regions of the genome and were carried into their current locations through an unknown mechanism of transposition.
Collapse
Affiliation(s)
- Can Alkan
- Department of EECS, Case Western Reserve University, Cleveland, OH 44106, USA
| | | | | | | | | |
Collapse
|
13
|
Daniels LM, Delany ME. Molecular and cytogenetic organization of the 5S ribosomal DNA array in chicken (Gallus gallus). Chromosome Res 2004; 11:305-17. [PMID: 12906126 DOI: 10.1023/a:1024008522122] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
The 5S ribosomal (r) RNA genes encode a small (approximately 120-bp) highly-conserved component of the large ribosomal subunit. The objective of the present research was to study the molecular and cytogenetic organization of the chicken 5S rDNA. A predominant 2.2-kb gene (5Salpha) consisting of a coding and intergenic spacer (IGS) region was identified in ten research and commercial populations. A variant gene repeat of 0.6kb (5Sbeta) was observed in some of the populations. Genetic linkage analysis and cytogenetic localization by fluorescence in-situ hybridization assigned the 5S rDNA to chromosome 9. The 5S rDNA array was determined to be 80.2 +/- 7.0 kb upon electrophoretic sizing following EcoRV digestion. Sequence analysis of 5Salpha IGS regions revealed considerable conservation between chicken subspecies (98.4% identity) as well as homology with vertebrate Pol III promoter and regulatory sequence motifs. Minor intraindividual sequence variation within 1000 bp of IGS was observed in four cloned Red Jungle Fowl (Gallus gallus gallus) 5Salpha repeats (95.5% identity in this region). Sequence comparisons between IGS regions of 5Salpha and 5Sbeta genes indicated two short continuous (>20bp) and many short non-continuous homologous regions as well as other conserved features such as promoter and termination motifs.
Collapse
Affiliation(s)
- Laura M Daniels
- Department of Animal Science, 2131D Meyer Hall, One Shields Avenue, University of California, Davis, CA 95616, USA
| | | |
Collapse
|
14
|
Kazakov AE, Shepelev VA, Tumeneva IG, Alexandrov AA, Yurov YB, Alexandrov IA. Interspersed repeats are found predominantly in the “old” α satellite families. Genomics 2003; 82:619-27. [PMID: 14611803 DOI: 10.1016/s0888-7543(03)00182-4] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
The biased distribution of dispersed repeat insertions in various types of primate specific alpha satellites (AS) is being discussed in the literature in relation to the modes of AS evolution and their possible roles in maintenance and disruption of functional centromeres. However, such a bias has not been properly documented on a genome-wide scale so far. In this work, using a representative sample of about 100 insertions we show that the "old" AS contains at least 10 times more dispersed repeats than the "new" one. In the new arrays insertions accumulate mostly in poorly homogenized areas, presumably in the edges, and in the old AS, throughout the whole array length. Dating of L1 insertions in the old AS revealed that their massive accumulation started at or after the time when the new AS emerged and expanded in the genome and the centromere function had shifted to the new AS arrays.
Collapse
Affiliation(s)
- Alexei E Kazakov
- Mental Health Research Center, Russian Academy of Medical Sciences, Zagorodnoe sh.2, Moscow 113152, Russia
| | | | | | | | | | | |
Collapse
|
15
|
Schindelhauer D, Schwarz T. Evidence for a fast, intrachromosomal conversion mechanism from mapping of nucleotide variants within a homogeneous alpha-satellite DNA array. Genome Res 2002; 12:1815-26. [PMID: 12466285 PMCID: PMC187568 DOI: 10.1101/gr.451502] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Assuming that patterns of sequence variants within highly homogeneous centromeric tandem repeat arrays can tell us which molecular turnover mechanisms are presently at work, we analyzed the alpha-satellite tandem repeat array DXZ1 of one human X chromosome. Here we present accurate snapshots from this dark matter of the genome. We demonstrate stable and representative cloning of the array in a P1 artificial chromosome (PAC) library, use samples of higher-order repeats subcloned from five unmapped PACs (120-160 kb) to identify common variants, and show that such variants are presently in a fixed transition state. To characterize patterns of variant spread throughout homogeneous array segments, we use a novel partial restriction and pulsed-field gel electrophoresis mapping approach. We find an older large-scale (35-50 kb) duplication event supporting the evolutionarily important unequal crossing-over hypothesis, but generally find independent variant occurrence and a paucity of potential de novo mutations within segments of highest homogeneity (99.1%-99.3%). Within such segments, a highly nonrandom variant clustering within adjacent higher-order repeats was found in the absence of haplotypic repeats. Such variant clusters are hardly explained by interchromosomal, fixation-driving mechanisms and likely reflect a fast, localized, intrachromosomal sequence conversion mechanism.
Collapse
Affiliation(s)
- Dirk Schindelhauer
- Institute of Human Genetics, Technical University of Munich, Munich, Germany.
| | | |
Collapse
|
16
|
Mashkova TD, Oparina NY, Lacroix MH, Fedorova LI, G Tumeneva I, Zinovieva OL, Kisselev LL. Structural rearrangements and insertions of dispersed elements in pericentromeric alpha satellites occur preferably at kinkable DNA sites. J Mol Biol 2001; 305:33-48. [PMID: 11114245 DOI: 10.1006/jmbi.2000.4270] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Centromeric region of human chromosome 21 comprises two long alphoid DNA arrays: the well homogenized and CENP-B box-rich alpha21-I and the alpha21-II, containing a set of less homogenized and CENP-B box-poor subfamilies located closer to the short arm of the chromosome. Continuous alphoid fragment of 100 monomers bordering the non-satellite sequences in human chromosome 21 was mapped to the pericentromeric short arm region by fluorescence in situ hybridization (alpha21-II locus). The alphoid sequence contained several rearrangements including five large deletions within monomers and insertions of three truncated L1 elements. No binding sites for centromeric protein CENP-B were found. We analyzed sequences with alphoid/non-alphoid junctions selectively screened from current databases and revealed various rearrangements disrupting the regular tandem alphoid structure, namely, deletions, duplications, inversions, expansions of short oligonucleotide motifs and insertions of different dispersed elements. The detailed analysis of more than 1100 alphoid monomers from junction regions showed that the vast majority of structural alterations and joinings with non-alphoid DNAs occur in alpha satellite families lacking CENP-B boxes. Most analyzed events were found in sequences located toward the edges of the centromeric alphoid arrays. Different dispersed elements were inserted into alphoid DNA at kinkable dinucleotides (TG, CA or TA) situated between pyrimidine/purine tracks. DNA rearrangements resulting from different processes such as recombination and replication occur at kinkable DNA sites alike insertions but irrespectively of the occurrence of pyrimidine/purine tracks. It seems that kinkable dinucleotides TG, CA and TA are part of recognition signals for many proteins involved in recombination, replication, and insertional events. Alphoid DNA is a good model for studying these processes.
Collapse
MESH Headings
- Alu Elements/genetics
- Autoantigens
- Base Sequence
- Binding Sites
- Centromere/chemistry
- Centromere/genetics
- Centromere/metabolism
- Centromere Protein B
- Chromosomal Proteins, Non-Histone/metabolism
- Chromosome Deletion
- Chromosome Inversion
- Chromosomes, Human, Pair 21/chemistry
- Chromosomes, Human, Pair 21/genetics
- Chromosomes, Human, Pair 21/metabolism
- Computational Biology
- Crossing Over, Genetic/genetics
- DNA Replication/genetics
- DNA, Satellite/chemistry
- DNA, Satellite/genetics
- DNA, Satellite/metabolism
- DNA-Binding Proteins
- Databases as Topic
- Dinucleotide Repeats/genetics
- Humans
- In Situ Hybridization, Fluorescence
- Lymphocytes
- Mutagenesis, Insertional/genetics
- Mutation/genetics
- Nucleic Acid Conformation
- Polymerase Chain Reaction
- Recombination, Genetic/genetics
Collapse
Affiliation(s)
- T D Mashkova
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 32 Vavilov Str., Moscow, 117984, Russia.
| | | | | | | | | | | | | |
Collapse
|
17
|
Donev RM. The type of DNA attachment sites recovered from nuclear matrix depends on isolation procedure used. Mol Cell Biochem 2000; 214:103-10. [PMID: 11195781 DOI: 10.1023/a:1007159421204] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
A large variety of DNA sequences have been described in nuclear matrix attachment regions. It could be most likely a result of the different methods used for their isolation. The idea about how different types of known DNA sequences (strongly attached to the nuclear matrix, weakly attached, or not attached) directly participate in anchoring DNA loops to the nuclear matrices isolated by different experimental procedures was tested in this study. Matrix-attached (M) and matrix-independent or loop (L) fractions as well as nuclear matrices were isolated using extractions of nuclei with 25 mM lithium 3,5-diiodosalicylate (LIS), 2 M NaCl, 0.65 M ammonium sulphate containing buffers followed by DNase I/RNase A digestion, or according to so designated conventional method. Using PCR-based and in vitro binding assays it was established that LIS and ammonium sulphate extractions gave similar results for the type of attachment of sequences investigated. The harsh extraction with 2 M NaCl or the conventional procedure led to some rearrangements in the attachment of DNA loops. As a result a big part of matrix attached sequences were found detached in the loop fractions. However, the in vitro binding abilities of the MARs to the nuclear matrices isolated by different methods did not change.
Collapse
Affiliation(s)
- R M Donev
- Institute of Molecular Biology, Bulgarian Academy of Sciences, Sofia
| |
Collapse
|
18
|
Bassi C, Magnani I, Sacchi N, Saccone S, Ventura A, Rocchi M, Marozzi A, Ginelli E, Meneveri R. Molecular structure and evolution of DNA sequences located at the alpha satellite boundary of chromosome 20. Gene 2000; 256:43-50. [PMID: 11054534 DOI: 10.1016/s0378-1119(00)00354-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
We have isolated and characterised one PAC clone (dJ233C1) containing a linkage between alphoid and non-alphoid DNA. The non-alphoid DNA was found to map at the pericentromeric region of chromosome 20, both on p and q sides, and to contain homologies with one contig (ctg176, Sanger Centre), also located in the same chromosome region. At variance with the chromosome specificity shown by the majority of non-alphoid DNA, a subset of alphoid repeats derived from the PAC yielded FISH hybridisation signals located at the centromeric region of several human chromosomes, belonging to three different suprachromosomal families. The evolutionary conservation of this boundary region was investigated by comparative FISH experiments on chromosomes from great apes. The non-alphoid DNA was found to have undergone events of expansion and transposition to different pericentromeric regions of great apes chromosomes. Alphoid sequences revealed a very wide distribution of FISH signals in the great apes. The pattern was substantially discordant with the data available in the literature, which is essentially derived from the central alphoid subset. These results add further support to the emerging opinion that the pericentromeric regions are high plastics, and that the alpha satellite junctions do not share the evolutionary history with the main subsets.
Collapse
Affiliation(s)
- C Bassi
- Dipartimento di Biologia e Genetica per le Scienze Mediche, Università di Milano, 20133, Milan, Italy
| | | | | | | | | | | | | | | | | |
Collapse
|
19
|
Lo AW, Liao GC, Rocchi M, Choo KH. Extreme reduction of chromosome-specific alpha-satellite array is unusually common in human chromosome 21. Genome Res 1999; 9:895-908. [PMID: 10523519 DOI: 10.1101/gr.9.10.895] [Citation(s) in RCA: 47] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Human centromeres contain large arrays of alpha-satellite DNA that are thought to provide centromere function. The arrays show size and sequence variation, but the extent to which extremely low levels of this DNA can occur on normal centromeres is unclear. Using a set of chromosome-specific alpha-satellite probes for each of the human chromosomes, we performed interphase fluorescence in situ hybridization (FISH) in a population-screening study. Our results demonstrate that extreme reduction of chromosome-specific alpha satellite is unusually common in chromosome 21 (screened with the alphaRI probe), with a prevalence of 3.70%, compared to < or =0.12% for each of chromosomes 13 and 17, and 0% for the other chromosomes. No analphoid centromere was identified in >17,000 morphologically normal chromosomes studied. All of the low-alphoid centromeres are fully functional as indicated by their mitotic stability and binding to centromere proteins CENP-B, CENP-C, and CENP-E. Sensitive metaphase FISH analysis of the low-alphoid chromosome 21 centromeres established the presence of residual alphaRI as well as other non-alphaRI alpha-satellite DNA suggesting that centromere function may be provided by (1) the residual alphaRI DNA, (2) other non-alphaRI alpha-satellite sequences, (3) a combination of 1 and 2, or (4) an activated neocentromere DNA. The low-alphoid centromeres, in particular those of chromosome 21, should provide unique opportunities for the study of the evolution and the minimal DNA requirement of the human centromere.
Collapse
Affiliation(s)
- A W Lo
- The Murdoch Institute, Royal Children's Hospital, Parkville, Victoria 3052, Australia
| | | | | | | |
Collapse
|