1
|
Morrissey A, Shi J, James DQ, Mahony S. Accurate allocation of multimapped reads enables regulatory element analysis at repeats. Genome Res 2024; 34:937-951. [PMID: 38986578 PMCID: PMC11293539 DOI: 10.1101/gr.278638.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Accepted: 06/14/2024] [Indexed: 07/12/2024]
Abstract
Transposable elements (TEs) and other repetitive regions have been shown to contain gene regulatory elements, including transcription factor binding sites. However, regulatory elements harbored by repeats have proven difficult to characterize using short-read sequencing assays such as ChIP-seq or ATAC-seq. Most regulatory genomics analysis pipelines discard "multimapped" reads that align equally well to multiple genomic locations. Because multimapped reads arise predominantly from repeats, current analysis pipelines fail to detect a substantial portion of regulatory events that occur in repetitive regions. To address this shortcoming, we developed Allo, a new approach to allocate multimapped reads in an efficient, accurate, and user-friendly manner. Allo combines probabilistic mapping of multimapped reads with a convolutional neural network that recognizes the read distribution features of potential peaks, offering enhanced accuracy in multimapping read assignment. Allo also provides read-level output in the form of a corrected alignment file, making it compatible with existing regulatory genomics analysis pipelines and downstream peak-finders. In a demonstration application on CTCF ChIP-seq data, we show that Allo results in the discovery of thousands of new CTCF peaks. Many of these peaks contain the expected cognate motif and/or serve as TAD boundaries. We additionally apply Allo to a diverse collection of ENCODE ChIP-seq data sets, resulting in multiple previously unidentified interactions between transcription factors and repetitive element families. Finally, we show that Allo may be particularly beneficial in identifying ChIP-seq peaks at centromeres, near segmentally duplicated genes, and in younger TEs, enabling new regulatory analyses in these regions.
Collapse
Affiliation(s)
- Alexis Morrissey
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Jeffrey Shi
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Daniela Q James
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Shaun Mahony
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| |
Collapse
|
2
|
Shukla HG, Chakraborty M, Emerson J. Genetic variation in recalcitrant repetitive regions of the Drosophila melanogaster genome. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.11.598575. [PMID: 38915508 PMCID: PMC11195212 DOI: 10.1101/2024.06.11.598575] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/26/2024]
Abstract
Many essential functions of organisms are encoded in highly repetitive genomic regions, including histones involved in DNA packaging, centromeres that are core components of chromosome segregation, ribosomal RNA comprising the protein translation machinery, telomeres that ensure chromosome integrity, piRNA clusters encoding host defenses against selfish elements, and virtually the entire Y chromosome. These regions, formed by highly similar tandem arrays, pose significant challenges for experimental and informatic study, impeding sequence-level descriptions essential for understanding genetic variation. Here, we report the assembly and variation analysis of such repetitive regions in Drosophila melanogaster, offering significant improvements to the existing community reference assembly. Our work successfully recovers previously elusive segments, including complete reconstructions of the histone locus and the pericentric heterochromatin of the X chromosome, spanning the Stellate locus to the distal flank of the rDNA cluster. To infer structural changes in these regions where alignments are often not practicable, we introduce landmark anchors based on unique variants that are putatively orthologous. These regions display considerable structural variation between different D. melanogaster strains, exhibiting differences in copy number and organization of homologous repeat units between haplotypes. In the histone cluster, although we observe minimal genetic exchange indicative of crossing over, the variation patterns suggest mechanisms such as unequal sister chromatid exchange. We also examine the prevalence and scale of concerted evolution in the histone and Stellate clusters and discuss the mechanisms underlying these observed patterns.
Collapse
Affiliation(s)
- Harsh G. Shukla
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, California 92697, USA
- Graduate Program in Mathematical, Computational and Systems Biology, University of California Irvine, Irvine, California 92697, USA
| | - Mahul Chakraborty
- Department of Biology, Texas A&M University, College Station, Texas 77843, USA
| | - J.J. Emerson
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, California 92697, USA
- Center for Complex Biological Systems, University of California Irvine, Irvine, California 92697, USA
| |
Collapse
|
3
|
Vitale M, Kranjc N, Leigh J, Kyrou K, Courty T, Marston L, Grilli S, Crisanti A, Bernardini F. Y chromosome shredding in Anopheles gambiae: Insight into the cellular dynamics of a novel synthetic sex ratio distorter. PLoS Genet 2024; 20:e1011303. [PMID: 38848445 PMCID: PMC11189259 DOI: 10.1371/journal.pgen.1011303] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Revised: 06/20/2024] [Accepted: 05/14/2024] [Indexed: 06/09/2024] Open
Abstract
Despite efforts to explore the genome of the malaria vector Anopheles gambiae, the Y chromosome of this species remains enigmatic. The large number of repetitive and heterochromatic DNA sequences makes the Y chromosome exceptionally difficult to fully assemble, hampering the progress of gene editing techniques and functional studies for this chromosome. In this study, we made use of a bioinformatic platform to identify Y-specific repetitive DNA sequences that served as a target site for a CRISPR/Cas9 system. The activity of Cas9 in the reproductive organs of males caused damage to Y-bearing sperm without affecting their fertility, leading to a strong female bias in the progeny. Cytological investigation allowed us to identify meiotic defects and investigate sperm selection in this new synthetic sex ratio distorter system. In addition, alternative promoters enable us to target the Y chromosome in specific tissues and developmental stages of male mosquitoes, enabling studies that shed light on the role of this chromosome in male gametogenesis. This work paves the way for further insight into the poorly characterised Y chromosome of Anopheles gambiae. Moreover, the sex distorter strain we have generated promises to be a valuable tool for the advancement of studies in the field of developmental biology, with the potential to support the progress of genetic strategies aimed at controlling malaria mosquitoes and other pest species.
Collapse
Affiliation(s)
- Matteo Vitale
- Department of Life Sciences, Imperial College London, London, United Kingdom
| | - Nace Kranjc
- Department of Life Sciences, Imperial College London, London, United Kingdom
| | - Jessica Leigh
- Department of Life Sciences, Imperial College London, London, United Kingdom
| | - Kyrous Kyrou
- Department of Life Sciences, Imperial College London, London, United Kingdom
| | - Thomas Courty
- Department of Life Sciences, Imperial College London, London, United Kingdom
| | - Louise Marston
- Department of Life Sciences, Imperial College London, London, United Kingdom
| | - Silvia Grilli
- Department of Life Sciences, Imperial College London, London, United Kingdom
| | - Andrea Crisanti
- Department of Life Sciences, Imperial College London, London, United Kingdom
| | - Federica Bernardini
- Department of Life Sciences, Imperial College London, London, United Kingdom
| |
Collapse
|
4
|
Chen E, Trajkovski M, Lee H, Nyovanie S, Martin K, Dean W, Tahiliani M, Plavec J, Yatsunyk L. Structure of native four-repeat satellite III sequence with non-canonical base interactions. Nucleic Acids Res 2024; 52:3390-3405. [PMID: 38381082 PMCID: PMC11014236 DOI: 10.1093/nar/gkae113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Revised: 01/31/2024] [Accepted: 02/06/2024] [Indexed: 02/22/2024] Open
Abstract
Tandem-repetitive DNA (where two or more DNA bases are repeated numerous times) can adopt non-canonical secondary structures. Many of these structures are implicated in important biological processes. Human Satellite III (HSat3) is enriched for tandem repeats of the sequence ATGGA and is located in pericentromeric heterochromatin in many human chromosomes. Here, we investigate the secondary structure of the four-repeat HSat3 sequence 5'-ATGGA ATGGA ATGGA ATGGA-3' using X-ray crystallography, NMR, and biophysical methods. Circular dichroism spectroscopy, thermal stability, native PAGE, and analytical ultracentrifugation indicate that this sequence folds into a monomolecular hairpin with non-canonical base pairing and B-DNA characteristics at concentrations below 0.9 mM. NMR studies at 0.05-0.5 mM indicate that the hairpin is likely folded-over into a compact structure with high dynamics. Crystallographic studies at 2.5 mM reveal an antiparallel self-complementary duplex with the same base pairing as in the hairpin, extended into an infinite polymer. The non-canonical base pairing includes a G-G intercalation sandwiched by sheared A-G base pairs, leading to a cross-strand four guanine stack, so called guanine zipper. The guanine zippers are spaced throughout the structure by A-T/T-A base pairs. Our findings lend further insight into recurring structural motifs associated with the HSat3 and their potential biological functions.
Collapse
Affiliation(s)
- Erin Chen
- Department of Chemistry and Biochemistry, Swarthmore College, 500 College Ave, Swarthmore, PA 19081, USA
| | - Marko Trajkovski
- Slovenian NMR Centre, National Institute of Chemistry, Hajdrihova 19, 1000 Ljubljana, Slovenia
| | - Hyun Kyung Lee
- Department of Chemistry and Biochemistry, Swarthmore College, 500 College Ave, Swarthmore, PA 19081, USA
| | - Samantha Nyovanie
- Department of Chemistry and Biochemistry, Swarthmore College, 500 College Ave, Swarthmore, PA 19081, USA
| | - Kailey N Martin
- Department of Chemistry and Biochemistry, Swarthmore College, 500 College Ave, Swarthmore, PA 19081, USA
| | - William L Dean
- Structural Biology Program JG Brown Cancer Center, University of Louisville, Louisville, KY 40202, USA
| | - Mamta Tahiliani
- Department of Biology, New York University, New York, NY 10003, USA
| | - Janez Plavec
- Slovenian NMR Centre, National Institute of Chemistry, Hajdrihova 19, 1000 Ljubljana, Slovenia
| | - Liliya A Yatsunyk
- Department of Chemistry and Biochemistry, Swarthmore College, 500 College Ave, Swarthmore, PA 19081, USA
| |
Collapse
|
5
|
de Oliveira AM, Souza GM, Toma GA, Dos Santos N, Dos Santos RZ, Goes CAG, Deon GA, Setti PG, Porto-Foresti F, Utsunomia R, Gunski RJ, Del Valle Garnero A, Herculano Correa de Oliveira E, Kretschmer R, Cioffi MDB. Satellite DNAs, heterochromatin, and sex chromosomes of the wattled jacana (Charadriiformes; Jacanidae): a species with highly rearranged karyotype. Genome 2024; 67:109-118. [PMID: 38316150 DOI: 10.1139/gen-2023-0082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2024]
Abstract
Charadriiformes, which comprises shorebirds and their relatives, is one of the most diverse avian orders, with over 390 species showing a wide range of karyotypes. Here, we isolated and characterized the whole collection of satellite DNAs (satDNAs) at both molecular and cytogenetic levels of one of its representative species, named the wattled jacana (Jacana jacana), a species that contains a typical ZZ/ZW sex chromosome system and a highly rearranged karyotype. In addition, we also investigate the in situ location of telomeric and microsatellite repeats. A small catalog of 11 satDNAs was identified that typically accumulated on microchromosomes and on the W chromosome. The latter also showed a significant accumulation of telomeric signals, being (GA)10 the only microsatellite with positive hybridization signals among all the 16 tested ones. These current findings contribute to our understanding of the genomic organization of repetitive DNAs in a bird species with high degree of chromosomal reorganization contrary to the majority of bird species that have stable karyotypes.
Collapse
Affiliation(s)
- Alan Moura de Oliveira
- Departamento de Genética e Evolução, Universidade Federal de São Carlos, São Carlos, São Paulo, Brazil
| | - Guilherme Mota Souza
- Departamento de Genética e Evolução, Universidade Federal de São Carlos, São Carlos, São Paulo, Brazil
| | - Gustavo Akira Toma
- Departamento de Genética e Evolução, Universidade Federal de São Carlos, São Carlos, São Paulo, Brazil
| | | | | | | | - Geize Aparecida Deon
- Departamento de Genética e Evolução, Universidade Federal de São Carlos, São Carlos, São Paulo, Brazil
| | - Princia Grejo Setti
- Departamento de Genética e Evolução, Universidade Federal de São Carlos, São Carlos, São Paulo, Brazil
| | | | | | | | | | | | - Rafael Kretschmer
- Departamento de Ecologia, Zoologia e Genética, Instituto de Biologia, Universidade Federal de Pelotas, Pelotas, Rio Grande do Sul, Brazil
| | - Marcelo de Bello Cioffi
- Departamento de Genética e Evolução, Universidade Federal de São Carlos, São Carlos, São Paulo, Brazil
| |
Collapse
|
6
|
Rico-Porras JM, Mora P, Palomeque T, Montiel EE, Cabral-de-Mello DC, Lorite P. Heterochromatin Is Not the Only Place for satDNAs: The High Diversity of satDNAs in the Euchromatin of the Beetle Chrysolina americana (Coleoptera, Chrysomelidae). Genes (Basel) 2024; 15:395. [PMID: 38674330 PMCID: PMC11049206 DOI: 10.3390/genes15040395] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2024] [Revised: 03/16/2024] [Accepted: 03/21/2024] [Indexed: 04/28/2024] Open
Abstract
The satellitome of the beetle Chrysolina americana Linneo, 1758 has been characterized through chromosomal analysis, genomic sequencing, and bioinformatics tools. C-banding reveals the presence of constitutive heterochromatin blocks enriched in A+T content, primarily located in pericentromeric regions. Furthermore, a comprehensive satellitome analysis unveils the extensive diversity of satellite DNA families within the genome of C. americana. Using fluorescence in situ hybridization techniques and the innovative CHRISMAPP approach, we precisely map the localization of satDNA families on assembled chromosomes, providing insights into their organization and distribution patterns. Among the 165 identified satDNA families, only three of them exhibit a remarkable amplification and accumulation, forming large blocks predominantly in pericentromeric regions. In contrast, the remaining, less abundant satDNA families are dispersed throughout euchromatic regions, challenging the traditional association of satDNA with heterochromatin. Overall, our findings underscore the complexity of repetitive DNA elements in the genome of C. americana and emphasize the need for further exploration to elucidate their functional significance and evolutionary implications.
Collapse
Affiliation(s)
- José M. Rico-Porras
- Department of Experimental Biology, Genetics Area, University of Jaén, Paraje las Lagunillas s/n, 23071 Jaén, Spain; (J.M.R.-P.); (P.M.); (T.P.)
| | - Pablo Mora
- Department of Experimental Biology, Genetics Area, University of Jaén, Paraje las Lagunillas s/n, 23071 Jaén, Spain; (J.M.R.-P.); (P.M.); (T.P.)
| | - Teresa Palomeque
- Department of Experimental Biology, Genetics Area, University of Jaén, Paraje las Lagunillas s/n, 23071 Jaén, Spain; (J.M.R.-P.); (P.M.); (T.P.)
| | - Eugenia E. Montiel
- Department of Biology, Genetics, Faculty of Sciences, Autonomous University of Madrid, 28049 Madrid, Spain;
- Center for Research in Biodiversity and Global Change, Autonomous University of Madrid, 28049 Madrid, Spain
| | - Diogo C. Cabral-de-Mello
- Department of General and Applied Biology, Institute of Biosciences/IB, UNESP—São Paulo State University, Rio Claro 13506-900, SP, Brazil;
| | - Pedro Lorite
- Department of Experimental Biology, Genetics Area, University of Jaén, Paraje las Lagunillas s/n, 23071 Jaén, Spain; (J.M.R.-P.); (P.M.); (T.P.)
| |
Collapse
|
7
|
Flynn JM, Yamashita YM. The implications of satellite DNA instability on cellular function and evolution. Semin Cell Dev Biol 2024; 156:152-159. [PMID: 37852904 DOI: 10.1016/j.semcdb.2023.10.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 09/21/2023] [Accepted: 10/11/2023] [Indexed: 10/20/2023]
Abstract
Abundant tandemly repeated satellite DNA is present in most eukaryotic genomes. Previous limitations including a pervasive view that it was uninteresting junk DNA, combined with challenges in studying it, are starting to dissolve - and recent studies have found important functions for satellite DNAs. The observed rapid evolution and implied instability of satellite DNA now has important significance for their functions and maintenance within the genome. In this review, we discuss the processes that lead to satellite DNA copy number instability, and the importance of mechanisms to manage the potential negative effects of instability. Satellite DNA is vulnerable to challenges during replication and repair, since it forms difficult-to-process secondary structures and its homology within tandem arrays can result in various types of recombination. Satellite DNA instability may be managed by DNA or chromatin-binding proteins ensuring proper nuclear localization and repair, or by proteins that process aberrant structures that satellite DNAs tend to form. We also discuss the pattern of satellite DNA mutations from recent mutation accumulation (MA) studies that have tracked changes in satellite DNA for up to 1000 generations with minimal selection. Finally, we highlight examples of satellite evolution from studies that have characterized satellites across millions of years of Drosophila fruit fly evolution, and discuss possible ways that selection might act on the satellite DNA composition.
Collapse
Affiliation(s)
- Jullien M Flynn
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA; Howard Hughes Medical Institute, Cambridge, MA, USA.
| | - Yukiko M Yamashita
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA; Howard Hughes Medical Institute, Cambridge, MA, USA; Massachusetts Institute of Technology, Cambridge, MA, USA.
| |
Collapse
|
8
|
Grishanin A. Chromatin diminution as a tool to study some biological problems. COMPARATIVE CYTOGENETICS 2024; 18:27-49. [PMID: 38369988 PMCID: PMC10870232 DOI: 10.3897/compcytogen.17.112152] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Accepted: 01/21/2024] [Indexed: 02/20/2024]
Abstract
This work reveals the opportunities to obtain additional information about some biological problems through studying species that possess chromatin diminution. A brief review of the hypothesized biological significance of chromatin diminution is discussed. This article analyzes the biological role of chromatin diminution as it relates to the C-value enigma. It is proposed to consider chromatin diminution as a universal mechanism of genome reduction, reducing the frequency of recombination events in the genome, which leads to specialization and adaptation of the species to more narrow environmental conditions. A hypothesis suggesting the role of non-coding DNA in homologous recombination in eukaryotes is proposed. Cyclopskolensis Lilljeborg, 1901 (Copepoda, Crustacea) is proposed as a model species for studying the mechanisms of transformation of the chromosomes and interphase nuclei structure of somatic line cells due to chromatin diminution. Chromatin diminution in copepods is considered as a stage of irreversible differentiation of embryonic cells during ontogenesis. The process of speciation in cyclopoids with chromatin diminution is considered.
Collapse
Affiliation(s)
- Andrey Grishanin
- Papanin Institute for Biology of Inland Waters, Russian Academy of Sciences, 152742 Borok, Yaroslavl Prov., RussiaRussian Academy of SciencesBorokRussia
- Department of Biophisics, Faculty of Natural and Engineering Sciences, Dubna State University, Universitetskaya 19, 141980, Dubna, Moscow Prov., RussiaDubna State UniversityDubnaRussia
| |
Collapse
|
9
|
Liu J, Lin X, Wang X, Feng L, Zhu S, Tian R, Fang J, Tao A, Fang P, Qi J, Zhang L, Huang Y, Xu J. Genomic and cytogenetic analyses reveal satellite repeat signature in allotetraploid okra (Abelmoschus esculentus). BMC PLANT BIOLOGY 2024; 24:71. [PMID: 38267860 PMCID: PMC10809672 DOI: 10.1186/s12870-024-04739-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 01/10/2024] [Indexed: 01/26/2024]
Abstract
BACKGROUND Satellite repeats are one of the most rapidly evolving components in eukaryotic genomes and play vital roles in genome regulation, genome evolution, and speciation. As a consequence, the composition, abundance and chromosome distribution of satellite repeats often exhibit variability across various species, genome, and even individual chromosomes. However, we know little about the satellite repeat evolution in allopolyploid genomes. RESULTS In this study, we investigated the satellite repeat signature in five okra (Abelmoschus esculentus) accessions using genomic and cytogenetic methods. In each of the five accessions, we identified eight satellite repeats, which exhibited a significant level of intraspecific conservation. Through fluorescence in situ hybridization (FISH) experiments, we observed that the satellite repeats generated multiple signals and exhibited variations in copy number across chromosomes. Intriguingly, we found that five satellite repeats were interspersed with centromeric retrotransposons, signifying their involvement in centromeric satellite repeat identity. We confirmed subgenome-biased amplification patterns of these satellite repeats through existing genome assemblies or dual-color FISH, indicating their distinct dynamic evolution in the allotetraploid okra subgenome. Moreover, we observed the presence of multiple chromosomes harboring the 35 S rDNA loci, alongside another chromosomal pair carrying the 5 S rDNA loci in okra using FISH assay. Remarkably, the intensity of 35 S rDNA hybridization signals varied among chromosomes, with the signals predominantly localized within regions of relatively weak DAPI staining, associated with GC-rich heterochromatin regions. Finally, we observed a similar localization pattern between 35 S rDNA and three satellite repeats with high GC content and confirmed their origin in the intergenic spacer region of the 35 S rDNA. CONCLUSIONS Our findings uncover a unique satellite repeat signature in the allotetraploid okra, contributing to our understanding of the composition, abundance, and chromosomal distribution of satellite repeats in allopolyploid genomes, further enriching our understanding of their evolutionary dynamics in complex allopolyploid genomes.
Collapse
Affiliation(s)
- Jiarui Liu
- Scientific Observing and Experimental Station of Southeastern kenaf & jute, Ministry of Agriculture and Rural Affairs of the People's Republic of China, Key Laboratory of Ministry of Education for Genetics, Breeding and Multiple Utilization of Crops, Fujian Provincial Key Laboratory of Crop Breeding by Design, National Engineering Research Center for Sugarcane, College of Agriculture, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Xinyi Lin
- Scientific Observing and Experimental Station of Southeastern kenaf & jute, Ministry of Agriculture and Rural Affairs of the People's Republic of China, Key Laboratory of Ministry of Education for Genetics, Breeding and Multiple Utilization of Crops, Fujian Provincial Key Laboratory of Crop Breeding by Design, National Engineering Research Center for Sugarcane, College of Agriculture, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Xiaojie Wang
- College of Life Science, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Liqing Feng
- College of Life Science, Fujian Normal University, Fuzhou, 350117, China
| | - Shixin Zhu
- Scientific Observing and Experimental Station of Southeastern kenaf & jute, Ministry of Agriculture and Rural Affairs of the People's Republic of China, Key Laboratory of Ministry of Education for Genetics, Breeding and Multiple Utilization of Crops, Fujian Provincial Key Laboratory of Crop Breeding by Design, National Engineering Research Center for Sugarcane, College of Agriculture, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Runmeng Tian
- Scientific Observing and Experimental Station of Southeastern kenaf & jute, Ministry of Agriculture and Rural Affairs of the People's Republic of China, Key Laboratory of Ministry of Education for Genetics, Breeding and Multiple Utilization of Crops, Fujian Provincial Key Laboratory of Crop Breeding by Design, National Engineering Research Center for Sugarcane, College of Agriculture, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Jingping Fang
- College of Life Science, Fujian Normal University, Fuzhou, 350117, China
| | - Aifen Tao
- Scientific Observing and Experimental Station of Southeastern kenaf & jute, Ministry of Agriculture and Rural Affairs of the People's Republic of China, Key Laboratory of Ministry of Education for Genetics, Breeding and Multiple Utilization of Crops, Fujian Provincial Key Laboratory of Crop Breeding by Design, National Engineering Research Center for Sugarcane, College of Agriculture, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Pingping Fang
- Scientific Observing and Experimental Station of Southeastern kenaf & jute, Ministry of Agriculture and Rural Affairs of the People's Republic of China, Key Laboratory of Ministry of Education for Genetics, Breeding and Multiple Utilization of Crops, Fujian Provincial Key Laboratory of Crop Breeding by Design, National Engineering Research Center for Sugarcane, College of Agriculture, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Jianmin Qi
- Scientific Observing and Experimental Station of Southeastern kenaf & jute, Ministry of Agriculture and Rural Affairs of the People's Republic of China, Key Laboratory of Ministry of Education for Genetics, Breeding and Multiple Utilization of Crops, Fujian Provincial Key Laboratory of Crop Breeding by Design, National Engineering Research Center for Sugarcane, College of Agriculture, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Liwu Zhang
- Scientific Observing and Experimental Station of Southeastern kenaf & jute, Ministry of Agriculture and Rural Affairs of the People's Republic of China, Key Laboratory of Ministry of Education for Genetics, Breeding and Multiple Utilization of Crops, Fujian Provincial Key Laboratory of Crop Breeding by Design, National Engineering Research Center for Sugarcane, College of Agriculture, Fujian Agriculture and Forestry University, Fuzhou, 350002, China
| | - Yongji Huang
- Ministerial and Provincial Joint Innovation Centre for Safety Production of Cross-Strait Crops, College of Geography and Oceanography, Minjiang University, Fuzhou, 350108, China.
| | - Jiantang Xu
- Scientific Observing and Experimental Station of Southeastern kenaf & jute, Ministry of Agriculture and Rural Affairs of the People's Republic of China, Key Laboratory of Ministry of Education for Genetics, Breeding and Multiple Utilization of Crops, Fujian Provincial Key Laboratory of Crop Breeding by Design, National Engineering Research Center for Sugarcane, College of Agriculture, Fujian Agriculture and Forestry University, Fuzhou, 350002, China.
| |
Collapse
|
10
|
Zhang Y, Chu J, Cheng H, Li H. De novo reconstruction of satellite repeat units from sequence data. Genome Res 2023; 33:1994-2001. [PMID: 37918962 PMCID: PMC10760446 DOI: 10.1101/gr.278005.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Accepted: 10/18/2023] [Indexed: 11/04/2023]
Abstract
Satellite DNA are long tandemly repeating sequences in a genome and may be organized as high-order repeats (HORs). They are enriched in centromeres and are challenging to assemble. Existing algorithms for identifying satellite repeats either require the complete assembly of satellites or only work for simple repeat structures without HORs. Here we describe Satellite Repeat Finder (SRF), a new algorithm for reconstructing satellite repeat units and HORs from accurate reads or assemblies without prior knowledge on repeat structures. Applying SRF to real sequence data, we show that SRF could reconstruct known satellites in human and well-studied model organisms. We also find satellite repeats are pervasive in various other species, accounting for up to 12% of their genome contents but are often underrepresented in assemblies. With the rapid progress in genome sequencing, SRF will help the annotation of new genomes and the study of satellite DNA evolution even if such repeats are not fully assembled.
Collapse
Affiliation(s)
- Yujie Zhang
- Harvard School of Public Health, Boston, Massachusetts 02115, USA
| | - Justin Chu
- Department of Data Science, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Haoyu Cheng
- Department of Data Science, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Heng Li
- Department of Data Science, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA;
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts 02115, USA
| |
Collapse
|
11
|
Morrissey A, Shi J, James DQ, Mahony S. Allo: Accurate allocation of multi-mapped reads enables regulatory element analysis at repeats. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.12.556916. [PMID: 37745557 PMCID: PMC10515862 DOI: 10.1101/2023.09.12.556916] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]
Abstract
Transposable elements (TEs) and other repetitive regions have been shown to contain gene regulatory elements, including transcription factor binding sites. Unfortunately, regulatory elements harbored by repeats have proven difficult to characterize using short-read sequencing assays such as ChIP-seq or ATAC-seq. Most regulatory genomics analysis pipelines discard "multi-mapped" reads that align equally well to multiple genomic locations. Since multi-mapped reads arise predominantly from repeats, current analysis pipelines fail to detect a substantial portion of regulatory events that occur in repetitive regions. To address this shortcoming, we developed Allo, a new approach to allocate multi-mapped reads in an efficient, accurate, and user-friendly manner. Allo combines probabilistic mapping of multi-mapped reads with a convolutional neural network that recognizes the read distribution features of potential peaks, offering enhanced accuracy in multi-mapping read assignment. Allo also provides read-level output in the form of a corrected alignment file, making it compatible with existing regulatory genomics analysis pipelines and downstream peak-finders. In a demonstration application on CTCF ChIP-seq data, we show that Allo results in the discovery of thousands of new CTCF peaks. Many of these peaks contain the expected cognate motif and/or serve as TAD boundaries. We additionally apply Allo to a diverse collection of ENCODE ChIP-seq datasets, resulting in multiple previously unidentified interactions between transcription factors and repetitive element families. Finally, we show that Allo may be particularly effective in identifying ChIP-seq peaks in younger TEs, which hold evolutionary significance due to their emergence during human evolution from primates.
Collapse
Affiliation(s)
- Alexis Morrissey
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
| | - Jeffrey Shi
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
| | - Daniela Q. James
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
| | - Shaun Mahony
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
| |
Collapse
|
12
|
Gaskill MM, Soluri IV, Branks AE, Boka AP, Stadler MR, Vietor K, Huang HYS, Gibson TJ, Mukherjee A, Mir M, Blythe SA, Harrison MM. Localization of the Drosophila pioneer factor GAF to subnuclear foci is driven by DNA binding and required to silence satellite repeat expression. Dev Cell 2023; 58:1610-1624.e8. [PMID: 37478844 PMCID: PMC10528433 DOI: 10.1016/j.devcel.2023.06.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Revised: 04/19/2023] [Accepted: 06/29/2023] [Indexed: 07/23/2023]
Abstract
The eukaryotic genome is organized to enable the precise regulation of gene expression. This organization is established as the embryo transitions from a fertilized gamete to a totipotent zygote. To understand the factors and processes that drive genomic organization, we focused on the pioneer factor GAGA factor (GAF) that is required for early development in Drosophila. GAF transcriptionally activates the zygotic genome and is localized to subnuclear foci. This non-uniform distribution is driven by binding to highly abundant GA repeats. At GA repeats, GAF is necessary to form heterochromatin and silence transcription. Thus, GAF is required to establish both active and silent regions. We propose that foci formation enables GAF to have opposing transcriptional roles within a single nucleus. Our data support a model in which the subnuclear concentration of transcription factors acts to organize the nucleus into functionally distinct domains essential for the robust regulation of gene expression.
Collapse
Affiliation(s)
- Marissa M Gaskill
- Department of Biomolecular Chemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Isabella V Soluri
- Department of Molecular Biosciences, Northwestern University, Evanston, IL 60208, USA
| | - Annemarie E Branks
- Department of Biomolecular Chemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Alan P Boka
- Biochemistry and Molecular Biophysics Graduate Group, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA 19104, USA; Center for Computational and Genomic Medicine, The Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Michael R Stadler
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Katherine Vietor
- Department of Biomolecular Chemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Hao-Yu S Huang
- Department of Biomolecular Chemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Tyler J Gibson
- Department of Biomolecular Chemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Apratim Mukherjee
- Center for Computational and Genomic Medicine, The Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Department of Cell and Developmental Biology, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA 19104, USA
| | - Mustafa Mir
- Center for Computational and Genomic Medicine, The Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Epigenetics Institute, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA 19104, USA; Institute for Regenerative, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA 19104, USA; Department of Cell and Developmental Biology, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA 19104, USA
| | - Shelby A Blythe
- Department of Molecular Biosciences, Northwestern University, Evanston, IL 60208, USA
| | - Melissa M Harrison
- Department of Biomolecular Chemistry, University of Wisconsin-Madison, Madison, WI 53706, USA.
| |
Collapse
|
13
|
de Moraes RLR, de Menezes Cavalcante Sassi F, Vidal JAD, Goes CAG, dos Santos RZ, Stornioli JHF, Porto-Foresti F, Liehr T, Utsunomia R, de Bello Cioffi M. Chromosomal Rearrangements and Satellite DNAs: Extensive Chromosome Reshuffling and the Evolution of Neo-Sex Chromosomes in the Genus Pyrrhulina (Teleostei; Characiformes). Int J Mol Sci 2023; 24:13654. [PMID: 37686460 PMCID: PMC10563077 DOI: 10.3390/ijms241713654] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Revised: 08/31/2023] [Accepted: 09/02/2023] [Indexed: 09/10/2023] Open
Abstract
Chromosomal rearrangements play a significant role in the evolution of fish genomes, being important forces in the rise of multiple sex chromosomes and in speciation events. Repetitive DNAs constitute a major component of the genome and are frequently found in heterochromatic regions, where satellite DNA sequences (satDNAs) usually represent their main components. In this work, we investigated the association of satDNAs with chromosome-shuffling events, as well as their potential relevance in both sex and karyotype evolution, using the well-known Pyrrhulina fish model. Pyrrhulina species have a conserved karyotype dominated by acrocentric chromosomes present in all examined species up to date. However, two species, namely P. marilynae and P. semifasciata, stand out for exhibiting unique traits that distinguish them from others in this group. The first shows a reduced diploid number (with 2n = 32), while the latter has a well-differentiated multiple X1X2Y sex chromosome system. In addition to isolating and characterizing the full collection of satDNAs (satellitomes) of both species, we also in situ mapped these sequences in the chromosomes of both species. Moreover, the satDNAs that displayed signals on the sex chromosomes of P. semifasciata were also mapped in some phylogenetically related species to estimate their potential accumulation on proto-sex chromosomes. Thus, a large collection of satDNAs for both species, with several classes being shared between them, was characterized for the first time. In addition, the possible involvement of these satellites in the karyotype evolution of P. marilynae and P. semifasciata, especially sex-chromosome formation and karyotype reduction in P. marilynae, could be shown.
Collapse
Affiliation(s)
- Renata Luiza Rosa de Moraes
- Departamento de Genética e Evolução, Universidade Federal de São Carlos, São Carlos 13565-905, SP, Brazil; (R.L.R.d.M.); (F.d.M.C.S.); (J.A.D.V.)
- Institute of Human Genetics, University Hospital Jena, 07747 Jena, Germany
| | - Francisco de Menezes Cavalcante Sassi
- Departamento de Genética e Evolução, Universidade Federal de São Carlos, São Carlos 13565-905, SP, Brazil; (R.L.R.d.M.); (F.d.M.C.S.); (J.A.D.V.)
- Institute of Human Genetics, University Hospital Jena, 07747 Jena, Germany
| | - Jhon Alex Dziechciarz Vidal
- Departamento de Genética e Evolução, Universidade Federal de São Carlos, São Carlos 13565-905, SP, Brazil; (R.L.R.d.M.); (F.d.M.C.S.); (J.A.D.V.)
| | - Caio Augusto Gomes Goes
- Faculdade de Ciências, UNESP, Bauru 17033-36, SP, Brazil; (C.A.G.G.); (R.Z.d.S.); (F.P.-F.); (R.U.)
| | - Rodrigo Zeni dos Santos
- Faculdade de Ciências, UNESP, Bauru 17033-36, SP, Brazil; (C.A.G.G.); (R.Z.d.S.); (F.P.-F.); (R.U.)
| | - José Henrique Forte Stornioli
- Institute of Biological Sciences and Health, Universidade Federal Rural do Rio de Janeiro, Seropédica 23890-000, RJ, Brazil;
| | - Fábio Porto-Foresti
- Faculdade de Ciências, UNESP, Bauru 17033-36, SP, Brazil; (C.A.G.G.); (R.Z.d.S.); (F.P.-F.); (R.U.)
| | - Thomas Liehr
- Institute of Human Genetics, University Hospital Jena, 07747 Jena, Germany
| | - Ricardo Utsunomia
- Faculdade de Ciências, UNESP, Bauru 17033-36, SP, Brazil; (C.A.G.G.); (R.Z.d.S.); (F.P.-F.); (R.U.)
| | - Marcelo de Bello Cioffi
- Departamento de Genética e Evolução, Universidade Federal de São Carlos, São Carlos 13565-905, SP, Brazil; (R.L.R.d.M.); (F.d.M.C.S.); (J.A.D.V.)
- Institute of Human Genetics, University Hospital Jena, 07747 Jena, Germany
| |
Collapse
|
14
|
Zhang Y, Chu J, Cheng H, Li H. De novo reconstruction of satellite repeat units from sequence data. ARXIV 2023:arXiv:2304.09729v1. [PMID: 37131874 PMCID: PMC10153287] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Satellite DNA are long tandemly repeating sequences in a genome and may be organized as high-order repeats (HORs). They are enriched in centromeres and are challenging to assemble. Existing algorithms for identifying satellite repeats either require the complete assembly of satellites or only work for simple repeat structures without HORs. Here we describe Satellite Repeat Finder (SRF), a new algorithm for reconstructing satellite repeat units and HORs from accurate reads or assemblies without prior knowledge on repeat structures. Applying SRF to real sequence data, we showed that SRF could reconstruct known satellites in human and well-studied model organisms. We also found satellite repeats are pervasive in various other species, accounting for up to 12% of their genome contents but are often underrepresented in assemblies. With the rapid progress on genome sequencing, SRF will help the annotation of new genomes and the study of satellite DNA evolution even if such repeats are not fully assembled.
Collapse
Affiliation(s)
- Yujie Zhang
- Harvard School of Public Health, 677 Huntington Avenue, Boston, MA 02115, USA
| | - Justin Chu
- Department of Data Science, Dana-Farber Cancer Institute, 450 Brookline Ave, Boston, MA 02215, USA
- Department of Biomedical Informatics, Harvard Medical School, 10 Shattuck St, Boston, MA 02115, USA
| | - Haoyu Cheng
- Department of Data Science, Dana-Farber Cancer Institute, 450 Brookline Ave, Boston, MA 02215, USA
- Department of Biomedical Informatics, Harvard Medical School, 10 Shattuck St, Boston, MA 02115, USA
| | - Heng Li
- Department of Data Science, Dana-Farber Cancer Institute, 450 Brookline Ave, Boston, MA 02215, USA
- Department of Biomedical Informatics, Harvard Medical School, 10 Shattuck St, Boston, MA 02115, USA
| |
Collapse
|
15
|
Šatović-Vukšić E, Plohl M. Satellite DNAs-From Localized to Highly Dispersed Genome Components. Genes (Basel) 2023; 14:genes14030742. [PMID: 36981013 PMCID: PMC10048060 DOI: 10.3390/genes14030742] [Citation(s) in RCA: 26] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2023] [Revised: 03/15/2023] [Accepted: 03/16/2023] [Indexed: 03/30/2023] Open
Abstract
According to the established classical view, satellite DNAs are defined as abundant non-coding DNA sequences repeated in tandem that build long arrays located in heterochromatin. Advances in sequencing methodologies and development of specialized bioinformatics tools enabled defining a collection of all repetitive DNAs and satellite DNAs in a genome, the repeatome and the satellitome, respectively, as well as their reliable annotation on sequenced genomes. Supported by various non-model species included in recent studies, the patterns of satellite DNAs and satellitomes as a whole showed much more diversity and complexity than initially thought. Differences are not only in number and abundance of satellite DNAs but also in their distribution across the genome, array length, interspersion patterns, association with transposable elements, localization in heterochromatin and/or in euchromatin. In this review, we compare characteristic organizational features of satellite DNAs and satellitomes across different animal and plant species in order to summarize organizational forms and evolutionary processes that may lead to satellitomes' diversity and revisit some basic notions regarding repetitive DNA landscapes in genomes.
Collapse
Affiliation(s)
- Eva Šatović-Vukšić
- Division of Molecular Biology, Ruđer Bošković Institute, 10000 Zagreb, Croatia
| | - Miroslav Plohl
- Division of Molecular Biology, Ruđer Bošković Institute, 10000 Zagreb, Croatia
| |
Collapse
|
16
|
Gutiérrez J, Aleix-Mata G, Montiel EE, Cabral-de-Mello DC, Marchal JA, Sánchez A. Satellitome Analysis on Talpa aquitania Genome and Inferences about the satDNAs Evolution on Some Talpidae. Genes (Basel) 2022; 14:117. [PMID: 36672858 PMCID: PMC9859602 DOI: 10.3390/genes14010117] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 12/27/2022] [Accepted: 12/28/2022] [Indexed: 01/04/2023] Open
Abstract
In the genus Talpa a new species, named Talpa aquitania, has been recently described. Only cytogenetic data are available for the nuclear genome of this species. In this work, we characterize the satellitome of the T. aquitania genome that presents 16 different families, including telomeric sequences, and they represent 1.24% of the genome. The first satellite DNA family (TaquSat1-183) represents 0.558%, and six more abundant families, including TaquSat1-183, comprise 1.13%, while the remaining 11 sat-DNAs represent only 0.11%. The average A + T content of the SatDNA families was 50.43% and the median monomer length was 289.24 bp. The analysis of these SatDNAs indicated that they have different grades of clusterization, homogenization, and degeneration. Most of the satDNA families are present in the genomes of the other Talpa species analyzed, while in the genomes of other more distant species of Talpidae, only some of them are present, in accordance with the library hypothesis. Moreover, chromosomal localization by FISH revealed that some satDNAs are localized preferentially on centromeric and non-centromeric heterochromatin in T. aquitania and also in the sister species T. occidentalis karyotype. The differences observed between T. aquitania and the close relative T. occidentalis and T. europaea suggested that the satellitome is a very dynamic component of the genomes and that the satDNAs could be responsible for chromosomal differences between the species. Finally, in a broad context, these data contribute to the understanding of the evolution of satellitomes on mammals.
Collapse
Affiliation(s)
- Juana Gutiérrez
- Departamento de Biología Experimental, Área de Genética, Universidad de Jaén, Paraje de las Lagunillas s/n, 23071 Jaén, Spain
| | - Gaël Aleix-Mata
- Departamento de Biología Experimental, Área de Genética, Universidad de Jaén, Paraje de las Lagunillas s/n, 23071 Jaén, Spain
| | - Eugenia E. Montiel
- Departamento de Biología Experimental, Área de Genética, Universidad de Jaén, Paraje de las Lagunillas s/n, 23071 Jaén, Spain
| | - Diogo C. Cabral-de-Mello
- Departamento de Biología Experimental, Área de Genética, Universidad de Jaén, Paraje de las Lagunillas s/n, 23071 Jaén, Spain
- Departamento de Biologia Geral e Aplicada, Instituto de Biociências/IB, UNESP—Universidade Estadual Paulista, Rio Claro, São Paulo 13506-900, Brazil
| | - Juan Alberto Marchal
- Departamento de Biología Experimental, Área de Genética, Universidad de Jaén, Paraje de las Lagunillas s/n, 23071 Jaén, Spain
| | - Antonio Sánchez
- Departamento de Biología Experimental, Área de Genética, Universidad de Jaén, Paraje de las Lagunillas s/n, 23071 Jaén, Spain
| |
Collapse
|
17
|
Goes CAG, dos Santos N, Rodrigues PHDM, Stornioli JHF, da Silva AB, dos Santos RZ, Vidal JAD, Silva DMZDA, Artoni RF, Foresti F, Hashimoto DT, Porto-Foresti F, Utsunomia R. The Satellite DNA Catalogues of Two Serrasalmidae (Teleostei, Characiformes): Conservation of General satDNA Features over 30 Million Years. Genes (Basel) 2022; 14:91. [PMID: 36672835 PMCID: PMC9859320 DOI: 10.3390/genes14010091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2022] [Revised: 12/08/2022] [Accepted: 12/24/2022] [Indexed: 12/31/2022] Open
Abstract
Satellite DNAs (satDNAs) are tandemly repeated sequences that are usually located on the heterochromatin, and the entire collection of satDNAs within a genome is called satellitome. Primarily, these sequences are not under selective pressure and evolve by concerted evolution, resulting in elevated rates of divergence between the satDNA profiles of reproductive isolated species/populations. Here, we characterized two additional satellitomes of Characiformes fish (Colossoma macropomum and Piaractus mesopotamicus) that diverged approximately 30 million years ago, while still retaining conserved karyotype features. The results we obtained indicated that several satDNAs (50% of satellite sequences in P. mesopotamicus and 43% in C. macropomum) show levels of conservation between the analyzed species, in the nucleotide and chromosomal levels. We propose that long-life cycles and few genomic changes could slow down rates of satDNA differentiation.
Collapse
Affiliation(s)
| | - Natalia dos Santos
- Faculty of Sciences, São Paulo State University (UNESP), Bauru 17033-360, SP, Brazil
| | | | - José Henrique Forte Stornioli
- Institute of Biological Sciences and Health, Federal Rural University of Rio de Janeiro, Seropédica 23890-000, RJ, Brazil
| | - Amanda Bueno da Silva
- Faculty of Sciences, São Paulo State University (UNESP), Bauru 17033-360, SP, Brazil
| | | | - Jhon Alex Dziechciarz Vidal
- Department of Structural, Molecular and Genetic Biology, State University of Ponta Grossa, Ponta Grossa 84030-900, PR, Brazil
- Department of Genetics and Evolution, Federal University of São Carlos, São Carlos 13565-905, SP, Brazil
| | | | - Roberto Ferreira Artoni
- Department of Structural, Molecular and Genetic Biology, State University of Ponta Grossa, Ponta Grossa 84030-900, PR, Brazil
- Department of Genetics and Evolution, Federal University of São Carlos, São Carlos 13565-905, SP, Brazil
| | - Fausto Foresti
- Department of Structural and Functional Biology, Institute of Biosciences, São Paulo State University, Botucatu 18618-970, SP, Brazil
| | - Diogo Teruo Hashimoto
- Aquaculture Center of UNESP, São Paulo State University, Jaboticabal 14884-900, SP, Brazil
| | - Fábio Porto-Foresti
- Faculty of Sciences, São Paulo State University (UNESP), Bauru 17033-360, SP, Brazil
- Aquaculture Center of UNESP, São Paulo State University, Jaboticabal 14884-900, SP, Brazil
| | - Ricardo Utsunomia
- Faculty of Sciences, São Paulo State University (UNESP), Bauru 17033-360, SP, Brazil
- Institute of Biological Sciences and Health, Federal Rural University of Rio de Janeiro, Seropédica 23890-000, RJ, Brazil
- Aquaculture Center of UNESP, São Paulo State University, Jaboticabal 14884-900, SP, Brazil
| |
Collapse
|
18
|
Telomeres and Their Neighbors. Genes (Basel) 2022; 13:genes13091663. [PMID: 36140830 PMCID: PMC9498494 DOI: 10.3390/genes13091663] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Revised: 09/08/2022] [Accepted: 09/09/2022] [Indexed: 11/21/2022] Open
Abstract
Telomeres are essential structures formed from satellite DNA repeats at the ends of chromosomes in most eukaryotes. Satellite DNA repeat sequences are useful markers for karyotyping, but have a more enigmatic role in the eukaryotic cell. Much work has been done to investigate the structure and arrangement of repetitive DNA elements in classical models with implications for species evolution. Still more is needed until there is a complete picture of the biological function of DNA satellite sequences, particularly when considering non-model organisms. Celebrating Gregor Mendel’s anniversary by going to the roots, this review is designed to inspire and aid new research into telomeres and satellites with a particular focus on non-model organisms and accessible experimental and in silico methods that do not require specialized equipment or expensive materials. We describe how to identify telomere (and satellite) repeats giving many examples of published (and some unpublished) data from these techniques to illustrate the principles behind the experiments. We also present advice on how to perform and analyse such experiments, including details of common pitfalls. Our examples are a selection of recent developments and underexplored areas of research from the past. As a nod to Mendel’s early work, we use many examples from plants and insects, especially as much recent work has expanded beyond the human and yeast models traditional in telomere research. We give a general introduction to the accepted knowledge of telomere and satellite systems and include references to specialized reviews for the interested reader.
Collapse
|
19
|
Arora UP, Dumont BL. Meiotic drive in house mice: mechanisms, consequences, and insights for human biology. Chromosome Res 2022; 30:165-186. [PMID: 35829972 PMCID: PMC9509409 DOI: 10.1007/s10577-022-09697-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2022] [Revised: 04/20/2022] [Accepted: 04/27/2022] [Indexed: 11/27/2022]
Abstract
Meiotic drive occurs when one allele at a heterozygous site cheats its way into a disproportionate share of functional gametes, violating Mendel's law of equal segregation. This genetic conflict typically imposes a fitness cost to individuals, often by disrupting the process of gametogenesis. The evolutionary impact of meiotic drive is substantial, and the phenomenon has been associated with infertility and reproductive isolation in a wide range of organisms. However, cases of meiotic drive in humans remain elusive, a finding that likely reflects the inherent challenges of detecting drive in our species rather than unique features of human genome biology. Here, we make the case that house mice (Mus musculus) present a powerful model system to investigate the mechanisms and consequences of meiotic drive and facilitate translational inferences about the scope and potential mechanisms of drive in humans. We first detail how different house mouse resources have been harnessed to identify cases of meiotic drive and the underlying mechanisms utilized to override Mendel's rules of inheritance. We then summarize the current state of knowledge of meiotic drive in the mouse genome. We profile known mechanisms leading to transmission bias at several established drive elements. We discuss how a detailed understanding of meiotic drive in mice can steer the search for drive elements in our own species. Lastly, we conclude with a prospective look into how new technologies and molecular tools can help resolve lingering mysteries about the prevalence and mechanisms of selfish DNA transmission in mammals.
Collapse
Affiliation(s)
- Uma P Arora
- The Jackson Laboratory, 600 Main Street, Bar Harbor, ME, 04609, USA
- Graduate School of Biomedical Sciences, Tufts University, 136 Harrison Ave, Boston, MA, 02111, USA
| | - Beth L Dumont
- The Jackson Laboratory, 600 Main Street, Bar Harbor, ME, 04609, USA.
- Graduate School of Biomedical Sciences, Tufts University, 136 Harrison Ave, Boston, MA, 02111, USA.
| |
Collapse
|
20
|
Identification and characterization of a new family of long satellite DNA, specific of true toads (Anura, Amphibia, Bufonidae). Sci Rep 2022; 12:13960. [PMID: 35978080 PMCID: PMC9385698 DOI: 10.1038/s41598-022-18051-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Accepted: 08/04/2022] [Indexed: 11/08/2022] Open
Abstract
Amphibians have some of the most variable genome sizes among vertebrates. Genome size variation has been attributed to repetitive and noncoding DNA, including satellite repeats, transposable elements, introns, and nuclear insertions of viral and organelle DNA. In vertebrates, satellite DNAs have been widely described in mammals, but few molecular studies have been carried out in amphibians. Here, we provide a detailed characterization of a new family of satellite DNA, present in all 15 examined species of the family Bufonidae. Southern-blot analysis and PCR reveal that this satellite is formed by monomers of 807 bp, is organized in tandem arrays, and has an AT-content of 57.4%. Phylogenetic analyses show that most clades exhibit species-specific variances, indicating that this satellite DNA has evolved by concerted evolution. The homogenization/fixation process is heterogeneous in Bufonidae, where the genera Bufo and Bufotes do not show species-specific differences, while populations from Rhinella marina exhibit population-specific changes. Additionally, variants of this satellite DNA have been identified in Duttaphrynus melanostictus and R. marina, supporting the 'library hypothesis' (a set, 'library', of satellite DNAs is shared by a species group). Physical mapping in Bufo bufo, Bufo spinosus, Epidalea calamita and Bufotes viridis provides evidence that this repetitive DNA is not dispersed in the karyotype, but accumulated in pericentromeric regions of some chromosomal pairs. This location, together with its presence in the transcriptomes of bufonids, could indicate a role in centromere function or heterochromatin formation and maintenance.
Collapse
|
21
|
Kirov I, Kolganova E, Dudnikov M, Yurkevich OY, Amosova AV, Muravenko OV. A Pipeline NanoTRF as a New Tool for De Novo Satellite DNA Identification in the Raw Nanopore Sequencing Reads of Plant Genomes. PLANTS (BASEL, SWITZERLAND) 2022; 11:2103. [PMID: 36015406 PMCID: PMC9413040 DOI: 10.3390/plants11162103] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 08/08/2022] [Accepted: 08/11/2022] [Indexed: 06/15/2023]
Abstract
High-copy tandemly organized repeats (TRs), or satellite DNA, is an important but still enigmatic component of eukaryotic genomes. TRs comprise arrays of multi-copy and highly similar tandem repeats, which makes the elucidation of TRs a very challenging task. Oxford Nanopore sequencing data provide a valuable source of information on TR organization at the single molecule level. However, bioinformatics tools for de novo identification of TRs in raw Nanopore data have not been reported so far. We developed NanoTRF, a new python pipeline for TR repeat identification, characterization and consensus monomer sequence assembly. This new pipeline requires only a raw Nanopore read file from low-depth (<1×) genome sequencing. The program generates an informative html report and figures on TR genome abundance, monomer sequence and monomer length. In addition, NanoTRF performs annotation of transposable elements (TEs) sequences within or near satDNA arrays, and the information can be used to elucidate how TR−TE co-evolve in the genome. Moreover, we validated by FISH that the NanoTRF report is useful for the evaluation of TR chromosome organization—clustered or dispersed. Our findings showed that NanoTRF is a robust method for the de novo identification of satellite repeats in raw Nanopore data without prior read assembly. The obtained sequences can be used in many downstream analyses including genome assembly assistance and gap estimation, chromosome mapping and cytogenetic marker development.
Collapse
Affiliation(s)
- Ilya Kirov
- All-Russia Research Institute of Agricultural Biotechnology, Timiryazevskaya Str. 42, Moscow 127550, Russia
- Moscow Institute of Physics and Technology, Dolgoprudny 141701, Russia
| | - Elizaveta Kolganova
- All-Russia Research Institute of Agricultural Biotechnology, Timiryazevskaya Str. 42, Moscow 127550, Russia
| | - Maxim Dudnikov
- All-Russia Research Institute of Agricultural Biotechnology, Timiryazevskaya Str. 42, Moscow 127550, Russia
- Moscow Institute of Physics and Technology, Dolgoprudny 141701, Russia
| | - Olga Yu. Yurkevich
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow 119991, Russia
| | - Alexandra V. Amosova
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow 119991, Russia
| | - Olga V. Muravenko
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow 119991, Russia
| |
Collapse
|
22
|
Delihas N. An ancestral genomic sequence that serves as a nucleation site for de novo gene birth. PLoS One 2022; 17:e0267864. [PMID: 35552551 PMCID: PMC9097989 DOI: 10.1371/journal.pone.0267864] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Accepted: 04/17/2022] [Indexed: 11/24/2022] Open
Abstract
The process of gene birth is of major interest with current excitement concerning de novo gene formation. We report a new and different mechanism of de novo gene birth based on the finding and the characteristics of a short non-coding sequence situated between two protein genes, termed a spacer sequence. This non-coding sequence is present in genomes of Mus musculus, the house mouse and Philippine tarsier, a primitive ancestral primate. The ancestral sequence is highly conserved during primate evolution with certain base pairs totally invariant from mouse to humans. By following the birth of the sequence of human lincRNA BCRP3 (BCR activator of RhoGEF and GTPase 3 pseudogene) during primate evolution, we find diverse genes, long non-coding RNA and protein genes (and sequences that do not appear to encode a gene) that all stem from the 3’ end of the spacer, and all begin with a similar sequence. During primate evolution, part of the BCRP3 sequence initially formed in the Old World Monkeys and developed into different primate genes before evolving into the BCRP3 gene in humans. The gene developmental process consists of the initiation of DNA synthesis at spacer 3’ ends, addition of a complex of tandem transposable elements and the addition of a segment of another gene. The findings support the concept of the spacer sequence as a starting site for DNA synthesis that leads to formation of different genes with the addition of other sequences. These data suggest a new process of de novo gene birth.
Collapse
Affiliation(s)
- Nicholas Delihas
- Department of Microbiology and Immunology, Renaissance School of Medicine, Stony Brook University, Stony Brook, New York, United States of America
- * E-mail:
| |
Collapse
|
23
|
Montiel EE, Mora P, Rico-Porras JM, Palomeque T, Lorite P. Satellitome of the Red Palm Weevil, Rhynchophorus ferrugineus (Coleoptera: Curculionidae), the Most Diverse Among Insects. Front Ecol Evol 2022. [DOI: 10.3389/fevo.2022.826808] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
The red palm weevil, Rhynchophorus ferrugineus, is the most harmful species among those pests affecting palm trees. Its impact causes important economic losses around the World. Nevertheless, the genetic information of Rh. ferrugineus is very scarce. Last year, the first genome assembly was published including a rough description of its repeatome. However, no information has been added about one of the main components of repeated DNA, the satellite DNA. Herein, we presented the characterization of the satellitome of this important species that includes 112 satellite DNA families, the largest number in an insect genome. These satellite DNA families made up around 25% of the genome while the most abundant family, RferSat01-169, alone represented 20.4%. Chromosomal location of most abundant satellite DNA families performed by fluorescence in situ hybridization showed that all of them are dispersed in the euchromatin on all chromosomes but some of them are also specifically accumulated either on the pericentromeric heterochromatic regions of all chromosomes or on specific chromosomes. Finally, the transcription of satellitome families was analyzed through Rh. ferrugineus development. It was found that 55 out of 112 satellite DNA families showed transcription, some families seemed to be transcribed across all stages while a few appeared to be stage-specific, indicating a possible role of those satellite DNA sequences in the development of this species.
Collapse
|
24
|
Kretschmer R, Goes CAG, Bertollo LAC, Ezaz T, Porto-Foresti F, Toma GA, Utsunomia R, de Bello Cioffi M. Satellitome analysis illuminates the evolution of ZW sex chromosomes of Triportheidae fishes (Teleostei: Characiformes). Chromosoma 2022; 131:29-45. [PMID: 35099570 DOI: 10.1007/s00412-022-00768-1] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2021] [Revised: 01/09/2022] [Accepted: 01/12/2022] [Indexed: 12/14/2022]
Abstract
Satellites are an abundant source of repetitive DNAs that play an essential role in the chromosomal organization and are tightly linked with the evolution of sex chromosomes. Among fishes, Triportheidae stands out as the only family where almost all species have a homeologous ZZ/ZW sex chromosomes system. While the Z chromosome is typically conserved, the W is always smaller, with variations in size and morphology between species. Here, we report an analysis of the satellitome of Triportheus auritus (TauSat) by integrating genomic and chromosomal data, with a special focus on the highly abundant and female-biased satDNAs. In addition, we investigated the evolutionary trajectories of the ZW sex chromosomes in the Triportheidae family by mapping satDNAs in selected representative species of this family. The satellitome of T. auritus comprised 53 satDNA families of which 24 were also hybridized by FISH. Most satDNAs differed significantly between sexes, with 19 out of 24 being enriched on the W chromosome of T. auritus. The number of satDNAs hybridized into the W chromosomes of T. signatus and T. albus decreased to six and four, respectively, in accordance with the size of their W chromosomes. No TauSat probes produced FISH signals on the chromosomes of Agoniates halecinus. Despite its apparent conservation, our results indicate that each species differs in the satDNA accumulation on the Z chromosome. Minimum spanning trees (MSTs), generated for three satDNA families with different patterns of FISH mapping data, revealed different homogenization rates between the Z and W chromosomes. These results were linked to different levels of recombination between them. The most abundant satDNA family (TauSat01) was exclusively hybridized in the centromeres of all 52 chromosomes of T. auritus, and its putative role in the centromere evolution was also highlighted. Our results identified a high differentiation of both ZW chromosomes regarding satellites composition, highlighting their dynamic role in the sex chromosomes evolution.
Collapse
Affiliation(s)
- Rafael Kretschmer
- Departamento de Genética e Evolução, Universidade Federal de São Carlos, São Carlos, São Paulo, Brazil
| | | | | | - Tariq Ezaz
- Institute for Applied Ecology, University of Canberra, Canberra, Australia
| | | | - Gustavo Akira Toma
- Departamento de Genética e Evolução, Universidade Federal de São Carlos, São Carlos, São Paulo, Brazil
| | - Ricardo Utsunomia
- Instituto de Ciências Biológicas e da Saúde, ICBS, Universidade Federal Rural do Rio de Janeiro, Rio de Janeiro, Brazil
| | - Marcelo de Bello Cioffi
- Departamento de Genética e Evolução, Universidade Federal de São Carlos, São Carlos, São Paulo, Brazil.
| |
Collapse
|
25
|
Affiliation(s)
| | - Francisco J. Ruiz-Ruano
- Department of Organismal Biology – Systematic Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
- School of Biological Sciences, Norwich Research Park University of East Anglia, Norwich, UK
| |
Collapse
|
26
|
Programmed DNA elimination: silencing genes and repetitive sequences in somatic cells. Biochem Soc Trans 2021; 49:1891-1903. [PMID: 34665225 DOI: 10.1042/bst20190951] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Revised: 09/25/2021] [Accepted: 09/28/2021] [Indexed: 12/30/2022]
Abstract
In a multicellular organism, the genomes of all cells are in general the same. Programmed DNA elimination is a notable exception to this genome constancy rule. DNA elimination removes genes and repetitive elements in the germline genome to form a reduced somatic genome in various organisms. The process of DNA elimination within an organism is highly accurate and reproducible; it typically occurs during early embryogenesis, coincident with germline-soma differentiation. DNA elimination provides a mechanism to silence selected genes and repeats in somatic cells. Recent studies in nematodes suggest that DNA elimination removes all chromosome ends, resolves sex chromosome fusions, and may also promote the birth of novel genes. Programmed DNA elimination processes are diverse among species, suggesting DNA elimination likely has evolved multiple times in different taxa. The growing list of organisms that undergo DNA elimination indicates that DNA elimination may be more widespread than previously appreciated. These various organisms will serve as complementary and comparative models to study the function, mechanism, and evolution of programmed DNA elimination in metazoans.
Collapse
|
27
|
Subirana JA, Messeguer X. DNA Satellites Are Transcribed as Part of the Non-Coding Genome in Eukaryotes and Bacteria. Genes (Basel) 2021; 12:genes12111651. [PMID: 34828257 PMCID: PMC8625621 DOI: 10.3390/genes12111651] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2021] [Revised: 10/16/2021] [Accepted: 10/17/2021] [Indexed: 12/01/2022] Open
Abstract
It has been shown in recent years that many repeated sequences in the genome are expressed as RNA transcripts, although the role of such RNAs is poorly understood. Some isolated and tandem repeats (satellites) have been found to be transcribed, such as mammalian Alu sequences and telomeric/centromeric satellites in different species. However, there is no detailed study on the eventual transcription of the interspersed satellites found in many species. Therefore, we decided to study for the first time the transcription of the abundant DNA satellites in the bacterium Bacillus coagulans and in the nematode Caenorhabditis elegans. We have updated the data for C. elegans satellites using the latest version of the genome. We analyzed the transcription of satellites in both species in available RNA-seq results and found that they are widely transcribed. Our demonstration that satellite RNAs are transcribed adds a new family of non-coding RNAs. This is a field that requires further investigation and will provide a deeper understanding of gene expression and control.
Collapse
|
28
|
Valeri MP, Dias GB, do Espírito Santo AA, Moreira CN, Yonenaga-Yassuda Y, Sommer IB, Kuhn GCS, Svartman M. First Description of a Satellite DNA in Manatees' Centromeric Regions. Front Genet 2021; 12:694866. [PMID: 34504514 PMCID: PMC8421680 DOI: 10.3389/fgene.2021.694866] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2021] [Accepted: 07/30/2021] [Indexed: 11/18/2022] Open
Abstract
Trichechus manatus and Trichechus inunguis are the two Sirenia species that occur in the Americas. Despite their increasing extinction risk, many aspects of their biology remain understudied, including the repetitive DNA fraction of their genomes. Here we used the sequenced genome of T. manatus and TAREAN to identify satellite DNAs (satDNAs) in this species. We report the first description of TMAsat, a satDNA comprising ~0.87% of the genome, with ~684bp monomers and centromeric localization. In T. inunguis, TMAsat showed similar monomer length, chromosome localization and conserved CENP-B box-like motifs as in T. manatus. We also detected this satDNA in the Dugong dugon and in the now extinct Hydrodamalis gigas genomes. The neighbor-joining tree shows that TMAsat sequences from T. manatus, T. inunguis, D. dugon, and H. gigas lack species-specific clusters, which disagrees with the predictions of concerted evolution. We detected a divergent TMAsat-like homologous sequence in elephants and hyraxes, but not in other mammals, suggesting this sequence was already present in the common ancestor of Paenungulata, and later became a satDNA in the Sirenians. This is the first description of a centromeric satDNA in manatees and will facilitate the inclusion of Sirenia in future studies of centromeres and satDNA biology.
Collapse
Affiliation(s)
- Mirela Pelizaro Valeri
- Laboratório de Citogenômica Evolutiva, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Guilherme Borges Dias
- Department of Genetics and Institute of Bioinformatics, University of Georgia, Athens, GA, United States
| | - Alice Alves do Espírito Santo
- Laboratório de Citogenômica Evolutiva, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Camila Nascimento Moreira
- Departamento de Genética e Biologia Evolutiva, Instituto de Biociências, Universidade de São Paulo, São Paulo, Brazil
| | - Yatiyo Yonenaga-Yassuda
- Departamento de Genética e Biologia Evolutiva, Instituto de Biociências, Universidade de São Paulo, São Paulo, Brazil
| | - Iara Braga Sommer
- Centro Nacional de Pesquisa e Conservação da Biodiversidade Marinha do Nordeste, Instituto Chico Mendes de Conservação da Biodiversidade, Brasília, Brazil
| | - Gustavo C. S. Kuhn
- Laboratório de Citogenômica Evolutiva, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Marta Svartman
- Laboratório de Citogenômica Evolutiva, Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| |
Collapse
|
29
|
Zhuravlev AV, Zakharov GA, Anufrieva EV, Medvedeva AV, Nikitina EA, Savvateeva-Popova EV. Chromatin Structure and "DNA Sequence View": The Role of Satellite DNA in Ectopic Pairing of the Drosophila X Polytene Chromosome. Int J Mol Sci 2021; 22:8713. [PMID: 34445413 PMCID: PMC8395981 DOI: 10.3390/ijms22168713] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Accepted: 08/11/2021] [Indexed: 11/16/2022] Open
Abstract
Chromatin 3D structure plays a crucial role in regulation of gene activity. Previous studies have envisioned spatial contact formations between chromatin domains with different epigenetic properties, protein compositions and transcription activity. This leaves specific DNA sequences that affect chromosome interactions. The Drosophila melanogaster polytene chromosomes are involved in non-allelic ectopic pairing. The mutant strain agnts3, a Drosophila model for Williams-Beuren syndrome, has an increased frequency of ectopic contacts (FEC) compared to the wild-type strain Canton-S (CS). Ectopic pairing can be mediated by some specific DNA sequences. In this study, using our Homology Segment Analysis software, we estimated the correlation between FEC and frequency of short matching DNA fragments (FMF) for all sections of the X chromosome of Drosophila CS and agnts3 strains. With fragment lengths of 50 nucleotides (nt), CS showed a specific FEC-FMF correlation for 20% of the sections involved in ectopic contacts. The correlation was unspecific in agnts3, which may indicate the alternative epigenetic mechanisms affecting FEC in the mutant strain. Most of the fragments that specifically contributed to FMF were related to 1.688 or 372-bp middle repeats. Thus, middle repetitive DNA may serve as an organizer of ectopic pairing.
Collapse
Affiliation(s)
- Aleksandr V. Zhuravlev
- Pavlov Institute of Physiology, Russian Academy of Sciences, 199034 Saint Petersburg, Russia; (G.A.Z.); (A.V.M.); (E.A.N.); (E.V.S.-P.)
| | - Gennadii A. Zakharov
- Pavlov Institute of Physiology, Russian Academy of Sciences, 199034 Saint Petersburg, Russia; (G.A.Z.); (A.V.M.); (E.A.N.); (E.V.S.-P.)
- EPAM Systems Inc., Saint Petersburg 197110, Russia
| | - Ekaterina V. Anufrieva
- Faculty of Biology, Herzen State Pedagogical University of Russia, 191186 Saint Petersburg, Russia;
| | - Anna V. Medvedeva
- Pavlov Institute of Physiology, Russian Academy of Sciences, 199034 Saint Petersburg, Russia; (G.A.Z.); (A.V.M.); (E.A.N.); (E.V.S.-P.)
| | - Ekaterina A. Nikitina
- Pavlov Institute of Physiology, Russian Academy of Sciences, 199034 Saint Petersburg, Russia; (G.A.Z.); (A.V.M.); (E.A.N.); (E.V.S.-P.)
- Faculty of Biology, Herzen State Pedagogical University of Russia, 191186 Saint Petersburg, Russia;
| | - Elena V. Savvateeva-Popova
- Pavlov Institute of Physiology, Russian Academy of Sciences, 199034 Saint Petersburg, Russia; (G.A.Z.); (A.V.M.); (E.A.N.); (E.V.S.-P.)
| |
Collapse
|
30
|
Montiel EE, Panzera F, Palomeque T, Lorite P, Pita S. Satellitome Analysis of Rhodnius prolixus, One of the Main Chagas Disease Vector Species. Int J Mol Sci 2021; 22:6052. [PMID: 34205189 PMCID: PMC8199985 DOI: 10.3390/ijms22116052] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Revised: 05/31/2021] [Accepted: 06/01/2021] [Indexed: 12/13/2022] Open
Abstract
The triatomine Rhodnius prolixus is the main vector of Chagas disease in countries such as Colombia and Venezuela, and the first kissing bug whose genome has been sequenced and assembled. In the repetitive genome fraction (repeatome) of this species, the transposable elements represented 19% of R. prolixus genome, being mostly DNA transposon (Class II elements). However, scarce information has been published regarding another important repeated DNA fraction, the satellite DNA (satDNA), or satellitome. Here, we offer, for the first time, extended data about satellite DNA families in the R. prolixus genome using bioinformatics pipeline based on low-coverage sequencing data. The satellitome of R. prolixus represents 8% of the total genome and it is composed by 39 satDNA families, including four satDNA families that are shared with Triatoma infestans, as well as telomeric (TTAGG)n and (GATA)n repeats, also present in the T. infestans genome. Only three of them exceed 1% of the genome. Chromosomal hybridization with these satDNA probes showed dispersed signals over the euchromatin of all chromosomes, both in autosomes and sex chromosomes. Moreover, clustering analysis revealed that most abundant satDNA families configured several superclusters, indicating that R. prolixus satellitome is complex and that the four most abundant satDNA families are composed by different subfamilies. Additionally, transcription of satDNA families was analyzed in different tissues, showing that 33 out of 39 satDNA families are transcribed in four different patterns of expression across samples.
Collapse
Affiliation(s)
- Eugenia E. Montiel
- Department of Experimental Biology, Genetics, University of Jaén. Paraje las Lagunillas sn., 23071 Jaén, Spain; (E.E.M.); (T.P.)
| | - Francisco Panzera
- Evolutionary Genetic Section, Faculty of Science, University of the Republic, Iguá 4225, Montevideo 11400, Uruguay;
| | - Teresa Palomeque
- Department of Experimental Biology, Genetics, University of Jaén. Paraje las Lagunillas sn., 23071 Jaén, Spain; (E.E.M.); (T.P.)
| | - Pedro Lorite
- Department of Experimental Biology, Genetics, University of Jaén. Paraje las Lagunillas sn., 23071 Jaén, Spain; (E.E.M.); (T.P.)
| | - Sebastián Pita
- Evolutionary Genetic Section, Faculty of Science, University of the Republic, Iguá 4225, Montevideo 11400, Uruguay;
| |
Collapse
|
31
|
Tandem Repeats in Bacillus: Unique Features and Taxonomic Distribution. Int J Mol Sci 2021; 22:ijms22105373. [PMID: 34065296 PMCID: PMC8161180 DOI: 10.3390/ijms22105373] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Revised: 05/14/2021] [Accepted: 05/18/2021] [Indexed: 11/16/2022] Open
Abstract
Little is known about DNA tandem repeats across prokaryotes. We have recently described an enigmatic group of tandem repeats in bacterial genomes with a constant repeat size but variable sequence. These findings strongly suggest that tandem repeat size in some bacteria is under strong selective constraints. Here, we extend these studies and describe tandem repeats in a large set of Bacillus. Some species have very few repeats, while other species have a large number. Most tandem repeats have repeats with a constant size (either 52 or 20-21 nt), but a variable sequence. We characterize in detail these intriguing tandem repeats. Individual species have several families of tandem repeats with the same repeat length and different sequence. This result is in strong contrast with eukaryotes, where tandem repeats of many sizes are found in any species. We discuss the possibility that they are transcribed as small RNA molecules. They may also be involved in the stabilization of the nucleoid through interaction with proteins. We also show that the distribution of tandem repeats in different species has a taxonomic significance. The data we present for all tandem repeats and their families in these bacterial species will be useful for further genomic studies.
Collapse
|
32
|
Lopes M, Louzada S, Gama-Carvalho M, Chaves R. Genomic Tackling of Human Satellite DNA: Breaking Barriers through Time. Int J Mol Sci 2021; 22:4707. [PMID: 33946766 PMCID: PMC8125562 DOI: 10.3390/ijms22094707] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Revised: 04/24/2021] [Accepted: 04/27/2021] [Indexed: 12/12/2022] Open
Abstract
(Peri)centromeric repetitive sequences and, more specifically, satellite DNA (satDNA) sequences, constitute a major human genomic component. SatDNA sequences can vary on a large number of features, including nucleotide composition, complexity, and abundance. Several satDNA families have been identified and characterized in the human genome through time, albeit at different speeds. Human satDNA families present a high degree of sub-variability, leading to the definition of various subfamilies with different organization and clustered localization. Evolution of satDNA analysis has enabled the progressive characterization of satDNA features. Despite recent advances in the sequencing of centromeric arrays, comprehensive genomic studies to assess their variability are still required to provide accurate and proportional representation of satDNA (peri)centromeric/acrocentric short arm sequences. Approaches combining multiple techniques have been successfully applied and seem to be the path to follow for generating integrated knowledge in the promising field of human satDNA biology.
Collapse
Affiliation(s)
- Mariana Lopes
- Laboratory of Cytogenomics and Animal Genomics (CAG), Department of Genetics and Biotechnology (DGB), University of Trás-os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal; (M.L.); (S.L.)
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisbon, 1749-016 Lisbon, Portugal;
| | - Sandra Louzada
- Laboratory of Cytogenomics and Animal Genomics (CAG), Department of Genetics and Biotechnology (DGB), University of Trás-os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal; (M.L.); (S.L.)
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisbon, 1749-016 Lisbon, Portugal;
| | - Margarida Gama-Carvalho
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisbon, 1749-016 Lisbon, Portugal;
| | - Raquel Chaves
- Laboratory of Cytogenomics and Animal Genomics (CAG), Department of Genetics and Biotechnology (DGB), University of Trás-os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal; (M.L.); (S.L.)
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisbon, 1749-016 Lisbon, Portugal;
| |
Collapse
|
33
|
Lauria Sneideman MP, Meller VH. Drosophila Satellite Repeats at the Intersection of Chromatin, Gene Regulation and Evolution. PROGRESS IN MOLECULAR AND SUBCELLULAR BIOLOGY 2021; 60:1-26. [PMID: 34386870 DOI: 10.1007/978-3-030-74889-0_1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
Satellite repeats make up a large fraction of the genomes of many higher eukaryotes. Until recently these sequences were viewed as molecular parasites with few functions. Drosophila melanogaster and related species have a wealth of diverse satellite repeats. Comparative studies of Drosophilids have been instrumental in understanding how these rapidly evolving sequences change and move. Remarkably, satellite repeats have been found to modulate gene expression and mediate genetic conflicts between chromosomes and between closely related fly species. This suggests that satellites play a key role in speciation. We have taken advantage of the depth of research on satellite repeats in flies to review the known functions of these sequences and consider their central role in evolution and gene expression.
Collapse
Affiliation(s)
| | - Victoria H Meller
- Department of Biological Sciences, Wayne State University, Detroit, MI, USA.
| |
Collapse
|
34
|
Waminal NE, Pellerin RJ, Kang SH, Kim HH. Chromosomal Mapping of Tandem Repeats Revealed Massive Chromosomal Rearrangements and Insights Into Senna tora Dysploidy. FRONTIERS IN PLANT SCIENCE 2021; 12:629898. [PMID: 33643358 PMCID: PMC7902697 DOI: 10.3389/fpls.2021.629898] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/16/2020] [Accepted: 01/21/2021] [Indexed: 05/16/2023]
Abstract
Tandem repeats can occupy a large portion of plant genomes and can either cause or result from chromosomal rearrangements, which are important drivers of dysploidy-mediated karyotype evolution and speciation. To understand the contribution of tandem repeats in shaping the extant Senna tora dysploid karyotype, we analyzed the composition and abundance of tandem repeats in the S. tora genome and compared the chromosomal distribution of these repeats between S. tora and a closely related euploid, Senna occidentalis. Using a read clustering algorithm, we identified the major S. tora tandem repeats and visualized their chromosomal distribution by fluorescence in situ hybridization. We identified eight independent repeats covering ~85 Mb or ~12% of the S. tora genome. The unit lengths and copy numbers had ranges of 7-5,833 bp and 325-2.89 × 106, respectively. Three short duplicated sequences were found in the 45S rDNA intergenic spacer, one of which was also detected at an extra-NOR locus. The canonical plant telomeric repeat (TTTAGGG)n was also detected as very intense signals in numerous pericentromeric and interstitial loci. StoTR05_180, which showed subtelomeric distribution in Senna occidentalis, was predominantly pericentromeric in S. tora. The unusual chromosomal distribution of tandem repeats in S. tora not only enabled easy identification of individual chromosomes but also revealed the massive chromosomal rearrangements that have likely played important roles in shaping its dysploid karyotype.
Collapse
Affiliation(s)
- Nomar Espinosa Waminal
- Department of Chemistry and Life Science, BioScience Institute, Sahmyook University, Seoul, South Korea
| | - Remnyl Joyce Pellerin
- Department of Chemistry and Life Science, BioScience Institute, Sahmyook University, Seoul, South Korea
| | - Sang-Ho Kang
- Genomics Division, National Institute of Agricultural Sciences, Rural Development Administration, Jeonju, South Korea
| | - Hyun Hee Kim
- Department of Chemistry and Life Science, BioScience Institute, Sahmyook University, Seoul, South Korea
- *Correspondence: Hyun Hee Kim
| |
Collapse
|