1
|
Pokrovac I, Rohner N, Pezer Ž. The prevalence of copy number increase at multiallelic copy number variants associated with cave colonization. Mol Ecol 2024; 33:e17339. [PMID: 38556927 DOI: 10.1111/mec.17339] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Revised: 03/16/2024] [Accepted: 03/22/2024] [Indexed: 04/02/2024]
Abstract
Copy number variation is a common contributor to phenotypic diversity, yet its involvement in ecological adaptation is not easily discerned. Instances of parallelly evolving populations of the same species in a similar environment marked by strong selective pressures present opportunities to study the role of copy number variants (CNVs) in adaptation. By identifying CNVs that repeatedly occur in multiple populations of the derived ecotype and are not (or are rarely) present in the populations of the ancestral ecotype, the association of such CNVs with adaptation to the novel environment can be inferred. We used this paradigm to identify CNVs associated with recurrent adaptation of the Mexican tetra (Astyanax mexicanus) to cave environment. Using a read-depth approach, we detected CNVs from previously re-sequenced genomes of 44 individuals belonging to two ancestral surfaces and three derived cave populations. We identified 102 genes and 292 genomic regions that repeatedly diverge in copy number between the two ecotypes and occupy 0.8% of the reference genome. Functional analysis revealed their association with processes previously recognized to be relevant for adaptation, such as vision, immunity, oxygen consumption, metabolism, and neural function and we propose that these variants have been selected for in the cave or surface waters. The majority of the ecotype-divergent CNVs are multiallelic and display copy number increases in cavefish compared to surface fish. Our findings suggest that multiallelic CNVs - including gene duplications - and divergence in copy number provide a fast route to produce novel phenotypes associated with adaptation to subterranean life.
Collapse
Affiliation(s)
| | - Nicolas Rohner
- Stowers Institute for Medical Research, Kansas City, Missouri, USA
| | | |
Collapse
|
2
|
Pokrovac I, Pezer Ž. Recent advances and current challenges in population genomics of structural variation in animals and plants. Front Genet 2022; 13:1060898. [PMID: 36523759 PMCID: PMC9745067 DOI: 10.3389/fgene.2022.1060898] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Accepted: 11/15/2022] [Indexed: 05/02/2024] Open
Abstract
The field of population genomics has seen a surge of studies on genomic structural variation over the past two decades. These studies witnessed that structural variation is taxonomically ubiquitous and represent a dominant form of genetic variation within species. Recent advances in technology, especially the development of long-read sequencing platforms, have enabled the discovery of structural variants (SVs) in previously inaccessible genomic regions which unlocked additional structural variation for population studies and revealed that more SVs contribute to evolution than previously perceived. An increasing number of studies suggest that SVs of all types and sizes may have a large effect on phenotype and consequently major impact on rapid adaptation, population divergence, and speciation. However, the functional effect of the vast majority of SVs is unknown and the field generally lacks evidence on the phenotypic consequences of most SVs that are suggested to have adaptive potential. Non-human genomes are heavily under-represented in population-scale studies of SVs. We argue that more research on other species is needed to objectively estimate the contribution of SVs to evolution. We discuss technical challenges associated with SV detection and outline the most recent advances towards more representative reference genomes, which opens a new era in population-scale studies of structural variation.
Collapse
Affiliation(s)
| | - Željka Pezer
- Laboratory for Evolutionary Genetics, Division of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia
| |
Collapse
|
3
|
Vojvoda Zeljko T, Ugarković Đ, Pezer Ž. Differential enrichment of H3K9me3 at annotated satellite DNA repeats in human cell lines and during fetal development in mouse. Epigenetics Chromatin 2021; 14:47. [PMID: 34663449 PMCID: PMC8524813 DOI: 10.1186/s13072-021-00423-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2021] [Accepted: 10/05/2021] [Indexed: 01/24/2023] Open
Abstract
BACKGROUND Trimethylation of histone H3 on lysine 9 (H3K9me3) at satellite DNA sequences has been primarily studied at (peri)centromeric regions, where its level shows differences associated with various processes such as development and malignant transformation. However, the dynamics of H3K9me3 at distal satellite DNA repeats has not been thoroughly investigated. RESULTS We exploit the sets of publicly available data derived from chromatin immunoprecipitation combined with massively parallel DNA sequencing (ChIP-Seq), produced by the The Encyclopedia of DNA Elements (ENCODE) project, to analyze H3K9me3 at assembled satellite DNA repeats in genomes of human cell lines and during mouse fetal development. We show that annotated satellite elements are generally enriched for H3K9me3, but its level in cancer cell lines is on average lower than in normal cell lines. We find 407 satellite DNA instances with differential H3K9me3 enrichment between cancer and normal cells including a large 115-kb cluster of GSATII elements on chromosome 12. Differentially enriched regions are not limited to satellite DNA instances, but instead encompass a wider region of flanking sequences. We found no correlation between the levels of H3K9me3 and noncoding RNA at corresponding satellite DNA loci. The analysis of data derived from multiple tissues identified 864 instances of satellite DNA sequences in the mouse reference genome that are differentially enriched between fetal developmental stages. CONCLUSIONS Our study reveals significant differences in H3K9me3 level at a subset of satellite repeats between biological states and as such contributes to understanding of the role of satellite DNA repeats in epigenetic regulation during development and carcinogenesis.
Collapse
Affiliation(s)
| | | | - Željka Pezer
- Ruđer Bošković Institute, Bijenička 54, 10000, Zagreb, Croatia.
| |
Collapse
|
4
|
Karn RC, Yazdanifar G, Pezer Ž, Boursot P, Laukaitis CM. Androgen-Binding Protein (Abp) Evolutionary History: Has Positive Selection Caused Fixation of Different Paralogs in Different Taxa of the Genus Mus? Genome Biol Evol 2021; 13:6377336. [PMID: 34581786 PMCID: PMC8525912 DOI: 10.1093/gbe/evab220] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/20/2021] [Indexed: 11/14/2022] Open
Abstract
Comparison of the androgen-binding protein (Abp) gene regions of six Mus genomes provides insights into the evolutionary history of this large murid rodent gene family. We identified 206 unique Abp sequences and mapped their physical relationships. At least 48 are duplicated and thus present in more than two identical copies. All six taxa have substantially elevated LINE1 densities in Abp regions compared with flanking regions, similar to levels in mouse and rat genomes, although nonallelic homologous recombination seems to have only occurred in Mus musculus domesticus. Phylogenetic and structural relationships support the hypothesis that the extensive Abp expansion began in an ancestor of the genus Mus. We also found duplicated Abpa27's in two taxa, suggesting that previously reported selection on a27 alleles may have actually detected selection on haplotypes wherein different paralogs were lost in each. Other studies reported that a27 gene and species trees were incongruent, likely because of homoplasy. However, L1MC3 phylogenies, supposed to be homoplasy-free compared with coding regions, support our paralog hypothesis because the L1MC3 phylogeny was congruent with the a27 topology. This paralog hypothesis provides an alternative explanation for the origin of the a27 gene that is suggested to be fixed in the three different subspecies of Mus musculus and to mediate sexual selection and incipient reinforcement between at least two of them. Finally, we ask why there are so many Abp genes, especially given the high frequency of pseudogenes and suggest that relaxed selection operates over a large part of the gene clusters.
Collapse
Affiliation(s)
- Robert C Karn
- Gene Networks in Neural and Developmental Plasticity, Institute for Genomic Biology, University of Illinois, Urbana, Illinois, USA
| | | | - Željka Pezer
- Division of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia
| | - Pierre Boursot
- Institut des Sciences de l'Evolution Montpellier, Université de Montpellier, CNRS, IRD, France
| | - Christina M Laukaitis
- Carle Health and Carle Illinois College of Medicine, University of Illinois, Urbana-Champaign, USA
| |
Collapse
|
5
|
Feliciello I, Pezer Ž, Kordiš D, Bruvo Mađarić B, Ugarković Đ. Evolutionary History of Alpha Satellite DNA Repeats Dispersed within Human Genome Euchromatin. Genome Biol Evol 2021; 12:2125-2138. [PMID: 33078196 PMCID: PMC7719264 DOI: 10.1093/gbe/evaa224] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/14/2020] [Indexed: 01/03/2023] Open
Abstract
Major human alpha satellite DNA repeats are preferentially assembled within (peri)centromeric regions but are also dispersed within euchromatin in the form of clustered or short single repeat arrays. To study the evolutionary history of single euchromatic human alpha satellite repeats (ARs), we analyzed their orthologous loci across the primate genomes. The continuous insertion of euchromatic ARs throughout the evolutionary history of primates starting with the ancestors of Simiformes (45-60 Ma) and continuing up to the ancestors of Homo is revealed. Once inserted, the euchromatic ARs were stably transmitted to the descendant species, some exhibiting copy number variation, whereas their sequence divergence followed the species phylogeny. Many euchromatic ARs have sequence characteristics of (peri)centromeric alpha repeats suggesting heterochromatin as a source of dispersed euchromatic ARs. The majority of euchromatic ARs are inserted in the vicinity of other repetitive elements such as L1, Alu, and ERV or are embedded within them. Irrespective of the insertion context, each AR insertion seems to be unique and once inserted, ARs do not seem to be subsequently spread to new genomic locations. In spite of association with (retro)transposable elements, there is no indication that such elements play a role in ARs proliferation. The presence of short duplications at most of ARs insertion sites suggests site-directed recombination between homologous motifs in ARs and in the target genomic sequence, probably mediated by extrachromosomal circular DNA, as a mechanism of spreading within euchromatin.
Collapse
Affiliation(s)
- Isidoro Feliciello
- Department of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia.,Dipartimento di Medicina Clinica e Chirurgia, Universita' degli Studi di Napoli Federico II, Italy
| | - Željka Pezer
- Department of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia
| | - Dušan Kordiš
- Department of Molecular and Biomedical Sciences, Jožef Stefan Institute, Ljubljana, Slovenia
| | | | - Đurđica Ugarković
- Department of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia
| |
Collapse
|
6
|
Feliciello I, Pezer Ž, Sermek A, Bruvo Mađarić B, Ljubić S, Ugarković Đ. Satellite DNA-Mediated Gene Expression Regulation: Physiological and Evolutionary Implication. Prog Mol Subcell Biol 2021; 60:145-167. [PMID: 34386875 DOI: 10.1007/978-3-030-74889-0_6] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
Satellite DNAs are tandemly repeated sequences organized in large clusters within (peri)centromeric and/or subtelomeric heterochromatin. However, in many species, satellite DNAs are not restricted to heterochromatin but are also dispersed as short arrays within euchromatin. Such genomic organization together with transcriptional activity seems to be a prerequisite for the gene-modulatory effect of satellite DNAs which was first demonstrated in the beetle Tribolium castaneum upon heat stress. Namely, enrichment of a silent histone mark at euchromatic repeats of a major beetle satellite DNA results in epigenetic silencing of neighboring genes. In addition, human satellite III transcripts induced by heat shock contribute to genome-wide gene silencing, providing protection against stress-induced cell death. Gene silencing mediated by satellite RNA was also shown to be fundamental for the early embryonic development of the mosquito Aedes aegypti. Apart from a physiological role during embryogenesis and heat stress response, activation of satellite DNAs in terms of transcription and proliferation can have an evolutionary impact. Spreading of satellite repeats throughout euchromatin promotes the variation of epigenetic landscapes and gene expression diversity, contributing to the evolution of gene regulatory networks and to genome adaptation in fluctuating environmental conditions.
Collapse
Affiliation(s)
- Isidoro Feliciello
- Department of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia.,Dipartimento di Medicina Clinica e Chirurgia, Universita' degli Studi di Napoli Federico II, Naples, Italy
| | - Željka Pezer
- Department of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia
| | - Antonio Sermek
- Department of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia
| | | | - Sven Ljubić
- Department of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia
| | - Đurđica Ugarković
- Department of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia.
| |
Collapse
|
7
|
Pezer Ž, Chung AG, Karn RC, Laukaitis CM. Analysis of Copy Number Variation in the Abp Gene Regions of Two House Mouse Subspecies Suggests Divergence during the Gene Family Expansions. Genome Biol Evol 2018; 9:3858091. [PMID: 28575204 PMCID: PMC5513543 DOI: 10.1093/gbe/evx099] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/26/2017] [Indexed: 12/26/2022] Open
Abstract
The Androgen-binding protein (Abp) gene region of the mouse genome contains 64 genes, some encoding pheromones that influence assortative mating between mice from different subspecies. Using CNVnator and quantitative PCR, we explored copy number variation in this gene family in natural populations of Mus musculus domesticus (Mmd) and Mus musculus musculus (Mmm), two subspecies of house mice that form a narrow hybrid zone in Central Europe. We found that copy number variation in the center of the Abp gene region is very common in wild Mmd, primarily representing the presence/absence of the final duplications described for the mouse genome. Clustering of Mmd individuals based on this variation did not reflect their geographical origin, suggesting no population divergence in the Abp gene cluster. However, copy number variation patterns differ substantially between Mmd and other mouse taxa. Large blocks of Abp genes are absent in Mmm, Mus musculus castaneus and an outgroup, Mus spretus, although with differences in variation and breakpoint locations. Our analysis calls into question the reliance on a reference genome for interpreting the detailed organization of genes in taxa more distant from the Mmd reference genome. The polymorphic nature of the gene family expansion in all four taxa suggests that the number of Abp genes, especially in the central gene region, is not critical to the survival and reproduction of the mouse. However, Abp haplotypes of variable length may serve as a source of raw genetic material for new signals influencing reproductive communication and thus speciation of mice.
Collapse
Affiliation(s)
- Željka Pezer
- Max Planck Institute for Evolutionary Biology, Plön, Germany.,Ruđer Bošković Institute, Zagreb, Croatia
| | - Amanda G Chung
- Department of Medicine, College of Medicine, University of Arizona
| | - Robert C Karn
- Department of Medicine, College of Medicine, University of Arizona
| | | |
Collapse
|
8
|
Harr B, Karakoc E, Neme R, Teschke M, Pfeifle C, Pezer Ž, Babiker H, Linnenbrink M, Montero I, Scavetta R, Abai MR, Molins MP, Schlegel M, Ulrich RG, Altmüller J, Franitza M, Büntge A, Künzel S, Tautz D. Genomic resources for wild populations of the house mouse, Mus musculus and its close relative Mus spretus. Sci Data 2016; 3:160075. [PMID: 27622383 PMCID: PMC5020872 DOI: 10.1038/sdata.2016.75] [Citation(s) in RCA: 92] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2016] [Accepted: 07/29/2016] [Indexed: 12/20/2022] Open
Abstract
Wild populations of the house mouse (Mus musculus) represent the raw genetic material for the classical inbred strains in biomedical research and are a major model system for evolutionary biology. We provide whole genome sequencing data of individuals representing natural populations of M. m. domesticus (24 individuals from 3 populations), M. m. helgolandicus (3 individuals), M. m. musculus (22 individuals from 3 populations) and M. spretus (8 individuals from one population). We use a single pipeline to map and call variants for these individuals and also include 10 additional individuals of M. m. castaneus for which genomic data are publically available. In addition, RNAseq data were obtained from 10 tissues of up to eight adult individuals from each of the three M. m. domesticus populations for which genomic data were collected. Data and analyses are presented via tracks viewable in the UCSC or IGV genome browsers. We also provide information on available outbred stocks and instructions on how to keep them in the laboratory.
Collapse
Affiliation(s)
- Bettina Harr
- Max-Planck Institute for Evolutionary Biology, August-Thienemanstrasse 2, 24306 Plön, Germany
| | - Emre Karakoc
- Max-Planck Institute for Evolutionary Biology, August-Thienemanstrasse 2, 24306 Plön, Germany
| | - Rafik Neme
- Max-Planck Institute for Evolutionary Biology, August-Thienemanstrasse 2, 24306 Plön, Germany
| | - Meike Teschke
- Max-Planck Institute for Evolutionary Biology, August-Thienemanstrasse 2, 24306 Plön, Germany
| | - Christine Pfeifle
- Max-Planck Institute for Evolutionary Biology, August-Thienemanstrasse 2, 24306 Plön, Germany
| | - Željka Pezer
- Max-Planck Institute for Evolutionary Biology, August-Thienemanstrasse 2, 24306 Plön, Germany
| | - Hiba Babiker
- Max-Planck Institute for Evolutionary Biology, August-Thienemanstrasse 2, 24306 Plön, Germany
| | - Miriam Linnenbrink
- Max-Planck Institute for Evolutionary Biology, August-Thienemanstrasse 2, 24306 Plön, Germany
| | - Inka Montero
- Max-Planck Institute for Evolutionary Biology, August-Thienemanstrasse 2, 24306 Plön, Germany
| | - Rick Scavetta
- Max-Planck Institute for Evolutionary Biology, August-Thienemanstrasse 2, 24306 Plön, Germany
| | - Mohammad Reza Abai
- Department of Medical Entomology and Vector Control, School of Public Health, Tehran University of Medical Sciences, Tehran 1417613151, Iran
| | - Marta Puente Molins
- Laboratorio de Anatomía Animal, Departamento de Biología Animal, Facultad de Ciencias, Universidad de Vigo, 36200 Vigo, Spain
| | - Mathias Schlegel
- Friedrich-Loeffler-Institut, Federal Research Institute for Animal Health, Institute for Novel and Emerging Infectious Diseases, Südufer 10, 17493 Greifswald-Insel Riems, Germany
| | - Rainer G Ulrich
- Friedrich-Loeffler-Institut, Federal Research Institute for Animal Health, Institute for Novel and Emerging Infectious Diseases, Südufer 10, 17493 Greifswald-Insel Riems, Germany
| | - Janine Altmüller
- Cologne Center for Genomics (CCG), University of Cologne, Weyertal 115b, 50931 Cologne, Germany.,Institute of Human Genetics, Universitätsklinik Köln, Kerpener Str. 34, 50931 Köln, Germany
| | - Marek Franitza
- Cologne Center for Genomics (CCG), University of Cologne, Weyertal 115b, 50931 Cologne, Germany.,Cologne Excellence Cluster on Cellular Stress Responses in Aging-Associated Diseases (CECAD), University of Cologne, Joseph-Stelzmann-Str. 26, 50931 Cologne, Germany
| | - Anna Büntge
- Max-Planck Institute for Evolutionary Biology, August-Thienemanstrasse 2, 24306 Plön, Germany
| | - Sven Künzel
- Max-Planck Institute for Evolutionary Biology, August-Thienemanstrasse 2, 24306 Plön, Germany
| | - Diethard Tautz
- Max-Planck Institute for Evolutionary Biology, August-Thienemanstrasse 2, 24306 Plön, Germany
| |
Collapse
|
9
|
Pezer Ž, Harr B, Teschke M, Babiker H, Tautz D. Divergence patterns of genic copy number variation in natural populations of the house mouse (Mus musculus domesticus) reveal three conserved genes with major population-specific expansions. Genome Res 2015; 25:1114-24. [PMID: 26149421 PMCID: PMC4509996 DOI: 10.1101/gr.187187.114] [Citation(s) in RCA: 59] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2014] [Accepted: 06/05/2015] [Indexed: 11/29/2022]
Abstract
Copy number variation represents a major source of genetic divergence, yet the evolutionary dynamics of genic copy number variation in natural populations during differentiation and adaptation remain unclear. We applied a read depth approach to genome resequencing data to detect copy number variants (CNVs) ≥1 kb in wild-caught mice belonging to four populations of Mus musculus domesticus. We complemented the bioinformatics analyses with experimental validation using droplet digital PCR. The specific focus of our analysis is CNVs that include complete genes, as these CNVs could be expected to contribute most directly to evolutionary divergence. In total, 1863 transcription units appear to be completely encompassed within CNVs in at least one individual when compared to the reference assembly. Further, 179 of these CNVs show population-specific copy number differences, and 325 are subject to complete deletion in multiple individuals. Among the most copy-number variable genes are three highly conserved genes that encode the splicing factor CWC22, the spindle protein SFI1, and the Holliday junction recognition protein HJURP. These genes exhibit population-specific expansion patterns that suggest involvement in local adaptations. We found that genes that overlap with large segmental duplications are generally more copy-number variable. These genes encode proteins that are relevant for environmental and behavioral interactions, such as vomeronasal and olfactory receptors, as well as major urinary proteins and several proteins of unknown function. The overall analysis shows that genic CNVs contribute more to population differentiation in mice than in humans and may promote and speed up population divergence.
Collapse
Affiliation(s)
- Željka Pezer
- Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany
| | - Bettina Harr
- Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany
| | - Meike Teschke
- Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany
| | - Hiba Babiker
- Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany
| | - Diethard Tautz
- Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany
| |
Collapse
|
10
|
|