1
|
Khandia R, Garg R, Pandey MK, Khan AA, Dhanda SK, Romashchenko V, Malik A, Gurjar P. Determination of codon pattern and evolutionary forces acting on genes linked to inflammatory bowel disease. Int J Biol Macromol 2024:134480. [PMID: 39116987 DOI: 10.1016/j.ijbiomac.2024.134480] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2024] [Revised: 06/25/2024] [Accepted: 07/31/2024] [Indexed: 08/10/2024]
Abstract
Inflammatory bowel disease (IBD) is an inflammatory disorder of the gastrointestinal tract. The present study attempted to understand the codon usage preferences in genes associated with IBD progression. Compositional analysis, codon usage bias (CUB), Relative synonymous codon usage (RSCU), RNA structure, and expression analysis were performed to obtain a comprehensive picture of codon usage in IBD genes. Compositional analysis of 62 IBD-associated genes revealed that G and T are the most and least abundant nucleotides, respectively. ApG, CpA, and TpG dinucleotides were overrepresented or randomly used, while ApC, CpG, GpT, and TpA dinucleotides were either underrepresented or randomly used in genes related to IBD. The codons influencing the codon usage the most in IBD genes were CGC and AGG. A comparison of codon usage between IBD, and pancreatitis (non-IBD inflammatory disease) indicated that only codon CTG codon usage was significantly different between IBD and pancreatitis. At the same time, there were codons ATA, ACA, CGT, CAA, GTA, CCT, ATT, GCT, CGG, TTG, and CAG for whom codon usage was significantly different for IBD and housekeeping gene sets. The results suggest similar codon usage in at least two inflammatory disorders, IBD and pancreatitis. The analysis helps understand the codon biology, factors affecting gene expression of IBD-associated genes, and the evolution of these genes. The study helps reveal the molecular patterns associated with IBD.
Collapse
Affiliation(s)
- Rekha Khandia
- Department of Biochemistry and Genetics, Barkatullah University, Bhopal 462026, MP, India.
| | - Rajkumar Garg
- Department of Biosciences, Barkatullah University, Bhopal 462026, MP, India
| | - Megha Katare Pandey
- Translational Medicine Center, All India Institute of Medical Sciences, Bhopal 462020, MP, India.
| | - Azmat Ali Khan
- Pharmaceutical Biotechnology Laboratory, Department of Pharmaceutical Chemistry, College of Pharmacy, King Saud University, Riyadh 11451, Saudi Arabia.
| | - Sandeep Kumar Dhanda
- Department of Oncology, St Jude Children's Research Hospital, Memphis, TN 38105, USA
| | | | - Abdul Malik
- Department of Pharmaceutics, College of Pharmacy, King Saud University, Riyadh 11451, Saudi Arabia.
| | - Pankaj Gurjar
- Centre for Global Health Research, Saveetha Medical College and Hospital, Saveetha Institute of Medical and Technical Sciences, Saveetha University, Chennai, Tamil Nadu, India; Department of Science and Engineering, Novel Global Community Educational Foundation, Hebersham, Australia.
| |
Collapse
|
2
|
Kaj I, Mugal CF, Müller-Widmann R. A Wright-Fisher graph model and the impact of directional selection on genetic variation. Theor Popul Biol 2024:S0040-5809(24)00077-7. [PMID: 39019334 DOI: 10.1016/j.tpb.2024.07.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Revised: 07/06/2024] [Accepted: 07/12/2024] [Indexed: 07/19/2024]
Abstract
We introduce a multi-allele Wright-Fisher model with mutation and selection such that allele frequencies at a single locus are traced by the path of a hybrid jump-diffusion process. The state space of the process is given by the vertices and edges of a topological graph, i.e. edges are unit intervals. Vertices represent monomorphic population states and positions on the edges mark the biallelic proportions of ancestral and derived alleles during polymorphic segments. In this setting, mutations can only occur at monomorphic loci. We derive the stationary distribution in mutation-selection-drift equilibrium and obtain the expected allele frequency spectrum under large population size scaling. For the extended model with multiple independent loci we derive rigorous upper bounds for a wide class of associated measures of genetic variation. Within this framework we present mathematically precise arguments to conclude that the presence of directional selection reduces the magnitude of genetic variation, as constrained by the bounds for neutral evolution.
Collapse
Affiliation(s)
- Ingemar Kaj
- Department of Mathematics, Uppsala University, Uppsala, Sweden.
| | - Carina F Mugal
- Department of Ecology and Genetics, Uppsala University, Uppsala, Sweden; Laboratory of Biometry and Evolutionary Biology, University of Lyon 1, UMR CNRS 5558, Villeurbanne, France
| | | |
Collapse
|
3
|
Joseph J, Prentout D, Laverré A, Tricou T, Duret L. High prevalence of PRDM9-independent recombination hotspots in placental mammals. Proc Natl Acad Sci U S A 2024; 121:e2401973121. [PMID: 38809707 PMCID: PMC11161765 DOI: 10.1073/pnas.2401973121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2024] [Accepted: 04/26/2024] [Indexed: 05/31/2024] Open
Abstract
In many mammals, recombination events are concentrated in hotspots directed by a sequence-specific DNA-binding protein named PRDM9. Intriguingly, PRDM9 has been lost several times in vertebrates, and notably among mammals, it has been pseudogenized in the ancestor of canids. In the absence of PRDM9, recombination hotspots tend to occur in promoter-like features such as CpG islands. It has thus been proposed that one role of PRDM9 could be to direct recombination away from PRDM9-independent hotspots. However, the ability of PRDM9 to direct recombination hotspots has been assessed in only a handful of species, and a clear picture of how much recombination occurs outside of PRDM9-directed hotspots in mammals is still lacking. In this study, we derived an estimator of past recombination activity based on signatures of GC-biased gene conversion in substitution patterns. We quantified recombination activity in PRDM9-independent hotspots in 52 species of boreoeutherian mammals. We observe a wide range of recombination rates at these loci: several species (such as mice, humans, some felids, or cetaceans) show a deficit of recombination, while a majority of mammals display a clear peak of recombination. Our results demonstrate that PRDM9-directed and PRDM9-independent hotspots can coexist in mammals and that their coexistence appears to be the rule rather than the exception. Additionally, we show that the location of PRDM9-independent hotspots is relatively more stable than that of PRDM9-directed hotspots, but that PRDM9-independent hotspots nevertheless evolve slowly in concert with DNA hypomethylation.
Collapse
Affiliation(s)
- Julien Joseph
- Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, CNRS, UMR 5558, Villeurbanne69100, France
| | - Djivan Prentout
- Department of Biological Sciences, Columbia University, New York, NY10027
| | - Alexandre Laverré
- Department of Ecology and Evolution, University of Lausanne, LausanneCH-1015, Switzerland
- Swiss Institute of Bioinformatics, LausanneCH-1015, Switzerland
| | - Théo Tricou
- Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, CNRS, UMR 5558, Villeurbanne69100, France
| | - Laurent Duret
- Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, CNRS, UMR 5558, Villeurbanne69100, France
| |
Collapse
|
4
|
Joseph J. Increased Positive Selection in Highly Recombining Genes Does not Necessarily Reflect an Evolutionary Advantage of Recombination. Mol Biol Evol 2024; 41:msae107. [PMID: 38829800 PMCID: PMC11173204 DOI: 10.1093/molbev/msae107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Revised: 04/08/2024] [Accepted: 05/28/2024] [Indexed: 06/05/2024] Open
Abstract
It is commonly thought that the long-term advantage of meiotic recombination is to dissipate genetic linkage, allowing natural selection to act independently on different loci. It is thus theoretically expected that genes with higher recombination rates evolve under more effective selection. On the other hand, recombination is often associated with GC-biased gene conversion (gBGC), which theoretically interferes with selection by promoting the fixation of deleterious GC alleles. To test these predictions, several studies assessed whether selection was more effective in highly recombining genes (due to dissipation of genetic linkage) or less effective (due to gBGC), assuming a fixed distribution of fitness effects (DFE) for all genes. In this study, I directly derive the DFE from a gene's evolutionary history (shaped by mutation, selection, drift, and gBGC) under empirical fitness landscapes. I show that genes that have experienced high levels of gBGC are less fit and thus have more opportunities for beneficial mutations. Only a small decrease in the genome-wide intensity of gBGC leads to the fixation of these beneficial mutations, particularly in highly recombining genes. This results in increased positive selection in highly recombining genes that is not caused by more effective selection. Additionally, I show that the death of a recombination hotspot can lead to a higher dN/dS than its birth, but with substitution patterns biased towards AT, and only at selected positions. This shows that controlling for a substitution bias towards GC is therefore not sufficient to rule out the contribution of gBGC to signatures of accelerated evolution. Finally, although gBGC does not affect the fixation probability of GC-conservative mutations, I show that by altering the DFE, gBGC can also significantly affect nonsynonymous GC-conservative substitution patterns.
Collapse
Affiliation(s)
- Julien Joseph
- Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, CNRS, UMR 5558, Villeurbanne, France
| |
Collapse
|
5
|
Kotari I, Kosiol C, Borges R. The Patterns of Codon Usage between Chordates and Arthropods are Different but Co-evolving with Mutational Biases. Mol Biol Evol 2024; 41:msae080. [PMID: 38667829 PMCID: PMC11108087 DOI: 10.1093/molbev/msae080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Revised: 03/22/2024] [Accepted: 04/15/2024] [Indexed: 05/22/2024] Open
Abstract
Different frequencies amongst codons that encode the same amino acid (i.e. synonymous codons) have been observed in multiple species. Studies focused on uncovering the forces that drive such codon usage showed that a combined effect of mutational biases and translational selection works to produce different frequencies of synonymous codons. However, only few have been able to measure and distinguish between these forces that may leave similar traces on the coding regions. Here, we have developed a codon model that allows the disentangling of mutation, selection on amino acids and synonymous codons, and GC-biased gene conversion (gBGC) which we employed on an extensive dataset of 415 chordates and 191 arthropods. We found that chordates need 15 more synonymous codon categories than arthropods to explain the empirical codon frequencies, which suggests that the extent of codon usage can vary greatly between animal phyla. Moreover, methylation at CpG sites seems to partially explain these patterns of codon usage in chordates but not in arthropods. Despite the differences between the two phyla, our findings demonstrate that in both, GC-rich codons are disfavored when mutations are GC-biased, and the opposite is true when mutations are AT-biased. This indicates that selection on the genomic coding regions might act primarily to stabilize its GC/AT content on a genome-wide level. Our study shows that the degree of synonymous codon usage varies considerably among animals, but is likely governed by a common underlying dynamic.
Collapse
Affiliation(s)
- Ioanna Kotari
- Institut für Populationsgenetik, University of Veterinary Medicine, Veterinärplatz 1, Vienna 1210, Austria
- Vienna Graduate School of Population Genetics, Vienna, Austria
| | - Carolin Kosiol
- Centre for Biological Diversity, School of Biology, University of St Andrews, Fife KY16 9TH, UK
| | - Rui Borges
- Institut für Populationsgenetik, University of Veterinary Medicine, Veterinärplatz 1, Vienna 1210, Austria
| |
Collapse
|
6
|
Rodrigues MF, Kern AD, Ralph PL. Shared evolutionary processes shape landscapes of genomic variation in the great apes. Genetics 2024; 226:iyae006. [PMID: 38242701 PMCID: PMC10990428 DOI: 10.1093/genetics/iyae006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Revised: 10/26/2023] [Accepted: 01/03/2024] [Indexed: 01/21/2024] Open
Abstract
For at least the past 5 decades, population genetics, as a field, has worked to describe the precise balance of forces that shape patterns of variation in genomes. The problem is challenging because modeling the interactions between evolutionary processes is difficult, and different processes can impact genetic variation in similar ways. In this paper, we describe how diversity and divergence between closely related species change with time, using correlations between landscapes of genetic variation as a tool to understand the interplay between evolutionary processes. We find strong correlations between landscapes of diversity and divergence in a well-sampled set of great ape genomes, and explore how various processes such as incomplete lineage sorting, mutation rate variation, GC-biased gene conversion and selection contribute to these correlations. Through highly realistic, chromosome-scale, forward-in-time simulations, we show that the landscapes of diversity and divergence in the great apes are too well correlated to be explained via strictly neutral processes alone. Our best fitting simulation includes both deleterious and beneficial mutations in functional portions of the genome, in which 9% of fixations within those regions is driven by positive selection. This study provides a framework for modeling genetic variation in closely related species, an approach which can shed light on the complex balance of forces that have shaped genetic variation.
Collapse
Affiliation(s)
- Murillo F Rodrigues
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR 97403, USA
- Department of Biology, University of Oregon, Eugene, OR 97403, USA
| | - Andrew D Kern
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR 97403, USA
- Department of Biology, University of Oregon, Eugene, OR 97403, USA
| | - Peter L Ralph
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR 97403, USA
- Department of Biology, University of Oregon, Eugene, OR 97403, USA
- Department of Mathematics, University of Oregon, Eugene, OR 97403, USA
| |
Collapse
|
7
|
Rodrigues MF, Kern AD, Ralph PL. Shared evolutionary processes shape landscapes of genomic variation in the great apes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.07.527547. [PMID: 36798346 PMCID: PMC9934647 DOI: 10.1101/2023.02.07.527547] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/10/2023]
Abstract
For at least the past five decades population genetics, as a field, has worked to describe the precise balance of forces that shape patterns of variation in genomes. The problem is challenging because modelling the interactions between evolutionary processes is difficult, and different processes can impact genetic variation in similar ways. In this paper, we describe how diversity and divergence between closely related species change with time, using correlations between landscapes of genetic variation as a tool to understand the interplay between evolutionary processes. We find strong correlations between landscapes of diversity and divergence in a well sampled set of great ape genomes, and explore how various processes such as incomplete lineage sorting, mutation rate variation, GC-biased gene conversion and selection contribute to these correlations. Through highly realistic, chromosome-scale, forward-in-time simulations we show that the landscapes of diversity and divergence in the great apes are too well correlated to be explained via strictly neutral processes alone. Our best fitting simulation includes both deleterious and beneficial mutations in functional portions of the genome, in which 9% of fixations within those regions is driven by positive selection. This study provides a framework for modelling genetic variation in closely related species, an approach which can shed light on the complex balance of forces that have shaped genetic variation.
Collapse
Affiliation(s)
- Murillo F. Rodrigues
- Institute of Ecology and Evolution, University of Oregon
- Department of Biology, University of Oregon
| | - Andrew D. Kern
- Institute of Ecology and Evolution, University of Oregon
- Department of Biology, University of Oregon
| | - Peter L. Ralph
- Institute of Ecology and Evolution, University of Oregon
- Department of Biology, University of Oregon
- Department of Mathematics, University of Oregon
| |
Collapse
|
8
|
Liu A, Wang N, Xie G, Li Y, Yan X, Li X, Zhu Z, Li Z, Yang J, Meng F, Dou M, Chen W, Ma N, Jiang Y, Gao Y, Wang Y. GC-biased gene conversion drives accelerated evolution of ultraconserved elements in mammalian and avian genomes. Genome Res 2023; 33:1673-1689. [PMID: 37884342 PMCID: PMC10691551 DOI: 10.1101/gr.277784.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Accepted: 08/23/2023] [Indexed: 10/28/2023]
Abstract
Ultraconserved elements (UCEs) are the most conserved regions among the genomes of evolutionarily distant species and are thought to play critical biological functions. However, some UCEs rapidly evolved in specific lineages, and whether they contributed to adaptive evolution is still controversial. Here, using an increased number of sequenced genomes with high taxonomic coverage, we identified 2191 mammalian UCEs and 5938 avian UCEs from 95 mammal and 94 bird genomes, respectively. Our results show that these UCEs are functionally constrained and that their adjacent genes are prone to widespread expression with low expression diversity across tissues. Functional enrichment of mammalian and avian UCEs shows different trends indicating that UCEs may contribute to adaptive evolution of taxa. Focusing on lineage-specific accelerated evolution, we discover that the proportion of fast-evolving UCEs in nine mammalian and 10 avian test lineages range from 0.19% to 13.2%. Notably, up to 62.1% of fast-evolving UCEs in test lineages are much more likely to result from GC-biased gene conversion (gBGC). A single cervid-specific gBGC region embracing the uc.359 allele significantly alters the expression of Nova1 and other neural-related genes in the rat brain. Combined with the altered regulatory activity of ancient gBGC-induced fast-evolving UCEs in eutherians, our results provide evidence that synergy between gBGC and selection shaped lineage-specific substitution patterns, even in the most constrained regulatory elements. In summary, our results show that gBGC played an important role in facilitating lineage-specific accelerated evolution of UCEs, and further support the idea that a combination of multiple evolutionary forces shapes adaptive evolution.
Collapse
Affiliation(s)
- Anguo Liu
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Nini Wang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Faculty of Mathematics and Natural Sciences, University of Cologne, and Cologne Excellence Cluster for Cellular Stress Responses in Aging-Associated Diseases (CECAD), University Hospital Cologne, Cologne 50931, Germany
| | - Guoxiang Xie
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Yang Li
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Xixi Yan
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Xinmei Li
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Zhenliang Zhu
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
- College of Veterinary Medicine, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Animal Biotechnology, Ministry of Agriculture, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Zhuohui Li
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Jing Yang
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
- College of Veterinary Medicine, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Animal Biotechnology, Ministry of Agriculture, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Fanxin Meng
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Mingle Dou
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Weihuang Chen
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Nange Ma
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Yu Jiang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
- Center for Functional Genomics, Institute of Future Agriculture, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Yuanpeng Gao
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China;
- College of Veterinary Medicine, Northwest A&F University, Yangling, Shaanxi 712100, China
- Key Laboratory of Animal Biotechnology, Ministry of Agriculture, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Yu Wang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, China;
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, Shaanxi 712100, China
| |
Collapse
|
9
|
Brovkina MV, Chapman MA, Holding ML, Clowney EJ. Emergence and influence of sequence bias in evolutionarily malleable, mammalian tandem arrays. BMC Biol 2023; 21:179. [PMID: 37612705 PMCID: PMC10463633 DOI: 10.1186/s12915-023-01673-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 08/01/2023] [Indexed: 08/25/2023] Open
Abstract
BACKGROUND The radiation of mammals at the extinction of the dinosaurs produced a plethora of new forms-as diverse as bats, dolphins, and elephants-in only 10-20 million years. Behind the scenes, adaptation to new niches is accompanied by extensive innovation in large families of genes that allow animals to contact the environment, including chemosensors, xenobiotic enzymes, and immune and barrier proteins. Genes in these "outward-looking" families are allelically diverse among humans and exhibit tissue-specific and sometimes stochastic expression. RESULTS Here, we show that these tandem arrays of outward-looking genes occupy AT-biased isochores and comprise the "tissue-specific" gene class that lack CpG islands in their promoters. Models of mammalian genome evolution have not incorporated the sharply different functions and transcriptional patterns of genes in AT- versus GC-biased regions. To examine the relationship between gene family expansion, sequence content, and allelic diversity, we use population genetic data and comparative analysis. First, we find that AT bias can emerge during evolutionary expansion of gene families in cis. Second, human genes in AT-biased isochores or with GC-poor promoters experience relatively low rates of de novo point mutation today but are enriched for non-synonymous variants. Finally, we find that isochores containing gene clusters exhibit low rates of recombination. CONCLUSIONS Our analyses suggest that tolerance of non-synonymous variation and low recombination are two forces that have produced the depletion of GC bases in outward-facing gene arrays. In turn, high AT content exerts a profound effect on their chromatin organization and transcriptional regulation.
Collapse
Affiliation(s)
- Margarita V Brovkina
- Graduate Program in Cellular and Molecular Biology, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Margaret A Chapman
- Neurosciences Graduate Program, University of Michigan Medical School, Ann Arbor, MI, USA
| | | | - E Josephine Clowney
- Department of Molecular, Cellular, and Developmental Biology, University of Michigan, Ann Arbor, MI, USA.
- Michigan Neuroscience Institute, University of Michigan, Ann Arbor, MI, USA.
| |
Collapse
|
10
|
Ragsdale AP, Thornton KR. Multiple Sources of Uncertainty Confound Inference of Historical Human Generation Times. Mol Biol Evol 2023; 40:msad160. [PMID: 37450583 PMCID: PMC10404577 DOI: 10.1093/molbev/msad160] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Revised: 07/05/2023] [Accepted: 07/07/2023] [Indexed: 07/18/2023] Open
Abstract
Wang et al. (2023) recently proposed an approach to infer the history of human generation intervals from changes in mutation profiles over time. As the relative proportions of different mutation types depend on the ages of parents, binning variants by the time they arose allows for the inference of changes in average paternal and maternal generation intervals. Applying this approach to published allele age estimates, Wang et al. (2023) inferred long-lasting sex differences in average generation times and surprisingly found that ancestral generation times of West African populations remained substantially higher than those of Eurasian populations extending tens of thousands of generations into the past. Here, we argue that the results and interpretations in Wang et al. (2023) are primarily driven by noise and biases in input data and a lack of validation using independent approaches for estimating allele ages. With the recent development of methods to reconstruct genome-wide gene genealogies, coalescence times, and allele ages, we caution that downstream analyses may be strongly influenced by uncharacterized biases in their output.
Collapse
Affiliation(s)
- Aaron P Ragsdale
- Department of Integrative Biology, University of Wisconsin-Madison, Madison, WI, USA
| | - Kevin R Thornton
- Department of Ecology and Evolutionary Biology, University of California, Irvine, CA, USA
| |
Collapse
|
11
|
Näsvall K, Boman J, Talla V, Backström N. Base Composition, Codon Usage, and Patterns of Gene Sequence Evolution in Butterflies. Genome Biol Evol 2023; 15:evad150. [PMID: 37565492 PMCID: PMC10462419 DOI: 10.1093/gbe/evad150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Revised: 07/17/2023] [Accepted: 08/08/2023] [Indexed: 08/12/2023] Open
Abstract
Coding sequence evolution is influenced by both natural selection and neutral evolutionary forces. In many species, the effects of mutation bias, codon usage, and GC-biased gene conversion (gBGC) on gene sequence evolution have not been detailed. Quantification of how these forces shape substitution patterns is therefore necessary to understand the strength and direction of natural selection. Here, we used comparative genomics to investigate the association between base composition and codon usage bias on gene sequence evolution in butterflies and moths (Lepidoptera), including an in-depth analysis of underlying patterns and processes in one species, Leptidea sinapis. The data revealed significant G/C to A/T substitution bias at third codon position with some variation in the strength among different butterfly lineages. However, the substitution bias was lower than expected from previously estimated mutation rate ratios, partly due to the influence of gBGC. We found that A/T-ending codons were overrepresented in most species, but there was a positive association between the magnitude of codon usage bias and GC-content in third codon positions. In addition, the tRNA-gene population in L. sinapis showed higher GC-content at third codon positions compared to coding sequences in general and less overrepresentation of A/T-ending codons. There was an inverse relationship between synonymous substitutions and codon usage bias indicating selection on synonymous sites. We conclude that the evolutionary rate in Lepidoptera is affected by a complex interaction between underlying G/C -> A/T mutation bias and partly counteracting fixation biases, predominantly conferred by overall purifying selection, gBGC, and selection on codon usage.
Collapse
Affiliation(s)
- Karin Näsvall
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Uppsala, Sweden
| | - Jesper Boman
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Uppsala, Sweden
| | - Venkat Talla
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Uppsala, Sweden
| | - Niclas Backström
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Uppsala, Sweden
| |
Collapse
|
12
|
Johri P, Pfeifer SP, Jensen JD. Developing an evolutionary baseline model for humans: jointly inferring purifying selection with population history. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.11.536488. [PMID: 37090533 PMCID: PMC10120674 DOI: 10.1101/2023.04.11.536488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/25/2023]
Abstract
Building evolutionarily appropriate baseline models for natural populations is not only important for answering fundamental questions in population genetics - including quantifying the relative contributions of adaptive vs. non-adaptive processes - but it is also essential for identifying candidate loci experiencing relatively rare and episodic forms of selection ( e.g., positive or balancing selection). Here, a baseline model was developed for a human population of West African ancestry, the Yoruba, comprising processes constantly operating on the genome ( i.e. , purifying and background selection, population size changes, recombination rate heterogeneity, and gene conversion). Specifically, to perform joint inference of selective effects with demography, an approximate Bayesian approach was employed that utilizes the decay of background selection effects around functional elements, taking into account genomic architecture. This approach inferred a recent 6-fold population growth together with a distribution of fitness effects that is skewed towards effectively neutral mutations. Importantly, these results further suggest that, while strong and/or frequent recurrent positive selection is inconsistent with observed data, weak to moderate positive selection is consistent but unidentifiable if rare.
Collapse
|
13
|
Liao K, Carlson J, Zöllner S. The effect of mutation subtypes on the allele frequency spectrum and population genetics inference. G3 (BETHESDA, MD.) 2023; 13:jkad035. [PMID: 36759699 PMCID: PMC10085755 DOI: 10.1093/g3journal/jkad035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 01/23/2023] [Accepted: 01/26/2023] [Indexed: 02/11/2023]
Abstract
Population genetics has adapted as technological advances in next-generation sequencing have resulted in an exponential increase of genetic data. A common approach to efficiently analyze genetic variation present in large sequencing data is through the allele frequency spectrum, defined as the distribution of allele frequencies in a sample. While the frequency spectrum serves to summarize patterns of genetic variation, it implicitly assumes mutation types (A→C vs C→T) as interchangeable. However, mutations of different types arise and spread due to spatial and temporal variation in forces such as mutation rate and biased gene conversion that result in heterogeneity in the distribution of allele frequencies across sites. In this work, we explore the impact of this simplification on multiple aspects of population genetic modeling. As a site's mutation rate is strongly affected by flanking nucleotides, we defined a mutation subtype by the base pair change and adjacent nucleotides (e.g. AAA→ATA) and systematically assessed the heterogeneity in the frequency spectrum across 96 distinct 3-mer mutation subtypes using n = 3556 whole-genome sequenced individuals of European ancestry. We observed substantial variation across the subtype-specific frequency spectra, with some of the variation being influenced by molecular factors previously identified for single base mutation types. Estimates of model parameters from demographic inference performed for each mutation subtype's AFS individually varied drastically across the 96 subtypes. In local patterns of variation, a combination of regional subtype composition and local genomic factors shaped the regional frequency spectrum across genomic regions. Our results illustrate how treating variants in large sequencing samples as interchangeable may confound population genetic frameworks and encourages us to consider the unique evolutionary mechanisms of analyzed polymorphisms.
Collapse
Affiliation(s)
- Kevin Liao
- Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Jedidiah Carlson
- Department of Integrative Biology, University of Texas at Austin, Austin, TX 78712, USA
- Department of Population Health, University of Texas at Austin, Austin, TX 78712, USA
| | - Sebastian Zöllner
- Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA
- Department of Psychiatry, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
14
|
Gao Z, Zhang Y, Cramer N, Przeworski M, Moorjani P. Limited role of generation time changes in driving the evolution of the mutation spectrum in humans. eLife 2023; 12:e81188. [PMID: 36779395 PMCID: PMC10014080 DOI: 10.7554/elife.81188] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2022] [Accepted: 02/02/2023] [Indexed: 02/14/2023] Open
Abstract
Recent studies have suggested that the human germline mutation rate and spectrum evolve rapidly. Variation in generation time has been linked to these changes, though its contribution remains unclear. We develop a framework to characterize temporal changes in polymorphisms within and between populations, while controlling for the effects of natural selection and biased gene conversion. Application to the 1000 Genomes Project dataset reveals multiple independent changes that arose after the split of continental groups, including a previously reported, transient elevation in TCC>TTC mutations in Europeans and novel signals of divergence in C>Gand T>A mutation rates among population samples. We also find a significant difference between groups sampled in and outside of Africa in old T>C polymorphisms that predate the out-of-Africa migration. This surprising signal is driven by TpG>CpG mutations and stems in part from mis-polarized CpG transitions, which are more likely to undergo recurrent mutations. Finally, by relating the mutation spectrum of polymorphisms to parental age effects on de novo mutations, we show that plausible changes in the generation time cannot explain the patterns observed for different mutation types jointly. Thus, other factors - genetic modifiers or environmental exposures - must have had a non-negligible impact on the human mutation landscape.
Collapse
Affiliation(s)
- Ziyue Gao
- Department of Genetics, University of Pennsylvania, Perelman School of MedicinePhiladelphiaUnited States
| | - Yulin Zhang
- Center for Computational Biology, University of California, BerkeleyBerkeleyUnited States
| | - Nathan Cramer
- Department of Molecular and Cell Biology, University of California, BerkeleyBerkeleyUnited States
| | - Molly Przeworski
- Department of Biological Sciences, Columbia UniversityNew YorkUnited States
- Department of Systems Biology, Columbia UniversityNew YorkUnited States
| | - Priya Moorjani
- Center for Computational Biology, University of California, BerkeleyBerkeleyUnited States
- Department of Molecular and Cell Biology, University of California, BerkeleyBerkeleyUnited States
| |
Collapse
|
15
|
Wang RJ, Al-Saffar SI, Rogers J, Hahn MW. Human generation times across the past 250,000 years. SCIENCE ADVANCES 2023; 9:eabm7047. [PMID: 36608127 PMCID: PMC9821931 DOI: 10.1126/sciadv.abm7047] [Citation(s) in RCA: 23] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
The generation times of our recent ancestors can tell us about both the biology and social organization of prehistoric humans, placing human evolution on an absolute time scale. We present a method for predicting historical male and female generation times based on changes in the mutation spectrum. Our analyses of whole-genome data reveal an average generation time of 26.9 years across the past 250,000 years, with fathers consistently older (30.7 years) than mothers (23.2 years). Shifts in sex-averaged generation times have been driven primarily by changes to the age of paternity, although we report a substantial increase in female generation times in the recent past. We also find a large difference in generation times among populations, reaching back to a time when all humans occupied Africa.
Collapse
Affiliation(s)
- Richard J. Wang
- Department of Biology, Indiana University, Bloomington, IN 47405, USA
- Department of Computer Science, Indiana University, Bloomington, IN 47405, USA
- Corresponding author.
| | - Samer I. Al-Saffar
- Department of Computer Science, Indiana University, Bloomington, IN 47405, USA
| | - Jeffrey Rogers
- Human Genome Sequencing Center and Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Matthew W. Hahn
- Department of Biology, Indiana University, Bloomington, IN 47405, USA
- Department of Computer Science, Indiana University, Bloomington, IN 47405, USA
| |
Collapse
|
16
|
Duchemin L, Lanore V, Veber P, Boussau B. Evaluation of Methods to Detect Shifts in Directional Selection at the Genome Scale. Mol Biol Evol 2022; 40:6889995. [PMID: 36510704 PMCID: PMC9940701 DOI: 10.1093/molbev/msac247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Revised: 10/24/2022] [Accepted: 10/26/2022] [Indexed: 12/15/2022] Open
Abstract
Identifying the footprints of selection in coding sequences can inform about the importance and function of individual sites. Analyses of the ratio of nonsynonymous to synonymous substitutions (dN/dS) have been widely used to pinpoint changes in the intensity of selection, but cannot distinguish them from changes in the direction of selection, that is, changes in the fitness of specific amino acids at a given position. A few methods that rely on amino-acid profiles to detect changes in directional selection have been designed, but their performances have not been well characterized. In this paper, we investigate the performance of six of these methods. We evaluate them on simulations along empirical phylogenies in which transition events have been annotated and compare their ability to detect sites that have undergone changes in the direction or intensity of selection to that of a widely used dN/dS approach, codeml's branch-site model A. We show that all methods have reduced performance in the presence of biased gene conversion but not CpG hypermutability. The best profile method, Pelican, a new implementation of Tamuri AU, Hay AJ, Goldstein RA. (2009. Identifying changes in selective constraints: host shifts in influenza. PLoS Comput Biol. 5(11):e1000564), performs as well as codeml in a range of conditions except for detecting relaxations of selection, and performs better when tree length increases, or in the presence of persistent positive selection. It is fast, enabling genome-scale searches for site-wise changes in the direction of selection associated with phenotypic changes.
Collapse
Affiliation(s)
| | - Vincent Lanore
- Laboratoire de Biométrie et Biologie Evolutive, Univ Lyon, Univ Lyon 1, CNRS, VetAgro Sup, UMR5558, Villeurbanne, France
| | - Philippe Veber
- Laboratoire de Biométrie et Biologie Evolutive, Univ Lyon, Univ Lyon 1, CNRS, VetAgro Sup, UMR5558, Villeurbanne, France
| | | |
Collapse
|
17
|
Bergman J, Schierup MH. Evolutionary dynamics of pseudoautosomal region 1 in humans and great apes. Genome Biol 2022; 23:215. [PMID: 36253794 PMCID: PMC9575207 DOI: 10.1186/s13059-022-02784-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Accepted: 09/30/2022] [Indexed: 12/03/2022] Open
Abstract
Background The pseudoautosomal region 1 (PAR1) is a 2.7 Mb telomeric region of human sex chromosomes. PAR1 has a crucial role in ensuring proper segregation of sex chromosomes during male meiosis, exposing it to extreme recombination and mutation processes. We investigate PAR1 evolution using population genomic datasets of extant humans, eight populations of great apes, and two archaic human genome sequences. Results We find that PAR1 is fast evolving and closer to evolutionary nucleotide equilibrium than autosomal telomeres. We detect a difference between substitution patterns and extant diversity in PAR1, mainly driven by the conflict between strong mutation and recombination-associated fixation bias at CpG sites. We detect excess C-to-G mutations in PAR1 of all great apes, specific to the mutagenic effect of male recombination. Despite recent evidence for Y chromosome introgression from humans into Neanderthals, we find that the Neanderthal PAR1 retained similarity to the Denisovan sequence. We find differences between substitution spectra of these archaics suggesting rapid evolution of PAR1 in recent hominin history. Frequency analysis of alleles segregating in females and males provided no evidence for recent sexual antagonism in this region. We study repeat content and double-strand break hotspot regions in PAR1 and find that they may play roles in ensuring the obligate X-Y recombination event during male meiosis. Conclusions Our study provides an unprecedented quantification of population genetic forces governing PAR1 biology across extant and extinct hominids. PAR1 evolutionary dynamics are predominantly governed by recombination processes with a strong impact on mutation patterns across all species. Supplementary Information The online version contains supplementary material available at 10.1186/s13059-022-02784-x.
Collapse
Affiliation(s)
- Juraj Bergman
- Bioinformatics Research Centre, Aarhus University, DK-8000, Aarhus C, Denmark.
| | | |
Collapse
|
18
|
Lee B, Cyrill SL, Lee W, Melchiotti R, Andiappan AK, Poidinger M, Rötzschke O. Analysis of archaic human haplotypes suggests that 5hmC acts as an epigenetic guide for NCO recombination. BMC Biol 2022; 20:173. [PMID: 35927700 PMCID: PMC9354366 DOI: 10.1186/s12915-022-01353-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Accepted: 06/17/2022] [Indexed: 11/17/2022] Open
Abstract
Background Non-crossover (NCO) refers to a mechanism of homologous recombination in which short tracks of DNA are copied between homologue chromatids. The allelic changes are typically restricted to one or few SNPs, which potentially allow for the gradual adaptation and maturation of haplotypes. It is assumed to be a stochastic process but the analysis of archaic and modern human haplotypes revealed a striking variability in local NCO recombination rates. Methods NCO recombination rates of 1.9 million archaic SNPs shared with Denisovan hominids were defined by a linkage study and correlated with functional and genomic annotations as well as ChIP-Seq data from modern humans. Results We detected a strong correlation between NCO recombination rates and the function of the respective region: low NCO rates were evident in introns and quiescent intergenic regions but high rates in splice sites, exons, 5′- and 3′-UTRs, as well as CpG islands. Correlations with ChIP-Seq data from ENCODE and other public sources further identified epigenetic modifications that associated directly with these recombination events. A particularly strong association was observed for 5-hydroxymethylcytosine marks (5hmC), which were enriched in virtually all of the functional regions associated with elevated NCO rates, including CpG islands and ‘poised’ bivalent regions. Conclusion Our results suggest that 5hmC marks may guide the NCO machinery specifically towards functionally relevant regions and, as an intermediate of oxidative demethylation, may open a pathway for environmental influence by specifically targeting recently opened gene loci. Supplementary Information The online version contains supplementary material available at 10.1186/s12915-022-01353-9.
Collapse
Affiliation(s)
- Bernett Lee
- Singapore Immunology Network (SIgN), Agency of Science Technology and Research (A*STAR), 8A Biomedical Drive, Singapore, 138648, Singapore.,Present address: Lee Kong Chian School of Medicine, Nanyang Technological University, 50 Nanyang Avenue, Singapore, 639798, Singapore
| | - Samantha Leeanne Cyrill
- Singapore Immunology Network (SIgN), Agency of Science Technology and Research (A*STAR), 8A Biomedical Drive, Singapore, 138648, Singapore.,Present address: Cold Spring Harbor Laboratory, One Bungtown Road, NY, 11724, Cold Spring Harbor, USA
| | - Wendy Lee
- Singapore Immunology Network (SIgN), Agency of Science Technology and Research (A*STAR), 8A Biomedical Drive, Singapore, 138648, Singapore
| | - Rossella Melchiotti
- Singapore Immunology Network (SIgN), Agency of Science Technology and Research (A*STAR), 8A Biomedical Drive, Singapore, 138648, Singapore
| | - Anand Kumar Andiappan
- Singapore Immunology Network (SIgN), Agency of Science Technology and Research (A*STAR), 8A Biomedical Drive, Singapore, 138648, Singapore
| | - Michael Poidinger
- Singapore Immunology Network (SIgN), Agency of Science Technology and Research (A*STAR), 8A Biomedical Drive, Singapore, 138648, Singapore.,Present address: Murdoch Children's Research Institute, Royal Children's Hospital, Flemington Road, Parkville, Victoria, 3052, Australia
| | - Olaf Rötzschke
- Singapore Immunology Network (SIgN), Agency of Science Technology and Research (A*STAR), 8A Biomedical Drive, Singapore, 138648, Singapore.
| |
Collapse
|
19
|
Matsen FA, Ralph PL. Enabling Inference for Context-Dependent Models of Mutation by Bounding the Propagation of Dependency. J Comput Biol 2022; 29:802-824. [PMID: 35776513 PMCID: PMC9419934 DOI: 10.1089/cmb.2021.0644] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Although the rates at which positions in the genome mutate are known to depend not only on the nucleotide to be mutated, but also on neighboring nucleotides, it remains challenging to do phylogenetic inference using models of context-dependent mutation. In these models, the effects of one mutation may in principle propagate to faraway locations, making it difficult to compute exact likelihoods. This article shows how to use bounds on the propagation of dependency to compute likelihoods of mutation of a given segment of genome by marginalizing over sufficiently long flanking sequence. This can be used for maximum likelihood or Bayesian inference. Protocols examining residuals and iterative model refinement are also discussed. Tools for efficiently working with these models are provided in an R package, which could be used in other applications. The method is used to examine context dependence of mutations since the common ancestor of humans and chimpanzee.
Collapse
Affiliation(s)
- Frederick A. Matsen
- Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
- Department of Genome Sciences, and University of Washington, Seattle, Washington, USA
- Department of Statistics, University of Washington, Seattle, Washington, USA
- Howard Hughes Medical Institute, Chevy Chase, Maryland, USA
| | - Peter L. Ralph
- Departments of Biology and Mathematics, Institute of Ecology and Evolution, University of Oregon, Eugene, Oregon, USA
| |
Collapse
|
20
|
Johri P, Eyre-Walker A, Gutenkunst RN, Lohmueller KE, Jensen JD. On the prospect of achieving accurate joint estimation of selection with population history. Genome Biol Evol 2022; 14:6604401. [PMID: 35675379 PMCID: PMC9254643 DOI: 10.1093/gbe/evac088] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/02/2022] [Indexed: 11/15/2022] Open
Abstract
As both natural selection and population history can affect genome-wide patterns of variation, disentangling the contributions of each has remained as a major challenge in population genetics. We here discuss historical and recent progress towards this goal—highlighting theoretical and computational challenges that remain to be addressed, as well as inherent difficulties in dealing with model complexity and model violations—and offer thoughts on potentially fruitful next steps.
Collapse
Affiliation(s)
- Parul Johri
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | | | - Ryan N Gutenkunst
- Department of Molecular and Cellular Biology, University of Arizona, Tucson, AZ, USA
| | - Kirk E Lohmueller
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA, USA.,Department of Human Genetics, University of California, Los Angeles, CA, USA
| | - Jeffrey D Jensen
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| |
Collapse
|
21
|
Ho AT, Hurst LD. Unusual mammalian usage of TGA stop codons reveals that sequence conservation need not imply purifying selection. PLoS Biol 2022; 20:e3001588. [PMID: 35550630 PMCID: PMC9129041 DOI: 10.1371/journal.pbio.3001588] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Revised: 05/24/2022] [Accepted: 04/20/2022] [Indexed: 11/18/2022] Open
Abstract
The assumption that conservation of sequence implies the action of purifying selection is central to diverse methodologies to infer functional importance. GC-biased gene conversion (gBGC), a meiotic mismatch repair bias strongly favouring GC over AT, can in principle mimic the action of selection, this being thought to be especially important in mammals. As mutation is GC→AT biased, to demonstrate that gBGC does indeed cause false signals requires evidence that an AT-rich residue is selectively optimal compared to its more GC-rich allele, while showing also that the GC-rich alternative is conserved. We propose that mammalian stop codon evolution provides a robust test case. Although in most taxa TAA is the optimal stop codon, TGA is both abundant and conserved in mammalian genomes. We show that this mammalian exceptionalism is well explained by gBGC mimicking purifying selection and that TAA is the selectively optimal codon. Supportive of gBGC, we observe (i) TGA usage trends are consistent at the focal stop codon and elsewhere (in UTR sequences); (ii) that higher TGA usage and higher TAA→TGA substitution rates are predicted by a high recombination rate; and (iii) across species the difference in TAA <-> TGA substitution rates between GC-rich and GC-poor genes is largest in genomes that possess higher between-gene GC variation. TAA optimality is supported both by enrichment in highly expressed genes and trends associated with effective population size. High TGA usage and high TAA→TGA rates in mammals are thus consistent with gBGC’s predicted ability to “drive” deleterious mutations and supports the hypothesis that sequence conservation need not be indicative of purifying selection. A general trend for GC-rich trinucleotides to reside at frequencies far above their mutational equilibrium in high recombining domains supports the generality of these results.
Collapse
Affiliation(s)
- Alexander Thomas Ho
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
- * E-mail:
| | | |
Collapse
|
22
|
Borges R, Boussau B, Szöllősi GJ, Kosiol C. Nucleotide Usage Biases Distort Inferences of the Species Tree. Genome Biol Evol 2022; 14:6496956. [PMID: 34983052 PMCID: PMC8829901 DOI: 10.1093/gbe/evab290] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/27/2021] [Indexed: 12/15/2022] Open
Abstract
Despite the importance of natural selection in species’ evolutionary history, phylogenetic methods that take into account population-level processes typically ignore selection. The assumption of neutrality is often based on the idea that selection occurs at a minority of loci in the genome and is unlikely to compromise phylogenetic inferences significantly. However, genome-wide processes like GC-bias and some variation segregating at the coding regions are known to evolve in the nearly neutral range. As we are now using genome-wide data to estimate species trees, it is natural to ask whether weak but pervasive selection is likely to blur species tree inferences. We developed a polymorphism-aware phylogenetic model tailored for measuring signatures of nucleotide usage biases to test the impact of selection in the species tree. Our analyses indicate that although the inferred relationships among species are not significantly compromised, the genetic distances are systematically underestimated in a node-height-dependent manner: that is, the deeper nodes tend to be more underestimated than the shallow ones. Such biases have implications for molecular dating. We dated the evolutionary history of 30 worldwide fruit fly populations, and we found signatures of GC-bias considerably affecting the estimated divergence times (up to 23%) in the neutral model. Our findings call for the need to account for selection when quantifying divergence or dating species evolution.
Collapse
Affiliation(s)
- Rui Borges
- Institut für Populationsgenetik, Vetmeduni Vienna, Wien, Austria
| | - Bastien Boussau
- Université de Lyon, Université Claude Bernard Lyon 1, CNRS UMR 5558, LBBE, Villeurbanne, France
| | - Gergely J Szöllősi
- Department of Biological Physics, Eötvös University, Budapest , Hungary.,MTA-ELTE "Lendület" Evolutionary Genomics Research Group, Budapest, Hungary.,Evolutionary Systems Research Group, Centre for Ecological Research, Hungarian Academy of Sciences, Tihany, Hungary
| | - Carolin Kosiol
- Institut für Populationsgenetik, Vetmeduni Vienna, Wien, Austria.,Centre for Biological Diversity, University of St Andrews, St Andrews, United Kingdom
| |
Collapse
|
23
|
Agarwal I, Przeworski M. Mutation saturation for fitness effects at human CpG sites. eLife 2021; 10:e71513. [PMID: 34806592 PMCID: PMC8683084 DOI: 10.7554/elife.71513] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2021] [Accepted: 11/21/2021] [Indexed: 01/06/2023] Open
Abstract
Whole exome sequences have now been collected for millions of humans, with the related goals of identifying pathogenic mutations in patients and establishing reference repositories of data from unaffected individuals. As a result, we are approaching an important limit, in which datasets are large enough that, in the absence of natural selection, every highly mutable site will have experienced at least one mutation in the genealogical history of the sample. Here, we focus on CpG sites that are methylated in the germline and experience mutations to T at an elevated rate of ~10-7 per site per generation; considering synonymous mutations in a sample of 390,000 individuals, ~ 99 % of such CpG sites harbor a C/T polymorphism. Methylated CpG sites provide a natural mutation saturation experiment for fitness effects: as we show, at nt sample sizes, not seeing a non-synonymous polymorphism is indicative of strong selection against that mutation. We rely on this idea in order to directly identify a subset of CpG transitions that are likely to be highly deleterious, including ~27 % of possible loss-of-function mutations, and up to 20 % of possible missense mutations, depending on the type of functional site in which they occur. Unlike methylated CpGs, most mutation types, with rates on the order of 10-8 or 10-9, remain very far from saturation. We discuss what these findings imply for interpreting the potential clinical relevance of mutations from their presence or absence in reference databases and for inferences about the fitness effects of new mutations.
Collapse
Affiliation(s)
- Ipsita Agarwal
- Department of Biological Sciences, Columbia UniversityNew YorkUnited States
| | - Molly Przeworski
- Department of Biological Sciences, Columbia UniversityNew YorkUnited States
- Department of Systems Biology, Columbia UniversityNew YorkUnited States
| |
Collapse
|
24
|
Daron J, Bravo IG. Variability in Codon Usage in Coronaviruses Is Mainly Driven by Mutational Bias and Selective Constraints on CpG Dinucleotide. Viruses 2021; 13:v13091800. [PMID: 34578381 PMCID: PMC8473333 DOI: 10.3390/v13091800] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2021] [Revised: 08/30/2021] [Accepted: 08/31/2021] [Indexed: 12/18/2022] Open
Abstract
The Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the third human-emerged virus of the 21st century from the Coronaviridae family, causing the ongoing coronavirus disease 2019 (COVID-19) pandemic. Due to the high zoonotic potential of coronaviruses, it is critical to unravel their evolutionary history of host species breadth, host-switch potential, adaptation and emergence, to identify viruses posing a pandemic risk in humans. We present here a comprehensive analysis of the composition and codon usage bias of the 82 Orthocoronavirinae members, infecting 47 different avian and mammalian hosts. Our results clearly establish that synonymous codon usage varies widely among viruses, is only weakly dependent on their primary host, and is dominated by mutational bias towards AU-enrichment and by CpG avoidance. Indeed, variation in GC3 explains around 34%, while variation in CpG frequency explains around 14% of total variation in codon usage bias. Further insight on the mutational equilibrium within Orthocoronavirinae revealed that most coronavirus genomes are close to their neutral equilibrium, the exception being the three recently infecting human coronaviruses, which lie further away from the mutational equilibrium than their endemic human coronavirus counterparts. Finally, our results suggest that, while replicating in humans, SARS-CoV-2 is slowly becoming AU-richer, likely until attaining a new mutational equilibrium.
Collapse
Affiliation(s)
- Josquin Daron
- Laboratoire MIVEGEC (CNRS, IRD, Université de Montpellier), 34394 Montpellier, France;
- Correspondence:
| | - Ignacio G. Bravo
- Laboratoire MIVEGEC (CNRS, IRD, Université de Montpellier), 34394 Montpellier, France;
- Center for Research on the Ecology and Evolution of Diseases (CREES), 34394 Montpellier, France
| |
Collapse
|
25
|
Jackson B, Charlesworth B. Evidence for a force favoring GC over AT at short intronic sites in Drosophila simulans and Drosophila melanogaster. G3 GENES|GENOMES|GENETICS 2021; 11:6321237. [PMID: 34544137 PMCID: PMC8496279 DOI: 10.1093/g3journal/jkab240] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/13/2021] [Accepted: 07/06/2021] [Indexed: 11/13/2022]
Abstract
Population genetics studies often make use of a class of nucleotide site free from selective pressures, in order to make inferences about population size changes or natural selection at other sites. If such neutral sites can be identified, they offer the opportunity to avoid any confounding effects of selection. Here, we investigate evolution at putatively neutrally evolving short intronic sites in natural populations of Drosophila melanogaster and Drosophila simulans, in order to understand the properties of spontaneous mutations and the extent of GC-biased gene conversion in these species. Use of data on the genetics of natural populations is advantageous because it integrates information from large numbers of individuals over long timescales. In agreement with direct evidence from observations of spontaneous mutations in Drosophila, we find a bias in the spectrum of mutations toward AT basepairs. In addition, we find that this bias is stronger in the D. melanogaster lineage than in the D. simulans lineage. The evidence for GC-biased gene conversion in Drosophila has been equivocal. Here, we provide evidence for a weak force favoring GC in both species, which is correlated with the GC content of introns and is stronger in D. simulans than in D. melanogaster.
Collapse
Affiliation(s)
- Ben Jackson
- School of Biological Sciences, Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3FL, UK
| | - Brian Charlesworth
- School of Biological Sciences, Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3FL, UK
| |
Collapse
|
26
|
Bergman J, Schierup MH. Population dynamics of GC-changing mutations in humans and great apes. Genetics 2021; 218:6291657. [PMID: 34081117 DOI: 10.1093/genetics/iyab083] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Accepted: 05/27/2021] [Indexed: 11/14/2022] Open
Abstract
The nucleotide composition of the genome is a balance between origin and fixation rates of different mutations. For example, it is well-known that transitions occur more frequently than transversions, particularly at CpG sites. Differences in fixation rates of mutation types are less explored. Specifically, recombination-associated GC-biased gene conversion (gBGC) may differentially impact GC-changing mutations, due to differences in their genomic distributions and efficiency of mismatch repair mechanisms. Given that recombination evolves rapidly across species, we explore gBGC of different mutation types across human populations and great ape species. We report a stronger correlation between segregating GC frequency and recombination for transitions than for transversions. Notably, CpG transitions are most strongly affected by gBGC in humans and chimpanzees. We show that the overall strength of gBGC is generally correlated with effective population sizes in humans, with some notable exceptions, such as a stronger effect of gBGC on non-CpG transitions in populations of European descent. Furthermore, species of the Gorilla and Pongo genus have a greatly reduced gBGC effect on CpG sites. We also study the dependence of gBGC dynamics on flanking nucleotides and show that some mutation types evolve in opposition to the gBGC expectation, likely due to hypermutability of specific nucleotide contexts. Our results highlight the importance of different gBGC dynamics experienced by GC-changing mutations and their impact on nucleotide composition evolution.
Collapse
Affiliation(s)
- Juraj Bergman
- Bioinformatics Research Institute, Aarhus University, DK-8000 Aarhus C, Denmark
| | | |
Collapse
|
27
|
Sweet DR, Lam C, Jain MK. Evolutionary Protection of Krüppel-Like Factors 2 and 4 in the Development of the Mature Hemovascular System. Front Cardiovasc Med 2021; 8:645719. [PMID: 34079826 PMCID: PMC8165158 DOI: 10.3389/fcvm.2021.645719] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Accepted: 04/21/2021] [Indexed: 02/02/2023] Open
Abstract
A properly functioning hemovascular system, consisting of circulating innate immune cells and endothelial cells (ECs), is essential in the distribution of nutrients to distant tissues while ensuring protection from invading pathogens. Professional phagocytes (e.g., macrophages) and ECs have co-evolved in vertebrates to adapt to increased physiological demands. Intercellular interactions between components of the hemovascular system facilitate numerous functions in physiology and disease in part through the utilization of shared signaling pathways and factors. Krüppel-like factors (KLFs) 2 and 4 are two such transcription factors with critical roles in both cellular compartments. Decreased expression of either factor in myeloid or endothelial cells increases susceptibility to a multitude of inflammatory diseases, underscoring the essential role for their expression in maintaining cellular quiescence. Given the close evolutionary relationship between macrophages and ECs, along with their shared utilization of KLF2 and 4, we hypothesize that KLF genes evolved in such a way that protected their expression in myeloid and endothelial cells. Within this Perspective, we review the roles of KLF2 and 4 in the hemovascular system and explore evolutionary trends in their nucleotide composition that suggest a coordinated protection that corresponds with the development of mature myeloid and endothelial systems.
Collapse
Affiliation(s)
- David R Sweet
- Case Cardiovascular Research Institute, Case Western Reserve University, Cleveland, OH, United States.,Harrington Heart and Vascular Institute, University Hospitals Cleveland Medical Center, Cleveland, OH, United States.,Department of Pathology, Case Western Reserve University, Cleveland, OH, United States
| | - Cherry Lam
- Department of Biology, New York University, New York, NY, United States
| | - Mukesh K Jain
- Case Cardiovascular Research Institute, Case Western Reserve University, Cleveland, OH, United States.,Harrington Heart and Vascular Institute, University Hospitals Cleveland Medical Center, Cleveland, OH, United States
| |
Collapse
|
28
|
Boman J, Mugal CF, Backström N. The Effects of GC-Biased Gene Conversion on Patterns of Genetic Diversity among and across Butterfly Genomes. Genome Biol Evol 2021; 13:evab064. [PMID: 33760095 PMCID: PMC8175052 DOI: 10.1093/gbe/evab064] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/22/2021] [Indexed: 12/28/2022] Open
Abstract
Recombination reshuffles the alleles of a population through crossover and gene conversion. These mechanisms have considerable consequences on the evolution and maintenance of genetic diversity. Crossover, for example, can increase genetic diversity by breaking the linkage between selected and nearby neutral variants. Bias in favor of G or C alleles during gene conversion may instead promote the fixation of one allele over the other, thus decreasing diversity. Mutation bias from G or C to A and T opposes GC-biased gene conversion (gBGC). Less recognized is that these two processes may-when balanced-promote genetic diversity. Here, we investigate how gBGC and mutation bias shape genetic diversity patterns in wood white butterflies (Leptidea sp.). This constitutes the first in-depth investigation of gBGC in butterflies. Using 60 resequenced genomes from six populations of three species, we find substantial variation in the strength of gBGC across lineages. When modeling the balance of gBGC and mutation bias and comparing analytical results with empirical data, we reject gBGC as the main determinant of genetic diversity in these butterfly species. As alternatives, we consider linked selection and GC content. We find evidence that high values of both reduce diversity. We also show that the joint effects of gBGC and mutation bias can give rise to a diversity pattern which resembles the signature of linked selection. Consequently, gBGC should be considered when interpreting the effects of linked selection on levels of genetic diversity.
Collapse
Affiliation(s)
- Jesper Boman
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Sweden
| | - Carina F Mugal
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Sweden
| | - Niclas Backström
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Sweden
| |
Collapse
|
29
|
de Oliveira JL, Morales AC, Hurst LD, Urrutia AO, Thompson CRL, Wolf JB. Inferring Adaptive Codon Preference to Understand Sources of Selection Shaping Codon Usage Bias. Mol Biol Evol 2021; 38:3247-3266. [PMID: 33871580 PMCID: PMC8321536 DOI: 10.1093/molbev/msab099] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
Alternative synonymous codons are often used at unequal frequencies. Classically, studies of such codon usage bias (CUB) attempted to separate the impact of neutral from selective forces by assuming that deviations from a predicted neutral equilibrium capture selection. However, GC-biased gene conversion (gBGC) can also cause deviation from a neutral null. Alternatively, selection has been inferred from CUB in highly expressed genes, but the accuracy of this approach has not been extensively tested, and gBGC can interfere with such extrapolations (e.g., if expression and gene conversion rates covary). It is therefore critical to examine deviations from a mutational null in a species with no gBGC. To achieve this goal, we implement such an analysis in the highly AT rich genome of Dictyostelium discoideum, where we find no evidence of gBGC. We infer neutral CUB under mutational equilibrium to quantify "adaptive codon preference," a nontautologous genome wide quantitative measure of the relative selection strength driving CUB. We observe signatures of purifying selection consistent with selection favoring adaptive codon preference. Preferred codons are not GC rich, underscoring the independence from gBGC. Expression-associated "preference" largely matches adaptive codon preference but does not wholly capture the influence of selection shaping patterns across all genes, suggesting selective constraints associated specifically with high expression. We observe patterns consistent with effects on mRNA translation and stability shaping adaptive codon preference. Thus, our approach to quantifying adaptive codon preference provides a framework for inferring the sources of selection that shape CUB across different contexts within the genome.
Collapse
Affiliation(s)
- Janaina Lima de Oliveira
- Instituto de Biologia, Universidade Federal da Bahia, Salvador, Bahia, 40170-115, Brazil.,Milner Centre for Evolution and Department of Biology and Biochemistry, University of Bath, Claverton Down, Bath, BA2 7AY, UK
| | - Atahualpa Castillo Morales
- Milner Centre for Evolution and Department of Biology and Biochemistry, University of Bath, Claverton Down, Bath, BA2 7AY, UK
| | - Laurence D Hurst
- Milner Centre for Evolution and Department of Biology and Biochemistry, University of Bath, Claverton Down, Bath, BA2 7AY, UK
| | - Araxi O Urrutia
- Milner Centre for Evolution and Department of Biology and Biochemistry, University of Bath, Claverton Down, Bath, BA2 7AY, UK.,Instituto de Ecologia, UNAM, Ciudad de Mexico 04510, Mexico
| | - Christopher R L Thompson
- Centre for Life's Origins and Evolution, Department of Genetics, Evolution and Environment, University College London, Darwin Building, Gower Street, London, WC1E 6BT, UK
| | - Jason B Wolf
- Milner Centre for Evolution and Department of Biology and Biochemistry, University of Bath, Claverton Down, Bath, BA2 7AY, UK
| |
Collapse
|
30
|
Abstract
Recombination increases the local GC-content in genomic regions through GC-biased gene conversion (gBGC). The recent discovery of a large genomic region with extreme GC-content in the fat sand rat Psammomys obesus provides a model to study the effects of gBGC on chromosome evolution. Here, we compare the GC-content and GC-to-AT substitution patterns across protein-coding genes of four gerbil species and two murine rodents (mouse and rat). We find that the known high-GC region is present in all the gerbils, and is characterized by high substitution rates for all mutational categories (AT-to-GC, GC-to-AT, and GC-conservative) both at synonymous and nonsynonymous sites. A higher AT-to-GC than GC-to-AT rate is consistent with the high GC-content. Additionally, we find more than 300 genes outside the known region with outlying values of AT-to-GC synonymous substitution rates in gerbils. Of these, over 30% are organized into at least 17 large clusters observable at the megabase-scale. The unusual GC-skewed substitution pattern suggests the evolution of genomic regions with very high recombination rates in the gerbil lineage, which can lead to a runaway increase in GC-content. Our results imply that rapid evolution of GC-content is possible in mammals, with gerbil species providing a powerful model to study the mechanisms of gBGC.
Collapse
Affiliation(s)
- Rodrigo Pracana
- Department of Zoology, University of Oxford, Oxford, United Kingdom
| | | | - John F Mulley
- School of Natural Sciences, Bangor University, Bangor, Gwynedd, United Kingdom
| | | |
Collapse
|
31
|
Suurväli J, Whiteley AR, Zheng Y, Gharbi K, Leptin M, Wiehe T. The Laboratory Domestication of Zebrafish: From Diverse Populations to Inbred Substrains. Mol Biol Evol 2021; 37:1056-1069. [PMID: 31808937 PMCID: PMC7086173 DOI: 10.1093/molbev/msz289] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
We know from human genetic studies that practically all aspects of biology are strongly influenced by the genetic background, as reflected in the advent of “personalized medicine.” Yet, with few exceptions, this is not taken into account when using laboratory populations as animal model systems for research in these fields. Laboratory strains of zebrafish (Danio rerio) are widely used for research in vertebrate developmental biology, behavior, and physiology, for modeling diseases, and for testing pharmaceutic compounds in vivo. However, all of these strains are derived from artificial bottleneck events and therefore are likely to represent only a fraction of the genetic diversity present within the species. Here, we use restriction site-associated DNA sequencing to genetically characterize wild populations of zebrafish from India, Nepal, and Bangladesh, and to compare them to previously published data on four common laboratory strains. We measured nucleotide diversity, heterozygosity, and allele frequency spectra, and find that wild zebrafish are much more diverse than laboratory strains. Further, in wild zebrafish, there is a clear signal of GC-biased gene conversion that is missing in laboratory strains. We also find that zebrafish populations in Nepal and Bangladesh are most distinct from all other strains studied, making them an attractive subject for future studies of zebrafish population genetics and molecular ecology. Finally, isolates of the same strains kept in different laboratories show a pattern of ongoing differentiation into genetically distinct substrains. Together, our findings broaden the basis for future genetic, physiological, pharmaceutic, and evolutionary studies in Danio rerio.
Collapse
Affiliation(s)
- Jaanus Suurväli
- Institute for Genetics, University of Cologne, Cologne, Germany
| | - Andrew R Whiteley
- Wildlife Biology Program, Department of Ecosystem and Conservation Sciences, College of Forestry and Conservation, University of Montana, Missoula, MT
| | - Yichen Zheng
- Institute for Genetics, University of Cologne, Cologne, Germany
| | - Karim Gharbi
- Edinburgh Genomics, Ashworth Laboratories, University of Edinburgh, Edinburgh, United Kingdom.,Earlham Institute, Norwich Research Park, Norwich, United Kingdom
| | - Maria Leptin
- Institute for Genetics, University of Cologne, Cologne, Germany
| | - Thomas Wiehe
- Institute for Genetics, University of Cologne, Cologne, Germany
| |
Collapse
|
32
|
Hayes K, Barton HJ, Zeng K. A Study of Faster-Z Evolution in the Great Tit (Parus major). Genome Biol Evol 2021; 12:210-222. [PMID: 32119100 PMCID: PMC7144363 DOI: 10.1093/gbe/evaa044] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/26/2020] [Indexed: 12/17/2022] Open
Abstract
Sex chromosomes contribute substantially to key evolutionary processes such as speciation and adaptation. Several theories suggest that evolution could occur more rapidly on sex chromosomes, but currently our understanding of whether and how this occurs is limited. Here, we present an analysis of the great tit (Parus major) genome, aiming to detect signals of faster-Z evolution. We find mixed evidence of faster divergence on the Z chromosome than autosomes, with significantly higher divergence being found in ancestral repeats, but not at 4- or 0-fold degenerate sites. Interestingly, some 4-fold sites appear to be selectively constrained, which may mislead analyses that use these sites as the neutral reference (e.g., dN/dS). Consistent with other studies in birds, the mutation rate is significantly higher in males than females, and the long-term Z-to-autosome effective population size ratio is only 0.5, significantly lower than the expected value of 0.75. These are indicative of male-driven evolution and high variance in male reproductive success, respectively. We find no evidence for an increased efficacy of positive selection on the Z chromosome. In contrast, the Z chromosome in great tits appears to be affected by increased genetic drift, which has led to detectable signals of weakened intensity of purifying selection. These results provide further evidence that the Z chromosome often has a low effective population size, and that this has important consequences for its evolution. They also highlight the importance of considering multiple factors that can affect the rate of evolution and effective population sizes of sex chromosomes.
Collapse
Affiliation(s)
- Kai Hayes
- Department of Animal and Plant Sciences, University of Sheffield, United Kingdom
| | - Henry J Barton
- Department of Animal and Plant Sciences, University of Sheffield, United Kingdom.,Organismal and Evolutionary Biology Research Program, University of Helsinki, Finland
| | - Kai Zeng
- Department of Animal and Plant Sciences, University of Sheffield, United Kingdom
| |
Collapse
|
33
|
Abstract
Drosophila melanogaster, a small dipteran of African origin, represents one of the best-studied model organisms. Early work in this system has uniquely shed light on the basic principles of genetics and resulted in a versatile collection of genetic tools that allow to uncover mechanistic links between genotype and phenotype. Moreover, given its worldwide distribution in diverse habitats and its moderate genome-size, Drosophila has proven very powerful for population genetics inference and was one of the first eukaryotes whose genome was fully sequenced. In this book chapter, we provide a brief historical overview of research in Drosophila and then focus on recent advances during the genomic era. After describing different types and sources of genomic data, we discuss mechanisms of neutral evolution including the demographic history of Drosophila and the effects of recombination and biased gene conversion. Then, we review recent advances in detecting genome-wide signals of selection, such as soft and hard selective sweeps. We further provide a brief introduction to background selection, selection of noncoding DNA and codon usage and focus on the role of structural variants, such as transposable elements and chromosomal inversions, during the adaptive process. Finally, we discuss how genomic data helps to dissect neutral and adaptive evolutionary mechanisms that shape genetic and phenotypic variation in natural populations along environmental gradients. In summary, this book chapter serves as a starting point to Drosophila population genomics and provides an introduction to the system and an overview to data sources, important population genetic concepts and recent advances in the field.
Collapse
|
34
|
Abstract
Mutation of the human genome results in three classes of genomic variation: single nucleotide variants; short insertions or deletions; and large structural variants (SVs). Some mutations occur during normal processes, such as meiotic recombination or B cell development, and others result from DNA replication or aberrant repair of breaks in sequence-specific contexts. Regardless of mechanism, mutations are subject to selection, and some hotspots can manifest in disease. Here, we discuss genomic regions prone to mutation, mechanisms contributing to mutation susceptibility, and the processes leading to their accumulation in normal and somatic genomes. With further, more accurate human genome sequencing, additional mutation hotspots, mechanistic details of their formation, and the relevance of hotspots to evolution and disease are likely to be discovered.
Collapse
|
35
|
Lin ZJ, Wang X, Wang J, Tan Y, Tang X, Werren JH, Zhang D, Wang X. Comparative analysis reveals the expansion of mitochondrial DNA control region containing unusually high G-C tandem repeat arrays in Nasonia vitripennis. Int J Biol Macromol 2020; 166:1246-1257. [PMID: 33159940 DOI: 10.1016/j.ijbiomac.2020.11.007] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2020] [Revised: 11/01/2020] [Accepted: 11/02/2020] [Indexed: 11/25/2022]
Abstract
Insect mitochondrial DNA (mtDNA) ranges from 14 to 19 kbp, and the size difference is attributed to the AT-rich control region. Jewel wasps have a parasitoid lifestyle, which may affect mitochondria function and evolution. We sequenced, assembled, and annotated mitochondrial genomes in Nasonia and outgroup species. Gene composition and order are conserved within Nasonia, but they differ from other parasitoids by two large inversion events that were not reported before. We observed a much higher substitution rate relative to the nuclear genome and mitochondrial introgression between N. giraulti and N. oneida, which is consistent with previous studies. Most strikingly, N. vitripennis mtDNA has an extremely long control region (7665 bp), containing twenty-nine 217 bp tandem repeats and can fold into a super-cruciform structure. In contrast to tandem repeats commonly found in other mitochondria, these high-copy repeats are highly conserved (98.7% sequence identity), much longer in length (approximately 8 Kb), extremely GC-rich (50.7%), and CpG-rich (percent CpG 19.4% vs. 1.1% in coding region), resulting in a 23 kbp mtDNA beyond the typical size range in insects. These N. vitripennis-specific mitochondrial repeats are not related to any known sequences in insect mitochondria. Their evolutionary origin and functional consequences warrant further investigations.
Collapse
Affiliation(s)
- Zi Jie Lin
- Department of Chemistry, Columbus State University, Columbus, GA 31909, United States of America
| | - Xiaozhu Wang
- Department of Pathobiology, College of Veterinary Medicine, Auburn University, Auburn, AL 36849, United States of America
| | - Jinbin Wang
- Institute of Biotechnology Research, Shanghai Academy of Agricultural Sciences, Shanghai 201106, China
| | - Yongjun Tan
- Department of Biology, College of Arts & Sciences, Saint Louis University, St. Louis, MO 63103, United States of America
| | - Xueming Tang
- Institute of Biotechnology Research, Shanghai Academy of Agricultural Sciences, Shanghai 201106, China
| | - John H Werren
- Department of Biology, University of Rochester, Rochester, NY 14627, United States of America
| | - Dapeng Zhang
- Department of Biology, College of Arts & Sciences, Saint Louis University, St. Louis, MO 63103, United States of America
| | - Xu Wang
- Department of Pathobiology, College of Veterinary Medicine, Auburn University, Auburn, AL 36849, United States of America; HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, United States of America; Alabama Agricultural Experiment Station, Auburn University, Auburn, AL 36849, United States of America; Department of Entomology and Plant Pathology, Auburn University, Auburn, AL 36849, United States of America.
| |
Collapse
|
36
|
Hämälä T, Tiffin P. Biased Gene Conversion Constrains Adaptation in Arabidopsis thaliana. Genetics 2020; 215:831-846. [PMID: 32414868 PMCID: PMC7337087 DOI: 10.1534/genetics.120.303335] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2020] [Accepted: 05/14/2020] [Indexed: 02/01/2023] Open
Abstract
Reduction of fitness due to deleterious mutations imposes a limit to adaptive evolution. By characterizing features that influence this genetic load we may better understand constraints on responses to both natural and human-mediated selection. Here, using whole-genome, transcriptome, and methylome data from >600 Arabidopsis thaliana individuals, we set out to identify important features influencing selective constraint. Our analyses reveal that multiple factors underlie the accumulation of maladaptive mutations, including gene expression level, gene network connectivity, and gene-body methylation. We then focus on a feature with major effect, nucleotide composition. The ancestral vs. derived status of segregating alleles suggests that GC-biased gene conversion, a recombination-associated process that increases the frequency of G and C nucleotides regardless of their fitness effects, shapes sequence patterns in A. thaliana Through estimation of mutational effects, we present evidence that biased gene conversion hinders the purging of deleterious mutations and contributes to a genome-wide signal of decreased efficacy of selection. By comparing these results to two outcrossing relatives, Arabidopsis lyrata and Capsella grandiflora, we find that protein evolution in A. thaliana is as strongly affected by biased gene conversion as in the outcrossing species. Last, we perform simulations to show that natural levels of outcrossing in A. thaliana are sufficient to facilitate biased gene conversion despite increased homozygosity due to selfing. Together, our results show that even predominantly selfing taxa are susceptible to biased gene conversion, suggesting that it may constitute an important constraint to adaptation among plant species.
Collapse
Affiliation(s)
- Tuomas Hämälä
- Department of Plant and Microbial Biology, University of Minnesota, St. Paul, Minnesota 55108
| | - Peter Tiffin
- Department of Plant and Microbial Biology, University of Minnesota, St. Paul, Minnesota 55108
| |
Collapse
|
37
|
Wilken SE, Seppälä S, Lankiewicz TS, Saxena M, Henske JK, Salamov AA, Grigoriev IV, O’Malley MA. Genomic and proteomic biases inform metabolic engineering strategies for anaerobic fungi. Metab Eng Commun 2020; 10:e00107. [PMID: 31799118 PMCID: PMC6883316 DOI: 10.1016/j.mec.2019.e00107] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2019] [Revised: 10/24/2019] [Accepted: 11/04/2019] [Indexed: 12/22/2022] Open
Abstract
Anaerobic fungi (Neocallimastigomycota) are emerging non-model hosts for biotechnology due to their wealth of biomass-degrading enzymes, yet tools to engineer these fungi have not yet been established. Here, we show that the anaerobic gut fungi have the most GC depleted genomes among 443 sequenced organisms in the fungal kingdom, which has ramifications for heterologous expression of genes as well as for emerging CRISPR-based genome engineering approaches. Comparative genomic analyses suggest that anaerobic fungi may contain cellular machinery to aid in sexual reproduction, yet a complete mating pathway was not identified. Predicted proteomes of the anaerobic fungi also contain an unusually large fraction of proteins with homopolymeric amino acid runs consisting of five or more identical consecutive amino acids. In particular, threonine runs are especially enriched in anaerobic fungal carbohydrate active enzymes (CAZymes) and this, together with a high abundance of predicted N-glycosylation motifs, suggests that gut fungal CAZymes are heavily glycosylated, which may impact heterologous production of these biotechnologically useful enzymes. Finally, we present a codon optimization strategy to aid in the development of genetic engineering tools tailored to these early-branching anaerobic fungi.
Collapse
Affiliation(s)
- St. Elmo Wilken
- Department of Chemical Engineering, University of California, Santa Barbara, CA, 93106, USA
| | - Susanna Seppälä
- Department of Chemical Engineering, University of California, Santa Barbara, CA, 93106, USA
| | - Thomas S. Lankiewicz
- Department of Chemical Engineering, University of California, Santa Barbara, CA, 93106, USA
- Department of Evolution Ecology and Marine Biology, University of California, Santa Barbara, CA, 93106, USA
| | - Mohan Saxena
- Department of Chemical Engineering, University of California, Santa Barbara, CA, 93106, USA
| | - John K. Henske
- Department of Chemical Engineering, University of California, Santa Barbara, CA, 93106, USA
| | - Asaf A. Salamov
- US Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA
| | - Igor V. Grigoriev
- US Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA
| | - Michelle A. O’Malley
- Department of Chemical Engineering, University of California, Santa Barbara, CA, 93106, USA
| |
Collapse
|
38
|
Vu HT, Tran N, Nguyen TD, Vu QL, Bui MH, Le MT, Le L. Complete Chloroplast Genome of Paphiopedilum delenatii and Phylogenetic Relationships among Orchidaceae. PLANTS (BASEL, SWITZERLAND) 2020; 9:E61. [PMID: 31906501 PMCID: PMC7020410 DOI: 10.3390/plants9010061] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/23/2019] [Revised: 12/23/2019] [Accepted: 12/25/2019] [Indexed: 02/05/2023]
Abstract
Paphiopedilum delenatii is a native orchid of Vietnam with highly attractive floral traits. Unfortunately, it is now listed as a critically endangered species with a few hundred individuals remaining in nature. In this study, we performed next-generation sequencing of P. delenatii and assembled its complete chloroplast genome. The whole chloroplast genome of P. delenatii was 160,955 bp in size, 35.6% of which was GC content, and exhibited typical quadripartite structure of plastid genomes with four distinct regions, including the large and small single-copy regions and a pair of inverted repeat regions. There were, in total, 130 genes annotated in the genome: 77 coding genes, 39 tRNA genes, 8 rRNA genes, and 6 pseudogenes. The loss of ndh genes and variation in inverted repeat (IR) boundaries as well as data of simple sequence repeats (SSRs) and divergent hotspots provided useful information for identification applications and phylogenetic studies of Paphiopedilum species. Whole chloroplast genomes could be used as an effective super barcode for species identification or for developing other identification markers, which subsequently serves the conservation of Paphiopedilum species.
Collapse
Affiliation(s)
- Huyen-Trang Vu
- Faculty of Biotechnology, Nguyen Tat Thanh University, District 4, Hochiminh City 72820, Vietnam; (H.-T.V.); (T.-D.N.); (M.-H.B.)
- Faculty of Biotechnology, International University-Vietnam National University, Linh Trung Ward, Thu Duc District, Hochiminh City 7000000, Vietnam; (N.T.); (M.-T.L.)
| | - Ngan Tran
- Faculty of Biotechnology, International University-Vietnam National University, Linh Trung Ward, Thu Duc District, Hochiminh City 7000000, Vietnam; (N.T.); (M.-T.L.)
| | - Thanh-Diem Nguyen
- Faculty of Biotechnology, Nguyen Tat Thanh University, District 4, Hochiminh City 72820, Vietnam; (H.-T.V.); (T.-D.N.); (M.-H.B.)
| | - Quoc-Luan Vu
- Tay Nguyen Institute for Scientific Research, Vietnam Academy of Science and Technology, Dalat 670000, Vietnam;
| | - My-Huyen Bui
- Faculty of Biotechnology, Nguyen Tat Thanh University, District 4, Hochiminh City 72820, Vietnam; (H.-T.V.); (T.-D.N.); (M.-H.B.)
| | - Minh-Tri Le
- Faculty of Biotechnology, International University-Vietnam National University, Linh Trung Ward, Thu Duc District, Hochiminh City 7000000, Vietnam; (N.T.); (M.-T.L.)
| | - Ly Le
- Faculty of Biotechnology, International University-Vietnam National University, Linh Trung Ward, Thu Duc District, Hochiminh City 7000000, Vietnam; (N.T.); (M.-T.L.)
- Vingroup Big Data Institute, Hai Ba Trung District, Hanoi 100000, Vietnam
| |
Collapse
|
39
|
Subramanian S. Population size influences the type of nucleotide variations in humans. BMC Genet 2019; 20:93. [PMID: 31805852 PMCID: PMC6894472 DOI: 10.1186/s12863-019-0798-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2019] [Accepted: 12/01/2019] [Indexed: 11/24/2022] Open
Abstract
Background It is well known that the effective size of a population (Ne) is one of the major determinants of the amount of genetic variation within the population. However, it is unclear whether the types of genetic variations are also dictated by the effective population size. To examine this, we obtained whole genome data from over 100 populations of the world and investigated the patterns of mutational changes. Results Our results revealed that for low frequency variants, the ratio of AT→GC to GC→AT variants (β) was similar across populations, suggesting the similarity of the pattern of mutation in various populations. However, for high frequency variants, β showed a positive correlation with the effective population size of the populations. This suggests a much higher proportion of high frequency AT→GC variants in large populations (e.g. Africans) compared to those with small population sizes (e.g. Asians). These results imply that the substitution patterns vary significantly between populations. These findings could be explained by the effect of GC-biased gene conversion (gBGC), which favors the fixation of G/C over A/T variants in populations. In large population, gBGC causes high β. However, in small populations, genetic drift reduces the effect of gBGC resulting in reduced β. This was further confirmed by a positive relationship between Ne and β for homozygous variants. Conclusions Our results highlight the huge variation in the types of homozygous and high frequency polymorphisms between world populations. We observed the same pattern for deleterious variants, implying that the homozygous polymorphisms associated with recessive genetic diseases will be more enriched with G or C in populations with large Ne (e.g. Africans) than in populations with small Ne (e.g. Europeans).
Collapse
Affiliation(s)
- Sankar Subramanian
- GeneCology Research Centre, University of the Sunshine Coast, 90 Sippy Downs Drive, Sippy Downs, QLD 4556, Australia.
| |
Collapse
|
40
|
Massip F, Laurent M, Brossas C, Fernández-Justel JM, Gómez M, Prioleau MN, Duret L, Picard F. Evolution of replication origins in vertebrate genomes: rapid turnover despite selective constraints. Nucleic Acids Res 2019; 47:5114-5125. [PMID: 30916335 PMCID: PMC6547456 DOI: 10.1093/nar/gkz182] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2018] [Revised: 02/28/2019] [Accepted: 03/11/2019] [Indexed: 01/03/2023] Open
Abstract
The replication program of vertebrate genomes is driven by the chromosomal distribution and timing of activation of tens of thousands of replication origins. Genome-wide studies have shown the association of origins with promoters and CpG islands, and their enrichment in G-quadruplex motifs (G4). However, the genetic determinants driving their activity remain poorly understood. To gain insight on the constraints operating on origins, we conducted the first evolutionary comparison of origins across vertebrates. We generated a genome-wide map of chicken origins (the first of a bird genome), and performed a comparison with human and mouse maps. The analysis of intra-species polymorphism revealed a strong depletion of genetic diversity at the core of replication initiation loci. This depletion is not linked to the presence of G4 motifs, promoters or CpG islands. In contrast, we show that origins experienced a rapid turnover during vertebrate evolution, since pairwise comparisons of origin maps revealed that <24% of them are conserved among vertebrates. This study unravels the existence of a novel determinant of origins, the precise functional role of which remains to be determined. Despite the importance of replication initiation for the fitness of organisms, the distribution of origins along vertebrate chromosomes is highly flexible.
Collapse
Affiliation(s)
- Florian Massip
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, Villleurbanne, France.,Berlin Institute for Medical Systems Biology, Max Delbrueck Center for Molecular Medicine, Berlin, Germany
| | - Marc Laurent
- Institut Jacques Monod, CNRS UMR7592, Université Paris Diderot, Equipe Labellisée Association pour la Recherche sur le Cancer, Paris, France
| | - Caroline Brossas
- Institut Jacques Monod, CNRS UMR7592, Université Paris Diderot, Equipe Labellisée Association pour la Recherche sur le Cancer, Paris, France
| | | | - María Gómez
- Centro de Biología Molecular Severo Ochoa CBMSO (CSIC/UAM). Nicolás Cabrera 1, 28049 Madrid, Spain
| | - Marie-Noelle Prioleau
- Institut Jacques Monod, CNRS UMR7592, Université Paris Diderot, Equipe Labellisée Association pour la Recherche sur le Cancer, Paris, France
| | - Laurent Duret
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, Villleurbanne, France
| | - Franck Picard
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, Villleurbanne, France
| |
Collapse
|
41
|
V. Barroso G, Puzović N, Dutheil JY. Inference of recombination maps from a single pair of genomes and its application to ancient samples. PLoS Genet 2019; 15:e1008449. [PMID: 31725722 PMCID: PMC6879166 DOI: 10.1371/journal.pgen.1008449] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2019] [Revised: 11/26/2019] [Accepted: 09/30/2019] [Indexed: 12/11/2022] Open
Abstract
Understanding the causes and consequences of recombination landscape evolution is a fundamental goal in genetics that requires recombination maps from across the tree of life. Such maps can be obtained from population genomic datasets, but require large sample sizes. Alternative methods are therefore necessary to research organisms where such datasets cannot be generated easily, such as non-model or ancient species. Here we extend the sequentially Markovian coalescent model to jointly infer demography and the spatial variation in recombination rate. Using extensive simulations and sequence data from humans, fruit-flies and a fungal pathogen, we demonstrate that iSMC accurately infers recombination maps under a wide range of scenarios-remarkably, even from a single pair of unphased genomes. We exploit this possibility and reconstruct the recombination maps of ancient hominins. We report that the ancient and modern maps are correlated in a manner that reflects the established phylogeny of Neanderthals, Denisovans, and modern human populations.
Collapse
Affiliation(s)
- Gustavo V. Barroso
- Max Planck Institute for Evolutionary Biology, Department of Evolutionary Genetics, August-Thienemann-Straße , Plön–GERMANY
| | - Nataša Puzović
- Max Planck Institute for Evolutionary Biology, Department of Evolutionary Genetics, August-Thienemann-Straße , Plön–GERMANY
| | - Julien Y. Dutheil
- Max Planck Institute for Evolutionary Biology, Department of Evolutionary Genetics, August-Thienemann-Straße , Plön–GERMANY
| |
Collapse
|
42
|
Castellano D, Macià MC, Tataru P, Bataillon T, Munch K. Comparison of the Full Distribution of Fitness Effects of New Amino Acid Mutations Across Great Apes. Genetics 2019; 213:953-966. [PMID: 31488516 PMCID: PMC6827385 DOI: 10.1534/genetics.119.302494] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2019] [Accepted: 08/29/2019] [Indexed: 12/31/2022] Open
Abstract
The distribution of fitness effects (DFE) is central to many questions in evolutionary biology. However, little is known about the differences in DFE between closely related species. We use >9000 coding genes orthologous one-to-one across great apes, gibbons, and macaques to assess the stability of the DFE across great apes. We use the unfolded site frequency spectrum of polymorphic mutations (n = 8 haploid chromosomes per population) to estimate the DFE. We find that the shape of the deleterious DFE is strikingly similar across great apes. We confirm that effective population size (Ne ) is a strong predictor of the strength of negative selection, consistent with the nearly neutral theory. However, we also find that the strength of negative selection varies more than expected given the differences in Ne between species. Across species, mean fitness effects of new deleterious mutations covaries with Ne , consistent with positive epistasis among deleterious mutations. We find that the strength of negative selection for the smallest populations, bonobos and western chimpanzees, is higher than expected given their Ne This may result from a more efficient purging of strongly deleterious recessive variants in these populations. Forward simulations confirm that these findings are not artifacts of the way we are inferring Ne and DFE parameters. All findings are replicated using only GC-conservative mutations, thereby confirming that GC-biased gene conversion is not affecting our conclusions.
Collapse
Affiliation(s)
- David Castellano
- Bioinformatics Research Centre, Aarhus University, DK-8000 Aarhus C, Denmark
| | - Moisès Coll Macià
- Bioinformatics Research Centre, Aarhus University, DK-8000 Aarhus C, Denmark
| | - Paula Tataru
- Bioinformatics Research Centre, Aarhus University, DK-8000 Aarhus C, Denmark
| | - Thomas Bataillon
- Bioinformatics Research Centre, Aarhus University, DK-8000 Aarhus C, Denmark
| | - Kasper Munch
- Bioinformatics Research Centre, Aarhus University, DK-8000 Aarhus C, Denmark
| |
Collapse
|
43
|
Kawakami T, Wallberg A, Olsson A, Wintermantel D, de Miranda JR, Allsopp M, Rundlöf M, Webster MT. Substantial Heritable Variation in Recombination Rate on Multiple Scales in Honeybees and Bumblebees. Genetics 2019; 212:1101-1119. [PMID: 31152071 PMCID: PMC6707477 DOI: 10.1534/genetics.119.302008] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2019] [Accepted: 05/30/2019] [Indexed: 12/30/2022] Open
Abstract
Meiotic recombination shuffles genetic variation and promotes correct segregation of chromosomes. Rates of recombination vary on several scales, both within genomes and between individuals, and this variation is affected by both genetic and environmental factors. Social insects have extremely high rates of recombination, although the evolutionary causes of this are not known. Here, we estimate rates of crossovers and gene conversions in 22 colonies of the honeybee, Apis mellifera, and 9 colonies of the bumblebee, Bombus terrestris, using direct sequencing of 299 haploid drone offspring. We confirm that both species have extremely elevated crossover rates, with higher rates measured in the highly eusocial honeybee than the primitively social bumblebee. There are also significant differences in recombination rate between subspecies of honeybee. There is substantial variation in genome-wide recombination rate between individuals of both A. mellifera and B. terrestris and the distribution of these rates overlap between species. A large proportion of interindividual variation in recombination rate is heritable, which indicates the presence of variation in trans-acting factors that influence recombination genome-wide. We infer that levels of crossover interference are significantly lower in honeybees compared to bumblebees, which may be one mechanism that contributes to higher recombination rates in honeybees. We also find a significant increase in recombination rate with distance from the centromere, mirrored by methylation differences. We detect a strong transmission bias due to GC-biased gene conversion associated with noncrossover gene conversions. Our results shed light on the mechanistic causes of extreme rates of recombination in social insects and the genetic architecture of recombination rate variation.
Collapse
Affiliation(s)
- Takeshi Kawakami
- Department of Evolutionary Biology, Evolutionary Biology Centre (EBC), Uppsala University, 752 36, Sweden
- Department of Animal and Plant Sciences, University of Sheffield, S10 2TN, United Kingdom
| | - Andreas Wallberg
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, 751 05. Sweden
| | - Anna Olsson
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, 751 05. Sweden
| | - Dimitry Wintermantel
- INRA, UE 1255 APIS, Le Magneraud, 17700 Surgères, France
- Centre d'Etudes Biologiques de Chizé, UMR 7372, CNRS and Université de La Rochelle, 79360 Villiers-en-Bois, France
| | - Joachim R de Miranda
- Department of Ecology, Swedish University of Agricultural Sciences, Uppsala 750 07, Sweden
| | - Mike Allsopp
- Plant Protection Research Institute, Agricultural Research Council, Stellenbosch, 7608, South Africa
| | - Maj Rundlöf
- Department of Biology, Lund University, 223 62, Sweden
| | - Matthew T Webster
- Department of Medical Biochemistry and Microbiology, Science for Life Laboratory, Uppsala University, 751 05. Sweden
| |
Collapse
|
44
|
Borges R, Szöllősi GJ, Kosiol C. Quantifying GC-Biased Gene Conversion in Great Ape Genomes Using Polymorphism-Aware Models. Genetics 2019; 212:1321-1336. [PMID: 31147380 PMCID: PMC6707462 DOI: 10.1534/genetics.119.302074] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2018] [Accepted: 05/20/2019] [Indexed: 11/18/2022] Open
Abstract
As multi-individual population-scale data become available, more complex modeling strategies are needed to quantify genome-wide patterns of nucleotide usage and associated mechanisms of evolution. Recently, the multivariate neutral Moran model was proposed. However, it was shown insufficient to explain the distribution of alleles in great apes. Here, we propose a new model that includes allelic selection. Our theoretical results constitute the basis of a new Bayesian framework to estimate mutation rates and selection coefficients from population data. We apply the new framework to a great ape dataset, where we found patterns of allelic selection that match those of genome-wide GC-biased gene conversion (gBGC). In particular, we show that great apes have patterns of allelic selection that vary in intensity-a feature that we correlated with great apes' distinct demographies. We also demonstrate that the AT/GC toggling effect decreases the probability of a substitution, promoting more polymorphisms in the base composition of great ape genomes. We further assess the impact of GC-bias in molecular analysis, and find that mutation rates and genetic distances are estimated under bias when gBGC is not properly accounted for. Our results contribute to the discussion on the tempo and mode of gBGC evolution, while stressing the need for gBGC-aware models in population genetics and phylogenetics.
Collapse
Affiliation(s)
- Rui Borges
- Institut für Populationsgenetik, Vetmeduni Vienna, 1210 Wien, Wien, Austria
| | - Gergely J Szöllősi
- Department of Biological Physics, MTA-ELTE "Lendulet" Evolutionary Genomics Research Group, Eötvös University, Pázmány P. stny. 1A, Budapest 1117, Hungary
| | - Carolin Kosiol
- Institut für Populationsgenetik, Vetmeduni Vienna, 1210 Wien, Wien, Austria
- Centre for Biological Diversity, School of Biology, University of St Andrews, Fife KY16 9TH, UK
| |
Collapse
|
45
|
Davydov II, Salamin N, Robinson-Rechavi M. Large-Scale Comparative Analysis of Codon Models Accounting for Protein and Nucleotide Selection. Mol Biol Evol 2019; 36:1316-1332. [PMID: 30847475 PMCID: PMC6526913 DOI: 10.1093/molbev/msz048] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
There are numerous sources of variation in the rate of synonymous substitutions inside genes, such as direct selection on the nucleotide sequence, or mutation rate variation. Yet scans for positive selection rely on codon models which incorporate an assumption of effectively neutral synonymous substitution rate, constant between sites of each gene. Here we perform a large-scale comparison of approaches which incorporate codon substitution rate variation and propose our own simple yet effective modification of existing models. We find strong effects of substitution rate variation on positive selection inference. More than 70% of the genes detected by the classical branch-site model are presumably false positives caused by the incorrect assumption of uniform synonymous substitution rate. We propose a new model which is strongly favored by the data while remaining computationally tractable. With the new model we can capture signatures of nucleotide level selection acting on translation initiation and on splicing sites within the coding region. Finally, we show that rate variation is highest in the highly recombining regions, and we propose that recombination and mutation rate variation, such as high CpG mutation rate, are the two main sources of nucleotide rate variation. Although we detect fewer genes under positive selection in Drosophila than without rate variation, the genes which we detect contain a stronger signal of adaptation of dynein, which could be associated with Wolbachia infection. We provide software to perform positive selection analysis using the new model.
Collapse
Affiliation(s)
- Iakov I Davydov
- Department of Computational Biology, Biophore, University of Lausanne, Lausanne, Switzerland.,Department of Ecology and Evolution, Biophore, University of Lausanne, Lausanne, Switzerland.,Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Nicolas Salamin
- Department of Computational Biology, Biophore, University of Lausanne, Lausanne, Switzerland.,Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Marc Robinson-Rechavi
- Department of Ecology and Evolution, Biophore, University of Lausanne, Lausanne, Switzerland.,Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
46
|
Booker TR, Keightley PD. Understanding the Factors That Shape Patterns of Nucleotide Diversity in the House Mouse Genome. Mol Biol Evol 2019; 35:2971-2988. [PMID: 30295866 PMCID: PMC6278861 DOI: 10.1093/molbev/msy188] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
A major goal of population genetics has been to determine the extent by which selection at linked sites influences patterns of neutral nucleotide diversity in the genome. Multiple lines of evidence suggest that diversity is influenced by both positive and negative selection. For example, in many species there are troughs in diversity surrounding functional genomic elements, consistent with the action of either background selection (BGS) or selective sweeps. In this study, we investigated the causes of the diversity troughs that are observed in the wild house mouse genome. Using the unfolded site frequency spectrum, we estimated the strength and frequencies of deleterious and advantageous mutations occurring in different functional elements in the genome. We then used these estimates to parameterize forward-in-time simulations of chromosomes, using realistic distributions of functional elements and recombination rate variation in order to determine whether selection at linked sites can explain the observed patterns of nucleotide diversity. The simulations suggest that BGS alone cannot explain the dips in diversity around either exons or conserved noncoding elements. A combination of BGS and selective sweeps produces deeper dips in diversity than BGS alone, but the inferred parameters of selection cannot fully explain the patterns observed in the genome. Our results provide evidence of sweeps shaping patterns of nucleotide diversity across the mouse genome and also suggest that infrequent, strongly advantageous mutations play an important role in this. The limitations of using the unfolded site frequency spectrum for inferring the frequency and effects of advantageous mutations are discussed.
Collapse
Affiliation(s)
- Tom R Booker
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, United Kingdom.,Department of Forest and Conservation Sciences, University of British Columbia, Vancouver, BC, Canada
| | - Peter D Keightley
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, United Kingdom
| |
Collapse
|
47
|
Zhao D, Hamilton JP, Vaillancourt B, Zhang W, Eizenga GC, Cui Y, Jiang J, Buell CR, Jiang N. The unique epigenetic features of Pack-MULEs and their impact on chromosomal base composition and expression spectrum. Nucleic Acids Res 2019; 46:2380-2397. [PMID: 29365184 PMCID: PMC5861414 DOI: 10.1093/nar/gky025] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2017] [Accepted: 01/18/2018] [Indexed: 12/11/2022] Open
Abstract
Acquisition and rearrangement of host genes by transposable elements (TEs) is an important mechanism to increase gene diversity as exemplified by the ∼3000 Pack-Mutator-like TEs in the rice genome which have acquired gene sequences (Pack-MULEs), yet remain enigmatic. To identify signatures of functioning Pack-MULEs and Pack-MULE evolution, we generated transcriptome, translatome, and epigenome datasets and compared Pack-MULEs to genes and other TE families. Approximately 40% of Pack-MULEs were transcribed with 9% having translation evidence, clearly distinguishing them from other TEs. Pack-MULEs exhibited a unique expression profile associated with specificity in reproductive tissues that may be associated with seed traits. Expressed Pack-MULEs resemble regular protein-coding genes as exhibited by a low level of DNA methylation, association with active histone marks and DNase I hypersensitive sites, and an absence of repressive histone marks, suggesting that a substantial fraction of Pack-MULEs are potentially functional in vivo. Interestingly, the expression capacity of Pack-MULEs is independent of the local genomic environment, and the insertion and expression of Pack-MULEs may have altered the local chromosomal expression pattern as well as counteracted the impact of recombination on chromosomal base composition, which has profound consequences on the evolution of chromosome structure.
Collapse
Affiliation(s)
- Dongyan Zhao
- Department of Horticulture, Michigan State University, East Lansing, MI 48824, USA.,Department of Plant Biology, Michigan State University, East Lansing, MI 48824, USA
| | - John P Hamilton
- Department of Plant Biology, Michigan State University, East Lansing, MI 48824, USA
| | | | - Wenli Zhang
- Department of Horticulture, University of Wisconsin, Madison, WI 53705, USA.,State Key Laboratory for Crop Genetics and Germplasm Enhancement, Nanjing Agriculture University, Nanjing, Jiangsu 210095, China
| | - Georgia C Eizenga
- USDA-ARS Dale Bumpers National Rice Research Center, 2890 Highway 130 East, Stuttgart, AR 72160, USA
| | - Yuehua Cui
- Department of Statistics and Probability, Michigan State University, East Lansing, MI 48824, USA
| | - Jiming Jiang
- Department of Horticulture, Michigan State University, East Lansing, MI 48824, USA.,Department of Plant Biology, Michigan State University, East Lansing, MI 48824, USA
| | - C Robin Buell
- Department of Plant Biology, Michigan State University, East Lansing, MI 48824, USA
| | - Ning Jiang
- Department of Horticulture, Michigan State University, East Lansing, MI 48824, USA.,Program in Ecology, Evolutionary Biology and Behavior, Michigan State University, East Lansing, MI 48824, USA
| |
Collapse
|
48
|
Laurin-Lemay S, Rodrigue N, Lartillot N, Philippe H. Conditional Approximate Bayesian Computation: A New Approach for Across-Site Dependency in High-Dimensional Mutation-Selection Models. Mol Biol Evol 2019; 35:2819-2834. [PMID: 30203003 DOI: 10.1093/molbev/msy173] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
A key question in molecular evolutionary biology concerns the relative roles of mutation and selection in shaping genomic data. Moreover, features of mutation and selection are heterogeneous along the genome and over time. Mechanistic codon substitution models based on the mutation-selection framework are promising approaches to separating these effects. In practice, however, several complications arise, since accounting for such heterogeneities often implies handling models of high dimensionality (e.g., amino acid preferences), or leads to across-site dependence (e.g., CpG hypermutability), making the likelihood function intractable. Approximate Bayesian Computation (ABC) could address this latter issue. Here, we propose a new approach, named Conditional ABC (CABC), which combines the sampling efficiency of MCMC and the flexibility of ABC. To illustrate the potential of the CABC approach, we apply it to the study of mammalian CpG hypermutability based on a new mutation-level parameter implying dependence across adjacent sites, combined with site-specific purifying selection on amino-acids captured by a Dirichlet process. Our proof-of-concept of the CABC methodology opens new modeling perspectives. Our application of the method reveals a high level of heterogeneity of CpG hypermutability across loci and mild heterogeneity across taxonomic groups; and finally, we show that CpG hypermutability is an important evolutionary factor in rendering relative synonymous codon usage. All source code is available as a GitHub repository (https://github.com/Simonll/LikelihoodFreePhylogenetics.git).
Collapse
Affiliation(s)
- Simon Laurin-Lemay
- Robert-Cedergren Center for Bioinformatics and Genomics, Department of Biochemistry and Molecular Medicine, Faculty of Medicine, Université de Montréal, Montréal, QC, Canada
| | - Nicolas Rodrigue
- Department of Biology, Institute of Biochemistry, and School of Mathematics and Statistics, Carleton University, Ottawa, ON, Canada
| | - Nicolas Lartillot
- Laboratoire de Biométrie et Biologie Évolutive, UMR CNRS 5558, Université Lyon 1, Lyon, France
| | - Hervé Philippe
- Robert-Cedergren Center for Bioinformatics and Genomics, Department of Biochemistry and Molecular Medicine, Faculty of Medicine, Université de Montréal, Montréal, QC, Canada.,Centre de Théorisation et de Modélisation de la Biodiversité, Station d'Écologie Théorique et Expérimentale, UMR CNRS 5321, Moulis, France
| |
Collapse
|
49
|
Zeng K, Jackson BC, Barton HJ. Methods for Estimating Demography and Detecting Between-Locus Differences in the Effective Population Size and Mutation Rate. Mol Biol Evol 2019; 36:423-433. [PMID: 30428070 PMCID: PMC6409433 DOI: 10.1093/molbev/msy212] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
It is known that the effective population size (Ne) and the mutation rate (u) vary across the genome. Here, we show that ignoring this heterogeneity may lead to biased estimates of past demography. To solve the problem, we develop new methods for jointly inferring past changes in population size and detecting variation in Ne and u between loci. These methods rely on either polymorphism data alone or both polymorphism and divergence data. In addition to inferring demography, we can use the methods to study a variety of questions: 1) comparing sex chromosomes with autosomes (for finding evidence for male-driven evolution, an unequal sex ratio, or sex-biased demographic changes) and 2) analyzing multilocus data from within autosomes or sex chromosomes (for studying determinants of variability in Ne and u). Simulations suggest that the methods can provide accurate parameter estimates and have substantial statistical power for detecting difference in Ne and u. As an example, we use the methods to analyze a polymorphism data set from Drosophila simulans. We find clear evidence for rapid population expansion. The results also indicate that the autosomes have a higher mutation rate than the X chromosome and that the sex ratio is probably female-biased. The new methods have been implemented in a user-friendly package.
Collapse
Affiliation(s)
- Kai Zeng
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, United Kingdom
| | - Benjamin C Jackson
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Henry J Barton
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, United Kingdom
| |
Collapse
|
50
|
Bast J, Parker DJ, Dumas Z, Jalvingh KM, Tran Van P, Jaron KS, Figuet E, Brandt A, Galtier N, Schwander T. Consequences of Asexuality in Natural Populations: Insights from Stick Insects. Mol Biol Evol 2019; 35:1668-1677. [PMID: 29659991 PMCID: PMC5995167 DOI: 10.1093/molbev/msy058] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Recombination is a fundamental process with significant impacts on genome evolution. Predicted consequences of the loss of recombination include a reduced effectiveness of selection, changes in the amount of neutral polymorphisms segregating in populations, and an arrest of GC-biased gene conversion. Although these consequences are empirically well documented for nonrecombining genome portions, it remains largely unknown if they extend to the whole genome scale in asexual organisms. We identify the consequences of asexuality using de novo transcriptomes of five independently derived, obligately asexual lineages of stick insects, and their sexual sister-species. We find strong evidence for higher rates of deleterious mutation accumulation, lower levels of segregating polymorphisms and arrested GC-biased gene conversion in asexuals as compared with sexuals. Taken together, our study conclusively shows that predicted consequences of genome evolution under asexuality can indeed be found in natural populations.
Collapse
Affiliation(s)
- Jens Bast
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
| | - Darren J Parker
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland.,Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Zoé Dumas
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
| | - Kirsten M Jalvingh
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
| | - Patrick Tran Van
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland.,Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Kamil S Jaron
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland.,Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Emeric Figuet
- Institute of Evolutionary Sciences, University of Montpellier, CNRS, IRD, EPHE, Montpellier, France
| | - Alexander Brandt
- Johann-Friedrich-Blumenbach Institute of Zoology and Anthropology, University of Goettingen, Goettingen, Germany
| | - Nicolas Galtier
- Institute of Evolutionary Sciences, University of Montpellier, CNRS, IRD, EPHE, Montpellier, France
| | - Tanja Schwander
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
| |
Collapse
|