1
|
Calò L, Gentile F, Baio E, Raschellà C, Capelli C, Marino A. Variability and forensic efficiency of 12 X-STR markers in Namibian populations. Int J Legal Med 2024:10.1007/s00414-024-03299-9. [PMID: 39105780 DOI: 10.1007/s00414-024-03299-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2024] [Accepted: 07/16/2024] [Indexed: 08/07/2024]
Abstract
STR loci localized on the X chromosome provide information additional to the autosomal markers routinely analyzed in forensic genetics, integrating genetic systems as Y-STRs and mitochondrial DNA in the investigation of complex kinship scenarios and mass disaster cases.In this study we genotyped 12 X-STR loci in 251 male samples from four populations of Namibia in southern Africa using the Investigator Argus X-12 kit (Qiagen, Hilden, Germany). Forensic efficiency parameters indicated high power of discrimination in the considered populations. As part of our investigation, we highlighted partial linkage associations between loci within known linkage groups (LGs) and identified several occurrences of previously unreported out-of-ladder (OL) alleles.Genetic distances between the Namibian populations here investigated and other African (Eritrea, Ethiopia, Somalia, Guinea, Cape Verde) and non-African (Germany, China, Philippines) populations using loci grouped in LGs mirrored their biogeographical distribution differently for each linkage group. Haplotype sharing within each LG revealed a high degree of population-specific types, hinting to the potential of these markers for ancestry applications.These results highlight the importance to produce specific and freely available population databases especially for multi-ethnic countries. This novel dataset is expected to be of interest for population studies that need an accessible reference dataset of African regions not currently well represented, as well as possible relevance for forensic applications focusing on the biogeographic origin of samples.
Collapse
Affiliation(s)
- Luca Calò
- Reparto Carabinieri Investigazioni Scientifiche di Parma - Sezione Biologia, Parma, Italy
| | - Fabiano Gentile
- Reparto Carabinieri Investigazioni Scientifiche di Parma - Sezione Biologia, Parma, Italy
| | - Elisa Baio
- Department of Chemistry, Life Sciences and Environmental Sustainability, University of Parma, Parma, Italy
| | | | - Cristian Capelli
- Department of Chemistry, Life Sciences and Environmental Sustainability, University of Parma, Parma, Italy.
| | - Alberto Marino
- Reparto Carabinieri Investigazioni Scientifiche di Parma - Sezione Biologia, Parma, Italy.
| |
Collapse
|
2
|
Lynch M, Ye Z, Urban L, Maruki T, Wei W. The Linkage-Disequilibrium and Recombinational Landscape in Daphnia pulex. Genome Biol Evol 2022; 14:evac145. [PMID: 36170345 PMCID: PMC9642108 DOI: 10.1093/gbe/evac145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/20/2022] [Indexed: 11/24/2022] Open
Abstract
By revealing the influence of recombinational activity beyond what can be achieved with controlled crosses, measures of linkage disequilibrium (LD) in natural populations provide a powerful means of defining the recombinational landscape within which genes evolve. In one of the most comprehensive studies of this sort ever performed, involving whole-genome analyses on nearly 1,000 individuals of the cyclically parthenogenetic microcrustacean Daphnia pulex, the data suggest a relatively uniform pattern of recombination across the genome. Patterns of LD are quite consistent among populations; average rates of recombination are quite similar for all chromosomes; and although some chromosomal regions have elevated recombination rates, the degree of inflation is not large, and the overall spatial pattern of recombination is close to the random expectation. Contrary to expectations for models in which crossing-over is the primary mechanism of recombination, and consistent with data for other species, the distance-dependent pattern of LD indicates excessively high levels at both short and long distances and unexpectedly low levels of decay at long distances, suggesting significant roles for factors such as nonindependent mutation, population subdivision, and recombination mechanisms unassociated with crossing over. These observations raise issues regarding the classical LD equilibrium model widely applied in population genetics to infer recombination rates across various length scales on chromosomes.
Collapse
Affiliation(s)
- Michael Lynch
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, AZ 85287, USA
| | - Zhiqiang Ye
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, AZ 85287, USA
| | - Lina Urban
- Department for Evolutionary Genetics, Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany
| | - Takahiro Maruki
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, AZ 85287, USA
| | - Wen Wei
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, AZ 85287, USA
| |
Collapse
|
3
|
Setter D, Ebdon S, Jackson B, Lohse K. Estimating the rates of crossover and gene conversion from individual genomes. Genetics 2022; 222:iyac100. [PMID: 35771626 PMCID: PMC9434185 DOI: 10.1093/genetics/iyac100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Accepted: 06/01/2022] [Indexed: 11/14/2022] Open
Abstract
Recombination can occur either as a result of crossover or gene conversion events. Population genetic methods for inferring the rate of recombination from patterns of linkage disequilibrium generally assume a simple model of recombination that only involves crossover events and ignore gene conversion. However, distinguishing the 2 processes is not only necessary for a complete description of recombination, but also essential for understanding the evolutionary consequences of inversions and other genomic partitions in which crossover (but not gene conversion) is reduced. We present heRho, a simple composite likelihood scheme for coestimating the rate of crossover and gene conversion from individual diploid genomes. The method is based on analytic results for the distance-dependent probability of heterozygous and homozygous states at 2 loci. We apply heRho to simulations and data from the house mouse Mus musculus castaneus, a well-studied model. Our analyses show (1) that the rates of crossover and gene conversion can be accurately coestimated at the level of individual chromosomes and (2) that previous estimates of the population scaled rate of recombination ρ=4Ner under a pure crossover model are likely biased.
Collapse
Affiliation(s)
- Derek Setter
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3FL, UK
| | - Sam Ebdon
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3FL, UK
| | - Ben Jackson
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3FL, UK
| | - Konrad Lohse
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3FL, UK
| |
Collapse
|
4
|
Wall JD, Robinson JA, Cox LA. High-Resolution Estimates of Crossover and Noncrossover Recombination from a Captive Baboon Colony. Genome Biol Evol 2022; 14:evac040. [PMID: 35325119 PMCID: PMC9048888 DOI: 10.1093/gbe/evac040] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/02/2022] [Indexed: 11/17/2022] Open
Abstract
Homologous recombination has been extensively studied in humans and a handful of model organisms. Much less is known about recombination in other species, including nonhuman primates. Here, we present a study of crossovers (COs) and noncrossover (NCO) recombination in olive baboons (Papio anubis) from two pedigrees containing a total of 20 paternal and 17 maternal meioses, and compare these results to linkage disequilibrium (LD) based recombination estimates from 36 unrelated olive baboons. We demonstrate how COs, combined with LD-based recombination estimates, can be used to identify genome assembly errors. We also quantify sex-specific differences in recombination rates, including elevated male CO and reduced female CO rates near telomeres. Finally, we add to the increasing body of evidence suggesting that while most NCO recombination tracts in mammals are short (e.g., <500 bp), there is a non-negligible fraction of longer (e.g., >1 kb) NCO tracts. For NCO tracts shorter than 10 kb, we fit a mixture of two (truncated) geometric distributions model to the NCO tract length distribution and estimate that >99% of all NCO tracts are very short (mean 24 bp), but the remaining tracts can be quite long (mean 4.3 kb). A single geometric distribution model for NCO tract lengths is incompatible with the data, suggesting that LD-based methods for estimating NCO recombination rates that make this assumption may need to be modified.
Collapse
Affiliation(s)
- Jeffrey D. Wall
- Institute for Human Genetics, University of California San Francisco, USA
| | | | - Laura A. Cox
- Center for Precision Medicine, Department of Internal Medicine, Wake Forest School of Medicine, Winston-Salem, USA
| |
Collapse
|
5
|
Johri P, Charlesworth B, Howell EK, Lynch M, Jensen JD. Revisiting the notion of deleterious sweeps. Genetics 2021; 219:iyab094. [PMID: 34125884 PMCID: PMC9101445 DOI: 10.1093/genetics/iyab094] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Accepted: 06/08/2021] [Indexed: 11/14/2022] Open
Abstract
It has previously been shown that, conditional on its fixation, the time to fixation of a semi-dominant deleterious autosomal mutation in a randomly mating population is the same as that of an advantageous mutation. This result implies that deleterious mutations could generate selective sweep-like effects. Although their fixation probabilities greatly differ, the much larger input of deleterious relative to beneficial mutations suggests that this phenomenon could be important. We here examine how the fixation of mildly deleterious mutations affects levels and patterns of polymorphism at linked sites-both in the presence and absence of interference amongst deleterious mutations-and how this class of sites may contribute to divergence between-populations and species. We find that, while deleterious fixations are unlikely to represent a significant proportion of outliers in polymorphism-based genomic scans within populations, minor shifts in the frequencies of deleterious mutations can influence the proportions of private variants and the value of FST after a recent population split. As sites subject to deleterious mutations are necessarily found in functional genomic regions, interpretations in terms of recurrent positive selection may require reconsideration.
Collapse
Affiliation(s)
- Parul Johri
- School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA
| | - Brian Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3FL, UK
| | - Emma K Howell
- School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA
| | - Michael Lynch
- School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA
- Center for Mechanisms of Evolution, The Biodesign Institute, Arizona State University, Tempe, AZ 85287, USA
| | - Jeffrey D Jensen
- School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA
| |
Collapse
|
6
|
Charlesworth B. How Good Are Predictions of the Effects of Selective Sweeps on Levels of Neutral Diversity? Genetics 2020; 216:1217-1238. [PMID: 33106248 PMCID: PMC7768247 DOI: 10.1534/genetics.120.303734] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Accepted: 10/22/2020] [Indexed: 11/18/2022] Open
Abstract
Selective sweeps are thought to play a significant role in shaping patterns of variability across genomes; accurate predictions of their effects are, therefore, important for understanding these patterns. A commonly used model of selective sweeps assumes that alleles sampled at the end of a sweep, and that fail to recombine with wild-type haplotypes during the sweep, coalesce instantaneously, leading to a simple expression for sweep effects on diversity. It is shown here that there can be a significant probability that a pair of alleles sampled at the end of a sweep coalesce during the sweep before a recombination event can occur, reducing their expected coalescent time below that given by the simple approximation. Expressions are derived for the expected reductions in pairwise neutral diversities caused by both single and recurrent sweeps in the presence of such within-sweep coalescence, although the effects of multiple recombination events during a sweep are only treated heuristically. The accuracies of the resulting expressions were checked against the results of simulations. For even moderate ratios of the recombination rate to the selection coefficient, the simple approximation can be substantially inaccurate. The selection model used here can be applied to favorable mutations with arbitrary dominance coefficients, to sex-linked loci with sex-specific selection coefficients, and to inbreeding populations. Using the results from this model, the expected differences between the levels of variability on X chromosomes and autosomes with selection at linked sites are discussed, and compared with data on a population of Drosophila melanogaster.
Collapse
Affiliation(s)
- Brian Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, EH9 3FL, United Kingdom
| |
Collapse
|
7
|
Pizarro M, Landi V, Navas F, León J, Martínez A, Fernández J, Delgado J. Nonparametric analysis of casein complex genes' epistasis and their effects on phenotypic expression of milk yield and composition in Murciano-Granadina goats. J Dairy Sci 2020; 103:8274-8291. [DOI: 10.3168/jds.2019-17833] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2019] [Accepted: 04/07/2020] [Indexed: 01/17/2023]
|
8
|
Na R, Ni W, E G, Zeng Y, Han Y, Huang Y. SNP screening of the MSTN gene and correlation analysis between genetic polymorphisms and growth traits in Dazu black goat. Anim Biotechnol 2020; 32:558-565. [PMID: 32633187 DOI: 10.1080/10495398.2020.1727915] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
The aim of this study was to detect SNPs in myostatin (MSTN) gene of four goat breeds, and analyze the correlation of these markers on body measurement traits in the Dazu black goat breed. In total, twenty polymorphic sites were found in one hundred forty-eight individuals, and all SNPs were distributed in introns 1 and 2, except g. 425 C > T, which was found in the regulatory region. Three SNPs (g. 2732 C > T, g. 2752 G > A and g. 4552 A > C) were polymorphic in all four breeds. None of the tag SNPs (g. 425 C > T, g. 1583 A > G, 2732 C > T, g. 4552 A > C and g. 5167 C > T) were significantly correlated with body measurement traits (p > 0.05) in the Dazu black goat. However, individuals with genotype combination 3 (GtC 3) of tag SNPs had higher birth weight and weaning weight than individuals with the other genotype combinations (p < 0.05). Moreover, the genotype combination 4 (GtC 4) was significantly associated with body length and height at the age of 2 months (p < 0.05), and genotype combination 13 (GtC 13) was significantly correlated with body height at 6 months (p < 0.05). Briefly, the combined tag SNP markers might be useful for goat marker-associated selective breeding.
Collapse
Affiliation(s)
- Risu Na
- College of Animal Science and Technology, Southwest University, Chongqing, China
| | - Weiwei Ni
- College of Animal Science and Technology, Southwest University, Chongqing, China
| | - Guangxin E
- College of Animal Science and Technology, Southwest University, Chongqing, China
| | - Yan Zeng
- College of Animal Science and Technology, Southwest University, Chongqing, China
| | - Yanguo Han
- College of Animal Science and Technology, Southwest University, Chongqing, China
| | - Yongfu Huang
- College of Animal Science and Technology, Southwest University, Chongqing, China
| |
Collapse
|
9
|
Footprints of natural selection at the mannose-6-phosphate isomerase locus in barnacles. Proc Natl Acad Sci U S A 2020; 117:5376-5385. [PMID: 32098846 PMCID: PMC7071928 DOI: 10.1073/pnas.1918232117] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
The rocky intertidal is a natural laboratory to study how natural selection acts on the genes and proteins responsible for organismal survival and reproduction. Alternative forms of enzymes that differ across the intertidal have been known for decades and have provided examples of selection, but the genetic basis of such enzyme variation is known in only a few cases. In this paper, we present molecular evidence of natural selection at the Mpi gene, a key enzyme in energy metabolism that alters survival of barnacles living across the stress gradient imposed by the intertidal. Our study demonstrates how natural selection can facilitate survival in highly heterogeneous environments through the maintenance of multiple molecular solutions to ecological stresses. The mannose-6-phosphate isomerase (Mpi) locus in Semibalanus balanoides has been studied as a candidate gene for balancing selection for more than two decades. Previous work has shown that Mpi allozyme genotypes (fast and slow) have different frequencies across Atlantic intertidal zones due to selection on postsettlement survival (i.e., allele zonation). We present the complete gene sequence of the Mpi locus and quantify nucleotide polymorphism in S. balanoides, as well as divergence to its sister taxon Semibalanus cariosus. We show that the slow allozyme contains a derived charge-altering amino acid polymorphism, and both allozyme classes correspond to two haplogroups with multiple internal haplotypes. The locus shows several footprints of balancing selection around the fast/slow site: an enrichment of positive Tajima’s D for nonsynonymous mutations, an excess of polymorphism, and a spike in the levels of silent polymorphism relative to silent divergence, as well as a site frequency spectrum enriched for midfrequency mutations. We observe other departures from neutrality across the locus in both coding and noncoding regions. These include a nonsynonymous trans-species polymorphism and a recent mutation under selection within the fast haplogroup. The latter suggests ongoing allelic replacement of functionally relevant amino acid variants. Moreover, predicted models of Mpi protein structure provide insight into the functional significance of the putatively selected amino acid polymorphisms. While footprints of selection are widespread across the range of S. balanoides, our data show that intertidal zonation patterns are variable across both spatial and temporal scales. These data provide further evidence for heterogeneous selection on Mpi.
Collapse
|
10
|
Belsare S, Levy-Sakin M, Mostovoy Y, Durinck S, Chaudhuri S, Xiao M, Peterson AS, Kwok PY, Seshagiri S, Wall JD. Evaluating the quality of the 1000 genomes project data. BMC Genomics 2019; 20:620. [PMID: 31416423 PMCID: PMC6696682 DOI: 10.1186/s12864-019-5957-x] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2018] [Accepted: 07/04/2019] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND Data from the 1000 Genomes project is quite often used as a reference for human genomic analysis. However, its accuracy needs to be assessed to understand the quality of predictions made using this reference. We present here an assessment of the genotyping, phasing, and imputation accuracy data in the 1000 Genomes project. We compare the phased haplotype calls from the 1000 Genomes project to experimentally phased haplotypes for 28 of the same individuals sequenced using the 10X Genomics platform. RESULTS We observe that phasing and imputation for rare variants are unreliable, which likely reflects the limited sample size of the 1000 Genomes project data. Further, it appears that using a population specific reference panel does not improve the accuracy of imputation over using the entire 1000 Genomes data set as a reference panel. We also note that the error rates and trends depend on the choice of definition of error, and hence any error reporting needs to take these definitions into account. CONCLUSIONS The quality of the 1000 Genomes data needs to be considered while using this database for further studies. This work presents an analysis that can be used for these assessments.
Collapse
Affiliation(s)
- Saurabh Belsare
- Institute for Human Genetics, University of California, San Francisco, CA, 94143, USA.
| | - Michal Levy-Sakin
- Department of Dermatology, University of California, San Francisco, CA, 94143, USA
| | - Yulia Mostovoy
- Department of Dermatology, University of California, San Francisco, CA, 94143, USA
| | - Steffen Durinck
- Department of Molecular Biology, Genentech Inc, 1 DNA Way, South San Francisco, CA, 94080, USA
| | - Subhra Chaudhuri
- Department of Molecular Biology, Genentech Inc, 1 DNA Way, South San Francisco, CA, 94080, USA
| | - Ming Xiao
- School of Biomedical Science, Engineering, and Health Systems, Drexel University, Philadelphia, PA, 19104, USA
| | - Andrew S Peterson
- Department of Molecular Biology, Genentech Inc, 1 DNA Way, South San Francisco, CA, 94080, USA
| | - Pui-Yan Kwok
- Institute for Human Genetics, University of California, San Francisco, CA, 94143, USA
- Department of Dermatology, University of California, San Francisco, CA, 94143, USA
- Cardiovascular Research Institute, San Francisco, CA, 94143, USA
| | - Somasekar Seshagiri
- Department of Molecular Biology, Genentech Inc, 1 DNA Way, South San Francisco, CA, 94080, USA
| | - Jeffrey D Wall
- Institute for Human Genetics, University of California, San Francisco, CA, 94143, USA.
- Department of Epidemiology and Biostatistics, University of California, San Francisco, CA, 94143, USA.
| |
Collapse
|
11
|
The Effects on Neutral Variability of Recurrent Selective Sweeps and Background Selection. Genetics 2019; 212:287-303. [PMID: 30923166 DOI: 10.1534/genetics.119.301951] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2019] [Accepted: 03/19/2019] [Indexed: 12/11/2022] Open
Abstract
Levels of variability and rates of adaptive evolution may be affected by hitchhiking, the effect of selection on evolution at linked sites. Hitchhiking can be caused either by "selective sweeps" or by background selection, involving the spread of new favorable alleles or the elimination of deleterious mutations, respectively. Recent analyses of population genomic data have fitted models where both these processes act simultaneously, to infer the parameters of selection. Here, we investigate the consequences of relaxing a key assumption of some of these studies, that the time occupied by a selective sweep is negligible compared with the neutral coalescent time. We derive a new expression for the expected level of neutral variability in the presence of recurrent selective sweeps and background selection. We also derive approximate integral expressions for the effects of recurrent selective sweeps. The accuracy of the theoretical predictions was tested against multilocus simulations, with selection, recombination, and mutation parameters that are realistic for Drosophila melanogaster In the presence of crossing over, there is approximate agreement between the theoretical and simulation results. We show that the observed relationships between the rate of crossing over, and the level of synonymous site diversity and rate of adaptive evolution in Drosophila are probably mainly caused by background selection, whereas selective sweeps and population size changes are needed to produce the observed distortions of the site frequency spectrum.
Collapse
|
12
|
Dutta R, Saha-Mandal A, Cheng X, Qiu S, Serpen J, Fedorova L, Fedorov A. 1000 human genomes carry widespread signatures of GC biased gene conversion. BMC Genomics 2018; 19:256. [PMID: 29661137 PMCID: PMC5902838 DOI: 10.1186/s12864-018-4593-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2017] [Accepted: 03/12/2018] [Indexed: 11/23/2022] Open
Abstract
BACKGROUND GC-Biased Gene Conversion (gBGC) is one of the important theories put forward to explain profound long-range non-randomness in nucleotide compositions along mammalian chromosomes. Nucleotide changes due to gBGC are hard to distinguish from regular mutations. Here, we present an algorithm for analysis of millions of known SNPs that detects a subset of so-called "SNP flip-over" events representing recent gBGC nucleotide changes, which occurred in previous generations via non-crossover meiotic recombination. RESULTS This algorithm has been applied in a large-scale analysis of 1092 sequenced human genomes. Altogether, 56,328 regions on all autosomes have been examined, which revealed 223,955 putative gBGC cases leading to SNP flip-overs. We detected a strong bias (11.7% ± 0.2% excess) in AT- > GC over GC- > AT base pair changes within the entire set of putative gBGC cases. CONCLUSIONS On average, a human gamete acquires 7 SNP flip-over events, in which one allele is replaced by its complementary allele during the process of meiotic non-crossover recombination. In each meiosis event, on average, gBGC results in replacement of 7 AT base pairs by GC base pairs, while only 6 GC pairs are replaced by AT pairs. Therefore, every human gamete is enriched by one GC pair. Happening over millions of years of evolution, this bias may be a noticeable force in changing the nucleotide composition landscape along chromosomes.
Collapse
Affiliation(s)
- Rajib Dutta
- Program in Biomedical Sciences, University of Toledo, Health Science Campus, Toledo, OH 43614 USA
- Department of Medicine, University of Toledo, Health Science Campus, Toledo, OH 43614 USA
- Present Address: Center for Cardiovascular and Pulmonary Research, Nationwide Children’s Hospital, 700 Children’s Dr, Columbus, OH USA
| | - Arnab Saha-Mandal
- Program in Bioinformatics and Proteomics/Genomics, University of Toledo, Health Science Campus, Toledo, OH 43614 USA
- Present Address: Biochemistry and Molecular Biology Graduate Program, Cumming School of Medicine, University of Calgary, Calgary, AB T2N4N1 Canada
| | - Xi Cheng
- Program in Biomedical Sciences, University of Toledo, Health Science Campus, Toledo, OH 43614 USA
| | - Shuhao Qiu
- Program in Biomedical Sciences, University of Toledo, Health Science Campus, Toledo, OH 43614 USA
- Department of Medicine, University of Toledo, Health Science Campus, Toledo, OH 43614 USA
| | - Jasmine Serpen
- SURF Program, University of Toledo, Health Science Campus, Toledo, OH 43614 USA
- College of Arts and Sciences, Washington University in St. Louis, 1 Brookings Dr, St. Louis, MO 63130 USA
| | | | - Alexei Fedorov
- Department of Medicine, University of Toledo, Health Science Campus, Toledo, OH 43614 USA
- Program in Bioinformatics and Proteomics/Genomics, University of Toledo, Health Science Campus, Toledo, OH 43614 USA
| |
Collapse
|
13
|
Fagundes NJR, Tagliani-Ribeiro A, Rubicz R, Tarskaia L, Crawford MH, Salzano FM, Bonatto SL. How strong was the bottleneck associated to the peopling of the Americas? New insights from multilocus sequence data. Genet Mol Biol 2018; 41:206-214. [PMID: 29668018 PMCID: PMC5913727 DOI: 10.1590/1678-4685-gmb-2017-0087] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2017] [Accepted: 05/23/2017] [Indexed: 12/31/2022] Open
Abstract
In spite of many genetic studies that contributed for a deep knowledge about the peopling of the Americas, no consensus has emerged about important parameters such as the effective size of the Native Americans founder population. Previous estimates based on genomic datasets may have been biased by the use of admixed individuals from Latino populations, while other recent studies using samples from Native American individuals relied on approximated analytical approaches. In this study we use resequencing data for nine independent regions in a set of Native American and Siberian individuals and a full-likelihood approach based on isolation-with-migration scenarios accounting for recent flow between Asian and Native American populations. Our results suggest that, in agreement with previous studies, the effective size of the Native American population was small, most likely in the order of a few hundred individuals, with point estimates close to 250 individuals, even though credible intervals include a number as large as ~4,000 individuals. Recognizing the size of the genetic bottleneck during the peopling of the Americas is important for determining the extent of genetic markers needed to characterize Native American populations in genome-wide studies and to evaluate the adaptive potential of genetic variants in this population.
Collapse
Affiliation(s)
- Nelson J R Fagundes
- Departamento de Genética, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, RS, Brazil
| | - Alice Tagliani-Ribeiro
- Fertilitat Centro de Medicina Reprodutiva, Centro Clínico da Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS), Porto Alegre, RS, Brazil
| | - Rohina Rubicz
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Larissa Tarskaia
- Laboratory of Biological Anthropology, University of Kansas, Lawrence, KS, USA
| | - Michael H Crawford
- Laboratory of Biological Anthropology, University of Kansas, Lawrence, KS, USA
| | - Francisco M Salzano
- Departamento de Genética, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, RS, Brazil
| | - Sandro L Bonatto
- Faculdade de Biociências, Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS), Porto Alegre, RS, Brazil
| |
Collapse
|
14
|
Comparison of Single Genome and Allele Frequency Data Reveals Discordant Demographic Histories. G3-GENES GENOMES GENETICS 2017; 7:3605-3620. [PMID: 28893846 PMCID: PMC5677151 DOI: 10.1534/g3.117.300259] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
Inference of demographic history from genetic data is a primary goal of population genetics of model and nonmodel organisms. Whole genome-based approaches such as the pairwise/multiple sequentially Markovian coalescent methods use genomic data from one to four individuals to infer the demographic history of an entire population, while site frequency spectrum (SFS)-based methods use the distribution of allele frequencies in a sample to reconstruct the same historical events. Although both methods are extensively used in empirical studies and perform well on data simulated under simple models, there have been only limited comparisons of them in more complex and realistic settings. Here we use published demographic models based on data from three human populations (Yoruba, descendants of northwest-Europeans, and Han Chinese) as an empirical test case to study the behavior of both inference procedures. We find that several of the demographic histories inferred by the whole genome-based methods do not predict the genome-wide distribution of heterozygosity, nor do they predict the empirical SFS. However, using simulated data, we also find that the whole genome methods can reconstruct the complex demographic models inferred by SFS-based methods, suggesting that the discordant patterns of genetic variation are not attributable to a lack of statistical power, but may reflect unmodeled complexities in the underlying demography. More generally, our findings indicate that demographic inference from a small number of genomes, routine in genomic studies of nonmodel organisms, should be interpreted cautiously, as these models cannot recapitulate other summaries of the data.
Collapse
|
15
|
Jennings WB. On the independent gene trees assumption in phylogenomic studies. Mol Ecol 2017; 26:4862-4871. [PMID: 28752599 DOI: 10.1111/mec.14274] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2016] [Revised: 07/13/2017] [Accepted: 07/24/2017] [Indexed: 11/28/2022]
Abstract
Multilocus coalescent methods for inferring species trees or historical demographic parameters typically require the assumption that gene trees for sampled SNPs or DNA sequence loci are conditionally independent given their species tree. In practice, researchers have used different criteria to delimit "independent loci." One criterion identifies sampled loci as being independent of each other if they undergo Mendelian independent assortment (IA criterion). O'Neill et al. (2013, Molecular Ecology, 22, 111-129) used this approach in their phylogeographic study of North American tiger salamander species complex. In two other studies, researchers developed a pair of related methods that employ an independent genealogies criterion (IG criterion), which considers the effects of population-level recombination on correlations between the gene trees of intrachromosomal loci. Here, I explain these three methods, illustrate their use with example data, and evaluate their efficacies. I show that the IA approach is more conservative, is simpler to use and requires fewer assumptions than the IG approaches. However, IG approaches can identify much larger numbers of independent loci than the IA method, which, in turn, allows researchers to obtain more precise and accurate estimates of species trees and historical demographic parameters. A disadvantage of the IG methods is that they require an estimate of the population recombination rate. Despite their drawbacks, IA and IG approaches provide molecular ecologists with promising a priori methods for selecting SNPs or DNA sequence loci that likely meet the independence assumption in coalescent-based phylogenomic studies.
Collapse
Affiliation(s)
- W Bryan Jennings
- Departamento de Vertebrados, Museu Nacional, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| |
Collapse
|
16
|
Silva-Junior OB, Grattapaglia D. Genome-wide patterns of recombination, linkage disequilibrium and nucleotide diversity from pooled resequencing and single nucleotide polymorphism genotyping unlock the evolutionary history of Eucalyptus grandis. THE NEW PHYTOLOGIST 2015; 208:830-45. [PMID: 26079595 DOI: 10.1111/nph.13505] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/28/2014] [Accepted: 05/06/2015] [Indexed: 05/03/2023]
Abstract
We used high-density single nucleotide polymorphism (SNP) data and whole-genome pooled resequencing to examine the landscape of population recombination (ρ) and nucleotide diversity (ϴw ), assess the extent of linkage disequilibrium (r(2) ) and build the highest density linkage maps for Eucalyptus. At the genome-wide level, linkage disequilibrium (LD) decayed within c. 4-6 kb, slower than previously reported from candidate gene studies, but showing considerable variation from absence to complete LD up to 50 kb. A sharp decrease in the estimate of ρ was seen when going from short to genome-wide inter-SNP distances, highlighting the dependence of this parameter on the scale of observation adopted. Recombination was correlated with nucleotide diversity, gene density and distance from the centromere, with hotspots of recombination enriched for genes involved in chemical reactions and pathways of the normal metabolic processes. The high nucleotide diversity (ϴw = 0.022) of E. grandis revealed that mutation is more important than recombination in shaping its genomic diversity (ρ/ϴw = 0.645). Chromosome-wide ancestral recombination graphs allowed us to date the split of E. grandis (1.7-4.8 million yr ago) and identify a scenario for the recent demographic history of the species. Our results have considerable practical importance to Genome Wide Association Studies (GWAS), while indicating bright prospects for genomic prediction of complex phenotypes in eucalypt breeding.
Collapse
Affiliation(s)
- Orzenil B Silva-Junior
- Laboratório de Genética Vegetal, EMBRAPA Recursos Genéticos e Biotecnologia, PqEB, Brasilia, 70770-970, DF, Brazil
- Laboratório de Bioinformática, EMBRAPA Recursos Genéticos e Biotecnologia, PqEB, Brasilia, DF, 70770-970, Brazil
- Programa de Ciências Genômicas e Biotecnologia, Universidade Católica de Brasília, SGAN 916, Brasilia, DF, 70790-160, Brazil
| | - Dario Grattapaglia
- Laboratório de Genética Vegetal, EMBRAPA Recursos Genéticos e Biotecnologia, PqEB, Brasilia, 70770-970, DF, Brazil
- Programa de Ciências Genômicas e Biotecnologia, Universidade Católica de Brasília, SGAN 916, Brasilia, DF, 70790-160, Brazil
| |
Collapse
|
17
|
Corbin LJ, Blott SC, Swinburne JE, Vaudin M, Bishop SC, Woolliams JA. Linkage disequilibrium and historical effective population size in the Thoroughbred horse. Anim Genet 2015; 41 Suppl 2:8-15. [PMID: 21070270 DOI: 10.1111/j.1365-2052.2010.02092.x] [Citation(s) in RCA: 78] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Many genomic methodologies rely on the presence and extent of linkage disequilibrium (LD) between markers and genetic variants underlying traits of interest, but the extent of LD in the horse has yet to be comprehensively characterized. In this study, we evaluate the extent and decay of LD in a sample of 817 Thoroughbreds. Horses were genotyped for over 50,000 single nucleotide polymorphism (SNP) markers across the genome, with 34,848 autosomal SNPs used in the final analysis. Linkage disequilibrium, as measured by the squared correlation coefficient (r(2)), was found to be relatively high between closely linked markers (>0.6 at 5 kb) and to extend over long distances, with average r(2) maintained above non-syntenic levels for single nucleotide polymorphisms (SNPs) up to 20 Mb apart. Using formulae which relate expected LD to effective population size (N(e)), and assuming a constant actual population size, N(e) was estimated to be 100 in our population. Values of historical N(e), calculated assuming linear population growth, suggested a decrease in N(e) since the distant past, reaching a minimum twenty generations ago, followed by a subsequent increase until the present time. The qualitative trends observed in N(e) can be rationalized by current knowledge of the history of the Thoroughbred breed, and inbreeding statistics obtained from published pedigree analyses are in agreement with observed values of N(e). Given the high LD observed and the small estimated N(e), genomic methodologies such as genomic selection could feasibly be applied to this population using the existing SNP marker set.
Collapse
Affiliation(s)
- L J Corbin
- Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Roslin Biocentre, EH25 9PS, UK.
| | | | | | | | | | | |
Collapse
|
18
|
Williams AL, Genovese G, Dyer T, Altemose N, Truax K, Jun G, Patterson N, Myers SR, Curran JE, Duggirala R, Blangero J, Reich D, Przeworski M. Non-crossover gene conversions show strong GC bias and unexpected clustering in humans. eLife 2015; 4. [PMID: 25806687 PMCID: PMC4404656 DOI: 10.7554/elife.04637] [Citation(s) in RCA: 63] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2014] [Accepted: 03/20/2015] [Indexed: 12/15/2022] Open
Abstract
Although the past decade has seen tremendous progress in our understanding of fine-scale recombination, little is known about non-crossover (NCO) gene conversion. We report the first genome-wide study of NCO events in humans. Using SNP array data from 98 meioses, we identified 103 sites affected by NCO, of which 50/52 were confirmed in sequence data. Overlap with double strand break (DSB) hotspots indicates that most of the events are likely of meiotic origin. We estimate that a site is involved in a NCO at a rate of 5.9 × 10(-6)/bp/generation, consistent with sperm-typing studies, and infer that tract lengths span at least an order of magnitude. Observed NCO events show strong allelic bias at heterozygous AT/GC SNPs, with 68% (58-78%) transmitting GC alleles (p = 5 × 10(-4)). Strikingly, in 4 of 15 regions with resequencing data, multiple disjoint NCO tracts cluster in close proximity (∼20-30 kb), a phenomenon not previously seen in mammals.
Collapse
Affiliation(s)
- Amy L Williams
- Department of Biological Sciences, Columbia University, New York, United States
| | - Giulio Genovese
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, United States
| | - Thomas Dyer
- Department of Genetics, Texas Biomedical Research Institute, San Antonio, United States
| | - Nicolas Altemose
- Wellcome Trust Centre for Human Genetics, Oxford University, Oxford, United Kingdom
| | - Katherine Truax
- Department of Genetics, Texas Biomedical Research Institute, San Antonio, United States
| | - Goo Jun
- Department of Biostatistics, University of Michigan, Ann Arbor, United States
| | - Nick Patterson
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, United States
| | - Simon R Myers
- Wellcome Trust Centre for Human Genetics, Oxford University, Oxford, United Kingdom
| | - Joanne E Curran
- Department of Genetics, Texas Biomedical Research Institute, San Antonio, United States
| | - Ravi Duggirala
- Department of Genetics, Texas Biomedical Research Institute, San Antonio, United States
| | - John Blangero
- Department of Genetics, Texas Biomedical Research Institute, San Antonio, United States
| | - David Reich
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, United States
| | - Molly Przeworski
- Department of Biological Sciences, Columbia University, New York, United States
| | | |
Collapse
|
19
|
Lachance J, Tishkoff SA. Biased gene conversion skews allele frequencies in human populations, increasing the disease burden of recessive alleles. Am J Hum Genet 2014; 95:408-20. [PMID: 25279983 PMCID: PMC4185123 DOI: 10.1016/j.ajhg.2014.09.008] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2014] [Revised: 08/21/2014] [Accepted: 09/10/2014] [Indexed: 10/25/2022] Open
Abstract
Gene conversion results in the nonreciprocal transfer of genetic information between two recombining sequences, and there is evidence that this process is biased toward G and C alleles. However, the strength of GC-biased gene conversion (gBGC) in human populations and its effects on hereditary disease have yet to be assessed on a genomic scale. Using high-coverage whole-genome sequences of African hunter-gatherers, agricultural populations, and primate outgroups, we quantified the effects of GC-biased gene conversion on population genomic data sets. We find that genetic distances (FST and population branch statistics) are modified by gBGC. In addition, the site frequency spectrum is left-shifted when ancestral alleles are favored by gBGC and right-shifted when derived alleles are favored by gBGC. Allele frequency shifts due to gBGC mimic the effects of natural selection. As expected, these effects are strongest in high-recombination regions of the human genome. By comparing the relative rates of fixation of unbiased and biased sites, the strength of gene conversion was estimated to be on the order of Nb ≈ 0.05 to 0.09. We also find that derived alleles favored by gBGC are much more likely to be homozygous than derived alleles at unbiased SNPs (+42.2% to 62.8%). This results in a curse of the converted, whereby gBGC causes substantial increases in hereditary disease risks. Taken together, our findings reveal that GC-biased gene conversion has important population genetic and public health implications.
Collapse
MESH Headings
- Bias
- Evolution, Molecular
- Gene Conversion
- Gene Frequency
- Genes, Recessive/genetics
- Genetic Diseases, Inborn/genetics
- Genetics, Population
- Genome, Human/genetics
- Humans
- Models, Genetic
- Models, Theoretical
- Polymorphism, Single Nucleotide/genetics
- Recombination, Genetic
- Selection, Genetic/genetics
Collapse
Affiliation(s)
- Joseph Lachance
- Departments of Biology and Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA.
| | - Sarah A Tishkoff
- Departments of Biology and Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA.
| |
Collapse
|
20
|
Dixit J, Arunyawat U, Huong NT, Das A. Multilocus nuclear DNA markers reveal population structure and demography of Anopheles minimus. Mol Ecol 2014; 23:5599-618. [PMID: 25266341 DOI: 10.1111/mec.12943] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2013] [Revised: 09/21/2014] [Accepted: 09/22/2014] [Indexed: 01/11/2023]
Abstract
Utilization of multiple putatively neutral DNA markers for inferring evolutionary history of species population is considered to be the most robust approach. Molecular population genetic studies have been conducted in many species of Anopheles genus, but studies based on single nucleotide polymorphism (SNP) data are still very scarce. Anopheles minimus is one of the principal malaria vectors of Southeast (SE) Asia including the Northeastern (NE) India. Although population genetic studies with mitochondrial genetic variation data have been utilized to infer phylogeography of the SE Asian populations of this species, limited information on the population structure and demography of Indian An. minimus is available. We herewith have developed multilocus nuclear genetic approach with SNP markers located in X chromosome of An. minimus in eight Indian and two SE Asian population samples (121 individual mosquitoes in total) to infer population history and test several hypotheses on the phylogeography of this species. While the Thai population sample of An. minimus presented the highest nucleotide diversity, majority of the Indian samples were also fairly diverse. In general, An. minimus populations were moderately substructured in the distribution range covering SE Asia and NE India, largely falling under three distinct genetic clusters. Moreover, demographic expansion events could be detected in the majority of the presently studied populations of An. minimus. Additional DNA sequencing of the mitochondrial COII region in a subset of the samples (40 individual mosquitoes) corroborated the existing hypothesis of Indian An. minimus falling under the earlier reported mitochondrial lineage B.
Collapse
Affiliation(s)
- Jyotsana Dixit
- Evolutionary Genomics and Bioinformatics Laboratory, Division of Genomics and Bioinformatics, National Institute of Malaria Research, Sector-8, Dwarka, New Delhi, 110077, India
| | | | | | | |
Collapse
|
21
|
Bayesian inferences suggest that Amazon Yunga Natives diverged from Andeans less than 5000 ybp: implications for South American prehistory. BMC Evol Biol 2014; 14:174. [PMID: 25266366 PMCID: PMC4189748 DOI: 10.1186/s12862-014-0174-3] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2014] [Accepted: 07/24/2014] [Indexed: 01/27/2023] Open
Abstract
BACKGROUND Archaeology reports millenary cultural contacts between Peruvian Coast-Andes and the Amazon Yunga, a rainforest transitional region between Andes and Lower Amazonia. To clarify the relationships between cultural and biological evolution of these populations, in particular between Amazon Yungas and Andeans, we used DNA-sequence data, a model-based Bayesian approach and several statistical validations to infer a set of demographic parameters. RESULTS We found that the genetic diversity of the Shimaa (an Amazon Yunga population) is a subset of that of Quechuas from Central-Andes. Using the Isolation-with-Migration population genetics model, we inferred that the Shimaa ancestors were a small subgroup that split less than 5300 years ago (after the development of complex societies) from an ancestral Andean population. After the split, the most plausible scenario compatible with our results is that the ancestors of Shimaas moved toward the Peruvian Amazon Yunga and incorporated the culture and language of some of their neighbors, but not a substantial amount of their genes. We validated our results using Approximate Bayesian Computations, posterior predictive tests and the analysis of pseudo-observed datasets. CONCLUSIONS We presented a case study in which model-based Bayesian approaches, combined with necessary statistical validations, shed light into the prehistoric demographic relationship between Andeans and a population from the Amazon Yunga. Our results offer a testable model for the peopling of this large transitional environmental region between the Andes and the Lower Amazonia. However, studies on larger samples and involving more populations of these regions are necessary to confirm if the predominant Andean biological origin of the Shimaas is the rule, and not the exception.
Collapse
|
22
|
Yang J, Shikano T, Li MH, Merilä J. Genome-wide linkage disequilibrium in nine-spined stickleback populations. G3 (BETHESDA, MD.) 2014; 4:1919-29. [PMID: 25122668 PMCID: PMC4199698 DOI: 10.1534/g3.114.013334] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/17/2014] [Accepted: 08/09/2014] [Indexed: 11/18/2022]
Abstract
Variation in the extent and magnitude of genome-wide linkage disequilibrium (LD) among populations residing in different habitats has seldom been studied in wild vertebrates. We used a total of 109 microsatellite markers to quantify the level and patterns of genome-wide LD in 13 Fennoscandian nine-spined stickleback (Pungitius pungitius) populations from four (viz. marine, lake, pond, and river) different habitat types. In general, high magnitude (D' > 0.5) of LD was found both in freshwater and marine populations, and the magnitude of LD was significantly greater in inland freshwater than in marine populations. Interestingly, three coastal freshwater populations located in close geographic proximity to the marine populations exhibited similar LD patterns and genetic diversity as their marine neighbors. The greater levels of LD in inland freshwater compared with marine and costal freshwater populations can be explained in terms of their contrasting demographic histories: founder events, long-term isolation, small effective sizes, and population bottlenecks are factors likely to have contributed to the high levels of LD in the inland freshwater populations. In general, these findings shed new light on the patterns and extent of variation in genome-wide LD, as well as the ecological and evolutionary factors driving them.
Collapse
Affiliation(s)
- Ji Yang
- CAS Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences (CAS), Beijing 100101, China
| | - Takahito Shikano
- Ecological Genetics Research Unit, Department of Biosciences, FIN-00014 University of Helsinki, Finland
| | - Meng-Hua Li
- CAS Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences (CAS), Beijing 100101, China
| | - Juha Merilä
- Ecological Genetics Research Unit, Department of Biosciences, FIN-00014 University of Helsinki, Finland
| |
Collapse
|
23
|
Abstract
The great ape families are the species most closely related to our own, comprising chimpanzees, bonobos, gorillas, and orangutans. They live exclusively in tropical rainforests in Central Africa and the islands of Southeast Asia. Due to their close evolutionary relationship with humans, great apes share many cognitive, physiological, and morphological similarities with humans. The members of the great ape family make obvious models to facilitate the further understanding about humans' biology and history. This review will discuss how the recent addition of genome-wide data from great apes has furthered humans' understanding of these species and humanity, especially in the realm of evolutionary genetics.
Collapse
|
24
|
Abstract
Although the analysis of linkage disequilibrium (LD) plays a central role in many areas of population genetics, the sampling variance of LD is known to be very large with high sensitivity to numbers of nucleotide sites and individuals sampled. Here we show that a genome-wide analysis of the distribution of heterozygous sites within a single diploid genome can yield highly informative patterns of LD as a function of physical distance. The proposed statistic, the correlation of zygosity, is closely related to the conventional population-level measure of LD, but is agnostic with respect to allele frequencies and hence likely less prone to outlier artifacts. Application of the method to several vertebrate species leads to the conclusion that >80% of recombination events are typically resolved by gene-conversion-like processes unaccompanied by crossovers, with the average lengths of conversion patches being on the order of one to several kilobases in length. Thus, contrary to common assumptions, the recombination rate between sites does not scale linearly with distance, often even up to distances of 100 kb. In addition, the amount of LD between sites separated by <200 bp is uniformly much greater than can be explained by the conventional neutral model, possibly because of the nonindependent origin of mutations within this spatial scale. These results raise questions about the application of conventional population-genetic interpretations to LD on short spatial scales and also about the use of spatial patterns of LD to infer demographic histories.
Collapse
|
25
|
Abstract
The "LD curve" relates the linkage disequilibrium (LD) between pairs of nucleotide sites to the distance that separates them along the chromosome. The shape of this curve reflects natural selection, admixture between populations, and the history of population size. This article derives new results about the last of these effects. When a population expands in size, the LD curve grows steeper, and this effect is especially pronounced following a bottleneck in population size. When a population shrinks, the LD curve rises but remains relatively flat. As LD converges toward a new equilibrium, its time path may not be monotonic. Following an episode of growth, for example, it declines to a low value before rising toward the new equilibrium. These changes happen at different rates for different LD statistics. They are especially slow for estimates of [Formula: see text], which therefore allow inferences about ancient population history. For the human population of Europe, these results suggest a history of population growth.
Collapse
|
26
|
Interplay of interlocus gene conversion and crossover in segmental duplications under a neutral scenario. G3-GENES GENOMES GENETICS 2014; 4:1479-89. [PMID: 24906640 PMCID: PMC4132178 DOI: 10.1534/g3.114.012435] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Interlocus gene conversion is a major evolutionary force that drives the concerted evolution of duplicated genomic regions. Theoretical models successfully have addressed the effects of interlocus gene conversion and the importance of crossover in the evolutionary fate of gene families and duplications but have not considered complex recombination scenarios, such as the presence of hotspots. To study the interplay between interlocus gene conversion and crossover, we have developed a forward-time simulator that allows the exploration of a wide range of interlocus gene conversion rates under different crossover models. Using it, we have analyzed patterns of nucleotide variation and linkage disequilibrium within and between duplicate regions, focusing on a neutral scenario with constant population size and validating our results with the existing theoretical models. We show that the interaction of gene conversion and crossover is nontrivial and that the location of crossover junctions is a fundamental determinant of levels of variation and linkage disequilibrium in duplicated regions. We also show that if crossover activity between duplications is strong enough, recurrent interlocus gene conversion events can break linkage disequilibrium within duplicates. Given the complex nature of interlocus gene conversion and crossover, we provide a framework to explore their interplay to help increase knowledge on molecular evolution within segmental duplications under more complex scenarios, such as demographic changes or natural selection.
Collapse
|
27
|
Genome-wide estimation of linkage disequilibrium from population-level high-throughput sequencing data. Genetics 2014; 197:1303-13. [PMID: 24875187 DOI: 10.1534/genetics.114.165514] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Rapidly improving sequencing technologies provide unprecedented opportunities for analyzing genome-wide patterns of polymorphisms. In particular, they have great potential for linkage-disequilibrium analyses on both global and local genetic scales, which will substantially improve our ability to derive evolutionary inferences. However, there are some difficulties with analyzing high-throughput sequencing data, including high error rates associated with base reads and complications from the random sampling of sequenced chromosomes in diploid organisms. To overcome these difficulties, we developed a maximum-likelihood estimator of linkage disequilibrium for use with error-prone sampling data. Computer simulations indicate that the estimator is nearly unbiased with a sampling variance at high coverage asymptotically approaching the value expected when all relevant information is accurately estimated. The estimator does not require phasing of haplotypes and enables the estimation of linkage disequilibrium even when all individual reads cover just single polymorphic sites.
Collapse
|
28
|
Pilot M, Greco C, vonHoldt BM, Jędrzejewska B, Randi E, Jędrzejewski W, Sidorovich VE, Ostrander EA, Wayne RK. Genome-wide signatures of population bottlenecks and diversifying selection in European wolves. Heredity (Edinb) 2013; 112:428-42. [PMID: 24346500 DOI: 10.1038/hdy.2013.122] [Citation(s) in RCA: 76] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2013] [Revised: 10/14/2013] [Accepted: 10/16/2013] [Indexed: 12/21/2022] Open
Abstract
Genomic resources developed for domesticated species provide powerful tools for studying the evolutionary history of their wild relatives. Here we use 61K single-nucleotide polymorphisms (SNPs) evenly spaced throughout the canine nuclear genome to analyse evolutionary relationships among the three largest European populations of grey wolves in comparison with other populations worldwide, and investigate genome-wide effects of demographic bottlenecks and signatures of selection. European wolves have a discontinuous range, with large and connected populations in Eastern Europe and relatively smaller, isolated populations in Italy and the Iberian Peninsula. Our results suggest a continuous decline in wolf numbers in Europe since the Late Pleistocene, and long-term isolation and bottlenecks in the Italian and Iberian populations following their divergence from the Eastern European population. The Italian and Iberian populations have low genetic variability and high linkage disequilibrium, but relatively few autozygous segments across the genome. This last characteristic clearly distinguishes them from populations that underwent recent drastic demographic declines or founder events, and implies long-term bottlenecks in these two populations. Although genetic drift due to spatial isolation and bottlenecks seems to be a major evolutionary force diversifying the European populations, we detected 35 loci that are putatively under diversifying selection. Two of these loci flank the canine platelet-derived growth factor gene, which affects bone growth and may influence differences in body size between wolf populations. This study demonstrates the power of population genomics for identifying genetic signals of demographic bottlenecks and detecting signatures of directional selection in bottlenecked populations, despite their low background variability.
Collapse
Affiliation(s)
- M Pilot
- 1] School of Life Sciences, University of Lincoln, Lincoln, UK [2] Museum and Institute of Zoology, Polish Academy of Sciences, Warsaw, Poland
| | - C Greco
- Istituto Superiore per la Protezione e la Ricerca Ambientale (ISPRA), Rome and Ozzano Emilia (BO), Italy
| | - B M vonHoldt
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA, USA
| | - B Jędrzejewska
- Mammal Research Institute, Polish Academy of Sciences, Białowieża, Poland
| | - E Randi
- 1] Istituto Superiore per la Protezione e la Ricerca Ambientale (ISPRA), Rome and Ozzano Emilia (BO), Italy [2] Aalborg University, Department 18, Section of Environmental Engineering, Aalborg, Denmark
| | - W Jędrzejewski
- Mammal Research Institute, Polish Academy of Sciences, Białowieża, Poland
| | - V E Sidorovich
- Institute of Zoology, National Academy of Sciences of Belarus, Minsk, Belarus
| | - E A Ostrander
- Cancer Genetics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - R K Wayne
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA, USA
| |
Collapse
|
29
|
Wan Q, Su J, Chen X, Yang C. Gene-based polymorphisms, genomic organization of interferon-β promoter stimulator 1 (IPS-1) gene and association study with the natural resistance to grass carp reovirus in grass carp Ctenopharyngodon idella. DEVELOPMENTAL AND COMPARATIVE IMMUNOLOGY 2013; 41:756-765. [PMID: 24001582 DOI: 10.1016/j.dci.2013.08.022] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/12/2013] [Revised: 08/19/2013] [Accepted: 08/24/2013] [Indexed: 06/02/2023]
Abstract
Interferon-β promoter stimulator 1 (IPS-1) plays a pivotal role in the production of type I interferon (IFN) and pro-inflammatory cytokines. Though its function in innate immunity was widely studied, its genetic polymorphisms and evolution information were rarely known. Herein, the present study firstly identified and characterized the genomic organization of CiIPS-1 (IPS-1 gene of grass carp, Ctenopharyngodon idella) and its nucleotide polymorphisms. CiIPS-1 consists of 8789 bp, with a 314 bp 5'-flanking region, 6 exons and 5 introns. After a challenge experiment, statistical analysis was performed to assess the association between the 17 polymorphisms of CiIPS-1 and the natural resistance of C. idella to grass carp reovirus (GCRV), which revealed that -3741 C/T in the intron 1, 933 C/G in the coding sequence and 2299 G/T in the last intron were significant (P < 0.05). The subsequent secondary challenge test further confirmed that -3741TT group were more resistant than -3741CC group to GCRV infection, while 933GG and 2299GG stocks were respectively more susceptible than 933CC and 2299TT stocks (P < 0.05). In addition, haplotype and polymorphism-polymorphism interaction analyses uncovered four significant haplotypes (P < 0.05) and two notable polymorphisms. These findings may provide basic data for the further functional research of CiIPS-1, and genetic markers for molecular and transgenic breeding of C. idella.
Collapse
Affiliation(s)
- Quanyuan Wan
- College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China
| | | | | | | |
Collapse
|
30
|
Chan YC, Roos C, Inoue-Murayama M, Inoue E, Shih CC, Pei KJC, Vigilant L. Inferring the evolutionary histories of divergences in Hylobates and Nomascus gibbons through multilocus sequence data. BMC Evol Biol 2013; 13:82. [PMID: 23586586 PMCID: PMC3637282 DOI: 10.1186/1471-2148-13-82] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2013] [Accepted: 04/08/2013] [Indexed: 01/15/2023] Open
Abstract
BACKGROUND Gibbons (Hylobatidae) are the most diverse group of living apes. They exist as geographically-contiguous species which diverged more rapidly than did their close relatives, the great apes (Hominidae). Of the four extant gibbon genera, the evolutionary histories of two polyspecific genera, Hylobates and Nomascus, have been the particular focus of research but the DNA sequence data used was largely derived from the maternally inherited mitochondrial DNA (mtDNA) locus. RESULTS To investigate the evolutionary relationships and divergence processes of gibbon species, particularly those of the Hylobates genus, we produced and analyzed a total of 11.5 kb DNA of sequence at 14 biparentally inherited autosomal loci. We find that on average gibbon genera have a high average sequence diversity but a lower degree of genetic differentiation as compared to great ape genera. Our multilocus species tree features H. pileatus in a basal position and a grouping of the four Sundaic island species (H. agilis, H. klossii, H. moloch and H. muelleri). We conducted pairwise comparisons based on an isolation-with-migration (IM) model and detect signals of asymmetric gene flow between H. lar and H. moloch, between H. agilis and H. muelleri, and between N. leucogenys and N. siki. CONCLUSIONS Our multilocus analyses provide inferences of gibbon evolutionary histories complementary to those based on single gene data. The results of IM analyses suggest that the divergence processes of gibbons may be accompanied by gene flow. Future studies using analyses of multi-population model with samples of known provenance for Hylobates and Nomascus species would expand the understanding of histories of gene flow during divergences for these two gibbon genera.
Collapse
Affiliation(s)
- Yi-Chiao Chan
- Department of Primatology, Max-Planck Institute for Evolutionary Anthropology, Deutscher Platz 6, Leipzig 04103, Germany.
| | | | | | | | | | | | | |
Collapse
|
31
|
Padhukasahasram B, Rannala B. Meiotic gene-conversion rate and tract length variation in the human genome. Eur J Hum Genet 2013:ejhg201330. [PMID: 23443031 DOI: 10.1038/ejhg.2013.30] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2012] [Revised: 12/17/2012] [Accepted: 01/10/2013] [Indexed: 01/11/2023] Open
Abstract
Meiotic recombination occurs in the form of two different mechanisms called crossing-over and gene-conversion and both processes have an important role in shaping genetic variation in populations. Although variation in crossing-over rates has been studied extensively using sperm-typing experiments, pedigree studies and population genetic approaches, our knowledge of variation in gene-conversion parameters (ie, rates and mean tract lengths) remains far from complete. To explore variability in population gene-conversion rates and its relationship to crossing-over rate variation patterns, we have developed and validated using coalescent simulations a comprehensive Bayesian full-likelihood method that can jointly infer crossing-over and gene-conversion rates as well as tract lengths from population genomic data under general variable rate models with recombination hotspots. Here, we apply this new method to SNP data from multiple human populations and attempt to characterize for the first time the fine-scale variation in gene-conversion parameters along the human genome. We find that the estimated ratio of gene-conversion to crossing-over rates varies considerably across genomic regions as well as between populations. However, there is a great degree of uncertainty associated with such estimates. We also find substantial evidence for variation in the mean conversion tract length. The estimated tract lengths did not show any negative relationship with the local heterozygosity levels in our analysis.European Journal of Human Genetics advance online publication, 27 February 2013; doi:10.1038/ejhg.2013.30.
Collapse
Affiliation(s)
- Badri Padhukasahasram
- 1] Center for Health Policy and Health Services Research, Henry Ford Health System, Detroit, MI, USA [2] Genome Center and Department of Evolution and Ecology, University of California, Davis, Davis, CA, USA
| | - Bruce Rannala
- Genome Center and Department of Evolution and Ecology, University of California, Davis, Davis, CA, USA
| |
Collapse
|
32
|
Moleirinho A, Seixas S, Lopes AM, Bento C, Prata MJ, Amorim A. Evolutionary constraints in the β-globin cluster: the signature of purifying selection at the δ-globin (HBD) locus and its role in developmental gene regulation. Genome Biol Evol 2013; 5:559-71. [PMID: 23431002 PMCID: PMC3622298 DOI: 10.1093/gbe/evt029] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/16/2013] [Indexed: 12/15/2022] Open
Abstract
Human hemoglobins, the oxygen carriers in the blood, are composed by two α-like and two β-like globin monomers. The β-globin gene cluster located at 11p15.5 comprises one pseudogene and five genes whose expression undergoes two critical switches: the embryonic-to-fetal and fetal-to-adult transition. HBD encodes the δ-globin chain of the minor adult hemoglobin (HbA2), which is assumed to be physiologically irrelevant. Paradoxically, reduced diversity levels have been reported for this gene. In this study, we sought a detailed portrait of the genetic variation within the β-globin cluster in a large human population panel from different geographic backgrounds. We resequenced the coding and noncoding regions of the two adult β-globin genes (HBD and HBB) in European and African populations, and analyzed the data from the β-globin cluster (HBE, HBG2, HBG1, HBBP1, HBD, and HBB) in 1,092 individuals representing 14 populations sequenced as part of the 1000 Genomes Project. Additionally, we assessed the diversity levels in nonhuman primates using chimpanzee sequence data provided by the PanMap Project. Comprehensive analyses, based on classic neutrality tests, empirical and haplotype-based studies, revealed that HBD and its neighbor pseudogene HBBP1 have mainly evolved under purifying selection, suggesting that their roles are essential and nonredundant. Moreover, in the light of recent studies on the chromatin conformation of the β-globin cluster, we present evidence sustaining that the strong functional constraints underlying the decreased contemporary diversity at these two regions were not driven by protein function but instead are likely due to a regulatory role in ontogenic switches of gene expression.
Collapse
Affiliation(s)
- Ana Moleirinho
- Institute of Molecular Pathology and Immunology of University of Porto (IPATIMUP), Porto, Portugal.
| | | | | | | | | | | |
Collapse
|
33
|
Comeron JM, Ratnappan R, Bailin S. The many landscapes of recombination in Drosophila melanogaster. PLoS Genet 2012; 8:e1002905. [PMID: 23071443 PMCID: PMC3469467 DOI: 10.1371/journal.pgen.1002905] [Citation(s) in RCA: 332] [Impact Index Per Article: 27.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2012] [Accepted: 07/02/2012] [Indexed: 01/06/2023] Open
Abstract
Recombination is a fundamental biological process with profound evolutionary implications. Theory predicts that recombination increases the effectiveness of selection in natural populations. Yet, direct tests of this prediction have been restricted to qualitative trends due to the lack of detailed characterization of recombination rate variation across genomes and within species. The use of imprecise recombination rates can also skew population genetic analyses designed to assess the presence and mode of selection across genomes. Here we report the first integrated high-resolution description of genomic and population variation in recombination, which also distinguishes between the two outcomes of meiotic recombination: crossing over (CO) and gene conversion (GC). We characterized the products of 5,860 female meioses in Drosophila melanogaster by genotyping a total of 139 million informative SNPs and mapped 106,964 recombination events at a resolution down to 2 kilobases. This approach allowed us to generate whole-genome CO and GC maps as well as a detailed description of variation in recombination among individuals of this species. We describe many levels of variation in recombination rates. At a large-scale (100 kb), CO rates exhibit extreme and highly punctuated variation along chromosomes, with hot and coldspots. We also show extensive intra-specific variation in CO landscapes that is associated with hotspots at low frequency in our sample. GC rates are more uniformly distributed across the genome than CO rates and detectable in regions with reduced or absent CO. At a local scale, recombination events are associated with numerous sequence motifs and tend to occur within transcript regions, thus suggesting that chromatin accessibility favors double-strand breaks. All these non-independent layers of variation in recombination across genomes and among individuals need to be taken into account in order to obtain relevant estimates of recombination rates, and should be included in a new generation of population genetic models of the interaction between selection and linkage.
Collapse
Affiliation(s)
- Josep M Comeron
- Department of Biology, University of Iowa, Iowa City, Iowa, USA.
| | | | | |
Collapse
|
34
|
Gene conversion occurs within the mating-type locus of Cryptococcus neoformans during sexual reproduction. PLoS Genet 2012; 8:e1002810. [PMID: 22792079 PMCID: PMC3390403 DOI: 10.1371/journal.pgen.1002810] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2012] [Accepted: 05/23/2012] [Indexed: 12/30/2022] Open
Abstract
Meiotic recombination of sex chromosomes is thought to be repressed in organisms with heterogametic sex determination (e.g. mammalian X/Y chromosomes), due to extensive divergence and chromosomal rearrangements between the two chromosomes. However, proper segregation of sex chromosomes during meiosis requires crossing-over occurring within the pseudoautosomal regions (PAR). Recent studies reveal that recombination, in the form of gene conversion, is widely distributed within and may have played important roles in the evolution of some chromosomal regions within which recombination was thought to be repressed, such as the centromere cores of maize. Cryptococcus neoformans, a major human pathogenic fungus, has an unusually large mating-type locus (MAT, >100 kb), and the MAT alleles from the two opposite mating-types show extensive nucleotide sequence divergence and chromosomal rearrangements, mirroring characteristics of sex chromosomes. Meiotic recombination was assumed to be repressed within the C. neoformans MAT locus. A previous study identified recombination hot spots flanking the C. neoformans MAT, and these hot spots are associated with high GC content. Here, we investigated a GC-rich intergenic region located within the MAT locus of C. neoformans to establish if this region also exhibits unique recombination behavior during meiosis. Population genetics analysis of natural C. neoformans isolates revealed signals of homogenization spanning this GC-rich intergenic region within different C. neoformans lineages, consistent with a model in which gene conversion of this region during meiosis prevents it from diversifying within each lineage. By analyzing meiotic progeny from laboratory crosses, we found that meiotic recombination (gene conversion) occurs around the GC-rich intergenic region at a frequency equal to or greater than the meiotic recombination frequency observed in other genomic regions. We discuss the implications of these findings with regards to the possible functional and evolutionary importance of gene conversion within the C. neoformans MAT locus and, more generally, in fungi. Recombination has been thought to be repressed within sex chromosomes, as well as within the mating-type (MAT) loci in many fungi, due to the highly diverged and rearranged nature between alleles defining opposite sexes or mating-types. However, it has long been appreciated that recombination can occur within these presumptive recombinational “cold spots,” and recent studies reveal that recombination, including gene conversion, can occur at a frequency higher than previously appreciated and could play important roles in shaping evolution of these chromosomal regions. Here, we provide evidence that, during sexual reproduction of the human pathogenic fungus Cryptococcus neoformans, recombination (gene conversion) occurs across a GC-rich intergenic region within the MAT locus. The frequency of this gene conversion is comparable to those of typical meiotic recombination events observed in other chromosomal regions. This is in accord with population genetics analyses, which indicate homogenization between alleles of opposite mating-types within the intergenic region. Gene conversion within these highly rearranged chromosomal regions may serve to ensure proper meiosis and/or rejuvenate genes/chromosomal regions within MAT that are otherwise facing irreversible evolutionary decay. In conclusion, our study provides further experimental evidence that at least some recombinational “cold spots” are not that cold, after all.
Collapse
|
35
|
Iskow RC, Gokcumen O, Lee C. Exploring the role of copy number variants in human adaptation. Trends Genet 2012; 28:245-57. [PMID: 22483647 PMCID: PMC3533238 DOI: 10.1016/j.tig.2012.03.002] [Citation(s) in RCA: 97] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2011] [Revised: 03/05/2012] [Accepted: 03/06/2012] [Indexed: 11/18/2022]
Abstract
Over the past decade, the ubiquity of copy number variants (CNVs, the gain or loss of genomic material) in the genomes of healthy humans has become apparent. Although some of these variants are associated with disorders, a handful of studies documented an adaptive advantage conferred by CNVs. In this review, we propose that CNVs are substrates for human evolution and adaptation. We discuss the possible mechanisms and evolutionary processes in which CNVs are selected, outline the current challenges in identifying these loci, and highlight that copy number variable regions allow for the creation of novel genes that may diversify the repertoire of such genes in response to rapidly changing environments. We expect that many more adaptive CNVs will be discovered in the coming years, and we believe that these new findings will contribute to our understanding of human-specific phenotypes.
Collapse
Affiliation(s)
- Rebecca C Iskow
- Department of Pathology, Brigham and Women's Hospital, Boston, MA 02115, USA
| | | | | |
Collapse
|
36
|
Corbin L, Liu A, Bishop S, Woolliams J. Estimation of historical effective population size using linkage disequilibria with marker data. J Anim Breed Genet 2012; 129:257-70. [DOI: 10.1111/j.1439-0388.2012.01003.x] [Citation(s) in RCA: 92] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
37
|
Seixas S, Ivanova N, Ferreira Z, Rocha J, Victor BL. Loss and gain of function in SERPINB11: an example of a gene under selection on standing variation, with implications for host-pathogen interactions. PLoS One 2012; 7:e32518. [PMID: 22393410 PMCID: PMC3290568 DOI: 10.1371/journal.pone.0032518] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2011] [Accepted: 02/01/2012] [Indexed: 12/26/2022] Open
Abstract
Serine protease inhibitors (SERPINs) are crucial in the regulation of diverse biological processes including inflammation and immune response. SERPINB11, located in the 18q21 gene cluster, is a polymorphic gene/pseudogene coding for a non-inhibitory SERPIN. In a genome-wide scan for recent selection, SERPINB11 was identified as a potential candidate gene for adaptive evolution in Yoruba. The present study sought a better understanding of the evolutionary history of SERPINB11, with special focus on evaluating its selective signature. Through the resequencing of coding and noncoding regions of SERPINB11 in 20 Yorubans and analyzing primate orthologous sequences, we identified a full-length SERPINB11 variant encoding a non-inhibitory SERPIN as the putative candidate of selection – probably driven to higher frequencies by an adaptive response using preexisting variation. In addition, we detected contrasting evolutionary features of SERPINB11 in primates: While primate phylogeny as a whole is under purifying selection, the human lineage shows evidence of positive selection in a few codons, all associated with the active SERPINB11. Comparative modeling studies suggest that positively selected codons reduce SERPINB11's ability to undergo the conformational changes typical of inhibitory SERPINs – suggesting that it is evolving towards a new non-inhibitory function in humans. Significant correlations between SERPINB11 variants and the environmental variables, pastoralism and pathogen richness, have led us to propose a selective advantage through host-pathogen interactions, possibly linked to an adaptive response combating the emergence of infectious diseases in recent human evolution. This work represents the first description of a resurrected gene in humans, and may well exemplify selection on standing variation triggered by drastic ecological shifts.
Collapse
Affiliation(s)
- Susana Seixas
- Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP), Porto, Portugal.
| | | | | | | | | |
Collapse
|
38
|
Scliar MO, Soares-Souza GB, Chevitarese J, Lemos L, Magalhães WCS, Fagundes NJ, Bonatto SL, Yeager M, Chanock SJ, Tarazona-Santos E. The population genetics of Quechuas, the largest native South American group: autosomal sequences, SNPs, and microsatellites evidence high level of diversity. AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY 2012; 147:443-51. [PMID: 22282032 DOI: 10.1002/ajpa.22013] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/11/2011] [Accepted: 12/09/2011] [Indexed: 11/12/2022]
Abstract
Elucidating the pattern of genetic diversity for non-European populations is necessary to make the benefits of human genetics research available to individuals from these groups. In the era of large human genomic initiatives, Native American populations have been neglected, in particular, the Quechua, the largest South Amerindian group settled along the Andes. We characterized the genetic diversity of a Quechua population in a global setting, using autosomal noncoding sequences (nine unlinked loci for a total of 16 kb), 351 unlinked SNPs and 678 microsatellites and tested predictions of the model of the evolution of Native Americans proposed by (Tarazona-Santos et al.: Am J Hum Genet 68 (2001) 1485-1496). European admixture is <5% and African ancestry is barely detectable in the studied population. The largest genetic distances were between African versus Quechua or Melanesian populations, which is concordant with the African origin of modern humans and the fact that South America was the last part of the world to be peopled. The diversity in the Quechua population is comparable with that of Eurasian populations, and the allele frequency spectrum based on resequencing data does not reflect a reduction in the proportion of rare alleles. Thus, the Quechua population is a large reservoir of common and rare genetic variants of South Amerindians. These results are consistent with and complement our evolutionary model of South Amerindians (Tarazona-Santos et al.: Am J Hum Genet 68 (2001) 1485-1496), proposed based on Y-chromosome data, which predicts high genomic diversity due to the high level of gene flow between Andean populations and their long-term effective population size.
Collapse
Affiliation(s)
- Marilia O Scliar
- Departamento de Biologia Geral, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais. Av. Antonio Carlos 6627, Pampulha. Caixa Postal 486, Belo Horizonte, Minas Gerais, CEP 31270-910, Brazil
| | | | | | | | | | | | | | | | | | | |
Collapse
|
39
|
Cagliani R, Guerini FR, Fumagalli M, Riva S, Agliardi C, Galimberti D, Pozzoli U, Goris A, Dubois B, Fenoglio C, Forni D, Sanna S, Zara I, Pitzalis M, Zoledziewska M, Cucca F, Marini F, Comi GP, Scarpini E, Bresolin N, Clerici M, Sironi M. A trans-specific polymorphism in ZC3HAV1 is maintained by long-standing balancing selection and may confer susceptibility to multiple sclerosis. Mol Biol Evol 2012; 29:1599-613. [PMID: 22319148 PMCID: PMC7187542 DOI: 10.1093/molbev/mss002] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
The human ZC3HAV1 gene encodes an antiviral protein. The longest splicing isoform of ZC3HAV1 contains a C-terminal PARP-like domain, which has evolved under positive selection in primates. We analyzed the evolutionary history of this same domain in humans and in Pan troglodytes. We identified two variants that segregate in both humans and chimpanzees; one of them (rs3735007) does not occur at a hypermutable site and accounts for a nonsynonymous substitution (Thr851Ile). The probability that the two trans-specific polymorphisms have occurred independently in the two lineages was estimated to be low (P = 0.0054), suggesting that at least one of them has arisen before speciation and has been maintained by selection. Population genetic analyses in humans indicated that the region surrounding the shared variants displays strong evidences of long-standing balancing selection. Selection signatures were also observed in a chimpanzee population sample. Inspection of 1000 Genomes data confirmed these findings but indicated that search for selection signatures using low-coverage whole-genome data may need masking of repetitive sequences. A case–control study of more than 1,000 individuals from mainland Italy indicated that the Thr851Ile SNP is significantly associated with susceptibility to multiple sclerosis (MS) (odds ratio [OR] = 1.47, 95% confidence intervals [CI]: 1.08–1.99, P = 0.011). This finding was confirmed in a larger sample of 4,416 Sardinians cases/controls (OR = 1.18, 95% CI: 1.037–1.344, P = 0.011), but not in a population from Belgium. We provide one of the first instances of human/chimpanzee trans-specific coding variant located outside the major histocompatibility complex region. The selective pressure is likely to be virus driven; in modern populations, this variant associates with susceptibility to MS, possibly via the interaction with environmental factors.
Collapse
Affiliation(s)
- R Cagliani
- Scientific Institute IRCCS E. Medea, Bosisio Parini, Lecco, Italy
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
40
|
Polymorphism and Divergence in Two Willow Species, Salix viminalis L. and Salix schwerinii E. Wolf. G3-GENES GENOMES GENETICS 2011; 1:387-400. [PMID: 22384349 PMCID: PMC3276148 DOI: 10.1534/g3.111.000539] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/07/2011] [Accepted: 08/24/2011] [Indexed: 12/20/2022]
Abstract
We investigated species divergence, present and past gene flow, levels of nucleotide polymorphism, and linkage disequilibrium in two willows from the plant genus Salix. Salix belongs together with Populus to the Salicaceae family; however, most population genetic studies of Salicaceae have been performed in Populus, the model genus in forest biology. Here we present a study on two closely related willow species Salix viminalis and S. schwerinii, in which we have resequenced 33 and 32 nuclear gene segments representing parts of 18 nuclear loci in 24 individuals for each species. We used coalescent simulations and estimated the split time to around 600,000 years ago and found that there is currently limited gene flow between the species. Mean intronic nucleotide diversity across gene segments was slightly higher in S. schwerinii (πi = 0.00849) than in S. viminalis (πi = 0.00655). Compared with other angiosperm trees, the two willows harbor intermediate levels of silent polymorphisms. The decay of linkage disequilibrium was slower in S. viminalis compared with S. schwerinii, and we speculate that this is due to different demographic histories as S. viminalis has been partly domesticated in Europe.
Collapse
|
41
|
Abstract
Meiotic recombination is a fundamental cellular mechanism in sexually reproducing organisms and its different forms, crossing over and gene conversion both play an important role in shaping genetic variation in populations. Here, we describe a coalescent-based full-likelihood Markov chain Monte Carlo (MCMC) method for jointly estimating the crossing-over, gene-conversion, and mean tract length parameters from population genomic data under a Bayesian framework. Although computationally more expensive than methods that use approximate likelihoods, the relative efficiency of our method is expected to be optimal in theory. Furthermore, it is also possible to obtain a posterior sample of genealogies for the data using this method. We first check the performance of the new method on simulated data and verify its correctness. We also extend the method for inference under models with variable gene-conversion and crossing-over rates and demonstrate its ability to identify recombination hotspots. Then, we apply the method to two empirical data sets that were sequenced in the telomeric regions of the X chromosome of Drosophila melanogaster. Our results indicate that gene conversion occurs more frequently than crossing over in the su-w and su-s gene sequences while the local rates of crossing over as inferred by our program are not low. The mean tract lengths for gene-conversion events are estimated to be ∼70 bp and 430 bp, respectively, for these data sets. Finally, we discuss ideas and optimizations for reducing the execution time of our algorithm.
Collapse
|
42
|
|
43
|
Ko WY, Kaercher KA, Giombini E, Marcatili P, Froment A, Ibrahim M, Lema G, Nyambo TB, Omar SA, Wambebe C, Ranciaro A, Hirbo JB, Tishkoff SA. Effects of natural selection and gene conversion on the evolution of human glycophorins coding for MNS blood polymorphisms in malaria-endemic African populations. Am J Hum Genet 2011; 88:741-754. [PMID: 21664997 DOI: 10.1016/j.ajhg.2011.05.005] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2011] [Revised: 04/22/2011] [Accepted: 05/05/2011] [Indexed: 11/17/2022] Open
Abstract
Malaria has been a very strong selection pressure in recent human evolution, particularly in Africa. Of the one million deaths per year due to malaria, more than 90% are in sub-Saharan Africa, a region with high levels of genetic variation and population substructure. However, there have been few studies of nucleotide variation at genetic loci that are relevant to malaria susceptibility across geographically and genetically diverse ethnic groups in Africa. Invasion of erythrocytes by Plasmodium falciparum parasites is central to the pathology of malaria. Glycophorin A (GYPA) and B (GYPB), which determine MN and Ss blood types, are two major receptors that are expressed on erythrocyte surfaces and interact with parasite ligands. We analyzed nucleotide diversity of the glycophorin gene family in 15 African populations with different levels of malaria exposure. High levels of nucleotide diversity and gene conversion were found at these genes. We observed divergent patterns of genetic variation between these duplicated genes and between different extracellular domains of GYPA. Specifically, we identified fixed adaptive changes at exons 3-4 of GYPA. By contrast, we observed an allele frequency spectrum skewed toward a significant excess of intermediate-frequency alleles at GYPA exon 2 in many populations; the degree of spectrum distortion is correlated with malaria exposure, possibly because of the joint effects of gene conversion and balancing selection. We also identified a haplotype causing three amino acid changes in the extracellular domain of glycophorin B. This haplotype might have evolved adaptively in five populations with high exposure to malaria.
Collapse
Affiliation(s)
- Wen-Ya Ko
- Department of Genetics and Biology, School of Medicine and School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Kristin A Kaercher
- Department of Genetics and Biology, School of Medicine and School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Emanuela Giombini
- Department of Biochemical Sciences "Rossi Fanelli" University of Rome "La Sapienza" P.ale Aldo Moro, 5 - 00185 Rome, Italy
| | - Paolo Marcatili
- Department of Biochemical Sciences "Rossi Fanelli" University of Rome "La Sapienza" P.ale Aldo Moro, 5 - 00185 Rome, Italy
| | - Alain Froment
- UMR 208, Institut de Recherche pour le Développement, Muséum National d'Histoire Naturelle, Musée de l'Homme, 75116 Paris, France
| | - Muntaser Ibrahim
- Department of Molecular Biology, Institute of Endemic Diseases, University of Khartoum, 15-Khartoum, Sudan
| | - Godfrey Lema
- Department of Biochemistry, Muhimbili University of Health and Allied Sciences, Dar es Salaam, Tanzania
| | - Thomas B Nyambo
- Department of Biochemistry, Muhimbili University of Health and Allied Sciences, Dar es Salaam, Tanzania
| | - Sabah A Omar
- Kenya Medical Research Institute, Center for Biotechnology Research and Development, 54840-00200 Nairobi, Kenya
| | | | - Alessia Ranciaro
- Department of Genetics and Biology, School of Medicine and School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Jibril B Hirbo
- Department of Genetics and Biology, School of Medicine and School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Sarah A Tishkoff
- Department of Genetics and Biology, School of Medicine and School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA 19104, USA.
| |
Collapse
|
44
|
Ferreira Z, Hurle B, Rocha J, Seixas S. Differing evolutionary histories of WFDC8 (short-term balancing) in Europeans and SPINT4 (incomplete selective sweep) in Africans. Mol Biol Evol 2011; 28:2811-22. [PMID: 21536719 DOI: 10.1093/molbev/msr106] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
The whey acidic protein four-disulfide core (WFDC) gene cluster on human chromosome 20q13, harbors 15 small serine protease inhibitor genes with roles in innate immunity, reproduction, and regulation of endogenous proteases kallikreins. The WFDC cluster has emerged as a prime example of rapid diversification and adaptive evolution in primates. This study sought a better understanding of the evolutionary history of WFDC genes in humans and focused on exploring the adaptive selection signatures found in populations of European (Utah residents with ancestry from northern and western Europe [CEU]) and African (Yoruba from Ibadan, in Nigeria [YRI]) ancestry in a genome-wide scan for putative targets of recent adaptive selection. Our approach included resequencing coding and noncoding regions of WFDC6, EPPIN, and WFDC8 in 20 CEU and of SPINT4 in 20 YRI individuals. We generated 302 kb and 60 kb of high-quality sequence data from CEU and of YRI populations, respectively, enabling the identification of 72 single nucleotide polymorphisms. Using classic neutrality tests, empirical and haplotype-based analysis, we pinpointed WFDC8 and SPINT4 as the likely targets of short-term balancing selection in the CEU population, and recent positive selection (incomplete selective sweep) in the YRI population. Putative candidate variants targeted by selection include 44A (rs7273669A) for WFDC8, which may downregulate gene expression by abolishing the binding site of two transcription factors; and a haplotype configuration [Ser73+98A] (rs6017667A-rs6032474A) for SPINT4, which may simultaneously affect protein function and gene regulation. We propose that the evolution of WFDC8 and SPINT4 has been shaped by complex selective scenarios due to the interdependence of variant fitness and ecological variables.
Collapse
Affiliation(s)
- Zélia Ferreira
- Institute of Molecular Pathology and Immunology of the University of Porto, Porto, Portugal
| | | | | | | |
Collapse
|
45
|
Abstract
Performing inference on contemporary samples of DNA sequence data is an important and challenging task. Computationally intensive methods such as importance sampling (IS) are attractive because they make full use of the available data, but in the presence of recombination the large state space of genealogies can be prohibitive. In this article, we make progress by developing an efficient IS proposal distribution for a two-locus model of sequence data. We show that the proposal developed here leads to much greater efficiency, outperforming existing IS methods that could be adapted to this model. Among several possible applications, the algorithm can be used to find maximum likelihood estimates for mutation and crossover rates, and to perform ancestral inference. We illustrate the method on previously reported sequence data covering two loci either side of the well-studied TAP2 recombination hotspot. The two loci are themselves largely non-recombining, so we obtain a gene tree at each locus and are able to infer in detail the effect of the hotspot on their joint ancestry. We summarize this joint ancestry by introducing the gene graph, a summary of the well-known ancestral recombination graph.
Collapse
Affiliation(s)
- Paul A Jenkins
- Department of Statistics, University of Oxford, Oxford, United Kingdom.
| | | |
Collapse
|
46
|
Dixit J, Srivastava H, Singh O, Saksena D, Das A. Multilocus nuclear DNA markers and genetic parameters in an Indian Anopheles minimus population. INFECTION GENETICS AND EVOLUTION 2011; 11:572-9. [DOI: 10.1016/j.meegid.2011.01.001] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/06/2010] [Revised: 01/03/2011] [Accepted: 01/06/2011] [Indexed: 01/26/2023]
|
47
|
Clark AG, Wang X, Matise T. Contrasting methods of quantifying fine structure of human recombination. Annu Rev Genomics Hum Genet 2010; 11:45-64. [PMID: 20690817 DOI: 10.1146/annurev-genom-082908-150031] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
There has been considerable excitement over the ability to construct linkage maps based only on genome-wide genotype data for single nucleotide polymorphic sites (SNPs) in a population sample. These maps, which are derived from estimates of linkage disequilibrium (LD), rely on population genetics theory to relate the decay of LD to the local rate of recombination, but other population processes also come into play. Here we contrast these LD maps to the classically derived, pedigree-based human recombination maps. The LD maps have a level of resolution greatly exceeding that of the pedigree maps, and at this fine scale, sperm typing allows a means of validation. While at a gross level both the pedigree maps and the sperm typing methods generally agree with LD maps, there are significant local differences between them, and the fact that these maps measure different genetic features should be remembered when using them for other genetic inferences.
Collapse
Affiliation(s)
- Andrew G Clark
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA.
| | | | | |
Collapse
|
48
|
Katzman S, Kern AD, Pollard KS, Salama SR, Haussler D. GC-biased evolution near human accelerated regions. PLoS Genet 2010; 6:e1000960. [PMID: 20502635 PMCID: PMC2873926 DOI: 10.1371/journal.pgen.1000960] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2009] [Accepted: 04/20/2010] [Indexed: 12/30/2022] Open
Abstract
Regions of the genome that have been the target of positive selection specifically along the human lineage are of special importance in human biology. We used high throughput sequencing combined with methods to enrich human genomic samples for particular targets to obtain the sequence of 22 chromosomal samples at high depth in 40 kb neighborhoods of 49 previously identified 100–400 bp elements that show evidence for human accelerated evolution. In addition to selection, the pattern of nucleotide substitutions in several of these elements suggested an historical bias favoring the conversion of weak (A or T) alleles into strong (G or C) alleles. Here we found strong evidence in the derived allele frequency spectra of many of these 40 kb regions for ongoing weak-to-strong fixation bias. Comparison of the nucleotide composition at polymorphic loci to the composition at sites of fixed substitutions additionally reveals the signature of historical weak-to-strong fixation bias in a subset of these regions. Most of the regions with evidence for historical bias do not also have signatures of ongoing bias, suggesting that the evolutionary forces generating weak-to-strong bias are not constant over time. To investigate the role of selection in shaping these regions, we analyzed the spatial pattern of polymorphism in our samples. We found no significant evidence for selective sweeps, possibly because the signal of such sweeps has decayed beyond the power of our tests to detect them. Together, these results do not rule out functional roles for the observed changes in these regions—indeed there is good evidence that the first two are functional elements in humans—but they suggest that a fixation process (such as biased gene conversion) that is biased at the nucleotide level, but is otherwise selectively neutral, could be an important evolutionary force at play in them, both historically and at present. The search for functional regions in the human genome, beyond the protein-coding portion, often relies on signals of conservation across species. The Human Accelerated Regions (HARs) are strongly conserved elements, ranging in size from 100–400 bp, that show an unexpected number of human-specific changes. This pattern suggests that HARs may be functional elements that have significantly changed during human evolution. To analyze the evolutionary forces that led these changes, we studied 40 kb neighborhoods of the top 49 HARs. We took advantage of recently developed DNA sequencing technology, coupled with methods to isolate genomic DNA for our target regions only, to determine the genotypes in 22 chromosomal samples. This polymorphism data showed no significant evidence for adaptive selective sweeps in HAR regions. By contrast, we found strong evidence for a nucleotide bias in the fixation of mutations from A or T to G or C basepairs. Our work reveals that this bias in the HAR neighborhoods is not just an historic phenomenon, but is ongoing in the present day human population. This finding adds credence to the possibility that non-selective forces, such as biased gene conversion, could have contributed to the evolution of several of these regions.
Collapse
Affiliation(s)
- Sol Katzman
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California, United States of America
| | - Andrew D. Kern
- Department of Biological Sciences, Dartmouth College, Hanover, New Hampshire, United States of America
| | - Katherine S. Pollard
- Gladstone Institutes, University of California San Francisco, San Francisco, California, United States of America
| | - Sofie R. Salama
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California, United States of America
- Center for Biomolecular Science and Engineering, University of California Santa Cruz, Santa Cruz, California, United States of America
- Howard Hughes Medical Institute, University of California Santa Cruz, Santa Cruz, California, United States of America
| | - David Haussler
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California, United States of America
- Center for Biomolecular Science and Engineering, University of California Santa Cruz, Santa Cruz, California, United States of America
- Howard Hughes Medical Institute, University of California Santa Cruz, Santa Cruz, California, United States of America
- * E-mail:
| |
Collapse
|
49
|
Formulating a historical and demographic model of recent human evolution based on resequencing data from noncoding regions. PLoS One 2010; 5:e10284. [PMID: 20421973 PMCID: PMC2858654 DOI: 10.1371/journal.pone.0010284] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2009] [Accepted: 03/27/2010] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND Estimating the historical and demographic parameters that characterize modern human populations is a fundamental part of reconstructing the recent history of our species. In addition, the development of a model of human evolution that can best explain neutral genetic diversity is required to identify confidently regions of the human genome that have been targeted by natural selection. METHODOLOGY/PRINCIPAL FINDINGS We have resequenced 20 independent noncoding autosomal regions dispersed throughout the genome in 213 individuals from different continental populations, corresponding to a total of approximately 6 Mb of diploid resequencing data. We used these data to explore and co-estimate an extensive range of historical and demographic parameters with a statistical framework that combines the evaluation of multiple models of human evolution via a best-fit approach, followed by an Approximate Bayesian Computation (ABC) analysis. From a methodological standpoint, evaluating the accuracy of the parameter co-estimation allowed us to identify the most accurate set of statistics to be used for the estimation of each of the different historical and demographic parameters characterizing recent human evolution. CONCLUSIONS/SIGNIFICANCE Our results support a model in which modern humans left Africa through a single major dispersal event occurring approximately 60,000 years ago, corresponding to a drastic reduction of approximately 5 times the effective population size of the ancestral African population of approximately 13,800 individuals. Subsequently, the ancestors of modern Europeans and East Asians diverged much later, approximately 22,500 years ago, from the population of ancestral migrants. This late diversification of Eurasians after the African exodus points to the occurrence of a long maturation phase in which the ancestral Eurasian population was not yet diversified.
Collapse
|
50
|
Arguello JR, Zhang Y, Kado T, Fan C, Zhao R, Innan H, Wang W, Long M. Recombination yet inefficient selection along the Drosophila melanogaster subgroup's fourth chromosome. Mol Biol Evol 2010; 27:848-61. [PMID: 20008457 PMCID: PMC2877538 DOI: 10.1093/molbev/msp291] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
A central goal of evolutionary genetics is an understanding of the forces responsible for the observed variation, both within and between species. Theoretical and empirical work have demonstrated that genetic recombination contributes to this variation by breaking down linkage between nucleotide sites, thus allowing them to behave independently and for selective forces to act efficiently on them. The Drosophila fourth chromosome, which is believed to experience no-or very low-rates of recombination has been an important model for investigating these effects. Despite previous efforts, central questions regarding the extent of recombination and the predominant modes of selection acting on it remain open. In order to more comprehensively test hypotheses regarding recombination and its potential influence on selection along the fourth chromosome, we have resequenced regions from most of its genes from Drosophila melanogaster, D. simulans, and D. yakuba. These data, along with available outgroup sequence, demonstrate that recombination is low but significantly greater than zero for the three species. Despite there being recombination, there is strong evidence that its frequency is low enough to have rendered selection relatively inefficient. The signatures of relaxed constraint can be detected at both the level of polymorphism and divergence.
Collapse
Affiliation(s)
- J. Roman Arguello
- Committee on Evolutionary Biology, University of Chicago
- Department of Ecology and Evolution, University of Chicago
| | - Yue Zhang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, China
| | - Tomoyuki Kado
- Hayama Center for Advanced Studies, The Graduate University for Advanced Studies, Hayama, Kanagawa, Japan
| | - Chuanzhu Fan
- Department of Ecology and Evolution, University of Chicago
| | - Ruoping Zhao
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, China
| | - Hideki Innan
- Hayama Center for Advanced Studies, The Graduate University for Advanced Studies, Hayama, Kanagawa, Japan
| | - Wen Wang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, China
| | - Manyuan Long
- Committee on Evolutionary Biology, University of Chicago
- Department of Ecology and Evolution, University of Chicago
| |
Collapse
|