Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: McVean GAT. A genealogical interpretation of linkage disequilibrium. Genetics 2002;162:987-91. [PMID: 12399406 PMCID: PMC1462283 DOI: 10.1093/genetics/162.2.987] [Citation(s) in RCA: 142] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

For:	McVean GAT. A genealogical interpretation of linkage disequilibrium. Genetics 2002;162:987-91. [PMID: 12399406 PMCID: PMC1462283 DOI: 10.1093/genetics/162.2.987] [Citation(s) in RCA: 142] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Number

Cited by Other Article(s)

Genomic prediction in animals and plants: simulation of data, validation, reporting, and benchmarking. Genetics 2012;193:347-65. [PMID: 23222650 DOI: 10.1534/genetics.112.147983] [Citation(s) in RCA: 239] [Impact Index Per Article: 19.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open

Abstract

The genomic prediction of phenotypes and breeding values in animals and plants has developed rapidly into its own research field. Results of genomic prediction studies are often difficult to compare because data simulation varies, real or simulated data are not fully described, and not all relevant results are reported. In addition, some new methods have been compared only in limited genetic architectures, leading to potentially misleading conclusions. In this article we review simulation procedures, discuss validation and reporting of results, and apply benchmark procedures for a variety of genomic prediction methods in simulated and real example data. Plant and animal breeding programs are being transformed by the use of genomic data, which are becoming widely available and cost-effective to predict genetic merit. A large number of genomic prediction studies have been published using both simulated and real data. The relative novelty of this area of research has made the development of scientific conventions difficult with regard to description of the real data, simulation of genomes, validation and reporting of results, and forward in time methods. In this review article we discuss the generation of simulated genotype and phenotype data, using approaches such as the coalescent and forward in time simulation. We outline ways to validate simulated data and genomic prediction results, including cross-validation. The accuracy and bias of genomic prediction are highlighted as performance indicators that should be reported. We suggest that a measure of relatedness between the reference and validation individuals be reported, as its impact on the accuracy of genomic prediction is substantial. A large number of methods were compared in example simulated and real (pine and wheat) data sets, all of which are publicly available. In our limited simulations, most methods performed similarly in traits with a large number of quantitative trait loci (QTL), whereas in traits with fewer QTL variable selection did have some advantages. In the real data sets examined here all methods had very similar accuracies. We conclude that no single method can serve as a benchmark for genomic prediction. We recommend comparing accuracy and bias of new methods to results from genomic best linear prediction and a variable selection approach (e.g., BayesB), because, together, these methods are appropriate for a range of genetic architectures. An accompanying article in this issue provides a comprehensive review of genomic prediction methods and discusses a selection of topics related to application of genomic prediction in plants and animals.

Collapse

Elhaik E. Empirical distributions of F(ST) from large-scale human polymorphism data. PLoS One 2012;7:e49837. [PMID: 23185452 PMCID: PMC3504095 DOI: 10.1371/journal.pone.0049837] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2012] [Accepted: 10/12/2012] [Indexed: 12/19/2022] Open

Abstract

Studies of the apportionment of human genetic variation have long established that most human variation is within population groups and that the additional variation between population groups is small but greatest when comparing different continental populations. These studies often used Wright's F(ST) that apportions the standardized variance in allele frequencies within and between population groups. Because local adaptations increase population differentiation, high-F(ST) may be found at closely linked loci under selection and used to identify genes undergoing directional or heterotic selection. We re-examined these processes using HapMap data. We analyzed 3 million SNPs on 602 samples from eight worldwide populations and a consensus subset of 1 million SNPs found in all populations. We identified four major features of the data: First, a hierarchically F(ST) analysis showed that only a paucity (12%) of the total genetic variation is distributed between continental populations and even a lesser genetic variation (1%) is found between intra-continental populations. Second, the global F(ST) distribution closely follows an exponential distribution. Third, although the overall F(ST) distribution is similarly shaped (inverse J), F(ST) distributions varies markedly by allele frequency when divided into non-overlapping groups by allele frequency range. Because the mean allele frequency is a crude indicator of allele age, these distributions mark the time-dependent change in genetic differentiation. Finally, the change in mean-F(ST) of these groups is linear in allele frequency. These results suggest that investigating the extremes of the F(ST) distribution for each allele frequency group is more efficient for detecting selection. Consequently, we demonstrate that such extreme SNPs are more clustered along the chromosomes than expected from linkage disequilibrium for each allele frequency group. These genomic regions are therefore likely candidates for natural selection.

Collapse

An ancestral recombination graph for diploid populations with skewed offspring distribution. Genetics 2012;193:255-90. [PMID: 23150600 DOI: 10.1534/genetics.112.144329] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Abstract

A large offspring-number diploid biparental multilocus population model of Moran type is our object of study. At each time step, a pair of diploid individuals drawn uniformly at random contributes offspring to the population. The number of offspring can be large relative to the total population size. Similar "heavily skewed" reproduction mechanisms have been recently considered by various authors (cf. e.g., Eldon and Wakeley 2006, 2008) and reviewed by Hedgecock and Pudovkin (2011). Each diploid parental individual contributes exactly one chromosome to each diploid offspring, and hence ancestral lineages can coalesce only when in distinct individuals. A separation-of-timescales phenomenon is thus observed. A result of Möhle (1998) is extended to obtain convergence of the ancestral process to an ancestral recombination graph necessarily admitting simultaneous multiple mergers of ancestral lineages. The usual ancestral recombination graph is obtained as a special case of our model when the parents contribute only one offspring to the population each time. Due to diploidy and large offspring numbers, novel effects appear. For example, the marginal genealogy at each locus admits simultaneous multiple mergers in up to four groups, and different loci remain substantially correlated even as the recombination rate grows large. Thus, genealogies for loci far apart on the same chromosome remain correlated. Correlation in coalescence times for two loci is derived and shown to be a function of the coalescence parameters of our model. Extending the observations by Eldon and Wakeley (2008), predictions of linkage disequilibrium are shown to be functions of the reproduction parameters of our model, in addition to the recombination rate. Correlations in ratios of coalescence times between loci can be high, even when the recombination rate is high and sample size is large, in large offspring-number populations, as suggested by simulations, hinting at how to distinguish between different population models.

Collapse

Gautier M, Gharbi K, Cezard T, Foucaud J, Kerdelhué C, Pudlo P, Cornuet JM, Estoup A. The effect of RAD allele dropout on the estimation of genetic variation within and between populations. Mol Ecol 2012;22:3165-78. [DOI: 10.1111/mec.12089] [Citation(s) in RCA: 219] [Impact Index Per Article: 18.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2012] [Revised: 09/04/2012] [Accepted: 09/12/2012] [Indexed: 12/17/2022]

Rikalainen K, Aspi J, Galarza JA, Koskela E, Mappes T. Maintenance of genetic diversity in cyclic populations-a longitudinal analysis in Myodes glareolus. Ecol Evol 2012;2:1491-502. [PMID: 22957157 PMCID: PMC3434924 DOI: 10.1002/ece3.277] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2012] [Revised: 04/05/2012] [Accepted: 04/11/2012] [Indexed: 11/08/2022] Open

Griswold CK, Henry TA. Epistasis can increase multivariate trait diversity in haploid non-recombining populations. Theor Popul Biol 2012;82:209-21. [PMID: 22771491 DOI: 10.1016/j.tpb.2012.06.007] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2012] [Revised: 06/21/2012] [Accepted: 06/23/2012] [Indexed: 11/18/2022]

Corbin L, Liu A, Bishop S, Woolliams J. Estimation of historical effective population size using linkage disequilibria with marker data. J Anim Breed Genet 2012;129:257-70. [DOI: 10.1111/j.1439-0388.2012.01003.x] [Citation(s) in RCA: 92] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Evolutionary history of synthesis pathway genes for phloroglucinol and cyanide antimicrobials in plant-associated fluorescent pseudomonads. Mol Phylogenet Evol 2012;63:877-90. [PMID: 22426436 DOI: 10.1016/j.ympev.2012.02.030] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2011] [Revised: 02/24/2012] [Accepted: 02/29/2012] [Indexed: 11/22/2022]

Bürger R, Akerman A. The effects of linkage and gene flow on local adaptation: a two-locus continent-island model. Theor Popul Biol 2011;80:272-88. [PMID: 21801739 PMCID: PMC3257863 DOI: 10.1016/j.tpb.2011.07.002] [Citation(s) in RCA: 65] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2011] [Revised: 07/08/2011] [Accepted: 07/11/2011] [Indexed: 11/24/2022]

Abstract

Population subdivision and migration are generally considered to be important causes of linkage disequilibrium (LD). We explore the combined effects of recombination and gene flow on the amount of LD, the maintenance of polymorphism, and the degree of local adaptation in a subdivided population by analyzing a diploid, deterministic continent-island model with genic selection on two linked loci (i.e., no dominance or epistasis). For this simple model, we characterize explicitly all possible equilibrium configurations. Simple and intuitive approximations for many quantities of interest are obtained in limiting cases, such as weak migration, weak selection, weak or strong recombination. For instance, we derive explicit expressions for the measures D(=p(AB)-p(A)p(B)) and r(2) (the squared correlation in allelic state) of LD. They depend in qualitatively different ways on the migration rate. Remarkably high values of r(2) are maintained between weakly linked loci, especially if gene flow is low. We determine how the maximum amount of gene flow that admits preservation of the locally adapted haplotype, hence of polymorphism at both loci, depends on recombination rate and selection coefficients. We also investigate the evolution of differentiation by examining the invasion of beneficial mutants of small effect that are linked to an already present, locally adapted allele. Mutants of much smaller effect can invade successfully than predicted by naive single-locus theory provided they are at least weakly linked. Finally, the influence of linkage on the degree of local adaptation, the migration load, and the effective migration rate at a neutral locus is explored. We discuss possible consequences for the evolution of genetic architecture, in particular, for the emergence of clusters of tightly linked, slightly beneficial mutations and the evolution of recombination and chromosome inversions.

Collapse

Linkage disequilibrium under recurrent bottlenecks. Genetics 2011;190:217-29. [PMID: 22048021 DOI: 10.1534/genetics.111.134437] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Abstract

To model deviations from selectively neutral genetic variation caused by different forms of selection, it is necessary to first understand patterns of neutral variation. Best understood is neutral genetic variation at a single locus. But, as is well known, additional insights can be gained by investigating multiple loci. The resulting patterns reflect the degree of association (linkage) between loci and provide information about the underlying multilocus gene genealogies. The statistical properties of two-locus gene genealogies have been intensively studied for populations of constant size, as well as for simple demographic histories such as exponential population growth and single bottlenecks. By contrast, the combined effect of recombination and sustained demographic fluctuations is poorly understood. Addressing this issue, we study a two-locus Wright-Fisher model of a population subject to recurrent bottlenecks. We derive coalescent approximations for the covariance of the times to the most recent common ancestor at two loci in samples of two chromosomes. This covariance reflects the degree of association and thus linkage disequilibrium between these loci. We find, first, that an effective population-size approximation describes the numerically observed association between two loci provided that recombination occurs either much faster or much more slowly than the population-size fluctuations. Second, when recombination occurs frequently between but rarely within bottlenecks, we observe that the association of gene histories becomes independent of physical distance over a certain range of distances. Third, we show that in this case, a commonly used measure of linkage disequilibrium, σ(2)(d) (closely related to r(2)), fails to capture the long-range association between two loci. The reason is that constituent terms, each reflecting the long-range association, cancel. Fourth, we analyze a limiting case in which the long-range association can be described in terms of a Xi coalescent allowing for simultaneous multiple mergers of ancestral lines.

Collapse

Takuno S, Kado T, Sugino RP, Nakhleh L, Innan H. Population genomics in bacteria: a case study of Staphylococcus aureus. Mol Biol Evol 2011;29:797-809. [PMID: 22009061 DOI: 10.1093/molbev/msr249] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open

The joint effects of background selection and genetic recombination on local gene genealogies. Genetics 2011;189:251-66. [PMID: 21705759 DOI: 10.1534/genetics.111.130575] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Agudo R, Alcaide M, Rico C, Lemus JA, Blanco G, Hiraldo F, Donázar JA. Major histocompatibility complex variation in insular populations of the Egyptian vulture: inferences about the roles of genetic drift and selection. Mol Ecol 2011;20:2329-40. [PMID: 21535276 DOI: 10.1111/j.1365-294x.2011.05107.x] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]

McEvoy BP, Powell JE, Goddard ME, Visscher PM. Human population dispersal "Out of Africa" estimated from linkage disequilibrium and allele frequencies of SNPs. Genome Res 2011;21:821-9. [PMID: 21518737 DOI: 10.1101/gr.119636.110] [Citation(s) in RCA: 123] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Barton NH, Kelleher J, Etheridge AM. A new model for extinction and recolonization in two dimensions: quantifying phylogeography. Evolution 2011;64:2701-15. [PMID: 20408876 DOI: 10.1111/j.1558-5646.2010.01019.x] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Rose CJ, Chapman JR, Marshall SDG, Lee SF, Batterham P, Ross HA, Newcomb RD. Selective sweeps at the organophosphorus insecticide resistance locus, Rop-1, have affected variation across and beyond the α-esterase gene cluster in the Australian sheep blowfly, Lucilia cuprina. Mol Biol Evol 2011;28:1835-46. [PMID: 21228400 DOI: 10.1093/molbev/msr006] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open

Zeng K, Charlesworth B. The effects of demography and linkage on the estimation of selection and mutation parameters. Genetics 2010;186:1411-24. [PMID: 20923980 PMCID: PMC2998320 DOI: 10.1534/genetics.110.122150] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2010] [Accepted: 09/27/2010] [Indexed: 11/18/2022] Open

Peng B, Amos CI. Forward-time simulation of realistic samples for genome-wide association studies. BMC Bioinformatics 2010;11:442. [PMID: 20809983 PMCID: PMC2939614 DOI: 10.1186/1471-2105-11-442] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2010] [Accepted: 09/01/2010] [Indexed: 12/21/2022] Open

Geneva A, Garrigan D. Population genomics of secondary contact. Genes (Basel) 2010;1:124-42. [PMID: 24710014 PMCID: PMC3960861 DOI: 10.3390/genes1010124] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2010] [Revised: 06/23/2010] [Accepted: 06/23/2010] [Indexed: 11/16/2022] Open

Barton NH. Estimating linkage disequilibria. Heredity (Edinb) 2010;106:205-6. [PMID: 20502479 DOI: 10.1038/hdy.2010.67] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open

Patterns of neutral genetic variation on recombining sex chromosomes. Genetics 2010;184:1141-52. [PMID: 20124026 DOI: 10.1534/genetics.109.113555] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open

Rexroad CE, Vallejo RL. Estimates of linkage disequilibrium and effective population size in rainbow trout. BMC Genet 2009;10:83. [PMID: 20003428 PMCID: PMC2800115 DOI: 10.1186/1471-2156-10-83] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2009] [Accepted: 12/14/2009] [Indexed: 12/19/2022] Open

Abstract

Background

The use of molecular genetic technologies for broodstock management and selective breeding of aquaculture species is becoming increasingly more common with the continued development of genome tools and reagents. Several laboratories have produced genetic maps for rainbow trout to aid in the identification of loci affecting phenotypes of interest. These maps have resulted in the identification of many quantitative/qualitative trait loci affecting phenotypic variation in traits associated with albinism, disease resistance, temperature tolerance, sex determination, embryonic development rate, spawning date, condition factor and growth. Unfortunately, the elucidation of the precise allelic variation and/or genes underlying phenotypic diversity has yet to be achieved in this species having low marker densities and lacking a whole genome reference sequence. Experimental designs which integrate segregation analyses with linkage disequilibrium (LD) approaches facilitate the discovery of genes affecting important traits. To date the extent of LD has been characterized for humans and several agriculturally important livestock species but not for rainbow trout.

Results

We observed that the level of LD between syntenic loci decayed rapidly at distances greater than 2 cM which is similar to observations of LD in other agriculturally important species including cattle, sheep, pigs and chickens. However, in some cases significant LD was also observed up to 50 cM. Our estimate of effective population size based on genome wide estimates of LD for the NCCCWA broodstock population was 145, indicating that this population will respond well to high selection intensity. However, the range of effective population size based on individual chromosomes was 75.51 - 203.35, possibly indicating that suites of genes on each chromosome are disproportionately under selection pressures.

Conclusions

Our results indicate that large numbers of markers, more than are currently available for this species, will be required to enable the use of genome-wide integrated mapping approaches aimed at identifying genes of interest in rainbow trout.

Collapse

A genealogical interpretation of principal components analysis. PLoS Genet 2009;5:e1000686. [PMID: 19834557 PMCID: PMC2757795 DOI: 10.1371/journal.pgen.1000686] [Citation(s) in RCA: 334] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2009] [Accepted: 09/16/2009] [Indexed: 11/24/2022] Open

Abstract

Principal components analysis, PCA, is a statistical method commonly used in population genetics to identify structure in the distribution of genetic variation across geographical location and ethnic background. However, while the method is often used to inform about historical demographic processes, little is known about the relationship between fundamental demographic parameters and the projection of samples onto the primary axes. Here I show that for SNP data the projection of samples onto the principal components can be obtained directly from considering the average coalescent times between pairs of haploid genomes. The result provides a framework for interpreting PCA projections in terms of underlying processes, including migration, geographical isolation, and admixture. I also demonstrate a link between PCA and Wright's f_st and show that SNP ascertainment has a largely simple and predictable effect on the projection of samples. Using examples from human genetics, I discuss the application of these results to empirical data and the implications for inference.

Genetic variation in natural populations typically demonstrates structure arising from diverse processes including geographical isolation, founder events, migration, and admixture. One technique commonly used to uncover such structure is principal components analysis, which identifies the primary axes of variation in data and projects the samples onto these axes in a graphically appealing and intuitive manner. However, as the method is non-parametric, it can be hard to relate PCA to underlying process. Here, I show that the underlying genealogical history of the samples can be related directly to the PC projection. The result is useful because it is straightforward to predict the effects of different demographic processes on the sample genealogy. However, the result also reveals the limitations of PCA, in that multiple processes can give the same projections, it is strongly influenced by uneven sampling, and it discards important information in the spatial structure of genetic variation along chromosomes.

Collapse

Rosenberg NA, Vanliere JM. Replication of genetic associations as pseudoreplication due to shared genealogy. Genet Epidemiol 2009;33:479-87. [PMID: 19191270 DOI: 10.1002/gepi.20400] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Correlation measures for linkage disequilibrium within and between populations. Genet Res (Camb) 2009;91:183-92. [PMID: 19589188 DOI: 10.1017/s0016672309000159] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open

Strasburg JL, Rieseberg LH. How robust are "isolation with migration" analyses to violations of the im model? A simulation study. Mol Biol Evol 2009;27:297-310. [PMID: 19793831 DOI: 10.1093/molbev/msp233] [Citation(s) in RCA: 207] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open

Sved JA. Linkage disequilibrium and its expectation in human populations. Twin Res Hum Genet 2009;12:35-43. [PMID: 19210178 DOI: 10.1375/twin.12.1.35] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

Depaulis F, Orlando L, Hänni C. Using classical population genetics tools with heterochroneous data: time matters! PLoS One 2009;4:e5541. [PMID: 19440242 PMCID: PMC2678253 DOI: 10.1371/journal.pone.0005541] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2008] [Accepted: 04/15/2009] [Indexed: 11/19/2022] Open

Abstract

BACKGROUND

New polymorphism datasets from heterochroneous data have arisen thanks to recent advances in experimental and microbial molecular evolution, and the sequencing of ancient DNA (aDNA). However, classical tools for population genetics analyses do not take into account heterochrony between subsets, despite potential bias on neutrality and population structure tests. Here, we characterize the extent of such possible biases using serial coalescent simulations.

METHODOLOGY/PRINCIPAL FINDINGS

We first use a coalescent framework to generate datasets assuming no or different levels of heterochrony and contrast most classical population genetic statistics. We show that even weak levels of heterochrony ( approximately 10% of the average depth of a standard population tree) affect the distribution of polymorphism substantially, leading to overestimate the level of polymorphism theta, to star like trees, with an excess of rare mutations and a deficit of linkage disequilibrium, which are the hallmark of e.g. population expansion (possibly after a drastic bottleneck). Substantial departures of the tests are detected in the opposite direction for more heterochroneous and equilibrated datasets, with balanced trees mimicking in particular population contraction, balancing selection, and population differentiation. We therefore introduce simple corrections to classical estimators of polymorphism and of the genetic distance between populations, in order to remove heterochrony-driven bias. Finally, we show that these effects do occur on real aDNA datasets, taking advantage of the currently available sequence data for Cave Bears (Ursus spelaeus), for which large mtDNA haplotypes have been reported over a substantial time period (22-130 thousand years ago (KYA)).

CONCLUSIONS/SIGNIFICANCE

Considering serial sampling changed the conclusion of several tests, indicating that neglecting heterochrony could provide significant support for false past history of populations and inappropriate conservation decisions. We therefore argue for systematically considering heterochroneous models when analyzing heterochroneous samples covering a large time scale.

Collapse

Eriksson A, Mahjani B, Mehlig B. Sequential Markov coalescent algorithms for population models with demographic structure. Theor Popul Biol 2009;76:84-91. [PMID: 19433100 DOI: 10.1016/j.tpb.2009.05.002] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2009] [Revised: 05/04/2009] [Accepted: 05/04/2009] [Indexed: 10/24/2022]

Tooming-Klunderud A, Fewer DP, Rohrlack T, Jokela J, Rouhiainen L, Sivonen K, Kristensen T, Jakobsen KS. Evidence for positive selection acting on microcystin synthetase adenylation domains in three cyanobacterial genera. BMC Evol Biol 2008;8:256. [PMID: 18808704 PMCID: PMC2564945 DOI: 10.1186/1471-2148-8-256] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2008] [Accepted: 09/22/2008] [Indexed: 11/30/2022] Open

Abstract

Background

Cyanobacteria produce a wealth of secondary metabolites, including the group of small cyclic heptapeptide hepatotoxins that constitutes the microcystin family. The enzyme complex that directs the biosynthesis of microcystin is encoded in a single large gene cluster (mcy). mcy genes have a widespread distribution among cyanobacteria and are likely to have an ancient origin. The notable diversity within some of the Mcy modules is generated through various recombination events including horizontal gene transfer.

Results

A comparative analysis of the adenylation domains from the first module of McyB (McyB1) and McyC in the microcystin synthetase complex was performed on a large number of microcystin-producing strains from the Anabaena, Microcystis and Planktothrix genera. We found no decisive evidence for recombination between strains from different genera. However, we detected frequent recombination events in the mcyB and mcyC genes between strains within the same genus. Frequent interdomain recombination events were also observed between mcyB and mcyC sequences in Anabaena and Microcystis. Recombination and mutation rate ratios suggest that the diversification of mcyB and mcyC genes is driven by recombination events as well as point mutations in all three genera. Sequence analysis suggests that generally the adenylation domains of the first domain of McyB and McyC are under purifying selection. However, we found clear evidence for positive selection acting on a number of amino acid residues within these adenylation domains. These include residues important for active site selectivity of the adenylation domain, strongly suggesting selection for novel microcystin variants.

Conclusion

We provide the first clear evidence for positive selection acting on amino acid residues involved directly in the recognition and activation of amino acids incorporated into microcystin, indicating that the microcystin complement of a given strain may influence the ability of a particular strain to interact with its environment.

Collapse

Jensen JD, Thornton KR, Andolfatto P. An approximate bayesian estimator suggests strong, recurrent selective sweeps in Drosophila. PLoS Genet 2008;4:e1000198. [PMID: 18802463 PMCID: PMC2529407 DOI: 10.1371/journal.pgen.1000198] [Citation(s) in RCA: 80] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2008] [Accepted: 08/13/2008] [Indexed: 11/18/2022] Open

The influence of gene conversion on linkage disequilibrium around a selective sweep. Genetics 2008;180:1251-9. [PMID: 18757941 DOI: 10.1534/genetics.108.092270] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Temporal and spatial dynamics of human immunodeficiency virus type 1 circulating recombinant forms 08_BC and 07_BC in Asia. J Virol 2008;82:9206-15. [PMID: 18596096 DOI: 10.1128/jvi.00399-08] [Citation(s) in RCA: 132] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Hellmann I, Mang Y, Gu Z, Li P, de la Vega FM, Clark AG, Nielsen R. Population genetic analysis of shotgun assemblies of genomic sequences from multiple individuals. Genes Dev 2008;18:1020-9. [PMID: 18411405 PMCID: PMC2493391 DOI: 10.1101/gr.074187.107] [Citation(s) in RCA: 79] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2007] [Accepted: 04/07/2008] [Indexed: 01/25/2023]

Jakobsson M, Scholz SW, Scheet P, Gibbs JR, VanLiere JM, Fung HC, Szpiech ZA, Degnan JH, Wang K, Guerreiro R, Bras JM, Schymick JC, Hernandez DG, Traynor BJ, Simon-Sanchez J, Matarin M, Britton A, van de Leemput J, Rafferty I, Bucan M, Cann HM, Hardy JA, Rosenberg NA, Singleton AB. Genotype, haplotype and copy-number variation in worldwide human populations. Nature 2008;451:998-1003. [PMID: 18288195 DOI: 10.1038/nature06742] [Citation(s) in RCA: 613] [Impact Index Per Article: 38.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2007] [Accepted: 01/29/2008] [Indexed: 11/09/2022]

Linkage disequilibrium under skewed offspring distribution among individuals in a population. Genetics 2008;178:1517-32. [PMID: 18245371 DOI: 10.1534/genetics.107.075200] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Tenaillon MI, Austerlitz F, Tenaillon O. Apparent mutational hotspots and long distance linkage disequilibrium resulting from a bottleneck. J Evol Biol 2008;21:541-50. [PMID: 18205779 DOI: 10.1111/j.1420-9101.2007.01490.x] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

Macpherson JM, González J, Witten DM, Davis JC, Rosenberg NA, Hirsh AE, Petrov DA. Nonadaptive explanations for signatures of partial selective sweeps in Drosophila. Mol Biol Evol 2008;25:1025-42. [PMID: 18199829 DOI: 10.1093/molbev/msn007] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open

Slate J, Pemberton JM. Admixture and patterns of linkage disequilibrium in a free-living vertebrate population. J Evol Biol 2007;20:1415-27. [PMID: 17584236 DOI: 10.1111/j.1420-9101.2007.01339.x] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

Kamau E, Charlesworth B, Charlesworth D. Linkage disequilibrium and recombination rate estimates in the self-incompatibility region of Arabidopsis lyrata. Genetics 2007;176:2357-69. [PMID: 17565949 PMCID: PMC1950637 DOI: 10.1534/genetics.107.072231] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2007] [Accepted: 05/17/2007] [Indexed: 11/18/2022] Open

Zauner H, Mayer WE, Herrmann M, Weller A, Erwig M, Sommer RJ. Distinct patterns of genetic variation in Pristionchus pacificus and Caenorhabditis elegans, two partially selfing nematodes with cosmopolitan distribution. Mol Ecol 2007;16:1267-80. [PMID: 17391412 DOI: 10.1111/j.1365-294x.2006.03222.x] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Abstract

Hermaphroditism has evolved several times independently in nematodes. The model organism Caenorhabditis elegans and Pristionchus pacificus are self-fertile hermaphrodites with rare facultative males. Both species are members of different families: C. elegans belongs to the Rhabditidae and P. pacificus to the Diplogastridae. Also, both species differ in their ecology: C. elegans is a soil-dwelling nematode that is often found in compost heaps. In contrast, field studies in Europe and North America indicate that Pristionchus nematodes are closely associated with scarab beetles. In C. elegans, several recent studies have found low genetic diversity and rare out-crossing events. Little is known about diversity levels and population structure in free-living hermaphroditic nematodes outside the genus Caenorhabditis. Taking a comparative approach, we analyse patterns of molecular diversity and linkage disequilibrium in 18 strains of P. pacificus from eight countries and four continents. Mitochondrial sequence data of P. pacificus isolates reveal a substantially higher genetic diversity on a global scale when compared to C. elegans. A mitochondrial-derived hermaphrodite phylogeny shows little geographic structuring, indicating several worldwide dispersal events. Amplified fragment length polymorphism and single strand conformation polymorphism analyses demonstrate a high degree of genome-wide linkage disequilibrium, which also extends to the mitochondrial genome. Together, these findings indicate distinct patterns of genetic variation of the two species. The low level of genetic diversity observed in C. elegans might reflect a recent human-associated dispersal, whereas the P. pacificus diversity might reflect a long-lasting and ongoing insect association. Thus, despite similar lifestyle characteristics in the laboratory, the reproductive mode of hermaphroditism with rare facultative males can result in distinct genetic variability patterns in different ecological settings.

Collapse

Thornton KR, Jensen JD, Becquet C, Andolfatto P. Progress and prospects in mapping recent selection in the genome. Heredity (Edinb) 2007;98:340-8. [PMID: 17473869 DOI: 10.1038/sj.hdy.6800967] [Citation(s) in RCA: 112] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open

Visscher PM. Variation of estimates of SNP and haplotype diversity and linkage disequilibrium in samples from the same population due to experimental and evolutionary sample size. Ann Hum Genet 2007;71:119-26. [PMID: 17227482 DOI: 10.1111/j.1469-1809.2006.00305.x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Tenesa A, Navarro P, Hayes BJ, Duffy DL, Clarke GM, Goddard ME, Visscher PM. Recent human effective population size estimated from linkage disequilibrium. Genome Res 2007;17:520-6. [PMID: 17351134 PMCID: PMC1832099 DOI: 10.1101/gr.6023607] [Citation(s) in RCA: 291] [Impact Index Per Article: 17.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

McVean G. The structure of linkage disequilibrium around a selective sweep. Genetics 2006;175:1395-406. [PMID: 17194788 PMCID: PMC1840056 DOI: 10.1534/genetics.106.062828] [Citation(s) in RCA: 104] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Minichiello MJ, Durbin R. Mapping trait loci by use of inferred ancestral recombination graphs. Am J Hum Genet 2006;79:910-22. [PMID: 17033967 PMCID: PMC1698562 DOI: 10.1086/508901] [Citation(s) in RCA: 79] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2006] [Accepted: 09/01/2006] [Indexed: 12/26/2022] Open

Fraser DJ, Jones MW, McParland TL, Hutchings JA. Loss of historical immigration and the unsuccessful rehabilitation of extirpated salmon populations. CONSERV GENET 2006. [DOI: 10.1007/s10592-006-9188-8] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Song YS, Song JS. Analytic computation of the expectation of the linkage disequilibrium coefficient r2. Theor Popul Biol 2006;71:49-60. [PMID: 17069867 DOI: 10.1016/j.tpb.2006.09.001] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2006] [Revised: 06/25/2006] [Accepted: 09/13/2006] [Indexed: 11/19/2022]

Pennings PS, Hermisson J. Soft sweeps III: the signature of positive selection from recurrent mutation. PLoS Genet 2006;2:e186. [PMID: 17173482 PMCID: PMC1698945 DOI: 10.1371/journal.pgen.0020186] [Citation(s) in RCA: 200] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2006] [Accepted: 09/14/2006] [Indexed: 11/18/2022] Open

Abstract

Polymorphism data can be used to identify loci at which a beneficial allele has recently gone to fixation, given that an accurate description of the signature of selection is available. In the classical model that is used, a favored allele derives from a single mutational origin. This ignores the fact that beneficial alleles can enter a population recurrently by mutation during the selective phase. In this study, we present a combination of analytical and simulation results to demonstrate the effect of adaptation from recurrent mutation on summary statistics for polymorphism data from a linked neutral locus. We also analyze the power of standard neutrality tests based on the frequency spectrum or on linkage disequilibrium (LD) under this scenario. For recurrent beneficial mutation at biologically realistic rates, we find substantial deviations from the classical pattern of a selective sweep from a single new mutation. Deviations from neutrality in the level of polymorphism and in the frequency spectrum are much less pronounced than in the classical sweep pattern. In contrast, for levels of LD, the signature is even stronger if recurrent beneficial mutation plays a role. We suggest a variant of existing LD tests that increases their power to detect this signature.

Populations adapt to their environment through fixation of beneficial alleles. Such fixation events leave a signature in neutral DNA variation of the population. An accurate description of this signature, also called a selective sweep, can be used to identify genes that have been involved in recent adaptations. The classical model of a selective sweep assumes that the beneficial allele was created only once by mutation, whereas the authors have shown, in a previous paper, that this assumption does not always hold. If a substitution involves multiple copies of an allele that have originated by independent mutation, it leads to a different signature, which the authors call a soft selective sweep. In this study, Pennings and Hermisson use analytical tools and coalescent simulations to describe this soft-sweep pattern. They show that this pattern is characterized by strong linkage disequilibrium. They also analyze the power of standard tests of neutrality to detect this pattern and suggest a variant of existing linkage-disequilibrium–based tests that increase the power to detect positive selection in the form of a soft selective sweep.

Collapse

100

Ruderfer DM, Pratt SC, Seidel HS, Kruglyak L. Population genomic analysis of outcrossing and recombination in yeast. Nat Genet 2006;38:1077-81. [PMID: 16892060 DOI: 10.1038/ng1859] [Citation(s) in RCA: 168] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2006] [Accepted: 07/10/2006] [Indexed: 11/09/2022]