Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Gazave E, Ma L, Chang D, Coventry A, Gao F, Muzny D, Boerwinkle E, Gibbs RA, Sing CF, Clark AG, Keinan A. Neutral genomic regions refine models of recent rapid human population growth. Proc Natl Acad Sci U S A 2014;111:757-62. [PMID: 24379384 PMCID: PMC3896169 DOI: 10.1073/pnas.1310398110] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

For:	Gazave E, Ma L, Chang D, Coventry A, Gao F, Muzny D, Boerwinkle E, Gibbs RA, Sing CF, Clark AG, Keinan A. Neutral genomic regions refine models of recent rapid human population growth. Proc Natl Acad Sci U S A 2014;111:757-62. [PMID: 24379384 PMCID: PMC3896169 DOI: 10.1073/pnas.1310398110] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Number

Cited by Other Article(s)

Fan WTL, Wakeley J. Latent mutations in the ancestries of alleles under selection. Theor Popul Biol 2024;158:1-20. [PMID: 38697365 DOI: 10.1016/j.tpb.2024.04.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Revised: 04/23/2024] [Accepted: 04/29/2024] [Indexed: 05/05/2024]

Schraiber JG, Edge MD, Pennell M. Unifying approaches from statistical genetics and phylogenetics for mapping phenotypes in structured populations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.10.579721. [PMID: 38496530 PMCID: PMC10942266 DOI: 10.1101/2024.02.10.579721] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]

Abstract

In both statistical genetics and phylogenetics, a major goal is to identify correlations between genetic loci or other aspects of the phenotype or environment and a focal trait. In these two fields, there are sophisticated but disparate statistical traditions aimed at these tasks. The disconnect between their respective approaches is becoming untenable as questions in medicine, conservation biology, and evolutionary biology increasingly rely on integrating data from within and among species, and once-clear conceptual divisions are becoming increasingly blurred. To help bridge this divide, we derive a general model describing the covariance between the genetic contributions to the quantitative phenotypes of different individuals. Taking this approach shows that standard models in both statistical genetics (e.g., Genome-Wide Association Studies; GWAS) and phylogenetic comparative biology (e.g., phylogenetic regression) can be interpreted as special cases of this more general quantitative-genetic model. The fact that these models share the same core architecture means that we can build a unified understanding of the strengths and limitations of different methods for controlling for genetic structure when testing for associations. We develop intuition for why and when spurious correlations may occur using analytical theory and conduct population-genetic and phylogenetic simulations of quantitative traits. The structural similarity of problems in statistical genetics and phylogenetics enables us to take methodological advances from one field and apply them in the other. We demonstrate this by showing how a standard GWAS technique-including both the genetic relatedness matrix (GRM) as well as its leading eigenvectors, corresponding to the principal components of the genotype matrix, in a regression model-can mitigate spurious correlations in phylogenetic analyses. As a case study of this, we re-examine an analysis testing for co-evolution of expression levels between genes across a fungal phylogeny, and show that including covariance matrix eigenvectors as covariates decreases the false positive rate while simultaneously increasing the true positive rate. More generally, this work provides a foundation for more integrative approaches for understanding the genetic architecture of phenotypes and how evolutionary processes shape it.

Collapse

Zurita AMI, Kyriazis CC, Lohmueller KE. The impact of non-neutral synonymous mutations when inferring selection on non-synonymous mutations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.07.579314. [PMID: 38370782 PMCID: PMC10871344 DOI: 10.1101/2024.02.07.579314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/20/2024]

Mah JC, Lohmueller KE, Garud N. Inference of the demographic histories and selective effects of human gut commensal microbiota over the course of human history. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.09.566454. [PMID: 38014007 PMCID: PMC10680615 DOI: 10.1101/2023.11.09.566454] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]

Antinucci M, Comas D, Calafell F. Population history modulates the fitness effects of Copy Number Variation in the Roma. Hum Genet 2023;142:1327-1343. [PMID: 37311904 PMCID: PMC10449987 DOI: 10.1007/s00439-023-02579-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Accepted: 06/02/2023] [Indexed: 06/15/2023]

Wakeley J, Fan WT(L, Koch E, Sunyaev S. Recurrent mutation in the ancestry of a rare variant. Genetics 2023;224:iyad049. [PMID: 36967220 PMCID: PMC10324944 DOI: 10.1093/genetics/iyad049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 01/30/2023] [Accepted: 03/08/2023] [Indexed: 03/28/2023] Open

Townsend C, Ferraro JV, Habecker H, Flinn MV. Human cooperation and evolutionary transitions in individuality. Philos Trans R Soc Lond B Biol Sci 2023;378:20210414. [PMID: 36688393 PMCID: PMC9869453 DOI: 10.1098/rstb.2021.0414] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open

Wehbi SS, Zu Dohna H. A comparative analysis of L1 retrotransposition activities in human genomes suggests an ongoing increase in L1 number despite an evolutionary trend towards lower activity. Mob DNA 2021;12:26. [PMID: 34782009 PMCID: PMC8594186 DOI: 10.1186/s13100-021-00255-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2021] [Accepted: 10/26/2021] [Indexed: 11/18/2022] Open

Helmstetter AJ, Cable S, Rakotonasolo F, Rabarijaona R, Rakotoarinivo M, Eiserhardt WL, Baker WJ, Papadopulos AST. The demographic history of Madagascan micro-endemics: have rare species always been rare? Proc Biol Sci 2021;288:20210957. [PMID: 34547905 PMCID: PMC8456134 DOI: 10.1098/rspb.2021.0957] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Accepted: 08/25/2021] [Indexed: 01/25/2023] Open

Yazar M, Özbek P. In Silico Tools and Approaches for the Prediction of Functional and Structural Effects of Single-Nucleotide Polymorphisms on Proteins: An Expert Review. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2020;25:23-37. [PMID: 33058752 DOI: 10.1089/omi.2020.0141] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

Valencia-Montoya WA, Elfekih S, North HL, Meier JI, Warren IA, Tay WT, Gordon KHJ, Specht A, Paula-Moraes SV, Rane R, Walsh TK, Jiggins CD. Adaptive Introgression across Semipermeable Species Boundaries between Local Helicoverpa zea and Invasive Helicoverpa armigera Moths. Mol Biol Evol 2020;37:2568-2583. [PMID: 32348505 PMCID: PMC7475041 DOI: 10.1093/molbev/msaa108] [Citation(s) in RCA: 40] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open

The exhaustive genomic scan approach, with an application to rare-variant association analysis. Eur J Hum Genet 2020;28:1283-1291. [PMID: 32415273 PMCID: PMC7608423 DOI: 10.1038/s41431-020-0639-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2019] [Revised: 02/28/2020] [Accepted: 04/07/2020] [Indexed: 12/12/2022] Open

Chen H. A Computational Approach for Modeling the Allele Frequency Spectrum of Populations with Arbitrarily Varying Size. GENOMICS PROTEOMICS & BIOINFORMATICS 2020;17:635-644. [PMID: 32173599 PMCID: PMC7212486 DOI: 10.1016/j.gpb.2019.06.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/28/2018] [Revised: 06/04/2019] [Accepted: 08/02/2019] [Indexed: 11/25/2022]

Jay F, Boitard S, Austerlitz F. An ABC Method for Whole-Genome Sequence Data: Inferring Paleolithic and Neolithic Human Expansions. Mol Biol Evol 2020;36:1565-1579. [PMID: 30785202 DOI: 10.1093/molbev/msz038] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open

Kamm J, Terhorst J, Durbin R, Song YS. Efficiently inferring the demographic history of many populations with allele count data. J Am Stat Assoc 2019;115:1472-1487. [PMID: 33012903 PMCID: PMC7531012 DOI: 10.1080/01621459.2019.1635482] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2018] [Revised: 04/14/2019] [Accepted: 06/08/2019] [Indexed: 01/06/2023]

Flagel L, Brandvain Y, Schrider DR. The Unreasonable Effectiveness of Convolutional Neural Networks in Population Genetic Inference. Mol Biol Evol 2019;36:220-238. [PMID: 30517664 PMCID: PMC6367976 DOI: 10.1093/molbev/msy224] [Citation(s) in RCA: 95] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open

Tournebize R, Poncet V, Jakobsson M, Vigouroux Y, Manel S. McSwan: A joint site frequency spectrum method to detect and date selective sweeps across multiple population genomes. Mol Ecol Resour 2018;19:283-295. [PMID: 30358170 DOI: 10.1111/1755-0998.12957] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2018] [Revised: 10/17/2018] [Accepted: 10/18/2018] [Indexed: 01/01/2023]

Beichman AC, Huerta-Sanchez E, Lohmueller KE. Using Genomic Data to Infer Historic Population Dynamics of Nonmodel Organisms. ANNUAL REVIEW OF ECOLOGY EVOLUTION AND SYSTEMATICS 2018. [DOI: 10.1146/annurev-ecolsys-110617-062431] [Citation(s) in RCA: 89] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Ragsdale AP, Moreau C, Gravel S. Genomic inference using diffusion models and the allele frequency spectrum. Curr Opin Genet Dev 2018;53:140-147. [PMID: 30366252 DOI: 10.1016/j.gde.2018.10.001] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2018] [Revised: 09/14/2018] [Accepted: 10/07/2018] [Indexed: 01/25/2023]

Reppell M, Zöllner S. An efficient algorithm for generating the internal branches of a Kingman coalescent. Theor Popul Biol 2018;122:57-66. [PMID: 28709926 PMCID: PMC5764821 DOI: 10.1016/j.tpb.2017.05.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2016] [Revised: 05/19/2017] [Accepted: 05/26/2017] [Indexed: 01/16/2023]

Schrider DR, Ayroles J, Matute DR, Kern AD. Supervised machine learning reveals introgressed loci in the genomes of Drosophila simulans and D. sechellia. PLoS Genet 2018;14:e1007341. [PMID: 29684059 PMCID: PMC5933812 DOI: 10.1371/journal.pgen.1007341] [Citation(s) in RCA: 69] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2017] [Revised: 05/03/2018] [Accepted: 03/28/2018] [Indexed: 12/30/2022] Open

Abstract

Hybridization and gene flow between species appears to be common. Even though it is clear that hybridization is widespread across all surveyed taxonomic groups, the magnitude and consequences of introgression are still largely unknown. Thus it is crucial to develop the statistical machinery required to uncover which genomic regions have recently acquired haplotypes via introgression from a sister population. We developed a novel machine learning framework, called FILET (Finding Introgressed Loci via Extra-Trees) capable of revealing genomic introgression with far greater power than competing methods. FILET works by combining information from a number of population genetic summary statistics, including several new statistics that we introduce, that capture patterns of variation across two populations. We show that FILET is able to identify loci that have experienced gene flow between related species with high accuracy, and in most situations can correctly infer which population was the donor and which was the recipient. Here we describe a data set of outbred diploid Drosophila sechellia genomes, and combine them with data from D. simulans to examine recent introgression between these species using FILET. Although we find that these populations may have split more recently than previously appreciated, FILET confirms that there has indeed been appreciable recent introgression (some of which might have been adaptive) between these species, and reveals that this gene flow is primarily in the direction of D. simulans to D. sechellia.

Understanding the extent to which species or diverged populations hybridize in nature is crucially important if we are to understand the speciation process. Accordingly numerous research groups have developed methodology for finding the genetic evidence of such introgression. In this report we develop a supervised machine learning approach for uncovering loci which have introgressed across species boundaries. We show that our method, FILET, has greater accuracy and power than competing methods in discovering introgression, and in addition can detect the directionality associated with the gene flow between species. Using whole genome sequences from Drosophila simulans and Drosophila sechellia we show that FILET discovers quite extensive introgression between these species that has occurred mostly from D. simulans to D. sechellia. Our work highlights the complex process of speciation even within a well-studied system and points to the growing importance of supervised machine learning in population genetics.

Collapse

Browning SR, Browning BL, Zhou Y, Tucci S, Akey JM. Analysis of Human Sequence Data Reveals Two Pulses of Archaic Denisovan Admixture. Cell 2018;173:53-61.e9. [PMID: 29551270 PMCID: PMC5866234 DOI: 10.1016/j.cell.2018.02.031] [Citation(s) in RCA: 175] [Impact Index Per Article: 29.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2017] [Revised: 11/21/2017] [Accepted: 02/12/2018] [Indexed: 01/27/2023]

Population genomic analysis of elongated skulls reveals extensive female-biased immigration in Early Medieval Bavaria. Proc Natl Acad Sci U S A 2018. [PMID: 29531040 PMCID: PMC5879695 DOI: 10.1073/pnas.1719880115] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open

Abstract

Many modern European states trace their roots back to a period known as the Migration Period that spans from Late Antiquity to the early Middle Ages. We have conducted the first population-level analysis of people from this era, generating genomic data from 41 graves from archaeological sites in present-day Bavaria in southern Germany mostly dating to around 500 AD. While they are predominantly of northern/central European ancestry, we also find significant evidence for a nonlocal genetic provenance that is highly enriched among resident Early Medieval women, demonstrating artificial skull deformation. We infer that the most likely origin of the majority of these women was southeastern Europe, resolving a debate that has lasted for more than half a century.

Modern European genetic structure demonstrates strong correlations with geography, while genetic analysis of prehistoric humans has indicated at least two major waves of immigration from outside the continent during periods of cultural change. However, population-level genome data that could shed light on the demographic processes occurring during the intervening periods have been absent. Therefore, we generated genomic data from 41 individuals dating mostly to the late 5th/early 6th century AD from present-day Bavaria in southern Germany, including 11 whole genomes (mean depth 5.56×). In addition we developed a capture array to sequence neutral regions spanning a total of 5 Mb and 486 functional polymorphic sites to high depth (mean 72×) in all individuals. Our data indicate that while men generally had ancestry that closely resembles modern northern and central Europeans, women exhibit a very high genetic heterogeneity; this includes signals of genetic ancestry ranging from western Europe to East Asia. Particularly striking are women with artificial skull deformations; the analysis of their collective genetic ancestry suggests an origin in southeastern Europe. In addition, functional variants indicate that they also differed in visible characteristics. This example of female-biased migration indicates that complex demographic processes during the Early Medieval period may have contributed in an unexpected way to shape the modern European genetic landscape. Examination of the panel of functional loci also revealed that many alleles associated with recent positive selection were already at modern-like frequencies in European populations ∼1,500 years ago.

Collapse

Baharian S, Gravel S. On the decidability of population size histories from finite allele frequency spectra. Theor Popul Biol 2018;120:42-51. [PMID: 29305873 DOI: 10.1016/j.tpb.2017.12.008] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2017] [Revised: 12/15/2017] [Accepted: 12/20/2017] [Indexed: 10/18/2022]

Comparison of Single Genome and Allele Frequency Data Reveals Discordant Demographic Histories. G3-GENES GENOMES GENETICS 2017;7:3605-3620. [PMID: 28893846 PMCID: PMC5677151 DOI: 10.1534/g3.117.300259] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]

Amorim CEG, Gao Z, Baker Z, Diesel JF, Simons YB, Haque IS, Pickrell J, Przeworski M. The population genetics of human disease: The case of recessive, lethal mutations. PLoS Genet 2017;13:e1006915. [PMID: 28957316 PMCID: PMC5619689 DOI: 10.1371/journal.pgen.1006915] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2016] [Accepted: 07/09/2017] [Indexed: 01/08/2023] Open

Abstract

Do the frequencies of disease mutations in human populations reflect a simple balance between mutation and purifying selection? What other factors shape the prevalence of disease mutations? To begin to answer these questions, we focused on one of the simplest cases: recessive mutations that alone cause lethal diseases or complete sterility. To this end, we generated a hand-curated set of 417 Mendelian mutations in 32 genes reported to cause a recessive, lethal Mendelian disease. We then considered analytic models of mutation-selection balance in infinite and finite populations of constant sizes and simulations of purifying selection in a more realistic demographic setting, and tested how well these models fit allele frequencies estimated from 33,370 individuals of European ancestry. In doing so, we distinguished between CpG transitions, which occur at a substantially elevated rate, and three other mutation types. Intriguingly, the observed frequency for CpG transitions is slightly higher than expectation but close, whereas the frequencies observed for the three other mutation types are an order of magnitude higher than expected, with a bigger deviation from expectation seen for less mutable types. This discrepancy is even larger when subtle fitness effects in heterozygotes or lethal compound heterozygotes are taken into account. In principle, higher than expected frequencies of disease mutations could be due to widespread errors in reporting causal variants, compensation by other mutations, or balancing selection. It is unclear why these factors would have a greater impact on disease mutations that occur at lower rates, however. We argue instead that the unexpectedly high frequency of disease mutations and the relationship to the mutation rate likely reflect an ascertainment bias: of all the mutations that cause recessive lethal diseases, those that by chance have reached higher frequencies are more likely to have been identified and thus to have been included in this study. Beyond the specific application, this study highlights the parameters likely to be important in shaping the frequencies of Mendelian disease alleles.

What determines the frequencies of disease mutations in human populations? To begin to answer this question, we focus on one of the simplest cases: mutations that cause completely recessive, lethal Mendelian diseases. We first review theory about what to expect from mutation and selection in a population of finite size and generate predictions based on simulations using a plausible demographic scenario of recent human evolution. For a highly mutable type of mutation, transitions at CpG sites, we find that the predictions are close to the observed frequencies of recessive lethal disease mutations. For less mutable types, however, predictions substantially under-estimate the observed frequency. We discuss possible explanations for the discrepancy and point to a complication that, to our knowledge, is not widely appreciated: that there exists ascertainment bias in disease mutation discovery. Specifically, we suggest that alleles that have been identified to date are likely the ones that by chance have reached higher frequencies and are thus more likely to have been mapped. More generally, our study highlights the factors that influence the frequencies of Mendelian disease alleles.

Collapse

Mostafavi H, Berisa T, Day FR, Perry JRB, Przeworski M, Pickrell JK. Identifying genetic variants that affect viability in large cohorts. PLoS Biol 2017;15:e2002458. [PMID: 28873088 PMCID: PMC5584811 DOI: 10.1371/journal.pbio.2002458] [Citation(s) in RCA: 49] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2017] [Accepted: 08/03/2017] [Indexed: 12/20/2022] Open

Abstract

A number of open questions in human evolutionary genetics would become tractable if we were able to directly measure evolutionary fitness. As a step towards this goal, we developed a method to examine whether individual genetic variants, or sets of genetic variants, currently influence viability. The approach consists in testing whether the frequency of an allele varies across ages, accounting for variation in ancestry. We applied it to the Genetic Epidemiology Research on Adult Health and Aging (GERA) cohort and to the parents of participants in the UK Biobank. Across the genome, we found only a few common variants with large effects on age-specific mortality: tagging the APOE ε4 allele and near CHRNA3. These results suggest that when large, even late-onset effects are kept at low frequency by purifying selection. Testing viability effects of sets of genetic variants that jointly influence 1 of 42 traits, we detected a number of strong signals. In participants of the UK Biobank of British ancestry, we found that variants that delay puberty timing are associated with a longer parental life span (P~6.2 × 10⁻⁶ for fathers and P~2.0 × 10⁻³ for mothers), consistent with epidemiological studies. Similarly, variants associated with later age at first birth are associated with a longer maternal life span (P~1.4 × 10⁻³). Signals are also observed for variants influencing cholesterol levels, risk of coronary artery disease (CAD), body mass index, as well as risk of asthma. These signals exhibit consistent effects in the GERA cohort and among participants of the UK Biobank of non-British ancestry. We also found marked differences between males and females, most notably at the CHRNA3 locus, and variants associated with risk of CAD and cholesterol levels. Beyond our findings, the analysis serves as a proof of principle for how upcoming biomedical data sets can be used to learn about selection effects in contemporary humans.

Our global understanding of adaptation in humans is limited to indirect statistical inferences from patterns of genetic variation, which are sensitive to past selection pressures. We introduced a method that allowed us to directly observe ongoing selection in humans by identifying genetic variants that affect survival to a given age (i.e., viability selection). We applied our approach to the GERA cohort and parents of the UK Biobank participants. We found viability effects of variants near the APOE and CHRNA3 genes, which are associated with the risk of Alzheimer disease and smoking behavior, respectively. We also tested for the joint effect of sets of genetic variants that influence quantitative traits. We uncovered an association between longer life span and genetic variants that delay puberty timing and age at first birth. We also detected detrimental effects of higher genetically predicted cholesterol levels, body mass index, risk of coronary artery disease (CAD), and risk of asthma on survival. Some of the observed effects differ between males and females, most notably those at the CHRNA3 gene and variants associated with risk of CAD and cholesterol levels. Beyond this application, our analysis shows how large biomedical data sets can be used to study natural selection in humans.

Collapse

Schrider DR, Kern AD. Soft Sweeps Are the Dominant Mode of Adaptation in the Human Genome. Mol Biol Evol 2017;34:1863-1877. [PMID: 28482049 PMCID: PMC5850737 DOI: 10.1093/molbev/msx154] [Citation(s) in RCA: 102] [Impact Index Per Article: 14.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open

Dietary adaptation of FADS genes in Europe varied across time and geography. Nat Ecol Evol 2017;1:167. [PMID: 29094686 PMCID: PMC5672832 DOI: 10.1038/s41559-017-0167] [Citation(s) in RCA: 42] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2016] [Accepted: 04/18/2017] [Indexed: 11/08/2022]

Accuracy of Demographic Inferences from the Site Frequency Spectrum: The Case of the Yoruba Population. Genetics 2017;206:439-449. [PMID: 28341655 DOI: 10.1534/genetics.116.192708] [Citation(s) in RCA: 54] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2016] [Accepted: 03/23/2017] [Indexed: 01/23/2023] Open

Inference of the Distribution of Selection Coefficients for New Nonsynonymous Mutations Using Large Samples. Genetics 2017;206:345-361. [PMID: 28249985 PMCID: PMC5419480 DOI: 10.1534/genetics.116.197145] [Citation(s) in RCA: 115] [Impact Index Per Article: 16.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2016] [Accepted: 02/14/2017] [Indexed: 12/23/2022] Open

Kamm JA, Terhorst J, Song YS. Efficient computation of the joint sample frequency spectra for multiple populations. J Comput Graph Stat 2017;26:182-194. [PMID: 28239248 DOI: 10.1080/10618600.2016.1159212] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]

Bagley RK, Sousa VC, Niemiller ML, Linnen CR. History, geography and host use shape genomewide patterns of genetic variation in the redheaded pine sawfly ( Neodiprion lecontei ). Mol Ecol 2017;26:1022-1044. [DOI: 10.1111/mec.13972] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2015] [Revised: 11/10/2016] [Accepted: 12/01/2016] [Indexed: 01/03/2023]

A Model of Compound Heterozygous, Loss-of-Function Alleles Is Broadly Consistent with Observations from Complex-Disease GWAS Datasets. PLoS Genet 2017;13:e1006573. [PMID: 28103232 PMCID: PMC5289629 DOI: 10.1371/journal.pgen.1006573] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2016] [Revised: 02/02/2017] [Accepted: 01/05/2017] [Indexed: 12/17/2022] Open

Abstract

The genetic component of complex disease risk in humans remains largely unexplained. A corollary is that the allelic spectrum of genetic variants contributing to complex disease risk is unknown. Theoretical models that relate population genetic processes to the maintenance of genetic variation for quantitative traits may suggest profitable avenues for future experimental design. Here we use forward simulation to model a genomic region evolving under a balance between recurrent deleterious mutation and Gaussian stabilizing selection. We consider multiple genetic and demographic models, and several different methods for identifying genomic regions harboring variants associated with complex disease risk. We demonstrate that the model of gene action, relating genotype to phenotype, has a qualitative effect on several relevant aspects of the population genetic architecture of a complex trait. In particular, the genetic model impacts genetic variance component partitioning across the allele frequency spectrum and the power of statistical tests. Models with partial recessivity closely match the minor allele frequency distribution of significant hits from empirical genome-wide association studies without requiring homozygous effect sizes to be small. We highlight a particular gene-based model of incomplete recessivity that is appealing from first principles. Under that model, deleterious mutations in a genomic region partially fail to complement one another. This model of gene-based recessivity predicts the empirically observed inconsistency between twin and SNP based estimated of dominance heritability. Furthermore, this model predicts considerable levels of unexplained variance associated with intralocus epistasis. Our results suggest a need for improved statistical tools for region based genetic association and heritability estimation.

Gene action determines how mutations affect phenotype. When placed in an evolutionary context, the details of the genotype-to-phenotype model can impact the maintenance of genetic variation for complex traits. Likewise, non-equilibrium demographic history may affect patterns of genetic variation. Here, we explore the impact of genetic model and population growth on distribution of genetic variance across the allele frequency spectrum underlying risk for a complex disease. Using forward-in-time population genetic simulations, we show that the genetic model has important impacts on the composition of variation for complex disease risk in a population. We explicitly simulate genome-wide association studies (GWAS) and perform heritability estimation on population samples. A particular model of gene-based partial recessivity, based on allelic non-complementation, aligns well with empirical results. This model is congruent with the dominance variance estimates from both SNPs and twins, and the minor allele frequency distribution of GWAS hits.

Collapse

Harpak A, Bhaskar A, Pritchard JK. Mutation Rate Variation is a Primary Determinant of the Distribution of Allele Frequencies in Humans. PLoS Genet 2016;12:e1006489. [PMID: 27977673 PMCID: PMC5157949 DOI: 10.1371/journal.pgen.1006489] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2016] [Accepted: 11/16/2016] [Indexed: 01/06/2023] Open

Gao F, Keinan A. Explosive genetic evidence for explosive human population growth. Curr Opin Genet Dev 2016;41:130-139. [PMID: 27710906 PMCID: PMC5161661 DOI: 10.1016/j.gde.2016.09.002] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2016] [Revised: 08/26/2016] [Accepted: 09/11/2016] [Indexed: 11/19/2022]

Field Y, Boyle EA, Telis N, Gao Z, Gaulton KJ, Golan D, Yengo L, Rocheleau G, Froguel P, McCarthy MI, Pritchard JK. Detection of human adaptation during the past 2000 years. Science 2016;354:760-764. [PMID: 27738015 PMCID: PMC5182071 DOI: 10.1126/science.aag0776] [Citation(s) in RCA: 246] [Impact Index Per Article: 30.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2016] [Accepted: 10/03/2016] [Indexed: 12/22/2022]

Schrider DR, Shanku AG, Kern AD. Effects of Linked Selective Sweeps on Demographic Inference and Model Selection. Genetics 2016;204:1207-1223. [PMID: 27605051 PMCID: PMC5105852 DOI: 10.1534/genetics.116.190223] [Citation(s) in RCA: 93] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2016] [Accepted: 09/02/2016] [Indexed: 01/06/2023] Open

Mathias RA, Taub MA, Gignoux CR, Fu W, Musharoff S, O'Connor TD, Vergara C, Torgerson DG, Pino-Yanes M, Shringarpure SS, Huang L, Rafaels N, Boorgula MP, Johnston HR, Ortega VE, Levin AM, Song W, Torres R, Padhukasahasram B, Eng C, Mejia-Mejia DA, Ferguson T, Qin ZS, Scott AF, Yazdanbakhsh M, Wilson JG, Marrugo J, Lange LA, Kumar R, Avila PC, Williams LK, Watson H, Ware LB, Olopade C, Olopade O, Oliveira R, Ober C, Nicolae DL, Meyers D, Mayorga A, Knight-Madden J, Hartert T, Hansel NN, Foreman MG, Ford JG, Faruque MU, Dunston GM, Caraballo L, Burchard EG, Bleecker E, Araujo MI, Herrera-Paz EF, Gietzen K, Grus WE, Bamshad M, Bustamante CD, Kenny EE, Hernandez RD, Beaty TH, Ruczinski I, Akey J, Barnes KC. A continuum of admixture in the Western Hemisphere revealed by the African Diaspora genome. Nat Commun 2016;7:12522. [PMID: 27725671 PMCID: PMC5062574 DOI: 10.1038/ncomms12522] [Citation(s) in RCA: 102] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2016] [Accepted: 07/12/2016] [Indexed: 01/20/2023] Open

Affiliation(s)

Rasika Ann Mathias Department of Medicine, Johns Hopkins University, Baltimore, Maryland 21224, USA Department of Epidemiology, Bloomberg School of Public Health, JHU, Baltimore, Maryland 21205, USA
Margaret A. Taub Department of Biostatistics, Bloomberg School of Public Health, JHU, Baltimore, Maryland 21205, USA
Christopher R. Gignoux Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA
Wenqing Fu Department of Genomic Sciences, University of Washington, Seattle, Washington 98195, USA
Shaila Musharoff Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA
Timothy D. O'Connor Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA Program in Personalized and Genomic Medicine, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA Department of Medicine, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA
Candelaria Vergara Department of Medicine, Johns Hopkins University, Baltimore, Maryland 21224, USA
Dara G. Torgerson Department of Medicine, University of California, San Francisco, San Francisco, California 94143, USA
Maria Pino-Yanes Department of Medicine, University of California, San Francisco, San Francisco, California 94143, USA CIBER de Enfermedades Respiratorias, Instituto de Salud Carlos III, Madrid 28029, Spain
Suyash S. Shringarpure Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA
Lili Huang Department of Medicine, Johns Hopkins University, Baltimore, Maryland 21224, USA
Nicholas Rafaels Department of Medicine, Johns Hopkins University, Baltimore, Maryland 21224, USA
Meher Preethi Boorgula Department of Medicine, Johns Hopkins University, Baltimore, Maryland 21224, USA
Henry Richard Johnston Department of Biostatistics and Bioinformatics, Emory University, Atlanta, Georgia 30322, USA
Victor E. Ortega Center for Human Genomics and Personalized Medicine, Wake Forest School of Medicine, Winston-Salem, North Carolina 27157, USA
Albert M. Levin Department of Public Health Sciences, Henry Ford Health System, Detroit, Michigan 48202, USA
Wei Song Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA Program in Personalized and Genomic Medicine, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA Department of Medicine, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA
Raul Torres Biomedical Sciences Graduate Program, University of California, San Francisco, San Francisco, California 94158, USA
Badri Padhukasahasram Center for Health Policy and Health Services Research, Henry Ford Health System, Detroit, Michigan 48202, USA
Celeste Eng Department of Medicine, University of California, San Francisco, San Francisco, California 94143, USA
Delmy-Aracely Mejia-Mejia Centro de Neumologia y Alergias, San Pedro Sula 21102, Honduras Faculty of Medicine, Centro Medico de la Familia, San Pedro Sula 21102, Honduras
Trevor Ferguson Tropical Medicine Research Institute, The University of the West Indies, St. Michael BB11115, Barbados
Zhaohui S. Qin Department of Biostatistics and Bioinformatics, Emory University, Atlanta, Georgia 30322, USA
Alan F. Scott Department of Medicine, Johns Hopkins University, Baltimore, Maryland 21224, USA
Maria Yazdanbakhsh Department of Parasitology, Leiden University Medical Center, Leiden 2333ZA, The Netherlands
James G. Wilson Department of Physiology and Biophysics, University of Mississippi Medical Center, Jackson, Mississippi 39216, USA
Javier Marrugo Instituto de Investigaciones Immunologicas, Universidad de Cartagena, Cartagena 130000, Colombia
Leslie A. Lange Department of Genetics, University of North Carolina, Chapel Hill, North Carolina 27599, USA
Rajesh Kumar Department of Pediatrics, Northwestern University, Chicago, Illinois 60637, USA The Ann & Robert H. Lurie Children's Hospital of Chicago, Chicago, Illinois 60637, USA
Pedro C. Avila Department of Medicine, Northwestern University, Chicago, Illinois 60637, USA
L. Keoki Williams Center for Health Policy and Health Services Research, Henry Ford Health System, Detroit, Michigan 48202, USA Department of Internal Medicine, Henry Ford Health System, Detroit, Michigan 48202, USA
Harold Watson Faculty of Medical Sciences Cave Hill Campus, The University of the West Indies, Bridgetown BB11000, Barbados Queen Elizabeth Hospital, The University of the West Indies, St. Michael BB11115, Barbados
Lorraine B. Ware Department of Medicine, Vanderbilt University, Nashville, Tennessee 37232, USA Department of Pathology, Microbiology and Immunology, Vanderbilt University, Nashville, Tennessee 37232, USA
Christopher Olopade Department of Medicine and Center for Global Health, University of Chicago, Chicago, Illinois 60637, USA
Olufunmilayo Olopade Department of Medicine, University of Chicago, Chicago, Illinois 60637, USA
Ricardo Oliveira Laboratório de Patologia Experimental, Centro de Pesquisas Gonçalo Moniz, Salvador 40296-710, Brazil
Carole Ober Department of Human Genetics, University of Chicago, Chicago, Illinois 60637, USA
Dan L. Nicolae Department of Medicine, University of Chicago, Chicago, Illinois 60637, USA Department of Statistics, University of Chicago, Chicago, Illinois 60637, USA
Deborah Meyers Center for Human Genomics and Personalized Medicine, Wake Forest School of Medicine, Winston-Salem, North Carolina 27157, USA
Alvaro Mayorga Centro de Neumologia y Alergias, San Pedro Sula 21102, Honduras
Jennifer Knight-Madden Tropical Medicine Research Institute, The University of the West Indies, St. Michael BB11115, Barbados
Tina Hartert Department of Medicine, Vanderbilt University, Nashville, Tennessee 37232, USA
Nadia N. Hansel Department of Medicine, Johns Hopkins University, Baltimore, Maryland 21224, USA
Marilyn G. Foreman Pulmonary and Critical Care Medicine, Morehouse School of Medicine, Atlanta, Georgia 30310, USA
Jean G. Ford Department of Epidemiology, Bloomberg School of Public Health, JHU, Baltimore, Maryland 21205, USA Department of Medicine, The Brooklyn Hospital Center, Brooklyn, New York 11201, USA
Mezbah U. Faruque National Human Genome Center, Howard University College of Medicine, Washington DC 20059, USA
Georgia M. Dunston National Human Genome Center, Howard University College of Medicine, Washington DC 20059, USA Department of Microbiology, Howard University College of Medicine, Washington DC 20059, USA
Luis Caraballo Institute for Immunological Research, Universidad de Cartagena, Cartagena 130000, Colombia
Esteban G. Burchard Department of Medicine, University of California, San Francisco, San Francisco, California 94143, USA Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, California 94158, USA
Eugene Bleecker Center for Human Genomics and Personalized Medicine, Wake Forest School of Medicine, Winston-Salem, North Carolina 27157, USA
Maria Ilma Araujo Immunology Service, Universidade Federal da Bahia, Salvador 401110170, Brazil
Edwin Francisco Herrera-Paz Centro de Neumologia y Alergias, San Pedro Sula 21102, Honduras Faculty of Medicine, Centro Medico de la Familia, San Pedro Sula 21102, Honduras Facultad de Medicina, Universidad Catolica de Honduras, San Pedro Sula 21102, Honduras
Kimberly Gietzen Illumina, Inc., San Diego, California 92122, USA
Wendy E. Grus Knome Inc., Cambridge, Massachusetts 02141, USA
Michael Bamshad Department of Pediatrics, University of Washington, Seattle, Washington 98195, USA
Carlos D. Bustamante Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA
Eimear E. Kenny Department of Genetics, Stanford University School of Medicine, Stanford, California 94305, USA Department of Genetics and Genomics, Icahn School of Medicine at Mount Sinai, New York, New York 10029, USA
Ryan D. Hernandez Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, California 94158, USA Institute for Human Genetics, University of California, San Francisco, San Francisco, California 94143, USA California Institute for Quantitative Biosciences, University of California, San Francisco, California 94143, USA
Terri H. Beaty Department of Epidemiology, Bloomberg School of Public Health, JHU, Baltimore, Maryland 21205, USA
Ingo Ruczinski Department of Biostatistics, Bloomberg School of Public Health, JHU, Baltimore, Maryland 21205, USA
Joshua Akey Department of Genomic Sciences, University of Washington, Seattle, Washington 98195, USA
Kathleen C. Barnes Department of Medicine, Johns Hopkins University, Baltimore, Maryland 21224, USA Department of Epidemiology, Bloomberg School of Public Health, JHU, Baltimore, Maryland 21205, USA

Collapse

Auer PL, Reiner AP, Wang G, Kang HM, Abecasis GR, Altshuler D, Bamshad MJ, Nickerson DA, Tracy RP, Rich SS, Leal SM, Leal SM. Guidelines for Large-Scale Sequence-Based Complex Trait Association Studies: Lessons Learned from the NHLBI Exome Sequencing Project. Am J Hum Genet 2016;99:791-801. [PMID: 27666372 DOI: 10.1016/j.ajhg.2016.08.012] [Citation(s) in RCA: 69] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2016] [Accepted: 08/08/2016] [Indexed: 12/11/2022] Open

Xue C, Chen H, Yu F. Base-Biased Evolution of Disease-Associated Mutations in the Human Genome. Hum Mutat 2016;37:1209-1214. [PMID: 27507420 DOI: 10.1002/humu.23065] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2016] [Revised: 08/02/2016] [Accepted: 08/07/2016] [Indexed: 11/08/2022]

Goldberg A, Mychajliw AM, Hadly EA. Post-invasion demography of prehistoric humans in South America. Nature 2016;532:232-5. [PMID: 27049941 DOI: 10.1038/nature17176] [Citation(s) in RCA: 72] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2015] [Accepted: 01/26/2016] [Indexed: 01/25/2023]

Li B, Wang GT, Leal SM. Generation of sequence-based data for pedigree-segregating Mendelian or Complex traits. Bioinformatics 2015;31:3706-8. [PMID: 26177964 DOI: 10.1093/bioinformatics/btv412] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2015] [Accepted: 07/07/2015] [Indexed: 01/05/2023] Open

Methods and models for unravelling human evolutionary history. Nat Rev Genet 2015;16:727-40. [DOI: 10.1038/nrg4005] [Citation(s) in RCA: 136] [Impact Index Per Article: 15.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Inference of Super-exponential Human Population Growth via Efficient Computation of the Site Frequency Spectrum for Generalized Models. Genetics 2015;202:235-45. [PMID: 26450922 PMCID: PMC4701087 DOI: 10.1534/genetics.115.180570] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2015] [Accepted: 09/28/2015] [Indexed: 01/08/2023] Open

Lohmueller KE. The distribution of deleterious genetic variation in human populations. Curr Opin Genet Dev 2015;29:139-46. [PMID: 25461617 DOI: 10.1016/j.gde.2014.09.005] [Citation(s) in RCA: 86] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2014] [Revised: 08/28/2014] [Accepted: 09/05/2014] [Indexed: 11/19/2022]

Chen H, Hey J, Chen K. Inferring Very Recent Population Growth Rate from Population-Scale Sequencing Data: Using a Large-Sample Coalescent Estimator. Mol Biol Evol 2015;32:2996-3011. [PMID: 26187437 DOI: 10.1093/molbev/msv158] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open

Fundamental limits on the accuracy of demographic inference based on the sample frequency spectrum. Proc Natl Acad Sci U S A 2015;112:7677-82. [PMID: 26056264 DOI: 10.1073/pnas.1503717112] [Citation(s) in RCA: 63] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open

Fregel R, Cabrera V, Larruga JM, Abu-Amero KK, González AM. Carriers of Mitochondrial DNA Macrohaplogroup N Lineages Reached Australia around 50,000 Years Ago following a Northern Asian Route. PLoS One 2015;10:e0129839. [PMID: 26053380 PMCID: PMC4460043 DOI: 10.1371/journal.pone.0129839] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2014] [Accepted: 05/13/2015] [Indexed: 01/17/2023] Open

Abstract

Background

The modern human colonization of Eurasia and Australia is mostly explained by a single-out-of-Africa exit following a southern coastal route throughout Arabia and India. However, dispersal across the Levant would better explain the introgression with Neanderthals, and more than one exit would fit better with the different ancient genomic components discovered in indigenous Australians and in ancient Europeans. The existence of an additional Northern route used by modern humans to reach Australia was previously deduced from the phylogeography of mtDNA macrohaplogroup N. Here, we present new mtDNA data and new multidisciplinary information that add more support to this northern route.

Methods

MtDNA hypervariable segments and haplogroup diagnostic coding positions were analyzed in 2,278 Saudi Arabs, from which 1,725 are new samples. Besides, we used 623 published mtDNA genomes belonging to macrohaplogroup N, but not R, to build updated phylogenetic trees to calculate their coalescence ages, and more than 70,000 partial mtDNA sequences were screened to establish their respective geographic ranges.

Results

The Saudi mtDNA profile confirms the absence of autochthonous mtDNA lineages in Arabia with coalescence ages deep enough to support population continuity in the region since the out-of-Africa episode. In contrast to Australia, where N(xR) haplogroups are found in high frequency and with deep coalescence ages, there are not autochthonous N(xR) lineages in India nor N(xR) branches with coalescence ages as deep as those found in Australia. These patterns are at odds with the supposition that Australian colonizers harboring N(xR) lineages used a route involving India as a stage. The most ancient N(xR) lineages in Eurasia are found in China, and inconsistently with the coastal route, N(xR) haplogroups with the southernmost geographical range have all more recent radiations than the Australians.

Conclusions

Apart from a single migration event via a southern route, phylogeny and phylogeography of N(xR) lineages support that people carrying mtDNA N lineages could have reach Australia following a northern route through Asia. Data from other disciplines also support this scenario.

Collapse

Yu F, Lu J, Liu X, Gazave E, Chang D, Raj S, Hunter-Zinck H, Blekhman R, Arbiza L, Van Hout C, Morrison A, Johnson AD, Bis J, Cupples LA, Psaty BM, Muzny D, Yu J, Gibbs RA, Keinan A, Clark AG, Boerwinkle E. Population genomic analysis of 962 whole genome sequences of humans reveals natural selection in non-coding regions. PLoS One 2015;10:e0121644. [PMID: 25807536 PMCID: PMC4373932 DOI: 10.1371/journal.pone.0121644] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2014] [Accepted: 08/14/2014] [Indexed: 12/13/2022] Open

Affiliation(s)

Fuli Yu Human Genome Sequencing Center, Molecular and Human Genetics Department, Baylor College of Medicine, Houston, Texas, United States of America Institute of Neurology, Tianjin Medical University General Hospital, Tianjin, China * E-mail: (FY); (EB)
Jian Lu Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, New York, United States of America College of Life Sciences, State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
Xiaoming Liu Human Genetic Center, University of Texas Health Science Center, Houston, Texas, United States of America
Elodie Gazave Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, New York, United States of America
Diana Chang Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, New York, United States of America
Srilakshmi Raj Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, New York, United States of America
Haley Hunter-Zinck Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, New York, United States of America
Ran Blekhman Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, New York, United States of America
Leonardo Arbiza Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, New York, United States of America
Cris Van Hout Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, New York, United States of America
Alanna Morrison Human Genetic Center, University of Texas Health Science Center, Houston, Texas, United States of America
Andrew D. Johnson National Heart, Lung and Blood Institute (NHLBI) Framingham Heart Study, Framingham, Massachusetts, United States of America
Joshua Bis Cardiovascular Health Research Unit, Departments of Medicine, Epidemiology, and Health Services, University of Washington, Seattle, Washington, United States of America
L. Adrienne Cupples National Heart, Lung and Blood Institute (NHLBI) Framingham Heart Study, Framingham, Massachusetts, United States of America
Bruce M. Psaty Cardiovascular Health Research Unit, Departments of Medicine, Epidemiology, and Health Services, University of Washington, Seattle, Washington, United States of America
Donna Muzny Human Genome Sequencing Center, Molecular and Human Genetics Department, Baylor College of Medicine, Houston, Texas, United States of America
Jin Yu Human Genome Sequencing Center, Molecular and Human Genetics Department, Baylor College of Medicine, Houston, Texas, United States of America
Richard A. Gibbs Human Genome Sequencing Center, Molecular and Human Genetics Department, Baylor College of Medicine, Houston, Texas, United States of America
Alon Keinan Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, New York, United States of America
Andrew G. Clark Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, New York, United States of America
Eric Boerwinkle Human Genome Sequencing Center, Molecular and Human Genetics Department, Baylor College of Medicine, Houston, Texas, United States of America Human Genetic Center, University of Texas Health Science Center, Houston, Texas, United States of America * E-mail: (FY); (EB)

Collapse