151
|
Andersen MK, Pedersen CET, Moltke I, Hansen T, Albrechtsen A, Grarup N. Genetics of Type 2 Diabetes: the Power of Isolated Populations. Curr Diab Rep 2016; 16:65. [PMID: 27189761 DOI: 10.1007/s11892-016-0757-z] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Type 2 diabetes (T2D) affects millions of people worldwide. Improving the understanding of the underlying mechanisms and ultimately improving the treatment strategies are, thus, of great interest. To achieve this, identification of genetic variation predisposing to T2D is important. A large number of variants have been identified in large outbred populations, mainly from Europe and Asia. However, to elucidate additional variation, isolated populations have a number of advantageous properties, including increased amounts of linkage disequilibrium, and increased probability for presence of high frequency disease-associated variants due to genetic drift. Collectively, this increases the statistical power to detect association signals in isolated populations compared to large outbred populations. In this review, we elaborate on why isolated populations are a powerful resource for the identification of complex disease variants and describe their contributions to the understanding of the genetics of T2D.
Collapse
Affiliation(s)
- Mette Korre Andersen
- The Novo Nordisk Foundation Center for Basic Metabolic Research, Section of Metabolic Genetics, Faculty of Health and Medical Sciences, University of Copenhagen, Universitetsparken 1, 2100, Copenhagen, Denmark
| | - Casper-Emil Tingskov Pedersen
- The Bioinformatics Centre, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200, Copenhagen, Denmark
| | - Ida Moltke
- The Bioinformatics Centre, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200, Copenhagen, Denmark
| | - Torben Hansen
- The Novo Nordisk Foundation Center for Basic Metabolic Research, Section of Metabolic Genetics, Faculty of Health and Medical Sciences, University of Copenhagen, Universitetsparken 1, 2100, Copenhagen, Denmark
- Faculty of Health Sciences, University of Southern Denmark, J.B. Winsløws Vej 19, 3, 5000, Odense, Denmark
| | - Anders Albrechtsen
- The Bioinformatics Centre, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200, Copenhagen, Denmark
| | - Niels Grarup
- The Novo Nordisk Foundation Center for Basic Metabolic Research, Section of Metabolic Genetics, Faculty of Health and Medical Sciences, University of Copenhagen, Universitetsparken 1, 2100, Copenhagen, Denmark.
| |
Collapse
|
152
|
Beissinger TM, Wang L, Crosby K, Durvasula A, Hufford MB, Ross-Ibarra J. Recent demography drives changes in linked selection across the maize genome. NATURE PLANTS 2016; 2:16084. [PMID: 27294617 DOI: 10.1038/nplants.2016.84] [Citation(s) in RCA: 72] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/05/2016] [Accepted: 05/12/2016] [Indexed: 05/14/2023]
Abstract
Genetic diversity is shaped by the interaction of drift and selection, but the details of this interaction are not well understood. The impact of genetic drift in a population is largely determined by its demographic history, typically summarized by its long-term effective population size (Ne). Rapidly changing population demographics complicate this relationship, however. To better understand how changing demography impacts selection, we used whole-genome sequencing data to investigate patterns of linked selection in domesticated and wild maize (teosinte). We produce the first whole-genome estimate of the demography of maize domestication, showing that maize was reduced to approximately 5% the population size of teosinte before it experienced rapid expansion post-domestication to population sizes much larger than its ancestor. Evaluation of patterns of nucleotide diversity in and near genes shows little evidence of selection on beneficial amino acid substitutions, and that the domestication bottleneck led to a decline in the efficiency of purifying selection in maize. Young alleles, however, show evidence of much stronger purifying selection in maize, reflecting the much larger effective size of present day populations. Our results demonstrate that recent demographic change-a hall-mark of many species including both humans and crops-can have immediate and wide-ranging impacts on diversity that conflict with expectations based on long-term Ne alone.
Collapse
Affiliation(s)
- Timothy M Beissinger
- Department of Plant Sciences, University of California, Davis, California 95616, USA
- US Department of Agriculture, Agricultural Research Service, Columbia, Missouri 65211, USA
- Division of Plant Sciences, University of Missouri, Columbia, Missouri 65211, USA
| | - Li Wang
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, Iowa 50011, USA
| | - Kate Crosby
- Department of Plant Sciences, University of California, Davis, California 95616, USA
| | - Arun Durvasula
- Department of Plant Sciences, University of California, Davis, California 95616, USA
| | - Matthew B Hufford
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, Iowa 50011, USA
| | - Jeffrey Ross-Ibarra
- Department of Plant Sciences, University of California, Davis, California 95616, USA
- Genome Center and Center for Population Biology, University of California, Davis, California 95616, USA
| |
Collapse
|
153
|
Uricchio LH, Zaitlen NA, Ye CJ, Witte JS, Hernandez RD. Selection and explosive growth alter genetic architecture and hamper the detection of causal rare variants. Genome Res 2016; 26:863-73. [PMID: 27197206 PMCID: PMC4937562 DOI: 10.1101/gr.202440.115] [Citation(s) in RCA: 52] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2015] [Accepted: 05/16/2016] [Indexed: 12/20/2022]
Abstract
The role of rare alleles in complex phenotypes has been hotly debated, but most rare variant association tests (RVATs) do not account for the evolutionary forces that affect genetic architecture. Here, we use simulation and numerical algorithms to show that explosive population growth, as experienced by human populations, can dramatically increase the impact of very rare alleles on trait variance. We then assess the ability of RVATs to detect causal loci using simulations and human RNA-seq data. Surprisingly, we find that statistical performance is worst for phenotypes in which genetic variance is due mainly to rare alleles, and explosive population growth decreases power. Although many studies have attempted to identify causal rare variants, few have reported novel associations. This has sometimes been interpreted to mean that rare variants make negligible contributions to complex trait heritability. Our work shows that RVATs are not robust to realistic human evolutionary forces, so general conclusions about the impact of rare variants on complex traits may be premature.
Collapse
Affiliation(s)
- Lawrence H Uricchio
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, California 94143, USA; Graduate Program in Bioinformatics, University of California, San Francisco, San Francisco, California 94143, USA
| | - Noah A Zaitlen
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, California 94143, USA; Institute for Human Genetics, University of California, San Francisco, San Francisco, California 94143, USA; Institute for Quantitative Biosciences (QB3), University of California, San Francisco, San Francisco, California 94143, USA
| | - Chun Jimmie Ye
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, California 94143, USA; Institute for Human Genetics, University of California, San Francisco, San Francisco, California 94143, USA; Department of Epidemiology and Biostatistics, University of California, San Francisco, San Francisco, California 94143, USA
| | - John S Witte
- Institute for Human Genetics, University of California, San Francisco, San Francisco, California 94143, USA; Department of Epidemiology and Biostatistics, University of California, San Francisco, San Francisco, California 94143, USA
| | - Ryan D Hernandez
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, California 94143, USA; Institute for Human Genetics, University of California, San Francisco, San Francisco, California 94143, USA; Institute for Quantitative Biosciences (QB3), University of California, San Francisco, San Francisco, California 94143, USA
| |
Collapse
|
154
|
Silver AJ, Larson JL, Silver MJ, Lim RM, Borroto C, Spurrier B, Morriss A, Silver LM. Carrier Screening is a Deficient Strategy for Determining Sperm Donor Eligibility and Reducing Risk of Disease in Recipient Children. Genet Test Mol Biomarkers 2016; 20:276-84. [PMID: 27104957 PMCID: PMC4892196 DOI: 10.1089/gtmb.2016.0014] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Aims: DNA-based carrier screening is a standard component of donor eligibility protocols practiced by U.S. sperm banks. Applicants who test positive for carrying a recessive disease mutation are typically disqualified. The aim of our study was to examine the utility of a range of screening panels adopted by the industry and the effectiveness of the screening paradigm in reducing a future child's risk of inheriting disease. Methods: A cohort of 27 donor applicants, who tested negative on an initial cystic fibrosis carrier test, was further screened with three expanded commercial carrier testing panels. These results were then compared to a systematic analysis of the applicants' DNA using next-generation sequencing (NGS) data. Results: The carrier panels detected serious pediatric disease mutations in one, four, or six donor applicants. Because each panel screens distinct regions of the genome, no single donor was uniformly identified as carrier positive by all three panels. In contrast, systematic NGS analysis identified all donors as carriers of one or more mutations associated with severe monogenic pediatric disease. These included 30 variants classified as “pathogenic” based on clinical observation and 66 with a high likelihood of causing gene dysfunction. Conclusion: Despite tremendous advances in variant identification, understanding, and analysis, the vast majority of disease-causing mutation combinations remain undetected by commercial carrier screening panels, which cover a narrow, and often distinct, subset of genes and mutations. The biological reality is that all donors and recipients carry serious recessive disease mutations. This challenges the utility of any screening protocol that anchors donor eligibility to carrier status. A more effective approach to reducing recessive disease risk would consider joint comprehensive analysis of both donor and recipient disease mutations. This type of high-resolution recessive disease risk analysis is now available and affordable, but industry practice must be modified to incorporate its use.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Anne Morriss
- 1 GenePeeks, Inc. , Cambridge, Massachusetts.,2 GenePeeks, Inc. , New York, New York
| | - Lee M Silver
- 1 GenePeeks, Inc. , Cambridge, Massachusetts.,2 GenePeeks, Inc. , New York, New York.,3 Department of Molecular Biology, Princeton University , Princeton, New Jersey
| |
Collapse
|
155
|
Abstract
Approximately 2-4% of genetic material in human populations outside Africa is derived from Neanderthals who interbred with anatomically modern humans. Recent studies have shown that this Neanderthal DNA is depleted around functional genomic regions; this has been suggested to be a consequence of harmful epistatic interactions between human and Neanderthal alleles. However, using published estimates of Neanderthal inbreeding and the distribution of mutational fitness effects, we infer that Neanderthals had at least 40% lower fitness than humans on average; this increased load predicts the reduction in Neanderthal introgression around genes without the need to invoke epistasis. We also predict a residual Neanderthal mutational load in non-Africans, leading to a fitness reduction of at least 0.5%. This effect of Neanderthal admixture has been left out of previous debate on mutation load differences between Africans and non-Africans. We also show that if many deleterious mutations are recessive, the Neanderthal admixture fraction could increase over time due to the protective effect of Neanderthal haplotypes against deleterious alleles that arose recently in the human population. This might partially explain why so many organisms retain gene flow from other species and appear to derive adaptive benefits from introgression.
Collapse
|
156
|
Narasimhan VM, Xue Y, Tyler-Smith C. Human Knockout Carriers: Dead, Diseased, Healthy, or Improved? Trends Mol Med 2016; 22:341-351. [PMID: 26988438 PMCID: PMC4826344 DOI: 10.1016/j.molmed.2016.02.006] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2016] [Revised: 02/18/2016] [Accepted: 02/19/2016] [Indexed: 01/11/2023]
Abstract
Whole-genome and whole-exome sequence data from large numbers of individuals reveal that we all carry many variants predicted to inactivate genes (knockouts). This discovery raises questions about the phenotypic consequences of these knockouts and potentially allows us to study human gene function through the investigation of homozygous loss-of-function carriers. Here, we discuss strategies, recent results, and future prospects for large-scale human knockout studies. We examine their relevance to studying gene function, population genetics, and importantly, the implications for accurate clinical interpretations.
Collapse
Affiliation(s)
| | - Yali Xue
- Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
| | | |
Collapse
|
157
|
Abstract
Deleterious alleles can reach high frequency in small populations because of random fluctuations in allele frequency. This may lead, over time, to reduced average fitness. In this sense, selection is more "effective" in larger populations. Recent studies have considered whether the different demographic histories across human populations have resulted in differences in the number, distribution, and severity of deleterious variants, leading to an animated debate. This article first seeks to clarify some terms of the debate by identifying differences in definitions and assumptions used in recent studies. We argue that variants of Morton, Crow, and Muller's "total mutational damage" provide the soundest and most practical basis for such comparisons. Using simulations, analytical calculations, and 1000 Genomes Project data, we provide an intuitive and quantitative explanation for the observed similarity in genetic load across populations. We show that recent demography has likely modulated the effect of selection and still affects it, but the net result of the accumulated differences is small. Direct observation of differential efficacy of selection for specific allele classes is nevertheless possible with contemporary data sets. By contrast, identifying average genome-wide differences in the efficacy of selection across populations will require many modeling assumptions and is unlikely to provide much biological insight about human populations.
Collapse
|
158
|
Brandvain Y, Wright SI. The Limits of Natural Selection in a Nonequilibrium World. Trends Genet 2016; 32:201-210. [PMID: 26874998 DOI: 10.1016/j.tig.2016.01.004] [Citation(s) in RCA: 63] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2015] [Revised: 12/23/2015] [Accepted: 01/19/2016] [Indexed: 11/19/2022]
Abstract
Evolutionary theory predicts that factors such as a small population size or low recombination rate can limit the action of natural selection. The emerging field of comparative population genomics offers an opportunity to evaluate these hypotheses. However, classical theoretical predictions assume that populations are at demographic equilibrium. This assumption is likely to be violated in the very populations researchers use to evaluate selection's limits: populations that have experienced a recent shift in population size and/or effective recombination rates. Here we highlight theory and data analyses concerning limitations on the action of natural selection in nonequilibrial populations and argue that substantial care is needed to appropriately test whether species and populations show meaningful differences in selection efficacy. A move toward model-based inferences that explicitly incorporate nonequilibrium dynamics provides a promising approach to more accurately contrast selection efficacy across populations and interpret its significance.
Collapse
Affiliation(s)
- Yaniv Brandvain
- Department of Plant Biology, University of Minnesota, St Paul, MN 55108, USA
| | - Stephen I Wright
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON M5S 3B2, Canada.
| |
Collapse
|
159
|
Patterns of deleterious variation between human populations reveal an unbalanced load. Proc Natl Acad Sci U S A 2016; 113:809-11. [PMID: 26787891 DOI: 10.1073/pnas.1524016113] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
|
160
|
Marsden CD, Ortega-Del Vecchyo D, O'Brien DP, Taylor JF, Ramirez O, Vilà C, Marques-Bonet T, Schnabel RD, Wayne RK, Lohmueller KE. Bottlenecks and selective sweeps during domestication have increased deleterious genetic variation in dogs. Proc Natl Acad Sci U S A 2016; 113:152-7. [PMID: 26699508 PMCID: PMC4711855 DOI: 10.1073/pnas.1512501113] [Citation(s) in RCA: 187] [Impact Index Per Article: 23.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Population bottlenecks, inbreeding, and artificial selection can all, in principle, influence levels of deleterious genetic variation. However, the relative importance of each of these effects on genome-wide patterns of deleterious variation remains controversial. Domestic and wild canids offer a powerful system to address the role of these factors in influencing deleterious variation because their history is dominated by known bottlenecks and intense artificial selection. Here, we assess genome-wide patterns of deleterious variation in 90 whole-genome sequences from breed dogs, village dogs, and gray wolves. We find that the ratio of amino acid changing heterozygosity to silent heterozygosity is higher in dogs than in wolves and, on average, dogs have 2-3% higher genetic load than gray wolves. Multiple lines of evidence indicate this pattern is driven by less efficient natural selection due to bottlenecks associated with domestication and breed formation, rather than recent inbreeding. Further, we find regions of the genome implicated in selective sweeps are enriched for amino acid changing variants and Mendelian disease genes. To our knowledge, these results provide the first quantitative estimates of the increased burden of deleterious variants directly associated with domestication and have important implications for selective breeding programs and the conservation of rare and endangered species. Specifically, they highlight the costs associated with selective breeding and question the practice favoring the breeding of individuals that best fit breed standards. Our results also suggest that maintaining a large population size, rather than just avoiding inbreeding, is a critical factor for preventing the accumulation of deleterious variants.
Collapse
Affiliation(s)
- Clare D Marsden
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA 90095
| | | | - Dennis P O'Brien
- Department of Veterinary Medicine and Surgery, University of Missouri, Columbia, MO 65211
| | - Jeremy F Taylor
- Division of Animal Sciences, University of Missouri, Columbia, MO 65211
| | - Oscar Ramirez
- Institut Catala de Recerca i Estudis Avançats, Institut de Biologia Evolutiva (Centro Superior de Investigaciones Cientificas-Universitat Pompeu Fabra), 08003 Barcelona, Spain
| | - Carles Vilà
- Conservation and Evolutionary Genetics Group, Estación Biológica de Doñana-Consejo Superior de Investigaciones Cientificas, 41092, Seville, Spain
| | - Tomas Marques-Bonet
- Institut Catala de Recerca i Estudis Avançats, Institut de Biologia Evolutiva (Centro Superior de Investigaciones Cientificas-Universitat Pompeu Fabra), 08003 Barcelona, Spain; Centro Nacional Analasis Genomico, 08023, Barcelona, Spain
| | - Robert D Schnabel
- Division of Animal Sciences, University of Missouri, Columbia, MO 65211; Informatics Institute, University of Missouri, Columbia, MO 65211
| | - Robert K Wayne
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA 90095
| | - Kirk E Lohmueller
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA 90095; Interdepartmental Program in Bioinformatics, University of California, Los Angeles, CA 90095; Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, CA 90095
| |
Collapse
|
161
|
Subramanian S. Europeans have a higher proportion of high‑frequency deleterious variants than Africans. Hum Genet 2016; 135:1-7. [PMID: 26462918 DOI: 10.1007/s00439-015-1604-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2015] [Accepted: 10/01/2015] [Indexed: 12/11/2022]
Abstract
Recent studies have shown that a high proportion of rare variants in European and African populations are deleterious in nature. However, the deleterious fraction of high-frequency variants is unclear. Using more than 6500 exomes we show a much higher fraction (11 %) of relatively high-frequency nonsynonymous (amino acid altering) variants (DAF 0.1–10 %) in European Americans (EA) compared to those from African Americans (AA). In contrast, this difference was only marginal (<2 %) for low-frequency nonsynonymous variants (DAF <0.1 %). Our results also revealed that the proportion of high-frequency deleterious nonsynonymous variants in EA was much higher (24 %) than that in AA and this difference was very small (4 %) for the low-frequency deleterious amino acid altering variants. We also show that EA have significantly more number of high-frequency deleterious nonsynonymous variants per genome than AA. The high proportion of high-frequency deleterious variants in EA could be the result of the well-known bottleneck experienced by European populations in which harmful mutations may have drifted to high frequencies. The estimated ages of deleterious variants support this prediction. Our results suggest that high-frequency variants could be relatively more likely to be associated with diseases in Europeans than in Africans and hence emphasize the need for population-specific strategies in genomic medicine studies.
Collapse
Affiliation(s)
- Sankar Subramanian
- Environmental Futures Research Institute, Griffith University, 170 Kessels Road, Nathan, QLD 4111, Australia.
| |
Collapse
|
162
|
Distance from sub-Saharan Africa predicts mutational load in diverse human genomes. Proc Natl Acad Sci U S A 2015; 113:E440-9. [PMID: 26712023 DOI: 10.1073/pnas.1510805112] [Citation(s) in RCA: 150] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
The Out-of-Africa (OOA) dispersal ∼ 50,000 y ago is characterized by a series of founder events as modern humans expanded into multiple continents. Population genetics theory predicts an increase of mutational load in populations undergoing serial founder effects during range expansions. To test this hypothesis, we have sequenced full genomes and high-coverage exomes from seven geographically divergent human populations from Namibia, Congo, Algeria, Pakistan, Cambodia, Siberia, and Mexico. We find that individual genomes vary modestly in the overall number of predicted deleterious alleles. We show via spatially explicit simulations that the observed distribution of deleterious allele frequencies is consistent with the OOA dispersal, particularly under a model where deleterious mutations are recessive. We conclude that there is a strong signal of purifying selection at conserved genomic positions within Africa, but that many predicted deleterious mutations have evolved as if they were neutral during the expansion out of Africa. Under a model where selection is inversely related to dominance, we show that OOA populations are likely to have a higher mutation load due to increased allele frequencies of nearly neutral variants that are recessive or partially recessive.
Collapse
|
163
|
Gaut BS, Díez CM, Morrell PL. Genomics and the Contrasting Dynamics of Annual and Perennial Domestication. Trends Genet 2015; 31:709-719. [PMID: 26603610 DOI: 10.1016/j.tig.2015.10.002] [Citation(s) in RCA: 93] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2015] [Revised: 09/08/2015] [Accepted: 10/15/2015] [Indexed: 12/20/2022]
Abstract
Plant domestication modifies a wild species genetically for human use. Among thousands of domesticated plants, a major distinction is the difference between annual and perennial life cycles. The domestication of perennials is expected to follow different processes than annuals, with distinct genetic outcomes. Here we examine domestication from a population genetics perspective, with a focus on three issues: genetic bottlenecks during domestication, introgression as a source of local adaptation, and genetic load. These three issues have been studied nominally in major annual crops but even less extensively in perennials. Here we highlight lessons from annual plants, motivations to study these issues in perennial plants, and new approaches that may lead to further progress.
Collapse
Affiliation(s)
- Brandon S Gaut
- Department of Ecology and Evolutionary Biology, University of California, Irvine, Irvine, CA 92697, USA.
| | - Concepción M Díez
- Departamento de Agronomía, Universidad de Córdoba - Campus de Excelencia Internacional Agroalimentario ceiA3, Edificio C4, Campus de Rabanales, 14014 Córdoba, Spain
| | - Peter L Morrell
- Department of Agronomy and Plant Genetics, University of Minnesota, Saint Paul, MN 55108, USA
| |
Collapse
|
164
|
Auton A, Brooks LD, Durbin RM, Garrison EP, Kang HM, Korbel JO, Marchini JL, McCarthy S, McVean GA, Abecasis GR. A global reference for human genetic variation. Nature 2015; 526:68-74. [PMID: 26432245 PMCID: PMC4750478 DOI: 10.1038/nature15393] [Citation(s) in RCA: 10858] [Impact Index Per Article: 1206.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2015] [Accepted: 08/20/2015] [Indexed: 12/04/2022]
Abstract
The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies. Results for the final phase of the 1000 Genomes Project are presented including whole-genome sequencing, targeted exome sequencing, and genotyping on high-density SNP arrays for 2,504 individuals across 26 populations, providing a global reference data set to support biomedical genetics. The 1000 Genomes Project has sought to comprehensively catalogue human genetic variation across populations, providing a valuable public genomic resource. The data obtained so far have found applications ranging from association studies and fine mapping studies to the filtering of likely neutral variants in rare-disease cohorts. The authors now report on the final phase of the project, phase 3, which covers previously uncharacterized areas of human genetic diversity in terms of the populations sampled and categories of characterized variation. The sample now includes more than 2,500 individuals from 26 global populations, with low coverage whole-genome and deep exome sequencing, as well as dense microarray genotyping. They find that while most common variants are shared across populations, rarer variants are often restricted to closely related populations. The authors also demonstrate the use of the phase 3 dataset as a reference panel for imputation to improve the resolution in genetic association studies.
Collapse
|
165
|
The Nature of Genetic Variation for Complex Traits Revealed by GWAS and Regional Heritability Mapping Analyses. Genetics 2015; 201:1601-13. [PMID: 26482794 DOI: 10.1534/genetics.115.177220] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2015] [Accepted: 10/09/2015] [Indexed: 02/08/2023] Open
Abstract
We use computer simulations to investigate the amount of genetic variation for complex traits that can be revealed by single-SNP genome-wide association studies (GWAS) or regional heritability mapping (RHM) analyses based on full genome sequence data or SNP chips. We model a large population subject to mutation, recombination, selection, and drift, assuming a pleiotropic model of mutations sampled from a bivariate distribution of effects of mutations on a quantitative trait and fitness. The pleiotropic model investigated, in contrast to previous models, implies that common mutations of large effect are responsible for most of the genetic variation for quantitative traits, except when the trait is fitness itself. We show that GWAS applied to the full sequence increases the number of QTL detected by as much as 50% compared to the number found with SNP chips but only modestly increases the amount of additive genetic variance explained. Even with full sequence data, the total amount of additive variance explained is generally below 50%. Using RHM on the full sequence data, a slightly larger number of QTL are detected than by GWAS if the same probability threshold is assumed, but these QTL explain a slightly smaller amount of genetic variance. Our results also suggest that most of the missing heritability is due to the inability to detect variants of moderate effect (∼0.03-0.3 phenotypic SDs) segregating at substantial frequencies. Very rare variants, which are more difficult to detect by GWAS, are expected to contribute little genetic variation, so their eventual detection is less relevant for resolving the missing heritability problem.
Collapse
|
166
|
Gu W, Gurguis CI, Zhou JJ, Zhu Y, Ko EA, Ko JH, Wang T, Zhou T. Functional and Structural Consequence of Rare Exonic Single Nucleotide Polymorphisms: One Story, Two Tales. Genome Biol Evol 2015; 7:2929-40. [PMID: 26454016 PMCID: PMC4684694 DOI: 10.1093/gbe/evv191] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/05/2015] [Indexed: 01/01/2023] Open
Abstract
Genetic variation arising from single nucleotide polymorphisms (SNPs) is ubiquitously found among human populations. While disease-causing variants are known in some cases, identifying functional or causative variants for most human diseases remains a challenging task. Rare SNPs, rather than common ones, are thought to be more important in the pathology of most human diseases. We propose that rare SNPs should be divided into two categories dependent on whether the minor alleles are derived or ancestral. Derived alleles are less likely to have been purified by evolutionary processes and may be more likely to induce deleterious effects. We therefore hypothesized that the rare SNPs with derived minor alleles would be more important for human diseases and predicted that these variants would have larger functional or structural consequences relative to the rare variants for which the minor alleles are ancestral. We systematically investigated the consequences of the exonic SNPs on protein function, mRNA structure, and translation. We found that the functional and structural consequences are more significant for the rare exonic variants for which the minor alleles are derived. However, this pattern is reversed when the minor alleles are ancestral. Thus, the rare exonic SNPs with derived minor alleles are more likely to be deleterious. Age estimation of rare SNPs confirms that these potentially deleterious SNPs are recently evolved in the human population. These results have important implications for understanding the function of genetic variations in human exonic regions and for prioritizing functional SNPs in genome-wide association studies of human diseases.
Collapse
Affiliation(s)
- Wanjun Gu
- Research Center for Learning Sciences, Southeast University, Nanjing, Jiangsu, China
| | | | - Jin J Zhou
- Department of Epidemiology and Biostatistics, The University of Arizona
| | - Yihua Zhu
- School of Biological Science and Medical Engineering, Southeast University, Nanjing, Jiangsu, China College of Information Science and Technology, Nanjing Agricultural University, Nanjing, Jiangsu, China
| | - Eun-A Ko
- Department of Pharmacology, The University of Nevada School of Medicine, Reno
| | - Jae-Hong Ko
- Department of Physiology, College of Medicine, Chung-Ang University, Seoul, South Korea
| | - Ting Wang
- Department of Medicine, The University of Arizona
| | - Tong Zhou
- Department of Medicine, The University of Arizona
| |
Collapse
|
167
|
Edge MD, Rosenberg NA. A General Model of the Relationship between the Apportionment of Human Genetic Diversity and the Apportionment of Human Phenotypic Diversity. Hum Biol 2015; 87:313-337. [PMID: 27737590 PMCID: PMC8504698 DOI: 10.13110/humanbiology.87.4.0313] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Models that examine genetic differences between populations alongside a genotype-phenotype map can provide insight about phenotypic variation among groups. We generalize a simple model of a completely heritable, additive, selectively neutral quantitative trait to examine the relationship between single-locus genetic differentiation and phenotypic differentiation on quantitative traits. In agreement with similar efforts using different models, we show that the expected degree to which two groups differ on a neutral quantitative trait is not strongly affected by the number of genetic loci that influence the trait: neutral trait differences are expected to have a magnitude comparable to the genetic differences at a single neutral locus. We discuss this result with respect to population differences in disease phenotypes, arguing that although neutral genetic differences between populations can contribute to specific differences between populations in health outcomes, systematic patterns of difference that run in the same direction for many genetically independent health conditions are unlikely to be explained by neutral genetic differentiation.
Collapse
Affiliation(s)
- Michael D. Edge
- Department of Biology, Stanford University, Stanford, California
| | | |
Collapse
|
168
|
Yang J, Bakshi A, Zhu Z, Hemani G, Vinkhuyzen AAE, Lee SH, Robinson MR, Perry JRB, Nolte IM, van Vliet-Ostaptchouk JV, Snieder H, Esko T, Milani L, Mägi R, Metspalu A, Hamsten A, Magnusson PKE, Pedersen NL, Ingelsson E, Soranzo N, Keller MC, Wray NR, Goddard ME, Visscher PM. Genetic variance estimation with imputed variants finds negligible missing heritability for human height and body mass index. Nat Genet 2015; 47:1114-20. [PMID: 26323059 PMCID: PMC4589513 DOI: 10.1038/ng.3390] [Citation(s) in RCA: 511] [Impact Index Per Article: 56.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2015] [Accepted: 07/31/2015] [Indexed: 12/17/2022]
Abstract
We propose a method (GREML-LDMS) to estimate heritability for human complex traits in unrelated individuals using whole-genome sequencing data. We demonstrate using simulations based on whole-genome sequencing data that ∼97% and ∼68% of variation at common and rare variants, respectively, can be captured by imputation. Using the GREML-LDMS method, we estimate from 44,126 unrelated individuals that all ∼17 million imputed variants explain 56% (standard error (s.e.) = 2.3%) of variance for height and 27% (s.e. = 2.5%) of variance for body mass index (BMI), and we find evidence that height- and BMI-associated variants have been under natural selection. Considering the imperfect tagging of imputation and potential overestimation of heritability from previous family-based studies, heritability is likely to be 60-70% for height and 30-40% for BMI. Therefore, the missing heritability is small for both traits. For further discovery of genes associated with complex traits, a study design with SNP arrays followed by imputation is more cost-effective than whole-genome sequencing at current prices.
Collapse
Affiliation(s)
- Jian Yang
- Queensland Brain Institute, University of Queensland, Brisbane, Queensland, Australia
- University of Queensland Diamantina Institute, Translation Research Institute, Brisbane, Queensland, Australia
| | - Andrew Bakshi
- Queensland Brain Institute, University of Queensland, Brisbane, Queensland, Australia
| | - Zhihong Zhu
- Queensland Brain Institute, University of Queensland, Brisbane, Queensland, Australia
| | - Gibran Hemani
- Queensland Brain Institute, University of Queensland, Brisbane, Queensland, Australia
- Medical Research Council (MRC) Integrative Epidemiology Unit (IEU) at the University of Bristol, School of Social and Community Medicine, Bristol, UK
| | - Anna A E Vinkhuyzen
- Queensland Brain Institute, University of Queensland, Brisbane, Queensland, Australia
| | - Sang Hong Lee
- Queensland Brain Institute, University of Queensland, Brisbane, Queensland, Australia
- School of Environmental and Rural Science, University of New England, Armidale, New South Wales, Australia
| | - Matthew R Robinson
- Queensland Brain Institute, University of Queensland, Brisbane, Queensland, Australia
| | - John R B Perry
- MRC Epidemiology Unit, University of Cambridge School of Clinical Medicine, Institute of Metabolic Science, Cambridge Biomedical Campus, Cambridge, UK
| | - Ilja M Nolte
- Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Jana V van Vliet-Ostaptchouk
- Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
- Department of Endocrinology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Harold Snieder
- Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Tonu Esko
- Estonian Genome Center, University of Tartu, Tartu, Estonia
- Division of Endocrinology, Boston Children's Hospital, Cambridge, Massachusetts, USA
- Program in Medical and Populational Genetics, Broad Institute, Cambridge, Massachusetts, USA
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, USA
| | - Lili Milani
- Estonian Genome Center, University of Tartu, Tartu, Estonia
| | - Reedik Mägi
- Estonian Genome Center, University of Tartu, Tartu, Estonia
| | - Andres Metspalu
- Estonian Genome Center, University of Tartu, Tartu, Estonia
- Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
| | - Anders Hamsten
- Cardiovascular Genetics and Genomics Group, Atherosclerosis Research Unit, Department of Medicine Solna, Karolinska Institutet, Stockholm, Sweden
| | - Patrik K E Magnusson
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Nancy L Pedersen
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Erik Ingelsson
- Department of Medical Sciences, Molecular Epidemiology and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
- Division of Cardiovascular Medicine, Department of Medicine, Stanford University School of Medicine, Stanford, California, USA
| | - Nicole Soranzo
- Department of Human Genetics, Wellcome Trust Sanger Institute, Genome Campus, Hinxton, UK
- Department of Haematology, University of Cambridge, Cambridge, UK
| | - Matthew C Keller
- Department of Psychology and Neuroscience, University of Colorado, Boulder, Colorado, USA
- Institute for Behavioral Genetics, University of Colorado, Boulder, Colorado, USA
| | - Naomi R Wray
- Queensland Brain Institute, University of Queensland, Brisbane, Queensland, Australia
| | - Michael E Goddard
- Faculty of Veterinary and Agricultural Science, University of Melbourne, Parkville, Victoria, Australia
- Biosciences Research Division, Department of Economic Development, Jobs, Transport and Resources, Bundoora, Victoria, Australia
| | - Peter M Visscher
- Queensland Brain Institute, University of Queensland, Brisbane, Queensland, Australia
- University of Queensland Diamantina Institute, Translation Research Institute, Brisbane, Queensland, Australia
| |
Collapse
|
169
|
Sudmant PH, Rausch T, Gardner EJ, Handsaker RE, Abyzov A, Huddleston J, Zhang Y, Ye K, Jun G, Hsi-Yang Fritz M, Konkel MK, Malhotra A, Stütz AM, Shi X, Paolo Casale F, Chen J, Hormozdiari F, Dayama G, Chen K, Malig M, Chaisson MJP, Walter K, Meiers S, Kashin S, Garrison E, Auton A, Lam HYK, Jasmine Mu X, Alkan C, Antaki D, Bae T, Cerveira E, Chines P, Chong Z, Clarke L, Dal E, Ding L, Emery S, Fan X, Gujral M, Kahveci F, Kidd JM, Kong Y, Lameijer EW, McCarthy S, Flicek P, Gibbs RA, Marth G, Mason CE, Menelaou A, Muzny DM, Nelson BJ, Noor A, Parrish NF, Pendleton M, Quitadamo A, Raeder B, Schadt EE, Romanovitch M, Schlattl A, Sebra R, Shabalin AA, Untergasser A, Walker JA, Wang M, Yu F, Zhang C, Zhang J, Zheng-Bradley X, Zhou W, Zichner T, Sebat J, Batzer MA, McCarroll SA, Mills RE, Gerstein MB, Bashir A, Stegle O, Devine SE, Lee C, Eichler EE, Korbel JO. An integrated map of structural variation in 2,504 human genomes. Nature 2015; 526:75-81. [PMID: 26432246 PMCID: PMC4617611 DOI: 10.1038/nature15394] [Citation(s) in RCA: 1455] [Impact Index Per Article: 161.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2015] [Accepted: 08/20/2015] [Indexed: 12/11/2022]
Abstract
Structural variants are implicated in numerous diseases and make up the majority of varying nucleotides among human genomes. Here we describe an integrated set of eight structural variant classes comprising both balanced and unbalanced variants, which we constructed using short-read DNA sequencing data and statistically phased onto haplotype blocks in 26 human populations. Analysing this set, we identify numerous gene-intersecting structural variants exhibiting population stratification and describe naturally occurring homozygous gene knockouts that suggest the dispensability of a variety of human genes. We demonstrate that structural variants are enriched on haplotypes identified by genome-wide association studies and exhibit enrichment for expression quantitative trait loci. Additionally, we uncover appreciable levels of structural variant complexity at different scales, including genic loci subject to clusters of repeated rearrangement and complex structural variants with multiple breakpoints likely to have formed through individual mutational events. Our catalogue will enhance future studies into structural variant demography, functional impact and disease association.
Collapse
Affiliation(s)
- Peter H. Sudmant
- Department of Genome Sciences, University of Washington, 3720 15th Avenue NE, Seattle, 98195-5065 Washington USA
| | - Tobias Rausch
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstrasse 1, Heidelberg, 69117 Germany
| | - Eugene J. Gardner
- Institute for Genome Sciences, University of Maryland School of Medicine, 801 W Baltimore Street, Baltimore, 21201 Maryland USA
| | - Robert E. Handsaker
- Department of Genetics, Harvard Medical School, 25 Shattuck Street, Boston, Boston, 02115 Massachusetts USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, 02142 Massachusetts USA
| | - Alexej Abyzov
- Department of Health Sciences Research, Center for Individualized Medicine, Mayo Clinic, 200 First Street SW, Rochester, 55905 Minnesota USA
| | - John Huddleston
- Department of Genome Sciences, University of Washington, 3720 15th Avenue NE, Seattle, 98195-5065 Washington USA
- Howard Hughes Medical Institute, University of Washington, Seattle, 98195 Washington USA
| | - Yan Zhang
- Program in Computational Biology and Bioinformatics, Yale University, BASS 432 & 437, 266 Whitney Avenue, New Haven, 06520 Connecticut USA
- Department of Molecular Biophysics and Biochemistry, School of Medicine, Yale University, 266 Whitney Avenue, New Haven, 06520 Connecticut USA
| | - Kai Ye
- The Genome Institute, Washington University School of Medicine, 4444 Forest Park Avenue, St Louis, 63108 Missouri USA
- Department of Genetics, Washington University in St Louis, 4444 Forest Park Avenue, St Louis, 63108 Missouri USA
| | - Goo Jun
- Department of Biostatistics and Center for Statistical Genetics, University of Michigan, 1415 Washington Heights, Ann Arbor, 48109 Michigan USA
- Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, 1200 Pressler St., Houston, 77030 Texas USA
| | - Markus Hsi-Yang Fritz
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstrasse 1, Heidelberg, 69117 Germany
| | - Miriam K. Konkel
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, 70803 Louisiana USA
| | - Ankit Malhotra
- The Jackson Laboratory for Genomic Medicine, 10 Discovery 263 Farmington Avenue, Farmington, 06030 Connecticut USA
| | - Adrian M. Stütz
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstrasse 1, Heidelberg, 69117 Germany
| | - Xinghua Shi
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, 9201 University City Blvd., Charlotte, 28223 North Carolina USA
| | - Francesco Paolo Casale
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, CB10 1SD Cambridge UK
| | - Jieming Chen
- Program in Computational Biology and Bioinformatics, Yale University, BASS 432 & 437, 266 Whitney Avenue, New Haven, 06520 Connecticut USA
- Integrated Graduate Program in Physical and Engineering Biology, Yale University, New Haven, 06520 Connecticut USA
| | - Fereydoun Hormozdiari
- Department of Genome Sciences, University of Washington, 3720 15th Avenue NE, Seattle, 98195-5065 Washington USA
| | - Gargi Dayama
- Department of Computational Medicine & Bioinformatics, University of Michigan, 500 S. State Street, Ann Arbor, 48109 Michigan USA
| | - Ken Chen
- The University of Texas MD Anderson Cancer Center, 1515 Holcombe Boulevard, Houston, 77030 Texas USA
| | - Maika Malig
- Department of Genome Sciences, University of Washington, 3720 15th Avenue NE, Seattle, 98195-5065 Washington USA
| | - Mark J. P. Chaisson
- Department of Genome Sciences, University of Washington, 3720 15th Avenue NE, Seattle, 98195-5065 Washington USA
| | - Klaudia Walter
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, CB10 1SA Cambridge UK
| | - Sascha Meiers
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstrasse 1, Heidelberg, 69117 Germany
| | - Seva Kashin
- Department of Genetics, Harvard Medical School, 25 Shattuck Street, Boston, Boston, 02115 Massachusetts USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, 02142 Massachusetts USA
| | - Erik Garrison
- Department of Biology, Boston College, 355 Higgins Hall, 140 Commonwealth Avenue, Chestnut Hill, 02467 Massachusetts USA
| | - Adam Auton
- Department of Genetics, Albert Einstein College of Medicine, 1301 Morris Park Avenue, Bronx, 10461 New York USA
| | - Hugo Y. K. Lam
- Bina Technologies, Roche Sequencing, 555 Twin Dolphin Drive, Redwood City, 94065 California USA
| | - Xinmeng Jasmine Mu
- Program in Computational Biology and Bioinformatics, Yale University, BASS 432 & 437, 266 Whitney Avenue, New Haven, 06520 Connecticut USA
- Cancer Program, Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, 02142 Massachusetts USA
| | - Can Alkan
- Department of Computer Engineering, Bilkent University, Ankara, 06800 Turkey
| | - Danny Antaki
- University of California San Diego (UCSD), 9500 Gilman Drive, La Jolla, 92093 California USA
| | - Taejeong Bae
- Department of Health Sciences Research, Center for Individualized Medicine, Mayo Clinic, 200 First Street SW, Rochester, 55905 Minnesota USA
| | - Eliza Cerveira
- The Jackson Laboratory for Genomic Medicine, 10 Discovery 263 Farmington Avenue, Farmington, 06030 Connecticut USA
| | - Peter Chines
- National Human Genome Research Institute, National Institutes of Health, Bethesda, 20892 Maryland USA
| | - Zechen Chong
- The University of Texas MD Anderson Cancer Center, 1515 Holcombe Boulevard, Houston, 77030 Texas USA
| | - Laura Clarke
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, CB10 1SD Cambridge UK
| | - Elif Dal
- Department of Computer Engineering, Bilkent University, Ankara, 06800 Turkey
| | - Li Ding
- The Genome Institute, Washington University School of Medicine, 4444 Forest Park Avenue, St Louis, 63108 Missouri USA
- Department of Genetics, Washington University in St Louis, 4444 Forest Park Avenue, St Louis, 63108 Missouri USA
- Department of Medicine, Washington University in St Louis, 4444 Forest Park Avenue, St Louis, 63108 Missouri USA
- Siteman Cancer Center, 660 South Euclid Avenue, St Louis, 63110 Missouri USA
| | - Sarah Emery
- Department of Human Genetics, University of Michigan, 1241 Catherine Street, Ann Arbor, 48109 Michigan USA
| | - Xian Fan
- The University of Texas MD Anderson Cancer Center, 1515 Holcombe Boulevard, Houston, 77030 Texas USA
| | - Madhusudan Gujral
- University of California San Diego (UCSD), 9500 Gilman Drive, La Jolla, 92093 California USA
| | - Fatma Kahveci
- Department of Computer Engineering, Bilkent University, Ankara, 06800 Turkey
| | - Jeffrey M. Kidd
- Department of Biostatistics and Center for Statistical Genetics, University of Michigan, 1415 Washington Heights, Ann Arbor, 48109 Michigan USA
- Department of Human Genetics, University of Michigan, 1241 Catherine Street, Ann Arbor, 48109 Michigan USA
| | - Yu Kong
- Department of Genetics, Albert Einstein College of Medicine, 1301 Morris Park Avenue, Bronx, 10461 New York USA
| | - Eric-Wubbo Lameijer
- Molecular Epidemiology, Leiden University Medical Center, Leiden, 2300RA The Netherlands
| | - Shane McCarthy
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, CB10 1SA Cambridge UK
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, CB10 1SD Cambridge UK
| | - Richard A. Gibbs
- Baylor College of Medicine, 1 Baylor Plaza, Houston, 77030 Texas USA
| | - Gabor Marth
- Department of Biology, Boston College, 355 Higgins Hall, 140 Commonwealth Avenue, Chestnut Hill, 02467 Massachusetts USA
| | - Christopher E. Mason
- The Department of Physiology and Biophysics and the HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, 1305 York Avenue, Weill Cornell Medical College, New York, 10065 New York USA
- The Feil Family Brain and Mind Research Institute, 413 East 69th St, Weill Cornell Medical College, New York, 10065 New York USA
| | - Androniki Menelaou
- University of Oxford, 1 South Parks Road, Oxford, OX3 9DS UK
- Department of Medical Genetics, Center for Molecular Medicine, University Medical Center Utrecht, Utrecht, 3584 CG The Netherlands
| | - Donna M. Muzny
- Department of Genetics and Genomic Sciences, Icahn School of Medicine, New York School of Natural Sciences, 1428 Madison Avenue, New York, 10029 New York USA
| | - Bradley J. Nelson
- Department of Genome Sciences, University of Washington, 3720 15th Avenue NE, Seattle, 98195-5065 Washington USA
| | - Amina Noor
- University of California San Diego (UCSD), 9500 Gilman Drive, La Jolla, 92093 California USA
| | - Nicholas F. Parrish
- Institute for Virus Research, Kyoto University, 53 Shogoin Kawahara-cho, Sakyo-ku, 606-8507 Kyoto Japan
| | - Matthew Pendleton
- Department of Genetics and Genomic Sciences, Icahn School of Medicine, New York School of Natural Sciences, 1428 Madison Avenue, New York, 10029 New York USA
| | - Andrew Quitadamo
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, 9201 University City Blvd., Charlotte, 28223 North Carolina USA
| | - Benjamin Raeder
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstrasse 1, Heidelberg, 69117 Germany
| | - Eric E. Schadt
- Department of Genetics and Genomic Sciences, Icahn School of Medicine, New York School of Natural Sciences, 1428 Madison Avenue, New York, 10029 New York USA
| | - Mallory Romanovitch
- The Jackson Laboratory for Genomic Medicine, 10 Discovery 263 Farmington Avenue, Farmington, 06030 Connecticut USA
| | - Andreas Schlattl
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstrasse 1, Heidelberg, 69117 Germany
| | - Robert Sebra
- Department of Genetics and Genomic Sciences, Icahn School of Medicine, New York School of Natural Sciences, 1428 Madison Avenue, New York, 10029 New York USA
| | - Andrey A. Shabalin
- Center for Biomarker Research and Precision Medicine, Virginia Commonwealth University, 1112 East Clay Street, McGuire Hall, Richmond, 23298-0581 Virginia USA
| | - Andreas Untergasser
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstrasse 1, Heidelberg, 69117 Germany
- Zentrum für Molekulare Biologie, University of Heidelberg, Im Neuenheimer Feld 282, Heidelberg, 69120 Germany
| | - Jerilyn A. Walker
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, 70803 Louisiana USA
| | - Min Wang
- Baylor College of Medicine, 1 Baylor Plaza, Houston, 77030 Texas USA
| | - Fuli Yu
- Baylor College of Medicine, 1 Baylor Plaza, Houston, 77030 Texas USA
| | - Chengsheng Zhang
- The Jackson Laboratory for Genomic Medicine, 10 Discovery 263 Farmington Avenue, Farmington, 06030 Connecticut USA
| | - Jing Zhang
- Program in Computational Biology and Bioinformatics, Yale University, BASS 432 & 437, 266 Whitney Avenue, New Haven, 06520 Connecticut USA
- Department of Molecular Biophysics and Biochemistry, School of Medicine, Yale University, 266 Whitney Avenue, New Haven, 06520 Connecticut USA
| | - Xiangqun Zheng-Bradley
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, CB10 1SD Cambridge UK
| | - Wanding Zhou
- The University of Texas MD Anderson Cancer Center, 1515 Holcombe Boulevard, Houston, 77030 Texas USA
| | - Thomas Zichner
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstrasse 1, Heidelberg, 69117 Germany
| | - Jonathan Sebat
- University of California San Diego (UCSD), 9500 Gilman Drive, La Jolla, 92093 California USA
| | - Mark A. Batzer
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, 70803 Louisiana USA
| | - Steven A. McCarroll
- Department of Genetics, Harvard Medical School, 25 Shattuck Street, Boston, Boston, 02115 Massachusetts USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, 02142 Massachusetts USA
| | - The 1000 Genomes Project Consortium
- Department of Genome Sciences, University of Washington, 3720 15th Avenue NE, Seattle, 98195-5065 Washington USA
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstrasse 1, Heidelberg, 69117 Germany
- Institute for Genome Sciences, University of Maryland School of Medicine, 801 W Baltimore Street, Baltimore, 21201 Maryland USA
- Department of Genetics, Harvard Medical School, 25 Shattuck Street, Boston, Boston, 02115 Massachusetts USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, 02142 Massachusetts USA
- Department of Health Sciences Research, Center for Individualized Medicine, Mayo Clinic, 200 First Street SW, Rochester, 55905 Minnesota USA
- Howard Hughes Medical Institute, University of Washington, Seattle, 98195 Washington USA
- Program in Computational Biology and Bioinformatics, Yale University, BASS 432 & 437, 266 Whitney Avenue, New Haven, 06520 Connecticut USA
- Department of Molecular Biophysics and Biochemistry, School of Medicine, Yale University, 266 Whitney Avenue, New Haven, 06520 Connecticut USA
- The Genome Institute, Washington University School of Medicine, 4444 Forest Park Avenue, St Louis, 63108 Missouri USA
- Department of Genetics, Washington University in St Louis, 4444 Forest Park Avenue, St Louis, 63108 Missouri USA
- Department of Biostatistics and Center for Statistical Genetics, University of Michigan, 1415 Washington Heights, Ann Arbor, 48109 Michigan USA
- Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, 1200 Pressler St., Houston, 77030 Texas USA
- Department of Biological Sciences, Louisiana State University, 202 Life Sciences Building, Baton Rouge, 70803 Louisiana USA
- The Jackson Laboratory for Genomic Medicine, 10 Discovery 263 Farmington Avenue, Farmington, 06030 Connecticut USA
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, 9201 University City Blvd., Charlotte, 28223 North Carolina USA
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, CB10 1SD Cambridge UK
- Integrated Graduate Program in Physical and Engineering Biology, Yale University, New Haven, 06520 Connecticut USA
- Department of Computational Medicine & Bioinformatics, University of Michigan, 500 S. State Street, Ann Arbor, 48109 Michigan USA
- The University of Texas MD Anderson Cancer Center, 1515 Holcombe Boulevard, Houston, 77030 Texas USA
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, CB10 1SA Cambridge UK
- Department of Biology, Boston College, 355 Higgins Hall, 140 Commonwealth Avenue, Chestnut Hill, 02467 Massachusetts USA
- Department of Genetics, Albert Einstein College of Medicine, 1301 Morris Park Avenue, Bronx, 10461 New York USA
- Bina Technologies, Roche Sequencing, 555 Twin Dolphin Drive, Redwood City, 94065 California USA
- Cancer Program, Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, 02142 Massachusetts USA
- Department of Computer Engineering, Bilkent University, Ankara, 06800 Turkey
- University of California San Diego (UCSD), 9500 Gilman Drive, La Jolla, 92093 California USA
- National Human Genome Research Institute, National Institutes of Health, Bethesda, 20892 Maryland USA
- Department of Medicine, Washington University in St Louis, 4444 Forest Park Avenue, St Louis, 63108 Missouri USA
- Siteman Cancer Center, 660 South Euclid Avenue, St Louis, 63110 Missouri USA
- Department of Human Genetics, University of Michigan, 1241 Catherine Street, Ann Arbor, 48109 Michigan USA
- Molecular Epidemiology, Leiden University Medical Center, Leiden, 2300RA The Netherlands
- Baylor College of Medicine, 1 Baylor Plaza, Houston, 77030 Texas USA
- The Department of Physiology and Biophysics and the HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, 1305 York Avenue, Weill Cornell Medical College, New York, 10065 New York USA
- The Feil Family Brain and Mind Research Institute, 413 East 69th St, Weill Cornell Medical College, New York, 10065 New York USA
- University of Oxford, 1 South Parks Road, Oxford, OX3 9DS UK
- Department of Medical Genetics, Center for Molecular Medicine, University Medical Center Utrecht, Utrecht, 3584 CG The Netherlands
- Department of Genetics and Genomic Sciences, Icahn School of Medicine, New York School of Natural Sciences, 1428 Madison Avenue, New York, 10029 New York USA
- Institute for Virus Research, Kyoto University, 53 Shogoin Kawahara-cho, Sakyo-ku, 606-8507 Kyoto Japan
- Center for Biomarker Research and Precision Medicine, Virginia Commonwealth University, 1112 East Clay Street, McGuire Hall, Richmond, 23298-0581 Virginia USA
- Zentrum für Molekulare Biologie, University of Heidelberg, Im Neuenheimer Feld 282, Heidelberg, 69120 Germany
- Department of Computer Science, Yale University, 51 Prospect Street, New Haven, 06511 Connecticut USA
- Department of Graduate Studies – Life Sciences, Ewha Womans University, Ewhayeodae-gil, Seodaemun-gu, 120-750 Seoul South Korea
| | - Ryan E. Mills
- Department of Computational Medicine & Bioinformatics, University of Michigan, 500 S. State Street, Ann Arbor, 48109 Michigan USA
- Department of Human Genetics, University of Michigan, 1241 Catherine Street, Ann Arbor, 48109 Michigan USA
| | - Mark B. Gerstein
- Program in Computational Biology and Bioinformatics, Yale University, BASS 432 & 437, 266 Whitney Avenue, New Haven, 06520 Connecticut USA
- Department of Molecular Biophysics and Biochemistry, School of Medicine, Yale University, 266 Whitney Avenue, New Haven, 06520 Connecticut USA
- Department of Computer Science, Yale University, 51 Prospect Street, New Haven, 06511 Connecticut USA
| | - Ali Bashir
- Department of Genetics and Genomic Sciences, Icahn School of Medicine, New York School of Natural Sciences, 1428 Madison Avenue, New York, 10029 New York USA
| | - Oliver Stegle
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, CB10 1SD Cambridge UK
| | - Scott E. Devine
- Institute for Genome Sciences, University of Maryland School of Medicine, 801 W Baltimore Street, Baltimore, 21201 Maryland USA
| | - Charles Lee
- The Jackson Laboratory for Genomic Medicine, 10 Discovery 263 Farmington Avenue, Farmington, 06030 Connecticut USA
- Department of Graduate Studies – Life Sciences, Ewha Womans University, Ewhayeodae-gil, Seodaemun-gu, 120-750 Seoul South Korea
| | - Evan E. Eichler
- Department of Genome Sciences, University of Washington, 3720 15th Avenue NE, Seattle, 98195-5065 Washington USA
- Howard Hughes Medical Institute, University of Washington, Seattle, 98195 Washington USA
| | - Jan O. Korbel
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstrasse 1, Heidelberg, 69117 Germany
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, CB10 1SD Cambridge UK
| |
Collapse
|
170
|
Abstract
The contribution of rare and low-frequency variants to human traits is largely unexplored. Here we describe insights from sequencing whole genomes (low read depth, 7×) or exomes (high read depth, 80×) of nearly 10,000 individuals from population-based and disease collections. In extensively phenotyped cohorts we characterize over 24 million novel sequence variants, generate a highly accurate imputation reference panel and identify novel alleles associated with levels of triglycerides (APOB), adiponectin (ADIPOQ) and low-density lipoprotein cholesterol (LDLR and RGAG1) from single-marker and rare variant aggregation tests. We describe population structure and functional annotation of rare and low-frequency variants, use the data to estimate the benefits of sequencing for association studies, and summarize lessons from disease-specific collections. Finally, we make available an extensive resource, including individual-level genetic and phenotypic data and web-based tools to facilitate the exploration of association results. Low read depth sequencing of whole genomes and high read depth exomes of nearly 10,000 extensively phenotyped individuals are combined to help characterize novel sequence variants, generate a highly accurate imputation reference panel and identify novel alleles associated with lipid-related traits; in addition to describing population structure and providing functional annotation of rare and low-frequency variants the authors use the data to estimate the benefits of sequencing for association studies. This paper, combining data and initial findings from the different arms of the UK10K project, describes insights from low-read-depth sequencing of whole genomes or high-read-depth exome sequencing of nearly 10,000 individuals sampled from a range of disease collections, as well as participants from healthy population based cohorts. The authors characterize novel sequence variants, generate a highly accurate imputation reference panel and identify novel alleles associated with lipid-related traits. In addition to describing population structure and providing functional annotation of rare and low frequency variants, they use the data to estimate the benefits of sequencing for association studies.
Collapse
|
171
|
Sudmant PH, Mallick S, Nelson BJ, Hormozdiari F, Krumm N, Huddleston J, Coe BP, Baker C, Nordenfelt S, Bamshad M, Jorde LB, Posukh OL, Sahakyan H, Watkins WS, Yepiskoposyan L, Abdullah MS, Bravi CM, Capelli C, Hervig T, Wee JTS, Tyler-Smith C, van Driem G, Romero IG, Jha AR, Karachanak-Yankova S, Toncheva D, Comas D, Henn B, Kivisild T, Ruiz-Linares A, Sajantila A, Metspalu E, Parik J, Villems R, Starikovskaya EB, Ayodo G, Beall CM, Di Rienzo A, Hammer MF, Khusainova R, Khusnutdinova E, Klitz W, Winkler C, Labuda D, Metspalu M, Tishkoff SA, Dryomov S, Sukernik R, Patterson N, Reich D, Eichler EE. Global diversity, population stratification, and selection of human copy-number variation. Science 2015; 349:aab3761. [PMID: 26249230 PMCID: PMC4568308 DOI: 10.1126/science.aab3761] [Citation(s) in RCA: 231] [Impact Index Per Article: 25.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2015] [Accepted: 07/29/2015] [Indexed: 12/14/2022]
Abstract
In order to explore the diversity and selective signatures of duplication and deletion human copy-number variants (CNVs), we sequenced 236 individuals from 125 distinct human populations. We observed that duplications exhibit fundamentally different population genetic and selective signatures than deletions and are more likely to be stratified between human populations. Through reconstruction of the ancestral human genome, we identify megabases of DNA lost in different human lineages and pinpoint large duplications that introgressed from the extinct Denisova lineage now found at high frequency exclusively in Oceanic populations. We find that the proportion of CNV base pairs to single-nucleotide-variant base pairs is greater among non-Africans than it is among African populations, but we conclude that this difference is likely due to unique aspects of non-African population history as opposed to differences in CNV load.
Collapse
Affiliation(s)
- Peter H Sudmant
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Swapan Mallick
- Broad Institute of Massachusetts Institute of Technology (MIT) and Harvard, Cambridge, MA 02142, USA. Department of Genetics, Harvard Medical School, Boston, MA 02115, USA
| | - Bradley J Nelson
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | | | - Niklas Krumm
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - John Huddleston
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA. Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA
| | - Bradley P Coe
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Carl Baker
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Susanne Nordenfelt
- Broad Institute of Massachusetts Institute of Technology (MIT) and Harvard, Cambridge, MA 02142, USA. Department of Genetics, Harvard Medical School, Boston, MA 02115, USA
| | - Michael Bamshad
- Department of Pediatrics, University of Washington, Seattle, WA 98119, USA
| | - Lynn B Jorde
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT 84112, USA
| | - Olga L Posukh
- Institute of Cytology and Genetics, Siberian Branch of Russian Academy of Sciences, Novosibirsk 630090, Russia. Novosibirsk State University, Novosibirsk 630090, Russia
| | - Hovhannes Sahakyan
- Estonian Biocentre, Evolutionary Biology Group, Tartu 51010, Estonia. Laboratory of Ethnogenomics, Institute of Molecular Biology, National Academy of Sciences of Armenia, Yerevan 0014, Armenia
| | - W Scott Watkins
- Department of Human Genetics, Eccles Institute of Human Genetics, University of Utah, Salt Lake City, UT 84112, USA
| | - Levon Yepiskoposyan
- Laboratory of Ethnogenomics, Institute of Molecular Biology, National Academy of Sciences of Armenia, Yerevan 0014, Armenia
| | - M Syafiq Abdullah
- Raja Isteri Pengiran Anak Saleha (RIPAS) Hospital, Bandar Seri Begawan, Brunei Darussalam
| | - Claudio M Bravi
- Laboratorio de Genética Molecular Poblacional, Instituto Multidisciplinario de Biología Celular (IMBICE), Centro Científico y Tecnológico-Consejo Nacional de Investigaciones Científicas y Técnicas (CCT-CONICET) and Comisión de Investigaciones Científicas de la Provincia de Buenos Aires (CICPBA), La Plata B1906APO, Argentina
| | | | - Tor Hervig
- Department of Clinical Science, University of Bergen, Bergen 5021, Norway
| | | | - Chris Tyler-Smith
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire CB10 1SA, UK
| | - George van Driem
- Institute of Linguistics, University of Bern, Bern CH-3012, Switzerland
| | | | - Aashish R Jha
- Department of Human Genetics, University of Chicago, Chicago, IL 60637, USA
| | - Sena Karachanak-Yankova
- Department of Medical Genetics, National Human Genome Center, Medical University Sofia, Sofia 1431, Bulgaria
| | - Draga Toncheva
- Department of Medical Genetics, National Human Genome Center, Medical University Sofia, Sofia 1431, Bulgaria
| | - David Comas
- Institut de Biologia Evolutiva [Consejo Superior de Investigaciones Científicas-Universitat Pompeu Fabra (CSIC-UPF)], Departament de Ciències Experimentals i de la Salut, UPF, Barcelona 08003, Spain
| | - Brenna Henn
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY 11794, USA
| | - Toomas Kivisild
- Division of Biological Anthropology, University of Cambridge, Fitzwilliam Street, Cambridge CB2 1QH, UK
| | - Andres Ruiz-Linares
- Department of Genetics, Evolution and Environment, University College London, WC1E 6BT, UK
| | - Antti Sajantila
- University of Helsinki, Department of Forensic Medicine, Helsinki 00014, Finland
| | - Ene Metspalu
- Estonian Biocentre, Evolutionary Biology Group, Tartu 51010, Estonia. University of Tartu, Department of Evolutionary Biology, Tartu 5101, Estonia
| | - Jüri Parik
- Estonian Biocentre, Evolutionary Biology Group, Tartu 51010, Estonia
| | - Richard Villems
- Estonian Biocentre, Evolutionary Biology Group, Tartu 51010, Estonia
| | - Elena B Starikovskaya
- Laboratory of Human Molecular Genetics, Institute of Molecular and Cellular Biology, Siberian Branch of Russian Academy of Sciences, Novosibirsk 630090, Russia
| | - George Ayodo
- Center for Global Health and Child Development, Kisumu 40100, Kenya
| | - Cynthia M Beall
- Department of Anthropology, Case Western Reserve University, Cleveland, OH 44106-7125, USA
| | - Anna Di Rienzo
- Department of Human Genetics, University of Chicago, Chicago, IL 60637, USA
| | - Michael F Hammer
- Arizona Research Laboratories Division of Biotechnology, University of Arizona, Tucson, AZ 85721, USA
| | - Rita Khusainova
- Institute of Biochemistry and Genetics, Ufa Research Centre, Russian Academy of Sciences, Ufa 450054, Russia. Department of Genetics and Fundamental Medicine, Bashkir State University, Ufa 450074, Russia
| | - Elza Khusnutdinova
- Institute of Biochemistry and Genetics, Ufa Research Centre, Russian Academy of Sciences, Ufa 450054, Russia. Department of Genetics and Fundamental Medicine, Bashkir State University, Ufa 450074, Russia
| | - William Klitz
- Integrative Biology, University of California, Berkeley, CA 94720-3140, USA
| | - Cheryl Winkler
- Basic Research Laboratory, Center for Cancer Research, National Cancer Institute, Leidos Biomedical Research, Incorporated, Frederick National Laboratory, Frederick, MD 21702, USA
| | - Damian Labuda
- Centre Hospitalier Universitaire (CHU) Sainte-Justine, Département de Pédiatrie, Université de Montréal, QC H3T 1C5, Canada
| | - Mait Metspalu
- Estonian Biocentre, Evolutionary Biology Group, Tartu 51010, Estonia
| | - Sarah A Tishkoff
- Departments of Biology and Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Stanislav Dryomov
- Laboratory of Human Molecular Genetics, Institute of Molecular and Cellular Biology, Siberian Branch of Russian Academy of Sciences, Novosibirsk 630090, Russia. Department of Paleolithic Archaeology, Institute of Archaeology and Ethnography, Siberian Branch of Russian Academy of Sciences, Novosibirsk 630090, Russia
| | - Rem Sukernik
- Laboratory of Human Molecular Genetics, Institute of Molecular and Cellular Biology, Siberian Branch of Russian Academy of Sciences, Novosibirsk 630090, Russia. Altai State University, Barnaul 656000, Russia
| | - Nick Patterson
- Broad Institute of Massachusetts Institute of Technology (MIT) and Harvard, Cambridge, MA 02142, USA. Department of Genetics, Harvard Medical School, Boston, MA 02115, USA
| | - David Reich
- Broad Institute of Massachusetts Institute of Technology (MIT) and Harvard, Cambridge, MA 02142, USA. Department of Genetics, Harvard Medical School, Boston, MA 02115, USA. Howard Hughes Medical Institute, Harvard Medical School, Boston, MA 02115, USA
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA. Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA.
| |
Collapse
|
172
|
Balick DJ, Do R, Cassa CA, Reich D, Sunyaev SR. Dominance of Deleterious Alleles Controls the Response to a Population Bottleneck. PLoS Genet 2015; 11:e1005436. [PMID: 26317225 PMCID: PMC4552954 DOI: 10.1371/journal.pgen.1005436] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2014] [Accepted: 07/09/2015] [Indexed: 11/30/2022] Open
Abstract
Population bottlenecks followed by re-expansions have been common throughout history of many populations. The response of alleles under selection to such demographic perturbations has been a subject of great interest in population genetics. On the basis of theoretical analysis and computer simulations, we suggest that this response qualitatively depends on dominance. The number of dominant or additive deleterious alleles per haploid genome is expected to be slightly increased following the bottleneck and re-expansion. In contrast, the number of completely or partially recessive alleles should be sharply reduced. Changes of population size expose differences between recessive and additive selection, potentially providing insight into the prevalence of dominance in natural populations. Specifically, we use a simple statistic, BR≡∑xipop1/∑xjpop2, where xi represents the derived allele frequency, to compare the number of mutations in different populations, and detail its functional dependence on the strength of selection and the intensity of the population bottleneck. We also provide empirical evidence showing that gene sets associated with autosomal recessive disease in humans may have a BR indicative of recessive selection. Together, these theoretical predictions and empirical observations show that complex demographic history may facilitate rather than impede inference of parameters of natural selection. Dominance has played a central role in classical genetics since its inception. However, the effect of dominance introduces substantial technical complications into theoretical models describing dynamics of alleles in populations. As a result, dominance is often ignored in population genetic models. Statistical tests for selection built on these models do not discriminate between recessive and additive alleles. We show that historical changes in population size can provide a way to differentiate between recessive and additive selection. Our analysis compares two sub-populations with different demographic histories. History of our own species provides plenty of examples of sub-populations that went through population bottlenecks followed by re-expansions. We show that demographic differences, which generally complicate the analysis, can instead aid in the inference of features of natural selection.
Collapse
Affiliation(s)
- Daniel J. Balick
- Division of Genetics, Brigham and Women’s Hospital, Boston, Massachusetts, United States of America
- Department of Medicine, Harvard Medical School, Boston, Massachusetts, United States of America
- Broad Institute of Harvard and MIT, Cambridge, Massachusetts, United States of America
| | - Ron Do
- Broad Institute of Harvard and MIT, Cambridge, Massachusetts, United States of America
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, United States of America
- The Center for Statistical Genetics, Icahn School of Medicine at Mount Sinai, New York, New York, United States of America
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, United States of America
| | - Christopher A. Cassa
- Division of Genetics, Brigham and Women’s Hospital, Boston, Massachusetts, United States of America
- Department of Medicine, Harvard Medical School, Boston, Massachusetts, United States of America
- Broad Institute of Harvard and MIT, Cambridge, Massachusetts, United States of America
| | - David Reich
- Broad Institute of Harvard and MIT, Cambridge, Massachusetts, United States of America
- Howard Hughes Medical Institute, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Shamil R. Sunyaev
- Division of Genetics, Brigham and Women’s Hospital, Boston, Massachusetts, United States of America
- Department of Medicine, Harvard Medical School, Boston, Massachusetts, United States of America
- Broad Institute of Harvard and MIT, Cambridge, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
173
|
Sánchez-Quinto F, Lalueza-Fox C. Almost 20 years of Neanderthal palaeogenetics: adaptation, admixture, diversity, demography and extinction. Philos Trans R Soc Lond B Biol Sci 2015; 370:20130374. [PMID: 25487326 DOI: 10.1098/rstb.2013.0374] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Nearly two decades since the first retrieval of Neanderthal DNA, recent advances in next-generation sequencing technologies have allowed the generation of high-coverage genomes from two archaic hominins, a Neanderthal and a Denisovan, as well as a complete mitochondrial genome from remains which probably represent early members of the Neanderthal lineage. This genomic information, coupled with diversity exome data from several Neanderthal specimens is shedding new light on evolutionary processes such as the genetic basis of Neanderthal and modern human-specific adaptations-including morphological and behavioural traits-as well as the extent and nature of the admixture events between them. An emerging picture is that Neanderthals had a long-term small population size, lived in small and isolated groups and probably practised inbreeding at times. Deleterious genetic effects associated with these demographic factors could have played a role in their extinction. The analysis of DNA from further remains making use of new large-scale hybridization-capture-based methods as well as of new approaches to discriminate contaminant DNA sequences will provide genetic information in spatial and temporal scales that could help clarify the Neanderthal's-and our very own-evolutionary history.
Collapse
Affiliation(s)
- Federico Sánchez-Quinto
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Carles Lalueza-Fox
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Dr. Aiguader 88, 08003 Barcelona, Spain
| |
Collapse
|
174
|
Lohmueller KE. The distribution of deleterious genetic variation in human populations. Curr Opin Genet Dev 2015; 29:139-46. [PMID: 25461617 DOI: 10.1016/j.gde.2014.09.005] [Citation(s) in RCA: 86] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2014] [Revised: 08/28/2014] [Accepted: 09/05/2014] [Indexed: 11/19/2022]
Abstract
Population genetic studies suggest that most amino-acid changing mutations are deleterious. Such mutations are of tremendous interest in human population genetics as they are important for the evolutionary process and may contribute risk to common disease. Genomic studies over the past 5 years have documented differences across populations in the number of heterozygous deleterious genotypes, number of homozygous derived deleterious genotypes, number of deleterious segregating sites and proportion of sites that are potentially deleterious. These differences have been attributed to population history affecting the ability of natural selection to remove deleterious variants from the population. However, recent studies have suggested that the genetic load is the same across populations and that the efficacy of natural selection has not differed across human populations. Here I show that these observations are not incompatible with each other and that the apparent differences are due to examining different features of the genetic data and differing definitions of terms.
Collapse
|
175
|
The role of climate and out-of-Africa migration in the frequencies of risk alleles for 21 human diseases. BMC Genet 2015; 16:81. [PMID: 26170196 PMCID: PMC4501093 DOI: 10.1186/s12863-015-0239-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2015] [Accepted: 06/24/2015] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Demography and environmental adaptation can affect the global distribution of genetic variants and possibly the distribution of disease. Population heterozygosity of single nucleotide polymorphisms has been shown to decrease strongly with distance from Africa and this has been attributed to the effect of serial founding events during the migration of humans out of Africa. Additionally, population allele frequencies have been shown to change due to environmental adaptation. Here, we investigate the relationship of Out-of-Africa migration and climatic variables to the distribution of risk alleles for 21 diseases. RESULTS For each disease, we computed the regression of average heterozygosity and average allele frequency of the risk alleles with distance from Africa and 9 environmental variables. We compared these regressions to a null distribution created by regressing statistics for SNPs not associated with disease on distance from Africa and these environmental variables. Additionally, we used Bayenv 2.0 to assess the signal of environmental adaptation associated with individual risk SNPs. For those SNPs in HGDP and HapMap that are risk alleles for type 2 diabetes, we cannot reject that their distribution is as expected from Out-of-Africa migration. However, the allelic statistics for many other diseases correlate more closely with environmental variables than would be expected from the serial founder effect and show signals of environmental adaptation. We report strong environmental interactions with several autoimmune diseases, and note a particularly strong interaction between asthma and summer humidity. Additionally, we identified several risk genes with strong environmental associations. CONCLUSIONS For most diseases, migration does not explain the distribution of risk alleles and the worldwide pattern of allele frequencies for some diseases may be better explained by environmental associations, which suggests that some selection has acted on these diseases.
Collapse
|
176
|
Kehdy FSG, Gouveia MH, Machado M, Magalhães WCS, Horimoto AR, Horta BL, Moreira RG, Leal TP, Scliar MO, Soares-Souza GB, Rodrigues-Soares F, Araújo GS, Zamudio R, Sant Anna HP, Santos HC, Duarte NE, Fiaccone RL, Figueiredo CA, Silva TM, Costa GNO, Beleza S, Berg DE, Cabrera L, Debortoli G, Duarte D, Ghirotto S, Gilman RH, Gonçalves VF, Marrero AR, Muniz YC, Weissensteiner H, Yeager M, Rodrigues LC, Barreto ML, Lima-Costa MF, Pereira AC, Rodrigues MR, Tarazona-Santos E. Origin and dynamics of admixture in Brazilians and its effect on the pattern of deleterious mutations. Proc Natl Acad Sci U S A 2015; 112:8696-701. [PMID: 26124090 PMCID: PMC4507185 DOI: 10.1073/pnas.1504447112] [Citation(s) in RCA: 171] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
While South Americans are underrepresented in human genomic diversity studies, Brazil has been a classical model for population genetics studies on admixture. We present the results of the EPIGEN Brazil Initiative, the most comprehensive up-to-date genomic analysis of any Latin-American population. A population-based genome-wide analysis of 6,487 individuals was performed in the context of worldwide genomic diversity to elucidate how ancestry, kinship, and inbreeding interact in three populations with different histories from the Northeast (African ancestry: 50%), Southeast, and South (both with European ancestry >70%) of Brazil. We showed that ancestry-positive assortative mating permeated Brazilian history. We traced European ancestry in the Southeast/South to a wider European/Middle Eastern region with respect to the Northeast, where ancestry seems restricted to Iberia. By developing an approximate Bayesian computation framework, we infer more recent European immigration to the Southeast/South than to the Northeast. Also, the observed low Native-American ancestry (6-8%) was mostly introduced in different regions of Brazil soon after the European Conquest. We broadened our understanding of the African diaspora, the major destination of which was Brazil, by revealing that Brazilians display two within-Africa ancestry components: one associated with non-Bantu/western Africans (more evident in the Northeast and African Americans) and one associated with Bantu/eastern Africans (more present in the Southeast/South). Furthermore, the whole-genome analysis of 30 individuals (42-fold deep coverage) shows that continental admixture rather than local post-Columbian history is the main and complex determinant of the individual amount of deleterious genotypes.
Collapse
Affiliation(s)
- Fernanda S G Kehdy
- Departamento de Biologia Geral, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, 31270-901, Belo Horizonte, Minas Gerais, Brazil
| | - Mateus H Gouveia
- Departamento de Biologia Geral, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, 31270-901, Belo Horizonte, Minas Gerais, Brazil
| | - Moara Machado
- Departamento de Biologia Geral, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, 31270-901, Belo Horizonte, Minas Gerais, Brazil
| | - Wagner C S Magalhães
- Departamento de Biologia Geral, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, 31270-901, Belo Horizonte, Minas Gerais, Brazil
| | - Andrea R Horimoto
- Instituto do Coração, Universidade de São Paulo, 05403-900, São Paulo, Sao Paulo, Brazil
| | - Bernardo L Horta
- Programa de Pós-Graduação em Epidemiologia, Universidade Federal de Pelotas, 464, 96001-970 Pelotas, Rio Grande do Sul, Brazil
| | - Rennan G Moreira
- Departamento de Biologia Geral, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, 31270-901, Belo Horizonte, Minas Gerais, Brazil
| | - Thiago P Leal
- Departamento de Biologia Geral, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, 31270-901, Belo Horizonte, Minas Gerais, Brazil
| | - Marilia O Scliar
- Departamento de Biologia Geral, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, 31270-901, Belo Horizonte, Minas Gerais, Brazil
| | - Giordano B Soares-Souza
- Departamento de Biologia Geral, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, 31270-901, Belo Horizonte, Minas Gerais, Brazil
| | - Fernanda Rodrigues-Soares
- Departamento de Biologia Geral, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, 31270-901, Belo Horizonte, Minas Gerais, Brazil
| | - Gilderlanio S Araújo
- Departamento de Biologia Geral, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, 31270-901, Belo Horizonte, Minas Gerais, Brazil
| | - Roxana Zamudio
- Departamento de Biologia Geral, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, 31270-901, Belo Horizonte, Minas Gerais, Brazil
| | - Hanaisa P Sant Anna
- Departamento de Biologia Geral, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, 31270-901, Belo Horizonte, Minas Gerais, Brazil
| | - Hadassa C Santos
- Instituto do Coração, Universidade de São Paulo, 05403-900, São Paulo, Sao Paulo, Brazil
| | - Nubia E Duarte
- Instituto do Coração, Universidade de São Paulo, 05403-900, São Paulo, Sao Paulo, Brazil
| | - Rosemeire L Fiaccone
- Departamento de Estatística, Instituto de Matemática, Universidade Federal da Bahia, 40170-110, Salvador, Bahia, Brazil
| | - Camila A Figueiredo
- Departamento de Ciências da Biointeração, Instituto de Ciências da Saúde, Universidade Federal da Bahia, 40110-100, Salvador, Bahia, Brazil
| | - Thiago M Silva
- Instituto de Saúde Coletiva, Universidade Federal da Bahia, 40110-040, Salvador, Bahia, Brazil
| | - Gustavo N O Costa
- Instituto de Saúde Coletiva, Universidade Federal da Bahia, 40110-040, Salvador, Bahia, Brazil
| | - Sandra Beleza
- Department of Genetics, University of Leicester, LE1 7RH, Leicester, United Kingdom
| | - Douglas E Berg
- Department of Molecular Microbiology, Washington University School of Medicine, St. Louis, MO 63110; Department of Medicine, University of California, San Diego, CA 92093
| | - Lilia Cabrera
- Biomedical Research Unit, Asociación Benéfica Proyectos en Informática, Salud, Medicina y Agricultura (AB PRISMA), 170070, Lima, Peru
| | - Guilherme Debortoli
- Departamento de Biologia Celular, Embriologia e Genética, Universidade Federal de Santa Catarina, 88040-900, Florianópolis, Santa Catarina, Brazil
| | - Denise Duarte
- Departamento de Estatística, Universidade Federal de Minas Gerais, 31270-901, Belo Horizonte, Minas Gerais, Brazil
| | - Silvia Ghirotto
- Dipartimento di Scienze della Vita e Biotecnologie, Università di Ferrara, 44121 Ferrara, Italy
| | - Robert H Gilman
- Bloomberg School of Public Health, International Health, Johns Hopkins University, Baltimore, MD 21205; Laboratorio de Investigación de Enfermedades Infecciosas, Universidade Peruana Cayetano Heredia, 15102, Lima, Peru
| | - Vanessa F Gonçalves
- Department of Psychiatry and Neuroscience Section, Center for Addiction and Mental Health, University of Toronto, Toronto, ON, Canada M5T 1R8
| | - Andrea R Marrero
- Departamento de Biologia Celular, Embriologia e Genética, Universidade Federal de Santa Catarina, 88040-900, Florianópolis, Santa Catarina, Brazil
| | - Yara C Muniz
- Departamento de Biologia Celular, Embriologia e Genética, Universidade Federal de Santa Catarina, 88040-900, Florianópolis, Santa Catarina, Brazil
| | - Hansi Weissensteiner
- Division of Genetic Epidemiology, Department of Medical Genetics, Molecular and Clinical Pharmacology, Innsbruck Medical University, 6020 Innsbruck, Austria
| | - Meredith Yeager
- Cancer Genomics Research Laboratory, Leidos Biomedical Research, Inc., Frederick National Laboratory for Cancer Research, Frederick, MD 20850
| | - Laura C Rodrigues
- Department of Infectious Disease Epidemiology, Faculty of Epidemiology, London School of Hygiene and Tropical Medicine, London WC1E 7HT, United Kingdom
| | - Mauricio L Barreto
- Instituto de Saúde Coletiva, Universidade Federal da Bahia, 40110-040, Salvador, Bahia, Brazil
| | - M Fernanda Lima-Costa
- Instituto de Pesquisa Rene Rachou, Fundação Oswaldo Cruz, 30190-002, Belo Horizonte, Minas Gerais, Brazil
| | - Alexandre C Pereira
- Instituto do Coração, Universidade de São Paulo, 05403-900, São Paulo, Sao Paulo, Brazil
| | - Maíra R Rodrigues
- Departamento de Biologia Geral, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, 31270-901, Belo Horizonte, Minas Gerais, Brazil
| | - Eduardo Tarazona-Santos
- Departamento de Biologia Geral, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, 31270-901, Belo Horizonte, Minas Gerais, Brazil;
| |
Collapse
|
177
|
Siegert S, Wolf A, Cooper DN, Krawczak M, Nothnagel M. Mutations Causing Complex Disease May under Certain Circumstances Be Protective in an Epidemiological Sense. PLoS One 2015; 10:e0132150. [PMID: 26161957 PMCID: PMC4498598 DOI: 10.1371/journal.pone.0132150] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2014] [Accepted: 06/10/2015] [Indexed: 01/19/2023] Open
Abstract
Guided by the practice of classical epidemiology, research into the genetic basis of complex disease has usually taken for granted the dictum that causative mutations are invariably over-represented among clinically affected as compared to unaffected individuals. However, we show that this supposition is not true and that a mutation contributing to the etiology of a complex disease can, under certain circumstances, be depleted among patients. Populations with defined disease prevalence were repeatedly simulated under a Wright-Fisher model, assuming various types of population history and genotype-phenotype relationship. For each simulation, the resulting mutation-specific population frequencies and odds ratios (ORs) were evaluated. In addition, the relationship between mutation frequency and OR was studied using real data from the NIH GWAS catalogue of reported phenotype associations of single-nucleotide polymorphisms (SNPs). While rare diseases (prevalence <1%) were found to be consistently caused by rare mutations with ORs>1, up to 20% of mutations causing a pandemic disease (prevalence 10-20%) had ORs<1, and their population frequency ranged from 0% to 100%. Moreover, simulation-based ORs exhibited a wide distribution, irrespective of mutation frequency. In conclusion, a substantial proportion of mutations causing common complex diseases may appear 'protective' in genetic epidemiological studies and hence would normally tend to be excluded, albeit erroneously, from further study. This apparently paradoxical result is explicable in terms of mutual confounding of the respective genotype-phenotype relationships due to a negative correlation between causal mutations induced by their common gene genealogy. As would be predicted by our findings, a significant negative correlation became apparent in published genome-wide association studies between the OR of genetic variants associated with a particular disease and the prevalence of that disease.
Collapse
Affiliation(s)
- Sabine Siegert
- Cologne Center for Genomics, University of Cologne, Cologne, Germany
- Institute of Epidemiology, Christian-Albrechts University, Kiel, Germany
| | - Andreas Wolf
- Institute of Medical Informatics and Statistics, Christian-Albrechts University, Kiel, Germany
| | - David N. Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff, United Kingdom
| | - Michael Krawczak
- Institute of Medical Informatics and Statistics, Christian-Albrechts University, Kiel, Germany
| | - Michael Nothnagel
- Cologne Center for Genomics, University of Cologne, Cologne, Germany
- Institute of Medical Informatics and Statistics, Christian-Albrechts University, Kiel, Germany
- * E-mail:
| |
Collapse
|
178
|
Connallon T, Clark AG. The distribution of fitness effects in an uncertain world. Evolution 2015; 69:1610-1618. [PMID: 25913128 DOI: 10.1111/evo.12673] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2014] [Accepted: 04/17/2015] [Indexed: 12/25/2022]
Abstract
The distribution of fitness effects (DFE) among new mutations plays a critical role in adaptive evolution and the maintenance of genetic variation. Although fitness landscape models predict several key features of the DFE, most theory to date focuses on predictable environmental conditions, while ignoring stochastic environmental fluctuations that feature prominently in the ecology of many organisms. Here, we derive an extension of Fisher's geometric model that incorporates two common effects of environmental variation: (1) nonadaptive genotype-by-environment interactions (G × E), in which the phenotype of a given genotype varies across environmental contexts; and (2) random fluctuation of the fitness optimum, which generates fluctuating selection. We show that both factors cause a mismatch between the DFE within single generations and the distribution of geometric mean fitness effects (averaged over multiple generations) that governs long-term evolutionary change. Such mismatches permit strong evolutionary constraints-despite an abundance of beneficial fitness variation within single environmental contexts-and to conflicting DFE estimates from direct versus indirect inference methods. Finally, our results suggest an intriguing parallel between the genetics and ecology of evolutionary constraints, with environmental fluctuations and pleiotropy placing qualitatively similar limits on the availability of adaptive genetic variation.
Collapse
Affiliation(s)
- Tim Connallon
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, 14853-2703.,School of Biological Sciences, Monash University, Clayton, Victoria, 3800, Australia
| | - Andrew G Clark
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, 14853-2703
| |
Collapse
|
179
|
Abstract
Next-generation sequencing technology has facilitated the discovery of millions of genetic variants in human genomes. A sizeable fraction of these variants are predicted to be deleterious. Here, we review the pattern of deleterious alleles as ascertained in genome sequencing data sets and ask whether human populations differ in their predicted burden of deleterious alleles - a phenomenon known as mutation load. We discuss three demographic models that are predicted to affect mutation load and relate these models to the evidence (or the lack thereof) for variation in the efficacy of purifying selection in diverse human genomes. We also emphasize why accurate estimation of mutation load depends on assumptions regarding the distribution of dominance and selection coefficients - quantities that remain poorly characterized for current genomic data sets.
Collapse
|
180
|
Renaut S, Rieseberg LH. The Accumulation of Deleterious Mutations as a Consequence of Domestication and Improvement in Sunflowers and Other Compositae Crops. Mol Biol Evol 2015; 32:2273-83. [PMID: 25939650 DOI: 10.1093/molbev/msv106] [Citation(s) in RCA: 97] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
For populations to maintain optimal fitness, harmful mutations must be efficiently purged from the genome. Yet, under circumstances that diminish the effectiveness of natural selection, such as the process of plant and animal domestication, deleterious mutations are predicted to accumulate. Here, we compared the load of deleterious mutations in 21 accessions from natural populations and 19 domesticated accessions of the common sunflower using whole-transcriptome single nucleotide polymorphism data. Although we find that genetic diversity has been greatly reduced during domestication, the remaining mutations were disproportionally biased toward nonsynonymous substitutions. Bioinformatically predicted deleterious mutations affecting protein function were especially strongly over-represented. We also identify similar patterns in two other domesticated species of the sunflower family (globe artichoke and cardoon), indicating that this phenomenon is not due to idiosyncrasies of sunflower domestication or the sunflower genome. Finally, we provide unequivocal evidence that deleterious mutations accumulate in low recombining regions of the genome, due to the reduced efficacy of purifying selection. These results represent a conundrum for crop improvement efforts. Although the elimination of harmful mutations should be a long-term goal of plant and animal breeding programs, it will be difficult to weed them out because of limited recombination.
Collapse
Affiliation(s)
- Sebastien Renaut
- Biodiversity Research Centre and Department of Botany, University of British Columbia, Vancouver, BC, Canada
| | - Loren H Rieseberg
- Biodiversity Research Centre and Department of Botany, University of British Columbia, Vancouver, BC, Canada Department of Biology, Indiana University - Bloomington
| |
Collapse
|
181
|
Hadjixenofontos A, Gourraud PA, Bakthavachalam V, Foco L, Ticca A, Bitti P, Pastorino R, Bernardinelli L, McCauley JL. Enrichment for Northern European-derived multiple sclerosis risk alleles in Sardinia. Mult Scler 2015; 21:1396-403. [DOI: 10.1177/1352458515581872] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2014] [Accepted: 03/15/2015] [Indexed: 12/25/2022]
Abstract
Background: The list of genomic loci associated with multiple sclerosis (MS) susceptibility outside the major histocompatibility complex (MHC) in patients of Northern European (NE) ancestry has increased to 103. Despite the extraordinarily high MS prevalence in the isolated Sardinian population, the contribution of genetic risk factors to MS in Sardinia is largely not understood. Objective: The objective of this paper is to examine the relevance of non-MHC MS susceptibility variants in Sardinia. Methods: We examined a log-additive MS-specific genetic burden score (MSGB) using 110 NE-derived risk alleles in a dataset of 75 Sardinian cases, 346 Sardinian controls and 177 cases and 1967 controls from the United States (US). Results: Sardinian cases demonstrate a heavier non-MHC MSGB load than Sardinian controls and US cases ( p = 2E-06, p = 1E-06, respectively). Furthermore, Sardinian controls carry a heavier burden than US controls ( p = 2E-14). Our results confirm the limited ability of the 110-SNP MSGB to predict disease status in Sardinia (AUROC = 0.629). Conclusions: Risk alleles discovered in samples of NE ancestry are relevant to MS in Sardinia. Our results suggest a general enrichment of MS susceptibility alleles in Sardinians, encouraging the pursuit of further studies of MS in this population.
Collapse
Affiliation(s)
- A Hadjixenofontos
- John P. Hussman Institute for Human Genomics and Dr John T Macdonald Foundation Department of Human Genetics, Miller School of Medicine, University of Miami, USA
| | - PA Gourraud
- Department of Neurology, School of Medicine, University of California at San Francisco, USA
| | - V Bakthavachalam
- Department of Neurology, School of Medicine, University of California at San Francisco, USA
| | - L Foco
- Department of Brain and Behavioral Sciences, University of Pavia, Italy
| | - A Ticca
- Divisione di Neurologia, Ospedale S. Francesco, Italy
| | - P Bitti
- Immunohaematology and Blood Transfusion Department, Ospedale S. Francesco, Italy
| | - R Pastorino
- Department of Brain and Behavioral Sciences, University of Pavia, Italy
| | - L Bernardinelli
- Department of Brain and Behavioral Sciences, University of Pavia, Italy/Department of Brain and Behavioral Sciences, University of Pavia, Italy Centre for Biostatistics, Institute of Population Health, University of Manchester, UK
| | - JL McCauley
- John P. Hussman Institute for Human Genomics and Dr John T Macdonald Foundation Department of Human Genetics, Miller School of Medicine, University of Miami, USA
| |
Collapse
|
182
|
Peischl S, Excoffier L. Expansion load: recessive mutations and the role of standing genetic variation. Mol Ecol 2015; 24:2084-94. [DOI: 10.1111/mec.13154] [Citation(s) in RCA: 121] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2014] [Revised: 03/11/2015] [Accepted: 03/11/2015] [Indexed: 12/13/2022]
Affiliation(s)
- Stephan Peischl
- Institute of Ecology and Evolution; University of Berne; Berne 3012 Switzerland
- Swiss Institute of Bioinformatics; Lausanne 1015 Switzerland
| | - Laurent Excoffier
- Institute of Ecology and Evolution; University of Berne; Berne 3012 Switzerland
- Swiss Institute of Bioinformatics; Lausanne 1015 Switzerland
| |
Collapse
|
183
|
McCoy RC, Demko Z, Ryan A, Banjevic M, Hill M, Sigurjonsson S, Rabinowitz M, Fraser HB, Petrov DA. Common variants spanning PLK4 are associated with mitotic-origin aneuploidy in human embryos. Science 2015; 348:235-8. [PMID: 25859044 DOI: 10.1126/science.aaa3337] [Citation(s) in RCA: 89] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Aneuploidy, the inheritance of an atypical chromosome complement, is common in early human development and is the primary cause of pregnancy loss. By screening day-3 embryos during in vitro fertilization cycles, we identified an association between aneuploidy of putative mitotic origin and linked genetic variants on chromosome 4 of maternal genomes. This associated region contains a candidate gene, Polo-like kinase 4 (PLK4), that plays a well-characterized role in centriole duplication and has the ability to alter mitotic fidelity upon minor dysregulation. Mothers with the high-risk genotypes contributed fewer embryos for testing at day 5, suggesting that their embryos are less likely to survive to blastocyst formation. The associated region coincides with a signature of a selective sweep in ancient humans, suggesting that the causal variant was either the target of selection or hitchhiked to substantial frequency.
Collapse
Affiliation(s)
- Rajiv C McCoy
- Department of Biology, Stanford University, Stanford, CA, USA
| | | | | | | | | | | | | | - Hunter B Fraser
- Department of Biology, Stanford University, Stanford, CA, USA
| | - Dmitri A Petrov
- Department of Biology, Stanford University, Stanford, CA, USA
| |
Collapse
|
184
|
Osada N, Hettiarachchi N, Adeyemi Babarinde I, Saitou N, Blancher A. Whole-genome sequencing of six Mauritian Cynomolgus macaques (Macaca fascicularis) reveals a genome-wide pattern of polymorphisms under extreme population bottleneck. Genome Biol Evol 2015; 7:821-30. [PMID: 25805843 PMCID: PMC5322541 DOI: 10.1093/gbe/evv033] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Cynomolgus macaques (Macaca fascicularis) were introduced to the island of Mauritius by humans around the 16th century. The unique demographic history of the Mauritian cynomolgus macaques provides the opportunity to not only examine the genetic background of well-established nonhuman primates for biomedical research but also understand the effect of an extreme population bottleneck on the pattern of polymorphisms in genomes. We sequenced the whole genomes of six Mauritian cynomolgus macaques and obtained an average of 20-fold coverage of the genome sequences for each individual. The overall level of nucleotide diversity was 23% smaller than that of the Malaysian cynomolgus macaques, and a reduction of low-frequency polymorphisms was observed. In addition, we also confirmed that the Mauritian cynomolgus macaques were genetically closer to a representative of the Malaysian population than to a representative of the Indochinese population. Excess of nonsynonymous polymorphisms in low frequency, which has been observed in many other species, was not very strong in the Mauritian samples, and the proportion of heterozygous nonsynonymous polymorphisms relative to synonymous polymorphisms is higher within individuals in Mauritian than Malaysian cynomolgus macaques. Those patterns indicate that the extreme population bottleneck made purifying selection overwhelmed by the power of genetic drift in the population. Finally, we estimated the number of founding individuals by using the genome-wide site frequency spectrum of the six samples. Assuming a simple demographic scenario with a single bottleneck followed by exponential growth, the estimated number of founders (∼20 individuals) is largely consistent with previous estimates.
Collapse
Affiliation(s)
- Naoki Osada
- Division of Evolutionary Genetics, National Institute of Genetics, Mishima, Japan Department of Genetics, SOKENDAI (The Graduate University for Advanced Studies), Mishima, Japan
| | - Nilmini Hettiarachchi
- Department of Genetics, SOKENDAI (The Graduate University for Advanced Studies), Mishima, Japan Division of Population Genetics, National Institute of Genetics, Mishima, Japan
| | - Isaac Adeyemi Babarinde
- Department of Genetics, SOKENDAI (The Graduate University for Advanced Studies), Mishima, Japan Division of Population Genetics, National Institute of Genetics, Mishima, Japan
| | - Naruya Saitou
- Department of Genetics, SOKENDAI (The Graduate University for Advanced Studies), Mishima, Japan Division of Population Genetics, National Institute of Genetics, Mishima, Japan
| | - Antoine Blancher
- Laboratoire d'Immunogénétique Moléculaire (LIMT, EA3034), Faculté de Médecine Purpan, Université Paul Sabatier, Toulouse III, France
| |
Collapse
|
185
|
Complex trait architecture: the pleiotropic model revisited. Sci Rep 2015; 5:9351. [PMID: 25792462 PMCID: PMC4366851 DOI: 10.1038/srep09351] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2014] [Accepted: 02/20/2015] [Indexed: 01/30/2023] Open
Abstract
There is currently much debate about how much the genetic heritability of complex traits is due to very rare alleles. This issue is important because it determines sampling strategies for genetic association studies. Several recent theoretical papers based on a pleiotropic model for trait evolution suggest that it is possible that a large proportion of the genetic variance could be explained by rare alleles. This model assumes that mutations with a large effect on fitness also tend to have large positive or negative effects on phenotypic traits. We show that conclusions based on standard diffusion results are generally applicable to simulations of whole genomes with overlapping generations in a finite population, although the variance contribution of rare alleles is somewhat smaller than theoretical predictions. We show that under many scenarios the pleiotropic model predicts trait distributions that are unrealistically leptokurtic. We argue that this imposes a limit on the relationship between fitness and trait effects.
Collapse
|
186
|
Tung J, Zhou X, Alberts SC, Stephens M, Gilad Y. The genetic architecture of gene expression levels in wild baboons. eLife 2015; 4. [PMID: 25714927 PMCID: PMC4383332 DOI: 10.7554/elife.04729] [Citation(s) in RCA: 86] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2014] [Accepted: 02/03/2015] [Indexed: 12/19/2022] Open
Abstract
Primate evolution has been argued to result, in part, from changes in how genes are regulated. However, we still know little about gene regulation in natural primate populations. We conducted an RNA sequencing (RNA-seq)-based study of baboons from an intensively studied wild population. We performed complementary expression quantitative trait locus (eQTL) mapping and allele-specific expression analyses, discovering substantial evidence for, and surprising power to detect, genetic effects on gene expression levels in the baboons. eQTL were most likely to be identified for lineage-specific, rapidly evolving genes; interestingly, genes with eQTL significantly overlapped between baboons and a comparable human eQTL data set. Our results suggest that genes vary in their tolerance of genetic perturbation, and that this property may be conserved across species. Further, they establish the feasibility of eQTL mapping using RNA-seq data alone, and represent an important step towards understanding the genetic architecture of gene expression in primates.
Collapse
Affiliation(s)
- Jenny Tung
- Department of Human Genetics, University of Chicago, Chicago, United States
| | - Xiang Zhou
- Department of Human Genetics, University of Chicago, Chicago, United States
| | - Susan C Alberts
- Institute of Primate Research, National Museums of Kenya, Nairobi, Kenya
| | - Matthew Stephens
- Department of Human Genetics, University of Chicago, Chicago, United States
| | - Yoav Gilad
- Department of Human Genetics, University of Chicago, Chicago, United States
| |
Collapse
|
187
|
An estimate of the average number of recessive lethal mutations carried by humans. Genetics 2015; 199:1243-54. [PMID: 25697177 PMCID: PMC4391560 DOI: 10.1534/genetics.114.173351] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2014] [Accepted: 02/11/2015] [Indexed: 12/27/2022] Open
Abstract
The effects of inbreeding on human health depend critically on the number and severity of recessive, deleterious mutations carried by individuals. In humans, existing estimates of these quantities are based on comparisons between consanguineous and nonconsanguineous couples, an approach that confounds socioeconomic and genetic effects of inbreeding. To overcome this limitation, we focused on a founder population that practices a communal lifestyle, for which there is almost complete Mendelian disease ascertainment and a known pedigree. Focusing on recessive lethal diseases and simulating allele transmissions, we estimated that each haploid set of human autosomes carries on average 0.29 (95% credible interval [0.10, 0.84]) recessive alleles that lead to complete sterility or death by reproductive age when homozygous. Comparison to existing estimates in humans suggests that a substantial fraction of the total burden imposed by recessive deleterious variants is due to single mutations that lead to sterility or death between birth and reproductive age. In turn, comparison to estimates from other eukaryotes points to a surprising constancy of the average number of recessive lethal mutations across organisms with markedly different genome sizes.
Collapse
|
188
|
Hussin JG, Hodgkinson A, Idaghdour Y, Grenier JC, Goulet JP, Gbeha E, Hip-Ki E, Awadalla P. Recombination affects accumulation of damaging and disease-associated mutations in human populations. Nat Genet 2015; 47:400-4. [DOI: 10.1038/ng.3216] [Citation(s) in RCA: 71] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2014] [Accepted: 01/14/2015] [Indexed: 01/17/2023]
|
189
|
No evidence that selection has been less effective at removing deleterious mutations in Europeans than in Africans. Nat Genet 2015; 47:126-31. [PMID: 25581429 PMCID: PMC4310772 DOI: 10.1038/ng.3186] [Citation(s) in RCA: 139] [Impact Index Per Article: 15.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2014] [Accepted: 12/09/2014] [Indexed: 01/05/2023]
Abstract
Non-African populations have experienced size reductions in the time since their split from West Africans, leading to the hypothesis that natural selection to remove weakly deleterious mutations has been less effective in the history of non-Africans. To test this hypothesis, we measured the per-genome accumulation of non-synonymous substitutions across diverse pairs of populations. We find no evidence for a higher load of deleterious mutations in non-Africans. However, we detect significant differences among more divergent populations, as archaic Denisovans have accumulated non-synonymous mutations faster than either modern humans or Neanderthals. To reconcile these findings with patterns that have been interpreted as evidence of less effective removal of deleterious mutations in non-Africans than in West Africans, we use simulations to show that the observed patterns are not likely to reflect changes in the effectiveness of selection after the populations split, and instead are likely to be driven by other population genetic factors.
Collapse
|
190
|
Bhaskar A, Wang YXR, Song YS. Efficient inference of population size histories and locus-specific mutation rates from large-sample genomic variation data. Genome Res 2015; 25:268-79. [PMID: 25564017 PMCID: PMC4315300 DOI: 10.1101/gr.178756.114] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
With the recent increase in study sample sizes in human genetics, there has been growing interest in inferring historical population demography from genomic variation data. Here, we present an efficient inference method that can scale up to very large samples, with tens or hundreds of thousands of individuals. Specifically, by utilizing analytic results on the expected frequency spectrum under the coalescent and by leveraging the technique of automatic differentiation, which allows us to compute gradients exactly, we develop a very efficient algorithm to infer piecewise-exponential models of the historical effective population size from the distribution of sample allele frequencies. Our method is orders of magnitude faster than previous demographic inference methods based on the frequency spectrum. In addition to inferring demography, our method can also accurately estimate locus-specific mutation rates. We perform extensive validation of our method on simulated data and show that it can accurately infer multiple recent epochs of rapid exponential growth, a signal that is difficult to pick up with small sample sizes. Lastly, we use our method to analyze data from recent sequencing studies, including a large-sample exome-sequencing data set of tens of thousands of individuals assayed at a few hundred genic regions.
Collapse
Affiliation(s)
- Anand Bhaskar
- Simons Institute for the Theory of Computing, Berkeley, California 94720, USA; Computer Science Division, University of California, Berkeley, California 94720, USA
| | - Y X Rachel Wang
- Department of Statistics, University of California, Berkeley, California 94720, USA
| | - Yun S Song
- Simons Institute for the Theory of Computing, Berkeley, California 94720, USA; Computer Science Division, University of California, Berkeley, California 94720, USA; Department of Statistics, University of California, Berkeley, California 94720, USA; Department of Integrative Biology, University of California, Berkeley, California 94720, USA
| |
Collapse
|
191
|
The evolution of selfing is accompanied by reduced efficacy of selection and purging of deleterious mutations. Genetics 2014; 199:817-29. [PMID: 25552275 DOI: 10.1534/genetics.114.172809] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
The transition from outcrossing to selfing is predicted to reduce the genome-wide efficacy of selection because of the lower effective population size (Ne) that accompanies this change in mating system. However, strongly recessive deleterious mutations exposed in the homozygous backgrounds of selfers should be under strong purifying selection. Here, we examine estimates of the distribution of fitness effects (DFE) and changes in the magnitude of effective selection coefficients (Nes) acting on mutations during the transition from outcrossing to selfing. Using forward simulations, we investigated the ability of a DFE inference approach to detect the joint influence of mating system and the dominance of deleterious mutations on selection efficacy. We investigated predictions from our simulations in the annual plant Eichhornia paniculata, in which selfing has evolved from outcrossing on multiple occasions. We used range-wide sampling to generate population genomic datasets and identified nonsynonymous and synonymous polymorphisms segregating in outcrossing and selfing populations. We found that the transition to selfing was accompanied by a change in the DFE, with a larger fraction of effectively neutral sites (Nes < 1), a result consistent with the effects of reduced Ne in selfers. Moreover, an increased proportion of sites in selfers were under strong purifying selection (Nes > 100), and simulations suggest that this is due to the exposure of recessive deleterious mutations. We conclude that the transition to selfing has been accompanied by the genome-wide influences of reduced Ne and strong purifying selection against deleterious recessive mutations, an example of purging at the molecular level.
Collapse
|
192
|
Zaitlen N, Pasaniuc B, Sankararaman S, Bhatia G, Zhang J, Gusev A, Young T, Tandon A, Pollack S, Vilhjálmsson BJ, Assimes TL, Berndt SI, Blot WJ, Chanock S, Franceschini N, Goodman PG, He J, Hennis AJM, Hsing A, Ingles SA, Isaacs W, Kittles RA, Klein EA, Lange LA, Nemesure B, Patterson N, Reich D, Rybicki BA, Stanford JL, Stevens VL, Strom SS, Whitsel EA, Witte JS, Xu J, Haiman C, Wilson JG, Kooperberg C, Stram D, Reiner AP, Tang H, Price AL. Leveraging population admixture to characterize the heritability of complex traits. Nat Genet 2014; 46:1356-62. [PMID: 25383972 PMCID: PMC4244251 DOI: 10.1038/ng.3139] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2014] [Accepted: 10/10/2014] [Indexed: 12/12/2022]
Abstract
Despite recent progress on estimating the heritability explained by genotyped SNPs (h(2)g), a large gap between h(2)g and estimates of total narrow-sense heritability (h(2)) remains. Explanations for this gap include rare variants or upward bias in family-based estimates of h(2) due to shared environment or epistasis. We estimate h(2) from unrelated individuals in admixed populations by first estimating the heritability explained by local ancestry (h(2)γ). We show that h(2)γ = 2FSTCθ(1 - θ)h(2), where FSTC measures frequency differences between populations at causal loci and θ is the genome-wide ancestry proportion. Our approach is not susceptible to biases caused by epistasis or shared environment. We applied this approach to the analysis of 13 phenotypes in 21,497 African-American individuals from 3 cohorts. For height and body mass index (BMI), we obtained h(2) estimates of 0.55 ± 0.09 and 0.23 ± 0.06, respectively, which are larger than estimates of h(2)g in these and other data but smaller than family-based estimates of h(2).
Collapse
Affiliation(s)
- Noah Zaitlen
- Department of Medicine, University of California, San Francisco, San Francisco, California, USA
| | - Bogdan Pasaniuc
- Department of Pathology and Laboratory Medicine, University of California, Los Angeles, Los Angeles, California, USA
| | - Sriram Sankararaman
- 1] Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA. [2] Department of Genetics, Harvard Medical School, Boston, Massachusetts, USA
| | - Gaurav Bhatia
- 1] Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA. [2] Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts, USA. [3] Department of Epidemiology, Harvard School of Public Health, Boston, Massachusetts, USA
| | - Jianqi Zhang
- Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California, USA
| | - Alexander Gusev
- 1] Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA. [2] Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts, USA. [3] Department of Epidemiology, Harvard School of Public Health, Boston, Massachusetts, USA
| | - Taylor Young
- Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA
| | - Arti Tandon
- 1] Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA. [2] Department of Genetics, Harvard Medical School, Boston, Massachusetts, USA
| | - Samuela Pollack
- 1] Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA. [2] Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts, USA. [3] Department of Epidemiology, Harvard School of Public Health, Boston, Massachusetts, USA
| | - Bjarni J Vilhjálmsson
- 1] Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA. [2] Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts, USA. [3] Department of Epidemiology, Harvard School of Public Health, Boston, Massachusetts, USA
| | - Themistocles L Assimes
- Department of Medicine, Stanford University School of Medicine, Stanford, California, USA
| | - Sonja I Berndt
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, US National Institutes of Health, Bethesda, Maryland, USA
| | - William J Blot
- 1] Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt University School of Medicine, Nashville, Tennessee, USA. [2] Vanderbilt-Ingram Cancer Center, Vanderbilt University School of Medicine, Nashville, Tennessee, USA. [3] International Epidemiology Institute, Rockville, Maryland, USA
| | - Stephen Chanock
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, US National Institutes of Health, Bethesda, Maryland, USA
| | - Nora Franceschini
- Department of Epidemiology, University of North Carolina, Chapel Hill, North Carolina, USA
| | | | - Jing He
- Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California, USA
| | - Anselm J M Hennis
- 1] Department of Preventive Medicine, Stony Brook University, Stony Brook, New York, USA. [2] Chronic Disease Research Centre, University of the West Indies, Bridgetown, Barbados. [3] Faculty of Medical Sciences, University of the West Indies, Bridgetown, Barbados. [4] Ministry of Health, Bridgetown, Barbados
| | - Ann Hsing
- 1] Cancer Prevention Institute of California, Fremont, California, USA. [2] Division of Epidemiology, Stanford University School of Medicine, Stanford, California, USA
| | - Sue A Ingles
- Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California, USA
| | - William Isaacs
- James Buchanan Brady Urological Institute, Johns Hopkins Hospital and Medical Institutions, Baltimore, Maryland, USA
| | - Rick A Kittles
- Department of Medicine, University of Illinois at Chicago, Chicago, Illinois, USA
| | - Eric A Klein
- Glickman Urologic and Kidney Institute, Cleveland Clinic, Cleveland, Ohio, USA
| | - Leslie A Lange
- Department of Genetics, University of North Carolina, Chapel Hill, North Carolina, USA
| | - Barbara Nemesure
- Department of Preventive Medicine, Stony Brook University, Stony Brook, New York, USA
| | - Nick Patterson
- Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA
| | - David Reich
- 1] Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA. [2] Department of Genetics, Harvard Medical School, Boston, Massachusetts, USA. [3] Howard Hughes Medical Institute, Harvard Medical School, Boston, Massachusetts, USA
| | - Benjamin A Rybicki
- Department of Public Health Sciences, Henry Ford Hospital, Detroit, Michigan, USA
| | - Janet L Stanford
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Victoria L Stevens
- Epidemiology Research Program, American Cancer Society, Atlanta, Georgia, USA
| | - Sara S Strom
- Department of Epidemiology, Division of Cancer Prevention and Population Sciences, University of Texas MD Anderson Cancer Center, Houston, Texas, USA
| | - Eric A Whitsel
- Department of Epidemiology, University of North Carolina, Chapel Hill, North Carolina, USA
| | - John S Witte
- Institute for Human Genetics, University of California, San Francisco, San Francisco, California, USA
| | - Jianfeng Xu
- Center for Cancer Genomics, Wake Forest University School of Medicine, Winston-Salem, North Carolina, USA
| | - Christopher Haiman
- 1] Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California, USA. [2] Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, California, USA
| | - James G Wilson
- Department of Physiology and Biophysics, University of Mississippi Medical Center, Jackson, Mississippi, USA
| | - Charles Kooperberg
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Daniel Stram
- Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California, USA
| | - Alex P Reiner
- Department of Epidemiology, University of Washington School of Public Health, Seattle, Washington, USA
| | - Hua Tang
- Department of Genetics, Stanford University School of Medicine, Stanford, California, USA
| | - Alkes L Price
- 1] Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA. [2] Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts, USA. [3] Department of Epidemiology, Harvard School of Public Health, Boston, Massachusetts, USA
| |
Collapse
|
193
|
Uricchio LH, Torres R, Witte JS, Hernandez RD. Population genetic simulations of complex phenotypes with implications for rare variant association tests. Genet Epidemiol 2014; 39:35-44. [PMID: 25417809 DOI: 10.1002/gepi.21866] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2014] [Revised: 09/09/2014] [Accepted: 09/26/2014] [Indexed: 12/12/2022]
Abstract
Demographic events and natural selection alter patterns of genetic variation within populations and may play a substantial role in shaping the genetic architecture of complex phenotypes and disease. However, the joint impact of these basic evolutionary forces is often ignored in the assessment of statistical tests of association. Here, we provide a simulation-based framework for generating DNA sequences that incorporates selection and demography with flexible models for simulating phenotypic variation (sfs_coder). This tool also allows the user to perform locus-specific simulations by automatically querying annotated genomic functional elements and genetic maps. We demonstrate the effects of evolutionary forces on patterns of genetic variation by simulating recently inferred models of human selection and demography. We use these simulations to show that the demographic model and locus-specific features, such as the proportion of sites under selection, may have practical implications for estimating the statistical power of sequencing-based rare variant association tests. In particular, for some phenotype models, there may be higher power to detect rare variant associations in African populations compared to non-Africans, but power is considerably reduced in regions of the genome with rampant negative selection. Furthermore, we show that existing methods for simulating large samples based on resampling from a small set of observed haplotypes fail to recapitulate the distribution of rare variants in the presence of rapid population growth (as has been observed in several human populations).
Collapse
Affiliation(s)
- Lawrence H Uricchio
- Graduate Program in Bioinformatics, University of California, San Francisco, California, United States of America
| | | | | | | |
Collapse
|
194
|
Fu W, Gittelman RM, Bamshad MJ, Akey JM. Characteristics of neutral and deleterious protein-coding variation among individuals and populations. Am J Hum Genet 2014; 95:421-36. [PMID: 25279984 PMCID: PMC4185119 DOI: 10.1016/j.ajhg.2014.09.006] [Citation(s) in RCA: 68] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2014] [Accepted: 09/11/2014] [Indexed: 01/27/2023] Open
Abstract
Whole-genome and exome data sets continue to be produced at a frenetic pace, resulting in massively large catalogs of human genomic variation. However, a clear picture of the characteristics and patterns of neutral and deleterious variation within and between populations has yet to emerge, given that recent large-scale sequencing studies have often emphasized different aspects of the data and sometimes appear to have conflicting conclusions. Here, we comprehensively studied characteristics of protein-coding variation in high-coverage exome sequence data from 6,515 European American (EA) and African American (AA) individuals. We developed an unbiased approach to identify putatively deleterious variants and investigated patterns of neutral and deleterious single-nucleotide variants and alleles between individuals and populations. We show that there are substantial differences in the composition of genotypes between EA and AA populations and that small but statistically significant differences exist in the average number of deleterious alleles carried by EA and AA individuals. Furthermore, we performed extensive simulations to delineate the temporal dynamics of deleterious alleles for a broad range of demographic models and use these data to inform the interpretation of empirical patterns of deleterious variation. Finally, we illustrate that the effects of demographic perturbations, such as bottlenecks and expansions, often manifest in opposing patterns of neutral and deleterious variation depending on whether the focus is on populations or individuals. Our results clarify seemingly disparate empirical characteristics of protein-coding variation and provide substantial insights into how natural selection and demographic history have patterned neutral and deleterious variation within and between populations.
Collapse
Affiliation(s)
- Wenqing Fu
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA.
| | - Rachel M Gittelman
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Michael J Bamshad
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA; Department of Pediatrics, University of Washington, Seattle, WA 98195, USA
| | - Joshua M Akey
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA.
| |
Collapse
|
195
|
The effects of demography and long-term selection on the accuracy of genomic prediction with sequence data. Genetics 2014; 198:1671-84. [PMID: 25233989 DOI: 10.1534/genetics.114.168344] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
The use of dense SNPs to predict the genetic value of an individual for a complex trait is often referred to as "genomic selection" in livestock and crops, but is also relevant to human genetics to predict, for example, complex genetic disease risk. The accuracy of prediction depends on the strength of linkage disequilibrium (LD) between SNPs and causal mutations. If sequence data were used instead of dense SNPs, accuracy should increase because causal mutations are present, but demographic history and long-term negative selection also influence accuracy. We therefore evaluated genomic prediction, using simulated sequence in two contrasting populations: one reducing from an ancestrally large effective population size (Ne) to a small one, with high LD common in domestic livestock, while the second had a large constant-sized Ne with low LD similar to that in some human or outbred plant populations. There were two scenarios in each population; causal variants were either neutral or under long-term negative selection. For large Ne, sequence data led to a 22% increase in accuracy relative to ∼600K SNP chip data with a Bayesian analysis and a more modest advantage with a BLUP analysis. This advantage increased when causal variants were influenced by negative selection, and accuracy persisted when 10 generations separated reference and validation populations. However, in the reducing Ne population, there was little advantage for sequence even with negative selection. This study demonstrates the joint influence of demography and selection on accuracy of prediction and improves our understanding of how best to exploit sequence for genomic prediction.
Collapse
|
196
|
Sequencing an Ashkenazi reference panel supports population-targeted personal genomics and illuminates Jewish and European origins. Nat Commun 2014; 5:4835. [PMID: 25203624 PMCID: PMC4164776 DOI: 10.1038/ncomms5835] [Citation(s) in RCA: 112] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2014] [Accepted: 07/28/2014] [Indexed: 12/17/2022] Open
Abstract
The Ashkenazi Jewish (AJ) population is a genetic isolate close to European and Middle Eastern groups, with genetic diversity patterns conducive to disease mapping. Here we report high-depth sequencing of 128 complete genomes of AJ controls. Compared with European samples, our AJ panel has 47% more novel variants per genome and is eightfold more effective at filtering benign variants out of AJ clinical genomes. Our panel improves imputation accuracy for AJ SNP arrays by 28%, and covers at least one haplotype in ≈67% of any AJ genome with long, identical-by-descent segments. Reconstruction of recent AJ history from such segments confirms a recent bottleneck of merely ≈350 individuals. Modelling of ancient histories for AJ and European populations using their joint allele frequency spectrum determines AJ to be an even admixture of European and likely Middle Eastern origins. We date the split between the two ancestral populations to ≈12–25 Kyr, suggesting a predominantly Near Eastern source for the repopulation of Europe after the Last Glacial Maximum. Ashkenazi Jews are a genetically isolated population with distinct patterns of genetic diversity. Here, the authors sequence the genomes of 128 Ashkenazi Jewish individuals and use the sequence information to provide insight into the population's European and Middle Eastern origins.
Collapse
|
197
|
Li X, Battle A, Karczewski K, Zappala Z, Knowles D, Smith K, Kukurba K, Wu E, Simon N, Montgomery S. Transcriptome sequencing of a large human family identifies the impact of rare noncoding variants. Am J Hum Genet 2014; 95:245-56. [PMID: 25192044 PMCID: PMC4157143 DOI: 10.1016/j.ajhg.2014.08.004] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2014] [Accepted: 08/12/2014] [Indexed: 12/26/2022] Open
Abstract
Recent and rapid human population growth has led to an excess of rare genetic variants that are expected to contribute to an individual's genetic burden of disease risk. To date, much of the focus has been on rare protein-coding variants, for which potential impact can be estimated from the genetic code, but determining the impact of rare noncoding variants has been more challenging. To improve our understanding of such variants, we combined high-quality genome sequencing and RNA sequencing data from a 17-individual, three-generation family to contrast expression quantitative trait loci (eQTLs) and splicing quantitative trait loci (sQTLs) within this family to eQTLs and sQTLs within a population sample. Using this design, we found that eQTLs and sQTLs with large effects in the family were enriched with rare regulatory and splicing variants (minor allele frequency < 0.01). They were also more likely to influence essential genes and genes involved in complex disease. In addition, we tested the capacity of diverse noncoding annotation to predict the impact of rare noncoding variants. We found that distance to the transcription start site, evolutionary constraint, and epigenetic annotation were considerably more informative for predicting the impact of rare variants than for predicting the impact of common variants. These results highlight that rare noncoding variants are important contributors to individual gene-expression profiles and further demonstrate a significant capability for genomic annotation to predict the impact of rare noncoding variants.
Collapse
|
198
|
Impact of range expansions on current human genomic diversity. Curr Opin Genet Dev 2014; 29:22-30. [PMID: 25156518 DOI: 10.1016/j.gde.2014.07.007] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2014] [Revised: 07/09/2014] [Accepted: 07/25/2014] [Indexed: 12/19/2022]
Abstract
The patterns of population genetic diversity depend to a large extent on past demographic history. Most human populations are known to have gone recently through a series of range expansions within and out of Africa, but these spatial expansions are rarely taken into account when interpreting observed genomic diversity, possibly because they are difficult to model. Here we review available evidence in favour of range expansions out of Africa, and we discuss several of their consequences on neutral and selected diversity, including some recent observations on an excess of rare neutral and selected variants in large samples. We further show that in spatially subdivided populations, the sampling strategy can severely impact the resulting genetic diversity and be confounded by past demography. We conclude that ignoring the spatial structure of human population can lead to some misinterpretations of extant genetic diversity.
Collapse
|
199
|
Lim ET, Würtz P, Havulinna AS, Palta P, Tukiainen T, Rehnström K, Esko T, Mägi R, Inouye M, Lappalainen T, Chan Y, Salem RM, Lek M, Flannick J, Sim X, Manning A, Ladenvall C, Bumpstead S, Hämäläinen E, Aalto K, Maksimow M, Salmi M, Blankenberg S, Ardissino D, Shah S, Horne B, McPherson R, Hovingh GK, Reilly MP, Watkins H, Goel A, Farrall M, Girelli D, Reiner AP, Stitziel NO, Kathiresan S, Gabriel S, Barrett JC, Lehtimäki T, Laakso M, Groop L, Kaprio J, Perola M, McCarthy MI, Boehnke M, Altshuler DM, Lindgren CM, Hirschhorn JN, Metspalu A, Freimer NB, Zeller T, Jalkanen S, Koskinen S, Raitakari O, Durbin R, MacArthur DG, Salomaa V, Ripatti S, Daly MJ, Palotie A. Distribution and medical impact of loss-of-function variants in the Finnish founder population. PLoS Genet 2014; 10:e1004494. [PMID: 25078778 PMCID: PMC4117444 DOI: 10.1371/journal.pgen.1004494] [Citation(s) in RCA: 284] [Impact Index Per Article: 28.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2014] [Accepted: 05/14/2014] [Indexed: 01/19/2023] Open
Abstract
Exome sequencing studies in complex diseases are challenged by the allelic heterogeneity, large number and modest effect sizes of associated variants on disease risk and the presence of large numbers of neutral variants, even in phenotypically relevant genes. Isolated populations with recent bottlenecks offer advantages for studying rare variants in complex diseases as they have deleterious variants that are present at higher frequencies as well as a substantial reduction in rare neutral variation. To explore the potential of the Finnish founder population for studying low-frequency (0.5-5%) variants in complex diseases, we compared exome sequence data on 3,000 Finns to the same number of non-Finnish Europeans and discovered that, despite having fewer variable sites overall, the average Finn has more low-frequency loss-of-function variants and complete gene knockouts. We then used several well-characterized Finnish population cohorts to study the phenotypic effects of 83 enriched loss-of-function variants across 60 phenotypes in 36,262 Finns. Using a deep set of quantitative traits collected on these cohorts, we show 5 associations (p<5×10⁻⁸) including splice variants in LPA that lowered plasma lipoprotein(a) levels (P = 1.5×10⁻¹¹⁷). Through accessing the national medical records of these participants, we evaluate the LPA finding via Mendelian randomization and confirm that these splice variants confer protection from cardiovascular disease (OR = 0.84, P = 3×10⁻⁴), demonstrating for the first time the correlation between very low levels of LPA in humans with potential therapeutic implications for cardiovascular diseases. More generally, this study articulates substantial advantages for studying the role of rare variation in complex phenotypes in founder populations like the Finns and by combining a unique population genetic history with data from large population cohorts and centralized research access to National Health Registers.
Collapse
Affiliation(s)
- Elaine T. Lim
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, Massachusetts, United States of America
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
- Program in Biological and Biomedical Sciences, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Peter Würtz
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
- Department of Chronic Disease Prevention, National Institute for Health and Welfare, Helsinki, Finland
- Computational Medicine, Institute of Health Sciences, University of Oulu, Oulu, Finland
| | - Aki S. Havulinna
- Department of Chronic Disease Prevention, National Institute for Health and Welfare, Helsinki, Finland
| | - Priit Palta
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, United Kingdom
| | - Taru Tukiainen
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, Massachusetts, United States of America
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Karola Rehnström
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, United Kingdom
| | - Tõnu Esko
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
- Estonian Genome Center, University of Tartu, Tartu, Estonia
- Divisions of Endocrinology and Genetics and Center for Basic and Translational Obesity Research, Children's Hospital Boston, Boston, Massachusetts, United States of America
| | - Reedik Mägi
- Estonian Genome Center, University of Tartu, Tartu, Estonia
| | - Michael Inouye
- Medical Systems Biology, Department of Pathology and Department of Microbiology & Immunology, The University of Melbourne, Parkville, Victoria, Australia
| | - Tuuli Lappalainen
- Department of Genetics, Stanford University, Stanford, California, United States of America
- Stanford Center for Computational, Evolutionary and Human Genomics, Stanford, California, United States of America
| | - Yingleong Chan
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America
- Program in Biological and Biomedical Sciences, Harvard Medical School, Boston, Massachusetts, United States of America
- Divisions of Endocrinology and Genetics and Center for Basic and Translational Obesity Research, Children's Hospital Boston, Boston, Massachusetts, United States of America
| | - Rany M. Salem
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America
- Divisions of Endocrinology and Genetics and Center for Basic and Translational Obesity Research, Children's Hospital Boston, Boston, Massachusetts, United States of America
| | - Monkol Lek
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, Massachusetts, United States of America
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Jason Flannick
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Xueling Sim
- Department of Biostatistics and Center for Statistical Genetics, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Alisa Manning
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America
| | - Claes Ladenvall
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
- Lund University Diabetes Center, Department of Clinical Sciences, Diabetes & Endocrinology, Skåne University Hospital, Lund University, Malmö, Sweden
| | | | - Eija Hämäläinen
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, United Kingdom
| | | | | | - Marko Salmi
- Department of Medical Microbiology and Immunology, University of Turku and National Institute for Health and Welfare, Turku, Finland
| | - Stefan Blankenberg
- University Heart Centre Hamburg, Clinic for General and Interventional Cardiology, Hamburg, Germany
- DZHK (German Centre for Cardiovascular Research), partner site Hamburg/Kiel/Lübeck, Hamburg, Germany
| | - Diego Ardissino
- Division of Cardiology, Azienda Ospedaliero-Universitaria di Parma, Parma, Italy
| | - Svati Shah
- Department of Medicine, Duke University Medical Center, Durham, North Carolina, United States of America
| | - Benjamin Horne
- Intermountain Heart Institute, Intermountain Medical Center, Salt Lake City, Utah, United States of America
| | - Ruth McPherson
- Division of Cardiology, University of Ottawa Heart Institute, Ottawa, Ontario, Canada
| | - Gerald K. Hovingh
- Department of Vascular Medicine, Academic Medical Center, Amsterdam, The Netherlands
| | - Muredach P. Reilly
- Cardiovascular Institute, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Hugh Watkins
- Division of Cardiovascular Medicine, Radcliffe Department of Medicine, The Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Anuj Goel
- Division of Cardiovascular Medicine, Radcliffe Department of Medicine, The Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Martin Farrall
- Division of Cardiovascular Medicine, Radcliffe Department of Medicine, The Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Domenico Girelli
- University of Verona School of Medicine, Department of Medicine, Verona, Italy
| | - Alex P. Reiner
- Department of Epidemiology, University of Washington, Seattle, Washington, United States of America
| | - Nathan O. Stitziel
- Cardiovascular Division, Department of Medicine, Washington University School of Medicine, St. Louis, Missouri, United States of America
| | - Sekar Kathiresan
- Center for Human Genetic Research, Massachusetts General Hospital, Boston, Massachusetts, United States of America
| | - Stacey Gabriel
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America
| | | | - Terho Lehtimäki
- Department of Clinical Chemistry, Fimlab Laboratories, University of Tampere School of Medicine, Tampere, Finland
| | - Markku Laakso
- Department of Medicine, University of Eastern Finland, Kuopio, Finland
| | - Leif Groop
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
- Lund University Diabetes Center, Department of Clinical Sciences, Diabetes & Endocrinology, Skåne University Hospital, Lund University, Malmö, Sweden
| | - Jaakko Kaprio
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
- University of Helsinki, Hjelt Institute, Dept of Public Health, Helsinki, Finland
- National Institute for Health and Welfare, Dept of Mental Health and Substance Abuse Services, Helsinki, Finland
| | - Markus Perola
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
| | - Mark I. McCarthy
- Oxford Centre for Diabetes, Endocrinology and Metabolism, University of Oxford, Churchill Hospital, Headington, Oxford, United Kingdom
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
- Oxford NIHR Biomedical Research Centre, Churchill Hospital, Headington, Oxford, United Kingdom
| | - Michael Boehnke
- Department of Biostatistics and Center for Statistical Genetics, University of Michigan, Ann Arbor, Michigan, United States of America
| | - David M. Altshuler
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Cecilia M. Lindgren
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, Massachusetts, United States of America
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Joel N. Hirschhorn
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America
- Divisions of Endocrinology and Genetics and Center for Basic and Translational Obesity Research, Children's Hospital Boston, Boston, Massachusetts, United States of America
| | | | - Nelson B. Freimer
- University of California Los Angeles Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, California, United States of America
| | - Tanja Zeller
- University Heart Centre Hamburg, Clinic for General and Interventional Cardiology, Hamburg, Germany
- DZHK (German Centre for Cardiovascular Research), partner site Hamburg/Kiel/Lübeck, Hamburg, Germany
| | - Sirpa Jalkanen
- Department of Medical Microbiology and Immunology, University of Turku and National Institute for Health and Welfare, Turku, Finland
| | - Seppo Koskinen
- Department of Health, Functional Capacity and Welfare, National Institute for Health and Welfare, Helsinki, Finland
| | - Olli Raitakari
- Research Centre of Applied and Preventive Cardiovascular Medicine, University of Turku, Turku, Finland
- Department of Clinical Physiology and Nuclear Medicine, Turku University Hospital, Turku, Finland
| | - Richard Durbin
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, United Kingdom
| | - Daniel G. MacArthur
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, Massachusetts, United States of America
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Veikko Salomaa
- Department of Chronic Disease Prevention, National Institute for Health and Welfare, Helsinki, Finland
| | - Samuli Ripatti
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
- Department of Chronic Disease Prevention, National Institute for Health and Welfare, Helsinki, Finland
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, United Kingdom
- University of Helsinki, Hjelt Institute, Dept of Public Health, Helsinki, Finland
- Department of Biometry, Hjelt Institute, University of Helsinki, Helsinki, Finland
| | - Mark J. Daly
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, Massachusetts, United States of America
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
- * E-mail: (MJD); (AP)
| | - Aarno Palotie
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, Massachusetts, United States of America
- Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts, United States of America
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
- Psychiatric & Neurodevelopmental Genetics Unit, Department of Psychiatry, Massachusetts General Hospital, Boston, Massachusetts, United States of America
- * E-mail: (MJD); (AP)
| | | |
Collapse
|
200
|
Abstract
Evolutionary processes of natural selection may be expected to leave their mark on age patterns of survival and reproduction. Demographic theory includes three main strands--mutation accumulation, stochastic vitality, and optimal life histories. This paper reviews the three strands and, concentrating on mutation accumulation, extends a mathematical result with broad implications concerning the effect of interactions between small age-specific effects of deleterious mutant alleles. Empirical data from genomic sequencing along with prospects for combining strands of theory hold hope for future progress.
Collapse
|