201
|
Rafajlović M, Klassmann A, Eriksson A, Wiehe T, Mehlig B. Demography-adjusted tests of neutrality based on genome-wide SNP data. Theor Popul Biol 2014; 95:1-12. [PMID: 24911258 DOI: 10.1016/j.tpb.2014.05.002] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2013] [Revised: 05/28/2014] [Accepted: 05/29/2014] [Indexed: 12/15/2022]
Abstract
Tests of the neutral evolution hypothesis are usually built on the standard null model which assumes that mutations are neutral and the population size remains constant over time. However, it is unclear how such tests are affected if the last assumption is dropped. Here, we extend the unifying framework for tests based on the site frequency spectrum, introduced by Achaz and Ferretti, to populations of varying size. Key ingredients are the first two moments of the site frequency spectrum. We show how these moments can be computed analytically if a population has experienced two instantaneous size changes in the past. We apply our method to data from ten human populations gathered in the 1000 genomes project, estimate their demographies and define demography-adjusted versions of Tajima's D, Fay & Wu's H, and Zeng's E. Our results show that demography-adjusted test statistics facilitate the direct comparison between populations and that most of the differences among populations seen in the original unadjusted tests can be explained by their underlying demographies. Upon carrying out whole-genome screens for deviations from neutrality, we identify candidate regions of recent positive selection. We provide track files with values of the adjusted and unadjusted tests for upload to the UCSC genome browser.
Collapse
Affiliation(s)
- M Rafajlović
- Department of Physics, University of Gothenburg, SE-412 96 Gothenburg, Sweden; The Linnaeus Centre for Marine Evolutionary Biology, University of Gothenburg, SE-405 30 Gothenburg, Sweden
| | - A Klassmann
- Institut für Genetik, Universität zu Köln, 50674 Köln, Germany
| | - A Eriksson
- Department of Zoology, University of Cambridge, CB2 3EJ Cambridge, UK; Integrative Systems Biology Lab, King Abdullah University of Science and Technology, Thuwal 23955-6900, Saudi Arabia
| | - T Wiehe
- Institut für Genetik, Universität zu Köln, 50674 Köln, Germany
| | - B Mehlig
- Department of Physics, University of Gothenburg, SE-412 96 Gothenburg, Sweden; The Linnaeus Centre for Marine Evolutionary Biology, University of Gothenburg, SE-405 30 Gothenburg, Sweden.
| |
Collapse
|
202
|
Abstract
Research into when and where modern humans originated and how they differ from, and interacted with, other now-extinct forms of human has so far been the realm of archaeologists and paleoanthropologists. However, over the past decade, molecular geneticists have begun to study genomes of extinct humans. Here, I discuss where we stand today with respect to understanding how modern humans came to differ from Neandertals and other human forms that existed until about 30,000 years ago.
Collapse
Affiliation(s)
- Svante Pääbo
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, D-04103 Leipzig, Germany.
| |
Collapse
|
203
|
Stunnenberg HG, Hubner NC. Genomics meets proteomics: identifying the culprits in disease. Hum Genet 2014; 133:689-700. [PMID: 24135908 PMCID: PMC4021166 DOI: 10.1007/s00439-013-1376-2] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2013] [Accepted: 10/01/2013] [Indexed: 12/20/2022]
Abstract
Genome-wide association studies (GWAS) revealed genomic risk loci that potentially have an impact on disease and phenotypic traits. This extensive resource holds great promise in providing novel directions for personalized medicine, including disease risk prediction, prevention and targeted medication. One of the major challenges that researchers face on the path between the initial identification of an association and precision treatment of patients is the comprehension of the biological mechanisms that underlie these associations. Currently, the focus to solve these questions lies on the integrative analysis of system-wide data on global genome variation, gene expression, transcription factor binding, epigenetic profiles and chromatin conformation. The generation of this data mainly relies on next-generation sequencing. However, due to multiple recent developments, mass spectrometry-based proteomics now offers additional, by the GWAS field so far hardly recognized possibilities for the identification of functional genome variants and, in particular, for the identification and characterization of (differentially) bound protein complexes as well as physiological target genes. In this review, we introduce these proteomics advances and suggest how they might be integrated in post-GWAS workflows. We argue that the combination of highly complementary techniques is powerful and can provide an unbiased, detailed picture of GWAS loci and their mechanistic involvement in disease.
Collapse
Affiliation(s)
- Hendrik G. Stunnenberg
- Department of Molecular Biology, Faculty of Science, Nijmegen Centre for Molecular Life Sciences, Radboud University Nijmegen, 6525 GA Nijmegen, The Netherlands
| | - Nina C. Hubner
- Department of Molecular Biology, Faculty of Science, Nijmegen Centre for Molecular Life Sciences, Radboud University Nijmegen, 6525 GA Nijmegen, The Netherlands
| |
Collapse
|
204
|
Fumagalli M, Sironi M. Human genome variability, natural selection and infectious diseases. Curr Opin Immunol 2014; 30:9-16. [PMID: 24880709 DOI: 10.1016/j.coi.2014.05.001] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2014] [Revised: 04/29/2014] [Accepted: 05/02/2014] [Indexed: 01/04/2023]
Abstract
The recent availability of large-scale sequencing DNA data allowed researchers to investigate how genomic variation is distributed among populations. While demographic factors explain genome-wide population genetic diversity levels, scans for signatures of natural selection pinpointed several regions under non-neutral evolution. Recent studies found an enrichment of immune-related genes subjected to natural selection, suggesting that pathogens and infectious diseases have imposed a strong selective pressure throughout human history. Pathogen-mediated selection often targeted regulatory sites of genes belonging to the same biological pathway. Results from these studies have the potential to identify mutations that modulate infection susceptibility by integrating a population genomic approach with molecular immunology data and large-scale functional annotations.
Collapse
Affiliation(s)
- Matteo Fumagalli
- UCL Genetics Institute, Department of Genetics, Evolution and Environment, University College London, Gower Street, London WC1E 6BT, United Kingdom.
| | - Manuela Sironi
- Bioinformatics - Scientific Institute IRCCS E.MEDEA, 23842 Bosisio Parini, Italy
| |
Collapse
|
205
|
Sjöstrand AE, Sjödin P, Jakobsson M. Private haplotypes can reveal local adaptation. BMC Genet 2014; 15:61. [PMID: 24885734 PMCID: PMC4040116 DOI: 10.1186/1471-2156-15-61] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2013] [Accepted: 05/07/2014] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND Genome-wide scans for regions that demonstrate deviating patterns of genetic variation have become common approaches for finding genes targeted by selection. Several genomic patterns have been utilized for this purpose, including deviations in haplotype homozygosity, frequency spectra and genetic differentiation between populations. RESULTS We describe a novel approach based on the Maximum Frequency of Private Haplotypes--MFPH--to search for signals of recent population-specific selection. The MFPH statistic is straightforward to compute for phased SNP- and sequence-data. Using both simulated and empirical data, we show that MFPH can be a powerful statistic to detect recent population-specific selection, that it performs at the same level as other commonly used summary statistics (e.g. FST, iHS and XP-EHH), and that MFPH in some cases capture signals of selection that are missed by other statistics. For instance, in the Maasai, MFPH reveals a strong signal of selection in a region where other investigated statistics fail to pick up a clear signal that contains the genes DOCK3, MAPKAPK3 and CISH. This region has been suggested to affect height in many populations based on phenotype-genotype association studies. It has specifically been suggested to be targeted by selection in Pygmy groups, which are on the opposite end of the human height spectrum compared to the Maasai. CONCLUSIONS From the analysis of both simulated and publicly available empirical data, we show that MFPH represents a summary statistic that can provide further insight concerning population-specific adaptation.
Collapse
Affiliation(s)
- Agnès E Sjöstrand
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
- UMR 7206 Eco-anthropologie et Ethnobiologie, CNRS-MNHN-Université Paris VII, Paris, France
- Laboratoire TIMC-IMAG, Centre National de la Recherche Scientifique, Université Joseph Fourier, Grenoble, France
| | - Per Sjödin
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Mattias Jakobsson
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
- Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| |
Collapse
|
206
|
Wright FA, Sullivan PF, Brooks AI, Zou F, Sun W, Xia K, Madar V, Jansen R, Chung W, Zhou YH, Abdellaoui A, Batista S, Butler C, Chen G, Chen TH, D'Ambrosio D, Gallins P, Ha MJ, Hottenga JJ, Huang S, Kattenberg M, Kochar J, Middeldorp CM, Qu A, Shabalin A, Tischfield J, Todd L, Tzeng JY, van Grootheest G, Vink JM, Wang Q, Wang W, Wang W, Willemsen G, Smit JH, de Geus EJ, Yin Z, Penninx BWJH, Boomsma DI. Heritability and genomics of gene expression in peripheral blood. Nat Genet 2014; 46:430-7. [PMID: 24728292 PMCID: PMC4012342 DOI: 10.1038/ng.2951] [Citation(s) in RCA: 255] [Impact Index Per Article: 23.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2012] [Accepted: 03/14/2014] [Indexed: 12/14/2022]
Abstract
We assessed gene expression profiles in 2,752 twins, using a classic twin design to quantify expression heritability and quantitative trait loci (eQTLs) in peripheral blood. The most highly heritable genes (∼777) were grouped into distinct expression clusters, enriched in gene-poor regions, associated with specific gene function or ontology classes, and strongly associated with disease designation. The design enabled a comparison of twin-based heritability to estimates based on dizygotic identity-by-descent sharing and distant genetic relatedness. Consideration of sampling variation suggests that previous heritability estimates have been upwardly biased. Genotyping of 2,494 twins enabled powerful identification of eQTLs, which we further examined in a replication set of 1,895 unrelated subjects. A large number of non-redundant local eQTLs (6,756) met replication criteria, whereas a relatively small number of distant eQTLs (165) met quality control and replication standards. Our results provide a new resource toward understanding the genetic control of transcription.
Collapse
Affiliation(s)
- Fred A Wright
- 1] Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina, USA. [2] Department of Statistics, North Carolina State University, Raleigh, North Carolina, USA. [3] Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina, USA. [4]
| | - Patrick F Sullivan
- 1] Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA. [2]
| | - Andrew I Brooks
- Department of Genetics, Rutgers University, New Brunswick, New Jersey, USA
| | - Fei Zou
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Wei Sun
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Kai Xia
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Vered Madar
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Rick Jansen
- Department of Psychiatry, VU Medical Center, Amsterdam, The Netherlands
| | - Wonil Chung
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Yi-Hui Zhou
- 1] Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina, USA. [2] Department of Statistics, North Carolina State University, Raleigh, North Carolina, USA
| | - Abdel Abdellaoui
- Department of Biological Psychology, VU University, Amsterdam, The Netherlands
| | - Sandra Batista
- Department of Computer Science, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Casey Butler
- Department of Computer Science, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Guanhua Chen
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Ting-Huei Chen
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - David D'Ambrosio
- Environmental and Occupational Health Sciences Institute, Rutgers University, New Brunswick, New Jersey, USA
| | - Paul Gallins
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Min Jin Ha
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Jouke Jan Hottenga
- Department of Biological Psychology, VU University, Amsterdam, The Netherlands
| | - Shunping Huang
- Department of Computer Science, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Mathijs Kattenberg
- Department of Biological Psychology, VU University, Amsterdam, The Netherlands
| | - Jaspreet Kochar
- Environmental and Occupational Health Sciences Institute, Rutgers University, New Brunswick, New Jersey, USA
| | | | - Ani Qu
- Environmental and Occupational Health Sciences Institute, Rutgers University, New Brunswick, New Jersey, USA
| | - Andrey Shabalin
- Department of Pharmacotherapy and Outcomes Science, Virginia Commonwealth University, Richmond, Virginia, USA
| | - Jay Tischfield
- Department of Genetics, Rutgers University, New Brunswick, New Jersey, USA
| | - Laura Todd
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Jung-Ying Tzeng
- 1] Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina, USA. [2] Department of Statistics, North Carolina State University, Raleigh, North Carolina, USA
| | | | - Jacqueline M Vink
- Department of Biological Psychology, VU University, Amsterdam, The Netherlands
| | - Qi Wang
- Environmental and Occupational Health Sciences Institute, Rutgers University, New Brunswick, New Jersey, USA
| | - Wei Wang
- Department of Computer Science, University of California, Los Angeles, Los Angeles, California, USA
| | - Weibo Wang
- Department of Computer Science, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Gonneke Willemsen
- Department of Biological Psychology, VU University, Amsterdam, The Netherlands
| | - Johannes H Smit
- Department of Psychiatry, VU Medical Center, Amsterdam, The Netherlands
| | - Eco J de Geus
- Department of Biological Psychology, VU University, Amsterdam, The Netherlands
| | - Zhaoyu Yin
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | | | - Dorret I Boomsma
- Department of Biological Psychology, VU University, Amsterdam, The Netherlands
| |
Collapse
|
207
|
Abstract
Infectious pathogens are among the strongest selective forces that shape the human genome. Migrations and cultural changes in the past 100,000 years exposed populations to dangerous new pathogens. Host genetics influences susceptibility to infectious disease. Evolutionary adaptations for resistance and symbiosis may underlie common immune-mediated diseases. Signatures of selection and methods to detect them vary with the age, geographical spread and virulence of the pathogen. A history of selection on a trait adds power to association studies by driving the emergence of common alleles of strong effect. Combining selection and association metrics can further increase power. Genome-wide association studies (GWASs) of susceptibility to pathogens that are moderately old (1,000–50,000 years ago), geographically limited in history and exerted strong positive selective pressure will have the most power if GWASs can be done in the historically affected population. An understanding of host–pathogen interactions can inform the development of new therapies for both infectious diseases and common immune-mediated diseases.
The impact of various infectious agents on human survival and reproduction over thousands of years has exerted selective pressure on numerous regions of the human genome. This Review describes how such signatures of selection can be detected and integrated with data from complementary approaches, such as genome-wide association studies, to provide biological insights into host–pathogen interactions. The ancient biological 'arms race' between microbial pathogens and humans has shaped genetic variation in modern populations, and this has important implications for the growing field of medical genomics. As humans migrated throughout the world, populations encountered distinct pathogens, and natural selection increased the prevalence of alleles that are advantageous in the new ecosystems in both host and pathogens. This ancient history now influences human infectious disease susceptibility and microbiome homeostasis, and contributes to common diseases that show geographical disparities, such as autoimmune and metabolic disorders. Using new high-throughput technologies, analytical methods and expanding public data resources, the investigation of natural selection is leading to new insights into the function and dysfunction of human biology.
Collapse
|
208
|
Skoglund P, Malmström H, Omrak A, Raghavan M, Valdiosera C, Günther T, Hall P, Tambets K, Parik J, Sjögren KG, Apel J, Willerslev E, Storå J, Götherström A, Jakobsson M. Genomic diversity and admixture differs for Stone-Age Scandinavian foragers and farmers. Science 2014; 344:747-50. [PMID: 24762536 DOI: 10.1126/science.1253448] [Citation(s) in RCA: 192] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Prehistoric population structure associated with the transition to an agricultural lifestyle in Europe remains a contentious idea. Population-genomic data from 11 Scandinavian Stone Age human remains suggest that hunter-gatherers had lower genetic diversity than that of farmers. Despite their close geographical proximity, the genetic differentiation between the two Stone Age groups was greater than that observed among extant European populations. Additionally, the Scandinavian Neolithic farmers exhibited a greater degree of hunter-gatherer-related admixture than that of the Tyrolean Iceman, who also originated from a farming context. In contrast, Scandinavian hunter-gatherers displayed no significant evidence of introgression from farmers. Our findings suggest that Stone Age foraging groups were historically in low numbers, likely owing to oscillating living conditions or restricted carrying capacity, and that they were partially incorporated into expanding farming groups.
Collapse
Affiliation(s)
- Pontus Skoglund
- Department of Evolutionary Biology, Uppsala University, Uppsala 752 36, Sweden
| | - Helena Malmström
- Department of Evolutionary Biology, Uppsala University, Uppsala 752 36, Sweden
| | - Ayça Omrak
- Department of Archaeology and Classical studies, Stockholm University, Stockholm 106 91, Sweden
| | - Maanasa Raghavan
- Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, Copenhagen 1350, Denmark
| | - Cristina Valdiosera
- Department of Archaeology, Environment and Community Planning, La Trobe University, Melbourne VIC 3086, Australia
| | - Torsten Günther
- Department of Evolutionary Biology, Uppsala University, Uppsala 752 36, Sweden
| | - Per Hall
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm 171 77, Sweden
| | - Kristiina Tambets
- Evolutionary Biology Group, Estonian Biocentre and University of Tartu, Tartu 51010, Estonia
| | - Jüri Parik
- Evolutionary Biology Group, Estonian Biocentre and University of Tartu, Tartu 51010, Estonia
| | - Karl-Göran Sjögren
- Department of Historical Studies, University of Gothenburg, Gothenburg, 405 30, Sweden
| | - Jan Apel
- Department of Archaeology and Ancient History, Lund University, Lund, 221 00, Sweden
| | - Eske Willerslev
- Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, Copenhagen 1350, Denmark
| | - Jan Storå
- Department of Archaeology and Classical studies, Stockholm University, Stockholm 106 91, Sweden
| | - Anders Götherström
- Department of Archaeology and Classical studies, Stockholm University, Stockholm 106 91, Sweden.
| | - Mattias Jakobsson
- Department of Evolutionary Biology, Uppsala University, Uppsala 752 36, Sweden. Science for Life Laboratory, Uppsala University, Uppsala 752 36, Sweden.
| |
Collapse
|
209
|
Abstract
With the completion of the human genome sequence, attention turned to identifying and annotating its functional DNA elements. As a complement to genetic and comparative genomics approaches, the Encyclopedia of DNA Elements Project was launched to contribute maps of RNA transcripts, transcriptional regulator binding sites, and chromatin states in many cell types. The resulting genome-wide data reveal sites of biochemical activity with high positional resolution and cell type specificity that facilitate studies of gene regulation and interpretation of noncoding variants associated with human disease. However, the biochemically active regions cover a much larger fraction of the genome than do evolutionarily conserved regions, raising the question of whether nonconserved but biochemically active regions are truly functional. Here, we review the strengths and limitations of biochemical, evolutionary, and genetic approaches for defining functional DNA segments, potential sources for the observed differences in estimated genomic coverage, and the biological implications of these discrepancies. We also analyze the relationship between signal intensity, genomic coverage, and evolutionary conservation. Our results reinforce the principle that each approach provides complementary information and that we need to use combinations of all three to elucidate genome function in human biology and disease.
Collapse
|
210
|
Abstract
The past fifty years have seen the development and application of numerous statistical methods to identify genomic regions that appear to be shaped by natural selection. These methods have been used to investigate the macro- and microevolution of a broad range of organisms, including humans. Here, we provide a comprehensive outline of these methods, explaining their conceptual motivations and statistical interpretations. We highlight areas of recent and future development in evolutionary genomics methods and discuss ongoing challenges for researchers employing such tests. In particular, we emphasize the importance of functional follow-up studies to characterize putative selected alleles and the use of selection scans as hypothesis-generating tools for investigating evolutionary histories.
Collapse
Affiliation(s)
- Joseph J Vitti
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts 02138; ,
| | | | | |
Collapse
|
211
|
Neanderthal ancestry drives evolution of lipid catabolism in contemporary Europeans. Nat Commun 2014; 5:3584. [PMID: 24690587 PMCID: PMC3988804 DOI: 10.1038/ncomms4584] [Citation(s) in RCA: 51] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2013] [Accepted: 03/07/2014] [Indexed: 11/09/2022] Open
Abstract
Although Neanderthals are extinct, fragments of their genomes persist in contemporary humans. Here we show that while the genome-wide frequency of Neanderthal-like sites is approximately constant across all contemporary out-of-Africa populations, genes involved in lipid catabolism contain more than threefold excess of such sites in contemporary humans of European descent. Evolutionally, these genes show significant association with signatures of recent positive selection in the contemporary European, but not Asian or African populations. Functionally, the excess of Neanderthal-like sites in lipid catabolism genes can be linked with a greater divergence of lipid concentrations and enzyme expression levels within this pathway, seen in contemporary Europeans, but not in the other populations. We conclude that sequence variants that evolved in Neanderthals may have given a selective advantage to anatomically modern humans that settled in the same geographical areas. Modern human genomes contain Neanderthal sequences, but it is unclear whether these were selected. Here, Khrameeva et al. show that Neanderthal sequences associated with lipid catabolism are three times more frequent in Europe, suggesting that these sequences might have been beneficial to Europeans.
Collapse
|
212
|
Fagny M, Patin E, Enard D, Barreiro LB, Quintana-Murci L, Laval G. Exploring the occurrence of classic selective sweeps in humans using whole-genome sequencing data sets. Mol Biol Evol 2014; 31:1850-68. [PMID: 24694833 DOI: 10.1093/molbev/msu118] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Genome-wide scans for selection have identified multiple regions of the human genome as being targeted by positive selection. However, only a small proportion has been replicated across studies, and the prevalence of positive selection as a mechanism of adaptive change in humans remains controversial. Here we explore the power of two haplotype-based statistics--the integrated haplotype score (iHS) and the Derived Intraallelic Nucleotide Diversity (DIND) test--in the context of next-generation sequencing data, and evaluate their robustness to demography and other selection modes. We show that these statistics are both powerful for the detection of recent positive selection, regardless of population history, and robust to variation in coverage, with DIND being insensitive to very low coverage. We apply these statistics to whole-genome sequence data sets from the 1000 Genomes Project and Complete Genomics. We found that putative targets of selection were highly significantly enriched in genic and nonsynonymous single nucleotide polymorphisms, and that DIND was more powerful than iHS in the context of small sample sizes, low-quality genotype calling, or poor coverage. As we excluded genomic confounders and alternative selection models, such as background selection, the observed enrichment attests to the action of recent, strong positive selection. Further support to the adaptive significance of these genomic regions came from their enrichment in functional variants detected by genome-wide association studies, informing the relationship between past selection and current benign and disease-related phenotypic variation. Our results indicate that hard sweeps targeting low-frequency standing variation have played a moderate, albeit significant, role in recent human evolution.
Collapse
Affiliation(s)
- Maud Fagny
- Institut Pasteur, Human Evolutionary Genetics, Department of Genomes and Genetics, Paris, FranceCentre National de la Recherche Scientifique, URA3012, Paris, FranceUniversité Pierre et Marie Curie, Cellule Pasteur UPMC, Paris, France
| | - Etienne Patin
- Institut Pasteur, Human Evolutionary Genetics, Department of Genomes and Genetics, Paris, FranceCentre National de la Recherche Scientifique, URA3012, Paris, France
| | | | - Luis B Barreiro
- Department of Pediatrics, Sainte-Justine Hospital Research Center, University of Montreal, Montreal, Quebec, Canada
| | - Lluis Quintana-Murci
- Institut Pasteur, Human Evolutionary Genetics, Department of Genomes and Genetics, Paris, FranceCentre National de la Recherche Scientifique, URA3012, Paris, France
| | - Guillaume Laval
- Institut Pasteur, Human Evolutionary Genetics, Department of Genomes and Genetics, Paris, FranceCentre National de la Recherche Scientifique, URA3012, Paris, France
| |
Collapse
|
213
|
An evolutionary analysis of antigen processing and presentation across different timescales reveals pervasive selection. PLoS Genet 2014; 10:e1004189. [PMID: 24675550 PMCID: PMC3967941 DOI: 10.1371/journal.pgen.1004189] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2013] [Accepted: 01/06/2014] [Indexed: 12/28/2022] Open
Abstract
The antigenic repertoire presented by MHC molecules is generated by the antigen processing and presentation (APP) pathway. We analyzed the evolutionary history of 45 genes involved in APP at the inter- and intra-species level. Results showed that 11 genes evolved adaptively in mammals. Several positively selected sites involve positions of fundamental importance to the protein function (e.g. the TAP1 peptide-binding domains, the sugar binding interface of langerin, and the CD1D trafficking signal region). In CYBB, all selected sites cluster in two loops protruding into the endosomal lumen; analysis of missense mutations responsible for chronic granulomatous disease (CGD) showed the action of different selective forces on the very same gene region, as most CGD substitutions involve aminoacid positions that are conserved in all mammals. As for ERAP2, different computational methods indicated that positive selection has driven the recurrent appearance of protein-destabilizing variants during mammalian evolution. Application of a population-genetics phylogenetics approach showed that purifying selection represented a major force acting on some APP components (e.g. immunoproteasome subunits and chaperones) and allowed identification of positive selection events in the human lineage. We also investigated the evolutionary history of APP genes in human populations by developing a new approach that uses several different tests to identify the selection target, and that integrates low-coverage whole-genome sequencing data with Sanger sequencing. This analysis revealed that 9 APP genes underwent local adaptation in human populations. Most positive selection targets are located within noncoding regions with regulatory function in myeloid cells or act as expression quantitative trait loci. Conversely, balancing selection targeted nonsynonymous variants in TAP1 and CD207 (langerin). Finally, we suggest that selected variants in PSMB10 and CD207 contribute to human phenotypes. Thus, we used evolutionary information to generate experimentally-testable hypotheses and to provide a list of sites to prioritize in follow-up analyses. Antigen-presenting cells digest intracellular and extracellular proteins and display the resulting antigenic repertoire on cell surface molecules for recognition by T cells. This process initiates cell-mediated immune responses and is essential to detect infections. The antigenic repertoire is generated by the antigen processing and presentation pathway. Because several pathogens evade immune recognition by hampering this process, genes involved in antigen processing and presentation may represent common natural selection targets. Thus, we analyzed the evolutionary history of these genes during mammalian evolution and in the more recent history of human populations. Evolutionary analyses in mammals indicated that positive selection targeted a very high proportion of genes (24%), and revealed that many selected sites affect positions of fundamental importance to the protein function. In humans, we found different signatures of natural selection acting both on regions that are expected to regulate gene expression levels or timing and on coding variants; two human selected polymorphisms may modulate the susceptibility to Crohn's disease and to HIV-1 infection. Therefore, we provide a comprehensive evolutionary analysis of antigen processing and we show that evolutionary studies can provide useful information concerning the location and nature of functional variants, ultimately helping to clarify phenotypic differences between and within species.
Collapse
|
214
|
Abstract
The role of positive selection in human evolution remains controversial. On the one hand, scans for positive selection have identified hundreds of candidate loci, and the genome-wide patterns of polymorphism show signatures consistent with frequent positive selection. On the other hand, recent studies have argued that many of the candidate loci are false positives and that most genome-wide signatures of adaptation are in fact due to reduction of neutral diversity by linked deleterious mutations, known as background selection. Here we analyze human polymorphism data from the 1000 Genomes Project and detect signatures of positive selection once we correct for the effects of background selection. We show that levels of neutral polymorphism are lower near amino acid substitutions, with the strongest reduction observed specifically near functionally consequential amino acid substitutions. Furthermore, amino acid substitutions are associated with signatures of recent adaptation that should not be generated by background selection, such as unusually long and frequent haplotypes and specific distortions in the site frequency spectrum. We use forward simulations to argue that the observed signatures require a high rate of strongly adaptive substitutions near amino acid changes. We further demonstrate that the observed signatures of positive selection correlate better with the presence of regulatory sequences, as predicted by the ENCODE Project Consortium, than with the positions of amino acid substitutions. Our results suggest that adaptation was frequent in human evolution and provide support for the hypothesis of King and Wilson that adaptive divergence is primarily driven by regulatory changes.
Collapse
Affiliation(s)
- David Enard
- Department of Biology, Stanford University, Stanford, California 94305, USA
| | - Philipp W Messer
- Department of Biology, Stanford University, Stanford, California 94305, USA
| | - Dmitri A Petrov
- Department of Biology, Stanford University, Stanford, California 94305, USA
| |
Collapse
|
215
|
Fairfax BP, Humburg P, Makino S, Naranbhai V, Wong D, Lau E, Jostins L, Plant K, Andrews R, McGee C, Knight JC. Innate immune activity conditions the effect of regulatory variants upon monocyte gene expression. Science 2014; 343:1246949. [PMID: 24604202 PMCID: PMC4064786 DOI: 10.1126/science.1246949] [Citation(s) in RCA: 553] [Impact Index Per Article: 50.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
To systematically investigate the impact of immune stimulation upon regulatory variant activity, we exposed primary monocytes from 432 healthy Europeans to interferon-γ (IFN-γ) or differing durations of lipopolysaccharide and mapped expression quantitative trait loci (eQTLs). More than half of cis-eQTLs identified, involving hundreds of genes and associated pathways, are detected specifically in stimulated monocytes. Induced innate immune activity reveals multiple master regulatory trans-eQTLs including the major histocompatibility complex (MHC), coding variants altering enzyme and receptor function, an IFN-β cytokine network showing temporal specificity, and an interferon regulatory factor 2 (IRF2) transcription factor-modulated network. Induced eQTL are significantly enriched for genome-wide association study loci, identifying context-specific associations to putative causal genes including CARD9, ATM, and IRF8. Thus, applying pathophysiologically relevant immune stimuli assists resolution of functional genetic variants.
Collapse
Affiliation(s)
- Benjamin P. Fairfax
- Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Oxford OX3 7BN, UK
| | - Peter Humburg
- Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Oxford OX3 7BN, UK
| | - Seiko Makino
- Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Oxford OX3 7BN, UK
| | - Vivek Naranbhai
- Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Oxford OX3 7BN, UK
| | - Daniel Wong
- Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Oxford OX3 7BN, UK
| | - Evelyn Lau
- Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Oxford OX3 7BN, UK
| | - Luke Jostins
- Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Oxford OX3 7BN, UK
| | - Katharine Plant
- Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Oxford OX3 7BN, UK
| | - Robert Andrews
- Wellcome Trust Sanger Institute, University of Cambridge, Hinxton CB10 1SA, UK
| | - Chris McGee
- Wellcome Trust Sanger Institute, University of Cambridge, Hinxton CB10 1SA, UK
| | - Julian C. Knight
- Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Oxford OX3 7BN, UK
| |
Collapse
|
216
|
Hudson NJ, Porto-Neto LR, Kijas J, McWilliam S, Taft RJ, Reverter A. Information compression exploits patterns of genome composition to discriminate populations and highlight regions of evolutionary interest. BMC Bioinformatics 2014; 15:66. [PMID: 24606587 PMCID: PMC4015654 DOI: 10.1186/1471-2105-15-66] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2013] [Accepted: 02/26/2014] [Indexed: 11/20/2022] Open
Abstract
Background Genomic information allows population relatedness to be inferred and selected genes to be identified. Single nucleotide polymorphism microarray (SNP-chip) data, a proxy for genome composition, contains patterns in allele order and proportion. These patterns can be quantified by compression efficiency (CE). In principle, the composition of an entire genome can be represented by a CE number quantifying allele representation and order. Results We applied a compression algorithm (DEFLATE) to genome-wide high-density SNP data from 4,155 human, 1,800 cattle, 1,222 sheep, 81 dogs and 49 mice samples. All human ethnic groups can be clustered by CE and the clusters recover phylogeography based on traditional fixation index (FST) analyses. CE analysis of other mammals results in segregation by breed or species, and is sensitive to admixture and past effective population size. This clustering is a consequence of individual patterns such as runs of homozygosity. Intriguingly, a related approach can also be used to identify genomic loci that show population-specific CE segregation. A high resolution CE ‘sliding window’ scan across the human genome, organised at the population level, revealed genes known to be under evolutionary pressure. These include SLC24A5 (European and Gujarati Indian skin pigmentation), HERC2 (European eye color), LCT (European and Maasai milk digestion) and EDAR (Asian hair thickness). We also identified a set of previously unidentified loci with high population-specific CE scores including the chromatin remodeler SCMH1 in Africans and EDA2R in Asians. Closer inspection reveals that these prioritised genomic regions do not correspond to simple runs of homozygosity but rather compositionally complex regions that are shared by many individuals of a given population. Unlike FST, CE analyses do not require ab initio population comparisons and are amenable to the hemizygous X chromosome. Conclusions We conclude with a discussion of the implications of CE for a complex systems science view of genome evolution. CE allows one to clearly visualise the evolution of individual genomes and populations through a formal, mathematically-rigorous information space. Overall, CE makes a set of biological predictions, some of which are unique and await functional validation.
Collapse
Affiliation(s)
| | | | | | | | - Ryan J Taft
- Computational and Systems Biology, CSIRO Animal, Food and Health Sciences, St, Lucia, Brisbane, QLD 4067, Australia.
| | | |
Collapse
|
217
|
Stathopoulos S, Neafsey DE, Lawniczak MKN, Muskavitch MAT, Christophides GK. Genetic dissection of Anopheles gambiae gut epithelial responses to Serratia marcescens. PLoS Pathog 2014; 10:e1003897. [PMID: 24603764 PMCID: PMC3946313 DOI: 10.1371/journal.ppat.1003897] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2013] [Accepted: 12/09/2013] [Indexed: 12/29/2022] Open
Abstract
Genetic variation in the mosquito Anopheles gambiae profoundly influences its ability to transmit malaria. Mosquito gut bacteria are shown to influence the outcome of infections with Plasmodium parasites and are also thought to exert a strong drive on genetic variation through natural selection; however, a link between antibacterial effects and genetic variation is yet to emerge. Here, we combined SNP genotyping and expression profiling with phenotypic analyses of candidate genes by RNAi-mediated silencing and 454 pyrosequencing to investigate this intricate biological system. We identified 138 An. gambiae genes to be genetically associated with the outcome of Serratia marcescens infection, including the peptidoglycan recognition receptor PGRPLC that triggers activation of the antibacterial IMD/REL2 pathway and the epidermal growth factor receptor EGFR. Silencing of three genes encoding type III fibronectin domain proteins (FN3Ds) increased the Serratia load and altered the gut microbiota composition in favor of Enterobacteriaceae. These data suggest that natural genetic variation in immune-related genes can shape the bacterial population structure of the mosquito gut with high specificity. Importantly, FN3D2 encodes a homolog of the hypervariable pattern recognition receptor Dscam, suggesting that pathogen-specific recognition may involve a broader family of immune factors. Additionally, we showed that silencing the gene encoding the gustatory receptor Gr9 that is also associated with the Serratia infection phenotype drastically increased Serratia levels. The Gr9 antibacterial activity appears to be related to mosquito feeding behavior and to mostly rely on changes of neuropeptide F expression, together suggesting a behavioral immune response following Serratia infection. Our findings reveal that the mosquito response to oral Serratia infection comprises both an epithelial and a behavioral immune component. In malaria vector mosquitoes, the presence of bacteria and malaria parasites is tightly linked. Bacteria that are part of the mosquito gut ecosystem are critical modulators of the immune response elicited during infection with malaria parasites. Furthermore, responses against oral bacterial infections can affect malaria parasites. Here, we combined mosquito gut infections with the enterobacterium Serratia marcescens with genome-wide discovery and phenotypic analysis of genes involved in antibacterial responses to characterize molecular processes that control gut bacterial infections thus possibly affecting the mosquito susceptibility to infection by malaria parasites. Our data reveal complex genetic networks controlling the gut bacterial infection load and ecosystem homeostasis. These networks appear to exhibit much higher specificity toward specific classes of bacteria than previously thought and include behavioral response circuits involved in antibacterial immunity.
Collapse
Affiliation(s)
| | | | | | | | - George K. Christophides
- Department of Life Sciences, Imperial College London, London, United Kingdom
- The Cyprus Institute, Nicosia, Cyprus
- * E-mail:
| |
Collapse
|
218
|
Chantratita N, Tandhavanant S, Myers ND, Chierakul W, Robertson JD, Mahavanakul W, Singhasivanon P, Emond MJ, Peacock SJ, West TE. Screen of whole blood responses to flagellin identifies TLR5 variation associated with outcome in melioidosis. Genes Immun 2014; 15:63-71. [PMID: 24285178 PMCID: PMC3948086 DOI: 10.1038/gene.2013.60] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2013] [Revised: 09/13/2013] [Accepted: 10/18/2013] [Indexed: 02/06/2023]
Abstract
Melioidosis is a severe infection caused by the flagellated bacterium Burkholderia pseudomallei. The nonsense polymorphism TLR51174C>T is associated with improved outcome in Thais with melioidosis. We hypothesized that other TLR5 variants may modulate the host response and determine outcome in melioidosis. We genotyped 12 TLR5 variants selected de novo from the HapMap database and examined the association of each with cytokines induced by flagellin stimulation of whole blood from healthy Thai subjects. We found a blunted cytokine response for three related markers that were in linkage disequilibrium (LD) with a non-synonymous variant, TLR51846T>C. Carriers of TLR51846T>C had broadly impaired cytokine responses induced by flagellin. TLR51846T>C was associated with protection against death in melioidosis patients (odds ratio: 0.62, 95% confidence interval: 0.42-0.93, P=0.021). We observed no impairment in TLR51846C-dependent nuclear factor κB activation, however, suggesting an alternative mechanism for the effect. We found that TLR51846T>C was in strong LD with TLR51174C>T. Many of the blunted cytokine responses observed and the association of TLR51846T>C with survival in melioidosis patients may be attributable to TLR51174C>T, but we could not exclude an independent effect of TLR51846T>C. These data identify novel associations for TLR51846T>C, enhance our understanding of TLR5 genetic architecture in Thais and highlight the role of TLR5 in melioidosis.
Collapse
Affiliation(s)
- Narisara Chantratita
- Department of Microbiology and Immunology, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand
- Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand
| | - Sarunporn Tandhavanant
- Department of Microbiology and Immunology, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand
- Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand
| | - Nicolle D. Myers
- Division of Pulmonary and Critical Care Medicine, Department of Medicine, University of Washington, Seattle, WA, USA
| | - Wirongrong Chierakul
- Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand
- Department of Clinical Tropical Medicine, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand
| | - Johanna D. Robertson
- Division of Pulmonary and Critical Care Medicine, Department of Medicine, University of Washington, Seattle, WA, USA
| | - Weera Mahavanakul
- Department of Medicine, Sappasithiprasong Hospital, Ubon Ratchathani, Thailand
| | - Pratap Singhasivanon
- Department of Tropical Hygiene, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand
| | - Mary J. Emond
- Department of Biostatistics, University of Washington, Seattle, WA, USA
| | - Sharon J. Peacock
- Department of Microbiology and Immunology, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand
- Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand
- Department of Medicine, University of Cambridge, Addenbrooke’s Hospital, Cambridge, United Kingdom
| | - T. Eoin West
- Division of Pulmonary and Critical Care Medicine, Department of Medicine, University of Washington, Seattle, WA, USA
- International Respiratory and Severe Illness Center, University of Washington, Seattle, WA, USA
| |
Collapse
|
219
|
Engelken J, Carnero-Montoro E, Pybus M, Andrews GK, Lalueza-Fox C, Comas D, Sekler I, de la Rasilla M, Rosas A, Stoneking M, Valverde MA, Vicente R, Bosch E. Extreme population differences in the human zinc transporter ZIP4 (SLC39A4) are explained by positive selection in Sub-Saharan Africa. PLoS Genet 2014; 10:e1004128. [PMID: 24586184 PMCID: PMC3930504 DOI: 10.1371/journal.pgen.1004128] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2013] [Accepted: 12/05/2013] [Indexed: 12/17/2022] Open
Abstract
Extreme differences in allele frequency between West Africans and Eurasians were observed for a leucine-to-valine substitution (Leu372Val) in the human intestinal zinc uptake transporter, ZIP4, yet no further evidence was found for a selective sweep around the ZIP4 gene (SLC39A4). By interrogating allele frequencies in more than 100 diverse human populations and resequencing Neanderthal DNA, we confirmed the ancestral state of this locus and found a strong geographical gradient for the derived allele (Val372), with near fixation in West Africa. In extensive coalescent simulations, we show that the extreme differences in allele frequency, yet absence of a classical sweep signature, can be explained by the effect of a local recombination hotspot, together with directional selection favoring the Val372 allele in Sub-Saharan Africans. The possible functional effect of the Leu372Val substitution, together with two pathological mutations at the same codon (Leu372Pro and Leu372Arg) that cause acrodermatitis enteropathica (a disease phenotype characterized by extreme zinc deficiency), was investigated by transient overexpression of human ZIP4 protein in HeLa cells. Both acrodermatitis mutations cause absence of the ZIP4 transporter cell surface expression and nearly absent zinc uptake, while the Val372 variant displayed significantly reduced surface protein expression, reduced basal levels of intracellular zinc, and reduced zinc uptake in comparison with the Leu372 variant. We speculate that reduced zinc uptake by the ZIP4-derived Val372 isoform may act by starving certain pathogens of zinc, and hence may have been advantageous in Sub-Saharan Africa. Moreover, these functional results may indicate differences in zinc homeostasis among modern human populations with possible relevance for disease risk.
Collapse
Affiliation(s)
- Johannes Engelken
- Institute of Evolutionary Biology (CSIC-UPF), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain ; Department of Evolutionary Genetics, Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Elena Carnero-Montoro
- Institute of Evolutionary Biology (CSIC-UPF), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain
| | - Marc Pybus
- Institute of Evolutionary Biology (CSIC-UPF), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain
| | - Glen K Andrews
- Department of Biochemistry and Molecular Biology, University of Kansas Medical Center, Kansas City, Kansas, United States of America
| | - Carles Lalueza-Fox
- Institute of Evolutionary Biology (CSIC-UPF), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain
| | - David Comas
- Institute of Evolutionary Biology (CSIC-UPF), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain
| | - Israel Sekler
- Department of Physiology, Ben-Gurion University, Beer-Sheva, Israel
| | - Marco de la Rasilla
- Área de Prehistoria, Departamento de Historia, Universidad de Oviedo, Oviedo, Spain
| | - Antonio Rosas
- Group of Paleoanthropology MNCN-CSIC, Department of Paleobiology, National Museum of Natural Sciences, CSIC, Madrid, Spain
| | - Mark Stoneking
- Department of Evolutionary Genetics, Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Miguel A Valverde
- Laboratory of Molecular Physiology and Channelopathies, Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain
| | - Rubén Vicente
- Laboratory of Molecular Physiology and Channelopathies, Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain
| | - Elena Bosch
- Institute of Evolutionary Biology (CSIC-UPF), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain
| |
Collapse
|
220
|
The role of microhomology in genomic structural variation. Trends Genet 2014; 30:85-94. [PMID: 24503142 DOI: 10.1016/j.tig.2014.01.001] [Citation(s) in RCA: 118] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2013] [Revised: 01/03/2014] [Accepted: 01/05/2014] [Indexed: 02/06/2023]
Abstract
Genomic structural variation, which can be defined as differences in the copy number, orientation, or location of relatively large DNA segments, is not only crucial in evolution, but also gives rise to genomic disorders. Whereas the major mechanisms that generate structural variation have been well characterised, insights into additional mechanisms are emerging from the identification of short regions of DNA sequence homology, also known as microhomology, at chromosomal breakpoints. In addition, functional studies are elucidating the characteristics of microhomology-mediated pathways, which are mutagenic. Here, we describe the features and mechanistic models of microhomology-mediated events, discuss their physiological and pathological significance, and highlight recent advances in this rapidly evolving field of research.
Collapse
|
221
|
Karlsson EK, Harris JB, Tabrizi S, Rahman A, Shlyakhter I, Patterson N, O'Dushlaine C, Schaffner SF, Gupta S, Chowdhury F, Sheikh A, Shin OS, Ellis C, Becker CE, Stuart LM, Calderwood SB, Ryan ET, Qadri F, Sabeti PC, Larocque RC. Natural selection in a bangladeshi population from the cholera-endemic ganges river delta. Sci Transl Med 2014; 5:192ra86. [PMID: 23825302 DOI: 10.1126/scitranslmed.3006338] [Citation(s) in RCA: 64] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
As an ancient disease with high fatality, cholera has likely exerted strong selective pressure on affected human populations. We performed a genome-wide study of natural selection in a population from the Ganges River Delta, the historic geographic epicenter of cholera. We identified 305 candidate selected regions using the composite of multiple signals (CMS) method. The regions were enriched for potassium channel genes involved in cyclic adenosine monophosphate-mediated chloride secretion and for components of the innate immune system involved in nuclear factor κB (NF-κB) signaling. We demonstrate that a number of these strongly selected genes are associated with cholera susceptibility in two separate cohorts. We further identify repeated examples of selection and association in an NF-κB/inflammasome-dependent pathway that is activated in vitro by Vibrio cholerae. Our findings shed light on the genetic basis of cholera resistance in a population from the Ganges River Delta and present a promising approach for identifying genetic factors influencing susceptibility to infectious diseases.
Collapse
Affiliation(s)
- Elinor K Karlsson
- Center for Systems Biology, Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
222
|
Wu Q, Zheng P, Hu Y, Wei F. Genome-scale analysis of demographic history and adaptive selection. Protein Cell 2014; 5:99-112. [PMID: 24474201 PMCID: PMC3956981 DOI: 10.1007/s13238-013-0004-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2013] [Accepted: 11/04/2013] [Indexed: 11/25/2022] Open
Abstract
One of the main topics in population genetics is identification of adaptive selection among populations. For this purpose, population history should be correctly inferred to evaluate the effect of random drift and exclude it in selection identification. With the rapid progress in genomics in the past decade, vast genome-scale variations are available for population genetic analysis, which however requires more sophisticated models to infer species' demographic history and robust methods to detect local adaptation. Here we aim to review what have been achieved in the fields of demographic modeling and selection detection. We summarize their rationales, implementations, and some classical applications. We also propose that some widely-used methods can be improved in both theoretical and practical aspects in near future.
Collapse
Affiliation(s)
- Qi Wu
- Key Lab of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101 China
| | - Pingping Zheng
- Key Lab of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101 China
| | - Yibu Hu
- Key Lab of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101 China
| | - Fuwen Wei
- Key Lab of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101 China
| |
Collapse
|
223
|
Derived immune and ancestral pigmentation alleles in a 7,000-year-old Mesolithic European. Nature 2014; 507:225-8. [PMID: 24463515 DOI: 10.1038/nature12960] [Citation(s) in RCA: 212] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2013] [Accepted: 12/17/2013] [Indexed: 12/19/2022]
Abstract
Ancient genomic sequences have started to reveal the origin and the demographic impact of farmers from the Neolithic period spreading into Europe. The adoption of farming, stock breeding and sedentary societies during the Neolithic may have resulted in adaptive changes in genes associated with immunity and diet. However, the limited data available from earlier hunter-gatherers preclude an understanding of the selective processes associated with this crucial transition to agriculture in recent human evolution. Here we sequence an approximately 7,000-year-old Mesolithic skeleton discovered at the La Braña-Arintero site in León, Spain, to retrieve a complete pre-agricultural European human genome. Analysis of this genome in the context of other ancient samples suggests the existence of a common ancient genomic signature across western and central Eurasia from the Upper Paleolithic to the Mesolithic. The La Braña individual carries ancestral alleles in several skin pigmentation genes, suggesting that the light skin of modern Europeans was not yet ubiquitous in Mesolithic times. Moreover, we provide evidence that a significant number of derived, putatively adaptive variants associated with pathogen resistance in modern Europeans were already present in this hunter-gatherer.
Collapse
|
224
|
Ramos PS, Shaftman SR, Ward RC, Langefeld CD. Genes associated with SLE are targets of recent positive selection. Autoimmune Dis 2014; 2014:203435. [PMID: 24587899 PMCID: PMC3920976 DOI: 10.1155/2014/203435] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2013] [Accepted: 11/12/2013] [Indexed: 01/03/2023] Open
Abstract
The reasons for the ethnic disparities in the prevalence of systemic lupus erythematosus (SLE) and the relative high frequency of SLE risk alleles in the population are not fully understood. Population genetic factors such as natural selection alter allele frequencies over generations and may help explain the persistence of such common risk variants in the population and the differential risk of SLE. In order to better understand the genetic basis of SLE that might be due to natural selection, a total of 74 genomic regions with compelling evidence for association with SLE were tested for evidence of recent positive selection in the HapMap and HGDP populations, using population differentiation, allele frequency, and haplotype-based tests. Consistent signs of positive selection across different studies and statistical methods were observed at several SLE-associated loci, including PTPN22, TNFSF4, TET3-DGUOK, TNIP1, UHRF1BP1, BLK, and ITGAM genes. This study is the first to evaluate and report that several SLE-associated regions show signs of positive natural selection. These results provide corroborating evidence in support of recent positive selection as one mechanism underlying the elevated population frequency of SLE risk loci and supports future research that integrates signals of natural selection to help identify functional SLE risk alleles.
Collapse
Affiliation(s)
- Paula S. Ramos
- Department of Medicine, Medical University of South Carolina, Charleston, SC 29425, USA
| | - Stephanie R. Shaftman
- Department of Public Health Sciences, Medical University of South Carolina, Charleston, SC 29425, USA
| | - Ralph C. Ward
- Department of Public Health Sciences, Medical University of South Carolina, Charleston, SC 29425, USA
| | - Carl D. Langefeld
- Department of Public Health Sciences, Wake Forest School of Medicine and Center for Public Health Genomics, Winston-Salem, NC 27157, USA
| |
Collapse
|
225
|
Weber AN, Försti A. Toll-like receptor genetic variants and colorectal cancer. Oncoimmunology 2014; 3:e27763. [PMID: 24790794 PMCID: PMC4004619 DOI: 10.4161/onci.27763] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2014] [Accepted: 01/08/2014] [Indexed: 11/19/2022] Open
Abstract
Single-nucleotide polymorphisms in Toll-like receptor 5 (TLR5), encoding a sensor for flagellin, have been shown to influence cytokine responses to intestinal bacteria and to be associated with significant alterations in the survival of colorectal carcinoma (CRC) patients. These findings point to a link between TLRs and CRC that may have both therapeutic and prognostic/predictive implications.
Collapse
Affiliation(s)
- Alexander Nr Weber
- Interfaculty Institute for Cell Biology; Department of Immunology; University of Tübingen; Tübingen, Germany
| | - Asta Försti
- Division of Molecular Genetic Epidemiology; German Cancer Research Center (DKFZ); Heidelberg, Germany ; Center for Primary Health Care Research; Clinical Research Center; Lund University; Malmö, Sweden
| |
Collapse
|
226
|
Li MJ, Wang LY, Xia Z, Wong MP, Sham PC, Wang J. dbPSHP: a database of recent positive selection across human populations. Nucleic Acids Res 2014; 42:D910-6. [PMID: 24194603 PMCID: PMC3965004 DOI: 10.1093/nar/gkt1052] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2013] [Revised: 10/04/2013] [Accepted: 10/11/2013] [Indexed: 12/31/2022] Open
Abstract
The dbPSHP database (http://jjwanglab.org/dbpshp) aims to help researchers to efficiently identify, validate and visualize putative positively selected loci in human evolution and further discover the mechanism governing these natural selections. Recent evolution of human populations at the genomic level reflects the adaptations to the living environments, including climate change and availability and stability of nutrients. Many genetic regions under positive selection have been identified, which assist us to understand how natural selection has shaped population differences. Here, we manually collect recent positive selections in different human populations, consisting of 15,472 loci from 132 publications. We further compiled a database that used 15 statistical terms of different evolutionary attributes for single nucleotide variant sites from the HapMap 3 and 1000 Genomes Project to identify putative regions under positive selection. These attributes include variant allele/genotype properties, variant heterozygosity, within population diversity, long-range haplotypes, pairwise population differentiation and evolutionary conservation. We also provide interactive pages for visualization and annotation of different selective signals. The database is freely available to the public and will be frequently updated.
Collapse
Affiliation(s)
- Mulin Jun Li
- Department of Biochemistry, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China, Shenzhen Institute of Research and Innovation, The University of Hong Kong, Shenzhen, Guangdong 518057, China, Department of Anaesthesiology, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China, Department of Pathology, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China, Department of Psychiatry, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China, State Key Laboratory in Cognitive and Brain Sciences, The University of Hong Kong, Hong Kong SAR, China and Centre for Genomic Sciences, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Lily Yan Wang
- Department of Biochemistry, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China, Shenzhen Institute of Research and Innovation, The University of Hong Kong, Shenzhen, Guangdong 518057, China, Department of Anaesthesiology, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China, Department of Pathology, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China, Department of Psychiatry, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China, State Key Laboratory in Cognitive and Brain Sciences, The University of Hong Kong, Hong Kong SAR, China and Centre for Genomic Sciences, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Zhengyuan Xia
- Department of Biochemistry, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China, Shenzhen Institute of Research and Innovation, The University of Hong Kong, Shenzhen, Guangdong 518057, China, Department of Anaesthesiology, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China, Department of Pathology, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China, Department of Psychiatry, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China, State Key Laboratory in Cognitive and Brain Sciences, The University of Hong Kong, Hong Kong SAR, China and Centre for Genomic Sciences, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Maria P. Wong
- Department of Biochemistry, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China, Shenzhen Institute of Research and Innovation, The University of Hong Kong, Shenzhen, Guangdong 518057, China, Department of Anaesthesiology, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China, Department of Pathology, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China, Department of Psychiatry, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China, State Key Laboratory in Cognitive and Brain Sciences, The University of Hong Kong, Hong Kong SAR, China and Centre for Genomic Sciences, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Pak Chung Sham
- Department of Biochemistry, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China, Shenzhen Institute of Research and Innovation, The University of Hong Kong, Shenzhen, Guangdong 518057, China, Department of Anaesthesiology, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China, Department of Pathology, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China, Department of Psychiatry, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China, State Key Laboratory in Cognitive and Brain Sciences, The University of Hong Kong, Hong Kong SAR, China and Centre for Genomic Sciences, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Junwen Wang
- Department of Biochemistry, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China, Shenzhen Institute of Research and Innovation, The University of Hong Kong, Shenzhen, Guangdong 518057, China, Department of Anaesthesiology, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China, Department of Pathology, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China, Department of Psychiatry, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China, State Key Laboratory in Cognitive and Brain Sciences, The University of Hong Kong, Hong Kong SAR, China and Centre for Genomic Sciences, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| |
Collapse
|
227
|
Stäubert C, Le Duc D, Schöneberg T. Examining the Dynamic Evolution of G Protein-Coupled Receptors. METHODS IN PHARMACOLOGY AND TOXICOLOGY 2014. [DOI: 10.1007/978-1-62703-779-2_2] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
|
228
|
Moore CB, Wallace JR, Wolfe DJ, Frase AT, Pendergrass SA, Weiss KM, Ritchie MD. Low frequency variants, collapsed based on biological knowledge, uncover complexity of population stratification in 1000 genomes project data. PLoS Genet 2013; 9:e1003959. [PMID: 24385916 PMCID: PMC3873241 DOI: 10.1371/journal.pgen.1003959] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2013] [Accepted: 10/01/2013] [Indexed: 12/13/2022] Open
Abstract
Analyses investigating low frequency variants have the potential for explaining additional genetic heritability of many complex human traits. However, the natural frequencies of rare variation between human populations strongly confound genetic analyses. We have applied a novel collapsing method to identify biological features with low frequency variant burden differences in thirteen populations sequenced by the 1000 Genomes Project. Our flexible collapsing tool utilizes expert biological knowledge from multiple publicly available database sources to direct feature selection. Variants were collapsed according to genetically driven features, such as evolutionary conserved regions, regulatory regions genes, and pathways. We have conducted an extensive comparison of low frequency variant burden differences (MAF<0.03) between populations from 1000 Genomes Project Phase I data. We found that on average 26.87% of gene bins, 35.47% of intergenic bins, 42.85% of pathway bins, 14.86% of ORegAnno regulatory bins, and 5.97% of evolutionary conserved regions show statistically significant differences in low frequency variant burden across populations from the 1000 Genomes Project. The proportion of bins with significant differences in low frequency burden depends on the ancestral similarity of the two populations compared and types of features tested. Even closely related populations had notable differences in low frequency burden, but fewer differences than populations from different continents. Furthermore, conserved or functionally relevant regions had fewer significant differences in low frequency burden than regions under less evolutionary constraint. This degree of low frequency variant differentiation across diverse populations and feature elements highlights the critical importance of considering population stratification in the new era of DNA sequencing and low frequency variant genomic analyses. Low frequency variants are likely to play an important role in uncovering complex trait heritability; however, they are often continent or population specific. This specificity complicates genetic analyses investigating low frequency variants for two reasons: low frequency variant signals in an association test are often difficult to generalize beyond a single population or continental group, and there is an increase in false positive results in association analyses due to underlying population stratification. In order to reveal the magnitude of low frequency population stratification, we performed pairwise population comparisons using the 1000 Genomes Project Phase I data to investigate differences in low frequency variant burden across multiple biological features. We found that low frequency variant confounding is much more prevalent than one might expect, even within continental groups. The proportion of significant differences in low frequency variant burden was also dependent on the region of interest; for example, annotated regulatory regions showed fewer low frequency burden differences between populations than intergenic regions. Knowledge of population structure and the genomic landscape in a region of interest are important factors in determining the extent of confounding due to population stratification in a low frequency genomic analysis.
Collapse
Affiliation(s)
- Carrie B. Moore
- Center for Human Genetic Research, Department of Molecular Physiology and Biophysics, Vanderbilt University, Nashville, Tennessee, United States of America
- Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, Eberly College of Science, The Huck Institutes of the Life Sciences, University Park, Pennsylvania, United States of America
| | - John R. Wallace
- Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, Eberly College of Science, The Huck Institutes of the Life Sciences, University Park, Pennsylvania, United States of America
| | - Daniel J. Wolfe
- Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, Eberly College of Science, The Huck Institutes of the Life Sciences, University Park, Pennsylvania, United States of America
| | - Alex T. Frase
- Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, Eberly College of Science, The Huck Institutes of the Life Sciences, University Park, Pennsylvania, United States of America
| | - Sarah A. Pendergrass
- Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, Eberly College of Science, The Huck Institutes of the Life Sciences, University Park, Pennsylvania, United States of America
| | - Kenneth M. Weiss
- Department of Anthropology, The Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Marylyn D. Ritchie
- Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, Eberly College of Science, The Huck Institutes of the Life Sciences, University Park, Pennsylvania, United States of America
- * E-mail:
| |
Collapse
|
229
|
Abstract
The recent availability of genomic data has spurred many genome-wide studies of human adaptation in different populations worldwide. Such studies have provided insights into novel candidate genes and pathways that are putatively involved in adaptation to different environments, diets and disease prevalence. However, much work is needed to translate these results into candidate adaptive variants that are biologically interpretable. In this Review, we discuss methods that may help to identify true biological signals of selection and studies that incorporate complementary phenotypic and functional data. We conclude with recommendations for future studies that focus on opportunities to use integrative genomics methodologies in human adaptation studies.
Collapse
|
230
|
Moura de Sousa JA, Campos PRA, Gordo I. An ABC method for estimating the rate and distribution of effects of beneficial mutations. Genome Biol Evol 2013; 5:794-806. [PMID: 23542207 PMCID: PMC3673657 DOI: 10.1093/gbe/evt045] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Determining the distribution of adaptive mutations available to natural selection is a
difficult task. These are rare events and most of them are lost by chance. Some
theoretical works propose that the distribution of newly arising beneficial mutations
should be close to exponential. Empirical data are scarce and do not always support an
exponential distribution. Analysis of the dynamics of adaptation in asexual populations of
microorganisms has revealed that these can be summarized by two effective parameters, the
effective mutation rate, Ue, and the effective selection
coefficient of a beneficial mutation, Se. Here, we show that
these effective parameters will not always reflect the rate and mean effect of beneficial
mutations, especially when the distribution of arising mutations has high variance, and
the mutation rate is high. We propose a method to estimate the distribution of arising
beneficial mutations, which is motivated by a common experimental setup. The method, which
we call One Biallelic Marker Approximate Bayesian Computation, makes use of experimental
data consisting of periodic measures of neutral marker frequencies and mean population
fitness. Using simulations, we find that this method allows the discrimination of the
shape of the distribution of arising mutations and that it provides reasonable estimates
of their rates and mean effects in ranges of the parameter space that may be of biological
relevance.
Collapse
|
231
|
Zhang G, Muglia LJ, Chakraborty R, Akey JM, Williams SM. Signatures of natural selection on genetic variants affecting complex human traits. Appl Transl Genom 2013; 2:78-94. [PMID: 27896059 PMCID: PMC5121263 DOI: 10.1016/j.atg.2013.10.002] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2013] [Accepted: 10/14/2013] [Indexed: 01/04/2023]
Abstract
It has recently been hypothesized that polygenic adaptation, resulting in modest allele frequency changes at many loci, could be a major mechanism behind the adaptation of complex phenotypes in human populations. Here we leverage the large number of variants that have been identified through genome-wide association (GWA) studies to comprehensively study signatures of natural selection on genetic variants associated with complex traits. Using population differentiation based methods, such as FST and phylogenetic branch length analyses, we systematically examined nearly 1300 SNPs associated with 38 complex phenotypes. Instead of detecting selection signatures at individual variants, we aimed to identify combined evidence of natural selection by aggregating signals across many trait associated SNPs. Our results have revealed some general features of polygenic selection on complex traits associated variants. First, natural selection acting on standing variants associated with complex traits is a common phenomenon. Second, characteristics of selection for different polygenic traits vary both temporarily and geographically. Third, some studied traits (e.g. height and urate level) could have been the primary targets of selection, as indicated by the significant correlation between the effect sizes and the estimated strength of selection in the trait associated variants; however, for most traits, the allele frequency changes in trait associated variants might have been driven by the selection on other correlated phenotypes. Fourth, the changes in allele frequencies as a result of selection can be highly stochastic, such that, polygenic adaptation may accelerate differentiation in allele frequencies among populations, but generally does not produce predictable directional changes. Fifth, multiple mechanisms (pleiotropy, hitchhiking, etc) may act together to govern the changes in allele frequencies of genetic variants associated with complex traits.
Collapse
Affiliation(s)
- Ge Zhang
- Human Genetics Division, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA
| | - Louis J. Muglia
- Center for Prevention of Preterm Birth, Perinatal Institute, Cincinnati Children's Hospital Medical Center and March of Dimes Prematurity Research Center Ohio Collaborative, Cincinnati, OH, USA
| | - Ranajit Chakraborty
- Center for Computational Genomics, Institute of Applied Genetics, University of North Texas Health Science Center, Fort Worth, TX, USA
| | - Joshua M. Akey
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Scott M. Williams
- Department of Genetics and Institute for Quantitative Biomedical Sciences, Geisel School of Medicine, Dartmouth College, Hanover, NH, USA
| |
Collapse
|
232
|
Pybus M, Dall'Olio GM, Luisi P, Uzkudun M, Carreño-Torres A, Pavlidis P, Laayouni H, Bertranpetit J, Engelken J. 1000 Genomes Selection Browser 1.0: a genome browser dedicated to signatures of natural selection in modern humans. Nucleic Acids Res 2013; 42:D903-9. [PMID: 24275494 PMCID: PMC3965045 DOI: 10.1093/nar/gkt1188] [Citation(s) in RCA: 111] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Searching for Darwinian selection in natural populations has been the focus of a multitude of studies over the last decades. Here we present the 1000 Genomes Selection Browser 1.0 (http://hsb.upf.edu) as a resource for signatures of recent natural selection in modern humans. We have implemented and applied a large number of neutrality tests as well as summary statistics informative for the action of selection such as Tajima’s D, CLR, Fay and Wu’s H, Fu and Li’s F* and D*, XPEHH, ΔiHH, iHS, FST, ΔDAF and XPCLR among others to low coverage sequencing data from the 1000 genomes project (Phase 1; release April 2012). We have implemented a publicly available genome-wide browser to communicate the results from three different populations of West African, Northern European and East Asian ancestry (YRI, CEU, CHB). Information is provided in UCSC-style format to facilitate the integration with the rich UCSC browser tracks and an access page is provided with instructions and for convenient visualization. We believe that this expandable resource will facilitate the interpretation of signals of selection on different temporal, geographical and genomic scales.
Collapse
Affiliation(s)
- Marc Pybus
- Program for Population Genetics, Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003 Barcelona, Spain, Population Genomics Node, National Institute for Bioinformatics (INB), Universitat Pompeu Fabra, 08003 Barcelona, Spain, Institute of Molecular Biology and Biotechnology-FORTH, Heraklion, Crete GR 700 13, Greece and Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany
| | | | | | | | | | | | | | | | | |
Collapse
|
233
|
Lachance J, Tishkoff SA. Population Genomics of Human Adaptation. ANNUAL REVIEW OF ECOLOGY EVOLUTION AND SYSTEMATICS 2013; 44:123-143. [PMID: 25383060 DOI: 10.1146/annurev-ecolsys-110512-135833] [Citation(s) in RCA: 69] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Recent advances in genotyping technologies have facilitated genome-wide scans for natural selection. Identification of targets of natural selection will shed light on processes of human adaptation and evolution and could be important for identifying variation that influences both normal human phenotypic variation as well as disease susceptibility. Here we focus on studies of natural selection in modern humans who originated ~200,000 years go in Africa and migrated across the globe ~50,000 - 100,000 years ago. Movement into new environments, as well as changes in culture and technology including plant and animal domestication, resulted in local adaptation to diverse environments. We summarize statistical approaches for detecting targets of natural selection and for distinguishing the effects of demographic history from natural selection. On a genome-wide scale, immune-related genes appear to be major targets of positive selection. Genes associated with reproduction and fertility also appear to be fast evolving. Additional examples of recent human adaptation include genes associated with lactase persistence, eccrine glands, and response to hypoxia. Lastly, we emphasize the need to supplement scans of selection with functional studies to demonstrate the physiologic impact of candidate loci.
Collapse
Affiliation(s)
- Joseph Lachance
- Departments of Biology and Genetics, University of Pennsylvania, Philadelphia, PA 19104 USA
| | - Sarah A Tishkoff
- Departments of Biology and Genetics, University of Pennsylvania, Philadelphia, PA 19104 USA
| |
Collapse
|
234
|
Hnisz D, Abraham BJ, Lee TI, Lau A, Saint-André V, Sigova AA, Hoke H, Young RA. Super-enhancers in the control of cell identity and disease. Cell 2013; 155:934-47. [PMID: 24119843 PMCID: PMC3841062 DOI: 10.1016/j.cell.2013.09.053] [Citation(s) in RCA: 2533] [Impact Index Per Article: 211.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2013] [Revised: 09/27/2013] [Accepted: 09/30/2013] [Indexed: 02/07/2023]
Abstract
Super-enhancers are large clusters of transcriptional enhancers that drive expression of genes that define cell identity. Improved understanding of the roles that super-enhancers play in biology would be afforded by knowing the constellation of factors that constitute these domains and by identifying super-enhancers across the spectrum of human cell types. We describe here the population of transcription factors, cofactors, chromatin regulators, and transcription apparatus occupying super-enhancers in embryonic stem cells and evidence that super-enhancers are highly transcribed. We produce a catalog of super-enhancers in a broad range of human cell types and find that super-enhancers associate with genes that control and define the biology of these cells. Interestingly, disease-associated variation is especially enriched in the super-enhancers of disease-relevant cell types. Furthermore, we find that cancer cells generate super-enhancers at oncogenes and other genes important in tumor pathogenesis. Thus, super-enhancers play key roles in human cell identity in health and in disease.
Collapse
Affiliation(s)
- Denes Hnisz
- Whitehead Institute for Biomedical Research, 9 Cambridge Center, Cambridge, MA 02142
| | - Brian J. Abraham
- Whitehead Institute for Biomedical Research, 9 Cambridge Center, Cambridge, MA 02142
| | - Tong Ihn Lee
- Whitehead Institute for Biomedical Research, 9 Cambridge Center, Cambridge, MA 02142
| | - Ashley Lau
- Whitehead Institute for Biomedical Research, 9 Cambridge Center, Cambridge, MA 02142
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, 02139
| | - Violaine Saint-André
- Whitehead Institute for Biomedical Research, 9 Cambridge Center, Cambridge, MA 02142
| | - Alla A. Sigova
- Whitehead Institute for Biomedical Research, 9 Cambridge Center, Cambridge, MA 02142
| | - Heather Hoke
- Whitehead Institute for Biomedical Research, 9 Cambridge Center, Cambridge, MA 02142
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, 02139
| | - Richard A. Young
- Whitehead Institute for Biomedical Research, 9 Cambridge Center, Cambridge, MA 02142
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, 02139
| |
Collapse
|
235
|
Affiliation(s)
- Cees BM Oudejans
- Department of Clinical Chemistry, VU University Medical Center, Amsterdam, the Netherlands
| |
Collapse
|
236
|
Campbell MC, Ranciaro A, Zinshteyn D, Rawlings-Goss R, Hirbo J, Thompson S, Woldemeskel D, Froment A, Rucker JB, Omar SA, Bodo JM, Nyambo T, Belay G, Drayna D, Breslin PAS, Tishkoff SA. Origin and differential selection of allelic variation at TAS2R16 associated with salicin bitter taste sensitivity in Africa. Mol Biol Evol 2013; 31:288-302. [PMID: 24177185 DOI: 10.1093/molbev/mst211] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
Bitter taste perception influences human nutrition and health, and the genetic variation underlying this trait may play a role in disease susceptibility. To better understand the genetic architecture and patterns of phenotypic variability of bitter taste perception, we sequenced a 996 bp region, encompassing the coding exon of TAS2R16, a bitter taste receptor gene, in 595 individuals from 74 African populations and in 94 non-Africans from 11 populations. We also performed genotype-phenotype association analyses of threshold levels of sensitivity to salicin, a bitter anti-inflammatory compound, in 296 individuals from Central and East Africa. In addition, we characterized TAS2R16 mutants in vitro to investigate the effects of polymorphic loci identified at this locus on receptor function. Here, we report striking signatures of positive selection, including significant Fay and Wu's H statistics predominantly in East Africa, indicating strong local adaptation and greater genetic structure among African populations than expected under neutrality. Furthermore, we observed a "star-like" phylogeny for haplotypes with the derived allele at polymorphic site 516 associated with increased bitter taste perception that is consistent with a model of selection for "high-sensitivity" variation. In contrast, haplotypes carrying the "low-sensitivity" ancestral allele at site 516 showed evidence of strong purifying selection. We also demonstrated, for the first time, the functional effect of nonsynonymous variation at site 516 on salicin phenotypic variance in vivo in diverse Africans and showed that most other nonsynonymous substitutions have weak or no effect on cell surface expression in vitro, suggesting that one main polymorphism at TAS2R16 influences salicin recognition. Additionally, we detected geographic differences in levels of bitter taste perception in Africa not previously reported and infer an East African origin for high salicin sensitivity in human populations.
Collapse
|
237
|
Klimosch SN, Försti A, Eckert J, Knežević J, Bevier M, von Schönfels W, Heits N, Walter J, Hinz S, Lascorz J, Hampe J, Hartl D, Frick JS, Hemminki K, Schafmayer C, Weber AN. Functional TLR5 Genetic Variants Affect Human Colorectal Cancer Survival. Cancer Res 2013; 73:7232-42. [DOI: 10.1158/0008-5472.can-13-1746] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
238
|
Abstract
When the human genome project started, the major challenge was how to sequence a 3 billion letter code in an organized and cost-effective manner. When completed, the project had laid the foundation for a huge variety of biomedical fields through the production of a complete human genome sequence, but also had driven the development of laboratory and analytical methods that could produce large amounts of sequencing data cheaply. These technological developments made possible the sequencing of many more vertebrate genomes, which have been necessary for the interpretation of the human genome. They have also enabled large-scale studies of vertebrate genome evolution, as well as comparative and human medicine. In this review, we give examples of evolutionary analysis using a wide variety of time frames—from the comparison of populations within a species to the comparison of species separated by at least 300 million years. Furthermore, we anticipate discoveries related to evolutionary mechanisms, adaptation, and disease to quickly accelerate in the coming years.
Collapse
Affiliation(s)
- Jessica Alföldi
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | | |
Collapse
|
239
|
Wolfe D, Dudek S, Ritchie MD, Pendergrass SA. Visualizing genomic information across chromosomes with PhenoGram. BioData Min 2013; 6:18. [PMID: 24131735 PMCID: PMC4015356 DOI: 10.1186/1756-0381-6-18] [Citation(s) in RCA: 144] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2013] [Accepted: 10/02/2013] [Indexed: 11/11/2022] Open
Abstract
Background With the abundance of information and analysis results being collected for genetic loci, user-friendly and flexible data visualization approaches can inform and improve the analysis and dissemination of these data. A chromosomal ideogram is an idealized graphic representation of chromosomes. Ideograms can be combined with overlaid points, lines, and/or shapes, to provide summary information from studies of various kinds, such as genome-wide association studies or phenome-wide association studies, coupled with genomic location information. To facilitate visualizing varied data in multiple ways using ideograms, we have developed a flexible software tool called PhenoGram which exists as a web-based tool and also a command-line program. Results With PhenoGram researchers can create chomosomal ideograms annotated with lines in color at specific base-pair locations, or colored base-pair to base-pair regions, with or without other annotation. PhenoGram allows for annotation of chromosomal locations and/or regions with shapes in different colors, gene identifiers, or other text. PhenoGram also allows for creation of plots showing expanded chromosomal locations, providing a way to show results for specific chromosomal regions in greater detail. We have now used PhenoGram to produce a variety of different plots, and provide these as examples herein. These plots include visualization of the genomic coverage of SNPs from a genotyping array, highlighting the chromosomal coverage of imputed SNPs, copy-number variation region coverage, as well as plots similar to the NHGRI GWA Catalog of genome-wide association results. Conclusions PhenoGram is a versatile, user-friendly software tool fostering the exploration and sharing of genomic information. Through visualization of data, researchers can both explore and share complex results, facilitating a greater understanding of these data.
Collapse
Affiliation(s)
| | | | - Marylyn D Ritchie
- Center for Systems Genomics, Department of Biochemistry and Molecular Biology, Eberly College of Science, The Huck Institutes of the Life Sciences, The Pennsylvania State University, 512 Wartik Laboratory, University Park, PA 16802, USA.
| | | |
Collapse
|
240
|
Khurana E, Fu Y, Colonna V, Mu XJ, Kang HM, Lappalainen T, Sboner A, Lochovsky L, Chen J, Harmanci A, Das J, Abyzov A, Balasubramanian S, Beal K, Chakravarty D, Challis D, Chen Y, Clarke D, Clarke L, Cunningham F, Evani US, Flicek P, Fragoza R, Garrison E, Gibbs R, Gümüş ZH, Herrero J, Kitabayashi N, Kong Y, Lage K, Liluashvili V, Lipkin SM, MacArthur DG, Marth G, Muzny D, Pers TH, Ritchie GRS, Rosenfeld JA, Sisu C, Wei X, Wilson M, Xue Y, Yu F, Dermitzakis ET, Yu H, Rubin MA, Tyler-Smith C, Gerstein M. Integrative annotation of variants from 1092 humans: application to cancer genomics. Science 2013; 342:1235587. [PMID: 24092746 PMCID: PMC3947637 DOI: 10.1126/science.1235587] [Citation(s) in RCA: 270] [Impact Index Per Article: 22.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Interpreting variants, especially noncoding ones, in the increasing number of personal genomes is challenging. We used patterns of polymorphisms in functionally annotated regions in 1092 humans to identify deleterious variants; then we experimentally validated candidates. We analyzed both coding and noncoding regions, with the former corroborating the latter. We found regions particularly sensitive to mutations ("ultrasensitive") and variants that are disruptive because of mechanistic effects on transcription-factor binding (that is, "motif-breakers"). We also found variants in regions with higher network centrality tend to be deleterious. Insertions and deletions followed a similar pattern to single-nucleotide variants, with some notable exceptions (e.g., certain deletions and enhancers). On the basis of these patterns, we developed a computational tool (FunSeq), whose application to ~90 cancer genomes reveals nearly a hundred candidate noncoding drivers.
Collapse
Affiliation(s)
- Ekta Khurana
- Program in Computational Biology and Bioinformatics, Yale
University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale
University, New Haven, CT 06520, USA
| | - Yao Fu
- Program in Computational Biology and Bioinformatics, Yale
University, New Haven, CT 06520, USA
| | - Vincenza Colonna
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus,
Cambridge, CB10 1SA, UK
- Institute of Genetics and Biophysics, National Research Council
(CNR), 80131 Naples, Italy
| | - Xinmeng Jasmine Mu
- Program in Computational Biology and Bioinformatics, Yale
University, New Haven, CT 06520, USA
| | - Hyun Min Kang
- Center for Statistical Genetics, Biostatistics, University of
Michigan, Ann Arbor, MI 48109, USA
| | - Tuuli Lappalainen
- Department of Genetic Medicine and Development, University of Geneva
Medical School, 1211 Geneva, Switzerland
- Institute for Genetics and Genomics in Geneva (iGE3), University of
Geneva, 1211 Geneva, Switzerland
- Swiss Institute of Bioinformatics, 1211 Geneva, Switzerland
| | - Andrea Sboner
- Institute for Precision Medicine and the Department of Pathology and
Laboratory Medicine, Weill Cornell Medical College and New York-Presbyterian
Hospital, New York, NY 10065, USA
- The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute
for Computational Biomedicine, Weill Cornell Medical College, New York, NY 10021,
USA
| | - Lucas Lochovsky
- Program in Computational Biology and Bioinformatics, Yale
University, New Haven, CT 06520, USA
| | - Jieming Chen
- Program in Computational Biology and Bioinformatics, Yale
University, New Haven, CT 06520, USA
- Integrated Graduate Program in Physical and Engineering Biology,
Yale University, New Haven, CT 06520, USA
| | - Arif Harmanci
- Program in Computational Biology and Bioinformatics, Yale
University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale
University, New Haven, CT 06520, USA
| | - Jishnu Das
- Department of Biological Statistics and Computational Biology,
Cornell University, Ithaca, NY 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University,
Ithaca, NY 14853, USA
| | - Alexej Abyzov
- Program in Computational Biology and Bioinformatics, Yale
University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale
University, New Haven, CT 06520, USA
| | - Suganthi Balasubramanian
- Program in Computational Biology and Bioinformatics, Yale
University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale
University, New Haven, CT 06520, USA
| | - Kathryn Beal
- European Molecular Biology Laboratory, European Bioinformatics
Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Dimple Chakravarty
- Institute for Precision Medicine and the Department of Pathology and
Laboratory Medicine, Weill Cornell Medical College and New York-Presbyterian
Hospital, New York, NY 10065, USA
| | - Daniel Challis
- Baylor College of Medicine, Human Genome Sequencing Center,
Houston, TX 77030, USA
| | - Yuan Chen
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus,
Cambridge, CB10 1SA, UK
| | - Declan Clarke
- Department of Chemistry, Yale University, New Haven, CT 06520, USA
| | - Laura Clarke
- European Molecular Biology Laboratory, European Bioinformatics
Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Fiona Cunningham
- European Molecular Biology Laboratory, European Bioinformatics
Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Uday S. Evani
- Baylor College of Medicine, Human Genome Sequencing Center,
Houston, TX 77030, USA
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics
Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Robert Fragoza
- Weill Institute for Cell and Molecular Biology, Cornell University,
Ithaca, NY 14853, USA
- Department of Molecular Biology and Genetics, Cornell University,
Ithaca, NY 14853, USA
| | - Erik Garrison
- Department of Biology, Boston College, Chestnut Hill, MA 02467, USA
| | - Richard Gibbs
- Baylor College of Medicine, Human Genome Sequencing Center,
Houston, TX 77030, USA
| | - Zeynep H. Gümüş
- The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute
for Computational Biomedicine, Weill Cornell Medical College, New York, NY 10021,
USA
- Department of Physiology and Biophysics, Weill Cornell Medical
College, New York, NY, 10065, USA
| | - Javier Herrero
- European Molecular Biology Laboratory, European Bioinformatics
Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Naoki Kitabayashi
- Institute for Precision Medicine and the Department of Pathology and
Laboratory Medicine, Weill Cornell Medical College and New York-Presbyterian
Hospital, New York, NY 10065, USA
| | - Yong Kong
- Department of Molecular Biophysics and Biochemistry, Yale
University, New Haven, CT 06520, USA
- Keck Biotechnology Resource Laboratory, Yale University, New Haven,
CT 06511, USA
| | - Kasper Lage
- Pediatric Surgical Research Laboratories, MassGeneral Hospital for
Children, Massachusetts General Hospital, Boston, MA 02114, USA
- Analytical and Translational Genetics Unit, Massachusetts General
Hospital, Boston, MA 02114, USA
- Harvard Medical School, Boston, MA 02115, USA
- Center for Biological Sequence Analysis, Department of Systems
Biology, Technical University of Denmark, Lyngby, Denmark
- Center for Protein Research, University of Copenhagen, Copenhagen,
Denmark
| | - Vaja Liluashvili
- The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute
for Computational Biomedicine, Weill Cornell Medical College, New York, NY 10021,
USA
- Department of Physiology and Biophysics, Weill Cornell Medical
College, New York, NY, 10065, USA
| | - Steven M. Lipkin
- Department of Medicine, Weill Cornell Medical College, New York, NY
10065, USA
| | - Daniel G. MacArthur
- Analytical and Translational Genetics Unit, Massachusetts General
Hospital, Boston, MA 02114, USA
- Program in Medical and Population Genetics, Broad Institute of
Harvard and Massachusetts Institute of Technology (MIT), Cambridge, MA 02142,
USA
| | - Gabor Marth
- Department of Biology, Boston College, Chestnut Hill, MA 02467, USA
| | - Donna Muzny
- Baylor College of Medicine, Human Genome Sequencing Center,
Houston, TX 77030, USA
| | - Tune H. Pers
- Center for Biological Sequence Analysis, Department of Systems
Biology, Technical University of Denmark, Lyngby, Denmark
- Division of Endocrinology and Center for Basic and Translational
Obesity Research, Children’s Hospital, Boston, MA 02115, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Graham R. S. Ritchie
- European Molecular Biology Laboratory, European Bioinformatics
Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jeffrey A. Rosenfeld
- Department of Medicine, Rutgers New Jersey Medical School, Newark,
NJ 07101, USA
- IST/High Performance and Research Computing, Rutgers University
Newark, NJ 07101, USA
- Sackler Institute for Comparative Genomics, American Museum of
Natural History, New York, NY 10024, USA
| | - Cristina Sisu
- Program in Computational Biology and Bioinformatics, Yale
University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale
University, New Haven, CT 06520, USA
| | - Xiaomu Wei
- Weill Institute for Cell and Molecular Biology, Cornell University,
Ithaca, NY 14853, USA
- Department of Medicine, Weill Cornell Medical College, New York, NY
10065, USA
| | - Michael Wilson
- Program in Computational Biology and Bioinformatics, Yale
University, New Haven, CT 06520, USA
- Child Study Center, Yale University, New Haven, CT 06520, USA
| | - Yali Xue
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus,
Cambridge, CB10 1SA, UK
| | - Fuli Yu
- Baylor College of Medicine, Human Genome Sequencing Center,
Houston, TX 77030, USA
| | | | - Emmanouil T. Dermitzakis
- Department of Genetic Medicine and Development, University of Geneva
Medical School, 1211 Geneva, Switzerland
- Institute for Genetics and Genomics in Geneva (iGE3), University of
Geneva, 1211 Geneva, Switzerland
- Swiss Institute of Bioinformatics, 1211 Geneva, Switzerland
| | - Haiyuan Yu
- Department of Biological Statistics and Computational Biology,
Cornell University, Ithaca, NY 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University,
Ithaca, NY 14853, USA
| | - Mark A. Rubin
- Institute for Precision Medicine and the Department of Pathology and
Laboratory Medicine, Weill Cornell Medical College and New York-Presbyterian
Hospital, New York, NY 10065, USA
| | - Chris Tyler-Smith
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus,
Cambridge, CB10 1SA, UK
| | - Mark Gerstein
- Program in Computational Biology and Bioinformatics, Yale
University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale
University, New Haven, CT 06520, USA
- Department of Computer Science, Yale University, New Haven, CT
06520, USA
| |
Collapse
|
241
|
Brown EA, Ruvolo M, Sabeti PC. Many ways to die, one way to arrive: how selection acts through pregnancy. Trends Genet 2013; 29:585-92. [DOI: 10.1016/j.tig.2013.03.001] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2012] [Revised: 02/15/2013] [Accepted: 03/08/2013] [Indexed: 01/24/2023]
|
242
|
Hudjashov G, Villems R, Kivisild T. Global patterns of diversity and selection in human tyrosinase gene. PLoS One 2013; 8:e74307. [PMID: 24040225 PMCID: PMC3770694 DOI: 10.1371/journal.pone.0074307] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2013] [Accepted: 07/31/2013] [Indexed: 01/09/2023] Open
Abstract
Global variation in skin pigmentation is one of the most striking examples of environmental adaptation in humans. More than two hundred loci have been identified as candidate genes in model organisms and a few tens of these have been found to be significantly associated with human skin pigmentation in genome-wide association studies. However, the evolutionary history of different pigmentation genes is rather complex: some loci have been subjected to strong positive selection, while others evolved under the relaxation of functional constraints in low UV environment. Here we report the results of a global study of the human tyrosinase gene, which is one of the key enzymes in melanin production, to assess the role of its variation in the evolution of skin pigmentation differences among human populations. We observe a higher rate of non-synonymous polymorphisms in the European sample consistent with the relaxation of selective constraints. A similar pattern was previously observed in the MC1R gene and concurs with UV radiation-driven model of skin color evolution by which mutations leading to lower melanin levels and decreased photoprotection are subject to purifying selection at low latitudes while being tolerated or even favored at higher latitudes because they facilitate UV-dependent vitamin D production. Our coalescent date estimates suggest that the non-synonymous variants, which are frequent in Europe and North Africa, are recent and have emerged after the separation of East and West Eurasian populations.
Collapse
Affiliation(s)
- Georgi Hudjashov
- Evolutionary Biology Group, Estonian Biocentre, Tartu, Estonia
- Department of Evolutionary Biology, Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
- * E-mail:
| | - Richard Villems
- Evolutionary Biology Group, Estonian Biocentre, Tartu, Estonia
- Department of Evolutionary Biology, Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
- Estonian Academy of Sciences, Tallinn, Estonia
| | - Toomas Kivisild
- Evolutionary Biology Group, Estonian Biocentre, Tartu, Estonia
- Department of Evolutionary Biology, Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
- Division of Biological Anthropology, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
243
|
Jones JC, Freeman GJ. Costimulatory genes: hotspots of conflict between host defense and autoimmunity. Immunity 2013; 38:1083-5. [PMID: 23809156 DOI: 10.1016/j.immuni.2013.06.008] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
To understand the adaptations of costimulatory molecules through mammalian evolution, Forni et al. (Forni et al., 2013) studied evolutionary selection in key costimulatory genes. Their results, presented in this issue of Immunity, suggest that the risk of autoimmmunity is balanced against efficacy of the anti-pathogen immune response.
Collapse
Affiliation(s)
- Jennifer C Jones
- Vaccine Branch, Center for Cancer Research, National Cancer Institute, Bethesda, MD 20892, USA
| | | |
Collapse
|
244
|
Quach H, Wilson D, Laval G, Patin E, Manry J, Guibert J, Barreiro LB, Nerrienet E, Verschoor E, Gessain A, Przeworski M, Quintana-Murci L. Different selective pressures shape the evolution of Toll-like receptors in human and African great ape populations. Hum Mol Genet 2013; 22:4829-40. [PMID: 23851028 DOI: 10.1093/hmg/ddt335] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The study of the genetic and selective landscape of immunity genes across primates can provide insight into the existing differences in susceptibility to infection observed between human and non-human primates. Here, we explored how selection has driven the evolution of a key family of innate immunity receptors, the Toll-like receptors (TLRs), in African great ape species. We sequenced the 10 TLRs in various populations of chimpanzees and gorillas, and analysed these data jointly with a human data set. We found that purifying selection has been more pervasive in great apes than in humans. Furthermore, in chimpanzees and gorillas, purifying selection has targeted TLRs irrespectively of whether they are endosomal or cell surface, in contrast to humans where strong selective constraints are restricted to endosomal TLRs. These observations suggest important differences in the relative importance of TLR-mediated pathogen sensing, such as that of recognition of flagellated bacteria by TLR5, between humans and great apes. Lastly, we used a population genetics-phylogenetics method that jointly analyses polymorphism and divergence data to detect fine-scale variation in selection pressures at specific codons within TLR genes. We identified different codons at different TLRs as being under positive selection in each species, highlighting that functional variation at these genes has conferred a selective advantage in immunity to infection to specific primate species. Overall, this study showed that the degree of selection driving the evolution of TLRs has largely differed between human and non-human primates, increasing our knowledge on their respective biological contribution to host defence in the natural setting.
Collapse
|
245
|
Hider JL, Gittelman RM, Shah T, Edwards M, Rosenbloom A, Akey JM, Parra EJ. Exploring signatures of positive selection in pigmentation candidate genes in populations of East Asian ancestry. BMC Evol Biol 2013; 13:150. [PMID: 23848512 PMCID: PMC3727976 DOI: 10.1186/1471-2148-13-150] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2013] [Accepted: 07/05/2013] [Indexed: 11/17/2022] Open
Abstract
Background Currently, there is very limited knowledge about the genes involved in normal pigmentation variation in East Asian populations. We carried out a genome-wide scan of signatures of positive selection using the 1000 Genomes Phase I dataset, in order to identify pigmentation genes showing putative signatures of selective sweeps in East Asia. We applied a broad range of methods to detect signatures of selection including: 1) Tests designed to identify deviations of the Site Frequency Spectrum (SFS) from neutral expectations (Tajima’s D, Fay and Wu’s H and Fu and Li’s D* and F*), 2) Tests focused on the identification of high-frequency haplotypes with extended linkage disequilibrium (iHS and Rsb) and 3) Tests based on genetic differentiation between populations (LSBL). Based on the results obtained from a genome wide analysis of 25 kb windows, we constructed an empirical distribution for each statistic across all windows, and identified pigmentation genes that are outliers in the distribution. Results Our tests identified twenty genes that are relevant for pigmentation biology. Of these, eight genes (ATRN, EDAR, KLHL7, MITF, OCA2, TH, TMEM33 and TRPM1,) were extreme outliers (top 0.1% of the empirical distribution) for at least one statistic, and twelve genes (ADAM17, BNC2, CTSD, DCT, EGFR, LYST, MC1R, MLPH, OPRM1, PDIA6, PMEL (SILV) and TYRP1) were in the top 1% of the empirical distribution for at least one statistic. Additionally, eight of these genes (BNC2, EGFR, LYST, MC1R, OCA2, OPRM1, PMEL (SILV) and TYRP1) have been associated with pigmentary traits in association studies. Conclusions We identified a number of putative pigmentation genes showing extremely unusual patterns of genetic variation in East Asia. Most of these genes are outliers for different tests and/or different populations, and have already been described in previous scans for positive selection, providing strong support to the hypothesis that recent selective sweeps left a signature in these regions. However, it will be necessary to carry out association and functional studies to demonstrate the implication of these genes in normal pigmentation variation.
Collapse
Affiliation(s)
- Jessica L Hider
- Department of Anthropology, University of Toronto at Mississauga, Mississauga, Ontario, Canada
| | | | | | | | | | | | | |
Collapse
|
246
|
Abstract
An enduring goal of evolutionary biology is to understand how natural selection has shaped patterns of polymorphism and divergence within and between species and to map the genetic basis of adaptations. The rapid maturation of next-generation sequencing technology has generated a deluge of genomics data from nonhuman primates, extinct hominins, and diverse human populations. These emerging genome data sets have simultaneously broadened our understanding of human evolution and sharply defined existing gaps in knowledge about the mechanistic basis of evolutionary change. In this review, we summarize recent insights into how natural selection has influenced the human genome across different timescales. Although the path to a more comprehensive understanding of selection and adaptation in humans remains arduous, some general insights are beginning to emerge, such as the importance of adaptive regulatory evolution, the absence of pervasive classic selective sweeps, and the potential roles that selection from standing variation and polygenic adaptation have likely played in recent human evolutionary history.
Collapse
Affiliation(s)
- Wenqing Fu
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195-5065;
| | | |
Collapse
|
247
|
Kohl JV. Nutrient-dependent/pheromone-controlled adaptive evolution: a model. SOCIOAFFECTIVE NEUROSCIENCE & PSYCHOLOGY 2013; 3:20553. [PMID: 24693353 PMCID: PMC3960065 DOI: 10.3402/snp.v3i0.20553] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/03/2013] [Revised: 04/13/2013] [Accepted: 05/13/2013] [Indexed: 12/19/2022]
Abstract
BACKGROUND The prenatal migration of gonadotropin-releasing hormone (GnRH) neurosecretory neurons allows nutrients and human pheromones to alter GnRH pulsatility, which modulates the concurrent maturation of the neuroendocrine, reproductive, and central nervous systems, thus influencing the development of ingestive behavior, reproductive sexual behavior, and other behaviors. METHODS THIS MODEL DETAILS HOW CHEMICAL ECOLOGY DRIVES ADAPTIVE EVOLUTION VIA: (1) ecological niche construction, (2) social niche construction, (3) neurogenic niche construction, and (4) socio-cognitive niche construction. This model exemplifies the epigenetic effects of olfactory/pheromonal conditioning, which alters genetically predisposed, nutrient-dependent, hormone-driven mammalian behavior and choices for pheromones that control reproduction via their effects on luteinizing hormone (LH) and systems biology. RESULTS Nutrients are metabolized to pheromones that condition behavior in the same way that food odors condition behavior associated with food preferences. The epigenetic effects of olfactory/pheromonal input calibrate and standardize molecular mechanisms for genetically predisposed receptor-mediated changes in intracellular signaling and stochastic gene expression in GnRH neurosecretory neurons of brain tissue. For example, glucose and pheromones alter the hypothalamic secretion of GnRH and LH. A form of GnRH associated with sexual orientation in yeasts links control of the feedback loops and developmental processes required for nutrient acquisition, movement, reproduction, and the diversification of species from microbes to man. CONCLUSION An environmental drive evolved from that of nutrient ingestion in unicellular organisms to that of pheromone-controlled socialization in insects. In mammals, food odors and pheromones cause changes in hormones such as LH, which has developmental affects on pheromone-controlled sexual behavior in nutrient-dependent reproductively fit individuals across species of vertebrates.
Collapse
|
248
|
Functional significance of evolving protein sequence in dihydrofolate reductase from bacteria to humans. Proc Natl Acad Sci U S A 2013; 110:10159-64. [PMID: 23733948 DOI: 10.1073/pnas.1307130110] [Citation(s) in RCA: 76] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
With the rapidly growing wealth of genomic data, experimental inquiries on the functional significance of important divergence sites in protein evolution are becoming more accessible. Here we trace the evolution of dihydrofolate reductase (DHFR) and identify multiple key divergence sites among 233 species between humans and bacteria. We connect these sites, experimentally and computationally, to changes in the enzyme's binding properties and catalytic efficiency. One of the identified evolutionarily important sites is the N23PP modification (∼mid-Devonian, 415-385 Mya), which alters the conformational states of the active site loop in Escherichia coli dihydrofolate reductase and negatively impacts catalysis. This enzyme activity was restored with the inclusion of an evolutionarily significant lid domain (G51PEKN in E. coli enzyme; ∼2.4 Gya). Guided by this evolutionary genomic analysis, we generated a human-like E. coli dihydrofolate reductase variant through three simple mutations despite only 26% sequence identity between native human and E. coli DHFRs. Molecular dynamics simulations indicate that the overall conformational motions of the protein within a common scaffold are retained throughout evolution, although subtle changes to the equilibrium conformational sampling altered the free energy barrier of the enzymatic reaction in some cases. The data presented here provide a glimpse into the evolutionary trajectory of functional DHFR through its protein sequence space that lead to the diverged binding and catalytic properties of the E. coli and human enzymes.
Collapse
|
249
|
Gibson IB, Jiang R, Yu F. The 1000 Genomes Project: paving the way for personalized genomic medicine. Per Med 2013; 10:321-324. [PMID: 29783422 DOI: 10.2217/pme.13.22] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Affiliation(s)
- Ian B Gibson
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA and Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Rongcai Jiang
- Department of Neurosurgery, Tianjin Neurological Institute, Tianjin Medical University General Hospital, Tianjin 300052, People's Republic of China
| | - Fuli Yu
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA and Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA.
| |
Collapse
|
250
|
Muers M. Tracking down human adaptations. Nat Rev Genet 2013. [DOI: 10.1038/nrg3466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|