1
|
Burt CH. Polygenic Indices (a.k.a. Polygenic Scores) in Social Science: A Guide for Interpretation and Evaluation. SOCIOLOGICAL METHODOLOGY 2024; 54:300-350. [PMID: 39091537 PMCID: PMC11293310 DOI: 10.1177/00811750241236482] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/04/2024]
Abstract
Polygenic indices (PGI)-the new recommended label for polygenic scores (PGS) in social science-are genetic summary scales often used to represent an individual's liability for a disease, trait, or behavior based on the additive effects of measured genetic variants. Enthusiasm for linking genetic data with social outcomes and the inclusion of premade PGIs in social science datasets have facilitated increased uptake of PGIs in social science research-a trend that will likely continue. Yet, most social scientists lack the expertise to interpret and evaluate PGIs in social science research. Here, we provide a primer on PGIs for social scientists focusing on key concepts, unique statistical genetic considerations, and best practices in calculation, estimation, reporting, and interpretation. We summarize our recommended best practices as a checklist to aid social scientists in evaluating and interpreting studies with PGIs. We conclude by discussing the similarities between PGIs and standard social science scales and unique interpretative considerations.
Collapse
|
2
|
Lei C, Liu J, Zhang R, Pan Y, Lu Y, Gao Y, Ma X, Yang Y, Guan Y, Mamatyusupu D, Xu S. Ancestral Origins and Admixture History of Kazakhs. Mol Biol Evol 2024; 41:msae144. [PMID: 38995236 PMCID: PMC11272102 DOI: 10.1093/molbev/msae144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Revised: 04/29/2024] [Accepted: 07/02/2024] [Indexed: 07/13/2024] Open
Abstract
Kazakh people, like many other populations that settled in Central Asia, demonstrate an array of mixed anthropological features of East Eurasian (EEA) and West Eurasian (WEA) populations, indicating a possible scenario of biological admixture between already differentiated EEA and WEA populations. However, their complex biological origin, genomic makeup, and genetic interaction with surrounding populations are not well understood. To decipher their genetic structure and population history, we conducted, to our knowledge, the first whole-genome sequencing study of Kazakhs residing in Xinjiang (KZK). We demonstrated that KZK derived their ancestries from 4 ancestral source populations: East Asian (∼39.7%), West Asian (∼28.6%), Siberian (∼23.6%), and South Asian (∼8.1%). The recognizable interactions of EEA and WEA ancestries in Kazakhs were dated back to the 15th century BCE. Kazakhs were genetically distinctive from the Uyghurs in terms of their overall genomic makeup, although the 2 populations were closely related in genetics, and both showed a substantial admixture of western and eastern peoples. Notably, we identified a considerable sex-biased admixture, with an excess of western males and eastern females contributing to the KZK gene pool. We further identified a set of genes that showed remarkable differentiation in KZK from the surrounding populations, including those associated with skin color (SLC24A5, OCA2), essential hypertension (HLA-DQB1), hypertension (MTHFR, SLC35F3), and neuron development (CNTNAP2). These results advance our understanding of the complex history of contacts between Western and Eastern Eurasians, especially those living or along the old Silk Road.
Collapse
Affiliation(s)
- Chang Lei
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Center for Evolutionary Biology, School of Life Sciences, Department of Liver Surgery and Transplantation, Liver Cancer Institute, Zhongshan Hospital, Fudan University, Shanghai 200032, China
| | - Jiaojiao Liu
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Center for Evolutionary Biology, School of Life Sciences, Department of Liver Surgery and Transplantation, Liver Cancer Institute, Zhongshan Hospital, Fudan University, Shanghai 200032, China
| | - Rui Zhang
- Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Yuwen Pan
- Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Yan Lu
- Ministry of Education Key Laboratory of Contemporary Anthropology, Fudan University, Shanghai 201203, China
| | - Yang Gao
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Center for Evolutionary Biology, School of Life Sciences, Department of Liver Surgery and Transplantation, Liver Cancer Institute, Zhongshan Hospital, Fudan University, Shanghai 200032, China
| | - Xixian Ma
- Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Yajun Yang
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Center for Evolutionary Biology, School of Life Sciences, Department of Liver Surgery and Transplantation, Liver Cancer Institute, Zhongshan Hospital, Fudan University, Shanghai 200032, China
| | - Yaqun Guan
- Department of Biochemistry and Molecular Biology, Preclinical Medicine College, Xinjiang Medical University, Urumqi 830011, China
| | - Dolikun Mamatyusupu
- College of the Life Sciences and Technology, Xinjiang University, Urumqi 830046, China
| | - Shuhua Xu
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Center for Evolutionary Biology, School of Life Sciences, Department of Liver Surgery and Transplantation, Liver Cancer Institute, Zhongshan Hospital, Fudan University, Shanghai 200032, China
- Ministry of Education Key Laboratory of Contemporary Anthropology, Fudan University, Shanghai 201203, China
| |
Collapse
|
3
|
Matheson J, Masel J. Background Selection From Unlinked Sites Causes Nonindependent Evolution of Deleterious Mutations. Genome Biol Evol 2024; 16:evae050. [PMID: 38482769 PMCID: PMC10972689 DOI: 10.1093/gbe/evae050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/11/2024] [Indexed: 04/01/2024] Open
Abstract
Background selection describes the reduction in neutral diversity caused by selection against deleterious alleles at other loci. It is typically assumed that the purging of deleterious alleles affects linked neutral variants, and indeed simulations typically only treat a genomic window. However, background selection at unlinked loci also depresses neutral diversity. In agreement with previous analytical approximations, in our simulations of a human-like genome with a realistically high genome-wide deleterious mutation rate, the effects of unlinked background selection exceed those of linked background selection. Background selection reduces neutral genetic diversity by a factor that is independent of census population size. Outside of genic regions, the strength of background selection increases with the mean selection coefficient, contradicting the linked theory but in agreement with the unlinked theory. Neutral diversity within genic regions is fairly independent of the strength of selection. Deleterious genetic load among haploid individuals is underdispersed, indicating nonindependent evolution of deleterious mutations. Empirical evidence for underdispersion was previously interpreted as evidence for global epistasis, but we recover it from a non-epistatic model.
Collapse
Affiliation(s)
- Joseph Matheson
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA
- Department of Ecology, Behavior, and Evolution, University of California San Diego, San Diego, CA 92093, USA
| | - Joanna Masel
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA
| |
Collapse
|
4
|
Fournier R, Tsangalidou Z, Reich D, Palamara PF. Haplotype-based inference of recent effective population size in modern and ancient DNA samples. Nat Commun 2023; 14:7945. [PMID: 38040695 PMCID: PMC10692198 DOI: 10.1038/s41467-023-43522-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2022] [Accepted: 11/10/2023] [Indexed: 12/03/2023] Open
Abstract
Individuals sharing recent ancestors are likely to co-inherit large identical-by-descent (IBD) genomic regions. The distribution of these IBD segments in a population may be used to reconstruct past demographic events such as effective population size variation, but accurate IBD detection is difficult in ancient DNA data and in underrepresented populations with limited reference data. In this work, we introduce an accurate method for inferring effective population size variation during the past ~2000 years in both modern and ancient DNA data, called HapNe. HapNe infers recent population size fluctuations using either IBD sharing (HapNe-IBD) or linkage disequilibrium (HapNe-LD), which does not require phasing and can be computed in low coverage data, including data sets with heterogeneous sampling times. HapNe shows improved accuracy in a range of simulated demographic scenarios compared to currently available methods for IBD-based and LD-based inference of recent effective population size, while requiring fewer computational resources. We apply HapNe to several modern populations from the 1,000 Genomes Project, the UK Biobank, the Allen Ancient DNA Resource, and recently published samples from Iron Age Britain, detecting multiple instances of recent effective population size variation across these groups.
Collapse
Affiliation(s)
| | | | - David Reich
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, USA
| | - Pier Francesco Palamara
- Department of Statistics, University of Oxford, Oxford, UK.
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK.
| |
Collapse
|
5
|
Astore C, Sharma S, Nagpal S, Cutler DJ, Rioux JD, Cho JH, McGovern DPB, Brant SR, Kugathasan S, Jordan IK, Gibson G. The role of admixture in the rare variant contribution to inflammatory bowel disease. Genome Med 2023; 15:97. [PMID: 37968638 PMCID: PMC10647102 DOI: 10.1186/s13073-023-01244-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Accepted: 10/10/2023] [Indexed: 11/17/2023] Open
Abstract
BACKGROUND Identification of rare variants involved in complex, polygenic diseases like Crohn's disease (CD) has accelerated with the introduction of whole exome/genome sequencing association studies. Rare variants can be used in both diagnostic and therapeutic assessments; however, since they are likely to be restricted to specific ancestry groups, their contributions to risk assessment need to be evaluated outside the discovery population. Prior studies implied that the three known rare variants in NOD2 are absent in West African and Asian populations and only contribute in African Americans via admixture. METHODS Whole genome sequencing (WGS) data from 3418 African American individuals, 1774 inflammatory bowel disease (IBD) cases, and 1644 controls were used to assess odds ratios and allele frequencies (AF), as well as haplotype-specific ancestral origins of European-derived CD variants discovered in a large exome-wide association study. Local and global ancestry was performed to assess the contribution of admixture to IBD contrasting European and African American cohorts. RESULTS Twenty-five rare variants associated with CD in European discovery cohorts are typically five-fold lower frequency in African Americans. Correspondingly, where comparisons could be made, the rare variants were found to have a predicted four-fold reduced burden for IBD in African Americans, when compared to European individuals. Almost all of the rare CD European variants were found on European haplotypes in the African American cohort, implying that they contribute to disease risk in African Americans primarily due to recent admixture. In addition, proportion of European ancestry correlates the number of rare CD European variants each African American individual carry, as well as their polygenic risk of disease. Similar findings were observed for 23 mutations affecting 10 other common complex diseases for which the rare variants were discovered in European cohorts. CONCLUSIONS European-derived Crohn's disease rare variants are even more rare in African Americans and contribute to disease risk mainly due to admixture, which needs to be accounted for when performing cross-ancestry genetic assessments.
Collapse
Affiliation(s)
- Courtney Astore
- Center for Integrative Genomics and School of Biological Sciences, Georgia Institute of Technology, Krone EBB1 Building, 950 Atlantic Drive, Atlanta, GA, 30332, USA
| | - Shivam Sharma
- Center for Integrative Genomics and School of Biological Sciences, Georgia Institute of Technology, Krone EBB1 Building, 950 Atlantic Drive, Atlanta, GA, 30332, USA
| | - Sini Nagpal
- Center for Integrative Genomics and School of Biological Sciences, Georgia Institute of Technology, Krone EBB1 Building, 950 Atlantic Drive, Atlanta, GA, 30332, USA
| | - David J Cutler
- Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, 30322, USA
| | - John D Rioux
- Department of Medicine, Université de Montréal and the Montreal Heart Institute Research Center, Montreal, QC, H1Y3N1, Canada
| | - Judy H Cho
- Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
| | - Dermot P B McGovern
- Department of Medicine, Rutgers Robert Wood Johnson Medical School, New Brunswick, NJ, 08901, USA
- Department of Genetics and the Human Genetics Institute of New Jersey, Rutgers University, Piscataway, NJ, 08554, USA
- Meyerhoff Inflammatory Bowel Disease Center, Johns Hopkins University School of Medicine, Baltimore, 21287, USA
| | - Steven R Brant
- Immunology Research Institute, Cedars-Sinai Medical Center, Los Angeles, CA, 90048, USA
| | - Subra Kugathasan
- Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, 30322, USA
- Department of Pediatrics, Emory University School of Medicine, and Children's Healthcare of Atlanta, Atlanta, GA, 30322, USA
| | - I King Jordan
- Center for Integrative Genomics and School of Biological Sciences, Georgia Institute of Technology, Krone EBB1 Building, 950 Atlantic Drive, Atlanta, GA, 30332, USA
| | - Greg Gibson
- Center for Integrative Genomics and School of Biological Sciences, Georgia Institute of Technology, Krone EBB1 Building, 950 Atlantic Drive, Atlanta, GA, 30332, USA.
| |
Collapse
|
6
|
Williams AE, Esezobor CI, Lane BM, Gbadegesin RA. Hiding in plain sight: genetics of childhood steroid-resistant nephrotic syndrome in Sub-Saharan Africa. Pediatr Nephrol 2023; 38:2003-2012. [PMID: 36459247 PMCID: PMC10416081 DOI: 10.1007/s00467-022-05831-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Revised: 11/14/2022] [Accepted: 11/14/2022] [Indexed: 12/03/2022]
Abstract
Steroid-resistant nephrotic syndrome (SRNS) is the most severe form of childhood nephrotic syndrome with an increased risk of progression to chronic kidney disease stage 5. Research endeavors to date have identified more than 80 genes that are associated with SRNS. Most of these genes regulate the structure and function of the podocyte, the visceral epithelial cells of the glomerulus. Although individuals of African ancestry have the highest prevalence of SRNS, especially those from Sub-Saharan Africa (SSA), with rates as high as 30-40% of all cases of nephrotic syndrome, studies focusing on the characterization and understanding of the genetic basis of SRNS in the region are negligible compared with Europe and North America. Therefore, it remains unclear if some of the variants in SRNS genes that are deemed pathogenic for SRNS are truly disease causing, and if the leading causes of monogenic nephrotic syndrome in other populations are the same for children in SSA with SRNS. Other implications of this lack of genetic data for SRNS in the region include the exclusion of children from the region from clinical trials aimed at identifying potential novel therapeutic agents for this severe form of nephrotic syndrome. This review underlines a need for concerted efforts to advance the genetic basis of SRNS in children in SSA. Such endeavors will complement global efforts at understanding the genetic basis of nephrotic syndrome.
Collapse
Affiliation(s)
- Anna Elizabeth Williams
- Department of Pediatrics, Division of Nephrology, Duke University Medical Center, Durham, NC, 27710, USA
| | - Christopher I Esezobor
- Department of Pediatrics, Faculty of Clinical Sciences, College of Medicine, University of Lagos, Lagos, Nigeria
| | - Brandon M Lane
- Department of Pediatrics, Division of Nephrology, Duke University Medical Center, Durham, NC, 27710, USA
- Duke Molecular Physiology Institute, Duke University, Durham, NC, USA
| | - Rasheed A Gbadegesin
- Department of Pediatrics, Division of Nephrology, Duke University Medical Center, Durham, NC, 27710, USA.
- Duke Molecular Physiology Institute, Duke University, Durham, NC, USA.
| |
Collapse
|
7
|
Yang Z, Chen H, Lu Y, Gao Y, Sun H, Wang J, Jin L, Chu J, Xu S. Genetic evidence of tri-genealogy hypothesis on the origin of ethnic minorities in Yunnan. BMC Biol 2022; 20:166. [PMID: 35864541 PMCID: PMC9306206 DOI: 10.1186/s12915-022-01367-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2021] [Accepted: 07/05/2022] [Indexed: 11/14/2022] Open
Abstract
BACKGROUND Yunnan is located in Southwest China and consists of great cultural, linguistic, and genetic diversity. However, the genomic diversity of ethnic minorities in Yunnan is largely under-investigated. To gain insights into population history and local adaptation of Yunnan minorities, we analyzed 242 whole-exome sequencing data with high coverage (~ 100-150 ×) of Yunnan minorities representing Achang, Jingpo, Dai, and Deang, who were linguistically assumed to be derived from three ancient lineages (the tri-genealogy hypothesis), i.e., Di-Qiang, Bai-Yue, and Bai-Pu. RESULTS Yunnan minorities show considerable genetic differences. Di-Qiang populations likely migrated from the Tibetan area about 6700 years ago. Genetic divergence between Bai-Yue and Di-Qiang was estimated to be 7000 years, and that between Bai-Yue and Bai-Pu was estimated to be 5500 years. Bai-Pu is relatively isolated, but gene flow from surrounding Di-Qiang and Bai-Yue populations was also found. Furthermore, we identified genetic variants that are differentiated within Yunnan minorities possibly due to the living circumstances and habits. Notably, we found that adaptive variants related to malaria and glucose metabolism suggest the adaptation to thalassemia and G6PD deficiency resulting from malaria resistance in the Dai population. CONCLUSIONS We provided genetic evidence of the tri-genealogy hypothesis as well as new insights into the genetic history and local adaptation of the Yunnan minorities.
Collapse
Affiliation(s)
- Zhaoqing Yang
- Department of Medical Genetics, Institute of Medical Biology, Chinese Academy of Medical Sciences, Kunming, 650118, China
| | - Hao Chen
- Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, 200031, China
| | - Yan Lu
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center for Genetics and Development, Center for Evolutionary Biology, School of Life Sciences, Fudan University, Shanghai, 200438, China
| | - Yang Gao
- Human Phenome Institute, Zhangjiang Fudan International Innovation Center, and Ministry of Education Key Laboratory of Contemporary Anthropology, Fudan University, Shanghai, 201203, China
| | - Hao Sun
- Department of Medical Genetics, Institute of Medical Biology, Chinese Academy of Medical Sciences, Kunming, 650118, China
| | - Jiucun Wang
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center for Genetics and Development, Center for Evolutionary Biology, School of Life Sciences, Fudan University, Shanghai, 200438, China
- Human Phenome Institute, Zhangjiang Fudan International Innovation Center, and Ministry of Education Key Laboratory of Contemporary Anthropology, Fudan University, Shanghai, 201203, China
| | - Li Jin
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center for Genetics and Development, Center for Evolutionary Biology, School of Life Sciences, Fudan University, Shanghai, 200438, China
- Human Phenome Institute, Zhangjiang Fudan International Innovation Center, and Ministry of Education Key Laboratory of Contemporary Anthropology, Fudan University, Shanghai, 201203, China
| | - Jiayou Chu
- Department of Medical Genetics, Institute of Medical Biology, Chinese Academy of Medical Sciences, Kunming, 650118, China.
| | - Shuhua Xu
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center for Genetics and Development, Center for Evolutionary Biology, School of Life Sciences, Fudan University, Shanghai, 200438, China.
- Human Phenome Institute, Zhangjiang Fudan International Innovation Center, and Ministry of Education Key Laboratory of Contemporary Anthropology, Fudan University, Shanghai, 201203, China.
- Department of Liver Surgery and Transplantation Liver Cancer Institute, Zhongshan Hospital, Fudan University, Shanghai, 200032, China.
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, 650223, China.
| |
Collapse
|
8
|
Harwood MP, Alves I, Edgington H, Agbessi M, Bruat V, Soave D, Lamaze FC, Favé MJ, Awadalla P. Recombination affects allele-specific expression of deleterious variants in human populations. SCIENCE ADVANCES 2022; 8:eabl3819. [PMID: 35559670 PMCID: PMC9106294 DOI: 10.1126/sciadv.abl3819] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/09/2021] [Accepted: 03/29/2022] [Indexed: 06/15/2023]
Abstract
How the genetic composition of a population changes through stochastic processes, such as genetic drift, in combination with deterministic processes, such as selection, is critical to understanding how phenotypes vary in space and time. Here, we show how evolutionary forces affecting selection, including recombination and effective population size, drive genomic patterns of allele-specific expression (ASE). Integrating tissue-specific genotypic and transcriptomic data from 1500 individuals from two different cohorts, we demonstrate that ASE is less often observed in regions of low recombination, and loci in high or normal recombination regions are more efficient at using ASE to underexpress harmful mutations. By tracking genetic ancestry, we discriminate between ASE variability due to past demographic effects, including subsequent bottlenecks, versus local environment. We observe that ASE is not randomly distributed along the genome and that population parameters influencing the efficacy of natural selection alter ASE levels genome wide.
Collapse
Affiliation(s)
- Michelle P. Harwood
- Ontario Institute for Cancer Research, Toronto, ON, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Isabel Alves
- Université de Nantes, CHU Nantes, CNRS, INSERM, L’Institut du thorax, F-44000 Nantes, France
| | | | | | - Vanessa Bruat
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - David Soave
- Ontario Institute for Cancer Research, Toronto, ON, Canada
- Department of Mathematics, Wilfrid Laurier University, Waterloo, ON, Canada
| | - Fabien C. Lamaze
- Ontario Institute for Cancer Research, Toronto, ON, Canada
- Institut universitaire de cardiologie et de pneumologie de Québec, Université Laval, Québec, QC, Canada
| | | | - Philip Awadalla
- Ontario Institute for Cancer Research, Toronto, ON, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
- Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
| |
Collapse
|
9
|
Arciero E, Dogra SA, Malawsky DS, Mezzavilla M, Tsismentzoglou T, Huang QQ, Hunt KA, Mason D, Sharif SM, van Heel DA, Sheridan E, Wright J, Small N, Carmi S, Iles MM, Martin HC. Fine-scale population structure and demographic history of British Pakistanis. Nat Commun 2021; 12:7189. [PMID: 34893604 PMCID: PMC8664933 DOI: 10.1038/s41467-021-27394-2] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2021] [Accepted: 11/09/2021] [Indexed: 02/08/2023] Open
Abstract
Previous genetic and public health research in the Pakistani population has focused on the role of consanguinity in increasing recessive disease risk, but little is known about its recent population history or the effects of endogamy. Here, we investigate fine-scale population structure, history and consanguinity patterns using genotype chip data from 2,200 British Pakistanis. We reveal strong recent population structure driven by the biraderi social stratification system. We find that all subgroups have had low recent effective population sizes (Ne), with some showing a decrease 15‒20 generations ago that has resulted in extensive identity-by-descent sharing and homozygosity, increasing the risk of recessive disorders. Our results from two orthogonal methods (one using machine learning and the other coalescent-based) suggest that the detailed reporting of parental relatedness for mothers in the cohort under-represents the true levels of consanguinity. These results demonstrate the impact of cultural practices on population structure and genomic diversity in Pakistanis, and have important implications for medical genetic studies.
Collapse
Affiliation(s)
- Elena Arciero
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK.
| | - Sufyan A Dogra
- Bradford Institute for Health Research, Bradford Teaching Hospitals NHS Foundation Trust, Bradford, UK
| | | | | | - Theofanis Tsismentzoglou
- Leeds Institute for Data Analytics, University of Leeds, Leeds, UK
- Leeds Institute of Medical Research, University of Leeds, Leeds, UK
| | - Qin Qin Huang
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
| | - Karen A Hunt
- Blizard Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London, UK
| | - Dan Mason
- Bradford Institute for Health Research, Bradford Teaching Hospitals NHS Foundation Trust, Bradford, UK
| | - Saghira Malik Sharif
- Yorkshire Regional Genetics Service, Leeds Teaching Hospitals NHS Trust, Leeds, UK
| | - David A van Heel
- Blizard Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London, UK
| | - Eamonn Sheridan
- Leeds Institute of Medical Research, University of Leeds, Leeds, UK
| | - John Wright
- Bradford Institute for Health Research, Bradford Teaching Hospitals NHS Foundation Trust, Bradford, UK
| | - Neil Small
- Faculty of Health Studies, University of Bradford, Richmond Road, Bradford, UK
| | - Shai Carmi
- Braun School of Public Health and Community Medicine, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Mark M Iles
- Leeds Institute for Data Analytics, University of Leeds, Leeds, UK
- Leeds Institute of Medical Research, University of Leeds, Leeds, UK
| | - Hilary C Martin
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK.
| |
Collapse
|
10
|
Garcia-Pelegrin E, Wilkins C, Clayton NS. The Ape That Lived to Tell the Tale. The Evolution of the Art of Storytelling and Its Relationship to Mental Time Travel and Theory of Mind. Front Psychol 2021; 12:755783. [PMID: 34744932 PMCID: PMC8569916 DOI: 10.3389/fpsyg.2021.755783] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Accepted: 09/20/2021] [Indexed: 11/13/2022] Open
Abstract
Engaging in the art of creating and telling stories is a defining behaviour of humankind. Humans have been sharing stories with each other, with and without words, since the dawn of recorded history, but the cognitive foundations of the behaviour can be traced deeper into our past. The emergence of stories can be strongly linked to Mental Time Travel (the ability to recall the past and imagine the future) and plays a key role in our ability to communicate past, present and future scenarios with other individuals, within and beyond our lifetimes. Stories are products engraved within the concept of time, constructed to elucidate the past experiences of the self, but designed with the future in mind, thus imparting lessons of such experiences to the receiver. By being privy to the experiences of others, humans can imagine themselves in a similar position to the protagonist of the story, thus mentally learning from an experience they might have never encountered other than in the mind's eye. Evolutionary Psychology investigates how the engagement in artistic endeavours by our ancestors in the Pleistocene granted them an advantage when confronted with obstacles that challenged their survival or reproductive fitness and questions whether art is an adaptation of the human mind or a spandrel of other cognitive adaptations. However, little attention has been placed on the cognitive abilities that might have been imperative for the development of art. Here, we examine the relationship between art, storytelling, Mental Time Travel and Theory of Mind (i.e., the ability to attribute mental states to others). We suggest that Mental Time Travel played a key role in the development of storytelling because through stories, humans can fundamentally transcend their present condition, by being able to imagine different times, separate realities, and place themselves and others anywhere within the time space continuum. We argue that the development of a Theory of Mind also sparked storytelling practises in humans as a method of diffusing the past experiences of the self to others whilst enabling the receiver to dissociate between the past experiences of others and their own, and to understand them as lessons for a possible future. We propose that when artistic products rely on storytelling in form and function, they ought to be considered separate from other forms of art whose appreciation capitalise on our aesthetic preferences.
Collapse
Affiliation(s)
| | - Clive Wilkins
- Department of Psychology, University of Cambridge, Cambridge, United Kingdom
| | - Nicola S Clayton
- Department of Psychology, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
11
|
The genomic landscape of Mexican Indigenous populations brings insights into the peopling of the Americas. Nat Commun 2021; 12:5942. [PMID: 34642312 PMCID: PMC8511047 DOI: 10.1038/s41467-021-26188-w] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2019] [Accepted: 09/22/2021] [Indexed: 12/30/2022] Open
Abstract
The genetic makeup of Indigenous populations inhabiting Mexico has been strongly influenced by geography and demographic history. Here, we perform a genome-wide analysis of 716 newly genotyped individuals from 60 of the 68 recognized ethnic groups in Mexico. We show that the genetic structure of these populations is strongly influenced by geography, and our demographic reconstructions suggest a decline in the population size of all tested populations in the last 15-30 generations. We find evidence that Aridoamerican and Mesoamerican populations diverged roughly 4-9.9 ka, around the time when sedentary farming started in Mesoamerica. Comparisons with ancient genomes indicate that the Upward Sun River 1 (USR1) individual is an outgroup to Mexican/South American Indigenous populations, whereas Anzick-1 was more closely related to Mesoamerican/South American populations than to those from Aridoamerica, showing an even more complex history of divergence than recognized so far.
Collapse
|
12
|
Holte SJ, Ohmann PR. Childhood in social learning models with changing environments: Implications for human evolution. Biosystems 2021; 210:104555. [PMID: 34601073 DOI: 10.1016/j.biosystems.2021.104555] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2021] [Revised: 08/03/2021] [Accepted: 09/28/2021] [Indexed: 10/20/2022]
Abstract
Childhood is a time of learning, both individually and through social interactions. But the vulnerability inherent in childhood represents a fitness cost: individuals do not have progeny during childhood, and childhood is a net drain of familial or group resources. In this study we model childhood in resource acquisition scenarios where social learning through observation competes with learning through innovation across multiple generations in a variety of environments. In general, observing others allows useful knowledge to be gained more efficiently than self-exploration and may result in significantly greater resource acquisition. However, social learning needing a lengthy childhood to develop advanced cognitive ability may offset the net fitness advantage that might otherwise be gained. Through simulations we demonstrate that individuals with a substantial childhood burden acquire more lifetime resources by observing others than do individuals with negligible childhood costs using self-exploration, as long as the environment is fairly stable. This advantage decreases as the environment becomes less predictable, and reverses in rapidly changing environments where knowledge is unreliable. These results suggest that hominid evolution, with vastly growing cognitive abilities and a longer, more vulnerable childhood, may have been facilitated in similarly stable environments. On the other hand, hominid populations may have been particularly vulnerable to environmental instability. We apply this insight to the Out of Africa Homo sapiens migration roughly 50,000 years ago and show consistency with the serial founder model that best fits current archeological and genetic evidence. Our findings are important for models of social learning, especially those that describe the emergence and spread of Homo sapiens.
Collapse
Affiliation(s)
- Serena J Holte
- Department of Physics, University of Saint Thomas, St. Paul MN, 55105, USA
| | - Paul R Ohmann
- Department of Physics, University of Saint Thomas, St. Paul MN, 55105, USA.
| |
Collapse
|
13
|
Garcia JA, Lohmueller KE. Negative linkage disequilibrium between amino acid changing variants reveals interference among deleterious mutations in the human genome. PLoS Genet 2021; 17:e1009676. [PMID: 34319975 PMCID: PMC8351996 DOI: 10.1371/journal.pgen.1009676] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2020] [Revised: 08/09/2021] [Accepted: 06/22/2021] [Indexed: 11/18/2022] Open
Abstract
Evolutionary forces like Hill-Robertson interference and negative epistasis can lead to deleterious mutations being found on distinct haplotypes. However, the extent to which these forces depend on the selection and dominance coefficients of deleterious mutations and shape genome-wide patterns of linkage disequilibrium (LD) in natural populations with complex demographic histories has not been tested. In this study, we first used forward-in-time simulations to predict how negative selection impacts LD. Under models where deleterious mutations have additive effects on fitness, deleterious variants less than 10 kb apart tend to be carried on different haplotypes relative to pairs of synonymous SNPs. In contrast, for recessive mutations, there is no consistent ordering of how selection coefficients affect LD decay, due to the complex interplay of different evolutionary effects. We then examined empirical data of modern humans from the 1000 Genomes Project. LD between derived alleles at nonsynonymous SNPs is lower compared to pairs of derived synonymous variants, suggesting that nonsynonymous derived alleles tend to occur on different haplotypes more than synonymous variants. This result holds when controlling for potential confounding factors by matching SNPs for frequency in the sample (allele count), physical distance, magnitude of background selection, and genetic distance between pairs of variants. Lastly, we introduce a new statistic HR(j) which allows us to detect interference using unphased genotypes. Application of this approach to high-coverage human genome sequences confirms our finding that nonsynonymous derived alleles tend to be located on different haplotypes more often than are synonymous derived alleles. Our findings suggest that interference may play a pervasive role in shaping patterns of LD between deleterious variants in the human genome, and consequently influences genome-wide patterns of LD.
Collapse
Affiliation(s)
- Jesse A. Garcia
- Interdepartmental Program in Bioinformatics, University of California, Los Angeles, California, United States of America
| | - Kirk E. Lohmueller
- Interdepartmental Program in Bioinformatics, University of California, Los Angeles, California, United States of America
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, California, United States of America
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, California, United States of America
| |
Collapse
|
14
|
Sandler G, Wright SI, Agrawal AF. Patterns and Causes of Signed Linkage Disequilibria in Flies and Plants. Mol Biol Evol 2021; 38:4310-4321. [PMID: 34097067 PMCID: PMC8476167 DOI: 10.1093/molbev/msab169] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Most empirical studies of linkage disequilibrium (LD) study its magnitude, ignoring its sign. Here, we examine patterns of signed LD in two population genomic data sets, one from Capsella grandiflora and one from Drosophila melanogaster. We consider how processes such as drift, admixture, Hill–Robertson interference, and epistasis may contribute to these patterns. We report that most types of mutations exhibit positive LD, particularly, if they are predicted to be less deleterious. We show with simulations that this pattern arises easily in a model of admixture or distance-biased mating, and that genome-wide differences across site types are generally expected due to differences in the strength of purifying selection even in the absence of epistasis. We further explore how signed LD decays on a finer scale, showing that loss of function mutations exhibit particularly positive LD across short distances, a pattern consistent with intragenic antagonistic epistasis. Controlling for genomic distance, signed LD in C. grandiflora decays faster within genes, compared with between genes, likely a by-product of frequent recombination in gene promoters known to occur in plant genomes. Finally, we use information from published biological networks to explore whether there is evidence for negative synergistic epistasis between interacting radical missense mutations. In D. melanogaster networks, we find a modest but significant enrichment of negative LD, consistent with the possibility of intranetwork negative synergistic epistasis.
Collapse
Affiliation(s)
- George Sandler
- Department of Ecology and Evolutionary Biology, University of Toronto, 25 Willcocks Street, Toronto, ON M5S 3B2, Canada
| | - Stephen I Wright
- Department of Ecology and Evolutionary Biology, University of Toronto, 25 Willcocks Street, Toronto, ON M5S 3B2, Canada.,Center for Analysis of Genome Evolution and Function, University of Toronto, 25 Willcocks Street, Toronto, ON M5S 3B2, Canada
| | - Aneil F Agrawal
- Department of Ecology and Evolutionary Biology, University of Toronto, 25 Willcocks Street, Toronto, ON M5S 3B2, Canada.,Center for Analysis of Genome Evolution and Function, University of Toronto, 25 Willcocks Street, Toronto, ON M5S 3B2, Canada
| |
Collapse
|
15
|
Ma X, Yang W, Gao Y, Pan Y, Lu Y, Chen H, Lu D, Xu S. Genetic origins and sex-biased admixture of the Huis. Mol Biol Evol 2021; 38:3804-3819. [PMID: 34021754 PMCID: PMC8382924 DOI: 10.1093/molbev/msab158] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
The Hui people are unique among Chinese ethnic minorities in that they speak the same language as Han Chinese (HAN) but practice Islam. However, as the second-largest minority group in China numbering well over 10 million, the Huis are under-represented in both global and regional genomic studies. Here, we present the first whole-genome sequencing effort of 234 Hui individuals (NXH) aged over 60 who have been living in Ningxia, where the Huis are mostly concentrated. NXH are genetically more similar to East Asian than to any other global populations. In particular, the genetic differentiation between NXH and HAN (FST = 0.0015) is only slightly larger than that between northern and southern HAN (FST = 0.0010), largely attributed to the western ancestry in NXH (∼10%). Highly differentiated functional variants between NXH and HAN were identified in genes associated with skin pigmentation (e.g., SLC24A5), facial morphology (e.g., EDAR), and lipid metabolism (e.g., ABCG8). The Huis are also distinct from other Muslim groups such as the Uyghurs (FST = 0.0187), especially, NXH derived much less western ancestry (∼10%) compared with the Uyghurs (∼50%). Modeling admixture history indicated that NXH experienced an episode of two-wave admixture. An ancient admixture occurred ∼1,025 years ago, reflecting the intensive west-east contacts during the late Tang Dynasty, and the Five Dynasties and Ten Kingdoms period. A recent admixture occurred ∼500 years ago, corresponding to the Ming Dynasty. Notably, we identified considerable sex-biased admixture, i.e., excess of western males and eastern females contributing to the NXH gene pool. The origins and the genomic diversity of the Hui people imply the complex history of contacts between western and eastern Eurasians.
Collapse
Affiliation(s)
- Xixian Ma
- Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Wenjun Yang
- Key Laboratory of Fertility Preservation and Maintenance, the General Hospital, Ningxia Medical University, Yinchuan, Ningxia 750004 China
| | - Yang Gao
- Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China.,School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China
| | - Yuwen Pan
- Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Yan Lu
- Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China.,State Key Laboratory of Genetic Engineering, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200438, China
| | - Hao Chen
- Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Dongsheng Lu
- Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Shuhua Xu
- Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China.,School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China.,State Key Laboratory of Genetic Engineering, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200438, China.,Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650223, China.,Henan Institute of Medical and Pharmaceutical Sciences, Zhengzhou University, Zhengzhou 450052, China.,Human Phenome Institute, Fudan University, Shanghai 201203, China
| |
Collapse
|
16
|
Elhaik E, Graur D. On the Unfounded Enthusiasm for Soft Selective Sweeps III: The Supervised Machine Learning Algorithm That Isn't. Genes (Basel) 2021; 12:genes12040527. [PMID: 33916341 PMCID: PMC8066263 DOI: 10.3390/genes12040527] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Revised: 03/22/2021] [Accepted: 03/29/2021] [Indexed: 12/12/2022] Open
Abstract
In the last 15 years or so, soft selective sweep mechanisms have been catapulted from a curiosity of little evolutionary importance to a ubiquitous mechanism claimed to explain most adaptive evolution and, in some cases, most evolution. This transformation was aided by a series of articles by Daniel Schrider and Andrew Kern. Within this series, a paper entitled “Soft sweeps are the dominant mode of adaptation in the human genome” (Schrider and Kern, Mol. Biol. Evolut. 2017, 34(8), 1863–1877) attracted a great deal of attention, in particular in conjunction with another paper (Kern and Hahn, Mol. Biol. Evolut. 2018, 35(6), 1366–1371), for purporting to discredit the Neutral Theory of Molecular Evolution (Kimura 1968). Here, we address an alleged novelty in Schrider and Kern’s paper, i.e., the claim that their study involved an artificial intelligence technique called supervised machine learning (SML). SML is predicated upon the existence of a training dataset in which the correspondence between the input and output is known empirically to be true. Curiously, Schrider and Kern did not possess a training dataset of genomic segments known a priori to have evolved either neutrally or through soft or hard selective sweeps. Thus, their claim of using SML is thoroughly and utterly misleading. In the absence of legitimate training datasets, Schrider and Kern used: (1) simulations that employ many manipulatable variables and (2) a system of data cherry-picking rivaling the worst excesses in the literature. These two factors, in addition to the lack of negative controls and the irreproducibility of their results due to incomplete methodological detail, lead us to conclude that all evolutionary inferences derived from so-called SML algorithms (e.g., S/HIC) should be taken with a huge shovel of salt.
Collapse
Affiliation(s)
- Eran Elhaik
- Department of Biology, Lund University, Sölvegatan 35, 22362 Lund, Sweden
- Correspondence:
| | - Dan Graur
- Department of Biology & Biochemistry, University of Houston, Science & Research Building 2, Suite #342, 3455 Cullen Bldv., Houston, TX 77204-5001, USA;
| |
Collapse
|
17
|
Tagore D, Aghakhanian F, Naidu R, Phipps ME, Basu A. Insights into the demographic history of Asia from common ancestry and admixture in the genomic landscape of present-day Austroasiatic speakers. BMC Biol 2021; 19:61. [PMID: 33781248 PMCID: PMC8008685 DOI: 10.1186/s12915-021-00981-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2020] [Accepted: 02/12/2021] [Indexed: 11/17/2022] Open
Abstract
BACKGROUND The demographic history of South and Southeast Asia (S&SEA) is complex and contentious, with multiple waves of human migration. Some of the earliest footfalls were of the ancestors of modern Austroasiatic (AA) language speakers. Understanding the history of the AA language family, comprising of over 150 languages and their speakers distributed across broad geographical region in isolated small populations of various sizes, can help shed light on the peopling of S&SEA. Here we investigated the genetic relatedness of two AA groups, their relationship with other ethno-linguistically distinct populations, and the relationship of these groups with ancient genomes of individuals living in S&SEA at different time periods, to infer about the demographic history of this region. RESULTS We analyzed 1451 extant genomes, 189 AAs from India and Malaysia, and 43 ancient genomes from S&SEA. Population structure analysis reveals neither language nor geography appropriately correlates with genetic diversity. The inconsistency between "language and genetics" or "geography and genetics" can largely be attributed to ancient admixture with East Asian populations. We estimated a pre-Neolithic origin of AA language speakers, with shared ancestry between Indian and Malaysian populations until about 470 generations ago, contesting the existing model of Neolithic expansion of the AA culture. We observed a spatio-temporal transition in the genetic ancestry of SEA with genetic contribution from East Asia significantly increasing in the post-Neolithic period. CONCLUSION Our study shows that contrary to assumptions in many previous studies and despite having linguistic commonality, Indian AAs have a distinct genomic structure compared to Malaysian AAs. This linguistic-genetic discordance is reflective of the complex history of population migration and admixture shaping the genomic landscape of S&SEA. We postulate that pre-Neolithic ancestors of today's AAs were widespread in S&SEA, and the fragmentation and dissipation of the population have largely been a resultant of multiple migrations of East Asian farmers during the Neolithic period. It also highlights the resilience of AAs in continuing to speak their language in spite of checkered population distribution and possible dominance from other linguistic groups.
Collapse
Affiliation(s)
- Debashree Tagore
- National Institute of Biomedical Genomics, Kalyani, 741251, India
| | - Farhang Aghakhanian
- Oklahoma Medical Research Foundation, Genes and Human Disease Program, 825 NE 13th Street, Oklahoma City, OK, 73104, USA
- Genomics Facility, School of Science, Monash University Malaysia, Jalan Lagoon Selatan, 47500, Bandar Sunway, Selangor, Malaysia
| | - Rakesh Naidu
- Jeffrey Cheah School of Medicine and Health Sciences, Monash University Malaysia, Jalan Lagoon Selatan, 47500, Bandar Sunway, Selangor Darul Ehsan, Malaysia
| | - Maude E Phipps
- Jeffrey Cheah School of Medicine and Health Sciences, Monash University Malaysia, Jalan Lagoon Selatan, 47500, Bandar Sunway, Selangor Darul Ehsan, Malaysia
| | - Analabha Basu
- National Institute of Biomedical Genomics, Kalyani, 741251, India.
| |
Collapse
|
18
|
Huang X, Wang S, Jin L, He Y. Dissecting dynamics and differences of selective pressures in the evolution of human pigmentation. Biol Open 2021; 10:bio056523. [PMID: 33495209 PMCID: PMC7888712 DOI: 10.1242/bio.056523] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2020] [Accepted: 12/21/2020] [Indexed: 01/05/2023] Open
Abstract
Human pigmentation is a highly diverse and complex trait among populations and has drawn particular attention from both academic and non-academic investigators for thousands of years. Previous studies detected selection signals in several human pigmentation genes, but few studies have integrated contribution from multiple genes to the evolution of human pigmentation. Moreover, none has quantified selective pressures on human pigmentation over epochs and between populations. Here, we dissect dynamics and differences of selective pressures during different periods and between distinct populations with new approaches. We use genotype data of 19 genes associated with human pigmentation from 17 publicly available datasets and obtain data for 2346 individuals of six representative population groups from across the world. Our results quantify the strength of natural selection on light pigmentation not only in modern Europeans (0.0259/generation) but also in proto-Eurasians (0.00650/generation). Our results also suggest that several derived alleles associated with human dark pigmentation may be under positive directional selection in some African populations. Our study provides the first attempt to quantitatively investigate the dynamics of selective pressures during different time periods in the evolution of human pigmentation.This article has an associated First Person interview with the first author of the article.
Collapse
Affiliation(s)
- Xin Huang
- Chinese Academy of Sciences Key Laboratory of Computational Biology, Chinese Academy of Sciences-Max Planck Society Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Sijia Wang
- Chinese Academy of Sciences Key Laboratory of Computational Biology, Chinese Academy of Sciences-Max Planck Society Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
- State Key Laboratory of Genetic Engineering and Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200433, China
| | - Li Jin
- Chinese Academy of Sciences Key Laboratory of Computational Biology, Chinese Academy of Sciences-Max Planck Society Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
- State Key Laboratory of Genetic Engineering and Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai 200433, China
| | - Yungang He
- Key Laboratory of Medical Epigenetics and Metabolism, Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China
| |
Collapse
|
19
|
Vizzari MT, Benazzo A, Barbujani G, Ghirotto S. A Revised Model of Anatomically Modern Human Expansions Out of Africa through a Machine Learning Approximate Bayesian Computation Approach. Genes (Basel) 2020; 11:E1510. [PMID: 33339234 PMCID: PMC7766041 DOI: 10.3390/genes11121510] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Revised: 12/11/2020] [Accepted: 12/14/2020] [Indexed: 01/25/2023] Open
Abstract
There is a wide consensus in considering Africa as the birthplace of anatomically modern humans (AMH), but the dispersal pattern and the main routes followed by our ancestors to colonize the world are still matters of debate. It is still an open question whether AMH left Africa through a single process, dispersing almost simultaneously over Asia and Europe, or in two main waves, first through the Arab Peninsula into southern Asia and Australo-Melanesia, and later through a northern route crossing the Levant. The development of new methodologies for inferring population history and the availability of worldwide high-coverage whole-genome sequences did not resolve this debate. In this work, we test the two main out-of-Africa hypotheses through an Approximate Bayesian Computation approach, based on the Random-Forest algorithm. We evaluated the ability of the method to discriminate between the alternative models of AMH out-of-Africa, using simulated data. Once assessed that the models are distinguishable, we compared simulated data with real genomic variation, from modern and archaic populations. This analysis showed that a model of multiple dispersals is four-fold as likely as the alternative single-dispersal model. According to our estimates, the two dispersal processes may be placed, respectively, around 74,000 and around 46,000 years ago.
Collapse
Affiliation(s)
| | | | | | - Silvia Ghirotto
- Department of Life Sciences and Biotechnology, University of Ferrara, 44121 Ferrara, Italy; (M.T.V.); (A.B.); (G.B.)
| |
Collapse
|
20
|
Inferring Effective Population Size and Divergence Time in the Lithuanian Population According to High-Density Genotyping Data. Genes (Basel) 2020; 11:genes11030293. [PMID: 32164396 PMCID: PMC7140885 DOI: 10.3390/genes11030293] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2020] [Revised: 02/28/2020] [Accepted: 03/05/2020] [Indexed: 11/17/2022] Open
Abstract
The prehistory of the Lithuanian population and genetic relationship to other populations are poorly studied. Thus, the Lithuanian population, as an object of study, is interesting due to its partial isolation with genetic distinctiveness within the European context and with preserved ancient genetic composition. The main objects of this study was to infer demographic parameters, effective population size (Ne), and divergence time using high-density single nucleotide polymorphism (SNP) genotyping data generated with the Illumina HumanOmmiExpress-12v1.1 array in 295 individuals from the Lithuanian population and to compare our data with other populations from the Human Genome Cell Line Diversity Panel (HGDP-CEPH). We also aimed to reconstruct past events between the main ethnolinguistic regions—Aukštaitija and Žemaitija of Lithuania. Historically, these regions probably developed as two independent Baltic tribes. Our results of Ne in the Lithuanian population through time demonstrated a substantial reduction of Ne over the 150,000–25,000 years before present (YBP). The estimated long-term Ne of the Lithuanian population is quite low—it equals 5404, which likely is a consequence of the bottlenecks associated with the last glacial period of 25,000–12,000 YBP in Europe. The obtained divergence time estimates between the study populations are in agreement with recent studies. The reconstructed past events in Aukštaitija and Žemaitija showed significant differences between these two regions of Lithuania.
Collapse
|
21
|
Hui TYJ, Burt A. Estimating linkage disequilibrium from genotypes under Hardy-Weinberg equilibrium. BMC Genet 2020; 21:21. [PMID: 32102657 PMCID: PMC7045472 DOI: 10.1186/s12863-020-0818-9] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2019] [Accepted: 01/29/2020] [Indexed: 11/10/2022] Open
Abstract
Background Measures of linkage disequilibrium (LD) play a key role in a wide range of applications from disease association to demographic history estimation. The true population LD cannot be measured directly and instead can only be inferred from genetic samples, which are unavoidably subject to measurement error. Previous studies of r2 (a measure of LD), such as the bias due to finite sample size and its variance, were based on the special case that the true population-wise LD is zero. These results generally do not hold for non-zero \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$ {r}_{true}^2 $$\end{document}rtrue2 values, which are more common in real genetic data. Results This work generalises the estimation of r2 to all levels of LD, and for both phased and unphased data. First, we provide new formulae for the effect of finite sample size on the observed r2 values. Second, we find a new empirical formula for the variance of the observed r2, equals to 2E[r2](1 − E[r2])/n, where n is the diploid sample size. Third, we propose a new routine, Constrained ML, a likelihood-based method to directly estimate haplotype frequencies and r2 from diploid genotypes under Hardy-Weinberg Equilibrium. While serving the same purpose as the pre-existing Expectation-Maximisation algorithm, the new routine can have better convergence and is simpler to use. A new likelihood-ratio test is also introduced to test for the absence of a particular haplotype. Extensive simulations are run to support these findings. Conclusion Most inferences on LD will benefit from our new findings, from point and interval estimation to hypothesis testing. Genetic analyses utilising r2 information will become more accurate as a result.
Collapse
Affiliation(s)
- Tin-Yu J Hui
- Department of Life Sciences, Silwood Park Campus, Imperial College London, Ascot, Berkshire, SL5 7PY, UK.
| | - Austin Burt
- Department of Life Sciences, Silwood Park Campus, Imperial College London, Ascot, Berkshire, SL5 7PY, UK
| |
Collapse
|
22
|
Pilot M, Moura AE, Okhlopkov IM, Mamaev NV, Alagaili AN, Mohammed OB, Yavruyan EG, Manaseryan NH, Hayrapetyan V, Kopaliani N, Tsingarska E, Krofel M, Skoglund P, Bogdanowicz W. Global Phylogeographic and Admixture Patterns in Grey Wolves and Genetic Legacy of An Ancient Siberian Lineage. Sci Rep 2019; 9:17328. [PMID: 31757998 PMCID: PMC6874602 DOI: 10.1038/s41598-019-53492-9] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2019] [Accepted: 11/01/2019] [Indexed: 12/12/2022] Open
Abstract
The evolutionary relationships between extinct and extant lineages provide important insight into species' response to environmental change. The grey wolf is among the few Holarctic large carnivores that survived the Late Pleistocene megafaunal extinctions, responding to that period's profound environmental changes with loss of distinct lineages and phylogeographic shifts, and undergoing domestication. We reconstructed global genome-wide phylogeographic patterns in modern wolves, including previously underrepresented Siberian wolves, and assessed their evolutionary relationships with a previously genotyped wolf from Taimyr, Siberia, dated at 35 Kya. The inferred phylogeographic structure was affected by admixture with dogs, coyotes and golden jackals, stressing the importance of accounting for this process in phylogeographic studies. The Taimyr lineage was distinct from modern Siberian wolves and constituted a sister lineage of modern Eurasian wolves and domestic dogs, with an ambiguous position relative to North American wolves. We detected gene flow from the Taimyr lineage to Arctic dog breeds, but population clustering methods indicated closer similarity of the Taimyr wolf to modern wolves than dogs, implying complex post-divergence relationships among these lineages. Our study shows that introgression from ecologically diverse con-specific and con-generic populations was common in wolves' evolutionary history, and could have facilitated their adaptation to environmental change.
Collapse
Affiliation(s)
- Małgorzata Pilot
- School of Life Sciences, University of Lincoln, Lincoln, United Kingdom
- Museum and Institute of Zoology, Polish Academy of Sciences, Warsaw, Poland
| | - Andre E Moura
- School of Life Sciences, University of Lincoln, Lincoln, United Kingdom
- Museum and Institute of Zoology, Polish Academy of Sciences, Warsaw, Poland
| | - Innokentiy M Okhlopkov
- Institute of Biological Problems of Cryolithozone, Siberian Branch of Russian Academy of Sciences, Yakutsk, Russia
| | - Nikolay V Mamaev
- Institute of Biological Problems of Cryolithozone, Siberian Branch of Russian Academy of Sciences, Yakutsk, Russia
| | - Abdulaziz N Alagaili
- KSU Mammals Research Chair, Department of Zoology, King Saud University, Riyadh, Saudi Arabia
| | - Osama B Mohammed
- KSU Mammals Research Chair, Department of Zoology, King Saud University, Riyadh, Saudi Arabia
| | - Eduard G Yavruyan
- Scientific Center of Zoology and Hydroecology, National Academy of Sciences, Yerevan, Armenia
| | - Ninna H Manaseryan
- Scientific Center of Zoology and Hydroecology, National Academy of Sciences, Yerevan, Armenia
| | | | - Natia Kopaliani
- Institute of Ecology, Ilia State University, Tbilisi, Georgia
| | | | - Miha Krofel
- Department of Forestry, Biotechnical Faculty, University of Ljubljana, Ljubljana, Slovenia
| | | | - Wiesław Bogdanowicz
- Museum and Institute of Zoology, Polish Academy of Sciences, Warsaw, Poland.
| |
Collapse
|
23
|
Linkage disequilibrium in the estimation of genetic and demographic parameters of extensively raised chicken populations. WORLD POULTRY SCI J 2019. [DOI: 10.1017/s0043933915002202] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
24
|
Genomic evidence for shared common ancestry of East African hunting-gathering populations and insights into local adaptation. Proc Natl Acad Sci U S A 2019; 116:4166-4175. [PMID: 30782801 DOI: 10.1073/pnas.1817678116] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Anatomically modern humans arose in Africa ∼300,000 years ago, but the demographic and adaptive histories of African populations are not well-characterized. Here, we have generated a genome-wide dataset from 840 Africans, residing in western, eastern, southern, and northern Africa, belonging to 50 ethnicities, and speaking languages belonging to four language families. In addition to agriculturalists and pastoralists, our study includes 16 populations that practice, or until recently have practiced, a hunting-gathering (HG) lifestyle. We observe that genetic structure in Africa is broadly correlated not only with geography, but to a lesser extent, with linguistic affiliation and subsistence strategy. Four East African HG (EHG) populations that are geographically distant from each other show evidence of common ancestry: the Hadza and Sandawe in Tanzania, who speak languages with clicks classified as Khoisan; the Dahalo in Kenya, whose language has remnant clicks; and the Sabue in Ethiopia, who speak an unclassified language. Additionally, we observed common ancestry between central African rainforest HGs and southern African San, the latter of whom speak languages with clicks classified as Khoisan. With the exception of the EHG, central African rainforest HGs, and San, other HG groups in Africa appear genetically similar to neighboring agriculturalist or pastoralist populations. We additionally demonstrate that infectious disease, immune response, and diet have played important roles in the adaptive landscape of African history. However, while the broad biological processes involved in recent human adaptation in Africa are often consistent across populations, the specific loci affected by selective pressures more often vary across populations.
Collapse
|
25
|
Jemaa SB, Thamri N, Mnara S, Rebours E, Rocha D, Boussaha M. Linkage disequilibrium and past effective population size in native Tunisian cattle. Genet Mol Biol 2019; 42:52-61. [PMID: 30776288 PMCID: PMC6428135 DOI: 10.1590/1678-4685-gmb-2017-0342] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2017] [Accepted: 05/14/2018] [Indexed: 12/30/2022] Open
Abstract
To carry out effective genome-wide association studies, information about linkage disequilibrium (LD) is essential. Here, we used medium-density SNP chips to provide estimates of LD in native Tunisian cattle. The two measures of LD that were used, mean r2 and D', decreased from 0.26 to 0.05 and from 0.73 to 0.40, respectively, when the distance between markers increased from less than 20 Kb to 200 Kb. The decay in LD over physical distance occurred at a faster rate than that reported for European and other indigenous breeds, and reached background levels at less than 500 Kb distance. This is consistent with the absence of strong selective pressure within the Tunisian population and suggests that, in order to be effective, any potential genome-wide association mapping studies will need to use chips with higher marker density. An analysis of effective population size (Ne) based on LD data showed a decline in past Ne, with a sudden drop starting about eight generations ago. This finding, combined with the high levels of recent inbreeding revealed by runs of homozygosity (ROH) analysis, indicate that this population is endangered and may be in urgent need of a conservation plan that includes a well-designed genetic management program.
Collapse
Affiliation(s)
- Slim Ben Jemaa
- National Institute of Agronomic Research of Tunisia, Laboratoire des Productions Animales et Fourragères, Ariana, Tunisia
| | - Nejia Thamri
- Livestock and Pasture Office, Tunis Belvedere, Tunisia
| | | | - Emmanuelle Rebours
- GABI, INRA, AgroParisTech, Université Paris Saclay, Jouy-en-Josas, France
| | - Dominique Rocha
- GABI, INRA, AgroParisTech, Université Paris Saclay, Jouy-en-Josas, France
| | - Mekki Boussaha
- GABI, INRA, AgroParisTech, Université Paris Saclay, Jouy-en-Josas, France
| |
Collapse
|
26
|
High-resolution deconstruction of evolution induced by chemotherapy treatments in breast cancer xenografts. Sci Rep 2018; 8:17937. [PMID: 30560892 PMCID: PMC6298990 DOI: 10.1038/s41598-018-36184-8] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2018] [Accepted: 11/14/2018] [Indexed: 12/30/2022] Open
Abstract
The processes by which tumors evolve are essential to the efficacy of treatment, but quantitative understanding of intratumoral dynamics has been limited. Although intratumoral heterogeneity is common, quantification of evolution is difficult from clinical samples because treatment replicates cannot be performed and because matched serial samples are infrequently available. To circumvent these problems we derived and assayed large sets of human triple-negative breast cancer xenografts and cell cultures from two patients, including 86 xenografts from cyclophosphamide, doxorubicin, cisplatin, docetaxel, or vehicle treatment cohorts as well as 45 related cell cultures. We assayed these samples via exome-seq and/or high-resolution droplet digital PCR, allowing us to distinguish complex therapy-induced selection and drift processes among endogenous cancer subclones with cellularity uncertainty <3%. For one patient, we discovered two predominant subclones that were granularly intermixed in all 48 co-derived xenograft samples. These two subclones exhibited differential chemotherapy sensitivity–when xenografts were treated with cisplatin for 3 weeks, the post-treatment volume change was proportional to the post-treatment ratio of subclones on a xenograft-to-xenograft basis. A subsequent cohort in which xenografts were treated with cisplatin, allowed a drug holiday, then treated a second time continued to exhibit this proportionality. In contrast, xenografts from other treatment cohorts, spatially dissected xenograft fragments, and cell cultures evolved in diverse ways but with substantial population bottlenecks. These results show that ecosystems susceptible to successive retreatment can arise spontaneously in breast cancer in spite of a background of irregular subclonal bottlenecks, and our work provides to our knowledge the first quantification of the population genetics of such a system. Intriguingly, in such an ecosystem the ratio of common subclones is predictive of the state of treatment susceptibility, showing how measurements of subclonal heterogeneity could guide treatment for some patients.
Collapse
|
27
|
Karafet TM, Osipova LP, Savina OV, Hallmark B, Hammer MF. Siberian genetic diversity reveals complex origins of the Samoyedic-speaking populations. Am J Hum Biol 2018; 30:e23194. [PMID: 30408262 DOI: 10.1002/ajhb.23194] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2018] [Revised: 08/26/2018] [Accepted: 09/23/2018] [Indexed: 12/21/2022] Open
Abstract
OBJECTIVES We examined autosomal genome-wide SNPs and Y-chromosome data from 15 Siberian and 12 reference populations to study the affinities of Siberian populations, and to address hypotheses about the origin of the Samoyed peoples. METHODS Samples were genotyped for 567 096 autosomal SNPs and 147 Y-chromosome polymorphic sites. For several analyses, we used 281 093 SNPs from the intersection of our data with publicly available ancient Siberian samples. To examine genetic relatedness among populations, we applied PCA, FST , TreeMix, and ADMIXTURE analyses. To explore the potential effect of demography and evolutionary processes, the distribution of ROH and IBD sharing within population were studied. RESULTS Analyses of autosomal and Y-chromosome data reveal high differentiation of the Siberian groups. The Siberian populations have a large proportion of their genome in ROH and IBD segments. Several populations (ie, Nganasans, Evenks, Yukagirs, and Koryaks) do not appear to have experienced admixture with other Siberian populations (ie, producing only positive f3), while for the other tested populations the composition of mixing sources always included Nganasans or Evenks. The Nganasans from the Taymyr Peninsula demonstrate the greatest level of shared shorter ROH and IBD with nearly all other Siberian populations. CONCLUSIONS Autosomal SNP and Y-chromosome data demonstrate that Samoyedic populations differ significantly in their genetic composition. Genetic relationship is observed only between Forest and Tundra Nentsi. Selkups are affiliated with the Kets from the Yenisey River, while the Nganasans are separated from their linguistic neighbors, showing closer affinities with the Evenks and Yukagirs.
Collapse
Affiliation(s)
- Tatiana M Karafet
- ARL Division of Biotechnology, University of Arizona, Tucson, Arizona
| | - Ludmila P Osipova
- Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia.,Novosibirsk State University, Novosibirsk, Russia
| | - Olga V Savina
- ARL Division of Biotechnology, University of Arizona, Tucson, Arizona
| | - Brian Hallmark
- Interdisciplinary Program in Statistics, University of Arizona, Tucson, Arizona
| | - Michael F Hammer
- ARL Division of Biotechnology, University of Arizona, Tucson, Arizona.,Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona
| |
Collapse
|
28
|
Garg P, Joshi RS, Watson C, Sharp AJ. A survey of inter-individual variation in DNA methylation identifies environmentally responsive co-regulated networks of epigenetic variation in the human genome. PLoS Genet 2018; 14:e1007707. [PMID: 30273333 PMCID: PMC6181428 DOI: 10.1371/journal.pgen.1007707] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2018] [Revised: 10/11/2018] [Accepted: 09/08/2018] [Indexed: 12/22/2022] Open
Abstract
While population studies have resulted in detailed maps of genetic variation in humans, to date there are few robust maps of epigenetic variation. We identified sites containing clusters of CpGs with high inter-individual epigenetic variation, termed Variably Methylated Regions (VMRs) in five purified cell types. We observed that VMRs occur preferentially at enhancers and 3' UTRs. While the majority of VMRs have high heritability, a subset of VMRs within the genome show highly correlated variation in trans, forming co-regulated networks that have low heritability, differ between cell types and are enriched for specific transcription factor binding sites and biological pathways of functional relevance to each tissue. For example, in T cells we defined a network of 95 co-regulated VMRs enriched for genes with roles in T-cell activation; in fibroblasts a network of 34 co-regulated VMRs comprising all four HOX gene clusters enriched for control of tissue growth; and in neurons a network of 18 VMRs enriched for roles in synaptic signaling. By culturing genetically-identical fibroblasts under varying environmental conditions, we experimentally demonstrated that some VMR networks are responsive to the environment, with methylation levels at these loci changing in a coordinated fashion in trans dependent on cellular growth. Intriguingly these environmentally-responsive VMRs showed a strong enrichment for imprinted loci (p<10-80), suggesting that these are particularly sensitive to environmental conditions. Our study provides a detailed map of common epigenetic variation in the human genome, showing that both genetic and environmental causes underlie this variation.
Collapse
Affiliation(s)
- Paras Garg
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, United States of America
| | - Ricky S. Joshi
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, United States of America
| | - Corey Watson
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, United States of America
| | - Andrew J. Sharp
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, United States of America
| |
Collapse
|
29
|
Wirtz J, Rauscher M, Wiehe T. Topological linkage disequilibrium calculated from coalescent genealogies. Theor Popul Biol 2018; 124:41-50. [PMID: 30243857 DOI: 10.1016/j.tpb.2018.09.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2018] [Revised: 08/03/2018] [Accepted: 09/06/2018] [Indexed: 10/28/2022]
Abstract
We revisit the classical, and introduce a novel, concept of two-locus linkage disequilibrium (LD). In contrast to defining haplotypes as allele combinations at two marker loci, we concentrate on the clustering of a sample of chromosomes induced by their coalescent genealogy. The root of a binary coalescent tree defines two clusters of chromosomes, each one of them containing the left and right descendants of the root. At two different loci this assignment may be different as a result of recombination. We show that the proportion of shared chromosomes among clusters at two different loci, measured by the squared correlation, constitutes a natural measure of LD. We call this topological LD (tLD) since it is induced by the topology of the coalescent tree. We find that it is, on average, larger than classical LD for any given distance between loci. Furthermore, tLD has a smaller coefficient of variation, which should provide an advantage, compared to the use of classical LD, for any kind of mapping purposes. We conclude with a practical application to the LCT region in human populations.
Collapse
Affiliation(s)
| | | | - Thomas Wiehe
- Institut für Genetik, Universität zu Köln, Germany.
| |
Collapse
|
30
|
Oliveira M, Ferreira J, Fernandes V, Sakuntabhai A, Pereira L. Host ancestry and dengue fever: from mapping of candidate genes to prediction of worldwide genetic risk. Future Virol 2018. [DOI: 10.2217/fvl-2018-0073] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
Prevalence of many complex diseases, including dengue disease, is not even across the world. Dengue is endemic in several overpopulated urban centers in poor Asian and American countries, representing a risk to two-thirds of the human population. High-throughput genomic screenings and new algorithms of admixture mapping are allowing confirming that ancestry plays a major role in dengue disease. Allele and haplotype candidate frequencies display high heterogeneity between population groups, especially in comparison with Africans. Genetic risk evaluation testifies African ancestry protection against the more severe forms of the disease, most probably due to natural selection of protective variants. European genetic predisposition is identical to the Asian for severer forms, but lower for mild dengue fever.
Collapse
Affiliation(s)
- Marisa Oliveira
- i3S - Instituto de Investigação e Inovação em Saúde, Universidade do Porto, 4200-135 Porto, Portugal
- Instituto de Patologia e Imunologia Molecular da Universidade do Porto (IPATIMUP), 4200-135 Porto, Portugal
- Instituto de Ciências Biomédicas Abel Salazar (ICBAS), Universidade do Porto, 4050-313 Porto, Portugal
- Institut Pasteur, Functional Genetics of Infectious Diseases Unit, 75015 Paris, France
| | - Joana Ferreira
- i3S - Instituto de Investigação e Inovação em Saúde, Universidade do Porto, 4200-135 Porto, Portugal
- Instituto de Patologia e Imunologia Molecular da Universidade do Porto (IPATIMUP), 4200-135 Porto, Portugal
- Instituto de Ciências Biomédicas Abel Salazar (ICBAS), Universidade do Porto, 4050-313 Porto, Portugal
| | - Verónica Fernandes
- i3S - Instituto de Investigação e Inovação em Saúde, Universidade do Porto, 4200-135 Porto, Portugal
- Instituto de Patologia e Imunologia Molecular da Universidade do Porto (IPATIMUP), 4200-135 Porto, Portugal
| | - Anavaj Sakuntabhai
- Institut Pasteur, Functional Genetics of Infectious Diseases Unit, 75015 Paris, France
- Pasteur Kyoto International Joint Research Unit for Integrative Vaccinomics, Kyoto, Japan
- CNRS, Unité de Recherche Associée 3012, 75015 Paris, France
| | - Luisa Pereira
- i3S - Instituto de Investigação e Inovação em Saúde, Universidade do Porto, 4200-135 Porto, Portugal
- Instituto de Patologia e Imunologia Molecular da Universidade do Porto (IPATIMUP), 4200-135 Porto, Portugal
- Departamento de Patologia, Faculdade de Medicina da Universidade do Porto, 4200-319 Porto, Portugal
| |
Collapse
|
31
|
Arciero E, Kraaijenbrink T, Asan, Haber M, Mezzavilla M, Ayub Q, Wang W, Pingcuo Z, Yang H, Wang J, Jobling MA, van Driem G, Xue Y, de Knijff P, Tyler-Smith C. Demographic History and Genetic Adaptation in the Himalayan Region Inferred from Genome-Wide SNP Genotypes of 49 Populations. Mol Biol Evol 2018; 35:1916-1933. [PMID: 29796643 PMCID: PMC6063301 DOI: 10.1093/molbev/msy094] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
We genotyped 738 individuals belonging to 49 populations from Nepal, Bhutan, North India, or Tibet at over 500,000 SNPs, and analyzed the genotypes in the context of available worldwide population data in order to investigate the demographic history of the region and the genetic adaptations to the harsh environment. The Himalayan populations resembled other South and East Asians, but in addition displayed their own specific ancestral component and showed strong population structure and genetic drift. We also found evidence for multiple admixture events involving Himalayan populations and South/East Asians between 200 and 2,000 years ago. In comparisons with available ancient genomes, the Himalayans, like other East and South Asian populations, showed similar genetic affinity to Eurasian hunter-gatherers (a 24,000-year-old Upper Palaeolithic Siberian), and the related Bronze Age Yamnaya. The high-altitude Himalayan populations all shared a specific ancestral component, suggesting that genetic adaptation to life at high altitude originated only once in this region and subsequently spread. Combining four approaches to identifying specific positively selected loci, we confirmed that the strongest signals of high-altitude adaptation were located near the Endothelial PAS domain-containing protein 1 and Egl-9 Family Hypoxia Inducible Factor 1 loci, and discovered eight additional robust signals of high-altitude adaptation, five of which have strong biological functional links to such adaptation. In conclusion, the demographic history of Himalayan populations is complex, with strong local differentiation, reflecting both genetic and cultural factors; these populations also display evidence of multiple genetic adaptations to high-altitude environments.
Collapse
Affiliation(s)
- Elena Arciero
- The Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - Thirsa Kraaijenbrink
- Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | - Asan
- BGI-Shenzhen, Shenzhen, China
| | - Marc Haber
- The Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - Massimo Mezzavilla
- The Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
- Division of Experimental Genetics, Sidra Medical and Research Center, Doha, Qatar
| | - Qasim Ayub
- The Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
- Tropical Medicine and Biology Multidisciplinary Platform, Monash University Malaysia Genomics Facility, Selangor Darul Ehsan, Malaysia
- School of Science, Monash University Malaysia, Selangor Darul Ehsan, Malaysia
| | | | - Zhaxi Pingcuo
- The Third People’s Hospital of the Tibet Autonomous Region, Lhasa, China
| | - Huanming Yang
- BGI-Shenzhen, Shenzhen, China
- James D. Watson Institute of Genome Science, Hangzhou, China
| | - Jian Wang
- BGI-Shenzhen, Shenzhen, China
- James D. Watson Institute of Genome Science, Hangzhou, China
| | - Mark A Jobling
- Department of Genetics & Genome Biology, University of Leicester, Leicester, United Kingdom
| | | | - Yali Xue
- The Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - Peter de Knijff
- Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | - Chris Tyler-Smith
- The Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| |
Collapse
|
32
|
Jinam TA, Phipps ME, Aghakhanian F, Majumder PP, Datar F, Stoneking M, Sawai H, Nishida N, Tokunaga K, Kawamura S, Omoto K, Saitou N. Discerning the Origins of the Negritos, First Sundaland People: Deep Divergence and Archaic Admixture. Genome Biol Evol 2018; 9:2013-2022. [PMID: 28854687 PMCID: PMC5597900 DOI: 10.1093/gbe/evx118] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/09/2017] [Indexed: 12/26/2022] Open
Abstract
Human presence in Southeast Asia dates back to at least 40,000 years ago, when the current islands formed a continental shelf called Sundaland. In the Philippine Islands, Peninsular Malaysia, and Andaman Islands, there exist indigenous groups collectively called Negritos whose ancestry can be traced to the "First Sundaland People." To understand the relationship between these Negrito groups and their demographic histories, we generated genome-wide single nucleotide polymorphism data in the Philippine Negritos and compared them with existing data from other populations. Phylogenetic tree analyses show that Negritos are basal to other East and Southeast Asians, and that they diverged from West Eurasians at least 38,000 years ago. We also found relatively high traces of Denisovan admixture in the Philippine Negritos, but not in the Malaysian and Andamanese groups, suggesting independent introgression and/or parallel losses involving Denisovan introgressed regions. Shared genetic loci between all three Negrito groups could be related to skin pigmentation, height, facial morphology and malarial resistance. These results show the unique status of Negrito groups as descended from the First Sundaland People.
Collapse
Affiliation(s)
- Timothy A Jinam
- Division of Population Genetics, National Institute of Genetics, Mishima, Japan
| | - Maude E Phipps
- Jeffrey Cheah School of Medicine and Health Sciences, Monash University Malaysia, Sunway City, Selangor, Malaysia
| | - Farhang Aghakhanian
- Jeffrey Cheah School of Medicine and Health Sciences, Monash University Malaysia, Sunway City, Selangor, Malaysia
| | - Partha P Majumder
- National Institute of Biomedical Genomics, Kalyani, West Bengal, India
| | - Francisco Datar
- Department of Anthropology, University of the Philippines, Diliman, Quezon City, The Philippines
| | - Mark Stoneking
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Hiromi Sawai
- Department of Human Genetics, Graduate School of Medicine, The University of Tokyo, Japan
| | - Nao Nishida
- Department of Human Genetics, Graduate School of Medicine, The University of Tokyo, Japan.,Department of Hepatic Disease, Research Center for Hepatitis and Immunology, National Center for Global Health and Medicine, Chiba, Japan
| | - Katsushi Tokunaga
- Department of Human Genetics, Graduate School of Medicine, The University of Tokyo, Japan
| | - Shoji Kawamura
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Japan
| | - Keiichi Omoto
- Department of Anthropology, Faculty of Science, The University of Tokyo, Japan
| | - Naruya Saitou
- Division of Population Genetics, National Institute of Genetics, Mishima, Japan
| |
Collapse
|
33
|
Oliveira M, Saraiva DP, Cavadas B, Fernandes V, Pedro N, Casademont I, Koeth F, Alshamali F, Harich N, Cherni L, Sierra B, Guzman MG, Sakuntabhai A, Pereira L. Population genetics-informed meta-analysis in seven genes associated with risk to dengue fever disease. INFECTION GENETICS AND EVOLUTION 2018; 62:60-72. [PMID: 29673983 DOI: 10.1016/j.meegid.2018.04.018] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/09/2018] [Revised: 04/02/2018] [Accepted: 04/13/2018] [Indexed: 12/30/2022]
Abstract
Population genetics theory predicted that rare frequent markers would be the main contributors for heritability of complex diseases, but meta-analyses of genome-wide association studies are revealing otherwise common markers, present in all population groups, as the identified candidate genes. In this work, we applied a population-genetics informed meta-analysis to 10 markers located in seven genes said to be associated with dengue fever disease. Seven markers (in PLCE1, CD32, CD209, OAS1 and OAS3 genes) have high-frequency and the other three (in MICB and TNFA genes) have intermediate frequency. Most of these markers have high discriminatory power between population groups, but their frequencies follow the rules of genetic drift, and seem to have not been under strong selective pressure. There was a good agreement in directional consistency across trans-ethnic association signals, in East Asian and Latin American cohorts, with heterogeneity generated by randomness between studies and especially by low sample sizes. This led to confirm the following significant associations: with DF, odds ratio of 0.67 for TNFA-rs1800629-A; with DHF, 0.82 for CD32-rs1801274-G; with DSS, 0.55 for OAS3-rs2285933-G, 0.80 for PLCE1-rs2274223-G and 1.32 for MICB-rs3132468-C. The overall genetic risks confirmed sub-Saharan African populations and descendants as the best protected against the severer forms of the disease, while Southeast and Northeast Asians are the least protected ones. European and close neighbours are the best protected against dengue fever, while, again, Southeast and Northeast Asians are the least protected ones. These risk scores provide important predictive information for the largely naïve European and North American regions, as well as for Africa where misdiagnosis with other hemorrhagic diseases is of concern.
Collapse
Affiliation(s)
- Marisa Oliveira
- i3S - Instituto de Investigação e Inovação em Saúde, Universidade do Porto, 4200-135 Porto, Portugal; Instituto de Patologia e Imunologia Molecular da Universidade do Porto (IPATIMUP), 4200-135 Porto, Portugal; Instituto de Ciências Biomédicas Abel Salazar (ICBAS), Universidade do Porto, 4050-313 Porto, Portugal; Institut Pasteur, Functional Genetics of Infectious Diseases Unit, 75724 Paris Cedex 15, France
| | - Diana P Saraiva
- i3S - Instituto de Investigação e Inovação em Saúde, Universidade do Porto, 4200-135 Porto, Portugal; Instituto de Patologia e Imunologia Molecular da Universidade do Porto (IPATIMUP), 4200-135 Porto, Portugal; Institut Pasteur, Functional Genetics of Infectious Diseases Unit, 75724 Paris Cedex 15, France
| | - Bruno Cavadas
- i3S - Instituto de Investigação e Inovação em Saúde, Universidade do Porto, 4200-135 Porto, Portugal; Instituto de Patologia e Imunologia Molecular da Universidade do Porto (IPATIMUP), 4200-135 Porto, Portugal; Instituto de Ciências Biomédicas Abel Salazar (ICBAS), Universidade do Porto, 4050-313 Porto, Portugal
| | - Verónica Fernandes
- i3S - Instituto de Investigação e Inovação em Saúde, Universidade do Porto, 4200-135 Porto, Portugal; Instituto de Patologia e Imunologia Molecular da Universidade do Porto (IPATIMUP), 4200-135 Porto, Portugal
| | - Nicole Pedro
- i3S - Instituto de Investigação e Inovação em Saúde, Universidade do Porto, 4200-135 Porto, Portugal; Instituto de Patologia e Imunologia Molecular da Universidade do Porto (IPATIMUP), 4200-135 Porto, Portugal
| | - Isabelle Casademont
- Institut Pasteur, Functional Genetics of Infectious Diseases Unit, 75724 Paris Cedex 15, France; Pasteur Kyoto International Joint Research Unit for Integrative Vaccinomics, Kyoto, Japan
| | - Fanny Koeth
- Institut Pasteur, Functional Genetics of Infectious Diseases Unit, 75724 Paris Cedex 15, France; Pasteur Kyoto International Joint Research Unit for Integrative Vaccinomics, Kyoto, Japan
| | - Farida Alshamali
- General Department of Forensic Sciences and Criminology, Dubai Police General Headquarters, PO Box 1493, Dubai, United Arab Emirates
| | - Nourdin Harich
- Laboratoire des Sciences Anthropogénétiques et Biotechnologies, Départment de Biologie, Université Chouaïb Doukkali, El Jadida 24000, Morocco
| | - Lotfi Cherni
- Laboratory of Genetics, Immunology and Human Pathology, Faculté de Sciences de Tunis, Université de Tunis El Manar, Tunis 2092, Tunisia; Tunis and High Institute of Biotechnology, University of Monastir, 5000 Monastir, Tunisia
| | - Beatriz Sierra
- Virology Department, PAHO/WHO Collaborating Center for the Study of Dengue and its Vector, Pedro Kourí Institute of Tropical Medicine (IPK), 601 Havana, Cuba
| | - Maria G Guzman
- Virology Department, PAHO/WHO Collaborating Center for the Study of Dengue and its Vector, Pedro Kourí Institute of Tropical Medicine (IPK), 601 Havana, Cuba
| | - Anavaj Sakuntabhai
- Institut Pasteur, Functional Genetics of Infectious Diseases Unit, 75724 Paris Cedex 15, France; Pasteur Kyoto International Joint Research Unit for Integrative Vaccinomics, Kyoto, Japan; CNRS UMR2000: Génomique évolutive, modélisation et santé (GEMS), Paris, France
| | - Luisa Pereira
- i3S - Instituto de Investigação e Inovação em Saúde, Universidade do Porto, 4200-135 Porto, Portugal; Instituto de Patologia e Imunologia Molecular da Universidade do Porto (IPATIMUP), 4200-135 Porto, Portugal; Faculdade de Medicina da Universidade do Porto, Portugal.
| |
Collapse
|
34
|
Wang Y, Lu D, Chung YJ, Xu S. Genetic structure, divergence and admixture of Han Chinese, Japanese and Korean populations. Hereditas 2018; 155:19. [PMID: 29636655 PMCID: PMC5889524 DOI: 10.1186/s41065-018-0057-5] [Citation(s) in RCA: 66] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2018] [Accepted: 03/23/2018] [Indexed: 12/25/2022] Open
Abstract
Background Han Chinese, Japanese and Korean, the three major ethnic groups of East Asia, share many similarities in appearance, language and culture etc., but their genetic relationships, divergence times and subsequent genetic exchanges have not been well studied. Results We conducted a genome-wide study and evaluated the population structure of 182 Han Chinese, 90 Japanese and 100 Korean individuals, together with the data of 630 individuals representing 8 populations wordwide. Our analyses revealed that Han Chinese, Japanese and Korean populations have distinct genetic makeup and can be well distinguished based on either the genome wide data or a panel of ancestry informative markers (AIMs). Their genetic structure corresponds well to their geographical distributions, indicating geographical isolation played a critical role in driving population differentiation in East Asia. The most recent common ancestor of the three populations was dated back to 3000 ~ 3600 years ago. Our analyses also revealed substantial admixture within the three populations which occurred subsequent to initial splits, and distinct gene introgression from surrounding populations, of which northern ancestral component is dominant. Conclusions These estimations and findings facilitate to understanding population history and mechanism of human genetic diversity in East Asia, and have implications for both evolutionary and medical studies.
Collapse
Affiliation(s)
- Yuchen Wang
- 1Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200031 China.,2University of Chinese Academy of Sciences, Beijing, 100049 China
| | - Dongsheng Lu
- 1Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200031 China.,2University of Chinese Academy of Sciences, Beijing, 100049 China
| | - Yeun-Jun Chung
- 3Integrated Research Center for Genome Polymorphism, Department of Microbiology, The Catholic University Medical College, Seoul, Socho-gu 137-701 South Korea
| | - Shuhua Xu
- 1Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200031 China.,2University of Chinese Academy of Sciences, Beijing, 100049 China.,4School of Life Science and Technology ShanghaiTech University, Shanghai, 201210 China.,5Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, 650223 China.,Collaborative Innovation Center of Genetics and Development, Shanghai, 200438 China
| |
Collapse
|
35
|
Pugach I, Duggan AT, Merriwether DA, Friedlaender FR, Friedlaender JS, Stoneking M. The Gateway from Near into Remote Oceania: New Insights from Genome-Wide Data. Mol Biol Evol 2018; 35:871-886. [PMID: 29301001 PMCID: PMC5889034 DOI: 10.1093/molbev/msx333] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
A widely accepted two-wave scenario of human settlement of Oceania involves the first out-of-Africa migration circa 50,000 years ago (ya), and the more recent Austronesian expansion, which reached the Bismarck Archipelago by 3,450 ya. Whereas earlier genetic studies provided evidence for extensive sex-biased admixture between the incoming and the indigenous populations, some archaeological, linguistic, and genetic evidence indicates a more complicated picture of settlement. To study regional variation in Oceania in more detail, we have compiled a genome-wide data set of 823 individuals from 72 populations (including 50 populations from Oceania) and over 620,000 autosomal single nucleotide polymorphisms (SNPs). We show that the initial dispersal of people from the Bismarck Archipelago into Remote Oceania occurred in a "leapfrog" fashion, completely by-passing the main chain of the Solomon Islands, and that the colonization of the Solomon Islands proceeded in a bidirectional manner. Our results also support a divergence between western and eastern Solomons, in agreement with the sharp linguistic divide known as the Tryon-Hackman line. We also report substantial post-Austronesian gene flow across the Solomons. In particular, Santa Cruz (in Remote Oceania) exhibits extraordinarily high levels of Papuan ancestry that cannot be explained by a simple bottleneck/founder event scenario. Finally, we use simulations to show that discrepancies between different methods for dating admixture likely reflect different sensitivities of the methods to multiple admixture events from the same (or similar) sources. Overall, this study points to the importance of fine-scale sampling to understand the complexities of human population history.
Collapse
Affiliation(s)
- Irina Pugach
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Ana T Duggan
- Department of Anthropology, McMaster University, Hamilton, Canada
| | | | | | | | - Mark Stoneking
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| |
Collapse
|
36
|
Yew CW, Lu D, Deng L, Wong LP, Ong RTH, Lu Y, Wang X, Yunus Y, Aghakhanian F, Mokhtar SS, Hoque MZ, Voo CLY, Abdul Rahman T, Bhak J, Phipps ME, Xu S, Teo YY, Kumar SV, Hoh BP. Genomic structure of the native inhabitants of Peninsular Malaysia and North Borneo suggests complex human population history in Southeast Asia. Hum Genet 2018; 137:161-173. [DOI: 10.1007/s00439-018-1869-0] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2017] [Accepted: 01/22/2018] [Indexed: 11/28/2022]
|
37
|
Prieur V, Clarke SM, Brito LF, McEwan JC, Lee MA, Brauning R, Dodds KG, Auvray B. Estimation of linkage disequilibrium and effective population size in New Zealand sheep using three different methods to create genetic maps. BMC Genet 2017; 18:68. [PMID: 28732466 PMCID: PMC5521107 DOI: 10.1186/s12863-017-0534-2] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2017] [Accepted: 07/11/2017] [Indexed: 11/21/2022] Open
Abstract
BACKGROUND Investments in genetic selection have played a major role in the New Zealand sheep industry competitiveness. Selection may erode genetic diversity, which is a crucial factor for the success of breeding programs. Better understanding of linkage disequilibrium (LD) and ancestral effective population size (Ne) through quantifying this diversity and comparison between populations allows for more informed decisions with regards to selective breeding taking population genetic diversity into account. The estimation of N e can be determined via genetic markers and requires knowledge of genetic distances between these markers. Single nucleotide polymorphisms (SNP) data from a sample of 12,597 New Zealand crossbred and purebred sheep genotyped with the Illumina Ovine SNP50 BeadChip was used to perform a genome-wide scan of LD and N e . Three methods to estimate genetic distances were investigated: 1) M1: a ratio fixed across the whole genome of one Megabase per centiMorgan; 2) M2: the ratios of genetic distance (using M3, below) over physical distance fixed for each chromosome; and, 3) M3: a genetic map of inter-SNP distances estimated using CRIMAP software (v2.503). RESULTS The estimates obtained with M2 and M3 showed much less variability between autosomes than those with M1, which tended to give lower N e results and higher LD decay. The results suggest that N e has decreased since the development of sheep breeds in Europe and this reduction in Ne has been accelerated in the last three decades. The N e estimated for five generations in the past ranged from 71 to 237 for Texel and Romney breeds, respectively. A low level of genetic kinship and inbreeding was estimated in those breeds suggesting avoidance of mating close relatives. CONCLUSIONS M3 was considered the most accurate method to create genetic maps for the estimation of LD and Ne. The findings of this study highlight the history of genetic selection in New Zealand crossbred and purebred sheep and these results will be very useful to understand genetic diversity of the population with respect to genetic selection. In addition, it will help geneticists to identify genomic regions which have been preferentially selected within a variety of breeds and populations.
Collapse
Affiliation(s)
- Vincent Prieur
- AgResearch, Invermay Agricultural Centre, Private Bag 50034, Mosgiel, 9053 New Zealand
- Current address: France Limousin Sélection, Pôle de Lanaud, 87220 Boisseuil, France
| | - Shannon M. Clarke
- AgResearch, Invermay Agricultural Centre, Private Bag 50034, Mosgiel, 9053 New Zealand
| | - Luiz F. Brito
- Centre for Genetic Improvement of Livestock, University of Guelph, N1G2W1, Guelph, Canada
| | - John C. McEwan
- AgResearch, Invermay Agricultural Centre, Private Bag 50034, Mosgiel, 9053 New Zealand
| | - Michael A. Lee
- Department of Mathematics and Statistics, University of Otago, Dunedin, 9058 New Zealand
| | - Rudiger Brauning
- AgResearch, Invermay Agricultural Centre, Private Bag 50034, Mosgiel, 9053 New Zealand
| | - Ken G. Dodds
- AgResearch, Invermay Agricultural Centre, Private Bag 50034, Mosgiel, 9053 New Zealand
| | - Benoît Auvray
- Department of Mathematics and Statistics, University of Otago, Dunedin, 9058 New Zealand
| |
Collapse
|
38
|
Lu J, Lou H, Fu R, Lu D, Zhang F, Wu Z, Zhang X, Li C, Fang B, Pu F, Wei J, Wei Q, Zhang C, Wang X, Lu Y, Yan S, Yang Y, Jin L, Xu S. Assessing genome-wide copy number variation in the Han Chinese population. J Med Genet 2017; 54:685-692. [PMID: 28705883 DOI: 10.1136/jmedgenet-2017-104613] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2017] [Revised: 06/08/2017] [Accepted: 06/08/2017] [Indexed: 12/22/2022]
Abstract
BACKGROUND Copy number variation (CNV) is a valuable source of genetic diversity in the human genome and a well-recognised cause of various genetic diseases. However, CNVs have been considerably under-represented in population-based studies, particularly the Han Chinese which is the largest ethnic group in the world. OBJECTIVES To build a representative CNV map for the Han Chinese population. METHODS We conducted a genome-wide CNV study involving 451 male Han Chinese samples from 11 geographical regions encompassing 28 dialect groups, representing a less-biased panel compared with the currently available data. We detected CNVs by using 4.2M NimbleGen comparative genomic hybridisation array and whole-genome deep sequencing of 51 samples to optimise the filtering conditions in CNV discovery. RESULTS A comprehensive Han Chinese CNV map was built based on a set of high-quality variants (positive predictive value >0.8, with sizes ranging from 369 bp to 4.16 Mb and a median of 5907 bp). The map consists of 4012 CNV regions (CNVRs), and more than half are novel to the 30 East Asian CNV Project and the 1000 Genomes Project Phase 3. We further identified 81 CNVRs specific to regional groups, which was indicative of the subpopulation structure within the Han Chinese population. CONCLUSIONS Our data are complementary to public data sources, and the CNV map may facilitate in the identification of pathogenic CNVs and further biomedical research studies involving the Han Chinese population.
Collapse
Affiliation(s)
- Jianqi Lu
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China
| | - Haiyi Lou
- Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai, China
| | - Ruiqing Fu
- Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Dongsheng Lu
- Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Feng Zhang
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China.,Collaborative Innovation Center of Genetics and Development, Fudan University, Shanghai, China
| | - Zhendong Wu
- Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai, China.,School of Life Science and Technology, ShanghaiTech University, Shanghai, China
| | - Xi Zhang
- Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai, China.,School of Life Science and Technology, ShanghaiTech University, Shanghai, China
| | - Changhua Li
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China
| | - Baijun Fang
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China
| | - Fangfang Pu
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China
| | - Jingning Wei
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China
| | - Qian Wei
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China
| | - Chao Zhang
- Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Xiaoji Wang
- Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Yan Lu
- Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai, China
| | - Shi Yan
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China
| | - Yajun Yang
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China
| | - Li Jin
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China.,Collaborative Innovation Center of Genetics and Development, Fudan University, Shanghai, China
| | - Shuhua Xu
- Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai, China.,University of Chinese Academy of Sciences, Beijing, China.,Collaborative Innovation Center of Genetics and Development, Fudan University, Shanghai, China.,School of Life Science and Technology, ShanghaiTech University, Shanghai, China
| |
Collapse
|
39
|
Zhang C, Lu Y, Feng Q, Wang X, Lou H, Liu J, Ning Z, Yuan K, Wang Y, Zhou Y, Deng L, Liu L, Yang Y, Li S, Ma L, Zhang Z, Jin L, Su B, Kang L, Xu S. Differentiated demographic histories and local adaptations between Sherpas and Tibetans. Genome Biol 2017; 18:115. [PMID: 28619099 PMCID: PMC5472941 DOI: 10.1186/s13059-017-1242-y] [Citation(s) in RCA: 44] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2016] [Accepted: 05/22/2017] [Indexed: 01/04/2023] Open
Abstract
BACKGROUND The genetic relationships reported by recent studies between Sherpas and Tibetans are controversial. To gain insights into the population history and the genetic basis of high-altitude adaptation of the two groups, we analyzed genome-wide data in 111 Sherpas (Tibet and Nepal) and 177 Tibetans (Tibet and Qinghai), together with available data from present-day human populations. RESULTS Sherpas and Tibetans show considerable genetic differences and can be distinguished as two distinct groups, even though the divergence between them (~3200-11,300 years ago) is much later than that between Han Chinese and either of the two groups (~6200-16,000 years ago). Sub-population structures exist in both Sherpas and Tibetans, corresponding to geographical or linguistic groups. Differentiation of genetic variants between Sherpas and Tibetans associated with adaptation to either high-altitude or ultraviolet radiation were identified and validated by genotyping additional Sherpa and Tibetan samples. CONCLUSIONS Our analyses indicate that both Sherpas and Tibetans are admixed populations, but the findings do not support the previous hypothesis that Tibetans derive their ancestry from Sherpas and Han Chinese. Compared to Tibetans, Sherpas show higher levels of South Asian ancestry, while Tibetans show higher levels of East Asian and Central Asian/Siberian ancestry. We propose a new model to elucidate the differentiated demographic histories and local adaptations of Sherpas and Tibetans.
Collapse
Affiliation(s)
- Chao Zhang
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institutes for Biological Sciences, CAS, Shanghai, 200031, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Yan Lu
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institutes for Biological Sciences, CAS, Shanghai, 200031, China
| | - Qidi Feng
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institutes for Biological Sciences, CAS, Shanghai, 200031, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Xiaoji Wang
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institutes for Biological Sciences, CAS, Shanghai, 200031, China.,University of Chinese Academy of Sciences, Beijing, 100049, China.,School of Life Science and Technology, ShanghaiTech University, Shanghai, 201210, China
| | - Haiyi Lou
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institutes for Biological Sciences, CAS, Shanghai, 200031, China
| | - Jiaojiao Liu
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institutes for Biological Sciences, CAS, Shanghai, 200031, China.,University of Chinese Academy of Sciences, Beijing, 100049, China.,School of Life Science and Technology, ShanghaiTech University, Shanghai, 201210, China
| | - Zhilin Ning
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institutes for Biological Sciences, CAS, Shanghai, 200031, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Kai Yuan
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institutes for Biological Sciences, CAS, Shanghai, 200031, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Yuchen Wang
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institutes for Biological Sciences, CAS, Shanghai, 200031, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Ying Zhou
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institutes for Biological Sciences, CAS, Shanghai, 200031, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Lian Deng
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institutes for Biological Sciences, CAS, Shanghai, 200031, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Lijun Liu
- Key Laboratory for Molecular Genetic Mechanisms and Intervention Research on High Altitude Disease of Tibet Autonomous Region, School of Medicine, Xizang Minzu University, Xianyang, Shaanxi, 712082, China
| | - Yajun Yang
- State Key Laboratory of Genetic Engineering and Ministry of Education (MOE) Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China
| | - Shilin Li
- State Key Laboratory of Genetic Engineering and Ministry of Education (MOE) Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China
| | - Lifeng Ma
- Key Laboratory for Molecular Genetic Mechanisms and Intervention Research on High Altitude Disease of Tibet Autonomous Region, School of Medicine, Xizang Minzu University, Xianyang, Shaanxi, 712082, China
| | - Zhiying Zhang
- Key Laboratory for Molecular Genetic Mechanisms and Intervention Research on High Altitude Disease of Tibet Autonomous Region, School of Medicine, Xizang Minzu University, Xianyang, Shaanxi, 712082, China
| | - Li Jin
- State Key Laboratory of Genetic Engineering and Ministry of Education (MOE) Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, 200433, China
| | - Bing Su
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650223, China
| | - Longli Kang
- Key Laboratory for Molecular Genetic Mechanisms and Intervention Research on High Altitude Disease of Tibet Autonomous Region, School of Medicine, Xizang Minzu University, Xianyang, Shaanxi, 712082, China
| | - Shuhua Xu
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institutes for Biological Sciences, CAS, Shanghai, 200031, China. .,University of Chinese Academy of Sciences, Beijing, 100049, China. .,School of Life Science and Technology, ShanghaiTech University, Shanghai, 201210, China. .,Collaborative Innovation Center of Genetics and Development, Shanghai, 200438, China.
| |
Collapse
|
40
|
Leonardi M, Barbujani G, Manica A. An earlier revolution: genetic and genomic analyses reveal pre-existing cultural differences leading to Neolithization. Sci Rep 2017; 7:3525. [PMID: 28615641 PMCID: PMC5471218 DOI: 10.1038/s41598-017-03717-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2016] [Accepted: 05/02/2017] [Indexed: 01/28/2023] Open
Abstract
Archaeological evidence shows that, in the long run, Neolitization (the transition from foraging to food production) was associated with demographic growth. We used two methods (patterns of linkage disequilibrium from whole-genome SNPs and MSMC estimates on genomes) to reconstruct the demographic profiles for respectively 64 and 24 modern-day populations with contrasting lifestyles across the Old World (sub-Saharan Africa, south-eastern Asia, Siberia). Surprisingly, in all regions, food producers had larger effective population sizes (Ne) than foragers already 20 k years ago, well before the Neolithic revolution. As expected, this difference further increased ~12–10 k years ago, around or just before the onset of food production. Using paleoclimate reconstructions, we show that the early difference in Ne cannot be explained by food producers inhabiting more favorable regions. A number of mechanisms, including ancestral differences in census size, sedentism, exploitation of the natural resources, social stratification or connectivity between groups, might have led to the early differences in Ne detected in our analyses. Irrespective of the specific mechanisms involved, our results provide further evidence that long term cultural differences among populations of Palaeolithic hunter-gatherers are likely to have played an important role in the later Neolithization process.
Collapse
Affiliation(s)
- Michela Leonardi
- Department of Life Sciences and Biotechnology, University of Ferrara, Via Borsari 44, 44121, Ferrara, Italy. .,Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, Oester Voldgade 5-7, DK-1350, Copenhagen, Denmark.
| | - Guido Barbujani
- Department of Life Sciences and Biotechnology, University of Ferrara, Via Borsari 44, 44121, Ferrara, Italy
| | - Andrea Manica
- Department of Zoology, University of Cambridge, Downing street, CB2 3EJ, Cambridge, UK
| |
Collapse
|
41
|
Feng Q, Lu Y, Ni X, Yuan K, Yang Y, Yang X, Liu C, Lou H, Ning Z, Wang Y, Lu D, Zhang C, Zhou Y, Shi M, Tian L, Wang X, Zhang X, Li J, Khan A, Guan Y, Tang K, Wang S, Xu S. Genetic History of Xinjiang’s Uyghurs Suggests Bronze Age Multiple-Way Contacts in Eurasia. Mol Biol Evol 2017; 34:2572-2582. [DOI: 10.1093/molbev/msx177] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
|
42
|
Abstract
Indigenous Tibetan people have lived on the Tibetan Plateau for millennia. There is a long-standing question about the genetic basis of high-altitude adaptation in Tibetans. We conduct a genome-wide study of 7.3 million genotyped and imputed SNPs of 3,008 Tibetans and 7,287 non-Tibetan individuals of Eastern Asian ancestry. Using this large dataset, we detect signals of high-altitude adaptation at nine genomic loci, of which seven are unique. The alleles under natural selection at two of these loci [methylenetetrahydrofolate reductase (MTHFR) and EPAS1] are strongly associated with blood-related phenotypes, such as hemoglobin, homocysteine, and folate in Tibetans. The folate-increasing allele of rs1801133 at the MTHFR locus has an increased frequency in Tibetans more than expected under a drift model, which is probably a consequence of adaptation to high UV radiation. These findings provide important insights into understanding the genomic consequences of high-altitude adaptation in Tibetans.
Collapse
|
43
|
McManus KF, Taravella AM, Henn BM, Bustamante CD, Sikora M, Cornejo OE. Population genetic analysis of the DARC locus (Duffy) reveals adaptation from standing variation associated with malaria resistance in humans. PLoS Genet 2017; 13:e1006560. [PMID: 28282382 PMCID: PMC5365118 DOI: 10.1371/journal.pgen.1006560] [Citation(s) in RCA: 65] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2016] [Revised: 03/24/2017] [Accepted: 12/30/2016] [Indexed: 12/22/2022] Open
Abstract
The human DARC (Duffy antigen receptor for chemokines) gene encodes a membrane-bound chemokine receptor crucial for the infection of red blood cells by Plasmodium vivax, a major causative agent of malaria. Of the three major allelic classes segregating in human populations, the FY*O allele has been shown to protect against P. vivax infection and is at near fixation in sub-Saharan Africa, while FY*B and FY*A are common in Europe and Asia, respectively. Due to the combination of strong geographic differentiation and association with malaria resistance, DARC is considered a canonical example of positive selection in humans. Despite this, details of the timing and mode of selection at DARC remain poorly understood. Here, we use sequencing data from over 1,000 individuals in twenty-one human populations, as well as ancient human genomes, to perform a fine-scale investigation of the evolutionary history of DARC. We estimate the time to most recent common ancestor (TMRCA) of the most common FY*O haplotype to be 42 kya (95% CI: 34-49 kya). We infer the FY*O null mutation swept to fixation in Africa from standing variation with very low initial frequency (0.1%) and a selection coefficient of 0.043 (95% CI:0.011-0.18), which is among the strongest estimated in the human genome. We estimate the TMRCA of the FY*A mutation in non-Africans to be 57 kya (95% CI: 48-65 kya) and infer that, prior to the sweep of FY*O, all three alleles were segregating in Africa, as highly diverged populations from Asia and ≠Khomani San hunter-gatherers share the same FY*A haplotypes. We test multiple models of admixture that may account for this observation and reject recent Asian or European admixture as the cause.
Collapse
Affiliation(s)
- Kimberly F. McManus
- Department of Biology, Stanford University, Stanford, California, United States of America
| | - Angela M. Taravella
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, New York, United States of America
| | - Brenna M. Henn
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, New York, United States of America
| | - Carlos D. Bustamante
- Department of Biology, Stanford University, Stanford, California, United States of America
- Department of Genetics, Stanford University, Stanford, California, United States of America
| | - Martin Sikora
- Department of Genetics, Stanford University, Stanford, California, United States of America
- Centre for Geogenetics, Natural History Museum Denmark, Copenhagen, Denmark
| | - Omar E. Cornejo
- Department of Genetics, Stanford University, Stanford, California, United States of America
- Department of Biological Sciences, Washington State University, Pullman, washington, United States of America
| |
Collapse
|
44
|
Genomic insights into the population structure and history of the Irish Travellers. Sci Rep 2017; 7:42187. [PMID: 28181990 PMCID: PMC5299991 DOI: 10.1038/srep42187] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2016] [Accepted: 01/04/2017] [Indexed: 01/17/2023] Open
Abstract
The Irish Travellers are a population with a history of nomadism; consanguineous unions are common and they are socially isolated from the surrounding, ‘settled’ Irish people. Low-resolution genetic analysis suggests a common Irish origin between the settled and the Traveller populations. What is not known, however, is the extent of population structure within the Irish Travellers, the time of divergence from the general Irish population, or the extent of autozygosity. Using a sample of 50 Irish Travellers, 143 European Roma, 2232 settled Irish, 2039 British and 6255 European or world-wide individuals, we demonstrate evidence for population substructure within the Irish Traveller population, and estimate a time of divergence before the Great Famine of 1845–1852. We quantify the high levels of autozygosity, which are comparable to levels previously described in Orcadian 1st/2nd cousin offspring, and finally show the Irish Traveller population has no particular genetic links to the European Roma. The levels of autozygosity and distinct Irish origins have implications for disease mapping within Ireland, while the population structure and divergence inform on social history.
Collapse
|
45
|
Pilot M, Malewski T, Moura AE, Grzybowski T, Oleński K, Ruść A, Kamiński S, Ruiz Fadel F, Mills DS, Alagaili AN, Mohammed OB, Kłys G, Okhlopkov IM, Suchecka E, Bogdanowicz W. On the origin of mongrels: evolutionary history of free-breeding dogs in Eurasia. Proc Biol Sci 2017; 282:20152189. [PMID: 26631564 DOI: 10.1098/rspb.2015.2189] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Although a large part of the global domestic dog population is free-ranging and free-breeding, knowledge of genetic diversity in these free-breeding dogs (FBDs) and their ancestry relations to pure-breed dogs is limited, and the indigenous status of FBDs in Asia is still uncertain. We analyse genome-wide SNP variability of FBDs across Eurasia, and show that they display weak genetic structure and are genetically distinct from pure-breed dogs rather than constituting an admixture of breeds. Our results suggest that modern European breeds originated locally from European FBDs. East Asian and Arctic breeds show closest affinity to East Asian FBDs, and they both represent the earliest branching lineages in the phylogeny of extant Eurasian dogs. Our biogeographic reconstruction of ancestral distributions indicates a gradual westward expansion of East Asian indigenous dogs to the Middle East and Europe through Central and West Asia, providing evidence for a major expansion that shaped the patterns of genetic differentiation in modern dogs. This expansion was probably secondary and could have led to the replacement of earlier resident populations in Western Eurasia. This could explain why earlier studies based on modern DNA suggest East Asia as the region of dog origin, while ancient DNA and archaeological data point to Western Eurasia.
Collapse
Affiliation(s)
- Małgorzata Pilot
- Museum and Institute of Zoology, Polish Academy of Sciences, Wilcza 64, Warszawa 00-679, Poland School of Life Sciences, University of Lincoln, Green Lane, Lincoln LN6 7DL, UK
| | - Tadeusz Malewski
- Museum and Institute of Zoology, Polish Academy of Sciences, Wilcza 64, Warszawa 00-679, Poland
| | - Andre E Moura
- School of Life Sciences, University of Lincoln, Green Lane, Lincoln LN6 7DL, UK
| | - Tomasz Grzybowski
- Division of Molecular and Forensic Genetics, Department of Forensic Medicine, Ludwik Rydygier Collegium Medicum, Nicolaus Copernicus University, Skłodowskiej-Curie 9, Bydgoszcz 85-094, Poland
| | - Kamil Oleński
- Department of Animal Genetics, University of Warmia and Mazury, Oczapowskiego 5, Olsztyn 10-711, Poland
| | - Anna Ruść
- Department of Animal Genetics, University of Warmia and Mazury, Oczapowskiego 5, Olsztyn 10-711, Poland
| | - Stanisław Kamiński
- Department of Animal Genetics, University of Warmia and Mazury, Oczapowskiego 5, Olsztyn 10-711, Poland
| | - Fernanda Ruiz Fadel
- School of Life Sciences, University of Lincoln, Green Lane, Lincoln LN6 7DL, UK
| | - Daniel S Mills
- School of Life Sciences, University of Lincoln, Green Lane, Lincoln LN6 7DL, UK
| | - Abdulaziz N Alagaili
- KSU Mammals Research Chair, Department of Zoology, College of Science, King Saud University, PO Box 2455, Riyadh 11451, Saudi Arabia
| | - Osama B Mohammed
- KSU Mammals Research Chair, Department of Zoology, College of Science, King Saud University, PO Box 2455, Riyadh 11451, Saudi Arabia
| | - Grzegorz Kłys
- Department of Biosystematics, University of Opole, Oleska 22, Opole 45-052, Poland
| | - Innokentiy M Okhlopkov
- Science Institute of Biological Problems Cryolithozone, Siberian Branch of Russian Academy of Sciences, Yakutsk 677980, Russia
| | - Ewa Suchecka
- Museum and Institute of Zoology, Polish Academy of Sciences, Wilcza 64, Warszawa 00-679, Poland
| | - Wiesław Bogdanowicz
- Museum and Institute of Zoology, Polish Academy of Sciences, Wilcza 64, Warszawa 00-679, Poland
| |
Collapse
|
46
|
Pagani L, Colonna V, Tyler-Smith C, Ayub Q. An Ethnolinguistic and Genetic Perspective on the Origins of the Dravidian-Speaking Brahui in Pakistan. MAN IN INDIA 2017; 97:267-278. [PMID: 28381901 PMCID: PMC5378296] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Pakistan is a part of South Asia that modern humans encountered soon after they left Africa ~50 - 70,000 years ago. Approximately 9,000 years ago they began establishing cities that eventually expanded to represent the Harappan culture, rivalling the early city states of Mesopotamia. The modern state constitutes the north western land mass of the Indian sub-continent and is now the abode of almost 200 million humans representing many ethnicities and linguistic groups. Studies utilising autosomal, Y chromosomal and mitochondrial DNA markers in selected Pakistani populations revealed a mixture of Western Eurasian-, South- and East Asian-specific lineages, some of which were unequivocally associated with past migrations. Overall in Pakistan, genetic relationships are generally predicted more accurately by geographic proximity than linguistic origin. The Dravidian-speaking Brahui population are a prime example of this. They currently reside in south-western Pakistan, surrounded by Indo-Europeans speakers with whom they share a common genetic origin. In contrast, the Hazara share the highest affinity with East Asians, despite their Indo-European linguistic affiliation. In this report we reexamine the genetic origins of the Brahuis, and compare them with diverse populations from India, including several Dravidian-speaking groups, and present a genetic perspective on ethnolinguistic groups in present-day Pakistan. Given the high affinity of Brahui to the other Indo-European Pakistani populations and the absence of population admixture with any of the examined Indian Dravidian groups, we conclude that Brahui are an example of cultural (linguistic) retention following a major population replacement.
Collapse
Affiliation(s)
- Luca Pagani
- Department of Archaeology and Anthropology, University of Cambridge, United Kingdom, Department of Biological, Geological and Environmental Sciences, University of Bologna, Italy
| | - Vincenza Colonna
- National Research Council, Institute of Genetics and Biophysics, Naples, Italy
| | - Chris Tyler-Smith
- The Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, CB10 1SA, United Kingdom
| | - Qasim Ayub
- The Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, CB10 1SA, United Kingdom
| |
Collapse
|
47
|
Tsur E, Friger M, Menashe I. The Unique Evolutionary Signature of Genes Associated with Autism Spectrum Disorder. Behav Genet 2016; 46:754-762. [PMID: 27515661 DOI: 10.1007/s10519-016-9804-4] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2016] [Accepted: 08/04/2016] [Indexed: 11/29/2022]
Abstract
Autism spectrum disorder (ASD) is a common heritable neurodevelopmental disorder, which is characterized by communication and social deficits that reduce the reproductive fitness of individuals with the disorder. Here, we studied the genomic characteristics of 651 ASD genes in a whole-exome sequencing dataset, to search for traces of the evolutionary forces that helped maintain ASD in the human population. We show that ASD genes are ~65 longer and ~20 % less variable than non-ASD genes. The mutational shortage in ASD genes was particularly eminent when considering only deleterious genetic variations, which is a hallmark of negative selection. We further show that these genomic characteristics are unique to ASD genes, as compared with brain-specific genes or with genes of other diseases. Our findings suggest that ASD genes have evolved under complex evolutionary forces, which have left a unique signature that can be used to identify new candidate ASD genes.
Collapse
Affiliation(s)
- Erez Tsur
- Department of Public Health, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beersheba, Israel.,Zlotowski Center for Neuroscience, Ben-Gurion University of the Negev, Beersheba, Israel
| | - Michael Friger
- Department of Public Health, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beersheba, Israel
| | - Idan Menashe
- Department of Public Health, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beersheba, Israel. .,Zlotowski Center for Neuroscience, Ben-Gurion University of the Negev, Beersheba, Israel.
| |
Collapse
|
48
|
Zhang M, Zhou L, Bawa R, Suren H, Holliday J. Recombination Rate Variation, Hitchhiking, and Demographic History Shape Deleterious Load in Poplar. Mol Biol Evol 2016; 33:2899-2910. [DOI: 10.1093/molbev/msw169] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
|
49
|
López S, van Dorp L, Hellenthal G. Human Dispersal Out of Africa: A Lasting Debate. Evol Bioinform Online 2016; 11:57-68. [PMID: 27127403 PMCID: PMC4844272 DOI: 10.4137/ebo.s33489] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2015] [Revised: 02/21/2016] [Accepted: 02/21/2016] [Indexed: 01/01/2023] Open
Abstract
Unraveling the first migrations of anatomically modern humans out of Africa has invoked great interest among researchers from a wide range of disciplines. Available fossil, archeological, and climatic data offer many hypotheses, and as such genetics, with the advent of genome-wide genotyping and sequencing techniques and an increase in the availability of ancient samples, offers another important tool for testing theories relating to our own history. In this review, we report the ongoing debates regarding how and when our ancestors left Africa, how many waves of dispersal there were and what geographical routes were taken. We explore the validity of each, using current genetic literature coupled with some of the key archeological findings.
Collapse
Affiliation(s)
- Saioa López
- Department of Genetics, Evolution and Environment, University College London, London, UK
| | - Lucy van Dorp
- Department of Genetics, Evolution and Environment, University College London, London, UK
- Centre for Mathematics and Physics in the Life Sciences and Experimental Biology (CoMPLEX), University College London, London, UK
| | - Garrett Hellenthal
- Department of Genetics, Evolution and Environment, University College London, London, UK
| |
Collapse
|
50
|
Pugach I, Matveev R, Spitsyn V, Makarov S, Novgorodov I, Osakovsky V, Stoneking M, Pakendorf B. The Complex Admixture History and Recent Southern Origins of Siberian Populations. Mol Biol Evol 2016; 33:1777-95. [PMID: 26993256 PMCID: PMC4915357 DOI: 10.1093/molbev/msw055] [Citation(s) in RCA: 51] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Although Siberia was inhabited by modern humans at an early stage, there is still debate over whether it remained habitable during the extreme cold of the Last Glacial Maximum or whether it was subsequently repopulated by peoples with recent shared ancestry. Previous studies of the genetic history of Siberian populations were hampered by the extensive admixture that appears to have taken place among these populations, because commonly used methods assume a tree-like population history and at most single admixture events. Here we analyze geogenetic maps and use other approaches to distinguish the effects of shared ancestry from prehistoric migrations and contact, and develop a new method based on the covariance of ancestry components, to investigate the potentially complex admixture history. We furthermore adapt a previously devised method of admixture dating for use with multiple events of gene flow, and apply these methods to whole-genome genotype data from over 500 individuals belonging to 20 different Siberian ethnolinguistic groups. The results of these analyses indicate that there have been multiple layers of admixture detectable in most of the Siberian populations, with considerable differences in the admixture histories of individual populations. Furthermore, most of the populations of Siberia included here, even those settled far to the north, appear to have a southern origin, with the northward expansions of different populations possibly being driven partly by the advent of pastoralism, especially reindeer domestication. These newly developed methods to analyze multiple admixture events should aid in the investigation of similarly complex population histories elsewhere.
Collapse
Affiliation(s)
- Irina Pugach
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Rostislav Matveev
- Max Planck Institute for Mathematics in the Sciences, Leipzig, Germany
| | - Viktor Spitsyn
- Research Centre for Medical Genetics, Federal State Budgetary Institution, Moscow, Russian Federation
| | - Sergey Makarov
- Research Centre for Medical Genetics, Federal State Budgetary Institution, Moscow, Russian Federation
| | - Innokentiy Novgorodov
- Institute of Foreign Philology and Regional Studies, North-Eastern Federal University, Yakutsk, Russian Federation
| | - Vladimir Osakovsky
- Institute of Health, North-Eastern Federal University, Yakutsk, Russian Federation
| | - Mark Stoneking
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Brigitte Pakendorf
- Laboratoire Dynamique du Langage, UMR5596, CNRS and Université Lyon Lumière 2, Lyon, France
| |
Collapse
|