51
|
Lee J, Lee J, Jeon S, Lee J, Jang I, Yang JO, Park S, Lee B, Choi J, Choi BO, Gee HY, Oh J, Jang IJ, Lee S, Baek D, Koh Y, Yoon SS, Kim YJ, Chae JH, Park WY, Bhak JH, Choi M. A database of 5305 healthy Korean individuals reveals genetic and clinical implications for an East Asian population. Exp Mol Med 2022; 54:1862-1871. [PMID: 36323850 PMCID: PMC9628380 DOI: 10.1038/s12276-022-00871-4] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Revised: 07/21/2022] [Accepted: 08/08/2022] [Indexed: 11/29/2022] Open
Abstract
Despite substantial advances in disease genetics, studies to date have largely focused on individuals of European descent. This limits further discoveries of novel functional genetic variants in other ethnic groups. To alleviate the paucity of East Asian population genome resources, we established the Korean Variant Archive 2 (KOVA 2), which is composed of 1896 whole-genome sequences and 3409 whole-exome sequences from healthy individuals of Korean ethnicity. This is the largest genome database from the ethnic Korean population to date, surpassing the 1909 Korean individuals deposited in gnomAD. The variants in KOVA 2 displayed all the known genetic features of those from previous genome databases, and we compiled data from Korean-specific runs of homozygosity, positively selected intervals, and structural variants. In doing so, we found loci, such as the loci of ADH1A/1B and UHRF1BP1, that are strongly selected in the Korean population relative to other East Asian populations. Our analysis of allele ages revealed a correlation between variant functionality and evolutionary age. The data can be browsed and downloaded from a public website ( https://www.kobic.re.kr/kova/ ). We anticipate that KOVA 2 will serve as a valuable resource for genetic studies involving East Asian populations.
Collapse
Affiliation(s)
- Jeongeun Lee
- grid.31501.360000 0004 0470 5905Interdisciplinary Program in Bioengineering, Graduate School, Seoul National University, Seoul, 03080 Republic of Korea
| | - Jean Lee
- grid.31501.360000 0004 0470 5905Department of Biomedical Sciences, Seoul National University College of Medicine, Seoul, 03080 Republic of Korea
| | - Sungwon Jeon
- grid.42687.3f0000 0004 0381 814XDepartment of Biomedical Engineering, College of Information and Biotechnology, Ulsan National Institute of Science and Technology (UNIST), Ulsan, 44919 Republic of Korea
| | - Jeongha Lee
- grid.31501.360000 0004 0470 5905Department of Biomedical Sciences, Seoul National University College of Medicine, Seoul, 03080 Republic of Korea
| | - Insu Jang
- grid.249967.70000 0004 0636 3099Korea BioInformation Center (KOBIC), Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141 Republic of Korea
| | - Jin Ok Yang
- grid.249967.70000 0004 0636 3099Korea BioInformation Center (KOBIC), Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141 Republic of Korea ,grid.37172.300000 0001 2292 0500Department of Bio and Brain Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141 Republic of Korea
| | - Soojin Park
- grid.31501.360000 0004 0470 5905Department of Pediatrics, Seoul National University College of Medicine, Seoul, 03080 Republic of Korea
| | - Byungwook Lee
- grid.249967.70000 0004 0636 3099Korea BioInformation Center (KOBIC), Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141 Republic of Korea
| | - Jinwook Choi
- grid.31501.360000 0004 0470 5905Interdisciplinary Program in Bioengineering, Graduate School, Seoul National University, Seoul, 03080 Republic of Korea ,grid.31501.360000 0004 0470 5905Department of Biomedical Engineering, Seoul National University College of Medicine, Seoul, 03080 Republic of Korea
| | - Byung-Ok Choi
- grid.264381.a0000 0001 2181 989XDepartment of Neurology, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, 06351 Republic of Korea
| | - Heon Yung Gee
- grid.15444.300000 0004 0470 5454Department of Pharmacology, Brain Korea 21 PLUS Project for Medical Sciences, Yonsei University College of Medicine, Seoul, 03722 Republic of Korea
| | - Jaeseong Oh
- grid.31501.360000 0004 0470 5905Department of Clinical Pharmacology and Therapeutics, Seoul National University College of Medicine and Hospital, Seoul, 03080 Republic of Korea
| | - In-Jin Jang
- grid.31501.360000 0004 0470 5905Department of Clinical Pharmacology and Therapeutics, Seoul National University College of Medicine and Hospital, Seoul, 03080 Republic of Korea
| | - Sanghyuk Lee
- grid.255649.90000 0001 2171 7754Department of Bio-Information Science, Ewha Womans University, Seoul, 03760 Republic of Korea
| | - Daehyun Baek
- grid.31501.360000 0004 0470 5905School of Biological Sciences, Seoul National University, Seoul, 08826 Republic of Korea
| | - Youngil Koh
- grid.412484.f0000 0001 0302 820XDepartment of Internal Medicine, Seoul National University Hospital, Seoul, 03080 Republic of Korea
| | - Sung-Soo Yoon
- grid.412484.f0000 0001 0302 820XDepartment of Internal Medicine, Seoul National University Hospital, Seoul, 03080 Republic of Korea
| | - Young-Joon Kim
- grid.15444.300000 0004 0470 5454Department of Biochemistry, College of Life Science and Biotechnology, Yonsei University, Seoul, 03722 Republic of Korea
| | - Jong-Hee Chae
- grid.31501.360000 0004 0470 5905Department of Pediatrics, Seoul National University College of Medicine, Seoul, 03080 Republic of Korea ,grid.412484.f0000 0001 0302 820XDepartment of Genomic Medicine, Seoul National University Hospital, Seoul, 03080 Republic of Korea
| | - Woong-Yang Park
- grid.414964.a0000 0001 0640 5613Samsung Genome Institute, Samsung Medical Center, Seoul, 06351 Republic of Korea
| | - Jong Hwa Bhak
- grid.42687.3f0000 0004 0381 814XDepartment of Biomedical Engineering, College of Information and Biotechnology, Ulsan National Institute of Science and Technology (UNIST), Ulsan, 44919 Republic of Korea
| | - Murim Choi
- grid.31501.360000 0004 0470 5905Department of Biomedical Sciences, Seoul National University College of Medicine, Seoul, 03080 Republic of Korea
| |
Collapse
|
52
|
Johnson KE, Adams CJ, Voight BF. Identifying rare variants inconsistent with identity-by-descent in population-scale whole-genome sequencing data. Methods Ecol Evol 2022; 13:2429-2442. [PMID: 38938451 PMCID: PMC11210625 DOI: 10.1111/2041-210x.13991] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Accepted: 09/12/2022] [Indexed: 12/01/2022]
Abstract
Analyses of genetic variation typically assume that rare variants within a population are inherited from a single common ancestral event identity-by-descent (IBD). However, there are genetic and technical processes through which rare variants in population genetic data may deviate from this simple evolutionary model, including recurrent mutations, gene conversions and genotyping error. All these processes can decrease the expected length of shared background haplotype surrounding a rare variant if that variant was inherited from a single event descending from a common ancestor. No method exists to computationally infer rare variants inconsistent with this simple model-denoted here as 'IBD-inconsistent'-using unphased population sequencing data.We hypothesized that the difference in shared haplotype background length can distinguish variants consistent and inconsistent with this simple IBD transmission population sequencing data without pedigree information. We implemented a Bayesian hierarchical model and used Gibbs sampling to estimate the posterior probability of IBD state for rare variants, using simulated recurrent mutations to demonstrate that our approach accurately distinguishes rare variants consistent and inconsistent with a simple IBD inheritance model.Applying our method to whole-genome sequencing data from 3,621 human individuals in the UK10K consortium, we found that IBD-inconsistent variants correlated with higher local mutation rates and genomic features like replication timing. Using a heuristic to categorize IBD-inconsistent variants as gene conversions, we found that potential gene conversions had expected properties such as enriched local GC content.By identifying IBD-inconsistent variants, we can better understand the spectrum of recent mutations in human populations, a source of genetic variation driving evolution and a key factor in understanding recent demographic history.
Collapse
Affiliation(s)
- Kelsey E. Johnson
- Cell and Molecular Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Christopher J. Adams
- Genomics and Computational Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Benjamin F. Voight
- Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
- Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| |
Collapse
|
53
|
Zhang J, Richards ZT, Adam AAS, Chan CX, Shinzato C, Gilmour J, Thomas L, Strugnell JM, Miller DJ, Cooke I. Evolutionary responses of a reef-building coral to climate change at the end of the last glacial maximum. Mol Biol Evol 2022; 39:msac201. [PMID: 36219871 PMCID: PMC9578555 DOI: 10.1093/molbev/msac201] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2022] [Revised: 09/04/2022] [Accepted: 09/13/2022] [Indexed: 11/13/2022] Open
Abstract
Climate change threatens the survival of coral reefs on a global scale, primarily through mass bleaching and mortality as a result of marine heatwaves. While these short-term effects are clear, predicting the fate of coral reefs over the coming century is a major challenge. One way to understand the longer-term effects of rapid climate change is to examine the response of coral populations to past climate shifts. Coastal and shallow-water marine ecosystems such as coral reefs have been reshaped many times by sea-level changes during the Pleistocene, yet, few studies have directly linked this with its consequences on population demographics, dispersal, and adaptation. Here we use powerful analytical techniques, afforded by haplotype phased whole-genomes, to establish such links for the reef-building coral, Acropora digitifera. We show that three genetically distinct populations are present in northwestern Australia, and that their rapid divergence since the last glacial maximum (LGM) can be explained by a combination of founder-effects and restricted gene flow. Signatures of selective sweeps, too strong to be explained by demographic history, are present in all three populations and overlap with genes that show different patterns of functional enrichment between inshore and offshore habitats. In contrast to rapid divergence in the host, we find that photosymbiont communities are largely undifferentiated between corals from all three locations, spanning almost 1000 km, indicating that selection on host genes and not acquisition of novel symbionts, has been the primary driver of adaptation for this species in northwestern Australia.
Collapse
Affiliation(s)
- Jia Zhang
- Department of Molecular and Cell Biology, James Cook University, Townsville, QLD, 4811, Australia
- Centre for Tropical Bioinformatics and Molecular Biology, James Cook University, Townsville, QLD, 4811, Australia
- ARC Centre of Excellence for Coral Reef Studies, James Cook University, Townsville, QLD, 4811, Australia
| | - Zoe T Richards
- Coral Conservation and Research Group, Trace and Environmental DNA Laboratory, School of Molecular and Life Sciences, Curtin University, Bentley, WA 6102, Australia
- Collections and Research, Western Australian Museum, 49 Kew Street Welshpool, WA 6106, Australia
| | - Arne A S Adam
- Coral Conservation and Research Group, Trace and Environmental DNA Laboratory, School of Molecular and Life Sciences, Curtin University, Bentley, WA 6102, Australia
| | - Cheong Xin Chan
- The University of Queensland, School of Chemistry and Molecular Biosciences, Australian Centre for Ecogenomics, Brisbane, QLD 4072, Australia
| | - Chuya Shinzato
- Atmosphere and Ocean Research Institute, The University of Tokyo277-8564, Chiba, Japan
| | - James Gilmour
- Australia Institute of Marine Science, Indian Oceans Marine Research Centre, Crawley, WA, 6009, Australia
| | - Luke Thomas
- Australia Institute of Marine Science, Indian Oceans Marine Research Centre, Crawley, WA, 6009, Australia
- Oceans Graduate School, The UWA Oceans Institute, The University of Western Australia, Perth, WA, 6009, Australia
| | - Jan M Strugnell
- Department of Marine Biology and Aquaculture, James Cook University, Townsville, QLD, 4811, Australia
- Centre for Sustainable Fisheries and Aquaculture, James Cook University, Townsville, QLD, 4811, Australia
| | - David J Miller
- Department of Molecular and Cell Biology, James Cook University, Townsville, QLD, 4811, Australia
- Centre for Tropical Bioinformatics and Molecular Biology, James Cook University, Townsville, QLD, 4811, Australia
- ARC Centre of Excellence for Coral Reef Studies, James Cook University, Townsville, QLD, 4811, Australia
- Marine Climate Change Unit, Okinawa Institute of Science and Technology, Onna-son, Okinawa, Japan 904-0495
| | - Ira Cooke
- Department of Molecular and Cell Biology, James Cook University, Townsville, QLD, 4811, Australia
- Centre for Tropical Bioinformatics and Molecular Biology, James Cook University, Townsville, QLD, 4811, Australia
| |
Collapse
|
54
|
Alagöz G, Molz B, Eising E, Schijven D, Francks C, Stein JL, Fisher SE. Using neuroimaging genomics to investigate the evolution of human brain structure. Proc Natl Acad Sci U S A 2022; 119:e2200638119. [PMID: 36161899 PMCID: PMC9546597 DOI: 10.1073/pnas.2200638119] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 08/15/2022] [Indexed: 01/16/2023] Open
Abstract
Alterations in brain size and organization represent some of the most distinctive changes in the emergence of our species. Yet, there is limited understanding of how genetic factors contributed to altered neuroanatomy during human evolution. Here, we analyze neuroimaging and genetic data from up to 30,000 people in the UK Biobank and integrate with genomic annotations for different aspects of human evolution, including those based on ancient DNA and comparative genomics. We show that previously reported signals of recent polygenic selection for cortical anatomy are not replicable in a more ancestrally homogeneous sample. We then investigate relationships between evolutionary annotations and common genetic variants shaping cortical surface area and white-matter connectivity for each hemisphere. Our analyses identify single-nucleotide polymorphism heritability enrichment in human-gained regulatory elements that are active in early brain development, affecting surface areas of several parts of the cortex, including left-hemispheric speech-associated regions. We also detect heritability depletion in genomic regions with Neanderthal ancestry for connectivity of the uncinate fasciculus; this is a white-matter tract involved in memory, language, and socioemotional processing with relevance to neuropsychiatric disorders. Finally, we show that common genetic loci associated with left-hemispheric pars triangularis surface area overlap with a human-gained enhancer and affect regulation of ZIC4, a gene implicated in neurogenesis. This work demonstrates how genomic investigations of present-day neuroanatomical variation can help shed light on the complexities of our evolutionary past.
Collapse
Affiliation(s)
- Gökberk Alagöz
- Language and Genetics Department, Max Planck Institute for Psycholinguistics, 6500 AH Nijmegen, The Netherlands
| | - Barbara Molz
- Language and Genetics Department, Max Planck Institute for Psycholinguistics, 6500 AH Nijmegen, The Netherlands
| | - Else Eising
- Language and Genetics Department, Max Planck Institute for Psycholinguistics, 6500 AH Nijmegen, The Netherlands
| | - Dick Schijven
- Language and Genetics Department, Max Planck Institute for Psycholinguistics, 6500 AH Nijmegen, The Netherlands
| | - Clyde Francks
- Language and Genetics Department, Max Planck Institute for Psycholinguistics, 6500 AH Nijmegen, The Netherlands
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, 6500 HB Nijmegen, The Netherlands
- Department of Human Genetics, Radboud University Medical Center, 6500 HB Nijmegen, The Netherlands
| | - Jason L. Stein
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599
- UNC Neuroscience Center, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599
| | - Simon E. Fisher
- Language and Genetics Department, Max Planck Institute for Psycholinguistics, 6500 AH Nijmegen, The Netherlands
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, 6500 HB Nijmegen, The Netherlands
| |
Collapse
|
55
|
Brucato N, André M, Hudjashov G, Mondal M, Cox MP, Leavesley M, Ricaut FX. Chronology of natural selection in Oceanian genomes. iScience 2022; 25:104583. [PMID: 35880026 PMCID: PMC9308150 DOI: 10.1016/j.isci.2022.104583] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 05/11/2022] [Accepted: 06/07/2022] [Indexed: 11/30/2022] Open
Abstract
As human populations left Asia to first settle in Oceania around 50,000 years ago, they entered a territory ecologically separated from the Old World for millions of years. We analyzed genomic data of 239 modern Oceanian individuals to detect and date signals of selection specific to this region. Combining both relative and absolute dating approaches, we identified a strong selection pattern between 52,000 and 54,000 years ago in the genomes of descendants of the first settlers of Sahul. This strikingly corresponds to the dates of initial settlement as inferred from archaeological evidence. Loci under selection during this period, some showing enrichment in Denisovan ancestry, overlap genes involved in the immune response and diet, especially based on plants. Pathogens and natural resources, especially from endemic plants, therefore appear to have acted as strong selective pressures on the genomes of the first settlers of Sahul. 239 human genomes from both sides of the Wallacean ecogeographical barriers Signals of selection are dated between -54,000 to -52,000 in modern Oceanian genomes Genes related to immunity and diet were under strong selection Denisovan introgressions participated to the genetic adaptations present in Oceanians
Collapse
Affiliation(s)
- Nicolas Brucato
- Laboratoire Évolution et Diversité Biologique (EDB UMR 5174), Université de Toulouse Midi-Pyrénées, CNRS, IRD, UPS. 118 route de Narbonne, Bat 4R1, 31062 cedex 9 Toulouse, France
| | - Mathilde André
- Laboratoire Évolution et Diversité Biologique (EDB UMR 5174), Université de Toulouse Midi-Pyrénées, CNRS, IRD, UPS. 118 route de Narbonne, Bat 4R1, 31062 cedex 9 Toulouse, France.,Institute of Genomics, University of Tartu, Tartu, 51010 Tartumaa, Estonia
| | - Georgi Hudjashov
- Institute of Genomics, University of Tartu, Tartu, 51010 Tartumaa, Estonia
| | - Mayukh Mondal
- Institute of Genomics, University of Tartu, Tartu, 51010 Tartumaa, Estonia
| | - Murray P Cox
- School of Natural Sciences, Massey University, Palmerston North 4442, New Zealand
| | - Matthew Leavesley
- Strand of Anthropology, Sociology and Archaeology, School of Humanities and Social Sciences, University of Papua New Guinea, PO Box 320, National Capital District 134, Papua New Guinea.,College of Arts, Society and Education, James Cook University, P.O. Box 6811, Cairns, QLD 4870, Australia.,ARC Centre of Excellence for Australian Biodiversity and Heritage, University of Wollongong, Wollongong, NSW 2522, Australia
| | - François-Xavier Ricaut
- Laboratoire Évolution et Diversité Biologique (EDB UMR 5174), Université de Toulouse Midi-Pyrénées, CNRS, IRD, UPS. 118 route de Narbonne, Bat 4R1, 31062 cedex 9 Toulouse, France
| |
Collapse
|
56
|
Peyrégne S, Kelso J, Peter BM, Pääbo S. The evolutionary history of human spindle genes includes back-and-forth gene flow with Neandertals. eLife 2022; 11:75464. [PMID: 35816093 PMCID: PMC9273211 DOI: 10.7554/elife.75464] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2021] [Accepted: 06/14/2022] [Indexed: 12/13/2022] Open
Abstract
Proteins associated with the spindle apparatus, a cytoskeletal structure that ensures the proper segregation of chromosomes during cell division, experienced an unusual number of amino acid substitutions in modern humans after the split from the ancestors of Neandertals and Denisovans. Here, we analyze the history of these substitutions and show that some of the genes in which they occur may have been targets of positive selection. We also find that the two changes in the kinetochore scaffold 1 (KNL1) protein, previously believed to be specific to modern humans, were present in some Neandertals. We show that the KNL1 gene of these Neandertals shared a common ancestor with present-day Africans about 200,000 years ago due to gene flow from the ancestors (or relatives) of modern humans into Neandertals. Subsequently, some non-Africans inherited this modern human-like gene variant from Neandertals, but none inherited the ancestral gene variants. These results add to the growing evidence of early contacts between modern humans and archaic groups in Eurasia and illustrate the intricate relationships among these groups.
Collapse
Affiliation(s)
- Stéphane Peyrégne
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Janet Kelso
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Benjamin M Peter
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Svante Pääbo
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| |
Collapse
|
57
|
Rubin CJ, Enbody ED, Dobreva MP, Abzhanov A, Davis BW, Lamichhaney S, Pettersson M, Sendell-Price AT, Sprehn CG, Valle CA, Vasco K, Wallerman O, Grant BR, Grant PR, Andersson L. Rapid adaptive radiation of Darwin's finches depends on ancestral genetic modules. SCIENCE ADVANCES 2022; 8:eabm5982. [PMID: 35857449 PMCID: PMC9269886 DOI: 10.1126/sciadv.abm5982] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/28/2021] [Accepted: 05/25/2022] [Indexed: 05/21/2023]
Abstract
Recent adaptive radiations are models for investigating mechanisms contributing to the evolution of biodiversity. An unresolved question is the relative importance of new mutations, ancestral variants, and introgressive hybridization for phenotypic evolution and speciation. Here, we address this issue using Darwin's finches and investigate the genomic architecture underlying their phenotypic diversity. Admixture mapping for beak and body size in the small, medium, and large ground finches revealed 28 loci showing strong genetic differentiation. These loci represent ancestral haplotype blocks with origins predating speciation events during the Darwin's finch radiation. Genes expressed in the developing beak are overrepresented in these genomic regions. Ancestral haplotypes constitute genetic modules for selection and act as key determinants of the unusual phenotypic diversity of Darwin's finches. Such ancestral haplotype blocks can be critical for how species adapt to environmental variability and change.
Collapse
Affiliation(s)
- Carl-Johan Rubin
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
- Institute of Marine Research, Nordnesgaten 50, 5005 Bergen, Norway
| | - Erik D. Enbody
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
| | - Mariya P. Dobreva
- Department of Life Sciences, Imperial College London, Silwood Park Campus, SL5 7PY Ascot, UK
| | - Arhat Abzhanov
- Department of Life Sciences, Imperial College London, Silwood Park Campus, SL5 7PY Ascot, UK
| | - Brian W. Davis
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, TX, USA
| | | | - Mats Pettersson
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
| | - Ashley T. Sendell-Price
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
- Edward Grey Institute, Department of Zoology, University of Oxford, Oxford, UK
| | - C. Grace Sprehn
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
| | - Carlos A. Valle
- Colegio de Ciencias Biológicas y Ambientales, Galápagos Science Center GSC, Universidad San Francisco de Quito USFQ, Quito, Ecuador
| | - Karla Vasco
- Colegio de Ciencias Biológicas y Ambientales, Galápagos Science Center GSC, Universidad San Francisco de Quito USFQ, Quito, Ecuador
| | - Ola Wallerman
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
| | - B. Rosemary Grant
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ, USA
| | - Peter R. Grant
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ, USA
| | - Leif Andersson
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, TX, USA
- Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, Uppsala, Sweden
- Corresponding author.
| |
Collapse
|
58
|
Toncheva D, Marinova M, Borovska P, Serbezov D. Spatio-temporal dynamics of pathogenic variants associated with monogenic disorders reconstructed with ancient DNA. PLoS One 2022; 17:e0269628. [PMID: 35749392 PMCID: PMC9231702 DOI: 10.1371/journal.pone.0269628] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2021] [Accepted: 05/24/2022] [Indexed: 11/18/2022] Open
Abstract
Genetic disease burden in ancient communities has barely been evaluated despite an ever expanding body of ancient genomes becoming available. In this study, we inspect 2729 publicly available ancient genomes (100 BP—52000 BP) for the presence of pathogenic variants in 32643 disease-associated loci. We base our subsequent analyses on 19 variants in seven genes—PAH, EDAR, F11, HBB, LRRK2, SLC12A6 and MAOA, associated with monogenic diseases and with well-established pathogenic impact in contemporary populations. We determine 230 homozygote genotypes of these variants in the screened 2729 ancient DNA samples. Eleven of these are in the PAH gene (126 ancient samples in total), a gene associated with the condition phenylketonuria in modern populations. The variants examined seem to show varying dynamics over the last 10000 years, some exhibiting a single upsurge in frequency and subsequently disappearing, while others maintain high frequency levels (compared to contemporary population frequencies) over long time periods. The geographic distribution and age of the ancient DNA samples with established pathogenic variants suggests multiple independent origin of these variants. Comparison of estimates of the geographic prevalence of these variants from ancient and contemporary data show discontinuity in their prevalence and supports their recurrent emergence. The oldest samples in which a variant is established might give an indication of their age and place origin, and an EDAR gene pathogenic variant was established in a sample estimated to be 33210–32480 calBCE. Knowledge about the historical prevalence of variants causing monogenic disorders provides insight on their emergence, dynamics and spread.
Collapse
Affiliation(s)
- Draga Toncheva
- Department of Medical Genetics, Medical Faculty, Medical University-Sofia, Sofia, Bulgaria
- Bulgarian Academy of Sciences, Sofia, Bulgaria
- * E-mail:
| | - Maria Marinova
- Department of Computer systems and Technologies, Faculty of Electronics and Automation, Technical University–Sofia, Branch Plovdiv, Bulgaria
| | - Plamenka Borovska
- Department of Informatics, Faculty of Applied Mathematics and Informatics, Technical University of Sofia, Sofia, Bulgaria
| | - Dimitar Serbezov
- Department of Medical Genetics, Medical Faculty, Medical University-Sofia, Sofia, Bulgaria
| |
Collapse
|
59
|
Temporal mapping of derived high-frequency gene variants supports the mosaic nature of the evolution of Homo sapiens. Sci Rep 2022; 12:9937. [PMID: 35705575 PMCID: PMC9200848 DOI: 10.1038/s41598-022-13589-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2022] [Accepted: 05/25/2022] [Indexed: 11/25/2022] Open
Abstract
Large-scale estimations of the time of emergence of variants are essential to examine hypotheses concerning human evolution with precision. Using an open repository of genetic variant age estimations, we offer here a temporal evaluation of various evolutionarily relevant datasets, such as Homo sapiens-specific variants, high-frequency variants found in genetic windows under positive selection, introgressed variants from extinct human species, as well as putative regulatory variants specific to various brain regions. We find a recurrent bimodal distribution of high-frequency variants, but also evidence for specific enrichments of gene categories in distinct time windows, pointing to different periods of phenotypic changes, resulting in a mosaic. With a temporal classification of genetic mutations in hand, we then applied a machine learning tool to predict what genes have changed more in certain time windows, and which tissues these genes may have impacted more. Overall, we provide a fine-grained temporal mapping of derived variants in Homo sapiens that helps to illuminate the intricate evolutionary history of our species.
Collapse
|
60
|
Barnard-Kubow KB, Becker D, Murray CS, Porter R, Gutierrez G, Erickson P, Nunez JCB, Voss E, Suryamohan K, Ratan A, Beckerman A, Bergland AO. Genetic Variation in Reproductive Investment Across an Ephemerality Gradient in Daphnia pulex. Mol Biol Evol 2022; 39:msac121. [PMID: 35642301 PMCID: PMC9198359 DOI: 10.1093/molbev/msac121] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open
Abstract
Species across the tree of life can switch between asexual and sexual reproduction. In facultatively sexual species, the ability to switch between reproductive modes is often environmentally dependent and subject to local adaptation. However, the ecological and evolutionary factors that influence the maintenance and turnover of polymorphism associated with facultative sex remain unclear. We studied the ecological and evolutionary dynamics of reproductive investment in the facultatively sexual model species, Daphnia pulex. We found that patterns of clonal diversity, but not genetic diversity varied among ponds consistent with the predicted relationship between ephemerality and clonal structure. Reconstruction of a multi-year pedigree demonstrated the coexistence of clones that differ in their investment into male production. Mapping of quantitative variation in male production using lab-generated and field-collected individuals identified multiple putative quantitative trait loci (QTL) underlying this trait, and we identified a plausible candidate gene. The evolutionary history of these QTL suggests that they are relatively young, and male limitation in this system is a rapidly evolving trait. Our work highlights the dynamic nature of the genetic structure and composition of facultative sex across space and time and suggests that quantitative genetic variation in reproductive strategy can undergo rapid evolutionary turnover.
Collapse
Affiliation(s)
- Karen B Barnard-Kubow
- Department of Biology, University of Virginia, Charlottesville, VA, USA
- Department of Biology, James Madison University, Harrisonburg, VA, USA
| | - Dörthe Becker
- Department of Biology, University of Virginia, Charlottesville, VA, USA
- School of Biosciences, Ecology and Evolutionary Biology, University of Sheffield, Sheffield, UK
- Department of Biology, University of Marburg, Marburg, Germany
| | - Connor S Murray
- Department of Biology, University of Virginia, Charlottesville, VA, USA
| | - Robert Porter
- Department of Biology, University of Virginia, Charlottesville, VA, USA
| | - Grace Gutierrez
- Department of Biology, University of Virginia, Charlottesville, VA, USA
| | | | - Joaquin C B Nunez
- Department of Biology, University of Virginia, Charlottesville, VA, USA
| | - Erin Voss
- Department of Biology, University of Virginia, Charlottesville, VA, USA
- Department of Integrative Biology, UC Berkeley, Berkeley, CA, USA
| | | | - Aakrosh Ratan
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
- Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA
| | - Andrew Beckerman
- School of Biosciences, Ecology and Evolutionary Biology, University of Sheffield, Sheffield, UK
| | - Alan O Bergland
- Department of Biology, University of Virginia, Charlottesville, VA, USA
| |
Collapse
|
61
|
Zhang M, Wu S, Du S, Qian W, Chen J, Qiao L, Yang Y, Tan J, Yuan Z, Peng Q, Liu Y, Navarro N, Tang K, Ruiz-Linares A, Wang J, Claes P, Jin L, Li J, Wang S. Genetic variants underlying differences in facial morphology in East Asian and European populations. Nat Genet 2022; 54:403-411. [PMID: 35393595 DOI: 10.1038/s41588-022-01038-7] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Revised: 01/19/2022] [Accepted: 02/25/2022] [Indexed: 11/09/2022]
Abstract
Facial morphology-a conspicuous feature of human appearance-is highly heritable. Previous studies on the genetic basis of facial morphology were performed mainly in European-ancestry cohorts (EUR). Applying a data-driven phenotyping and multivariate genome-wide scanning protocol to a large collection of three-dimensional facial images of individuals with East Asian ancestry (EAS), we identified 244 variants in 166 loci (62 new) associated with typical-range facial variation. A newly proposed polygenic shape analysis indicates that the effects of the variants on facial shape in EAS can be generalized to EUR. Based on this, we further identified 13 variants related to differences between facial shape in EUR and EAS populations. Evolutionary analyses suggest that the difference in nose shape between EUR and EAS populations is caused by a directional selection, due mainly to a local adaptation in Europeans. Our results illustrate the underlying genetic basis for facial differences across populations.
Collapse
Affiliation(s)
- Manfei Zhang
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Fudan University, Shanghai, China.,CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China.,School of Computer Science, Fudan University, Shanghai, China
| | - Sijie Wu
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Fudan University, Shanghai, China.,CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China.,Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
| | - Siyuan Du
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Wei Qian
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Fudan University, Shanghai, China.,CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China.,School of Computer Science, Fudan University, Shanghai, China
| | - Jieyi Chen
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China.,Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
| | - Lu Qiao
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Yajun Yang
- Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
| | - Jingze Tan
- Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
| | - Ziyu Yuan
- Fudan-Taizhou Institute of Health Sciences, Taizhou, China
| | - Qianqian Peng
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Yu Liu
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Nicolas Navarro
- Biogéosciences, UMR 6282 CNRS-EPHE, Université Bourgogne Franche-Comté, Dijon, France.,Ecole Pratique des Hautes Etudes, PSL University, Paris, France
| | - Kun Tang
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Andrés Ruiz-Linares
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Fudan University, Shanghai, China.,Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China.,Aix-Marseille Université, CNRS, EFS, ADES, Marseille, France.,Department of Genetics, Evolution and Environment, and UCL Genetics Institute, University College London, London, UK
| | - Jiucun Wang
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Fudan University, Shanghai, China.,Fudan-Taizhou Institute of Health Sciences, Taizhou, China
| | - Peter Claes
- Department of Electrical Engineering, ESAT/PSI, KU Leuven, Leuven, Belgium.,Medical Imaging Research Center, UZ Leuven, Leuven, Belgium.,Department of Human Genetics, KU Leuven, Leuven, Belgium.,Murdoch Children's Research Institute, Melbourne, Victoria, Australia
| | - Li Jin
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Fudan University, Shanghai, China. .,CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China. .,Fudan-Taizhou Institute of Health Sciences, Taizhou, China.
| | - Jiarui Li
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China. .,Department of Electrical Engineering, ESAT/PSI, KU Leuven, Leuven, Belgium. .,Medical Imaging Research Center, UZ Leuven, Leuven, Belgium.
| | - Sijia Wang
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Fudan University, Shanghai, China. .,CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China. .,Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China.
| |
Collapse
|
62
|
Acosta-Uribe J, Aguillón D, Cochran JN, Giraldo M, Madrigal L, Killingsworth BW, Singhal R, Labib S, Alzate D, Velilla L, Moreno S, García GP, Saldarriaga A, Piedrahita F, Hincapié L, López HE, Perumal N, Morelo L, Vallejo D, Solano JM, Reiman EM, Surace EI, Itzcovich T, Allegri R, Sánchez-Valle R, Villegas-Lanau A, White CL, Matallana D, Myers RM, Browning SR, Lopera F, Kosik KS. A neurodegenerative disease landscape of rare mutations in Colombia due to founder effects. Genome Med 2022; 14:27. [PMID: 35260199 PMCID: PMC8902761 DOI: 10.1186/s13073-022-01035-9] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2021] [Accepted: 02/26/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The Colombian population, as well as those in other Latin American regions, arose from a recent tri-continental admixture among Native Americans, Spanish invaders, and enslaved Africans, all of whom passed through a population bottleneck due to widespread infectious diseases that left small isolated local settlements. As a result, the current population reflects multiple founder effects derived from diverse ancestries. METHODS We characterized the role of admixture and founder effects on the origination of the mutational landscape that led to neurodegenerative disorders under these historical circumstances. Genomes from 900 Colombian individuals with Alzheimer's disease (AD) [n = 376], frontotemporal lobar degeneration-motor neuron disease continuum (FTLD-MND) [n = 197], early-onset dementia not otherwise specified (EOD) [n = 73], and healthy participants [n = 254] were analyzed. We examined their global and local ancestry proportions and screened this cohort for deleterious variants in disease-causing and risk-conferring genes. RESULTS We identified 21 pathogenic variants in AD-FTLD related genes, and PSEN1 harbored the majority (11 pathogenic variants). Variants were identified from all three continental ancestries. TREM2 heterozygous and homozygous variants were the most common among AD risk genes (102 carriers), a point of interest because the disease risk conferred by these variants differed according to ancestry. Several gene variants that have a known association with MND in European populations had FTLD phenotypes on a Native American haplotype. Consistent with founder effects, identity by descent among carriers of the same variant was frequent. CONCLUSIONS Colombian demography with multiple mini-bottlenecks probably enhanced the detection of founder events and left a proportionally higher frequency of rare variants derived from the ancestral populations. These findings demonstrate the role of genomically defined ancestry in phenotypic disease expression, a phenotypic range of different rare mutations in the same gene, and further emphasize the importance of inclusiveness in genetic studies.
Collapse
Affiliation(s)
- Juliana Acosta-Uribe
- Neuroscience Research Institute and Department of Molecular Cellular and Developmental Biology, University of California, Santa Barbara, CA, USA
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | - David Aguillón
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | | | - Margarita Giraldo
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
- Instituto Neurológico de Colombia (INDEC), Medellín, Colombia
| | - Lucía Madrigal
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | - Bradley W Killingsworth
- Neuroscience Research Institute and Department of Molecular Cellular and Developmental Biology, University of California, Santa Barbara, CA, USA
| | - Rijul Singhal
- Neuroscience Research Institute and Department of Molecular Cellular and Developmental Biology, University of California, Santa Barbara, CA, USA
| | - Sarah Labib
- Neuroscience Research Institute and Department of Molecular Cellular and Developmental Biology, University of California, Santa Barbara, CA, USA
| | - Diana Alzate
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | - Lina Velilla
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | - Sonia Moreno
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | - Gloria P García
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | - Amanda Saldarriaga
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | - Francisco Piedrahita
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | - Liliana Hincapié
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | - Hugo E López
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | - Nithesh Perumal
- Neuroscience Research Institute and Department of Molecular Cellular and Developmental Biology, University of California, Santa Barbara, CA, USA
| | - Leonilde Morelo
- Department of Internal Medicine, School of Medicine, Universidad del Sinú, Montería, Colombia
| | - Dionis Vallejo
- Department of Neurology, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | - Juan Marcos Solano
- Department of Neurology, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | | | - Ezequiel I Surace
- Laboratorio de Enfermedades Neurodegenerativas (Fleni-CONICET), Buenos Aires, Argentina
| | - Tatiana Itzcovich
- Laboratorio de Enfermedades Neurodegenerativas (Fleni-CONICET), Buenos Aires, Argentina
| | - Ricardo Allegri
- Centro de Memoria y Envejecimiento (Fleni-CONICET), Buenos Aires, Argentina
| | - Raquel Sánchez-Valle
- Alzheimer's Disease and Other Cognitive Disorders Unit, Hospital Clínic de Barcelona, IDIBAPS and University of Barcelona, Barcelona, Spain
| | - Andrés Villegas-Lanau
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia
| | - Charles L White
- Neuropathology Section, Department of Pathology, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Diana Matallana
- Instituto de Envejecimiento, Department of Psychiatry, School of Medicine, Pontifical Xaverian University, Bogotá, Colombia
- Department of Mental Health, Hospital Universitario Santa Fe de Bogotá, Bogotá, Colombia
| | - Richard M Myers
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | - Sharon R Browning
- Department of Biostatistics, University of Washington, Seattle, WA, USA
| | - Francisco Lopera
- Grupo de Neurociencias de Antioquia, School of Medicine, Universidad de Antioquia, Medellín, Colombia.
| | - Kenneth S Kosik
- Neuroscience Research Institute and Department of Molecular Cellular and Developmental Biology, University of California, Santa Barbara, CA, USA.
| |
Collapse
|
63
|
Wohns AW, Wong Y, Jeffery B, Akbari A, Mallick S, Pinhasi R, Patterson N, Reich D, Kelleher J, McVean G. A unified genealogy of modern and ancient genomes. Science 2022; 375:eabi8264. [PMID: 35201891 PMCID: PMC10027547 DOI: 10.1126/science.abi8264] [Citation(s) in RCA: 40] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
The sequencing of modern and ancient genomes from around the world has revolutionized our understanding of human history and evolution. However, the problem of how best to characterize ancestral relationships from the totality of human genomic variation remains unsolved. Here, we address this challenge with nonparametric methods that enable us to infer a unified genealogy of modern and ancient humans. This compact representation of multiple datasets explores the challenges of missing and erroneous data and uses ancient samples to constrain and date relationships. We demonstrate the power of the method to recover relationships between individuals and populations as well as to identify descendants of ancient samples. Finally, we introduce a simple nonparametric estimator of the geographical location of ancestors that recapitulates key events in human history.
Collapse
Affiliation(s)
- Anthony Wilder Wohns
- Broad Institute of MIT and Harvard; Cambridge, MA 02142, USA
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford; Oxford OX3 7LF, UK
| | - Yan Wong
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford; Oxford OX3 7LF, UK
| | - Ben Jeffery
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford; Oxford OX3 7LF, UK
| | - Ali Akbari
- Broad Institute of MIT and Harvard; Cambridge, MA 02142, USA
- Department of Human Evolutionary Biology, Harvard University; Cambridge, MA 02138, USA
- Department of Genetics, Harvard Medical School; Boston, MA 02115, USA
| | - Swapan Mallick
- Broad Institute of MIT and Harvard; Cambridge, MA 02142, USA
- Howard Hughes Medical Institute; Boston, MA 02115, USA
| | - Ron Pinhasi
- Department of Evolutionary Anthropology, University of Vienna; 1090 Vienna, Austria
| | - Nick Patterson
- Broad Institute of MIT and Harvard; Cambridge, MA 02142, USA
- Department of Human Evolutionary Biology, Harvard University; Cambridge, MA 02138, USA
- Howard Hughes Medical Institute; Boston, MA 02115, USA
- Department of Genetics, Harvard Medical School; Boston, MA 02115, USA
| | - David Reich
- Broad Institute of MIT and Harvard; Cambridge, MA 02142, USA
- Department of Human Evolutionary Biology, Harvard University; Cambridge, MA 02138, USA
- Howard Hughes Medical Institute; Boston, MA 02115, USA
- Department of Genetics, Harvard Medical School; Boston, MA 02115, USA
| | - Jerome Kelleher
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford; Oxford OX3 7LF, UK
| | - Gil McVean
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford; Oxford OX3 7LF, UK
- Corresponding author.
| |
Collapse
|
64
|
Benchmarking phasing software with a whole-genome sequenced cattle pedigree. BMC Genomics 2022; 23:130. [PMID: 35164677 PMCID: PMC8845340 DOI: 10.1186/s12864-022-08354-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2021] [Accepted: 01/24/2022] [Indexed: 12/30/2022] Open
Abstract
Background Accurate haplotype reconstruction is required in many applications in quantitative and population genomics. Different phasing methods are available but their accuracy must be evaluated for samples with different properties (population structure, marker density, etc.). We herein took advantage of whole-genome sequence data available for a Holstein cattle pedigree containing 264 individuals, including 98 trios, to evaluate several population-based phasing methods. This data represents a typical example of a livestock population, with low effective population size, high levels of relatedness and long-range linkage disequilibrium. Results After stringent filtering of our sequence data, we evaluated several population-based phasing programs including one or more versions of AlphaPhase, ShapeIT, Beagle, Eagle and FImpute. To that end we used 98 individuals having both parents sequenced for validation. Their haplotypes reconstructed based on Mendelian segregation rules were considered the gold standard to assess the performance of population-based methods in two scenarios. In the first one, only these 98 individuals were phased, while in the second one, all the 264 sequenced individuals were phased simultaneously, ignoring the pedigree relationships. We assessed phasing accuracy based on switch error counts (SEC) and rates (SER), lengths of correctly phased haplotypes and the probability that there is no phasing error between a pair of SNPs as a function of their distance. For most evaluated metrics or scenarios, the best software was either ShapeIT4.1 or Beagle5.2, both methods resulting in particularly high phasing accuracies. For instance, ShapeIT4.1 achieved a median SEC of 50 per individual and a mean haplotype block length of 24.1 Mb (scenario 2). These statistics are remarkable since the methods were evaluated with a map of 8,400,000 SNPs, and this corresponds to only one switch error every 40,000 phased informative markers. When more relatives were included in the data (scenario 2), FImpute3.0 reconstructed extremely long segments without errors. Conclusions We report extremely high phasing accuracies in a typical livestock sample. ShapeIT4.1 and Beagle5.2 proved to be the most accurate, particularly for phasing long segments and in the first scenario. Nevertheless, most tools achieved high accuracy at short distances and would be suitable for applications requiring only local haplotypes. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08354-6.
Collapse
|
65
|
Deng L, Pan Y, Wang Y, Chen H, Yuan K, Chen S, Lu D, Lu Y, Mokhtar SS, Rahman TA, Hoh BP, Xu S. Genetic Connections and Convergent Evolution of Tropical Indigenous Peoples in Asia. Mol Biol Evol 2022; 39:msab361. [PMID: 34940850 PMCID: PMC8826522 DOI: 10.1093/molbev/msab361] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Tropical indigenous peoples in Asia (TIA) attract much attention for their unique appearance, whereas their genetic history and adaptive evolution remain mysteries. We conducted a comprehensive study to characterize the genetic distinction and connection of broad geographical TIAs. Despite the diverse genetic makeup and large interarea genetic differentiation between the TIA groups, we identified a basal Asian ancestry (bASN) specifically shared by these populations. The bASN ancestry was relatively enriched in ancient Asian human genomes dated as early as ∼50,000 years before the present and diminished in more recent history. Notably, the bASN ancestry is unlikely to be derived from archaic hominins. Instead, we suggest it may be better modeled as a survived lineage of the initial peopling of Asia. Shared adaptations inherited from the ancient Asian ancestry were detected among the TIA groups (e.g., LIMS1 for hair morphology, and COL24A1 for bone formation), and they are enriched in neurological functions either at an identical locus (e.g., NKAIN3), or different loci in an identical gene (e.g., TENM4). The bASN ancestry could also have formed the substrate of the genetic architecture of the dark pigmentation observed in the TIA peoples. We hypothesize that phenotypic convergence of the dark pigmentation in TIAs could have resulted from parallel (e.g., DDB1/DAK) or genetic convergence driven by admixture (e.g., MTHFD1 and RAD18), new mutations (e.g., STK11), or notably purifying selection (e.g., MC1R). Our results provide new insights into the initial peopling of Asia and an advanced understanding of the phenotypic convergence of the TIA peoples.
Collapse
Affiliation(s)
- Lian Deng
- State Key Laboratory of Genetic Engineering, Center for Evolutionary Biology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
| | - Yuwen Pan
- Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Yinan Wang
- Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
- Department of Liver Surgery and Transplantation Liver Cancer Institute, Zhongshan Hospital, Fudan University, Shanghai, China
| | - Hao Chen
- Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Kai Yuan
- Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Sihan Chen
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Human Phenome Institute, Fudan University, Shanghai, China
| | - Dongsheng Lu
- Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Yan Lu
- State Key Laboratory of Genetic Engineering, Center for Evolutionary Biology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
| | - Siti Shuhada Mokhtar
- Institute of Medical Molecular Biotechnology, Faculty of Medicine, Universiti Teknologi MARA, Sungai Buloh Campus, Sungai Buloh, Selangor, Malaysia
| | - Thuhairah Abdul Rahman
- Clinical Pathology Diagnostic Centre Research Laboratory, Faculty of Medicine, Universiti Teknologi MARA, Sungai Buloh Campus, Sungai Buloh, Selangor, Malaysia
| | - Boon-Peng Hoh
- Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
- Faculty of Medicine and Health Sciences, UCSI University, Cheras, Kuala Lumpur, Malaysia
| | - Shuhua Xu
- State Key Laboratory of Genetic Engineering, Center for Evolutionary Biology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
- Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
- Department of Liver Surgery and Transplantation Liver Cancer Institute, Zhongshan Hospital, Fudan University, Shanghai, China
- Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Human Phenome Institute, Fudan University, Shanghai, China
- School of Life Science and Technology, ShanghaiTech University, Shanghai, China
- Jiangsu Key Laboratory of Phylogenomics and Comparative Genomics, School of Life Sciences, Jiangsu Normal University, Xuzhou, China
- Henan Institute of Medical and Pharmaceutical Sciences, Zhengzhou University, Zhengzhou, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China
| |
Collapse
|
66
|
Li B, Kamarck ML, Peng Q, Lim FL, Keller A, Smeets MAM, Mainland JD, Wang S. From musk to body odor: Decoding olfaction through genetic variation. PLoS Genet 2022; 18:e1009564. [PMID: 35113854 PMCID: PMC8812863 DOI: 10.1371/journal.pgen.1009564] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2021] [Accepted: 12/01/2021] [Indexed: 12/30/2022] Open
Abstract
The olfactory system combines input from multiple receptor types to represent odor information, but there are few explicit examples relating olfactory receptor (OR) activity patterns to odor perception. To uncover these relationships, we performed genome-wide scans on odor-perception phenotypes for ten odors in 1000 Han Chinese and validated results for six of these odors in an ethnically diverse population (n = 364). In both populations, consistent with previous studies, we replicated three previously reported associations (β-ionone/OR5A1, androstenone/OR7D4, cis-3-hexen-1-ol/OR2J3 LD-band), but not for odors containing aldehydes, suggesting that olfactory phenotype/genotype studies are robust across populations. Two novel associations between an OR and odor perception contribute to our understanding of olfactory coding. First, we found a SNP in OR51B2 that associated with trans-3-methyl-2-hexenoic acid, a key component of human underarm odor. Second, we found two linked SNPs associated with the musk Galaxolide in a novel musk receptor, OR4D6, which is also the first human OR shown to drive specific anosmia to a musk compound. We noticed that SNPs detected for odor intensity were enriched with amino acid substitutions, implying functional changes of odor receptors. Furthermore, we also found that the derived alleles of the SNPs tend to be associated with reduced odor intensity, supporting the hypothesis that the primate olfactory gene repertoire has degenerated over time. This study provides information about coding for human body odor, and gives us insight into broader mechanisms of olfactory coding, such as how differential OR activation can converge on a similar percept.
Collapse
Affiliation(s)
- Bingjie Li
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
- Department of Skin and Cosmetics Research, Shanghai Skin Disease Hospital, School of Medicine, Tongji University, Shanghai, China
| | - Marissa L. Kamarck
- Monell Chemical Senses Center, Philadelphia, Pennsylvania, United States of America
- Department of Neuroscience, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Qianqian Peng
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Fei-Ling Lim
- Unilever Research & Development, Colworth, United Kingdom
| | - Andreas Keller
- Laboratory of Neurogenetics and Behavior, The Rockefeller University, New York, New York State, United States of America
| | | | - Joel D. Mainland
- Monell Chemical Senses Center, Philadelphia, Pennsylvania, United States of America
- Department of Neuroscience, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Sijia Wang
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China
| |
Collapse
|
67
|
Hench K, Helmkampf M, McMillan WO, Puebla O. Rapid radiation in a highly diverse marine environment. Proc Natl Acad Sci U S A 2022; 119:e2020457119. [PMID: 35042790 PMCID: PMC8794831 DOI: 10.1073/pnas.2020457119] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Accepted: 11/18/2021] [Indexed: 12/16/2022] Open
Abstract
Rapid diversification is often observed when founding species invade isolated or newly formed habitats that provide ecological opportunity for adaptive radiation. However, most of the Earth's diversity arose in diverse environments where ecological opportunities appear to be more constrained. Here, we present a striking example of a rapid radiation in a highly diverse marine habitat. The hamlets, a group of reef fishes from the wider Caribbean, have radiated into a stunning diversity of color patterns but show low divergence across other ecological axes. Although the hamlet lineage is ∼26 My old, the radiation appears to have occurred within the last 10,000 generations in a burst of diversification that ranks among the fastest in fishes. As such, the hamlets provide a compelling backdrop to uncover the genomic elements associated with phenotypic diversification and an excellent opportunity to build a broader comparative framework for understanding the drivers of adaptive radiation. The analysis of 170 genomes suggests that color pattern diversity is generated by different combinations of alleles at a few large-effect loci. Such a modular genomic architecture of diversification has been documented before in Heliconius butterflies, capuchino finches, and munia finches, three other tropical radiations that took place in highly diverse and complex environments. The hamlet radiation also occurred in a context of high effective population size, which is typical of marine populations. This allows for the accumulation of new variants through mutation and the retention of ancestral genetic variation, both of which appear to be important in this radiation.
Collapse
Affiliation(s)
- Kosmas Hench
- Ecology Department, Leibniz Centre for Tropical Marine Research, 28359 Bremen, Germany;
| | - Martin Helmkampf
- Ecology Department, Leibniz Centre for Tropical Marine Research, 28359 Bremen, Germany
| | - W Owen McMillan
- Smithsonian Tropical Research Institute, Apartado Postal 0843-03092, Republic of Panama
| | - Oscar Puebla
- Ecology Department, Leibniz Centre for Tropical Marine Research, 28359 Bremen, Germany;
- Smithsonian Tropical Research Institute, Apartado Postal 0843-03092, Republic of Panama
- Institute for Chemistry and Biology of the Marine Environment, 26111 Oldenburg, Germany
- Marine Evolutionary Ecology, GEOMAR Helmholtz Centre for Ocean Research Kiel, 24105 Kiel, Germany
| |
Collapse
|
68
|
Colomer-Vilaplana A, Murga-Moreno J, Canalda-Baltrons A, Inserte C, Soto D, Coronado-Zamora M, Barbadilla A, Casillas S. PopHumanVar: an interactive application for the functional characterization and prioritization of adaptive genomic variants in humans. Nucleic Acids Res 2022; 50:D1069-D1076. [PMID: 34664660 PMCID: PMC8728255 DOI: 10.1093/nar/gkab925] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2021] [Revised: 09/17/2021] [Accepted: 09/28/2021] [Indexed: 12/22/2022] Open
Abstract
Adaptive challenges that humans faced as they expanded across the globe left specific molecular footprints that can be decoded in our today's genomes. Different sets of metrics are used to identify genomic regions that have undergone selection. However, there are fewer methods capable of pinpointing the allele ultimately responsible for this selection. Here, we present PopHumanVar, an interactive online application that is designed to facilitate the exploration and thorough analysis of candidate genomic regions by integrating both functional and population genomics data currently available. PopHumanVar generates useful summary reports of prioritized variants that are putatively causal of recent selective sweeps. It compiles data and graphically represents different layers of information, including natural selection statistics, as well as functional annotations and genealogical estimations of variant age, for biallelic single nucleotide variants (SNVs) of the 1000 Genomes Project phase 3. Specifically, PopHumanVar amasses SNV-based information from GEVA, SnpEFF, GWAS Catalog, ClinVar, RegulomeDB and DisGeNET databases, as well as accurate estimations of iHS, nSL and iSAFE statistics. Notably, PopHumanVar can successfully identify known causal variants of frequently reported candidate selection regions, including EDAR in East-Asians, ACKR1 (DARC) in Africans and LCT/MCM6 in Europeans. PopHumanVar is open and freely available at https://pophumanvar.uab.cat.
Collapse
Affiliation(s)
- Aina Colomer-Vilaplana
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | - Jesús Murga-Moreno
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | - Aleix Canalda-Baltrons
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | - Clara Inserte
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | - Daniel Soto
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | - Marta Coronado-Zamora
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | - Antonio Barbadilla
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | - Sònia Casillas
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| |
Collapse
|
69
|
Horvath R, Menon M, Stitzer M, Ross-Ibarra J. OUP accepted manuscript. Genome Biol Evol 2022; 14:6519160. [PMID: 35104327 PMCID: PMC8872973 DOI: 10.1093/gbe/evac016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/22/2022] [Indexed: 11/23/2022] Open
Abstract
Recognition of the important role of transposable elements (TEs) in eukaryotic genomes quickly led to a burgeoning literature modeling and estimating the effects of selection on TEs. Much of the empirical work on selection has focused on analyzing the site frequency spectrum (SFS) of TEs. But TE evolution differs from standard models in a number of ways that can impact the power and interpretation of the SFS. For example, rather than mutating under a clock-like model, transposition often occurs in bursts which can inflate particular frequency categories compared with expectations under a standard neutral model. If a TE burst has been recent, the excess of low-frequency polymorphisms can mimic the effect of purifying selection. Here, we investigate how transposition bursts affect the frequency distribution of TEs and the correlation between age and allele frequency. Using information on the TE age distribution, we propose an age-adjusted SFS to compare TEs and neutral polymorphisms to more effectively evaluate whether TEs are under selective constraints. We show that our approach can minimize instances of false inference of selective constraint, remains robust to simple demographic changes, and allows for a correct identification of even weak selection affecting TEs which experienced a transposition burst. The results presented here will help researchers working on TEs to more reliably identify the effects of selection on TEs without having to rely on the assumption of a constant transposition rate.
Collapse
Affiliation(s)
- Robert Horvath
- Department of Evolution and Ecology, University of California, Davis, USA
- Corresponding authors: E-mails: ;
| | - Mitra Menon
- Department of Evolution and Ecology, University of California, Davis, USA
- Center for Population Biology, University of California, Davis, USA
| | - Michelle Stitzer
- Institute for Genomic Diversity and Department of Molecular Biology and Genetics, Cornell University, USA
| | - Jeffrey Ross-Ibarra
- Department of Evolution and Ecology, University of California, Davis, USA
- Center for Population Biology, University of California, Davis, USA
- Genome Center, University of California, Davis, USA
- Corresponding authors: E-mails: ;
| |
Collapse
|
70
|
Hayakawa T, Terahara M, Fujito NT, Matsunaga T, Teshima KM, Hane M, Kitajima K, Sato C, Takahata N, Satta Y. Lower promoter activity of the ST8SIA2 gene has been favored in evolving human collective brains. PLoS One 2021; 16:e0259897. [PMID: 34914745 PMCID: PMC8675693 DOI: 10.1371/journal.pone.0259897] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2021] [Accepted: 10/28/2021] [Indexed: 11/18/2022] Open
Abstract
ST8SIA2 is an important molecule regulating expression of the phenotype involved in schizophrenia. Lowered promoter activity of the ST8SIA2 gene is considered to be protective against schizophrenia by conferring tolerance to psychosocial stress. Here, we examined the promoter-type composition of anatomically modern humans (AMHs) and archaic humans (AHs; Neanderthals and Denisovans), and compared the promoter activity at the population level (population promoter activity; PPA) between them. In AMHs, the TCT-type, showing the second lowest promoter activity, was most prevalent in the ancestral population of non-Africans. However, the detection of only the CGT-type from AH samples and recombination tracts in AH sequences showed that the CGT- and TGT-types, exhibiting the two highest promoter activities, were common in AH populations. Furthermore, interspecies gene flow occurred into AMHs from AHs and into Denisovans from Neanderthals, influencing promoter-type compositions independently in both AMHs and AHs. The difference of promoter-type composition makes PPA unique in each population. East and Southeast Asian populations show the lowest PPA. This results from the selective increase of the CGC-type, showing the lowest promoter activity, in these populations. Every non-African population shows significantly lower PPA than African populations, resulting from the TCT-type having the highest prevalence in the ancestral population of non-Africans. In addition, PPA reduction is also found among subpopulations within Africa via a slight increase of the TCT-type. These findings indicate a trend toward lower PPA in the spread of AMHs, interpreted as a continuous adaptation to psychosocial stress arising in migration. This trend is considered as genetic tuning for the evolution of collective brains. The inferred promoter-type composition of AHs differed markedly from that of AMHs, resulting in higher PPA in AHs than in AMHs. This suggests that the trend toward lower PPA is a unique feature in AMH spread.
Collapse
Affiliation(s)
- Toshiyuki Hayakawa
- Graduate School of Systems Life Sciences, Kyushu University, Fukuoka, Japan
- Faculty of Arts and Science, Kyushu University, Fukuoka, Japan
- * E-mail:
| | - Masahiro Terahara
- Graduate School of Systems Life Sciences, Kyushu University, Fukuoka, Japan
| | - Naoko T. Fujito
- School of Advanced Sciences, SOKENDAI (The Graduate University for Advanced Studies), Hayama, Kanagawa, Japan
| | - Takumi Matsunaga
- Graduate School of Systems Life Sciences, Kyushu University, Fukuoka, Japan
| | | | - Masaya Hane
- Bioscience and Biotechnology Center, Nagoya University, Nagoya, Aichi, Japan
| | - Ken Kitajima
- Institute for Glyco-core Research (iGCORE), Nagoya University, Nagoya, Aichi, Japan
| | - Chihiro Sato
- Institute for Glyco-core Research (iGCORE), Nagoya University, Nagoya, Aichi, Japan
| | - Naoyuki Takahata
- School of Advanced Sciences, SOKENDAI (The Graduate University for Advanced Studies), Hayama, Kanagawa, Japan
| | - Yoko Satta
- School of Advanced Sciences, SOKENDAI (The Graduate University for Advanced Studies), Hayama, Kanagawa, Japan
| |
Collapse
|
71
|
Hateley S, Lopez-Izquierdo A, Jou CJ, Cho S, Schraiber JG, Song S, Maguire CT, Torres N, Riedel M, Bowles NE, Arrington CB, Kennedy BJ, Etheridge SP, Lai S, Pribble C, Meyers L, Lundahl D, Byrnes J, Granka JM, Kauffman CA, Lemmon G, Boyden S, Scott Watkins W, Karren MA, Knight S, Brent Muhlestein J, Carlquist JF, Anderson JL, Chahine KG, Shah KU, Ball CA, Benjamin IJ, Yandell M, Tristani-Firouzi M. The history and geographic distribution of a KCNQ1 atrial fibrillation risk allele. Nat Commun 2021; 12:6442. [PMID: 34750360 PMCID: PMC8575962 DOI: 10.1038/s41467-021-26741-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2020] [Accepted: 10/20/2021] [Indexed: 11/08/2022] Open
Abstract
The genetic architecture of atrial fibrillation (AF) encompasses low impact, common genetic variants and high impact, rare variants. Here, we characterize a high impact AF-susceptibility allele, KCNQ1 R231H, and describe its transcontinental geographic distribution and history. Induced pluripotent stem cell-derived cardiomyocytes procured from risk allele carriers exhibit abbreviated action potential duration, consistent with a gain-of-function effect. Using identity-by-descent (IBD) networks, we estimate the broad- and fine-scale population ancestry of risk allele carriers and their relatives. Analysis of ancestral migration routes reveals ancestors who inhabited Denmark in the 1700s, migrated to the Northeastern United States in the early 1800s, and traveled across the Midwest to arrive in Utah in the late 1800s. IBD/coalescent-based allele dating analysis reveals a relatively recent origin of the AF risk allele (~5000 years). Thus, our approach broadens the scope of study for disease susceptibility alleles to the context of human migration and ancestral origins.
Collapse
Affiliation(s)
| | | | - Chuanchau J Jou
- Nora Eccles Harrison CVRTI, University of Utah School of Medicine, Salt Lake City, UT, USA
- Division of Pediatric Cardiology, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - Scott Cho
- Nora Eccles Harrison CVRTI, University of Utah School of Medicine, Salt Lake City, UT, USA
| | | | | | - Colin T Maguire
- Nora Eccles Harrison CVRTI, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - Natalia Torres
- Nora Eccles Harrison CVRTI, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - Michael Riedel
- Cardiovascular Center, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Neil E Bowles
- Division of Pediatric Cardiology, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - Cammon B Arrington
- Division of Pediatric Cardiology, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - Brett J Kennedy
- Department of Human Genetics, University of Utah, Salt Lake City, UT, USA
| | - Susan P Etheridge
- Division of Pediatric Cardiology, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - Shuping Lai
- Cardiovascular Center, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Chase Pribble
- Nora Eccles Harrison CVRTI, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - Lindsay Meyers
- Division of Pediatric Cardiology, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - Derek Lundahl
- Division of Pediatric Cardiology, University of Utah School of Medicine, Salt Lake City, UT, USA
| | | | | | - Christopher A Kauffman
- Nora Eccles Harrison CVRTI, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - Gordon Lemmon
- Department of Human Genetics, University of Utah, Salt Lake City, UT, USA
| | - Steven Boyden
- Department of Human Genetics, University of Utah, Salt Lake City, UT, USA
| | - W Scott Watkins
- Department of Human Genetics, University of Utah, Salt Lake City, UT, USA
| | - Mary Anne Karren
- Department of Human Genetics, University of Utah, Salt Lake City, UT, USA
| | | | | | | | | | | | - Khushi U Shah
- Nora Eccles Harrison CVRTI, University of Utah School of Medicine, Salt Lake City, UT, USA
| | | | - Ivor J Benjamin
- Cardiovascular Center, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Mark Yandell
- Department of Human Genetics, University of Utah, Salt Lake City, UT, USA
| | - Martin Tristani-Firouzi
- Nora Eccles Harrison CVRTI, University of Utah School of Medicine, Salt Lake City, UT, USA.
- Division of Pediatric Cardiology, University of Utah School of Medicine, Salt Lake City, UT, USA.
| |
Collapse
|
72
|
Saitou M, Masuda N, Gokcumen O. Similarity-based analysis of allele frequency distribution among multiple populations identifies adaptive genomic structural variants. Mol Biol Evol 2021; 39:6413645. [PMID: 34718708 PMCID: PMC8896759 DOI: 10.1093/molbev/msab313] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Structural variants have a considerable impact on human genomic diversity. However, their evolutionary history remains mostly unexplored. Here, we developed a new method to identify potentially adaptive structural variants based on a similarity-based analysis that incorporates genotype frequency data from 26 populations simultaneously. Using this method, we analyzed 57,629 structural variants and identified 576 structural variants that show unusual population differentiation. Of these putatively adaptive structural variants, we further showed that 24 variants are multiallelic and overlap with coding sequences, and 20 variants are significantly associated with GWAS traits. Closer inspection of the haplotypic variation associated with these putatively adaptive and functional structural variants reveals deviations from neutral expectations due to: 1) population differentiation of rapidly evolving multiallelic variants, 2) incomplete sweeps, and 3) recent population-specific negative selection. Overall, our study provides new methodological insights, documents hundreds of putatively adaptive variants, and introduces evolutionary models that may better explain the complex evolution of structural variants.
Collapse
Affiliation(s)
- Marie Saitou
- Dept. of Biological Sciences, University at Buffalo, State University of New York, Buffalo, NY 14260-2900, USA.,Currently at the Faculty of Biosciences, Norwegian University of Life Sciences, Universitetstunet 3, 1430 Ås, Norway.,Dept. of Medicine, The University of Chicago. Section of Genetic Medicine, 5841 S. Maryland Ave., Chicago, IL, 60637-1447, USA
| | - Naoki Masuda
- Department of Mathematics, University at Buffalo, State University of New York, Buffalo, NY 14260-2900, USA.,Computational and Data-Enabled Science and Engineering Program, University at Buffalo, State University of New York, Buffalo, NY 14260-5030, USA
| | - Omer Gokcumen
- Dept. of Biological Sciences, University at Buffalo, State University of New York, Buffalo, NY 14260-2900, USA
| |
Collapse
|
73
|
Estimating the age of single nucleotide polymorphic sites in humans. Genes Genomics 2021; 43:1179-1188. [PMID: 34245420 DOI: 10.1007/s13258-021-01135-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2021] [Accepted: 06/28/2021] [Indexed: 10/20/2022]
Abstract
BACKGROUND Determining the ages of polymorphic sites in human genomes needs to be carried out in a careful balance between the degree of complexity of the approach and the desired accuracy. OBJECTIVE We provide evidence that a simpler approach where age determination is based upon the degree to which the alternative allele is spread among the population can be competitive with more complex methods. METHODS The information contained in the vcf files of Phase 1 of the 1000 Genomes Project combined with the genomic sequences of seven non-human primate species was analyzed. The analyses were supplemented by a computer simulation of the mutational changes in 10,000 model chromosomes with a length of 10,000 nucleotides over a period of 16 million years. The list of the birth dates of the derived alleles of homozygous and heterozygous components of the derived alleles served as a yardstick to estimate the ages of human alternative alleles. RESULTS The age of the derived alleles born in Africa before the "Out of Africa" event and not brought to other continents are estimated to be 0.17 Ma, the derived alleles present simultaneously on all continents are estimated to be 1.3 Ma old and the age of alleles arising in Europe or Asia is 0.06 Ma. CONCLUSION Our approach functions with polymorphisms that respect the "more frequent means older" principle. However, this shortcoming only leads to disagreement with the Atlas of Variant Age in about 20% of cases.
Collapse
|
74
|
Excofffier L, Marchi N, Marques DA, Matthey-Doret R, Gouy A, Sousa VC. fastsimcoal2: demographic inference under complex evolutionary scenarios. Bioinformatics 2021; 37:4882-4885. [PMID: 34164653 PMCID: PMC8665742 DOI: 10.1093/bioinformatics/btab468] [Citation(s) in RCA: 104] [Impact Index Per Article: 34.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2021] [Revised: 06/11/2021] [Accepted: 06/22/2021] [Indexed: 01/25/2023] Open
Abstract
Motivation fastsimcoal2 extends fastsimcoal, a continuous time coalescent-based genetic simulation program, by enabling the estimation of demographic parameters under very complex scenarios from the site frequency spectrum under a maximum-likelihood framework. Results Other improvements include multi-threading, handling of population inbreeding, extended input file syntax facilitating the description of complex demographic scenarios, and more efficient simulations of sparsely structured populations and of large chromosomes. Availability and implementation fastsimcoal2 is freely available on http://cmpg.unibe.ch/software/fastsimcoal2/. It includes console versions for Linux, Windows and MacOS, additional scripts for the analysis and visualization of simulated and estimated scenarios, as well as a detailed documentation and ready-to-use examples.
Collapse
Affiliation(s)
- Laurent Excofffier
- Computational and Molecular Population Genetics Lab, Institute of Ecology and Evolution, University of Bern, 3012 Bern, Switzerland.,Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Nina Marchi
- Computational and Molecular Population Genetics Lab, Institute of Ecology and Evolution, University of Bern, 3012 Bern, Switzerland.,Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - David Alexander Marques
- Life Science Division, Natural History Museum Basel, 4051 Basel, Switzerland.,Aquatic Ecology and Evolution, Institute of Ecology and Evolution, University of Bern, 3012 Bern, Switzerland.,Department of Fish Ecology and Evolution, EAWAG swiss Federal institute of Aquatic Science and Technology, Center for Ecology, Evolution and Biogeochemistry, 6047 Kastanienbaum, Switzerland
| | - Remi Matthey-Doret
- Computational and Molecular Population Genetics Lab, Institute of Ecology and Evolution, University of Bern, 3012 Bern, Switzerland.,Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Alexandre Gouy
- Computational and Molecular Population Genetics Lab, Institute of Ecology and Evolution, University of Bern, 3012 Bern, Switzerland.,Gouy Data Consulting, 1026 Denges, Switzerland
| | - Vitor C Sousa
- Computational and Molecular Population Genetics Lab, Institute of Ecology and Evolution, University of Bern, 3012 Bern, Switzerland.,cE3c - Centre for Ecology, Evolution and Environmental Changes, Faculdade de Ciências da Universidade de Lisboa, University of Lisbon, Campo Grande, 1749-016, Lisbon, Portugal
| |
Collapse
|
75
|
Bohutínská M, Vlček J, Yair S, Laenen B, Konečná V, Fracassetti M, Slotte T, Kolář F. Genomic basis of parallel adaptation varies with divergence in Arabidopsis and its relatives. Proc Natl Acad Sci U S A 2021; 118:e2022713118. [PMID: 34001609 PMCID: PMC8166048 DOI: 10.1073/pnas.2022713118] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Parallel adaptation provides valuable insight into the predictability of evolutionary change through replicated natural experiments. A steadily increasing number of studies have demonstrated genomic parallelism, yet the magnitude of this parallelism varies depending on whether populations, species, or genera are compared. This led us to hypothesize that the magnitude of genomic parallelism scales with genetic divergence between lineages, but whether this is the case and the underlying evolutionary processes remain unknown. Here, we resequenced seven parallel lineages of two Arabidopsis species, which repeatedly adapted to challenging alpine environments. By combining genome-wide divergence scans with model-based approaches, we detected a suite of 151 genes that show parallel signatures of positive selection associated with alpine colonization, involved in response to cold, high radiation, short season, herbivores, and pathogens. We complemented these parallel candidates with published gene lists from five additional alpine Brassicaceae and tested our hypothesis on a broad scale spanning ∼0.02 to 18 My of divergence. Indeed, we found quantitatively variable genomic parallelism whose extent significantly decreased with increasing divergence between the compared lineages. We further modeled parallel evolution over the Arabidopsis candidate genes and showed that a decreasing probability of repeated selection on the same standing or introgressed alleles drives the observed pattern of divergence-dependent parallelism. We therefore conclude that genetic divergence between populations, species, and genera, affecting the pool of shared variants, is an important factor in the predictability of genome evolution.
Collapse
Affiliation(s)
- Magdalena Bohutínská
- Department of Botany, Faculty of Science, Charles University, 128 01 Prague, Czech Republic;
- Institute of Botany, Czech Academy of Sciences, 252 43 Průhonice, Czech Republic
| | - Jakub Vlček
- Department of Botany, Faculty of Science, Charles University, 128 01 Prague, Czech Republic
- Biology Centre, Czech Academy of Sciences, 370 05 České Budějovice, Czech Republic
- Department of Zoology, Faculty of Science, University of South Bohemia, 370 05 České Budějovice, Czech Republic
| | - Sivan Yair
- Center for Population Biology, University of California, Davis, CA 95616
| | - Benjamin Laenen
- Department of Ecology, Environment and Plant Sciences, Science for Life Laboratory, Stockholm University, SE-106 91 Stockholm, Sweden
| | - Veronika Konečná
- Department of Botany, Faculty of Science, Charles University, 128 01 Prague, Czech Republic
- Institute of Botany, Czech Academy of Sciences, 252 43 Průhonice, Czech Republic
| | - Marco Fracassetti
- Department of Ecology, Environment and Plant Sciences, Science for Life Laboratory, Stockholm University, SE-106 91 Stockholm, Sweden
| | - Tanja Slotte
- Department of Ecology, Environment and Plant Sciences, Science for Life Laboratory, Stockholm University, SE-106 91 Stockholm, Sweden
| | - Filip Kolář
- Department of Botany, Faculty of Science, Charles University, 128 01 Prague, Czech Republic;
- Institute of Botany, Czech Academy of Sciences, 252 43 Průhonice, Czech Republic
| |
Collapse
|
76
|
Trigo BB, Utsunomiya ATH, Fortunato AAAD, Milanesi M, Torrecilha RBP, Lamb H, Nguyen L, Ross EM, Hayes B, Padula RCM, Sussai TS, Zavarez LB, Cipriano RS, Caminhas MMT, Lopes FL, Pelle C, Leeb T, Bannasch D, Bickhart D, Smith TPL, Sonstegard TS, Garcia JF, Utsunomiya YT. Variants at the ASIP locus contribute to coat color darkening in Nellore cattle. Genet Sel Evol 2021; 53:40. [PMID: 33910501 PMCID: PMC8082809 DOI: 10.1186/s12711-021-00633-2] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Accepted: 04/16/2021] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Nellore cattle (Bos indicus) are well-known for their adaptation to warm and humid environments. Hair length and coat color may impact heat tolerance. The Nellore breed has been strongly selected for white coat, but bulls generally exhibit darker hair ranging from light grey to black on the head, neck, hump, and knees. Given the potential contribution of coat color variation to the adaptation of cattle populations to tropical and sub-tropical environments, our aim was to map positional and functional candidate genetic variants associated with darkness of hair coat (DHC) in Nellore bulls. RESULTS We performed a genome-wide association study (GWAS) for DHC using data from 432 Nellore bulls that were genotyped for more than 777 k single nucleotide polymorphism (SNP) markers. A single major association signal was detected in the vicinity of the agouti signaling protein gene (ASIP). The analysis of whole-genome sequence (WGS) data from 21 bulls revealed functional variants that are associated with DHC, including a structural rearrangement involving ASIP (ASIP-SV1). We further characterized this structural variant using Oxford Nanopore sequencing data from 13 Australian Brahman heifers, which share ancestry with Nellore cattle; we found that this variant originates from a 1155-bp deletion followed by an insertion of a transposable element of more than 150 bp that may impact the recruitment of ASIP non-coding exons. CONCLUSIONS Our results indicate that the variant ASIP sequence causes darker coat pigmentation on specific parts of the body, most likely through a decreased expression of ASIP and consequently an increased production of eumelanin.
Collapse
Affiliation(s)
- Beatriz B Trigo
- School of Veterinary Medicine, Araçatuba, Department of Production and Animal Health, São Paulo State University (Unesp), Araçatuba, SP, Brazil.,International Atomic Energy Agency (IAEA) Collaborating Centre On Animal Genomics and Bioinformatics, Araçatuba, SP, Brazil
| | - Adam T H Utsunomiya
- International Atomic Energy Agency (IAEA) Collaborating Centre On Animal Genomics and Bioinformatics, Araçatuba, SP, Brazil.,AgroPartners Consulting, R. Floriano Peixoto, 120-Sala 43a-Centro, Araçatuba, SP, 16010-220, Brazil
| | - Alvaro A A D Fortunato
- School of Veterinary Medicine, Araçatuba, Department of Production and Animal Health, São Paulo State University (Unesp), Araçatuba, SP, Brazil.,International Atomic Energy Agency (IAEA) Collaborating Centre On Animal Genomics and Bioinformatics, Araçatuba, SP, Brazil.,Personal-PEC, R. Sebastião Lima, 1336-Centro, Campo Grande, MS, 79004-600, Brazil
| | - Marco Milanesi
- School of Veterinary Medicine, Araçatuba, Department of Production and Animal Health, São Paulo State University (Unesp), Araçatuba, SP, Brazil.,International Atomic Energy Agency (IAEA) Collaborating Centre On Animal Genomics and Bioinformatics, Araçatuba, SP, Brazil.,AgroPartners Consulting, R. Floriano Peixoto, 120-Sala 43a-Centro, Araçatuba, SP, 16010-220, Brazil
| | - Rafaela B P Torrecilha
- International Atomic Energy Agency (IAEA) Collaborating Centre On Animal Genomics and Bioinformatics, Araçatuba, SP, Brazil.,AgroPartners Consulting, R. Floriano Peixoto, 120-Sala 43a-Centro, Araçatuba, SP, 16010-220, Brazil
| | - Harrison Lamb
- Centre for Animal Science, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Brisbane, Australia
| | - Loan Nguyen
- Centre for Animal Science, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Brisbane, Australia
| | - Elizabeth M Ross
- Centre for Animal Science, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Brisbane, Australia
| | - Ben Hayes
- Centre for Animal Science, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Brisbane, Australia
| | | | - Thayla S Sussai
- Centro Universitário Católico Salesiano, Araçatuba, SP, Brazil
| | - Ludmilla B Zavarez
- International Atomic Energy Agency (IAEA) Collaborating Centre On Animal Genomics and Bioinformatics, Araçatuba, SP, Brazil
| | | | - Maria M T Caminhas
- School of Veterinary Medicine, Araçatuba, Department of Production and Animal Health, São Paulo State University (Unesp), Araçatuba, SP, Brazil
| | - Flavia L Lopes
- School of Veterinary Medicine, Araçatuba, Department of Production and Animal Health, São Paulo State University (Unesp), Araçatuba, SP, Brazil
| | | | - Tosso Leeb
- Institute of Genetics, Vetsuisse-Faculty, University of Bern, Bremgartenstrasse 109A, 3012, Bern, Switzerland.,Dermfocus, University of Bern, Bremgartenstrasse 109A, 3012, Bern, Switzerland
| | - Danika Bannasch
- Institute of Genetics, Vetsuisse-Faculty, University of Bern, Bremgartenstrasse 109A, 3012, Bern, Switzerland.,Department of Population Health and Reproduction, School of Veterinary Medicine, University of California, Davis, CA, 95616, USA
| | - Derek Bickhart
- Dairy Forage Research Center, USDA-ARS, 1925 Linden Drive, Madison, WI, 53706, USA
| | - Timothy P L Smith
- US. Meat Animal Research Center, USDA-ARS, 844 Road 313, Clay Center, NE, 68933, USA
| | | | - José F Garcia
- School of Veterinary Medicine, Araçatuba, Department of Production and Animal Health, São Paulo State University (Unesp), Araçatuba, SP, Brazil.,International Atomic Energy Agency (IAEA) Collaborating Centre On Animal Genomics and Bioinformatics, Araçatuba, SP, Brazil.,AgroPartners Consulting, R. Floriano Peixoto, 120-Sala 43a-Centro, Araçatuba, SP, 16010-220, Brazil.,School of Agriculture and Veterinarian Sciences, Jaboticabal, Department of Preventive Veterinary Medicine and Animal Reproduction, São Paulo State University (Unesp), Jaboticabal, SP, Brazil
| | - Yuri T Utsunomiya
- School of Veterinary Medicine, Araçatuba, Department of Production and Animal Health, São Paulo State University (Unesp), Araçatuba, SP, Brazil. .,International Atomic Energy Agency (IAEA) Collaborating Centre On Animal Genomics and Bioinformatics, Araçatuba, SP, Brazil. .,AgroPartners Consulting, R. Floriano Peixoto, 120-Sala 43a-Centro, Araçatuba, SP, 16010-220, Brazil.
| |
Collapse
|
77
|
Suryamohan K, Diwanji D, Stawiski EW, Gupta R, Miersch S, Liu J, Chen C, Jiang YP, Fellouse FA, Sathirapongsasuti JF, Albers PK, Deepak T, Saberianfar R, Ratan A, Washburn G, Mis M, Santhosh D, Somasekar S, Hiranjith GH, Vargas D, Mohan S, Phalke S, Kuriakose B, Antony A, Ustav M, Schuster SC, Sidhu S, Junutula JR, Jura N, Seshagiri S. Human ACE2 receptor polymorphisms and altered susceptibility to SARS-CoV-2. Commun Biol 2021; 4:475. [PMID: 33846513 PMCID: PMC8041869 DOI: 10.1038/s42003-021-02030-3] [Citation(s) in RCA: 105] [Impact Index Per Article: 35.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Accepted: 03/23/2021] [Indexed: 01/07/2023] Open
Abstract
COVID-19 is a respiratory illness caused by a novel coronavirus called SARS-CoV-2. The viral spike (S) protein engages the human angiotensin-converting enzyme 2 (ACE2) receptor to invade host cells with ~10-15-fold higher affinity compared to SARS-CoV S-protein, making it highly infectious. Here, we assessed if ACE2 polymorphisms can alter host susceptibility to SARS-CoV-2 by affecting this interaction. We analyzed over 290,000 samples representing >400 population groups from public genomic datasets and identified multiple ACE2 protein-altering variants. Using reported structural data, we identified natural ACE2 variants that could potentially affect virus-host interaction and thereby alter host susceptibility. These include variants S19P, I21V, E23K, K26R, T27A, N64K, T92I, Q102P and H378R that were predicted to increase susceptibility, while variants K31R, N33I, H34R, E35K, E37K, D38V, Y50F, N51S, M62V, K68E, F72V, Y83H, G326E, G352V, D355N, Q388L and D509Y were predicted to be protective variants that show decreased binding to S-protein. Using biochemical assays, we confirmed that K31R and E37K had decreased affinity, and K26R and T92I variants showed increased affinity for S-protein when compared to wildtype ACE2. Consistent with this, soluble ACE2 K26R and T92I were more effective in blocking entry of S-protein pseudotyped virus suggesting that ACE2 variants can modulate susceptibility to SARS-CoV-2.
Collapse
Affiliation(s)
- Kushal Suryamohan
- Research and Development Department, MedGenome Inc, Foster City, CA, USA
| | - Devan Diwanji
- Cardiovascular Research Institute, University of California San Francisco, San Francisco, CA, USA
- Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, CA, USA
| | - Eric W Stawiski
- Research and Development Department, MedGenome Inc, Foster City, CA, USA
| | - Ravi Gupta
- MedGenome Labs Ltd., Bangalore, Karnataka, India
| | - Shane Miersch
- Department of Molecular Genetics, and the Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, Canada
| | - Jiang Liu
- ModMab Therapeutics, Foster City, CA, USA
| | - Chao Chen
- Department of Molecular Genetics, and the Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, Canada
| | | | - Frederic A Fellouse
- ModMab Therapeutics, Accelerator for Donnelly Collaboration, University of Toronto, Toronto, ON, Canada
| | | | | | | | - Reza Saberianfar
- ModMab Therapeutics, Accelerator for Donnelly Collaboration, University of Toronto, Toronto, ON, Canada
| | - Aakrosh Ratan
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
| | - Gavin Washburn
- Research and Development Department, MedGenome Inc, Foster City, CA, USA
| | - Monika Mis
- Research and Development Department, MedGenome Inc, Foster City, CA, USA
| | | | | | - G H Hiranjith
- Research and Development Department, MedGenome Inc, Foster City, CA, USA
| | - Derek Vargas
- Research and Development Department, MedGenome Inc, Foster City, CA, USA
| | - Sangeetha Mohan
- Department of Molecular Biology, SciGenom Labs Pvt Ltd, Kochi, Kerala, India
| | - Sameer Phalke
- Department of Molecular Biology, SciGenom Labs Pvt Ltd, Kochi, Kerala, India
| | | | - Aju Antony
- Department of Molecular Biology, SciGenom Labs Pvt Ltd, Kochi, Kerala, India
| | - Mart Ustav
- Department of Molecular Genetics, and the Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, Canada
| | - Stephan C Schuster
- Singapore Centre for Environmental Life Sciences Engineering, Nanyang Technological University, Singapore, Singapore
| | - Sachdev Sidhu
- Department of Molecular Genetics, and the Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, Canada
| | | | - Natalia Jura
- Cardiovascular Research Institute, University of California San Francisco, San Francisco, CA, USA.
- Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, CA, USA.
| | - Somasekar Seshagiri
- ModMab Therapeutics, Foster City, CA, USA.
- SciGenom Research Foundation, Bangalore, Karnataka, India.
| |
Collapse
|
78
|
Tilot AK, Khramtsova EA, Liang D, Grasby KL, Jahanshad N, Painter J, Colodro-Conde L, Bralten J, Hibar DP, Lind PA, Liu S, Brotman SM, Thompson PM, Medland SE, Macciardi F, Stranger BE, Davis LK, Fisher SE, Stein JL. The Evolutionary History of Common Genetic Variants Influencing Human Cortical Surface Area. Cereb Cortex 2021; 31:1873-1887. [PMID: 33290510 PMCID: PMC7945014 DOI: 10.1093/cercor/bhaa327] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2020] [Revised: 10/09/2020] [Accepted: 10/09/2020] [Indexed: 12/15/2022] Open
Abstract
Structural brain changes along the lineage leading to modern Homo sapiens contributed to our distinctive cognitive and social abilities. However, the evolutionarily relevant molecular variants impacting key aspects of neuroanatomy are largely unknown. Here, we integrate evolutionary annotations of the genome at diverse timescales with common variant associations from large-scale neuroimaging genetic screens. We find that alleles with evidence of recent positive polygenic selection over the past 2000-3000 years are associated with increased surface area (SA) of the entire cortex, as well as specific regions, including those involved in spoken language and visual processing. Therefore, polygenic selective pressures impact the structure of specific cortical areas even over relatively recent timescales. Moreover, common sequence variation within human gained enhancers active in the prenatal cortex is associated with postnatal global SA. We show that such variation modulates the function of a regulatory element of the developmentally relevant transcription factor HEY2 in human neural progenitor cells and is associated with structural changes in the inferior frontal cortex. These results indicate that non-coding genomic regions active during prenatal cortical development are involved in the evolution of human brain structure and identify novel regulatory elements and genes impacting modern human brain structure.
Collapse
Affiliation(s)
- Amanda K Tilot
- Language and Genetics Department, Max Planck Institute for Psycholinguistics, Nijmegen, 6500 AH, Netherlands
- Mark and Mary Stevens Neuroimaging and Informatics Institute, Keck School of Medicine, University of Southern California, Marina del Rey, CA 90292, USA
| | - Ekaterina A Khramtsova
- Department of Medicine, Section of Genetic Medicine & Institute for Genomics and Systems Biology, University of Chicago, Chicago, IL 60637, USA
- Computational Sciences, Janssen Pharmaceuticals, Spring House, PA 19477, USA
| | - Dan Liang
- Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA
- UNC Neuroscience Center, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Katrina L Grasby
- Psychiatric Genetics, QIMR Berghofer Medical Research Institute, Brisbane, QLD 4006, Australia
| | - Neda Jahanshad
- Mark and Mary Stevens Neuroimaging and Informatics Institute, Keck School of Medicine, University of Southern California, Marina del Rey, CA 90292, USA
| | - Jodie Painter
- Psychiatric Genetics, QIMR Berghofer Medical Research Institute, Brisbane, QLD 4006, Australia
| | - Lucía Colodro-Conde
- Psychiatric Genetics, QIMR Berghofer Medical Research Institute, Brisbane, QLD 4006, Australia
| | - Janita Bralten
- Radboud University Medical Center, 6525 XZ Nijmegen, Netherlands
| | | | - Penelope A Lind
- Psychiatric Genetics, QIMR Berghofer Medical Research Institute, Brisbane, QLD 4006, Australia
| | - Siyao Liu
- Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA
- UNC Neuroscience Center, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Sarah M Brotman
- Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA
- UNC Neuroscience Center, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Paul M Thompson
- Mark and Mary Stevens Neuroimaging and Informatics Institute, Keck School of Medicine, University of Southern California, Marina del Rey, CA 90292, USA
| | - Sarah E Medland
- Psychiatric Genetics, QIMR Berghofer Medical Research Institute, Brisbane, QLD 4006, Australia
| | - Fabio Macciardi
- Department of Psychiatry and Human Behavior, University of California, Irvine, CA 92697, USA
| | - Barbara E Stranger
- Department of Medicine, Section of Genetic Medicine & Institute for Genomics and Systems Biology, University of Chicago, Chicago, IL 60637, USA
- Department of Pharmacology, Center for Genetic Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA
| | - Lea K Davis
- Department of Medicine, Division of Medical Genetics, Vanderbilt University Medical Center, Nashville, TN 37232, USA
- Department of Psychiatry and Behavioral Sciences, Vanderbilt University Medical Center, Nashville, TN 37232, USA
- Vanderbilt University Medical Center, Vanderbilt Genetics Institute, Nashville, TN 37232, USA
| | - Simon E Fisher
- Language and Genetics Department, Max Planck Institute for Psycholinguistics, Nijmegen, 6500 AH, Netherlands
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, 6500 HB, Netherlands
| | - Jason L Stein
- Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA
- UNC Neuroscience Center, University of North Carolina, Chapel Hill, NC 27599, USA
| |
Collapse
|
79
|
Kerner G, Laval G, Patin E, Boisson-Dupuis S, Abel L, Casanova JL, Quintana-Murci L. Human ancient DNA analyses reveal the high burden of tuberculosis in Europeans over the last 2,000 years. Am J Hum Genet 2021; 108:517-524. [PMID: 33667394 PMCID: PMC8008489 DOI: 10.1016/j.ajhg.2021.02.009] [Citation(s) in RCA: 51] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2020] [Accepted: 02/05/2021] [Indexed: 02/07/2023] Open
Abstract
Tuberculosis (TB), usually caused by Mycobacterium tuberculosis bacteria, is the first cause of death from an infectious disease at the worldwide scale, yet the mode and tempo of TB pressure on humans remain unknown. The recent discovery that homozygotes for the P1104A polymorphism of TYK2 are at higher risk to develop clinical forms of TB provided the first evidence of a common, monogenic predisposition to TB, offering a unique opportunity to inform on human co-evolution with a deadly pathogen. Here, we investigate the history of human exposure to TB by determining the evolutionary trajectory of the TYK2 P1104A variant in Europe, where TB is considered to be the deadliest documented infectious disease. Leveraging a large dataset of 1,013 ancient human genomes and using an approximate Bayesian computation approach, we find that the P1104A variant originated in the common ancestors of West Eurasians ∼30,000 years ago. Furthermore, we show that, following large-scale population movements of Anatolian Neolithic farmers and Eurasian steppe herders into Europe, P1104A has markedly fluctuated in frequency over the last 10,000 years of European history, with a dramatic decrease in frequency after the Bronze Age. Our analyses indicate that such a frequency drop is attributable to strong negative selection starting ∼2,000 years ago, with a relative fitness reduction on homozygotes of 20%, among the highest in the human genome. Together, our results provide genetic evidence that TB has imposed a heavy burden on European health over the last two millennia.
Collapse
|
80
|
Panda A, Subramanian K, Kahali B. Implementation of human whole genome sequencing data analysis: A containerized framework for sustained and enhanced throughput. INFORMATICS IN MEDICINE UNLOCKED 2021. [DOI: 10.1016/j.imu.2021.100684] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022] Open
|
81
|
Biddanda A, Rice DP, Novembre J. A variant-centric perspective on geographic patterns of human allele frequency variation. eLife 2020; 9:60107. [PMID: 33350384 PMCID: PMC7755386 DOI: 10.7554/elife.60107] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2020] [Accepted: 11/12/2020] [Indexed: 12/14/2022] Open
Abstract
A key challenge in human genetics is to understand the geographic distribution of human genetic variation. Often genetic variation is described by showing relationships among populations or individuals, drawing inferences over many variants. Here, we introduce an alternative representation of genetic variation that reveals the relative abundance of different allele frequency patterns. This approach allows viewers to easily see several features of human genetic structure: (1) most variants are rare and geographically localized, (2) variants that are common in a single geographic region are more likely to be shared across the globe than to be private to that region, and (3) where two individuals differ, it is most often due to variants that are found globally, regardless of whether the individuals are from the same region or different regions. Our variant-centric visualization clarifies the geographic patterns of human variation and can help address misconceptions about genetic differentiation among populations.
Collapse
Affiliation(s)
- Arjun Biddanda
- Department of Human Genetics, University of Chicago, Chicago, United States
| | - Daniel P Rice
- Department of Human Genetics, University of Chicago, Chicago, United States
| | - John Novembre
- Department of Human Genetics, University of Chicago, Chicago, United States
| |
Collapse
|
82
|
Walsh S, Izquierdo-Serra M, Acosta S, Edo A, Lloret M, Moret R, Bosch E, Oliva B, Bertranpetit J, Fernández-Fernández JM. Adaptive selection drives TRPP3 loss-of-function in an Ethiopian population. Sci Rep 2020; 10:20999. [PMID: 33268808 PMCID: PMC7710729 DOI: 10.1038/s41598-020-78081-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2020] [Accepted: 11/20/2020] [Indexed: 11/15/2022] Open
Abstract
TRPP3 (also called PKD2L1) is a nonselective, cation-permeable channel activated by multiple stimuli, including extracellular pH changes. TRPP3 had been considered a candidate for sour sensor in humans, due to its high expression in a subset of tongue receptor cells detecting sour, along with its membership to the TRP channel family known to function as sensory receptors. Here, we describe the functional consequences of two non-synonymous genetic variants (R278Q and R378W) found to be under strong positive selection in an Ethiopian population, the Gumuz. Electrophysiological studies and 3D modelling reveal TRPP3 loss-of-functions produced by both substitutions. R278Q impairs TRPP3 activation after alkalinisation by mislocation of H+ binding residues at the extracellular polycystin mucolipin domain. R378W dramatically reduces channel activity by altering conformation of the voltage sensor domain and hampering channel transition from closed to open state. Sour sensitivity tests in R278Q/R378W carriers argue against both any involvement of TRPP3 in sour detection and the role of such physiological process in the reported evolutionary positive selection past event.
Collapse
Affiliation(s)
- Sandra Walsh
- Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, Dr. Aiguader, 88, 08003, Barcelona, Catalonia, Spain
| | - Mercè Izquierdo-Serra
- Laboratory of Molecular Physiology, Department of Experimental and Health Sciences, Universitat Pompeu Fabra, 08003, Barcelona, Spain
| | - Sandra Acosta
- Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, Dr. Aiguader, 88, 08003, Barcelona, Catalonia, Spain
| | - Albert Edo
- Laboratory of Molecular Physiology, Department of Experimental and Health Sciences, Universitat Pompeu Fabra, 08003, Barcelona, Spain
| | - María Lloret
- Laboratory of Molecular Physiology, Department of Experimental and Health Sciences, Universitat Pompeu Fabra, 08003, Barcelona, Spain
| | - Roser Moret
- Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, Dr. Aiguader, 88, 08003, Barcelona, Catalonia, Spain
| | - Elena Bosch
- Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, Dr. Aiguader, 88, 08003, Barcelona, Catalonia, Spain.,Centro de Investigación Biomédica en Red de Salud Mental (CIBERSAM), 43206, Reus, Spain
| | - Baldo Oliva
- Structural Bioinformatics Lab, Department of Experimental and Health Sciences, Universitat Pompeu Fabra, 08003, Barcelona, Spain
| | - Jaume Bertranpetit
- Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, Dr. Aiguader, 88, 08003, Barcelona, Catalonia, Spain.
| | - José Manuel Fernández-Fernández
- Laboratory of Molecular Physiology, Department of Experimental and Health Sciences, Universitat Pompeu Fabra, 08003, Barcelona, Spain.
| |
Collapse
|
83
|
Nait Saada J, Kalantzis G, Shyr D, Cooper F, Robinson M, Gusev A, Palamara PF. Identity-by-descent detection across 487,409 British samples reveals fine scale population structure and ultra-rare variant associations. Nat Commun 2020; 11:6130. [PMID: 33257650 PMCID: PMC7704644 DOI: 10.1038/s41467-020-19588-x] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2020] [Accepted: 10/02/2020] [Indexed: 12/14/2022] Open
Abstract
Detection of Identical-By-Descent (IBD) segments provides a fundamental measure of genetic relatedness and plays a key role in a wide range of analyses. We develop FastSMC, an IBD detection algorithm that combines a fast heuristic search with accurate coalescent-based likelihood calculations. FastSMC enables biobank-scale detection and dating of IBD segments within several thousands of years in the past. We apply FastSMC to 487,409 UK Biobank samples and detect ~214 billion IBD segments transmitted by shared ancestors within the past 1500 years, obtaining a fine-grained picture of genetic relatedness in the UK. Sharing of common ancestors strongly correlates with geographic distance, enabling the use of genomic data to localize a sample's birth coordinates with a median error of 45 km. We seek evidence of recent positive selection by identifying loci with unusually strong shared ancestry and detect 12 genome-wide significant signals. We devise an IBD-based test for association between phenotype and ultra-rare loss-of-function variation, identifying 29 association signals in 7 blood-related traits.
Collapse
Affiliation(s)
| | | | - Derek Shyr
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA
| | - Fergus Cooper
- Department of Computer Science, University of Oxford, Oxford, UK
| | - Martin Robinson
- Department of Computer Science, University of Oxford, Oxford, UK
| | - Alexander Gusev
- Brigham & Women's Hospital, Division of Genetics, Boston, MA, 02215, USA
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Pier Francesco Palamara
- Department of Statistics, University of Oxford, Oxford, UK.
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK.
| |
Collapse
|
84
|
Alemu SW, Kadri NK, Harland C, Faux P, Charlier C, Caballero A, Druet T. An evaluation of inbreeding measures using a whole-genome sequenced cattle pedigree. Heredity (Edinb) 2020; 126:410-423. [PMID: 33159183 PMCID: PMC8027009 DOI: 10.1038/s41437-020-00383-9] [Citation(s) in RCA: 35] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2020] [Revised: 10/23/2020] [Accepted: 10/23/2020] [Indexed: 11/13/2022] Open
Abstract
The estimation of the inbreeding coefficient (F) is essential for the study of inbreeding depression (ID) or for the management of populations under conservation. Several methods have been proposed to estimate the realized F using genetic markers, but it remains unclear which one should be used. Here we used whole-genome sequence data for 245 individuals from a Holstein cattle pedigree to empirically evaluate which estimators best capture homozygosity at variants causing ID, such as rare deleterious alleles or loci presenting heterozygote advantage and segregating at intermediate frequency. Estimators relying on the correlation between uniting gametes (FUNI) or on the genomic relationships (FGRM) presented the highest correlations with these variants. However, homozygosity at rare alleles remained poorly captured. A second group of estimators relying on excess homozygosity (FHOM), homozygous-by-descent segments (FHBD), runs-of-homozygosity (FROH) or on the known genealogy (FPED) was better at capturing whole-genome homozygosity, reflecting the consequences of inbreeding on all variants, and for young alleles with low to moderate frequencies (0.10 < . < 0.25). The results indicate that FUNI and FGRM might present a stronger association with ID. However, the situation might be different when recessive deleterious alleles reach higher frequencies, such as in populations with a small effective population size. For locus-specific inbreeding measures or at low marker density, the ranking of the methods can also change as FHBD makes better use of the information from neighboring markers. Finally, we confirmed that genomic measures are in general superior to pedigree-based estimates. In particular, FPED was uncorrelated with locus-specific homozygosity.
Collapse
Affiliation(s)
- Setegn Worku Alemu
- Unit of Animal Genomics, GIGA-R & Faculty of Veterinary Medicine, University of Liège, Liège, Belgium
| | - Naveen Kumar Kadri
- Unit of Animal Genomics, GIGA-R & Faculty of Veterinary Medicine, University of Liège, Liège, Belgium
| | - Chad Harland
- Unit of Animal Genomics, GIGA-R & Faculty of Veterinary Medicine, University of Liège, Liège, Belgium
| | - Pierre Faux
- Unit of Animal Genomics, GIGA-R & Faculty of Veterinary Medicine, University of Liège, Liège, Belgium
| | - Carole Charlier
- Unit of Animal Genomics, GIGA-R & Faculty of Veterinary Medicine, University of Liège, Liège, Belgium
| | - Armando Caballero
- Centro de Investigación Mariña, Departamento de Bioquímica, Genética e Inmunología, Edificio CC Experimentais, Universidade de Vigo, Campus de Vigo, As Lagoas, Marcosende, 36310, Vigo, Spain
| | - Tom Druet
- Unit of Animal Genomics, GIGA-R & Faculty of Veterinary Medicine, University of Liège, Liège, Belgium.
| |
Collapse
|
85
|
Abstract
Simulation plays a central role in population genomics studies. Recent years have seen rapid improvements in software efficiency that make it possible to simulate large genomic regions for many individuals sampled from large numbers of populations. As the complexity of the demographic models we study grows, however, there is an ever-increasing opportunity to introduce bugs in their implementation. Here, we describe two errors made in defining population genetic models using the msprime coalescent simulator that have found their way into the published record. We discuss how these errors have affected downstream analyses and give recommendations for software developers and users to reduce the risk of such errors.
Collapse
|
86
|
Zhang J, Kadri NK, Mullaart E, Spelman R, Fritz S, Boichard D, Charlier C, Georges M, Druet T. Genetic architecture of individual variation in recombination rate on the X chromosome in cattle. Heredity (Edinb) 2020; 125:304-316. [PMID: 32651548 DOI: 10.1038/s41437-020-0341-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2020] [Revised: 07/01/2020] [Accepted: 07/01/2020] [Indexed: 12/26/2022] Open
Abstract
Meiotic recombination is an essential biological process that ensures proper chromosome segregation and creates genetic diversity. Individual variation in global recombination rates has been shown to be heritable in several species, and variants significantly associated with this trait have been identified. Recombination on the sex chromosome has often been ignored in these studies although this trait may be particularly interesting as it may correspond to a biological process distinct from that on autosomes. For instance, recombination in males is restricted to the pseudo-autosomal region (PAR). We herein used a large cattle pedigree with more than 100,000 genotyped animals to improve the genetic map of the X chromosome and to study the genetic architecture of individual variation in recombination rate on the sex chromosome (XRR). The length of the genetic map was 46.4 and 121.2 cM in males and females, respectively, but the recombination rate in the PAR was six times higher in males. The heritability of CO counts on the X chromosome was comparable to that of autosomes in males (0.011) but larger than that of autosomes in females (0.024). XRR was highly correlated (0.76) with global recombination rate (GRR) in females, suggesting that both traits might be governed by shared variants. In agreement, a set of eleven previously identified variants associated with GRR had correlated effects on female XRR (0.86). In males, XRR and GRR appeared to be distinct traits, although more accurate CO counts on the PAR would be valuable to confirm these results.
Collapse
Affiliation(s)
- Junjie Zhang
- Unit of Animal Genomics, GIGA-R and Faculty of Veterinary Medicine, University of Liège, Liège, Belgium
| | - Naveen Kumar Kadri
- Unit of Animal Genomics, GIGA-R and Faculty of Veterinary Medicine, University of Liège, Liège, Belgium.,Animal Genomics, Institute of Agricultural Science, ETH Zürich, Zürich, Switzerland
| | | | | | - Sébastien Fritz
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, Jouy-en-Josas, France.,Allice, Paris, France
| | | | - Carole Charlier
- Unit of Animal Genomics, GIGA-R and Faculty of Veterinary Medicine, University of Liège, Liège, Belgium
| | - Michel Georges
- Unit of Animal Genomics, GIGA-R and Faculty of Veterinary Medicine, University of Liège, Liège, Belgium
| | - Tom Druet
- Unit of Animal Genomics, GIGA-R and Faculty of Veterinary Medicine, University of Liège, Liège, Belgium.
| |
Collapse
|
87
|
Cayuela H, Rougemont Q, Laporte M, Mérot C, Normandeau E, Dorant Y, Tørresen OK, Hoff SNK, Jentoft S, Sirois P, Castonguay M, Jansen T, Praebel K, Clément M, Bernatchez L. Shared ancestral polymorphisms and chromosomal rearrangements as potential drivers of local adaptation in a marine fish. Mol Ecol 2020; 29:2379-2398. [DOI: 10.1111/mec.15499] [Citation(s) in RCA: 35] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2019] [Revised: 05/19/2020] [Accepted: 05/26/2020] [Indexed: 12/18/2022]
Affiliation(s)
- Hugo Cayuela
- Institut de Biologie Intégrative et des Systèmes (IBIS) Université Laval Quebec City QC Canada
| | - Quentin Rougemont
- Institut de Biologie Intégrative et des Systèmes (IBIS) Université Laval Quebec City QC Canada
| | - Martin Laporte
- Institut de Biologie Intégrative et des Systèmes (IBIS) Université Laval Quebec City QC Canada
| | - Claire Mérot
- Institut de Biologie Intégrative et des Systèmes (IBIS) Université Laval Quebec City QC Canada
| | - Eric Normandeau
- Institut de Biologie Intégrative et des Systèmes (IBIS) Université Laval Quebec City QC Canada
| | - Yann Dorant
- Institut de Biologie Intégrative et des Systèmes (IBIS) Université Laval Quebec City QC Canada
| | - Ole K. Tørresen
- Centre for Ecological and Evolutionary Synthesis (CEES) Department of Biosciences University of Oslo Oslo Norway
| | - Siv Nam Khang Hoff
- Centre for Ecological and Evolutionary Synthesis (CEES) Department of Biosciences University of Oslo Oslo Norway
| | - Sissel Jentoft
- Centre for Ecological and Evolutionary Synthesis (CEES) Department of Biosciences University of Oslo Oslo Norway
| | - Pascal Sirois
- Département des sciences fondamentales Université du Québec à Chicoutimi Chicoutimi QC Canada
| | - Martin Castonguay
- Fisheries and Oceans Canada Institut Maurice‐Lamontagne Mont‐Joli QC Canada
| | - Teunis Jansen
- GINR‐Greenland Institute of Natural Resources Nuuk Greenland
- DTU Aqua‐National Institute of Aquatic Resources Technical University of Denmark Charlottenlund Castle, Charlottenlund Denmark
| | - Kim Praebel
- Norwegian College of Fishery Science Faculty of Biosciences, Fisheries and Economics UiT The Arctic University of Norway Tromsø Norway
| | - Marie Clément
- Center for Fisheries Ecosystems Research Fisheries and Marine Institute of Memorial University of Newfoundland St. John's NL Canada
- Labrador Institute of Memorial University of Newfoundland Happy Valley‐Goose Bay NL Canada
| | - Louis Bernatchez
- Institut de Biologie Intégrative et des Systèmes (IBIS) Université Laval Quebec City QC Canada
| |
Collapse
|
88
|
Characterisation of a second gain of function EDAR variant, encoding EDAR380R, in East Asia. Eur J Hum Genet 2020; 28:1694-1702. [PMID: 32499598 DOI: 10.1038/s41431-020-0660-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2019] [Revised: 04/07/2020] [Accepted: 05/12/2020] [Indexed: 11/08/2022] Open
Abstract
Ectodysplasin A1 receptor (EDAR) is a TNF receptor family member with roles in the development and growth of hair, teeth and glands. A derived allele of EDAR, single-nucleotide variant rs3827760, encodes EDAR:p.(Val370Ala), a receptor with more potent signalling effects than the ancestral EDAR370Val. This allele of rs3827760 is at very high frequency in modern East Asian and Native American populations as a result of ancient positive selection and has been associated with straighter, thicker hair fibres, alteration of tooth and ear shape, reduced chin protrusion and increased fingertip sweat gland density. Here we report the characterisation of another SNV in EDAR, rs146567337, encoding EDAR:p.(Ser380Arg). The derived allele of this SNV is at its highest global frequency, of up to 5%, in populations of southern China, Vietnam, the Philippines, Malaysia and Indonesia. Using haplotype analyses, we find that the rs3827760 and rs146567337 SNVs arose on distinct haplotypes and that rs146567337 does not show the same signs of positive selection as rs3827760. From functional studies in cultured cells, we find that EDAR:p.(Ser380Arg) displays increased EDAR signalling output, at a similar level to that of EDAR:p.(Val370Ala). The existence of a second SNV with partly overlapping geographic distribution, the same in vitro functional effect and similar evolutionary age as the derived allele of rs3827760, but of independent origin and not exhibiting the same signs of strong selection, suggests a northern focus of positive selection on EDAR function in East Asia.
Collapse
|
89
|
Meyerson W, Leisman J, Navarro FCP, Gerstein M. Origins and characterization of variants shared between databases of somatic and germline human mutations. BMC Bioinformatics 2020; 21:227. [PMID: 32498674 PMCID: PMC7273669 DOI: 10.1186/s12859-020-3508-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2020] [Accepted: 04/20/2020] [Indexed: 01/26/2023] Open
Abstract
Background Mutations arise in the human genome in two major settings: the germline and the soma. These settings involve different inheritance patterns, time scales, chromatin structures, and environmental exposures, all of which impact the resulting distribution of substitutions. Nonetheless, many of the same single nucleotide variants (SNVs) are shared between germline and somatic mutation databases, such as between the gnomAD database of 120,000 germline exomes and the TCGA database of 10,000 somatic exomes. Here, we sought to explain this overlap. Results After strict filtering to exclude common germline polymorphisms and sites with poor coverage or mappability, we found 336,987 variants shared between the somatic and germline databases. A uniform statistical model explains 34% of these shared variants; a model that incorporates the varying mutation rates of the basic mutation types explains another 50% of shared variants; and a model that includes extended nucleotide contexts (e.g. surrounding 3 bases on either side) explains an additional 4% of shared variants. Analysis of read depth finds mixed evidence that up to 4% of the shared variants may represent germline variants leaked into somatic call sets. 9% of the shared variants are not explained by any model. Sequencing errors and convergent evolution did not account for these. We surveyed other factors as well: Cancers driven by endogenous mutational processes share a greater fraction of variants with the germline, and recently derived germline variants were more likely to be somatically shared than were ancient germline ones. Conclusions Overall, we find that shared variants largely represent bona fide biological occurrences of the same variant in the germline and somatic setting and arise primarily because DNA has some of the same basic chemical vulnerabilities in either setting. Moreover, we find mixed evidence that somatic call-sets leak appreciable numbers of germline variants, which is relevant to genomic privacy regulations. In future studies, the similar chemical vulnerability of DNA between the somatic and germline settings might be used to help identify disease-related genes by guiding the development of background-mutation models that are informed by both somatic and germline patterns of variation.
Collapse
Affiliation(s)
- William Meyerson
- Computational Biology & Bioinformatics, Yale University, New Haven, CT, 06511, USA. .,Yale School of Medicine, Yale University, New Haven, CT, 06510, USA.
| | - John Leisman
- Molecular, Cellular and Developmental Biology, Yale University, New Haven, CT, 06510, USA
| | - Fabio C P Navarro
- Computational Biology & Bioinformatics, Yale University, New Haven, CT, 06511, USA.,Molecular Biophysics & Biochemistry, Yale University, New Haven, CT, 06511, USA
| | - Mark Gerstein
- Computational Biology & Bioinformatics, Yale University, New Haven, CT, 06511, USA. .,Yale School of Medicine, Yale University, New Haven, CT, 06510, USA. .,Molecular Biophysics & Biochemistry, Yale University, New Haven, CT, 06511, USA. .,Department of Computer Science, Yale University, New Haven, CT, 06511, USA.
| |
Collapse
|
90
|
A positively selected FBN1 missense variant reduces height in Peruvian individuals. Nature 2020; 582:234-239. [PMID: 32499652 PMCID: PMC7410362 DOI: 10.1038/s41586-020-2302-0] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2019] [Accepted: 03/10/2020] [Indexed: 01/21/2023]
Abstract
On average, Peruvian individuals are among the shortest in the world1. Here we show that Native American ancestry is associated with reduced height in an ethnically diverse group of Peruvian individuals, and identify a population-specific, missense variant in the FBN1 gene (E1297G) that is significantly associated with lower height. Each copy of the minor allele (frequency of 4.7%) reduces height by 2.2 cm (4.4 cm in homozygous individuals). To our knowledge, this is the largest effect size known for a common height-associated variant. FBN1 encodes the extracellular matrix protein fibrillin 1, which is a major structural component of microfibrils. We observed less densely packed fibrillin-1-rich microfibrils with irregular edges in the skin of individuals who were homozygous for G1297 compared with individuals who were homozygous for E1297. Moreover, we show that the E1297G locus is under positive selection in non-African populations, and that the E1297 variant shows subtle evidence of positive selection specifically within the Peruvian population. This variant is also significantly more frequent in coastal Peruvian populations than in populations from the Andes or the Amazon, which suggests that short stature might be the result of adaptation to factors that are associated with the coastal environment in Peru.
Collapse
|
91
|
Wakeley J. Developments in coalescent theory from single loci to chromosomes. Theor Popul Biol 2020; 133:56-64. [DOI: 10.1016/j.tpb.2020.02.002] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2019] [Revised: 02/19/2020] [Accepted: 02/26/2020] [Indexed: 10/24/2022]
|
92
|
Agrawal A, Chiu AM, Le M, Halperin E, Sankararaman S. Scalable probabilistic PCA for large-scale genetic variation data. PLoS Genet 2020; 16:e1008773. [PMID: 32469896 PMCID: PMC7286535 DOI: 10.1371/journal.pgen.1008773] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2019] [Revised: 06/10/2020] [Accepted: 04/09/2020] [Indexed: 01/04/2023] Open
Abstract
Principal component analysis (PCA) is a key tool for understanding population structure and controlling for population stratification in genome-wide association studies (GWAS). With the advent of large-scale datasets of genetic variation, there is a need for methods that can compute principal components (PCs) with scalable computational and memory requirements. We present ProPCA, a highly scalable method based on a probabilistic generative model, which computes the top PCs on genetic variation data efficiently. We applied ProPCA to compute the top five PCs on genotype data from the UK Biobank, consisting of 488,363 individuals and 146,671 SNPs, in about thirty minutes. To illustrate the utility of computing PCs in large samples, we leveraged the population structure inferred by ProPCA within White British individuals in the UK Biobank to identify several novel genome-wide signals of recent putative selection including missense mutations in RPGRIP1L and TLR4. Principal component analysis is a commonly used technique for understanding population structure and genetic variation. With the advent of large-scale datasets that contain the genetic information of hundreds of thousands of individuals, there is a need for methods that can compute principal components (PCs) with scalable computational and memory requirements. In this study, we present ProPCA, a highly scalable statistical method to compute genetic PCs efficiently. We systematically evaluate the accuracy and scalability of our method on large-scale simulated data and apply it to the UK Biobank. Leveraging the population structure inferred by ProPCA within the White British individuals in the UK Biobank, we identify several novel signals of putative recent selection.
Collapse
Affiliation(s)
- Aman Agrawal
- Department of Computer Science, Indian Institute of Technology, Delhi, India
| | - Alec M. Chiu
- Bioinformatics Interdepartmental Program, University of California, Los Angeles, California, United States of America
| | - Minh Le
- Department of Computer Science, University of California, Los Angeles, California, United States of America
| | - Eran Halperin
- Department of Computer Science, University of California, Los Angeles, California, United States of America
- Department of Human Genetics, University of California, Los Angeles, California, United States of America
- Department of Anesthesiology and Perioperative Medicine, University of California, Los Angeles, California, United States of America
- Department of Computational Medicine, David Geffen School of Medicine, University of California, Los Angeles, California, United States of America
- Institute of Precision Health, University of California, Los Angeles, California, United States of America
| | - Sriram Sankararaman
- Department of Computer Science, University of California, Los Angeles, California, United States of America
- Department of Human Genetics, University of California, Los Angeles, California, United States of America
- Department of Computational Medicine, David Geffen School of Medicine, University of California, Los Angeles, California, United States of America
- * E-mail:
| |
Collapse
|
93
|
Inferring whole-genome histories in large population datasets. Nat Genet 2019; 51:1330-1338. [PMID: 31477934 PMCID: PMC6726478 DOI: 10.1038/s41588-019-0483-y] [Citation(s) in RCA: 121] [Impact Index Per Article: 24.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2019] [Accepted: 07/15/2019] [Indexed: 01/01/2023]
Abstract
Inferring the full genealogical history of a set of DNA sequences is a core problem in evolutionary biology, because this history encodes information about the events and forces that have influenced a species. However, current methods are limited, and the most accurate techniques are able to process no more than a hundred samples. As datasets that consist of millions of genomes are now being collected, there is a need for scalable and efficient inference methods to fully utilize these resources. Here we introduce an algorithm that is able to not only infer whole-genome histories with comparable accuracy to the state-of-the-art but also process four orders of magnitude more sequences. The approach also provides an 'evolutionary encoding' of the data, enabling efficient calculation of relevant statistics. We apply the method to human data from the 1000 Genomes Project, Simons Genome Diversity Project and UK Biobank, showing that the inferred genealogies are rich in biological signal and efficient to process.
Collapse
|