1
|
Zheng HX, Yan S, Zhang M, Gu Z, Wang J, Jin L. Mitochondrial DNA Genomes Reveal Relaxed Purifying Selection During Human Population Expansion after the Last Glacial Maximum. Mol Biol Evol 2024; 41:msae175. [PMID: 39162340 PMCID: PMC11373649 DOI: 10.1093/molbev/msae175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Revised: 07/31/2024] [Accepted: 08/06/2024] [Indexed: 08/21/2024] Open
Abstract
Modern humans have experienced explosive population growth in the past thousand years. We hypothesized that recent human populations have inhabited environments with relaxation of selective constraints, possibly due to the more abundant food supply after the Last Glacial Maximum. The ratio of nonsynonymous to synonymous mutations (N/S ratio) is a useful and common statistic for measuring selective constraints. In this study, we reconstructed a high-resolution phylogenetic tree using a total of 26,419 East Eurasian mitochondrial DNA genomes, which were further classified into expansion and nonexpansion groups on the basis of the frequencies of their founder lineages. We observed a much higher N/S ratio in the expansion group, especially for nonsynonymous mutations with moderately deleterious effects, indicating a weaker effect of purifying selection in the expanded clades. However, this observation on N/S ratio was unlikely in computer simulations where all individuals were under the same selective constraints. Thus, we argue that the expanded populations were subjected to weaker selective constraints than the nonexpanded populations were. The mildly deleterious mutations were retained during population expansion, which could have a profound impact on present-day disease patterns.
Collapse
Affiliation(s)
- Hong-Xiang Zheng
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Human Phenome Institute and Center for Evolutionary Biology, Fudan University, Shanghai, China
- Collaborative Innovation Center for Genetics and Development, Fudan University, Shanghai, China
| | - Shi Yan
- Ministry of Education Key Laboratory of Contemporary Anthropology, Department of Anthropology and Human Genetics, School of Life Sciences, Fudan University, Shanghai, China
- School of Ethnology and Sociology, Minzu University of China, Beijing, China
| | - Menghan Zhang
- Institute of Modern Languages and Linguistics, Fudan University, Shanghai, China
- Research Institute of Intelligent Complex Systems, Fudan University, Shanghai, China
| | - Zhenglong Gu
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Human Phenome Institute and Center for Evolutionary Biology, Fudan University, Shanghai, China
- Greater Bay Area Institute of Precision Medicine (Guangzhou), Fudan University, Guangzhou, China
| | - Jiucun Wang
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Human Phenome Institute and Center for Evolutionary Biology, Fudan University, Shanghai, China
- Collaborative Innovation Center for Genetics and Development, Fudan University, Shanghai, China
- Research Unit of Dissecting Population Genetics and Developing New Technologies for Treatment and Prevention of Skin Phenotypes and Dermatological Diseases (2019RU058), Chinese Academy of Medical Sciences, Beijing, China
| | - Li Jin
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Human Phenome Institute and Center for Evolutionary Biology, Fudan University, Shanghai, China
- Collaborative Innovation Center for Genetics and Development, Fudan University, Shanghai, China
- Research Unit of Dissecting Population Genetics and Developing New Technologies for Treatment and Prevention of Skin Phenotypes and Dermatological Diseases (2019RU058), Chinese Academy of Medical Sciences, Beijing, China
| |
Collapse
|
2
|
Kyriazis CC, Lohmueller KE. Constraining models of dominance for nonsynonymous mutations in the human genome. PLoS Genet 2024; 20:e1011198. [PMID: 39302992 PMCID: PMC11446423 DOI: 10.1371/journal.pgen.1011198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2024] [Revised: 10/02/2024] [Accepted: 09/04/2024] [Indexed: 09/22/2024] Open
Abstract
Dominance is a fundamental parameter in genetics, determining the dynamics of natural selection on deleterious and beneficial mutations, the patterns of genetic variation in natural populations, and the severity of inbreeding depression in a population. Despite this importance, dominance parameters remain poorly known, particularly in humans or other non-model organisms. A key reason for this lack of information about dominance is that it is extremely challenging to disentangle the selection coefficient (s) of a mutation from its dominance coefficient (h). Here, we explore dominance and selection parameters in humans by fitting models to the site frequency spectrum (SFS) for nonsynonymous mutations. When assuming a single dominance coefficient for all nonsynonymous mutations, we find that numerous h values can fit the data, so long as h is greater than ~0.15. Moreover, we also observe that theoretically-predicted models with a negative relationship between h and s can also fit the data well, including models with h = 0.05 for strongly deleterious mutations. Finally, we use our estimated dominance and selection parameters to inform simulations revisiting the question of whether the out-of-Africa bottleneck has led to differences in genetic load between African and non-African human populations. These simulations suggest that the relative burden of genetic load in non-African populations depends on the dominance model assumed, with slight increases for more weakly recessive models and slight decreases shown for more strongly recessive models. Moreover, these results also demonstrate that models of partially recessive nonsynonymous mutations can explain the observed severity of inbreeding depression in humans, bridging the gap between molecular population genetics and direct measures of fitness in humans. Our work represents a comprehensive assessment of dominance and deleterious variation in humans, with implications for parameterizing models of deleterious variation in humans and other mammalian species.
Collapse
Affiliation(s)
- Christopher C. Kyriazis
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, California, United States of America
| | - Kirk E. Lohmueller
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, California, United States of America
- Interdepartmental Program in Bioinformatics, University of California, Los Angeles, California, United States of America
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, California, United States of America
| |
Collapse
|
3
|
Jabalameli MR, Lin JR, Zhang Q, Wang Z, Mitra J, Nguyen N, Gao T, Khusidman M, Sathyan S, Atzmon G, Milman S, Vijg J, Barzilai N, Zhang ZD. Polygenic prediction of human longevity on the supposition of pervasive pleiotropy. Sci Rep 2024; 14:19981. [PMID: 39198552 PMCID: PMC11358495 DOI: 10.1038/s41598-024-69069-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Accepted: 07/31/2024] [Indexed: 09/01/2024] Open
Abstract
The highly polygenic nature of human longevity renders pleiotropy an indispensable feature of its genetic architecture. Leveraging the genetic correlation between aging-related traits (ARTs), we aimed to model the additive variance in lifespan as a function of the cumulative liability from pleiotropic segregating variants. We tracked allele frequency changes as a function of viability across different age bins and prioritized 34 variants with an immediate implication on lipid metabolism, body mass index (BMI), and cognitive performance, among other traits, revealed by PheWAS analysis in the UK Biobank. Given the highly complex and non-linear interactions between the genetic determinants of longevity, we reasoned that a composite polygenic score would approximate a substantial portion of the variance in lifespan and developed the integrated longevity genetic scores (iLGSs) for distinguishing exceptional survival. We showed that coefficients derived from our ensemble model could potentially reveal an interesting pattern of genomic pleiotropy specific to lifespan. We assessed the predictive performance of our model for distinguishing the enrichment of exceptional longevity among long-lived individuals in two replication cohorts (the Scripps Wellderly cohort and the Medical Genome Reference Bank (MRGB)) and showed that the median lifespan in the highest decile of our composite prognostic index is up to 4.8 years longer. Finally, using the proteomic correlates of iLGS, we identified protein markers associated with exceptional longevity irrespective of chronological age and prioritized drugs with repurposing potentials for gerotherapeutics. Together, our approach demonstrates a promising framework for polygenic modeling of additive liability conferred by ARTs in defining exceptional longevity and assisting the identification of individuals at a higher risk of mortality for targeted lifestyle modifications earlier in life. Furthermore, the proteomic signature associated with iLGS highlights the functional pathway upstream of the PI3K-Akt that can be effectively targeted to slow down aging and extend lifespan.
Collapse
Affiliation(s)
- M Reza Jabalameli
- Department of Genetics, Albert Einstein College of Medicine, New York, NY, USA
| | - Jhih-Rong Lin
- Department of Genetics, Albert Einstein College of Medicine, New York, NY, USA
| | - Quanwei Zhang
- Department of Genetics, Albert Einstein College of Medicine, New York, NY, USA
| | - Zhen Wang
- Department of Genetics, Albert Einstein College of Medicine, New York, NY, USA
| | - Joydeep Mitra
- Department of Genetics, Albert Einstein College of Medicine, New York, NY, USA
| | - Nha Nguyen
- Department of Genetics, Albert Einstein College of Medicine, New York, NY, USA
| | - Tina Gao
- Department of Medicine, Albert Einstein College of Medicine, New York, NY, USA
| | - Mark Khusidman
- Department of Genetics, Albert Einstein College of Medicine, New York, NY, USA
| | - Sanish Sathyan
- Department of Neurology, Albert Einstein College of Medicine, New York, NY, USA
| | - Gil Atzmon
- Department of Genetics, Albert Einstein College of Medicine, New York, NY, USA
- Department of Medicine, Albert Einstein College of Medicine, New York, NY, USA
- Faculty of Natural Sciences, University of Haifa, Haifa, Israel
| | - Sofiya Milman
- Department of Genetics, Albert Einstein College of Medicine, New York, NY, USA
- Department of Medicine, Albert Einstein College of Medicine, New York, NY, USA
| | - Jan Vijg
- Department of Genetics, Albert Einstein College of Medicine, New York, NY, USA
| | - Nir Barzilai
- Department of Genetics, Albert Einstein College of Medicine, New York, NY, USA
- Department of Medicine, Albert Einstein College of Medicine, New York, NY, USA
- Institute for Aging Research, Albert Einstein College of Medicine, New York, NY, USA
| | - Zhengdong D Zhang
- Department of Genetics, Albert Einstein College of Medicine, New York, NY, USA.
| |
Collapse
|
4
|
Pearson NM, Novembre J. No evidence that ACE2 or TMPRSS2 drive population disparity in COVID risks. BMC Med 2024; 22:337. [PMID: 39183295 PMCID: PMC11346279 DOI: 10.1186/s12916-024-03539-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Accepted: 07/22/2024] [Indexed: 08/27/2024] Open
Abstract
Early in the SARS-CoV2 pandemic, in this journal, Hou et al. (BMC Med 18:216, 2020) interpreted public genotype data, run through functional prediction tools, as suggesting that members of particular human populations carry potentially COVID-risk-increasing variants in genes ACE2 and TMPRSS2 far more often than do members of other populations. Beyond resting on predictions rather than clinical outcomes, and focusing on variants too rare to typify population members even jointly, their claim mistook a well known artifact (that large samples reveal more of a population's variants than do small samples) as if showing real and congruent population differences for the two genes, rather than lopsided population sampling in their shared source data. We explain that artifact, and contrast it with empirical findings, now ample, that other loci shape personal COVID risks far more significantly than do ACE2 and TMPRSS2-and that variation in ACE2 and TMPRSS2 per se unlikely exacerbates any net population disparity in the effects of such more risk-informative loci.
Collapse
Affiliation(s)
| | - John Novembre
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
| |
Collapse
|
5
|
Kobren SN, Moldovan MA, Reimers R, Traviglia D, Li X, Barnum D, Veit A, Corona RI, Carvalho Neto GDV, Willett J, Berselli M, Ronchetti W, Nelson SF, Martinez-Agosto JA, Sherwood R, Krier J, Kohane IS, Sunyaev SR. Joint, multifaceted genomic analysis enables diagnosis of diverse, ultra-rare monogenic presentations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.13.580158. [PMID: 38405764 PMCID: PMC10888768 DOI: 10.1101/2024.02.13.580158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/27/2024]
Abstract
Genomics for rare disease diagnosis has advanced at a rapid pace due to our ability to perform "N-of-1" analyses on individual patients with ultra-rare diseases. The increasing sizes of ultra-rare disease cohorts internationally newly enables cohort-wide analyses for new discoveries, but well-calibrated statistical genetics approaches for jointly analyzing these patients are still under development.1,2 The Undiagnosed Diseases Network (UDN) brings multiple clinical, research and experimental centers under the same umbrella across the United States to facilitate and scale N-of-1 analyses. Here, we present the first joint analysis of whole genome sequencing data of UDN patients across the network. We introduce new, well-calibrated statistical methods for prioritizing disease genes with de novo recurrence and compound heterozygosity. We also detect pathways enriched with candidate and known diagnostic genes. Our computational analysis, coupled with a systematic clinical review, recapitulated known diagnoses and revealed new disease associations. We further release a software package, RaMeDiES, enabling automated cross-analysis of deidentified sequenced cohorts for new diagnostic and research discoveries. Gene-level findings and variant-level information across the cohort are available in a public-facing browser (https://dbmi-bgm.github.io/udn-browser/). These results show that N-of-1 efforts should be supplemented by a joint genomic analysis across cohorts.
Collapse
Affiliation(s)
| | | | | | - Daniel Traviglia
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA
| | - Xinyun Li
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT
| | | | - Alexander Veit
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA
| | - Rosario I. Corona
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, CA
| | - George de V. Carvalho Neto
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, CA
| | - Julian Willett
- Department of Pathology and Laboratory Medicine, NewYork-Presbyterian Weill Cornell Medical Center, New York, NY
| | - Michele Berselli
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA
| | - William Ronchetti
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA
| | - Stanley F. Nelson
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, CA
| | - Julian A. Martinez-Agosto
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, CA
| | - Richard Sherwood
- Division of Genetics, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA
| | - Joel Krier
- Department of Genetics, Atrius Health, Boston, MA
| | - Isaac S. Kohane
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA
| | | | - Shamil R. Sunyaev
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA
| |
Collapse
|
6
|
He J, Kou SH, Li J, Ding X, Wang SM. Pathogenic variants in human DNA damage repair genes mostly arose after the latest human out-of-Africa migration. Front Genet 2024; 15:1408952. [PMID: 38948361 PMCID: PMC11211533 DOI: 10.3389/fgene.2024.1408952] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2024] [Accepted: 05/21/2024] [Indexed: 07/02/2024] Open
Abstract
Introduction The DNA damage repair (DDR) system in human genome is pivotal in maintaining genomic integrity. Pathogenic variation (PV) in DDR genes impairs their function, leading to genome instability and increased susceptibility to diseases, especially cancer. Understanding the evolution origin and arising time of DDR PV is crucial for comprehending disease susceptibility in modern humans. Methods We used big data approach to identify the PVs in DDR genes in modern humans. We mined multiple genomic databases derived from 251,214 modern humans of African and non-Africans. We compared the DDR PVs between African and non-African. We also mined the DDR PVs in the genomic data derived from 5,031 ancient humans. We used the DDR PVs from ancient humans as the intermediate to further the DDR PVs between African and non-African. Results and discussion We identified 1,060 single-base DDR PVs across 77 DDR genes in modern humans of African and non-African. Direct comparison of the DDR PVs between African and non-African showed that 82.1% of the non-African PVs were not present in African. We further identified 397 single-base DDR PVs in 56 DDR genes in the 5,031 ancient humans dated between 45,045 and 100 years before present (BP) lived in Eurasian continent therefore the descendants of the latest out-of-Africa human migrants occurred 50,000-60,000 years ago. By referring to the ancient DDR PVs, we observed that 276 of the 397 (70.3%) ancient DDR PVs were exclusive in non-African, 106 (26.7%) were shared between non-African and African, and only 15 (3.8%) were exclusive in African. We further validated the distribution pattern by testing the PVs in BRCA and TP53, two of the important genes in genome stability maintenance, in African, non-African, and Ancient humans. Our study revealed that DDR PVs in modern humans mostly emerged after the latest out-of-Africa migration. The data provides a foundation to understand the evolutionary basis of disease susceptibility, in particular cancer, in modern humans.
Collapse
Affiliation(s)
| | | | | | | | - San Ming Wang
- Department of Public Health and Medical Administration, Faculty of Health Sciences, Ministry of Education Frontiers Science Center for Precision Oncology, Cancer Centre and Institute of Translational Medicine, University of Macau, Taipa, China
| |
Collapse
|
7
|
Laurent R, Gineau L, Utge J, Lafosse S, Phoeung CL, Hegay T, Olaso R, Boland A, Deleuze JF, Toupance B, Heyer E, Leutenegger AL, Chaix R. Measuring the Efficiency of Purging by non-random Mating in Human Populations. Mol Biol Evol 2024; 41:msae094. [PMID: 38839045 PMCID: PMC11184347 DOI: 10.1093/molbev/msae094] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Revised: 05/07/2024] [Accepted: 05/10/2024] [Indexed: 06/07/2024] Open
Abstract
Human populations harbor a high concentration of deleterious genetic variants. Here, we tested the hypothesis that non-random mating practices affect the distribution of these variants, through exposure in the homozygous state, leading to their purging from the population gene pool. To do so, we produced whole-genome sequencing data for two pairs of Asian populations exhibiting different alliance rules and rates of inbreeding, but with similar effective population sizes. The results show that populations with higher rates of inbred matings do not purge deleterious variants more efficiently. Purging therefore has a low efficiency in human populations, and different mating practices lead to a similar mutational load.
Collapse
Affiliation(s)
- Romain Laurent
- Eco-anthropologie (EA), Muséum National d'Histoire Naturelle, CNRS, Université Paris Cité, 75016 Paris, France
| | - Laure Gineau
- IRD, MERIT, Université Paris Cité, 75006 Paris, France
| | - José Utge
- Eco-anthropologie (EA), Muséum National d'Histoire Naturelle, CNRS, Université Paris Cité, 75016 Paris, France
| | - Sophie Lafosse
- Eco-anthropologie (EA), Muséum National d'Histoire Naturelle, CNRS, Université Paris Cité, 75016 Paris, France
| | | | - Tatyana Hegay
- Laboratory of Genome-cell technology, Institute of Immunology and Human genomics, Academy of Sciences, Tashkent, Uzbekistan
| | - Robert Olaso
- Centre National de Recherche en Génomique Humaine (CNRGH), CEA, Université Paris-Saclay, 91057, Evry, France
| | - Anne Boland
- Centre National de Recherche en Génomique Humaine (CNRGH), CEA, Université Paris-Saclay, 91057, Evry, France
| | - Jean-François Deleuze
- Centre National de Recherche en Génomique Humaine (CNRGH), CEA, Université Paris-Saclay, 91057, Evry, France
| | - Bruno Toupance
- Eco-anthropologie (EA), Muséum National d'Histoire Naturelle, CNRS, Université Paris Cité, 75016 Paris, France
- Eco-Anthropologie, Université Paris Cité, 75006 Paris, France
| | - Evelyne Heyer
- Eco-anthropologie (EA), Muséum National d'Histoire Naturelle, CNRS, Université Paris Cité, 75016 Paris, France
| | | | - Raphaëlle Chaix
- Eco-anthropologie (EA), Muséum National d'Histoire Naturelle, CNRS, Université Paris Cité, 75016 Paris, France
| |
Collapse
|
8
|
Buffalo V, Kern AD. A quantitative genetic model of background selection in humans. PLoS Genet 2024; 20:e1011144. [PMID: 38507461 PMCID: PMC10984650 DOI: 10.1371/journal.pgen.1011144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2023] [Revised: 04/01/2024] [Accepted: 01/19/2024] [Indexed: 03/22/2024] Open
Abstract
Across the human genome, there are large-scale fluctuations in genetic diversity caused by the indirect effects of selection. This "linked selection signal" reflects the impact of selection according to the physical placement of functional regions and recombination rates along chromosomes. Previous work has shown that purifying selection acting against the steady influx of new deleterious mutations at functional portions of the genome shapes patterns of genomic variation. To date, statistical efforts to estimate purifying selection parameters from linked selection models have relied on classic Background Selection theory, which is only applicable when new mutations are so deleterious that they cannot fix in the population. Here, we develop a statistical method based on a quantitative genetics view of linked selection, that models how polygenic additive fitness variance distributed along the genome increases the rate of stochastic allele frequency change. By jointly predicting the equilibrium fitness variance and substitution rate due to both strong and weakly deleterious mutations, we estimate the distribution of fitness effects (DFE) and mutation rate across three geographically distinct human samples. While our model can accommodate weaker selection, we find evidence of strong selection operating similarly across all human samples. Although our quantitative genetic model of linked selection fits better than previous models, substitution rates of the most constrained sites disagree with observed divergence levels. We find that a model incorporating selective interference better predicts observed divergence in conserved regions, but overall our results suggest uncertainty remains about the processes generating fitness variation in humans.
Collapse
Affiliation(s)
- Vince Buffalo
- Department of Integrative Biology, University of California, Berkeley, Berkeley, California, United States of America
- Institute of Ecology and Evolution and Department of Biology, University of Oregon, Eugene, Oregon, United States of America
| | - Andrew D. Kern
- Institute of Ecology and Evolution and Department of Biology, University of Oregon, Eugene, Oregon, United States of America
| |
Collapse
|
9
|
Kyriazis CC, Lohmueller KE. Constraining models of dominance for nonsynonymous mutations in the human genome. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.25.582010. [PMID: 38463985 PMCID: PMC10925099 DOI: 10.1101/2024.02.25.582010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/12/2024]
Abstract
Dominance is a fundamental parameter in genetics, determining the dynamics of natural selection on deleterious and beneficial mutations, the patterns of genetic variation in natural populations, and the severity of inbreeding depression in a population. Despite this importance, dominance parameters remain poorly known, particularly in humans or other non-model organisms. A key reason for this lack of information about dominance is that it is extremely challenging to disentangle the selection coefficient (s) of a mutation from its dominance coefficient (h). Here, we explore dominance and selection parameters in humans by fitting models to the site frequency spectrum (SFS) for nonsynonymous mutations. When assuming a single dominance coefficient for all nonsynonymous mutations, we find that numerous h values can fit the data, so long as h is greater than ~0.15. Moreover, we also observe that theoretically-predicted models with a negative relationship between h and s can also fit the data well, including models with h=0.05 for strongly deleterious mutations. Finally, we use our estimated dominance and selection parameters to inform simulations revisiting the question of whether the out-of-Africa bottleneck has led to differences in genetic load between African and non-African human populations. These simulations suggest that the relative burden of genetic load in non-African populations depends on the dominance model assumed, with slight increases for more weakly recessive models and slight decreases shown for more strongly recessive models. Moreover, these results also demonstrate that models of partially recessive nonsynonymous mutations can explain the observed severity of inbreeding depression in humans, bridging the gap between molecular population genetics and direct measures of fitness in humans. Our work represents a comprehensive assessment of dominance and deleterious variation in humans, with implications for parameterizing models of deleterious variation in humans and other mammalian species.
Collapse
Affiliation(s)
| | - Kirk E. Lohmueller
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, USA
- Interdepartmental Program in Bioinformatics, University of California, Los Angeles, USA
- Department of Human Genetics, David Geffen School of Medicine, Los Angeles, USA
| |
Collapse
|
10
|
Cui R, Wu J, Yan K, Luo S, Hu Y, Feng W, Lu B, Wang J. Phased genome assemblies reveal haplotype-specific genetic load in the critically endangered Chinese Bahaba (Teleostei, Sciaenidae). Mol Ecol 2024; 33:e17250. [PMID: 38179694 DOI: 10.1111/mec.17250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2023] [Revised: 12/06/2023] [Accepted: 12/11/2023] [Indexed: 01/06/2024]
Abstract
While haplotype-specific genetic load shapes the evolutionary trajectory of natural and captive populations, mixed-haplotype assembly and genotyping hindered its characterization in diploids. Herein, we produced two phased genome assemblies of the critically endangered fish Chinese Bahaba (Bahaba taipingensis, Sciaenidae, Teleostei) and resequenced 20 whole genomes to quantify population genetic load at a haplotype level. We identified frame-shifting variants as the most deleterious type, followed by mutations in the 5'-UTR, 3'-UTR and missense mutations at conserved amino acids. Phased haplotypes revealed gene deletions and high-impact deleterious variants. We estimated ~1.12% of genes missing or interrupted per haplotype, with a significant overlap of disrupted genes (30.35%) between haplotype sets. Relative proportions of deleterious variant categories differed significantly between haplotypes. Simulations suggested that purifying selection struggled to purge slightly deleterious genetic load in captive breeding compared to genotyping interventions, and that higher inter-haplotypic variance of genetic load predicted more efficient purging by artificial selection. Combining the knowledge of haplotype-resolved genetic load with predictive modelling will be immensely useful for understanding the evolution of deleterious variants and guiding conservation planning.
Collapse
Affiliation(s)
- Rongfeng Cui
- School of Ecology & State Key Laboratory of Biocontrol, Sun Yat-sen University, Shenzhen, China
- Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai, China
| | - Jinxian Wu
- Guangzhou Key Laboratory of Subtropical Biodiversity and Biomonitoring, Guangdong-Macao Joint Laboratory for Aquaculture Breeding Development and Innovation, School of Life Sciences, South China Normal University, Guangzhou, China
| | - Kuoqiu Yan
- Huangjing Marine Biotechnology Co. Ltd., Huizhou, China
| | - Sujun Luo
- Dongguan Forestry Affairs Center, Dongguan, China
| | - Yuting Hu
- Dongguan Forestry Affairs Center, Dongguan, China
| | - Wei Feng
- Dongguan Forestry Affairs Center, Dongguan, China
| | - Bingqian Lu
- Dongguan Forestry Affairs Center, Dongguan, China
| | - Junjie Wang
- Guangzhou Key Laboratory of Subtropical Biodiversity and Biomonitoring, Guangdong-Macao Joint Laboratory for Aquaculture Breeding Development and Innovation, School of Life Sciences, South China Normal University, Guangzhou, China
| |
Collapse
|
11
|
Lucas-Sánchez M, Abdeli A, Bekada A, Calafell F, Benhassine T, Comas D. The Impact of Recent Demography on Functional Genetic Variation in North African Human Groups. Mol Biol Evol 2024; 41:msad283. [PMID: 38152862 PMCID: PMC10783648 DOI: 10.1093/molbev/msad283] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Revised: 11/22/2023] [Accepted: 12/19/2023] [Indexed: 12/29/2023] Open
Abstract
The strategic location of North Africa has made the region the core of a wide range of human demographic events, including migrations, bottlenecks, and admixture processes. This has led to a complex and heterogeneous genetic and cultural landscape, which remains poorly studied compared to other world regions. Whole-exome sequencing is particularly relevant to determine the effects of these demographic events on current-day North Africans' genomes, since it allows to focus on those parts of the genome that are more likely to have direct biomedical consequences. Whole-exome sequencing can also be used to assess the effect of recent demography in functional genetic variation and the efficacy of natural selection, a long-lasting debate. In the present work, we use newly generated whole-exome sequencing and genome-wide array genotypes to investigate the effect of demography in functional variation in 7 North African populations, considering both cultural and demographic differences and with a special focus on Amazigh (plur. Imazighen) groups. We detect genetic differences among populations related to their degree of isolation and the presence of bottlenecks in their recent history. We find differences in the functional part of the genome that suggest a relaxation of purifying selection in the more isolated groups, allowing for an increase of putatively damaging variation. Our results also show a shift in mutational load coinciding with major demographic events in the region and reveal differences within and between cultural and geographic groups.
Collapse
Affiliation(s)
- Marcel Lucas-Sánchez
- Departament de Medicina i Ciències de la Vida, Institut de Biologia Evolutiva (CSIC-UPF), Universitat Pompeu Fabra, Barcelona, Spain
| | - Amine Abdeli
- Faculté des Sciences Biologiques, Laboratoire de Biologie Cellulaire et Moléculaire, Université des Sciences et de la Technologie Houari Boumediene, Alger, Algeria
| | - Asmahan Bekada
- Département de Biotechnologie, Faculté des Sciences de la Nature et de la Vie, Université Oran 1 (Ahmad Ben Bella), Oran, Algeria
| | - Francesc Calafell
- Departament de Medicina i Ciències de la Vida, Institut de Biologia Evolutiva (CSIC-UPF), Universitat Pompeu Fabra, Barcelona, Spain
| | - Traki Benhassine
- Faculté des Sciences Biologiques, Laboratoire de Biologie Cellulaire et Moléculaire, Université des Sciences et de la Technologie Houari Boumediene, Alger, Algeria
| | - David Comas
- Departament de Medicina i Ciències de la Vida, Institut de Biologia Evolutiva (CSIC-UPF), Universitat Pompeu Fabra, Barcelona, Spain
| |
Collapse
|
12
|
Jabalameli M, Lin JR, Zhang Q, Wang Z, Mitra J, Nguyen N, Gao T, Khusidman M, Atzmon G, Milman S, Vijg J, Barzilai N, Zhang ZD. Polygenic prediction of human longevity on the supposition of pervasive pleiotropy. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.12.10.23299795. [PMID: 38168353 PMCID: PMC10760260 DOI: 10.1101/2023.12.10.23299795] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2024]
Abstract
The highly polygenic nature of human longevity renders cross-trait pleiotropy an indispensable feature of its genetic architecture. Leveraging the genetic correlation between the aging-related traits (ARTs), we sought to model the additive variance in lifespan as a function of cumulative liability from pleiotropic segregating variants. We tracked allele frequency changes as a function of viability across different age bins and prioritized 34 variants with an immediate implication on lipid metabolism, body mass index (BMI), and cognitive performance, among other traits, revealed by PheWAS analysis in the UK Biobank. Given the highly complex and non-linear interactions between the genetic determinants of longevity, we reasoned that a composite polygenic score would approximate a substantial portion of the variance in lifespan and developed the integrated longevity genetic scores (iLGSs) for distinguishing exceptional survival. We showed that coefficients derived from our ensemble model could potentially reveal an interesting pattern of genomic pleiotropy specific to lifespan. We assessed the predictive performance of our model for distinguishing the enrichment of exceptional longevity among long-lived individuals in two replication cohorts and showed that the median lifespan in the highest decile of our composite prognostic index is up to 4.8 years longer. Finally, using the proteomic correlates of i L G S , we identified protein markers associated with exceptional longevity irrespective of chronological age and prioritized drugs with repurposing potentials for gerotherapeutics. Together, our approach demonstrates a promising framework for polygenic modeling of additive liability conferred by ARTs in defining exceptional longevity and assisting the identification of individuals at higher risk of mortality for targeted lifestyle modifications earlier in life. Furthermore, the proteomic signature associated with i L G S highlights the functional pathway upstream of the PI3K-Akt that can be effectively targeted to slow down aging and extend lifespan.
Collapse
Affiliation(s)
- M.Reza Jabalameli
- Department of Genetics, Albert Einstein College of Medicine, New York, NY, USA
| | - Jhih-Rong Lin
- Department of Genetics, Albert Einstein College of Medicine, New York, NY, USA
| | - Quanwei Zhang
- Department of Genetics, Albert Einstein College of Medicine, New York, NY, USA
| | - Zhen Wang
- Department of Genetics, Albert Einstein College of Medicine, New York, NY, USA
| | - Joydeep Mitra
- Department of Genetics, Albert Einstein College of Medicine, New York, NY, USA
| | - Nha Nguyen
- Department of Genetics, Albert Einstein College of Medicine, New York, NY, USA
| | - Tina Gao
- Department of Medicine, Albert Einstein College of Medicine, New York, NY, USA
| | - Mark Khusidman
- Department of Genetics, Albert Einstein College of Medicine, New York, NY, USA
| | - Gil Atzmon
- Department of Genetics, Albert Einstein College of Medicine, New York, NY, USA
| | - Sofiya Milman
- Department of Genetics, Albert Einstein College of Medicine, New York, NY, USA
- Department of Medicine, Albert Einstein College of Medicine, New York, NY, USA
| | - Jan Vijg
- Department of Genetics, Albert Einstein College of Medicine, New York, NY, USA
| | - Nir Barzilai
- Department of Genetics, Albert Einstein College of Medicine, New York, NY, USA
- Department of Medicine, Albert Einstein College of Medicine, New York, NY, USA
| | - Zhengdong D. Zhang
- Department of Genetics, Albert Einstein College of Medicine, New York, NY, USA
| |
Collapse
|
13
|
James J, Kastally C, Budde KB, González-Martínez SC, Milesi P, Pyhäjärvi T, Lascoux M. Between but Not Within-Species Variation in the Distribution of Fitness Effects. Mol Biol Evol 2023; 40:msad228. [PMID: 37832225 PMCID: PMC10630145 DOI: 10.1093/molbev/msad228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Revised: 09/04/2023] [Accepted: 09/25/2023] [Indexed: 10/15/2023] Open
Abstract
New mutations provide the raw material for evolution and adaptation. The distribution of fitness effects (DFE) describes the spectrum of effects of new mutations that can occur along a genome, and is, therefore, of vital interest in evolutionary biology. Recent work has uncovered striking similarities in the DFE between closely related species, prompting us to ask whether there is variation in the DFE among populations of the same species, or among species with different degrees of divergence, that is whether there is variation in the DFE at different levels of evolution. Using exome capture data from six tree species sampled across Europe we characterized the DFE for multiple species, and for each species, multiple populations, and investigated the factors potentially influencing the DFE, such as demography, population divergence, and genetic background. We find statistical support for the presence of variation in the DFE at the species level, even among relatively closely related species. However, we find very little difference at the population level, suggesting that differences in the DFE are primarily driven by deep features of species biology, and those evolutionarily recent events, such as demographic changes and local adaptation, have little impact.
Collapse
Affiliation(s)
- Jennifer James
- Department of Ecology and Genetics, Uppsala University, Uppsala, Sweden
- Swedish Collegium of Advanced Study, Uppsala University, Uppsala, Sweden
| | - Chedly Kastally
- Department of Forest Sciences, University of Helsinki, Helsinki, Finland
- Viikki Plant Science Centre, University of Helsinki, Helsinki, Finland
| | - Katharina B Budde
- Department of Forest Genetics and Forest Tree Breeding, Georg-August-University Goettingen, Goettingen, Germany
- Center of Biodiversity and Sustainable Land Use (CBL), University of Goettingen, Goettingen, Germany
| | - Santiago C González-Martínez
- National Research Institute for Agriculture, Food and the Environment (INRAE), University of Bordeaux, BIOGECO, Cestas, France
| | - Pascal Milesi
- Department of Ecology and Genetics, Uppsala University, Uppsala, Sweden
- Science for Life Laboratory (SciLifeLab), Uppsala University, Uppsala, Sweden
| | - Tanja Pyhäjärvi
- Department of Forest Sciences, University of Helsinki, Helsinki, Finland
- Viikki Plant Science Centre, University of Helsinki, Helsinki, Finland
| | - Martin Lascoux
- Department of Ecology and Genetics, Uppsala University, Uppsala, Sweden
| |
Collapse
|
14
|
Swinford NA, Prall SP, Gopalan S, Williams CM, Sheehama J, Scelza BA, Henn BM. Increased homozygosity due to endogamy results in fitness consequences in a human population. Proc Natl Acad Sci U S A 2023; 120:e2309552120. [PMID: 37847737 PMCID: PMC10614605 DOI: 10.1073/pnas.2309552120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Accepted: 09/18/2023] [Indexed: 10/19/2023] Open
Abstract
Recessive alleles have been shown to directly affect both human Mendelian disease phenotypes and complex traits. Pedigree studies also suggest that consanguinity results in increased childhood mortality and adverse health phenotypes, presumably through penetrance of recessive mutations. Here, we test whether the accumulation of homozygous, recessive alleles decreases reproductive success in a human population. We address this question among the Namibian Himba, an endogamous agro-pastoralist population, who until very recently practiced natural fertility. Using a sample of 681 individuals, we show that Himba exhibit elevated levels of "inbreeding," calculated as the fraction of the genome in runs of homozygosity (FROH). Many individuals contain multiple long segments of ROH in their genomes, indicating that their parents had high kinship coefficients. However, we do not find evidence that this is explained by first-cousin consanguinity, despite a reported social preference for cross-cousin marriages. Rather, we show that elevated haplotype sharing in the Himba is due to a bottleneck, likely in the past 60 generations. We test whether increased recessive mutation load results in observed fitness consequences by assessing the effect of FROH on completed fertility in a cohort of postreproductive women (n = 69). We find that higher FROH is significantly associated with lower fertility. Our data suggest a multilocus genetic effect on fitness driven by the expression of deleterious recessive alleles, especially those in long ROH. However, these effects are not the result of consanguinity but rather elevated background identity by descent.
Collapse
Affiliation(s)
- N. A. Swinford
- Department of Anthropology, University of California Davis, Davis, CA95616
| | - S. P. Prall
- Department of Anthropology, University of Missouri, Columbia, MO65211
| | - S. Gopalan
- Department of Evolutionary Anthropology, Duke University, Durham, NC27708
| | - C. M. Williams
- Center for Computational Molecular Biology, Brown University, Providence, RI02912
| | - J. Sheehama
- Department of Human, Biological and Translational Medical Sciences, School of Medicine University of Namibia, Oshakati10005, Namibia
| | - B. A. Scelza
- Department of Anthropology, University of California Los Angeles, Los Angeles, CA90095
| | - B. M. Henn
- Department of Anthropology, University of California Davis, Davis, CA95616
- Center for Population Biology, University of California Davis, Davis, CA95616
- Genome Center, University of California Davis, Davis, CA95616
| |
Collapse
|
15
|
Sohail M, Palma-Martínez MJ, Chong AY, Quinto-Cortés CD, Barberena-Jonas C, Medina-Muñoz SG, Ragsdale A, Delgado-Sánchez G, Cruz-Hervert LP, Ferreyra-Reyes L, Ferreira-Guerrero E, Mongua-Rodríguez N, Canizales-Quintero S, Jimenez-Kaufmann A, Moreno-Macías H, Aguilar-Salinas CA, Auckland K, Cortés A, Acuña-Alonzo V, Gignoux CR, Wojcik GL, Ioannidis AG, Fernández-Valverde SL, Hill AVS, Tusié-Luna MT, Mentzer AJ, Novembre J, García-García L, Moreno-Estrada A. Mexican Biobank advances population and medical genomics of diverse ancestries. Nature 2023; 622:775-783. [PMID: 37821706 PMCID: PMC10600006 DOI: 10.1038/s41586-023-06560-0] [Citation(s) in RCA: 22] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Accepted: 08/22/2023] [Indexed: 10/13/2023]
Abstract
Latin America continues to be severely underrepresented in genomics research, and fine-scale genetic histories and complex trait architectures remain hidden owing to insufficient data1. To fill this gap, the Mexican Biobank project genotyped 6,057 individuals from 898 rural and urban localities across all 32 states in Mexico at a resolution of 1.8 million genome-wide markers with linked complex trait and disease information creating a valuable nationwide genotype-phenotype database. Here, using ancestry deconvolution and inference of identity-by-descent segments, we inferred ancestral population sizes across Mesoamerican regions over time, unravelling Indigenous, colonial and postcolonial demographic dynamics2-6. We observed variation in runs of homozygosity among genomic regions with different ancestries reflecting distinct demographic histories and, in turn, different distributions of rare deleterious variants. We conducted genome-wide association studies (GWAS) for 22 complex traits and found that several traits are better predicted using the Mexican Biobank GWAS compared to the UK Biobank GWAS7,8. We identified genetic and environmental factors associating with trait variation, such as the length of the genome in runs of homozygosity as a predictor for body mass index, triglycerides, glucose and height. This study provides insights into the genetic histories of individuals in Mexico and dissects their complex trait architectures, both crucial for making precision and preventive medicine initiatives accessible worldwide.
Collapse
Affiliation(s)
- Mashaal Sohail
- Unidad de Genómica Avanzada (UGA-LANGEBIO), Centro de Investigación y Estudios Avanzados del IPN (Cinvestav), Irapuato, Mexico.
- Department of Human Genetics, University of Chicago, Chicago, IL, USA.
- Centro de Ciencias Genómicas (CCG), Universidad Nacional Autónoma de México (UNAM), Cuernavaca, Mexico.
| | - María J Palma-Martínez
- Unidad de Genómica Avanzada (UGA-LANGEBIO), Centro de Investigación y Estudios Avanzados del IPN (Cinvestav), Irapuato, Mexico
| | - Amanda Y Chong
- The Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Consuelo D Quinto-Cortés
- Unidad de Genómica Avanzada (UGA-LANGEBIO), Centro de Investigación y Estudios Avanzados del IPN (Cinvestav), Irapuato, Mexico
| | - Carmina Barberena-Jonas
- Unidad de Genómica Avanzada (UGA-LANGEBIO), Centro de Investigación y Estudios Avanzados del IPN (Cinvestav), Irapuato, Mexico
| | - Santiago G Medina-Muñoz
- Unidad de Genómica Avanzada (UGA-LANGEBIO), Centro de Investigación y Estudios Avanzados del IPN (Cinvestav), Irapuato, Mexico
| | - Aaron Ragsdale
- Unidad de Genómica Avanzada (UGA-LANGEBIO), Centro de Investigación y Estudios Avanzados del IPN (Cinvestav), Irapuato, Mexico
- Department of Integrative Biology, University of Wisconsin-Madison, Madison, WI, USA
| | | | - Luis Pablo Cruz-Hervert
- Instituto Nacional de Salud Pública (INSP), Cuernavaca, Mexico
- División de Estudios de Posgrado e Investigación, Facultad de Odontología, Universidad Nacional Autónoma de México (UNAM), Mexico City, Mexico
| | | | | | | | | | - Andrés Jimenez-Kaufmann
- Unidad de Genómica Avanzada (UGA-LANGEBIO), Centro de Investigación y Estudios Avanzados del IPN (Cinvestav), Irapuato, Mexico
| | - Hortensia Moreno-Macías
- Unidad de Biología Molecular y Medicina Genómica, Instituto de Investigaciones Biomédicas UNAM/Instituto Nacional de Ciencias Médicas y Nutrición Salvador Zubirán, Mexico City, Mexico
- Universidad Autónoma Metropolitana, Mexico City, Mexico
| | - Carlos A Aguilar-Salinas
- Division de Nutrición, Instituto Nacional de Ciencias Médicas y Nutrición Salvador Zubirán, Mexico City, Mexico
| | - Kathryn Auckland
- The Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Adrián Cortés
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK
| | | | - Christopher R Gignoux
- Colorado Center for Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Genevieve L Wojcik
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | | | - Selene L Fernández-Valverde
- Unidad de Genómica Avanzada (UGA-LANGEBIO), Centro de Investigación y Estudios Avanzados del IPN (Cinvestav), Irapuato, Mexico
- School of Biotechnology and Biomolecular Sciences and the RNA Institute, The University of New South Wales, Sydney, New South Wales, Australia
| | - Adrian V S Hill
- The Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
- The Jenner Institute, University of Oxford, Oxford, UK
| | - María Teresa Tusié-Luna
- Unidad de Biología Molecular y Medicina Genómica, Instituto de Investigaciones Biomédicas UNAM/Instituto Nacional de Ciencias Médicas y Nutrición Salvador Zubirán, Mexico City, Mexico
| | - Alexander J Mentzer
- The Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK.
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK.
| | - John Novembre
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA
| | | | - Andrés Moreno-Estrada
- Unidad de Genómica Avanzada (UGA-LANGEBIO), Centro de Investigación y Estudios Avanzados del IPN (Cinvestav), Irapuato, Mexico.
| |
Collapse
|
16
|
Nigenda-Morales SF, Lin M, Nuñez-Valencia PG, Kyriazis CC, Beichman AC, Robinson JA, Ragsdale AP, Urbán R J, Archer FI, Viloria-Gómora L, Pérez-Álvarez MJ, Poulin E, Lohmueller KE, Moreno-Estrada A, Wayne RK. The genomic footprint of whaling and isolation in fin whale populations. Nat Commun 2023; 14:5465. [PMID: 37699896 PMCID: PMC10497599 DOI: 10.1038/s41467-023-40052-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Accepted: 07/10/2023] [Indexed: 09/14/2023] Open
Abstract
Twentieth century industrial whaling pushed several species to the brink of extinction, with fin whales being the most impacted. However, a small, resident population in the Gulf of California was not targeted by whaling. Here, we analyzed 50 whole-genomes from the Eastern North Pacific (ENP) and Gulf of California (GOC) fin whale populations to investigate their demographic history and the genomic effects of natural and human-induced bottlenecks. We show that the two populations diverged ~16,000 years ago, after which the ENP population expanded and then suffered a 99% reduction in effective size during the whaling period. In contrast, the GOC population remained small and isolated, receiving less than one migrant per generation. However, this low level of migration has been crucial for maintaining its viability. Our study exposes the severity of whaling, emphasizes the importance of migration, and demonstrates the use of genome-based analyses and simulations to inform conservation strategies.
Collapse
Affiliation(s)
- Sergio F Nigenda-Morales
- Advanced Genomics Unit, National Laboratory of Genomics for Biodiversity (Langebio), Center for Research and Advanced Studies (Cinvestav), Irapuato, Guanajuato, 36824, Mexico.
- Department of Biological Sciences, California State University San Marcos, San Marcos, CA, 92096, USA.
| | - Meixi Lin
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA, 90095, USA.
- Department of Plant Biology, Carnegie Institution for Science, Stanford, CA, 94305, USA.
| | - Paulina G Nuñez-Valencia
- Advanced Genomics Unit, National Laboratory of Genomics for Biodiversity (Langebio), Center for Research and Advanced Studies (Cinvestav), Irapuato, Guanajuato, 36824, Mexico
- Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México (UNAM), Cuernavaca, Morelos, México
| | - Christopher C Kyriazis
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Annabel C Beichman
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195, USA
| | - Jacqueline A Robinson
- Institute for Human Genetics, University of California, San Francisco (UCSF), San Francisco, CA, 94143, USA
| | - Aaron P Ragsdale
- Advanced Genomics Unit, National Laboratory of Genomics for Biodiversity (Langebio), Center for Research and Advanced Studies (Cinvestav), Irapuato, Guanajuato, 36824, Mexico
- Department of Integrative Biology, University of Wisconsin, Madison, WI, 53706, USA
| | - Jorge Urbán R
- Departamento de Ciencias Marinas y Costeras, Universidad Autónoma de Baja California Sur (UABCS), La Paz, Baja California Sur, Mexico
| | - Frederick I Archer
- Marine Mammal and Turtle Division, Southwest Fisheries Science Center, La Jolla, CA, 92037, USA
| | - Lorena Viloria-Gómora
- Departamento de Ciencias Marinas y Costeras, Universidad Autónoma de Baja California Sur (UABCS), La Paz, Baja California Sur, Mexico
| | - María José Pérez-Álvarez
- Escuela de Medicina Veterinaria, Facultad de Medicina y Ciencias de la Salud, Universidad Mayor, Santiago, Chile
- Millennium Institute Biodiversity of Antarctic and Subantarctic Ecosystems (BASE), Universidad de Chile, Santiago, Chile
| | - Elie Poulin
- Millennium Institute Biodiversity of Antarctic and Subantarctic Ecosystems (BASE), Universidad de Chile, Santiago, Chile
| | - Kirk E Lohmueller
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA, 90095, USA.
- Interdepartmental Program in Bioinformatics, University of California, Los Angeles, CA, 90095, USA.
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, CA, USA.
| | - Andrés Moreno-Estrada
- Advanced Genomics Unit, National Laboratory of Genomics for Biodiversity (Langebio), Center for Research and Advanced Studies (Cinvestav), Irapuato, Guanajuato, 36824, Mexico.
| | - Robert K Wayne
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| |
Collapse
|
17
|
Matheson J, Bertram J, Masel J. Human deleterious mutation rate implies high fitness variance, with declining mean fitness compensated by rarer beneficial mutations of larger effect. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.01.555871. [PMID: 37732183 PMCID: PMC10508744 DOI: 10.1101/2023.09.01.555871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/22/2023]
Abstract
Each new human has an expected Ud = 2 - 10 new deleterious mutations. This deluge of deleterious mutations cannot all be purged, and therefore accumulate in a declining fitness ratchet. Using a novel simulation framework designed to efficiently handle genome-wide linkage disequilibria across many segregating sites, we find that rarer, beneficial mutations of larger effect are sufficient to compensate fitness declines due to the fixation of many slightly deleterious mutations. Drift barrier theory posits a similar asymmetric pattern of fixations to explain ratcheting genome size and complexity, but in our theory, the cause is Ud > 1 rather than small population size. In our simulations, Ud ~2 - 10 generates high within-population variance in relative fitness; two individuals will typically differ in fitness by 15-40%. Ud ~2 - 10 also slows net adaptation by ~13%-39%. Surprisingly, fixation rates are more sensitive to changes in the beneficial than the deleterious mutation rate, e.g. a 10% increase in overall mutation rate leads to faster adaptation; this puts to rest dysgenic fears about increasing mutation rates due to rising paternal age.
Collapse
Affiliation(s)
- Joseph Matheson
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ, 85721, USA
- Department of Ecology, Behavior, and Evolution, University of California San Diego, San Diego, CA, 92093, USA
| | - Jason Bertram
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ, 85721, USA
- Department of Mathematics, University of Western Ontario, London ON, Canada
| | - Joanna Masel
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ, 85721, USA
| |
Collapse
|
18
|
Rougemont Q, Leroy T, Rondeau EB, Koop B, Bernatchez L. Allele surfing causes maladaptation in a Pacific salmon of conservation concern. PLoS Genet 2023; 19:e1010918. [PMID: 37683018 PMCID: PMC10545117 DOI: 10.1371/journal.pgen.1010918] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2022] [Revised: 10/02/2023] [Accepted: 08/11/2023] [Indexed: 09/10/2023] Open
Abstract
How various factors, including demography, recombination or genome duplication, may impact the efficacy of natural selection and the burden of deleterious mutations, is a central question in evolutionary biology and genetics. In this study, we show that key evolutionary processes, including variations in i) effective population size (Ne) ii) recombination rates and iii) chromosome inheritance, have influenced the genetic load and efficacy of selection in Coho salmon (Oncorhynchus kisutch), a widely distributed salmonid species on the west coast of North America. Using whole genome resequencing data from 14 populations at different migratory distances from their southern glacial refugium, we found evidence supporting gene surfing, wherein reduced Ne at the postglacial recolonization front, leads to a decrease in the efficacy of selection and a surf of deleterious alleles in the northernmost populations. Furthermore, our results indicate that recombination rates play a prime role in shaping the load along the genome. Additionally, we identified variation in polyploidy as a contributing factor to within-genome variation of the load. Overall, our results align remarkably well with expectations under the nearly neutral theory of molecular evolution. We discuss the fundamental and applied implications of these findings for evolutionary and conservation genomics.
Collapse
Affiliation(s)
- Quentin Rougemont
- Centre d’Ecologie Fonctionnelle et Evolutive, Univ Montpellier, CNRS, EPHE, IRD, Montpellier, France
| | - Thibault Leroy
- GenPhySE, INRAE, INP, ENVT, Université de Toulouse, Auzeville- Tolosane, France
| | - Eric B. Rondeau
- Department of Fisheries and Ocean, Pacific Biological Station, Nanaimo, Canada
| | - Ben Koop
- Department of Biology, University of Victoria, Victoria, Canada
| | - Louis Bernatchez
- Département de Biologie, Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, Canada
| |
Collapse
|
19
|
Wang N, Cao S, Liu Z, Xiao H, Hu J, Xu X, Chen P, Ma Z, Ye J, Chai L, Guo W, Larkin RM, Xu Q, Morrell PL, Zhou Y, Deng X. Genomic conservation of crop wild relatives: A case study of citrus. PLoS Genet 2023; 19:e1010811. [PMID: 37339133 DOI: 10.1371/journal.pgen.1010811] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Accepted: 06/01/2023] [Indexed: 06/22/2023] Open
Abstract
Conservation of crop wild relatives is critical for plant breeding and food security. The lack of clarity on the genetic factors that lead to endangered status or extinction create difficulties when attempting to develop concrete recommendations for conserving a citrus wild relative: the wild relatives of crops. Here, we evaluate the conservation of wild kumquat (Fortunella hindsii) using genomic, geographical, environmental, and phenotypic data, and forward simulations. Genome resequencing data from 73 accessions from the Fortunella genus were combined to investigate population structure, demography, inbreeding, introgression, and genetic load. Population structure was correlated with reproductive type (i.e., sexual and apomictic) and with a significant differentiation within the sexually reproducing population. The effective population size for one of the sexually reproducing subpopulations has recently declined to ~1,000, resulting in high levels of inbreeding. In particular, we found that 58% of the ecological niche overlapped between wild and cultivated populations and that there was extensive introgression into wild samples from cultivated populations. Interestingly, the introgression pattern and accumulation of genetic load may be influenced by the type of reproduction. In wild apomictic samples, the introgressed regions were primarily heterozygous, and genome-wide deleterious variants were hidden in the heterozygous state. In contrast, wild sexually reproducing samples carried a higher recessive deleterious burden. Furthermore, we also found that sexually reproducing samples were self-incompatible, which prevented the reduction of genetic diversity by selfing. Our population genomic analyses provide specific recommendations for distinct reproductive types and monitoring during conservation. This study highlights the genomic landscape of a wild relative of citrus and provides recommendations for the conservation of crop wild relatives.
Collapse
Affiliation(s)
- Nan Wang
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Huazhong Agricultural University, Wuhan, China
- State Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Shuo Cao
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Huazhong Agricultural University, Wuhan, China
- State Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Zhongjie Liu
- State Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Hua Xiao
- State Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Jianbing Hu
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Huazhong Agricultural University, Wuhan, China
| | - Xiaodong Xu
- State Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
- Hubei Hongshan Laboratory, Wuhan, China
| | - Peng Chen
- Institute of Horticultural Research, Hunan Academy of Agricultural Sciences, Changsha, China
| | - Zhiyao Ma
- State Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Junli Ye
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Huazhong Agricultural University, Wuhan, China
| | - Lijun Chai
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Huazhong Agricultural University, Wuhan, China
| | - Wenwu Guo
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Huazhong Agricultural University, Wuhan, China
- Hubei Hongshan Laboratory, Wuhan, China
| | - Robert M Larkin
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Huazhong Agricultural University, Wuhan, China
- Hubei Hongshan Laboratory, Wuhan, China
| | - Qiang Xu
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Huazhong Agricultural University, Wuhan, China
- Hubei Hongshan Laboratory, Wuhan, China
| | - Peter L Morrell
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, Minnesota, United States of America
| | - Yongfeng Zhou
- State Key Laboratory of Tropical Crop Breeding, Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Key Laboratory of Synthetic Biology, Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
- State Key Laboratory of Tropical Crop Breeding, Tropical Crops Genetic Resources Institute, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
| | - Xiuxin Deng
- National Key Laboratory for Germplasm Innovation & Utilization of Horticultural Crops, Huazhong Agricultural University, Wuhan, China
- Hubei Hongshan Laboratory, Wuhan, China
| |
Collapse
|
20
|
Gao H, Hamp T, Ede J, Schraiber JG, McRae J, Singer-Berk M, Yang Y, Dietrich ASD, Fiziev PP, Kuderna LFK, Sundaram L, Wu Y, Adhikari A, Field Y, Chen C, Batzoglou S, Aguet F, Lemire G, Reimers R, Balick D, Janiak MC, Kuhlwilm M, Orkin JD, Manu S, Valenzuela A, Bergman J, Rousselle M, Silva FE, Agueda L, Blanc J, Gut M, de Vries D, Goodhead I, Harris RA, Raveendran M, Jensen A, Chuma IS, Horvath JE, Hvilsom C, Juan D, Frandsen P, de Melo FR, Bertuol F, Byrne H, Sampaio I, Farias I, do Amaral JV, Messias M, da Silva MNF, Trivedi M, Rossi R, Hrbek T, Andriaholinirina N, Rabarivola CJ, Zaramody A, Jolly CJ, Phillips-Conroy J, Wilkerson G, Abee C, Simmons JH, Fernandez-Duque E, Kanthaswamy S, Shiferaw F, Wu D, Zhou L, Shao Y, Zhang G, Keyyu JD, Knauf S, Le MD, Lizano E, Merker S, Navarro A, Bataillon T, Nadler T, Khor CC, Lee J, Tan P, Lim WK, Kitchener AC, Zinner D, Gut I, Melin A, Guschanski K, Schierup MH, Beck RMD, Umapathy G, Roos C, Boubli JP, Lek M, Sunyaev S, O'Donnell-Luria A, Rehm HL, Xu J, Rogers J, Marques-Bonet T, Farh KKH. The landscape of tolerated genetic variation in humans and primates. Science 2023; 380:eabn8153. [PMID: 37262156 DOI: 10.1126/science.abn8197] [Citation(s) in RCA: 52] [Impact Index Per Article: 52.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2021] [Accepted: 03/22/2023] [Indexed: 06/03/2023]
Abstract
Personalized genome sequencing has revealed millions of genetic differences between individuals, but our understanding of their clinical relevance remains largely incomplete. To systematically decipher the effects of human genetic variants, we obtained whole-genome sequencing data for 809 individuals from 233 primate species and identified 4.3 million common protein-altering variants with orthologs in humans. We show that these variants can be inferred to have nondeleterious effects in humans based on their presence at high allele frequencies in other primate populations. We use this resource to classify 6% of all possible human protein-altering variants as likely benign and impute the pathogenicity of the remaining 94% of variants with deep learning, achieving state-of-the-art accuracy for diagnosing pathogenic variants in patients with genetic diseases.
Collapse
Affiliation(s)
- Hong Gao
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | - Tobias Hamp
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | - Jeffrey Ede
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | - Joshua G Schraiber
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | - Jeremy McRae
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | - Moriel Singer-Berk
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Boston, MA, 02142, USA
| | - Yanshen Yang
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | | | - Petko P Fiziev
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | - Lukas F K Kuderna
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Laksshman Sundaram
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | - Yibing Wu
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | - Aashish Adhikari
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | - Yair Field
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | - Chen Chen
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | - Serafim Batzoglou
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | - Francois Aguet
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| | - Gabrielle Lemire
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Boston, MA, 02142, USA
- Division of Genetics and Genomics, Department of Pediatrics, Boston Children's Hospital, Harvard Medical School, Boston, MA, 02115, USA
| | - Rebecca Reimers
- Division of Genetics and Genomics, Department of Pediatrics, Boston Children's Hospital, Harvard Medical School, Boston, MA, 02115, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, 02115, USA
| | - Daniel Balick
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, 02115, USA
- Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, 02115, USA
| | - Mareike C Janiak
- School of Science, Engineering & Environment, University of Salford, Salford M5 4WT, UK
| | - Martin Kuhlwilm
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
- Department of Evolutionary Anthropology, University of Vienna, Djerassiplatz 1, 1030 Vienna, Austria
- Human Evolution and Archaeological Sciences (HEAS), University of Vienna, 1030 Vienna, Austria
| | - Joseph D Orkin
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
- Département d'anthropologie, Université de Montréal, 3150 Jean-Brillant, Montréal, QC H3T 1N8, Canada
| | - Shivakumara Manu
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
- Laboratory for the Conservation of Endangered Species, CSIR-Centre for Cellular and Molecular Biology, Hyderabad 500007, India
| | - Alejandro Valenzuela
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Juraj Bergman
- Bioinformatics Research Centre, Aarhus University, Aarhus 8000, Denmark
- Section for Ecoinformatics & Biodiversity, Department of Biology, Aarhus University, 8000 Aarhus, Denmark
| | | | - Felipe Ennes Silva
- Research Group on Primate Biology and Conservation, Mamirauá Institute for Sustainable Development, Estrada da Bexiga 2584, Tefé, Amazonas, CEP 69553-225, Brazil
- Evolutionary Biology and Ecology (EBE), Département de Biologie des Organismes, Université libre de Bruxelles (ULB), Av. Franklin D. Roosevelt 50, CP 160/12, B-1050 Brussels, Belgium
| | - Lidia Agueda
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain
| | - Julie Blanc
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain
| | - Marta Gut
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain
| | - Dorien de Vries
- School of Science, Engineering & Environment, University of Salford, Salford M5 4WT, UK
| | - Ian Goodhead
- School of Science, Engineering & Environment, University of Salford, Salford M5 4WT, UK
| | - R Alan Harris
- Human Genome Sequencing Center and Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Muthuswamy Raveendran
- Human Genome Sequencing Center and Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Axel Jensen
- Department of Ecology and Genetics, Animal Ecology, Uppsala University, SE-75236 Uppsala, Sweden
| | | | - Julie E Horvath
- North Carolina Museum of Natural Sciences, Raleigh, NC 27601, USA
- Department of Biological and Biomedical Sciences, North Carolina Central University, Durham, NC 27707, USA
- Department of Biological Sciences, North Carolina State University, Raleigh, NC 27695, USA
- Department of Evolutionary Anthropology, Duke University, Durham, NC 27708, USA
- Renaissance Computing Institute, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | | | - David Juan
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
| | | | | | - Fabrício Bertuol
- Universidade Federal do Amazonas, Departamento de Genética, Laboratório de Evolução e Genética Animal (LEGAL), Manaus, Amazonas, 69080-900, Brazil
| | - Hazel Byrne
- Department of Anthropology, University of Utah, Salt Lake City, UT 84102, USA
| | - Iracilda Sampaio
- Universidade Federal do Para, Guamá, Belém - PA, 66075-110, Brazil
| | - Izeni Farias
- Universidade Federal do Amazonas, Departamento de Genética, Laboratório de Evolução e Genética Animal (LEGAL), Manaus, Amazonas, 69080-900, Brazil
| | - João Valsecchi do Amaral
- Research Group on Terrestrial Vertebrate Ecology, Mamirauá Institute for Sustainable Development, Tefé, Amazonas, 69553-225, Brazil
- Rede de Pesquisa para Estudos sobre Diversidade, Conservação e Uso da Fauna na Amazônia - RedeFauna, Manaus, Amazonas, 69080-900, Brazil
- Comunidad de Manejo de Fauna Silvestre en la Amazonía y en Latinoamérica - ComFauna, Iquitos, Loreto, 16001, Peru
| | - Mariluce Messias
- Universidade Federal de Rondonia, Porto Velho, Rondônia, 78900-000, Brazil
- PPGREN - Programa de Pós-Graduação "Conservação e Uso dos Recursos Naturais and BIONORTE - Programa de Pós-Graduação em Biodiversidade e Biotecnologia da Rede BIONORTE, Universidade Federal de Rondonia, Porto Velho, Rondônia, 78900-000, Brazil
| | - Maria N F da Silva
- Instituto Nacional de Pesquisas da Amazonia, Petrópolis, Manaus - AM, 69067-375, Brazil
| | - Mihir Trivedi
- Laboratory for the Conservation of Endangered Species, CSIR-Centre for Cellular and Molecular Biology, Hyderabad 500007, India
| | - Rogerio Rossi
- Universidade Federal do Mato Grosso, Boa Esperança, Cuiabá - MT, 78060-900, Brazil
| | - Tomas Hrbek
- Universidade Federal do Amazonas, Departamento de Genética, Laboratório de Evolução e Genética Animal (LEGAL), Manaus, Amazonas, 69080-900, Brazil
- Department of Biology, Trinity University, San Antonio, TX 78212, USA
| | - Nicole Andriaholinirina
- Life Sciences and Environment, Technology and Environment of Mahajanga, University of Mahajanga, Mahajanga, 401, Madagascar
| | - Clément J Rabarivola
- Life Sciences and Environment, Technology and Environment of Mahajanga, University of Mahajanga, Mahajanga, 401, Madagascar
| | - Alphonse Zaramody
- Life Sciences and Environment, Technology and Environment of Mahajanga, University of Mahajanga, Mahajanga, 401, Madagascar
| | | | | | - Gregory Wilkerson
- Keeling Center for Comparative Medicine and Research, MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Christian Abee
- Keeling Center for Comparative Medicine and Research, MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Joe H Simmons
- Keeling Center for Comparative Medicine and Research, MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Eduardo Fernandez-Duque
- Yale University, New Haven, CT 06520, USA
- Universidad Nacional de Formosa, Argentina Fundacion ECO, Formosa, Argentina
| | | | - Fekadu Shiferaw
- Guinea Worm Eradication Program, The Carter Center Ethiopia, PoB 16316, Addis Ababa 1000, Ethiopia
| | - Dongdong Wu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China
| | - Long Zhou
- Center for Evolutionary & Organismal Biology, Zhejiang University School of Medicine, Hangzhou 310058, China
| | - Yong Shao
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China
| | - Guojie Zhang
- Center for Evolutionary & Organismal Biology, Zhejiang University School of Medicine, Hangzhou 310058, China
- Villum Center for Biodiversity Genomics, Section for Ecology and Evolution, Department of Biology, University of Copenhagen, DK-2100 Copenhagen, Denmark
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China
- Liangzhu Laboratory, Zhejiang University Medical Center, 1369 West Wenyi Road, Hangzhou 311121, China
- Women's Hospital, School of Medicine, Zhejiang University, 1 Xueshi Road, Shangcheng District, Hangzhou 310006, China
| | - Julius D Keyyu
- Tanzania Wildlife Research Institute (TAWIRI), Head Office, P.O. Box 661, Arusha, Tanzania
| | - Sascha Knauf
- Institute of International Animal Health/One Health, Friedrich-Loeffler-Institut, Federal Research Institute for Animal Health, 17493 Greifswald - Insei Riems, Germany
| | - Minh D Le
- Department of Environmental Ecology, Faculty of Environmental Sciences, University of Science and Central Institute for Natural Resources and Environmental Studies, Vietnam National University, Hanoi 100000, Vietnam
| | - Esther Lizano
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
- Catalan Institution of Research and Advanced Studies (ICREA), Passeig de Lluís Companys, 23, 08010 Barcelona, Spain
| | - Stefan Merker
- Department of Zoology, State Museum of Natural History Stuttgart, 70191 Stuttgart, Germany
| | - Arcadi Navarro
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
- Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Edifici ICTA-ICP, c/ Columnes s/n, 08193 Cerdanyola del Vallès, Barcelona, Spain
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Av. Doctor Aiguader, N88, 08003 Barcelona, Spain
- BarcelonaBeta Brain Research Center, Pasqual Maragall Foundation, C. Wellington 30, 08005 Barcelona, Spain
| | - Thomas Bataillon
- Bioinformatics Research Centre, Aarhus University, Aarhus 8000, Denmark
| | - Tilo Nadler
- Cuc Phuong Commune, Nho Quan District, Ninh Binh Province 430000, Vietnam
| | - Chiea Chuen Khor
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), 60 Biopolis Street, Genome, Singapore 138672, Republic of Singapore
| | - Jessica Lee
- Mandai Nature, 80 Mandai Lake Road, Singapore 729826, Republic of Singapore
| | - Patrick Tan
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), 60 Biopolis Street, Genome, Singapore 138672, Republic of Singapore
- SingHealth Duke-NUS Institute of Precision Medicine (PRISM), Singapore 168582, Republic of Singapore
- Cancer and Stem Cell Biology Program, Duke-NUS Medical School, Singapore 168582, Republic of Singapore
| | - Weng Khong Lim
- SingHealth Duke-NUS Institute of Precision Medicine (PRISM), Singapore 168582, Republic of Singapore
- Cancer and Stem Cell Biology Program, Duke-NUS Medical School, Singapore 168582, Republic of Singapore
- SingHealth Duke-NUS Genomic Medicine Centre, Singapore 168582, Republic of Singapore
| | - Andrew C Kitchener
- Department of Natural Sciences, National Museums Scotland, Chambers Street, Edinburgh EH1 1JF, UK
- School of Geosciences, University of Edinburgh, Drummond Street, Edinburgh EH8 9XP, UK
| | - Dietmar Zinner
- Cognitive Ethology Laboratory, Germany Primate Center, Leibniz Institute for Primate Research, 37077 Göttingen, Germany
- Department of Primate Cognition, Georg-August-Universität Göttingen, 37077 Göttingen, Germany
- Leibniz Science Campus Primate Cognition, 37077 Göttingen, Germany
| | - Ivo Gut
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain
- Universitat Pompeu Fabra, Pg. Luís Companys 23, 08010 Barcelona, Spain
| | - Amanda Melin
- Department of Anthropology & Archaeology, University of Calgary, 2500 University Dr NW, Calgary, AB T2N 1N4, Canada
- Department of Medical Genetics, 3330 Hospital Drive NW, HMRB 202, Calgary, AB T2N 4N1, Canada
- Alberta Children's Hospital Research Institute, University of Calgary, 2500 University Dr NW, Calgary, AB T2N 1N4, Canada
| | - Katerina Guschanski
- Department of Ecology and Genetics, Animal Ecology, Uppsala University, SE-75236 Uppsala, Sweden
- Institute of Ecology and Evolution, School of Biological Sciences, University of Edinburgh, Edinburgh EH8 9XP, UK
| | | | - Robin M D Beck
- School of Science, Engineering & Environment, University of Salford, Salford M5 4WT, UK
| | - Govindhaswamy Umapathy
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
- Laboratory for the Conservation of Endangered Species, CSIR-Centre for Cellular and Molecular Biology, Hyderabad 500007, India
| | - Christian Roos
- Gene Bank of Primates and Primate Genetics Laboratory, German Primate Center, Leibniz Institute for Primate Research, Kellnerweg 4, 37077 Göttingen, Germany
| | - Jean P Boubli
- School of Science, Engineering & Environment, University of Salford, Salford M5 4WT, UK
| | - Monkol Lek
- Department of Genetics, Yale School of Medicine, New Haven, CT 06520, USA
| | - Shamil Sunyaev
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, 02115, USA
- Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, 02115, USA
| | - Anne O'Donnell-Luria
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Boston, MA, 02142, USA
- Division of Genetics and Genomics, Department of Pediatrics, Boston Children's Hospital, Harvard Medical School, Boston, MA, 02115, USA
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02115, USA
| | - Heidi L Rehm
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Boston, MA, 02142, USA
- Department of Genetics, Yale School of Medicine, New Haven, CT 06520, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Jinbo Xu
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
- Toyota Technological Institute at Chicago, Chicago, IL 60637, USA
| | - Jeffrey Rogers
- Human Genome Sequencing Center and Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Tomas Marques-Bonet
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri i Reixac 4, 08028 Barcelona, Spain
- Catalan Institution of Research and Advanced Studies (ICREA), Passeig de Lluís Companys, 23, 08010 Barcelona, Spain
- Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Edifici ICTA-ICP, c/ Columnes s/n, 08193 Cerdanyola del Vallès, Barcelona, Spain
| | - Kyle Kai-How Farh
- Illumina Artificial Intelligence Laboratory, Illumina Inc., Foster City, CA, 94404, USA
| |
Collapse
|
21
|
Wang X, Peischl S, Heckel G. Demographic history and genomic consequences of 10,000 generations of isolation in a wild mammal. Curr Biol 2023; 33:2051-2062.e4. [PMID: 37178689 DOI: 10.1016/j.cub.2023.04.042] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Revised: 12/20/2022] [Accepted: 04/17/2023] [Indexed: 05/15/2023]
Abstract
Increased human activities caused the isolation of populations in many species-often associated with genetic depletion and negative fitness effects. The effects of isolation are predicted by theory, but long-term data from natural populations are scarce. We show, with full genome sequences, that common voles (Microtus arvalis) in the Orkney archipelago have remained genetically isolated from conspecifics in continental Europe since their introduction by humans over 5,000 years ago. Modern Orkney vole populations are genetically highly differentiated from continental conspecifics as a result of genetic drift processes. Colonization likely started on the biggest Orkney island and vole populations on smaller islands were gradually split off, without signs of secondary admixture. Despite having large modern population sizes, Orkney voles are genetically depauperate and successive introductions to smaller islands resulted in further reduction of genetic diversity. We detected high levels of fixation of predicted deleterious variation compared with continental populations, particularly on smaller islands, yet the fitness effects realized in nature are unknown. Simulations showed that predominantly mildly deleterious mutations were fixed in populations, while highly deleterious mutations were purged early in the history of the Orkney population. Relaxation of selection overall due to benign environmental conditions on the islands and the effects of soft selection may have contributed to the repeated, successful establishment of Orkney voles despite potential fitness loss. Furthermore, the specific life history of these small mammals, resulting in relatively large population sizes, has probably been important for their long-term persistence in full isolation.
Collapse
Affiliation(s)
- Xuejing Wang
- Institute of Ecology and Evolution, University of Bern, Baltzerstrasse 6, 3012 Bern, Switzerland
| | - Stephan Peischl
- Interfaculty Bioinformatics Unit, University of Bern, Baltzerstrasse 6, 3012 Bern, Switzerland; Swiss Institute of Bioinformatics, Amphipôle, Quartier UNIL-Sorge, 1015 Lausanne, Switzerland
| | - Gerald Heckel
- Institute of Ecology and Evolution, University of Bern, Baltzerstrasse 6, 3012 Bern, Switzerland; Swiss Institute of Bioinformatics, Amphipôle, Quartier UNIL-Sorge, 1015 Lausanne, Switzerland.
| |
Collapse
|
22
|
Gao H, Hamp T, Ede J, Schraiber JG, McRae J, Singer-Berk M, Yang Y, Dietrich A, Fiziev P, Kuderna L, Sundaram L, Wu Y, Adhikari A, Field Y, Chen C, Batzoglou S, Aguet F, Lemire G, Reimers R, Balick D, Janiak MC, Kuhlwilm M, Orkin JD, Manu S, Valenzuela A, Bergman J, Rouselle M, Silva FE, Agueda L, Blanc J, Gut M, de Vries D, Goodhead I, Harris RA, Raveendran M, Jensen A, Chuma IS, Horvath J, Hvilsom C, Juan D, Frandsen P, de Melo FR, Bertuol F, Byrne H, Sampaio I, Farias I, do Amaral JV, Messias M, da Silva MNF, Trivedi M, Rossi R, Hrbek T, Andriaholinirina N, Rabarivola CJ, Zaramody A, Jolly CJ, Phillips-Conroy J, Wilkerson G, Abee C, Simmons JH, Fernandez-Duque E, Kanthaswamy S, Shiferaw F, Wu D, Zhou L, Shao Y, Zhang G, Keyyu JD, Knauf S, Le MD, Lizano E, Merker S, Navarro A, Batallion T, Nadler T, Khor CC, Lee J, Tan P, Lim WK, Kitchener AC, Zinner D, Gut I, Melin A, Guschanski K, Schierup MH, Beck RMD, Umapathy G, Roos C, Boubli JP, Lek M, Sunyaev S, O’Donnell A, Rehm H, Xu J, Rogers J, Marques-Bonet T, Kai-How Farh K. The landscape of tolerated genetic variation in humans and primates. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.01.538953. [PMID: 37205491 PMCID: PMC10187174 DOI: 10.1101/2023.05.01.538953] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
Personalized genome sequencing has revealed millions of genetic differences between individuals, but our understanding of their clinical relevance remains largely incomplete. To systematically decipher the effects of human genetic variants, we obtained whole genome sequencing data for 809 individuals from 233 primate species, and identified 4.3 million common protein-altering variants with orthologs in human. We show that these variants can be inferred to have non-deleterious effects in human based on their presence at high allele frequencies in other primate populations. We use this resource to classify 6% of all possible human protein-altering variants as likely benign and impute the pathogenicity of the remaining 94% of variants with deep learning, achieving state-of-the-art accuracy for diagnosing pathogenic variants in patients with genetic diseases. One Sentence Summary Deep learning classifier trained on 4.3 million common primate missense variants predicts variant pathogenicity in humans.
Collapse
Affiliation(s)
- Hong Gao
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Tobias Hamp
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Jeffrey Ede
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Joshua G. Schraiber
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Jeremy McRae
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Moriel Singer-Berk
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard; Boston, Massachusetts, 02142, USA
| | - Yanshen Yang
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Anastasia Dietrich
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Petko Fiziev
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Lukas Kuderna
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
- Institute of Evolutionary Biology (UPF-CSIC); PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Laksshman Sundaram
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Yibing Wu
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Aashish Adhikari
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Yair Field
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Chen Chen
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Serafim Batzoglou
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Francois Aguet
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| | - Gabrielle Lemire
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard; Boston, Massachusetts, 02142, USA
- Division of Genetics and Genomics, Department of Pediatrics, Boston Children’s Hospital, Harvard Medical School; Boston, Massachusetts, 02115, USA
| | - Rebecca Reimers
- Division of Genetics and Genomics, Department of Pediatrics, Boston Children’s Hospital, Harvard Medical School; Boston, Massachusetts, 02115, USA
| | - Daniel Balick
- Division of Genetics, Brigham and Women’s Hospital, Harvard Medical School; Boston, Massachusetts, 02115, USA
| | - Mareike C. Janiak
- School of Science, Engineering & Environment, University of Salford; Salford, M5 4WT, United Kingdom
| | - Martin Kuhlwilm
- Institute of Evolutionary Biology (UPF-CSIC); PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
- Department of Evolutionary Anthropology, University of Vienna; Djerassiplatz 1, 1030, Vienna, Austria
- Human Evolution and Archaeological Sciences (HEAS), University of Vienna; 1030, Vienna, Austria
| | - Joseph D. Orkin
- Institute of Evolutionary Biology (UPF-CSIC); PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
- Département d’anthropologie, Université de Montréal; 3150 Jean-Brillant, Montréal, QC, H3T 1N8, Canada
| | - Shivakumara Manu
- Academy of Scientific and Innovative Research (AcSIR); Ghaziabad, 201002, India
- Laboratory for the Conservation of Endangered Species, CSIR-Centre for Cellular and Molecular Biology; Hyderabad, 500007, India
| | - Alejandro Valenzuela
- Institute of Evolutionary Biology (UPF-CSIC); PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Juraj Bergman
- Bioinformatics Research Centre, Aarhus University; Aarhus, 8000, Denmark
- Section for Ecoinformatics & Biodiversity, Department of Biology, Aarhus University; Aarhus, 8000, Denmark
| | | | - Felipe Ennes Silva
- Research Group on Primate Biology and Conservation, Mamirauá Institute for Sustainable Development; Estrada da Bexiga 2584, Tefé, Amazonas, CEP 69553-225, Brazil
- Faculty of Sciences, Department of Organismal Biology, Unit of Evolutionary Biology and Ecology, Université Libre de Bruxelles (ULB); Avenue Franklin D. Roosevelt 50, 1050, Brussels, Belgium
| | - Lidia Agueda
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST); Baldiri i Reixac 4, 08028, Barcelona, Spain
| | - Julie Blanc
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST); Baldiri i Reixac 4, 08028, Barcelona, Spain
| | - Marta Gut
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST); Baldiri i Reixac 4, 08028, Barcelona, Spain
| | - Dorien de Vries
- School of Science, Engineering & Environment, University of Salford; Salford, M5 4WT, United Kingdom
| | - Ian Goodhead
- School of Science, Engineering & Environment, University of Salford; Salford, M5 4WT, United Kingdom
| | - R. Alan Harris
- Human Genome Sequencing Center and Department of Molecular and Human Genetics, Baylor College of Medicine; Houston, Texas, 77030, USA
| | - Muthuswamy Raveendran
- Human Genome Sequencing Center and Department of Molecular and Human Genetics, Baylor College of Medicine; Houston, Texas, 77030, USA
| | - Axel Jensen
- Department of Ecology and Genetics, Animal Ecology, Uppsala University; SE-75236, Uppsala, Sweden
| | | | - Julie Horvath
- North Carolina Museum of Natural Sciences; Raleigh, North Carolina, 27601, USA
- Department of Biological and Biomedical Sciences, North Carolina Central University; Durham, North Carolina , 27707, USA
- Department of Biological Sciences, North Carolina State University; Raleigh, North Carolina , 27695, USA
- Department of Evolutionary Anthropology, Duke University; Durham, North Carolina , 27708, USA
- Renaissance Computing Institute, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | | | - David Juan
- Institute of Evolutionary Biology (UPF-CSIC); PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
| | | | | | - Fabricio Bertuol
- Universidade Federal do Amazonas, Departamento de Genética, Laboratório de Evolução e Genética Animal (LEGAL); Manaus, Amazonas, 69080-900, Brazil
| | - Hazel Byrne
- Department of Anthropology, University of Utah; Salt Lake City, Utah, 84102, USA
| | - Iracilda Sampaio
- Universidade Federal do Para; Guamá, Belém - PA, 66075-110, Brazil
| | - Izeni Farias
- Universidade Federal do Amazonas, Departamento de Genética, Laboratório de Evolução e Genética Animal (LEGAL); Manaus, Amazonas, 69080-900, Brazil
| | - João Valsecchi do Amaral
- Research Group on Terrestrial Vertebrate Ecology, Mamirauá Institute for Sustainable Development; Tefé, Amazonas, 69553-225, Brazil
- Rede de Pesquisa para Estudos sobre Diversidade, Conservação e Uso da Fauna na Amazônia – RedeFauna; Manaus, Amazonas, 69080-900, Brazil
- Comunidad de Manejo de Fauna Silvestre en la Amazonía y en Latinoamérica – ComFauna; Iquitos, Loreto, 16001, Peru
| | - Mariluce Messias
- Universidade Federal de Rondonia; Porto Velho, Rondônia, 78900-000, Brazil
- PPGREN - Programa de Pós-Graduação “Conservação e Uso dos Recursos Naturais and BIONORTE - Programa de Pós-Graduação em Biodiversidade e Biotecnologia da Rede BIONORTE, Universidade Federal de Rondonia; Porto Velho, Rondônia, 78900-000, Brazil
| | - Maria N. F. da Silva
- Instituto Nacional de Pesquisas da Amazonia; Petrópolis, Manaus - AM, 69067-375, Brazil
| | - Mihir Trivedi
- Laboratory for the Conservation of Endangered Species, CSIR-Centre for Cellular and Molecular Biology; Hyderabad, 500007, India
| | - Rogerio Rossi
- Universidade Federal do Mato Grosso; Boa Esperança, Cuiabá - MT, 78060-900, Brazil
| | - Tomas Hrbek
- Universidade Federal do Amazonas, Departamento de Genética, Laboratório de Evolução e Genética Animal (LEGAL); Manaus, Amazonas, 69080-900, Brazil
- Department of Biology, Trinity University; San Antonio, Texas, 78212, USA
| | - Nicole Andriaholinirina
- Life Sciences and Environment, Technology and Environment of Mahajanga, University of Mahajanga; Mahajanga, 401, Madagascar
| | - Clément J. Rabarivola
- Life Sciences and Environment, Technology and Environment of Mahajanga, University of Mahajanga; Mahajanga, 401, Madagascar
| | - Alphonse Zaramody
- Life Sciences and Environment, Technology and Environment of Mahajanga, University of Mahajanga; Mahajanga, 401, Madagascar
| | | | | | - Gregory Wilkerson
- Keeling Center for Comparative Medicine and Research, MD Anderson Cancer Center; Houston, Texas, 77030, USA
| | | | - Joe H. Simmons
- Keeling Center for Comparative Medicine and Research, MD Anderson Cancer Center; Houston, Texas, 77030, USA
| | - Eduardo Fernandez-Duque
- Yale University; New Haven, Connecticut, 06520, USA
- Universidad Nacional de Formosa, Argentina Fundacion ECO, Formosa, Argentina
| | | | | | - Dongdong Wu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences; Kunming, Yunnan, 650223, China
| | - Long Zhou
- Center for Evolutionary & Organismal Biology, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Yong Shao
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences; Kunming, Yunnan, 650223, China
| | - Guojie Zhang
- Center for Evolutionary & Organismal Biology, Zhejiang University School of Medicine, Hangzhou, 310058, China
- Villum Center for Biodiversity Genomics, Section for Ecology and Evolution, Department of Biology, University of Copenhagen; Copenhagen, DK-2100, Denmark
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China
- Liangzhu Laboratory, Zhejiang University Medical Center; 1369 West Wenyi Road, Hangzhou, 311121, China
- Women’s Hospital, School of Medicine, Zhejiang University; 1 Xueshi Road, Shangcheng District, Hangzhou, 310006, China
| | - Julius D. Keyyu
- Tanzania Wildlife Research Institute (TAWIRI), Head Office; P.O.Box 661, Arusha, Tanzania
| | - Sascha Knauf
- Institute of International Animal Health/One Health, Friedrich-Loeffler-Institut, Federal Research Institute for Animal Health; 17493 Greifswald - Isle of Riems, Germany
| | - Minh D. Le
- Department of Environmental Ecology, Faculty of Environmental Sciences, University of Science and Central Institute for Natural Resources and Environmental Studies, Vietnam National University; Hanoi, 100000, Vietnam
| | - Esther Lizano
- Institute of Evolutionary Biology (UPF-CSIC); PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
- Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Barcelona, Spain; Catalan Institution of Research and Advanced Studies (ICREA), Barcelona, Spain
| | - Stefan Merker
- Department of Zoology, State Museum of Natural History Stuttgart; 70191 Stuttgart, Germany
| | - Arcadi Navarro
- Institute of Evolutionary Biology (UPF-CSIC); PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA) and Universitat Pompeu Fabra, Pg. Luís Companys 23, Barcelona, 08010, Spain
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology; Av. Doctor Aiguader, N88, Barcelona, 08003, Spain
- BarcelonaBeta Brain Research Center, Pasqual Maragall Foundation; C. Wellington 30, Barcelona, 08005, Spain
| | - Thomas Batallion
- Bioinformatics Research Centre, Aarhus University; Aarhus, 8000, Denmark
| | - Tilo Nadler
- Cuc Phuong Commune; Nho Quan District, Ninh Binh Province, 430000, Vietnam
| | - Chiea Chuen Khor
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), 60 Biopolis Street, Genome, Singapore 138672, Republic of Singapore
| | - Jessica Lee
- Mandai Nature; 80 Mandai Lake Road, Singapore 729826, Republic of Singapore
| | - Patrick Tan
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), 60 Biopolis Street, Genome, Singapore 138672, Republic of Singapore
- SingHealth Duke-NUS Institute of Precision Medicine (PRISM); Singapore 168582, Republic of Singapore
- Cancer and Stem Cell Biology Program, Duke-NUS Medical School; Singapore 168582, Republic of Singapore
| | - Weng Khong Lim
- SingHealth Duke-NUS Institute of Precision Medicine (PRISM); Singapore 168582, Republic of Singapore
- Cancer and Stem Cell Biology Program, Duke-NUS Medical School; Singapore 168582, Republic of Singapore
- SingHealth Duke-NUS Genomic Medicine Centre; Singapore 168582, Republic of Singapore
| | - Andrew C. Kitchener
- Department of Natural Sciences, National Museums Scotland; Chambers Street, Edinburgh, EH1 1JF, UK
- School of Geosciences, University of Edinburgh; Drummond Street, Edinburgh, EH8 9XP, UK
| | - Dietmar Zinner
- Cognitive Ethology Laboratory, Germany Primate Center, Leibniz Institute for Primate Research; 37077 Göttingen, Germany
- Department of Primate Cognition, Georg-August-Universität Göttingen; 37077 Göttingen, Germany
| | - Ivo Gut
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST); Baldiri i Reixac 4, 08028, Barcelona, Spain
- Universitat Pompeu Fabra, Pg. Luís Companys 23, Barcelona, 08010, Spain
| | - Amanda Melin
- Leibniz Science Campus Primate Cognition; 37077 Göttingen, Germany
- Department of Anthropology & Archaeology and Department of Medical Genetics
| | - Katerina Guschanski
- Department of Ecology and Genetics, Animal Ecology, Uppsala University; SE-75236, Uppsala, Sweden
- Alberta Children’s Hospital Research Institute; University of Calgary; 2500 University Dr NW T2N 1N4, Calgary, Alberta, Canada
| | | | - Robin M. D. Beck
- School of Science, Engineering & Environment, University of Salford; Salford, M5 4WT, United Kingdom
| | - Govindhaswamy Umapathy
- Academy of Scientific and Innovative Research (AcSIR); Ghaziabad, 201002, India
- Laboratory for the Conservation of Endangered Species, CSIR-Centre for Cellular and Molecular Biology; Hyderabad, 500007, India
| | - Christian Roos
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh; Edinburgh, EH8 9XP, UK
| | - Jean P. Boubli
- School of Science, Engineering & Environment, University of Salford; Salford, M5 4WT, United Kingdom
| | - Monkol Lek
- Gene Bank of Primates and Primate Genetics Laboratory, German Primate Center, Leibniz Institute for Primate Research; Kellnerweg 4, 37077 Göttingen, Germany
| | - Shamil Sunyaev
- Division of Genetics, Brigham and Women’s Hospital, Harvard Medical School; Boston, Massachusetts, 02115, USA
- Department of Genetics, Yale School of Medicine; New Haven, Connecticut, 06520, USA
| | - Anne O’Donnell
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard; Boston, Massachusetts, 02142, USA
- Division of Genetics and Genomics, Department of Pediatrics, Boston Children’s Hospital, Harvard Medical School; Boston, Massachusetts, 02115, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, 02115, USA
| | - Heidi Rehm
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard; Boston, Massachusetts, 02142, USA
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital and Harvard Medical School; Boston, Massachusetts, 02115, USA
| | - Jinbo Xu
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
- Toyota Technological Institute at Chicago; Chicago, Illinois, 60637, USA
| | - Jeffrey Rogers
- Human Genome Sequencing Center and Department of Molecular and Human Genetics, Baylor College of Medicine; Houston, Texas, 77030, USA
| | - Tomas Marques-Bonet
- Institute of Evolutionary Biology (UPF-CSIC); PRBB, Dr. Aiguader 88, 08003 Barcelona, Spain
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST); Baldiri i Reixac 4, 08028, Barcelona, Spain
- Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Barcelona, Spain; Catalan Institution of Research and Advanced Studies (ICREA), Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA) and Universitat Pompeu Fabra, Pg. Luís Companys 23, Barcelona, 08010, Spain
| | - Kyle Kai-How Farh
- Illumina Artificial Intelligence Laboratory, Illumina Inc.; Foster City, California, 94404, USA
| |
Collapse
|
23
|
Shi Y, Niu Y, Zhang P, Luo H, Liu S, Zhang S, Wang J, Li Y, Liu X, Song T, Xu T, He S. Characterization of genome-wide STR variation in 6487 human genomes. Nat Commun 2023; 14:2092. [PMID: 37045857 PMCID: PMC10097659 DOI: 10.1038/s41467-023-37690-8] [Citation(s) in RCA: 18] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Accepted: 03/27/2023] [Indexed: 04/14/2023] Open
Abstract
Short tandem repeats (STRs) are abundant and highly mutagenic in the human genome. Many STR loci have been associated with a range of human genetic disorders. However, most population-scale studies on STR variation in humans have focused on European ancestry cohorts or are limited by sequencing depth. Here, we depicted a comprehensive map of 366,013 polymorphic STRs (pSTRs) constructed from 6487 deeply sequenced genomes, comprising 3983 Chinese samples (~31.5x, NyuWa) and 2504 samples from the 1000 Genomes Project (~33.3x, 1KGP). We found that STR mutations were affected by motif length, chromosome context and epigenetic features. We identified 3273 and 1117 pSTRs whose repeat numbers were associated with gene expression and 3'UTR alternative polyadenylation, respectively. We also implemented population analysis, investigated population differentiated signatures, and genotyped 60 known disease-causing STRs. Overall, this study further extends the scale of STR variation in humans and propels our understanding of the semantics of STRs.
Collapse
Affiliation(s)
- Yirong Shi
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Yiwei Niu
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Peng Zhang
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
| | - Huaxia Luo
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
| | - Shuai Liu
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Sijia Zhang
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Jiajia Wang
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
| | - Yanyan Li
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
| | - Xinyue Liu
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Tingrui Song
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
| | - Tao Xu
- National Laboratory of Biomacromolecules, CAS Center for Excellence in Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China.
- Shandong First Medical University & Shandong Academy of Medical Sciences, Jinan, 250117, Shandong, China.
| | - Shunmin He
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China.
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049, China.
| |
Collapse
|
24
|
Robinson J, Kyriazis CC, Yuan SC, Lohmueller KE. Deleterious Variation in Natural Populations and Implications for Conservation Genetics. Annu Rev Anim Biosci 2023; 11:93-114. [PMID: 36332644 PMCID: PMC9933137 DOI: 10.1146/annurev-animal-080522-093311] [Citation(s) in RCA: 37] [Impact Index Per Article: 37.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Deleterious mutations decrease reproductive fitness and are ubiquitous in genomes. Given that many organisms face ongoing threats of extinction, there is interest in elucidating the impact of deleterious variation on extinction risk and optimizing management strategies accounting for such mutations. Quantifying deleterious variation and understanding the effects of population history on deleterious variation are complex endeavors because we do not know the strength of selection acting on each mutation. Further, the effect of demographic history on deleterious mutations depends on the strength of selection against the mutation and the degree of dominance. Here we clarify how deleterious variation can be quantified and studied in natural populations. We then discuss how different demographic factors, such as small population size, nonequilibrium population size changes, inbreeding, and gene flow, affect deleterious variation. Lastly, we provide guidance on studying deleterious variation in nonmodel populations of conservation concern.
Collapse
Affiliation(s)
- Jacqueline Robinson
- Institute for Human Genetics, University of California, San Francisco, California, USA;
| | - Christopher C Kyriazis
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, California, USA; , ,
| | - Stella C Yuan
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, California, USA; , ,
| | - Kirk E Lohmueller
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, California, USA; , , .,Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, California, USA
| |
Collapse
|
25
|
Kyriazis CC, Beichman AC, Brzeski KE, Hoy SR, Peterson RO, Vucetich JA, Vucetich LM, Lohmueller KE, Wayne RK. Genomic Underpinnings of Population Persistence in Isle Royale Moose. Mol Biol Evol 2023; 40:msad021. [PMID: 36729989 PMCID: PMC9927576 DOI: 10.1093/molbev/msad021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Revised: 01/20/2023] [Accepted: 01/25/2023] [Indexed: 02/03/2023] Open
Abstract
Island ecosystems provide natural laboratories to assess the impacts of isolation on population persistence. However, most studies of persistence have focused on a single species, without comparisons to other organisms they interact with in the ecosystem. The case study of moose and gray wolves on Isle Royale allows for a direct contrast of genetic variation in isolated populations that have experienced dramatically differing population trajectories over the past decade. Whereas the Isle Royale wolf population recently declined nearly to extinction due to severe inbreeding depression, the moose population has thrived and continues to persist, despite having low genetic diversity and being isolated for ∼120 years. Here, we examine the patterns of genomic variation underlying the continued persistence of the Isle Royale moose population. We document high levels of inbreeding in the population, roughly as high as the wolf population at the time of its decline. However, inbreeding in the moose population manifests in the form of intermediate-length runs of homozygosity suggestive of historical inbreeding and purging, contrasting with the long runs of homozygosity observed in the smaller wolf population. Using simulations, we confirm that substantial purging has likely occurred in the moose population. However, we also document notable increases in genetic load, which could eventually threaten population viability over the long term. Overall, our results demonstrate a complex relationship between inbreeding, genetic diversity, and population viability that highlights the use of genomic datasets and computational simulation tools for understanding the factors enabling persistence in isolated populations.
Collapse
Affiliation(s)
- Christopher C Kyriazis
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA
| | | | - Kristin E Brzeski
- College of Forest Resources and Environmental Science, Michigan Technological University, Houghton, MI
| | - Sarah R Hoy
- College of Forest Resources and Environmental Science, Michigan Technological University, Houghton, MI
| | - Rolf O Peterson
- College of Forest Resources and Environmental Science, Michigan Technological University, Houghton, MI
| | - John A Vucetich
- College of Forest Resources and Environmental Science, Michigan Technological University, Houghton, MI
| | - Leah M Vucetich
- College of Forest Resources and Environmental Science, Michigan Technological University, Houghton, MI
| | - Kirk E Lohmueller
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA
- Interdepartmental Program in Bioinformatics, University of California, Los Angeles, CA
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, CA
| | - Robert K Wayne
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA
| |
Collapse
|
26
|
Agarwal I, Fuller ZL, Myers SR, Przeworski M. Relating pathogenic loss-of-function mutations in humans to their evolutionary fitness costs. eLife 2023; 12:e83172. [PMID: 36648429 PMCID: PMC9937649 DOI: 10.7554/elife.83172] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Accepted: 01/16/2023] [Indexed: 01/18/2023] Open
Abstract
Causal loss-of-function (LOF) variants for Mendelian and severe complex diseases are enriched in 'mutation intolerant' genes. We show how such observations can be interpreted in light of a model of mutation-selection balance and use the model to relate the pathogenic consequences of LOF mutations at present to their evolutionary fitness effects. To this end, we first infer posterior distributions for the fitness costs of LOF mutations in 17,318 autosomal and 679 X-linked genes from exome sequences in 56,855 individuals. Estimated fitness costs for the loss of a gene copy are typically above 1%; they tend to be largest for X-linked genes, whether or not they have a Y homolog, followed by autosomal genes and genes in the pseudoautosomal region. We compare inferred fitness effects for all possible de novo LOF mutations to those of de novo mutations identified in individuals diagnosed with one of six severe, complex diseases or developmental disorders. Probands carry an excess of mutations with estimated fitness effects above 10%; as we show by simulation, when sampled in the population, such highly deleterious mutations are typically only a couple of generations old. Moreover, the proportion of highly deleterious mutations carried by probands reflects the typical age of onset of the disease. The study design also has a discernible influence: a greater proportion of highly deleterious mutations is detected in pedigree than case-control studies, and for autism, in simplex than multiplex families and in female versus male probands. Thus, anchoring observations in human genetics to a population genetic model allows us to learn about the fitness effects of mutations identified by different mapping strategies and for different traits.
Collapse
Affiliation(s)
- Ipsita Agarwal
- Department of Biological Sciences, Columbia UniversityNew YorkUnited States
- Department of Statistics, University of OxfordOxfordUnited Kingdom
| | - Zachary L Fuller
- Department of Biological Sciences, Columbia UniversityNew YorkUnited States
| | - Simon R Myers
- Department of Statistics, University of OxfordOxfordUnited Kingdom
- The Wellcome Centre for Human Genetics, University of OxfordOxfordUnited Kingdom
| | - Molly Przeworski
- Department of Biological Sciences, Columbia UniversityNew YorkUnited States
- Department of Systems Biology, Columbia UniversityNew YorkUnited States
| |
Collapse
|
27
|
Moreira LR, Klicka J, Smith BT. Demography and linked selection interact to shape the genomic landscape of codistributed woodpeckers during the Ice Age. Mol Ecol 2023; 32:1739-1759. [PMID: 36617622 DOI: 10.1111/mec.16841] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2022] [Revised: 12/13/2022] [Accepted: 12/20/2022] [Indexed: 01/10/2023]
Abstract
The influence of genetic drift on population dynamics during Pleistocene glacial cycles is well understood, but the role of selection in shaping patterns of genomic variation during these events is less explored. We resequenced whole genomes to investigate how demography and natural selection interact to generate the genomic landscapes of Downy and Hairy Woodpecker, species codistributed in previously glaciated North America. First, we explored the spatial and temporal patterns of genomic diversity produced by neutral evolution. Next, we tested (i) whether levels of nucleotide diversity along the genome are correlated with intrinsic genomic properties, such as recombination rate and gene density, and (ii) whether different demographic trajectories impacted the efficacy of selection. Our results revealed cycles of bottleneck and expansion, and genetic structure associated with glacial refugia. Nucleotide diversity varied widely along the genome, but this variation was highly correlated between the species, suggesting the presence of conserved genomic features. In both taxa, nucleotide diversity was positively correlated with recombination rate and negatively correlated with gene density, suggesting that linked selection played a role in reducing diversity. Despite strong fluctuations in effective population size, the maintenance of relatively large populations during glaciations may have facilitated selection. Under these conditions, we found evidence that the individual demographic trajectory of populations modulated linked selection, with purifying selection being more efficient in removing deleterious alleles in large populations. These results highlight that while genome-wide variation reflects the expected signature of demographic change during climatic perturbations, the interaction of multiple processes produces a predictable and highly heterogeneous genomic landscape.
Collapse
Affiliation(s)
- Lucas R Moreira
- Department of Ecology, Evolution, and Environmental Biology, Columbia University, New York, New York, USA.,Department of Ornithology, American Museum of Natural History, New York City, New York, USA.,Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, Massachusetts, USA
| | - John Klicka
- Burke Museum of Natural History and Culture and Department of Biology, University of Washington, Seattle, Washington, USA
| | - Brian Tilston Smith
- Department of Ornithology, American Museum of Natural History, New York City, New York, USA
| |
Collapse
|
28
|
Beichman AC, Kalhori P, Kyriazis CC, DeVries AA, Nigenda-Morales S, Heckel G, Schramm Y, Moreno-Estrada A, Kennett DJ, Hylkema M, Bodkin J, Koepfli KP, Lohmueller KE, Wayne RK. Genomic analyses reveal range-wide devastation of sea otter populations. Mol Ecol 2023; 32:281-298. [PMID: 34967471 PMCID: PMC9875727 DOI: 10.1111/mec.16334] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2021] [Revised: 12/02/2021] [Accepted: 12/23/2021] [Indexed: 01/28/2023]
Abstract
The genetic consequences of species-wide declines are rarely quantified because the timing and extent of the decline varies across the species' range. The sea otter (Enhydra lutris) is a unique model in this regard. Their dramatic decline from thousands to fewer than 100 individuals per population occurred range-wide and nearly simultaneously due to the 18th-19th century fur trade. Consequently, each sea otter population represents an independent natural experiment of recovery after extreme population decline. We designed sequence capture probes for 50 Mb of sea otter exonic and neutral genomic regions. We sequenced 107 sea otters from five populations that span the species range to high coverage (18-76×) and three historical Californian samples from ~1500 and ~200 years ago to low coverage (1.5-3.5×). We observe distinct population structure and find that sea otters in California are the last survivors of a divergent lineage isolated for thousands of years and therefore warrant special conservation concern. We detect signals of extreme population decline in every surviving sea otter population and use this demographic history to design forward-in-time simulations of coding sequence. Our simulations indicate that this decline could lower the fitness of recovering populations for generations. However, the simulations also demonstrate how historically low effective population sizes prior to the fur trade may have mitigated the effects of population decline on genetic health. Our comprehensive approach shows how demographic inference from genomic data, coupled with simulations, allows assessment of extinction risk and different models of recovery.
Collapse
Affiliation(s)
- Annabel C. Beichman
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA 90095, USA
| | - Pooneh Kalhori
- Department of Biology, San Francisco State University, San Francisco, CA 94132, USA
| | - Christopher C. Kyriazis
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA 90095, USA
| | - Amber A. DeVries
- Center for Bioinformatics and Functional Genomics, Department of Biomedical Sciences, Cedars-Sinai Medical Center, Los Angeles, CA 90048, USA
| | - Sergio Nigenda-Morales
- National Laboratory of Genomics for Biodiversity, Unit of Advanced Genomics (LANGEBIO), CINVESTAV, Irapuato, Guanajuato 36824, Mexico
| | - Gisela Heckel
- Centro de Investigación Científica y de Educación Superior de Ensenada (Ensenada Center for Scientific Research and Higher Education), Ensenada, Baja California 22860, Mexico
| | - Yolanda Schramm
- Universidad Autónoma de Baja California (Autonomous University of Baja California), Ensenada, Baja California 22860, Mexico
| | - Andrés Moreno-Estrada
- National Laboratory of Genomics for Biodiversity, Unit of Advanced Genomics (LANGEBIO), CINVESTAV, Irapuato, Guanajuato 36824, Mexico
| | - Douglas J. Kennett
- Department of Anthropology, University of California, Santa Barbara, CA 93106, USA
| | - Mark Hylkema
- Cultural Resources Program Manager and Tribal Liaison/Archaeologist, Santa Cruz District, California State Parks, Santa Cruz, California, USA
| | - James Bodkin
- Retired, Alaska Science Center, US Geological Survey, Anchorage Alaska, 99503, USA
| | - Klaus-Peter Koepfli
- Smithsonian-Mason School of Conservation, George Mason University, Front Royal, VA 22630, USA
- Smithsonian Conservation Biology Institute, Center for Species Survival, National Zoological Park, Washington, D.C., 20008, USA
- ITMO University, Computer Technologies Laboratory, St. Petersburg 197101, Russia
| | - Kirk E. Lohmueller
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA 90095, USA
- Interdepartmental Program in Bioinformatics, University of California, Los Angeles, CA 90095, USA
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
| | - Robert K. Wayne
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA 90095, USA
| |
Collapse
|
29
|
Achakkagari SR, Kyriakidou M, Gardner KM, De Koeyer D, De Jong H, Strömvik MV, Tai HH. Genome sequencing of adapted diploid potato clones. FRONTIERS IN PLANT SCIENCE 2022; 13:954933. [PMID: 36003817 PMCID: PMC9394749 DOI: 10.3389/fpls.2022.954933] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Accepted: 07/20/2022] [Indexed: 06/15/2023]
Abstract
Cultivated potato is a vegetatively propagated crop, and most varieties are autotetraploid with high levels of heterozygosity. Reducing the ploidy and breeding potato at the diploid level can increase efficiency for genetic improvement including greater ease of introgression of diploid wild relatives and more efficient use of genomics and markers in selection. More recently, selfing of diploids for generation of inbred lines for F1 hybrid breeding has had a lot of attention in potato. The current study provides genomics resources for nine legacy non-inbred adapted diploid potato clones developed at Agriculture and Agri-Food Canada. De novo genome sequence assembly using 10× Genomics and Illumina sequencing technologies show the genome sizes ranged from 712 to 948 Mbp. Structural variation was identified by comparison to two references, the potato DMv6.1 genome and the phased RHv3 genome, and a k-mer based analysis of sequence reads showed the genome heterozygosity range of 1 to 9.04% between clones. A genome-wide approach was taken to scan 5 Mb bins to visualize patterns of heterozygous deleterious alleles. These were found dispersed throughout the genome including regions overlapping segregation distortions. Novel variants of the StCDF1 gene conferring earliness of tuberization were found among these clones, which all produce tubers under long days. The genomes will be useful tools for genome design for potato breeding.
Collapse
Affiliation(s)
| | - Maria Kyriakidou
- Department of Plant Science, McGill University, Sainte-Anne-de-Bellevue, QC, Canada
| | - Kyle M. Gardner
- Fredericton Research and Development Centre, Agriculture and Agri-Food Canada, Fredericton, NB, Canada
| | - David De Koeyer
- Fredericton Research and Development Centre, Agriculture and Agri-Food Canada, Fredericton, NB, Canada
| | - Hielke De Jong
- Fredericton Research and Development Centre, Agriculture and Agri-Food Canada, Fredericton, NB, Canada
| | - Martina V. Strömvik
- Department of Plant Science, McGill University, Sainte-Anne-de-Bellevue, QC, Canada
| | - Helen H. Tai
- Fredericton Research and Development Centre, Agriculture and Agri-Food Canada, Fredericton, NB, Canada
| |
Collapse
|
30
|
Milligan WR, Amster G, Sella G. The impact of genetic modifiers on variation in germline mutation rates within and among human populations. Genetics 2022; 221:iyac087. [PMID: 35666194 PMCID: PMC9339295 DOI: 10.1093/genetics/iyac087] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Accepted: 05/16/2022] [Indexed: 11/14/2022] Open
Abstract
Mutation rates and spectra differ among human populations. Here, we examine whether this variation could be explained by evolution at mutation modifiers. To this end, we consider genetic modifier sites at which mutations, "mutator alleles," increase genome-wide mutation rates and model their evolution under purifying selection due to the additional deleterious mutations that they cause, genetic drift, and demographic processes. We solve the model analytically for a constant population size and characterize how evolution at modifier sites impacts variation in mutation rates within and among populations. We then use simulations to study the effects of modifier sites under a plausible demographic model for Africans and Europeans. When comparing populations that evolve independently, weakly selected modifier sites (2Nes≈1), which evolve slowly, contribute the most to variation in mutation rates. In contrast, when populations recently split from a common ancestral population, strongly selected modifier sites (2Nes≫1), which evolve rapidly, contribute the most to variation between them. Moreover, a modest number of modifier sites (e.g. 10 per mutation type in the standard classification into 96 types) subject to moderate to strong selection (2Nes>1) could account for the variation in mutation rates observed among human populations. If such modifier sites indeed underlie differences among populations, they should also cause variation in mutation rates within populations and their effects should be detectable in pedigree studies.
Collapse
Affiliation(s)
- William R Milligan
- Department of Biological Sciences, Columbia University, New York, NY 10027, USA
| | - Guy Amster
- Department of Biological Sciences, Columbia University, New York, NY 10027, USA
- Flatiron Health Inc., New York, NY 10013, USA
| | - Guy Sella
- Department of Biological Sciences, Columbia University, New York, NY 10027, USA
- Program for Mathematical Genomics, Columbia University, New York, NY 10032, USA
| |
Collapse
|
31
|
Qin Z, Huang T, Guo M, Wang SM. Distinct landscapes of deleterious variants in DNA damage repair system in ethnic human populations. Life Sci Alliance 2022; 5:5/9/e202101319. [PMID: 35595529 PMCID: PMC9122833 DOI: 10.26508/lsa.202101319] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2021] [Revised: 05/11/2022] [Accepted: 05/11/2022] [Indexed: 12/03/2022] Open
Abstract
Deleterious variants in the DNA damage repair system can cause genome instability and increase cancer risk. The highly ethnic-specific DDR deleterious variation from this study suggests its potential relationship with different disease susceptibility in ethnic human populations. Deleterious variants in DNA damage repair (DDR) system can cause genome instability and increase cancer risk. In this study, we analyzed the deleterious variants in DDR system in 16 ethnic human populations. From the genetic variants in 169 DDR genes involved in nine DDR pathways collected from 158,612 individuals of different ethnic background, we identified 1,781 deleterious variants in 81 DDR genes in eight DDR pathways (https://genemutation.fhs.um.edu.mo/dbddr-global/). Our analysis showed although the quantity of deleterious variants was loaded at a similar level, the landscape of the variants differed substantially among different populations that two-third of the variants were present in single ethnic populations, and the rest was mostly shared between the populations with closer geographic and genetic relationship. The highly ethnic-specific DDR deleterious variation suggests its potential relationship with different disease susceptibility in ethnic human populations.
Collapse
Affiliation(s)
- Zixin Qin
- Cancer Centre and Institute of Translational Medicine, Ministry of Education Frontiers Science Center for Precision Oncology, Faculty of Health Sciences, University of Macau, Macau, China
| | - Teng Huang
- Cancer Centre and Institute of Translational Medicine, Ministry of Education Frontiers Science Center for Precision Oncology, Faculty of Health Sciences, University of Macau, Macau, China
| | - Maoni Guo
- Cancer Centre and Institute of Translational Medicine, Ministry of Education Frontiers Science Center for Precision Oncology, Faculty of Health Sciences, University of Macau, Macau, China
| | - San Ming Wang
- Cancer Centre and Institute of Translational Medicine, Ministry of Education Frontiers Science Center for Precision Oncology, Faculty of Health Sciences, University of Macau, Macau, China
| |
Collapse
|
32
|
Markello C, Huang C, Rodriguez A, Carroll A, Chang PC, Eizenga J, Markello T, Haussler D, Paten B. A complete pedigree-based graph workflow for rare candidate variant analysis. Genome Res 2022; 32:893-903. [PMID: 35483961 PMCID: PMC9104704 DOI: 10.1101/gr.276387.121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Accepted: 03/24/2022] [Indexed: 11/24/2022]
Abstract
Methods that use a linear genome reference for genome sequencing data analysis are reference-biased. In the field of clinical genetics for rare diseases, a resulting reduction in genotyping accuracy in some regions has likely prevented the resolution of some cases. Pangenome graphs embed population variation into a reference structure. Although pangenome graphs have helped to reduce reference mapping bias, further performance improvements are possible. We introduce VG-Pedigree, a pedigree-aware workflow based on the pangenome-mapping tool of Giraffe and the variant calling tool DeepTrio using a specially trained model for Giraffe-based alignments. We demonstrate mapping and variant calling improvements in both single-nucleotide variants (SNVs) and insertion and deletion (indel) variants over those produced by alignments created using BWA-MEM to a linear-reference and Giraffe mapping to a pangenome graph containing data from the 1000 Genomes Project. We have also adapted and upgraded deleterious-variant (DV) detecting methods and programs into a streamlined workflow. We used these workflows in combination to detect small lists of candidate DVs among 15 family quartets and quintets of the Undiagnosed Diseases Program (UDP). All candidate DVs that were previously diagnosed using the Mendelian models covered by the previously published methods were recapitulated by these workflows. The results of these experiments indicate that a slightly greater absolute count of DVs are detected in the proband population than in their matched unaffected siblings.
Collapse
Affiliation(s)
- Charles Markello
- UC Santa Cruz Genomics Institute, Santa Cruz, California 95060, USA
| | - Charles Huang
- Undiagnosed Diseases Program, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20894, USA
| | - Alex Rodriguez
- Undiagnosed Diseases Program, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20894, USA
| | - Andrew Carroll
- Google Incorporated, Mountain View, California 94043, USA
| | - Pi-Chuan Chang
- Google Incorporated, Mountain View, California 94043, USA
| | - Jordan Eizenga
- UC Santa Cruz Genomics Institute, Santa Cruz, California 95060, USA
| | - Thomas Markello
- Undiagnosed Diseases Program, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20894, USA
| | - David Haussler
- UC Santa Cruz Genomics Institute, Santa Cruz, California 95060, USA
- Howard Hughes Medical Institute, University of California, Santa Cruz, California 95064, USA
| | - Benedict Paten
- UC Santa Cruz Genomics Institute, Santa Cruz, California 95060, USA
| |
Collapse
|
33
|
Abstract
SignificanceThe dynamics of deleterious variation under contrasting demographic scenarios remain poorly understood in spite of their relevance in evolutionary and conservation terms. Here we apply a genomic approach to study differences in the burden of deleterious alleles between the endangered Iberian lynx (Lynx pardinus) and the widespread Eurasian lynx (Lynx lynx). Our analysis unveils a significantly lower deleterious burden in the former species that should be ascribed to genetic purging, that is, to the increased opportunities of selection against recessive homozygotes due to the inbreeding caused by its smaller population size, as illustrated by our analytical predictions. This research provides theoretical and empirical evidence on the evolutionary relevance of genetic purging under certain demographic conditions.
Collapse
|
34
|
Mularo AJ, Bernal XE, DeWoody JA. Dominance can increase genetic variance after a population bottleneck: a synthesis of the theoretical and empirical evidence. J Hered 2022; 113:257-271. [PMID: 35143665 DOI: 10.1093/jhered/esac007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2021] [Accepted: 02/07/2022] [Indexed: 11/13/2022] Open
Abstract
Drastic reductions in population size, or population bottlenecks, can lead to a reduction in additive genetic variance and adaptive potential. Genetic variance for some quantitative genetic traits, however, can increase after a population reduction. Empirical evaluations of quantitative traits following experimental bottlenecks indicate that non-additive genetic effects, including both allelic dominance at a given locus and epistatic interactions among loci, may impact the additive variance contributed by alleles that ultimately influences phenotypic expression and fitness. The dramatic effects of bottlenecks on overall genetic diversity have been well studied, but relatively little is known about how dominance and demographic events like bottlenecks can impact additive genetic variance. Herein, we critically examine how the degree of dominance among alleles affects additive genetic variance after a bottleneck. We first review and synthesize studies that document the impact of empirical bottlenecks on dominance variance. We then extend earlier work by elaborating on two theoretical models that illustrate the relationship between dominance and the potential increase in additive genetic variance immediately following a bottleneck. Furthermore, we investigate the parameters that influence the maximum level of genetic variation (associated with adaptive potential) after a bottleneck, including the number of founding individuals. Finally, we validated our methods using forward-time population genetic simulations of loci with varying dominance and selection levels. The fate of non-additive genetic variation following bottlenecks could have important implications for conservation and management efforts in a wide variety of taxa, and our work should help contextualize future studies (e.g., epistatic variance) in population genomics.
Collapse
Affiliation(s)
- Andrew J Mularo
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana, USA
| | - Ximena E Bernal
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana, USA.,Smithsonian Tropical Research Institute, Balboa, Republic of Panamá
| | - J Andrew DeWoody
- Department of Biological Sciences, Purdue University, West Lafayette, Indiana, USA.,Department of Forestry and Natural Resources, Purdue University, West Lafayette, IN
| |
Collapse
|
35
|
Genetic load: genomic estimates and applications in non-model animals. Nat Rev Genet 2022; 23:492-503. [PMID: 35136196 DOI: 10.1038/s41576-022-00448-x] [Citation(s) in RCA: 75] [Impact Index Per Article: 37.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/10/2022] [Indexed: 12/11/2022]
Abstract
Genetic variation, which is generated by mutation, recombination and gene flow, can reduce the mean fitness of a population, both now and in the future. This 'genetic load' has been estimated in a wide range of animal taxa using various approaches. Advances in genome sequencing and computational techniques now enable us to estimate the genetic load in populations and individuals without direct fitness estimates. Here, we review the classic and contemporary literature of genetic load. We describe approaches to quantify the genetic load in whole-genome sequence data based on evolutionary conservation and annotations. We show that splitting the load into its two components - the realized load (or expressed load) and the masked load (or inbreeding load) - can improve our understanding of the population genetics of deleterious mutations.
Collapse
|
36
|
Conover JL, Wendel JF. Deleterious Mutations Accumulate Faster in Allopolyploid than Diploid Cotton (Gossypium) and Unequally between Subgenomes. Mol Biol Evol 2022; 39:6517786. [PMID: 35099532 PMCID: PMC8841602 DOI: 10.1093/molbev/msac024] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Abstract
Whole genome duplication (polyploidization) is among the most dramatic mutational processes in nature, so understanding how natural selection differs in polyploids relative to diploids is an important goal. Population genetics theory predicts that recessive deleterious mutations accumulate faster in allopolyploids than diploids due to the masking effect of redundant gene copies, but this prediction is hitherto unconfirmed. Here, we use the cotton genus (Gossypium), which contains seven allopolyploids derived from a single polyploidization event 1-2 million years ago, to investigate deleterious mutation accumulation. We use two methods of identifying deleterious mutations at the nucleotide and amino acid level, along with whole-genome resequencing of 43 individuals spanning six allopolyploid species and their two diploid progenitors, to demonstrate that deleterious mutations accumulate faster in allopolyploids than in their diploid progenitors. We find that, unlike what would be expected under models of demographic changes alone, strongly deleterious mutations show the biggest difference between ploidy levels, and this effect diminishes for moderately and mildly deleterious mutations. We further show that the proportion of nonsynonymous mutations that are deleterious differs between the two co-resident subgenomes in the allopolyploids, suggesting that homoeologous masking acts unequally between subgenomes. Our results provide a genome-wide perspective on classic notions of the significance of gene duplication that likely are broadly applicable to allopolyploids, with implications for our understanding of the evolutionary fate of deleterious mutations. Finally, we note that some measures of selection (e.g. dN/dS, πN/πS) may be biased when species of different ploidy levels are compared.
Collapse
Affiliation(s)
- Justin L Conover
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, 50011, USA
| | - Jonathan F Wendel
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, 50011, USA
| |
Collapse
|
37
|
Vecchyo DOD, Lohmueller KE, Novembre J. Haplotype-based inference of the distribution of fitness effects. Genetics 2022; 220:6501446. [PMID: 35100400 PMCID: PMC8982047 DOI: 10.1093/genetics/iyac002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Accepted: 12/18/2021] [Indexed: 11/13/2022] Open
Abstract
Abstract
Recent genome sequencing studies with large sample sizes in humans have discovered a vast quantity of low-frequency variants, providing an important source of information to analyze how selection is acting on human genetic variation. In order to estimate the strength of natural selection acting on low-frequency variants, we have developed a likelihood-based method that uses the lengths of pairwise identity-by-state between haplotypes carrying low-frequency variants. We show that in some non-equilibrium populations (such as those that have had recent population expansions) it is possible to distinguish between positive or negative selection acting on a set of variants. With our new framework, one can infer a fixed selection intensity acting on a set of variants at a particular frequency, or a distribution of selection coefficients for standing variants and new mutations. We show an application of our method to the UK10K phased haplotype dataset of individuals.
Collapse
Affiliation(s)
- Diego Ortega-Del Vecchyo
- Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de México, Juriquilla, Querétaro, 76230, México
- Interdepartmental Program in Bioinformatics, University of California, Los Angeles, Los Angeles, California, 90095, United States of America
| | - Kirk E Lohmueller
- Interdepartmental Program in Bioinformatics, University of California, Los Angeles, Los Angeles, California, 90095, United States of America
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, California, 90095, United States of America
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, California, 90095, United States of America
| | - John Novembre
- Department of Human Genetics, University of Chicago, Chicago, Illinois, 60637, United States of America
- Department of Ecology and Evolution, University of Chicago, Chicago, Illinois, 60637, United States of America
| |
Collapse
|
38
|
Sandell L, Sharp NP. Fitness Effects of Mutations: An Assessment of PROVEAN Predictions Using Mutation Accumulation Data. Genome Biol Evol 2022; 14:evac004. [PMID: 35038732 PMCID: PMC8790079 DOI: 10.1093/gbe/evac004] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/22/2021] [Indexed: 11/14/2022] Open
Abstract
Predicting fitness in natural populations is a major challenge in biology. It may be possible to leverage fast-accumulating genomic data sets to infer the fitness effects of mutant alleles, allowing evolutionary questions to be addressed in any organism. In this paper, we investigate the utility of one such tool, called PROVEAN. This program compares a query sequence with existing data to provide an alignment-based score for any protein variant, with scores categorized as neutral or deleterious based on a pre-set threshold. PROVEAN has been used widely in evolutionary studies, for example, to estimate mutation load in natural populations, but has not been formally tested as a predictor of aggregate mutational effects on fitness. Using three large published data sets on the genome sequences of laboratory mutation accumulation lines, we assessed how well PROVEAN predicted the actual fitness patterns observed, relative to other metrics. In most cases, we find that a simple count of the total number of mutant proteins is a better predictor of fitness than the number of proteins with variants scored as deleterious by PROVEAN. We also find that the sum of all mutant protein scores explains variation in fitness better than the number of mutant proteins in one of the data sets. We discuss the implications of these results for studies of populations in the wild.
Collapse
Affiliation(s)
- Linnea Sandell
- Department of Zoology, University of British Columbia, Vancouver, British Columbia, Canada
- Systematic Biology, Department of Organismal Biology, Uppsala University, Sweden
| | | |
Collapse
|
39
|
Ahmadi N. Genetic Bases of Complex Traits: From Quantitative Trait Loci to Prediction. Methods Mol Biol 2022; 2467:1-44. [PMID: 35451771 DOI: 10.1007/978-1-0716-2205-6_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Conceived as a general introduction to the book, this chapter is a reminder of the core concepts of genetic mapping and molecular marker-based prediction. It provides an overview of the principles and the evolution of methods for mapping the variation of complex traits, and methods for QTL-based prediction of human disease risk and animal and plant breeding value. The principles of linkage-based and linkage disequilibrium-based QTL mapping methods are described in the context of the simplest, single-marker, methods. Methodological evolutions are analysed in relation with their ability to account for the complexity of the genotype-phenotype relations. Main characteristics of the genetic architecture of complex traits, drawn from QTL mapping works using large populations of unrelated individuals, are presented. Methods combining marker-QTL association data into polygenic risk score that captures part of an individual's susceptibility to complex diseases are reviewed. Principles of best linear mixed model-based prediction of breeding value in animal- and plant-breeding programs using phenotypic and pedigree data, are summarized and methods for moving from BLUP to marker-QTL BLUP are presented. Factors influencing the additional genetic progress achieved by using molecular data and rules for their optimization are discussed.
Collapse
Affiliation(s)
- Nourollah Ahmadi
- CIRAD, UMR AGAP Institut, Montpellier, France.
- AGAP Institut, Univ Montpellier, CIRAD, INRAE, Montpellier SupAgro, Montpellier, France.
| |
Collapse
|
40
|
Hayward LK, Sella G. Polygenic adaptation after a sudden change in environment. eLife 2022; 11:66697. [PMID: 36155653 PMCID: PMC9683794 DOI: 10.7554/elife.66697] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2021] [Accepted: 07/18/2022] [Indexed: 11/13/2022] Open
Abstract
Polygenic adaptation is thought to be ubiquitous, yet remains poorly understood. Here, we model this process analytically, in the plausible setting of a highly polygenic, quantitative trait that experiences a sudden shift in the fitness optimum. We show how the mean phenotype changes over time, depending on the effect sizes of loci that contribute to variance in the trait, and characterize the allele dynamics at these loci. Notably, we describe the two phases of the allele dynamics: The first is a rapid phase, in which directional selection introduces small frequency differences between alleles whose effects are aligned with or opposed to the shift, ultimately leading to small differences in their probability of fixation during a second, longer phase, governed by stabilizing selection. As we discuss, key results should hold in more general settings and have important implications for efforts to identify the genetic basis of adaptation in humans and other species.
Collapse
Affiliation(s)
- Laura Katharine Hayward
- Department of Mathematics, Columbia UniversityNew YorkUnited States,Institute of Science and TechnologyMaria GuggingAustria
| | - Guy Sella
- Department of Biological Sciences, Columbia UniversityNew YorkUnited States,Program for Mathematical Genomics, Columbia UniversityNew YorkUnited States
| |
Collapse
|
41
|
Samayoa LF, Olukolu BA, Yang CJ, Chen Q, Stetter MG, York AM, Sanchez-Gonzalez JDJ, Glaubitz JC, Bradbury PJ, Romay MC, Sun Q, Yang J, Ross-Ibarra J, Buckler ES, Doebley JF, Holland JB. Domestication reshaped the genetic basis of inbreeding depression in a maize landrace compared to its wild relative, teosinte. PLoS Genet 2021; 17:e1009797. [PMID: 34928949 PMCID: PMC8722731 DOI: 10.1371/journal.pgen.1009797] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2021] [Revised: 01/03/2022] [Accepted: 12/03/2021] [Indexed: 12/29/2022] Open
Abstract
Inbreeding depression is the reduction in fitness and vigor resulting from mating of close relatives observed in many plant and animal species. The extent to which the genetic load of mutations contributing to inbreeding depression is due to large-effect mutations versus variants with very small individual effects is unknown and may be affected by population history. We compared the effects of outcrossing and self-fertilization on 18 traits in a landrace population of maize, which underwent a population bottleneck during domestication, and a neighboring population of its wild relative teosinte. Inbreeding depression was greater in maize than teosinte for 15 of 18 traits, congruent with the greater segregating genetic load in the maize population that we predicted from sequence data. Parental breeding values were highly consistent between outcross and selfed offspring, indicating that additive effects determine most of the genetic value even in the presence of strong inbreeding depression. We developed a novel linkage scan to identify quantitative trait loci (QTL) representing large-effect rare variants carried by only a single parent, which were more important in teosinte than maize. Teosinte also carried more putative juvenile-acting lethal variants identified by segregation distortion. These results suggest a mixture of mostly polygenic, small-effect partially recessive effects in linkage disequilibrium underlying inbreeding depression, with an additional contribution from rare larger-effect variants that was more important in teosinte but depleted in maize following the domestication bottleneck. Purging associated with the maize domestication bottleneck may have selected against some large effect variants, but polygenic load is harder to purge and overall segregating mutational burden increased in maize compared to teosinte. Inbreeding depression is the reduction in fitness and vigor resulting from mating of close relatives observed in many plant and animal species. Mating of close relatives increases the probability that an individual inherits two non-functioning mutations at the same gene, resulting in lower fitness of such matings. We do not know the extent to which inbreeding depression is due to mutations with large-effects versus small-effect polygenic variants. We compared the effects of outcrossing and self-fertilization on 18 traits in a landrace population of maize, which underwent a population bottleneck during domestication, and a neighboring population of its wild relative teosinte. Inbreeding depression was greater in maize than teosinte for 15 of 18 traits and we found that this was consistent with higher predicted ‘genetic load’ in maize based solely on the evolutionary conservation of the sequence variants observed in the population. We also mapped genome positions associated with inbreeding depression, identifying more and larger-effect genetic variants in teosinte than maize. These results suggest that during domestication, some of the rare large-effect variants in teosinte were bred out, but many genetic variants of small effects on inbreeding depression increased in frequency maize.
Collapse
Affiliation(s)
- Luis Fernando Samayoa
- Department of Crop and Soil Sciences, North Carolina State University, Raleigh, North Carolina, United States of America
| | - Bode A. Olukolu
- Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, Tennessee, United States of America
| | - Chin Jian Yang
- Laboratory of Genetics, University of Wisconsin–Madison, Madison, Wisconsin, United States of America
| | - Qiuyue Chen
- Laboratory of Genetics, University of Wisconsin–Madison, Madison, Wisconsin, United States of America
| | - Markus G. Stetter
- Institute for Plant Sciences and Center of Excellence on Plant Sciences, University of Cologne, Cologne, Germany
| | - Alessandra M. York
- Laboratory of Genetics, University of Wisconsin–Madison, Madison, Wisconsin, United States of America
| | | | - Jeffrey C. Glaubitz
- Institute of Biotechnology, Cornell University, Ithaca, New York, United States of America
| | - Peter J. Bradbury
- US Department of Agriculture–Agricultural Research Service, Cornell University, Ithaca, New York, United States of America
| | - Maria Cinta Romay
- Institute of Biotechnology, Cornell University, Ithaca, New York, United States of America
| | - Qi Sun
- Institute of Biotechnology, Cornell University, Ithaca, New York, United States of America
| | - Jinliang Yang
- Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, Nebraska, United States of America
| | - Jeffrey Ross-Ibarra
- Department of Evolution and Ecology, Center for Population Biology, and Genome Center, University of California, Davis, California, United States of America
| | - Edward S. Buckler
- US Department of Agriculture–Agricultural Research Service, Cornell University, Ithaca, New York, United States of America
| | - John F. Doebley
- Laboratory of Genetics, University of Wisconsin–Madison, Madison, Wisconsin, United States of America
| | - James B. Holland
- Department of Crop and Soil Sciences, North Carolina State University, Raleigh, North Carolina, United States of America
- United States Department of Agriculture–Agriculture Research Service, Raleigh, North Carolina, United States of America
- * E-mail:
| |
Collapse
|
42
|
Sohail M, Izarraras-Gomez A, Ortega-Del Vecchyo D. Populations, Traits, and Their Spatial Structure in Humans. Genome Biol Evol 2021; 13:evab272. [PMID: 34894236 PMCID: PMC8715524 DOI: 10.1093/gbe/evab272] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/06/2021] [Indexed: 11/16/2022] Open
Abstract
The spatial distribution of genetic variants is jointly determined by geography, past demographic processes, natural selection, and its interplay with environmental variation. A fraction of these genetic variants are "causal alleles" that affect the manifestation of a complex trait. The effect exerted by these causal alleles on complex traits can be independent or dependent on the environment. Understanding the evolutionary processes that shape the spatial structure of causal alleles is key to comprehend the spatial distribution of complex traits. Natural selection, past population size changes, range expansions, consanguinity, assortative mating, archaic introgression, admixture, and the environment can alter the frequencies, effect sizes, and heterozygosities of causal alleles. This provides a genetic axis along which complex traits can vary. However, complex traits also vary along biogeographical and sociocultural axes which are often correlated with genetic axes in complex ways. The purpose of this review is to consider these genetic and environmental axes in concert and examine the ways they can help us decipher the variation in complex traits that is visible in humans today. This initiative necessarily implies a discussion of populations, traits, the ability to infer and interpret "genetic" components of complex traits, and how these have been impacted by adaptive events. In this review, we provide a history-aware discussion on these topics using both the recent and more distant past of our academic discipline and its relevant contexts.
Collapse
Affiliation(s)
- Mashaal Sohail
- Department of Human Genetics, University of Chicago, USA
- Centro de Ciencias Genómicas (CCG), Universidad Nacional Autónoma de México (UNAM), Cuernavaca, Morelos, México
| | - Alan Izarraras-Gomez
- Laboratorio Internacional de Investigación sobre el Genoma Humano (LIIGH), Universidad Nacional Autónoma de México (UNAM), Juriquilla, Querétaro, México
| | - Diego Ortega-Del Vecchyo
- Laboratorio Internacional de Investigación sobre el Genoma Humano (LIIGH), Universidad Nacional Autónoma de México (UNAM), Juriquilla, Querétaro, México
| |
Collapse
|
43
|
Ma Y, Liu D, Wariss HM, Zhang R, Tao L, Milne RI, Sun W. Demographic history and identification of threats revealed by population genomic analysis provide insights into conservation for an endangered maple. Mol Ecol 2021; 31:767-779. [PMID: 34826164 DOI: 10.1111/mec.16289] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2020] [Revised: 11/17/2021] [Accepted: 11/22/2021] [Indexed: 12/01/2022]
Abstract
Recent advancements in whole genome sequencing techniques capable of covering nearly all the nucleotide variations of a genome would make it possible to set up a conservation framework for threatened plants at the genomic level. Here we applied a whole genome resequencing approach to obtain genome-wide data from 105 individuals sampled from the 10 currently known extant populations of Acer yangbiense, an endangered species with fragmented habitats and restricted distribution in Yunnan, China. To inform meaningful conservation action, we investigated what factors might have contributed to the formation of its extremely small population sizes and what threats it currently suffers at a genomic level. Our results revealed that A. yangbiense has low genetic diversity and comprises different numbers of genetic groups based on neutral (seven) and selected loci (13), with frequent gene flow between populations. Repeated bottleneck events, particularly the most recent one occurring within ~10,000 years before present, which decreased its effective population size (Ne ) < 200, and severe habitat fragmentation resulting from anthropogenic activities as well as a biased gender ratio of mature individuals in its natural habitat, might have together contributed to the currently fragmented and endangered status of A. yangbiense. The species has suffered from inbreeding and deleterious mutation load, both of which varied among populations but had similar patterns; that is, populations with higher FROH (frequency of runs of homozygosity) always carried a larger number of deleterious mutations in the homozygous state than in populations with lower FROH. In addition, based on our genetic differentiation results, and the distribution patterns of homozygous deleterious mutations in individuals, we recommend certain conservation actions regarding the genetic rescue of A. yangbiense. Overall, our study provides meaningful insights into the conservation genetics and a framework for the further conservation for the endangered A. yangbiense.
Collapse
Affiliation(s)
- Yongpeng Ma
- Yunnan Key Laboratory for Integrative Conservation of Plant Species with Extremely Small Populations, Chinese Academy of Sciences, Kunming Institute of Botany, Kunming, China.,Key Laboratory for Plant Diversity and Biogeography of East Asia, Chinese Academy of Sciences, Kunming Institute of Botany, Kunming, China
| | - Detuan Liu
- Yunnan Key Laboratory for Integrative Conservation of Plant Species with Extremely Small Populations, Chinese Academy of Sciences, Kunming Institute of Botany, Kunming, China.,Key Laboratory for Plant Diversity and Biogeography of East Asia, Chinese Academy of Sciences, Kunming Institute of Botany, Kunming, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Hafiz Muhammad Wariss
- Yunnan Key Laboratory for Integrative Conservation of Plant Species with Extremely Small Populations, Chinese Academy of Sciences, Kunming Institute of Botany, Kunming, China.,Key Laboratory for Plant Diversity and Biogeography of East Asia, Chinese Academy of Sciences, Kunming Institute of Botany, Kunming, China
| | - Rengang Zhang
- Beijing Ori-Gene Science and Technology Co. Ltd, Beijing, China
| | - Lidan Tao
- Yunnan Key Laboratory for Integrative Conservation of Plant Species with Extremely Small Populations, Chinese Academy of Sciences, Kunming Institute of Botany, Kunming, China.,Key Laboratory for Plant Diversity and Biogeography of East Asia, Chinese Academy of Sciences, Kunming Institute of Botany, Kunming, China
| | - Richard I Milne
- Institute of Molecular Plant Sciences, University of Edinburgh, Edinburgh, UK
| | - Weibang Sun
- Yunnan Key Laboratory for Integrative Conservation of Plant Species with Extremely Small Populations, Chinese Academy of Sciences, Kunming Institute of Botany, Kunming, China.,Key Laboratory for Plant Diversity and Biogeography of East Asia, Chinese Academy of Sciences, Kunming Institute of Botany, Kunming, China.,Kunming Botanical Garden, Chinese Academy of Sciences, Kunming Institute of Botany, Kunming, China
| |
Collapse
|
44
|
Agarwal I, Przeworski M. Mutation saturation for fitness effects at human CpG sites. eLife 2021; 10:e71513. [PMID: 34806592 PMCID: PMC8683084 DOI: 10.7554/elife.71513] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2021] [Accepted: 11/21/2021] [Indexed: 01/06/2023] Open
Abstract
Whole exome sequences have now been collected for millions of humans, with the related goals of identifying pathogenic mutations in patients and establishing reference repositories of data from unaffected individuals. As a result, we are approaching an important limit, in which datasets are large enough that, in the absence of natural selection, every highly mutable site will have experienced at least one mutation in the genealogical history of the sample. Here, we focus on CpG sites that are methylated in the germline and experience mutations to T at an elevated rate of ~10-7 per site per generation; considering synonymous mutations in a sample of 390,000 individuals, ~ 99 % of such CpG sites harbor a C/T polymorphism. Methylated CpG sites provide a natural mutation saturation experiment for fitness effects: as we show, at nt sample sizes, not seeing a non-synonymous polymorphism is indicative of strong selection against that mutation. We rely on this idea in order to directly identify a subset of CpG transitions that are likely to be highly deleterious, including ~27 % of possible loss-of-function mutations, and up to 20 % of possible missense mutations, depending on the type of functional site in which they occur. Unlike methylated CpGs, most mutation types, with rates on the order of 10-8 or 10-9, remain very far from saturation. We discuss what these findings imply for interpreting the potential clinical relevance of mutations from their presence or absence in reference databases and for inferences about the fitness effects of new mutations.
Collapse
Affiliation(s)
- Ipsita Agarwal
- Department of Biological Sciences, Columbia UniversityNew YorkUnited States
| | - Molly Przeworski
- Department of Biological Sciences, Columbia UniversityNew YorkUnited States
- Department of Systems Biology, Columbia UniversityNew YorkUnited States
| |
Collapse
|
45
|
Li SH, Liu Y, Yeh CF, Fu Y, Yeung CKL, Lee CC, Chiu CC, Kuo TH, Chan FT, Chen YC, Ko WY, Yao CT. Not out of the woods yet: Signatures of the prolonged negative genetic consequences of a population bottleneck in a rapidly re-expanding wader, the black-faced spoonbill Platalea minor. Mol Ecol 2021; 31:529-545. [PMID: 34726290 DOI: 10.1111/mec.16260] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2021] [Revised: 09/27/2021] [Accepted: 10/20/2021] [Indexed: 11/30/2022]
Abstract
The long-term persistence of a population which has suffered a bottleneck partly depends on how historical demographic dynamics impacted its genetic diversity and the accumulation of deleterious mutations. Here we provide genomic evidence for the genetic effect of a recent population bottleneck in the endangered black-faced spoonbill (Platalea minor) after its rapid population recovery. Our data suggest that the bird's effective population size, Ne , had been relatively stable (7500-9000) since 22,000 years ago; however, a recent brief yet severe bottleneck (Ne = 20) which we here estimated to occur around the 1940s wiped out >99% of its historical Ne in roughly three generations. Despite a >15-fold population recovery since 1988, we found that black-faced spoonbill population has higher levels of inbreeding (7.4 times more runs of homozygosity) than its sister species, the royal spoonbill (P. regia), which is not thought to have undergone a marked population contraction. Although the two spoonbills have similar levels of genome-wide genetic diversity, our results suggest that selection on more genes was relaxed in the black-faced spoonbill; moreover individual black-faced spoonbills carry more putatively deleterious mutations (Grantham's score > 50), and may therefore express more deleterious phenotypic effects than royal spoonbills. Here we demonstrate the value of using genomic indices to monitor levels of genetic erosion, inbreeding and mutation load in species with conservation concerns. To mitigate the prolonged negative genetic effect of a population bottleneck, we recommend that all possible measures should be employed to maintain population growth of a threatened species.
Collapse
Affiliation(s)
- Shou-Hsien Li
- School of Life Science, National Taiwan Normal University, Taipei, Taiwan
| | - Yang Liu
- State Key Laboratory of Biocontrol, School of Ecology, Sun Yat-Sen University, Guangzhou, China
| | - Chia-Fen Yeh
- School of Life Science, National Taiwan Normal University, Taipei, Taiwan
| | - Yuchen Fu
- School of Life Science, National Taiwan Normal University, Taipei, Taiwan
| | | | - Chun-Cheng Lee
- School of Life Science, National Taiwan Normal University, Taipei, Taiwan
| | - Chi-Cheng Chiu
- School of Life Science, National Taiwan Normal University, Taipei, Taiwan
| | | | - Fang-Tse Chan
- Division of Zoology, Taiwan Endemic Species Research Institute, Nantou, Taiwan
| | - Yu-Chia Chen
- Department of Life Sciences, National Yanming Medical University, Taipei, Taiwan
| | - Wen-Ya Ko
- Department of Life Sciences, National Yanming Medical University, Taipei, Taiwan
| | - Cheng-Te Yao
- High Altitude Research Station, Taiwan Endemic Species Research Institute, Nantou, Taiwan
| |
Collapse
|
46
|
Lucena-Perez M, Kleinman-Ruiz D, Marmesat E, Saveljev AP, Schmidt K, Godoy JA. Bottleneck-associated changes in the genomic landscape of genetic diversity in wild lynx populations. Evol Appl 2021; 14:2664-2679. [PMID: 34815746 PMCID: PMC8591332 DOI: 10.1111/eva.13302] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2021] [Revised: 08/17/2021] [Accepted: 09/08/2021] [Indexed: 01/06/2023] Open
Abstract
Demographic bottlenecks generally reduce genetic diversity through more intense genetic drift, but their net effect may vary along the genome due to the random nature of genetic drift and to local effects of recombination, mutation, and selection. Here, we analyzed the changes in genetic diversity following a bottleneck by comparing whole-genome diversity patterns in populations with and without severe recent documented declines of Iberian (Lynx pardinus, n = 31) and Eurasian lynx (Lynx lynx, n = 29). As expected, overall genomic diversity correlated negatively with bottleneck intensity and/or duration. Correlations of genetic diversity with divergence, chromosome size, gene or functional site content, GC content, or recombination were observed in nonbottlenecked populations, but were weaker in bottlenecked populations. Also, functional features under intense purifying selection and the X chromosome showed an increase in the observed density of variants, even resulting in higher θ W diversity than in nonbottlenecked populations. Increased diversity seems to be related to both a higher mutational input in those regions creating a large collection of low-frequency variants, a few of which increase in frequency during the bottleneck to the point they become detectable with our limited sample, and the reduced efficacy of purifying selection, which affects not only protein structure and function but also the regulation of gene expression. The results of this study alert to the possible reduction of fitness and adaptive potential associated with the genomic erosion in regulatory elements. Further, the detection of a gain of diversity in ultra-conserved elements can be used as a sensitive and easy-to-apply signature of genetic erosion in wild populations.
Collapse
Affiliation(s)
- Maria Lucena-Perez
- Departamento de Ecología Integrativa Estación Biológica de Doñana (CSIC) Sevilla Spain
| | - Daniel Kleinman-Ruiz
- Departamento de Ecología Integrativa Estación Biológica de Doñana (CSIC) Sevilla Spain
- Departamento de Genética Facultad de Biología Universidad Complutense Madrid Spain
| | - Elena Marmesat
- Departamento de Ecología Integrativa Estación Biológica de Doñana (CSIC) Sevilla Spain
| | - Alexander P Saveljev
- Department of Animal Ecology Russian Research Institute of Game Management and Fur Farming Kirov Russia
| | - Krzysztof Schmidt
- Mammal Research Institute Polish Academy of Sciences Białowieża Poland
| | - José A Godoy
- Departamento de Ecología Integrativa Estación Biológica de Doñana (CSIC) Sevilla Spain
| |
Collapse
|
47
|
Whole-exome analysis in Tunisian Imazighen and Arabs shows the impact of demography in functional variation. Sci Rep 2021; 11:21125. [PMID: 34702931 PMCID: PMC8548440 DOI: 10.1038/s41598-021-00576-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2021] [Accepted: 10/14/2021] [Indexed: 11/08/2022] Open
Abstract
Human populations are genetically affected by their demographic history, which shapes the distribution of their functional genomic variation. However, the genetic impact of recent demography is debated. This issue has been studied in different populations, but never in North Africans, despite their relevant cultural and demographic diversity. In this study we address the question by analyzing new whole-exome sequences from two culturally different Tunisian populations, an isolated Amazigh population and a close non-isolated Arab-speaking population, focusing on the distribution of functional variation. Both populations present clear differences in their variant frequency distribution, in general and for putatively damaging variation. This suggests a relevant effect in the Amazigh population of genetic isolation, drift, and inbreeding, pointing to relaxed purifying selection. We also discover the enrichment in Imazighen of variation associated to specific diseases or phenotypic traits, but the scarce genetic and biomedical data in the region limits further interpretation. Our results show the genomic impact of recent demography and reveal a clear genetic differentiation probably related to culture. These findings highlight the importance of considering cultural and demographic heterogeneity within North Africa when defining population groups, and the need for more data to improve knowledge on the region's health and disease landscape.
Collapse
|
48
|
Otte KA, Nolte V, Mallard F, Schlötterer C. The genetic architecture of temperature adaptation is shaped by population ancestry and not by selection regime. Genome Biol 2021; 22:211. [PMID: 34271951 PMCID: PMC8285869 DOI: 10.1186/s13059-021-02425-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2020] [Accepted: 06/29/2021] [Indexed: 12/28/2022] Open
Abstract
Background Understanding the genetic architecture of temperature adaptation is key for characterizing and predicting the effect of climate change on natural populations. One particularly promising approach is Evolve and Resequence, which combines advantages of experimental evolution such as time series, replicate populations, and controlled environmental conditions, with whole genome sequencing. Recent analysis of replicate populations from two different Drosophila simulans founder populations, which were adapting to the same novel hot environment, uncovered very different architectures—either many selection targets with large heterogeneity among replicates or fewer selection targets with a consistent response among replicates. Results Here, we expose the founder population from Portugal to a cold temperature regime. Although almost no selection targets are shared between the hot and cold selection regime, the adaptive architecture was similar. We identify a moderate number of targets under strong selection (19 selection targets, mean selection coefficient = 0.072) and parallel responses in the cold evolved replicates. This similarity across different environments indicates that the adaptive architecture depends more on the ancestry of the founder population than the specific selection regime. Conclusions These observations will have broad implications for the correct interpretation of the genomic responses to a changing climate in natural populations. Supplementary Information The online version contains supplementary material available at 10.1186/s13059-021-02425-9.
Collapse
Affiliation(s)
- Kathrin A Otte
- Institut für Populationsgenetik, Vetmeduni Vienna, Vienna, Austria.,Present address: Institute for Zoology, University of Cologne, Cologne, Germany
| | - Viola Nolte
- Institut für Populationsgenetik, Vetmeduni Vienna, Vienna, Austria
| | - François Mallard
- Institut für Populationsgenetik, Vetmeduni Vienna, Vienna, Austria.,Present address: Institut de Biologie de l'École Normale Supérieure, CNRS UMR 8197, Inserm U1024, PSL Research University, F-75005, Paris, France
| | | |
Collapse
|
49
|
Teixeira JC, Huber CD. Authors’ Reply to Letter to the Editor: Neutral genetic diversity as a useful tool for conservation biology. CONSERV GENET 2021. [DOI: 10.1007/s10592-021-01385-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
50
|
Mathur S, DeWoody JA. Genetic load has potential in large populations but is realized in small inbred populations. Evol Appl 2021; 14:1540-1557. [PMID: 34178103 PMCID: PMC8210801 DOI: 10.1111/eva.13216] [Citation(s) in RCA: 44] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Revised: 02/25/2021] [Accepted: 03/02/2021] [Indexed: 12/20/2022] Open
Abstract
Populations with higher genetic diversity and larger effective sizes have greater evolutionary capacity (i.e., adaptive potential) to respond to ecological stressors. We are interested in how the variation captured in protein-coding genes fluctuates relative to overall genomic diversity and whether smaller populations suffer greater costs due to their genetic load of deleterious mutations compared with larger populations. We analyzed individual whole-genome sequences (N = 74) from three different populations of Montezuma quail (Cyrtonyx montezumae), a small ground-dwelling bird that is sustainably harvested in some portions of its range but is of conservation concern elsewhere. Our historical demographic results indicate that Montezuma quail populations in the United States exhibit low levels of genomic diversity due in large part to long-term declines in effective population sizes over nearly a million years. The smaller and more isolated Texas population is significantly more inbred than the large Arizona and the intermediate-sized New Mexico populations we surveyed. The Texas gene pool has a significantly smaller proportion of strongly deleterious variants segregating in the population compared with the larger Arizona gene pool. Our results demonstrate that even in small populations, highly deleterious mutations are effectively purged and/or lost due to drift. However, we find that in small populations the realized genetic load is elevated because of inbreeding coupled with a higher frequency of slightly deleterious mutations that are manifested in homozygotes. Overall, our study illustrates how population genomics can be used to proactively assess both neutral and functional aspects of contemporary genetic diversity in a conservation framework while simultaneously considering deeper demographic histories.
Collapse
Affiliation(s)
- Samarth Mathur
- Department of Biological SciencesPurdue UniversityWest LafayetteIndianaUSA
- Present address:
Department of Evolution, Ecology and Organismal BiologyThe Ohio State UniversityColumbusOhioUSA
| | - J. Andrew DeWoody
- Department of Biological SciencesPurdue UniversityWest LafayetteIndianaUSA
- Department of Forestry and Natural ResourcesPurdue UniversityWest LafayetteIndianaUSA
| |
Collapse
|