1
|
Liu L, James J, Zhang YQ, Wang ZF, Arakaki M, Vadillo G, Zhou QJ, Lascoux M, Ge XJ. The 'queen of the Andes' (Puya raimondii) is genetically fragile and fragmented: a consequence of long generation time and semelparity? THE NEW PHYTOLOGIST 2024; 244:277-291. [PMID: 39135394 DOI: 10.1111/nph.20036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/13/2024] [Accepted: 07/16/2024] [Indexed: 09/17/2024]
Abstract
Understanding how life history shapes genetic diversity is a fundamental issue in evolutionary biology, with important consequences for conservation. However, we still have an incomplete picture of the impact of life history on genome-wide patterns of diversity, especially in long-lived semelparous plants. Puya raimondii is a high-altitude semelparous species from the Andes that flowers at 40-100 years of age. We sequenced the whole genome and estimated the nucleotide diversity of 200 individuals sampled from nine populations. Coalescent-based approaches were then used to infer past population dynamics. Finally, these results were compared with results obtained for the iteroparous species, Puya macrura. The nine populations of P. raimondii were highly divergent, highly inbred, and carried an exceptionally high genetic load. They are genetically depauperate, although, locally in the genome, balancing selection contributed to the maintenance of genetic polymorphism. While both P. raimondii and P. macrura went through a severe bottleneck during the Pleistocene, P. raimondii did not recover from it and continuously declined, while P. macrura managed to bounce back. Our results demonstrate the importance of life history, in particular generation time and reproductive strategy, in affecting population dynamics and genomic variation, and illustrate the genetic fragility of long-lived semelparous plants.
Collapse
Affiliation(s)
- Lu Liu
- State Key Laboratory of Plant Diversity and Specialty Crops, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China
- Key Laboratory of National Forestry and Grassland Administration on Plant Conservation and Utilization in Southern China, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China
- South China National Botanical Garden, Guangzhou, 510650, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
- Department of Ecology and Genetics, Evolutionary Biology Centre and Science for Life Laboratory, Uppsala University, Uppsala, 75236, Sweden
| | - Jennifer James
- Department of Ecology and Genetics, Evolutionary Biology Centre and Science for Life Laboratory, Uppsala University, Uppsala, 75236, Sweden
- Swedish Collegium of Advanced Study, Uppsala University, Uppsala, 75236, Sweden
| | - Yu-Qu Zhang
- College of Pharmacy, Shaanxi University of Chinese Medicine, Xi'an, 712044, China
| | - Zheng-Feng Wang
- Key Laboratory of National Forestry and Grassland Administration on Plant Conservation and Utilization in Southern China, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China
- South China National Botanical Garden, Guangzhou, 510650, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
- Key Laboratory of Vegetation Restoration and Management of Degraded Ecosystems, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China
| | - Mónica Arakaki
- Natural History Museum, Universidad Nacional Mayor de San Marcos, Lima, 15072, Peru
| | - Giovana Vadillo
- Plant Physiology Laboratory, Faculty of Biological Sciences, Universidad Nacional Mayor de San Marcos, Lima, 15081, Peru
| | - Qiu-Jie Zhou
- Department of Ecology and Genetics, Evolutionary Biology Centre and Science for Life Laboratory, Uppsala University, Uppsala, 75236, Sweden
| | - Martin Lascoux
- Department of Ecology and Genetics, Evolutionary Biology Centre and Science for Life Laboratory, Uppsala University, Uppsala, 75236, Sweden
| | - Xue-Jun Ge
- State Key Laboratory of Plant Diversity and Specialty Crops, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China
- Key Laboratory of National Forestry and Grassland Administration on Plant Conservation and Utilization in Southern China, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China
- South China National Botanical Garden, Guangzhou, 510650, China
| |
Collapse
|
2
|
Wang X, Du Z, Duan Y, Liu S, Liu J, Li B, Ma L, Wu Y, Tian L, Song F, Cai W, Li H. Population genomics analyses reveal the role of hybridization in the rapid invasion of fall armyworm. J Adv Res 2024:S2090-1232(24)00430-2. [PMID: 39357646 DOI: 10.1016/j.jare.2024.09.028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2024] [Revised: 09/25/2024] [Accepted: 09/29/2024] [Indexed: 10/04/2024] Open
Abstract
INTRODUCTION Invasive species pose a major threat to global biodiversity and agricultural productivity, yet the genomic mechanisms driving their rapid expansion into new habitats are not fully understood. The fall armyworm, Spodoptera frugiperda, originally from the Americas, has expanded its reach across the Old World, causing substantial reduction in crop yield. Although the hybridization between two genetically distinct strains has been well-documented, the role of such hybridization in enhancing the species' invasive capabilities remains largely unexplored. OBJECTIVES This study aims to investigate the contributions of hybridization and natural selection to the rapid invasion of the fall armyworm. METHODS We analyzed the whole-genome resequencing data from 432 individuals spanning its global distribution. We identified the genomic signatures of selection associated with invasion and explored their linkage with the Tpi gene indicating strain differentiation. Furthermore, we detected signatures of balancing selection in native populations for candidate genes that underwent selective sweeps during the invasion process. RESULTS Our analysis revealed pronounced genomic differentiation between native and invasive populations. Invasive populations displayed a uniform genomic structure distinctly different from that of native populations, indicating hybridization between the strains during invasion. This hybridization likely contributes to maintaining high genetic diversity in invasive regions, which is crucial for survival and adaptation. Additionally, polymorphisms on genes under selection during invasion were possibly preserved through balancing selection in their native environments. CONCLUSION Our findings reveal the genomic basis of the fall armyworm's successful invasion and rapid adaptation to new environments, highlighting the important role of hybridization in the dynamics of invasive species.
Collapse
Affiliation(s)
- Xuan Wang
- Department of Entomology and MOA Key Lab of Pest Monitoring and Green Management, College of Plant Protection, China Agricultural University, Beijing 100193, China; Sanya Institute of China Agricultural University, Sanya 572025, China
| | - Zhenyong Du
- Department of Entomology and MOA Key Lab of Pest Monitoring and Green Management, College of Plant Protection, China Agricultural University, Beijing 100193, China; Sanya Institute of China Agricultural University, Sanya 572025, China
| | - Yuange Duan
- Department of Entomology and MOA Key Lab of Pest Monitoring and Green Management, College of Plant Protection, China Agricultural University, Beijing 100193, China; Sanya Institute of China Agricultural University, Sanya 572025, China
| | - Shanlin Liu
- Department of Entomology and MOA Key Lab of Pest Monitoring and Green Management, College of Plant Protection, China Agricultural University, Beijing 100193, China; Sanya Institute of China Agricultural University, Sanya 572025, China
| | - Jie Liu
- National Agro-Tech Extension and Service Center, Beijing 100125, China
| | - Bingyan Li
- Department of Entomology and MOA Key Lab of Pest Monitoring and Green Management, College of Plant Protection, China Agricultural University, Beijing 100193, China; Sanya Institute of China Agricultural University, Sanya 572025, China
| | - Ling Ma
- Department of Entomology and MOA Key Lab of Pest Monitoring and Green Management, College of Plant Protection, China Agricultural University, Beijing 100193, China; Sanya Institute of China Agricultural University, Sanya 572025, China
| | - Yunfei Wu
- College of Biology and Food Engineering, Chuzhou University, Chuzhou 239000, China
| | - Li Tian
- Department of Entomology and MOA Key Lab of Pest Monitoring and Green Management, College of Plant Protection, China Agricultural University, Beijing 100193, China; Sanya Institute of China Agricultural University, Sanya 572025, China
| | - Fan Song
- Department of Entomology and MOA Key Lab of Pest Monitoring and Green Management, College of Plant Protection, China Agricultural University, Beijing 100193, China; Sanya Institute of China Agricultural University, Sanya 572025, China
| | - Wanzhi Cai
- Department of Entomology and MOA Key Lab of Pest Monitoring and Green Management, College of Plant Protection, China Agricultural University, Beijing 100193, China; Sanya Institute of China Agricultural University, Sanya 572025, China
| | - Hu Li
- Department of Entomology and MOA Key Lab of Pest Monitoring and Green Management, College of Plant Protection, China Agricultural University, Beijing 100193, China; Sanya Institute of China Agricultural University, Sanya 572025, China.
| |
Collapse
|
3
|
Strickland K, Matthews B, Jónsson ZO, Kristjánsson BK, Phillips JS, Einarsson Á, Räsänen K. Microevolutionary change in wild stickleback: Using integrative time-series data to infer responses to selection. Proc Natl Acad Sci U S A 2024; 121:e2410324121. [PMID: 39231210 PMCID: PMC11406292 DOI: 10.1073/pnas.2410324121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2024] [Accepted: 08/07/2024] [Indexed: 09/06/2024] Open
Abstract
A central goal in evolutionary biology is to understand how different evolutionary processes cause trait change in wild populations. However, quantifying evolutionary change in the wild requires linking trait change to shifts in allele frequencies at causal loci. Nevertheless, datasets that allow for such tests are extremely rare and existing theoretical approaches poorly account for the evolutionary dynamics that likely occur in ecological settings. Using a decade-long integrative phenome-to-genome time-series dataset on wild threespine stickleback (Gasterosteus aculeatus), we identified how different modes of selection (directional, episodic, and balancing) drive microevolutionary change in correlated traits over time. Most strikingly, we show that feeding traits changed by as much 25% across 10 generations which was driven by changes in the genetic architecture (i.e., in both genomic breeding values and allele frequencies at genetic loci for feeding traits). Importantly, allele frequencies at genetic loci related to feeding traits changed at a rate greater than expected under drift, suggesting that the observed change was a result of directional selection. Allele frequency dynamics of loci related to swimming traits appeared to be under fluctuating selection evident in periodic population crashes in this system. Our results show that microevolutionary change in a wild population is characterized by different modes of selection acting simultaneously on different traits, which likely has important consequences for the evolution of correlated traits. Our study provides one of the most thorough descriptions to date of how microevolutionary processes result in trait change in a natural population.
Collapse
Affiliation(s)
- Kasha Strickland
- Institute of Ecology and Evolution, School of Biological Sciences, University of Edinburgh, EdinburghEH9 3FL, United Kingdom
- Department of Aquaculture and Fish Biology, Háskólinn á Hólum, Hólum í Hjaltadal, Sauðárkrókur551, Iceland
| | - Blake Matthews
- Department of Fish Ecology and Evolution, Swiss Federal Institute of Aquatic Science and Technology, EAWAG, KastanienbaumCH-6047, Switzerland
| | - Zophonías O. Jónsson
- Institute of Life and Environmental Sciences, School of Engineering and Natural Sciences, University of Iceland, Reykjavík102, Iceland
| | - Bjarni K. Kristjánsson
- Department of Aquaculture and Fish Biology, Háskólinn á Hólum, Hólum í Hjaltadal, Sauðárkrókur551, Iceland
| | - Joseph S. Phillips
- Department of Aquaculture and Fish Biology, Háskólinn á Hólum, Hólum í Hjaltadal, Sauðárkrókur551, Iceland
- Department of Biology, Creighton University, Omaha, NE68178
| | - Árni Einarsson
- Institute of Life and Environmental Sciences, School of Engineering and Natural Sciences, University of Iceland, Reykjavík102, Iceland
| | - Katja Räsänen
- Department of Aquatic Ecology, Swiss Federal Institute of Aquatic Science and Technology, EAWAG, Duebendorf8600, Switzerland
- Department of Biological and Environmental Science, University of Jyväskylä, Jyväskylä40014, Finland
| |
Collapse
|
4
|
Banjoko AW, Ng'uni T, Naidoo N, Ramsuran V, Hyrien O, Ndhlovu ZM. High Resolution Class I HLA -A, -B, and - C Diversity in Eastern and Southern African Populations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.09.04.611164. [PMID: 39282263 PMCID: PMC11398358 DOI: 10.1101/2024.09.04.611164] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 09/21/2024]
Abstract
Africa remains significantly underrepresented in high-resolution Human Leukocyte Antigen (HLA) data, despite being one of the most genetically diverse regions in the world. This critical gap in genetic information poses a substantial barrier to HLA-based research on the continent. In this study, Class I HLA data from Eastern and Southern African populations were analysed to assess genetic diversity across the region. We examined allele and haplotype frequency distributions, deviations from Hardy-Weinberg Equilibrium (HWE), linkage disequilibrium (LD), and conducted neutrality tests of homozygosity across various populations. Additionally, the African HLA data were compared to those of Caucasian and African American populations using the Jaccard index and multidimensional scaling (MDS) methods. The study revealed that South African populations exhibited 50.4% more genetic diversity within the Class I HLA region compared to other African populations. Zambia showed an estimated 36.5% genetic diversity, with Kenya, Rwanda and Uganda showing 35.7%, 34.2%, and 31.1%, respectively. Furthermore, an analysis of in-country diversity among different tribes indicated an average Class I HLA diversity of 25.7% in Kenya, 17% in Rwanda, 2.8% in South Africa, 13.6% in Uganda, and 6.5% in Zambia. The study also highlighted the genetic distinctness of Caucasian and African American populations compared to African populations. Notably, the differential frequencies of disease-promoting and disease-preventing HLA alleles across these populations emphasize the urgent need to generate high-quality HLA data for all regions of Africa and its major ethnic groups. Such efforts will be crucial in enhancing healthcare outcomes across the continent.
Collapse
Affiliation(s)
- Alabi W Banjoko
- Africa Health Research Institute (AHRI), Nelson R. Mandela School of Medicine, Durban, South Africa
- Department of Statistics, University of Ilorin, Kwara state, Nigeria
| | - Tiza Ng'uni
- Africa Health Research Institute (AHRI), Nelson R. Mandela School of Medicine, Durban, South Africa
| | - Nitalia Naidoo
- Africa Health Research Institute (AHRI), Nelson R. Mandela School of Medicine, Durban, South Africa
| | - Veron Ramsuran
- School of Laboratory Medicine and Medical Sciences, College of Health Sciences, University of KwaZulu-Natal, Durban, South Africa
| | - Olivier Hyrien
- Fred Hutchinson Cancer Center, Vaccine and Infectious Disease Division, Vaccine and Immunology Statistical Centre, Seattle, USA
| | - Zaza M Ndhlovu
- Africa Health Research Institute (AHRI), Nelson R. Mandela School of Medicine, Durban, South Africa
- Ragon Institute of Massachusetts General Hospital, Massachusetts Institute of Technology, and Harvard University, Cambridge, MA, United States
- School of Laboratory Medicine and Medical Sciences, College of Health Sciences, University of KwaZulu-Natal, Durban, South Africa
| |
Collapse
|
5
|
Robinson KM, Schiffthaler B, Liu H, Rydman SM, Rendón-Anaya M, Kalman TA, Kumar V, Canovi C, Bernhardsson C, Delhomme N, Jenkins J, Wang J, Mähler N, Richau KH, Stokes V, A'Hara S, Cottrell J, Coeck K, Diels T, Vandepoele K, Mannapperuma C, Park EJ, Plaisance S, Jansson S, Ingvarsson PK, Street NR. An Improved Chromosome-scale Genome Assembly and Population Genetics resource for Populus tremula. PHYSIOLOGIA PLANTARUM 2024; 176:e14511. [PMID: 39279509 DOI: 10.1111/ppl.14511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/14/2024] [Revised: 08/16/2024] [Accepted: 08/20/2024] [Indexed: 09/18/2024]
Abstract
Aspen (Populus tremula L.) is a keystone species and a model system for forest tree genomics. We present an updated resource comprising a chromosome-scale assembly, population genetics and genomics data. Using the resource, we explore the genetic basis of natural variation in leaf size and shape, traits with complex genetic architecture. We generated the genome assembly using long-read sequencing, optical and high-density genetic maps. We conducted whole-genome resequencing of the Umeå Aspen (UmAsp) collection. Using the assembly and re-sequencing data from the UmAsp, Swedish Aspen (SwAsp) and Scottish Aspen (ScotAsp) collections we performed genome-wide association analyses (GWAS) using Single Nucleotide Polymorphisms (SNPs) for 26 leaf physiognomy phenotypes. We conducted Assay of Transposase Accessible Chromatin sequencing (ATAC-Seq), identified genomic regions of accessible chromatin, and subset SNPs to these regions, improving the GWAS detection rate. We identified candidate long non-coding RNAs in leaf samples, quantified their expression in an updated co-expression network, and used this to explore the functions of candidate genes identified from the GWAS. A GWAS found SNP associations for seven traits. The associated SNPs were in or near genes annotated with developmental functions, which represent candidates for further study. Of particular interest was a ~177-kbp region harbouring associations with several leaf phenotypes in ScotAsp. We have incorporated the assembly, population genetics, genomics, and GWAS data into the PlantGenIE.org web resource, including updating existing genomics data to the new genome version, to enable easy exploration and visualisation. We provide all raw and processed data to facilitate reuse in future studies.
Collapse
Affiliation(s)
- Kathryn M Robinson
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, Umeå, Sweden
| | - Bastian Schiffthaler
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, Umeå, Sweden
| | - Hui Liu
- National Engineering Laboratory for Tree Breeding; Key Laboratory of Genetics and Breeding in Forest Trees and Ornamental Plants, Ministry of Education; The Tree and Ornamental Plant Breeding and Biotechnology Laboratory of National Forestry and Grassland Administration, College of Biological Sciences and Technology, Beijing Forestry University, China
| | - Sara M Rydman
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, Umeå, Sweden
| | - Martha Rendón-Anaya
- Linnean Centre for Plant Biology, Department of Plant Biology, Uppsala BioCenter, Swedish University of Agricultural Science, Uppsala, Sweden
| | - Teitur Ahlgren Kalman
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, Umeå, Sweden
| | - Vikash Kumar
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, Umeå, Sweden
| | - Camilla Canovi
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, Umeå, Sweden
| | - Carolina Bernhardsson
- Evolutionary Biology Centre, Department of Organismal Biology, Uppsala University, Uppsala, Sweden
| | - Nicolas Delhomme
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Science, Umeå, Sweden
| | - Jerry Jenkins
- Hudson-Alpha Institute for Biotechnology, Huntsville, Alabama, USA
| | - Jing Wang
- Key Laboratory for Bio-Resources and Eco-Environment, College of Life Science, Sichuan University, Chengdu, China
| | - Niklas Mähler
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, Umeå, Sweden
| | - Kerstin H Richau
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, Umeå, Sweden
| | | | - Stuart A'Hara
- Forest Research, Northern Research Station, Roslin, UK
| | - Joan Cottrell
- Forest Research, Northern Research Station, Roslin, UK
| | - Kizi Coeck
- Vlaams Instituut voor Biotechnologie Nucleomics Core, Leuven, Belgium
| | - Tim Diels
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- Vlaams Instituut voor Biotechnologie Center for Plant Systems Biology, Ghent, Belgium
| | - Klaas Vandepoele
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- Vlaams Instituut voor Biotechnologie Center for Plant Systems Biology, Ghent, Belgium
- Bioinformatics Institute Ghent, Ghent University, Ghent, Belgium
| | - Chanaka Mannapperuma
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, Umeå, Sweden
| | - Eung-Jun Park
- Forest Medicinal Resources Research Center, National Institute of Forest Science, Suwon, Korea
| | | | - Stefan Jansson
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, Umeå, Sweden
| | - Pär K Ingvarsson
- Linnean Centre for Plant Biology, Department of Plant Biology, Uppsala BioCenter, Swedish University of Agricultural Science, Uppsala, Sweden
| | - Nathaniel R Street
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, Umeå, Sweden
- Science for Life Laboratory, Umeå University, Umeå, Sweden
| |
Collapse
|
6
|
Braichenko S, Borges R, Kosiol C. Polymorphism-Aware Models in RevBayes: Species Trees, Disentangling Balancing Selection, and GC-Biased Gene Conversion. Mol Biol Evol 2024; 41:msae138. [PMID: 38980178 PMCID: PMC11272101 DOI: 10.1093/molbev/msae138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Revised: 04/19/2024] [Accepted: 07/06/2024] [Indexed: 07/10/2024] Open
Abstract
The role of balancing selection is a long-standing evolutionary puzzle. Balancing selection is a crucial evolutionary process that maintains genetic variation (polymorphism) over extended periods of time; however, detecting it poses a significant challenge. Building upon the Polymorphism-aware phylogenetic Models (PoMos) framework rooted in the Moran model, we introduce a PoMoBalance model. This novel approach is designed to disentangle the interplay of mutation, genetic drift, and directional selection (GC-biased gene conversion), along with the previously unexplored balancing selection pressures on ultra-long timescales comparable with species divergence times by analyzing multi-individual genomic and phylogenetic divergence data. Implemented in the open-source RevBayes Bayesian framework, PoMoBalance offers a versatile tool for inferring phylogenetic trees as well as quantifying various selective pressures. The novel aspect of our approach in studying balancing selection lies in polymorphism-aware phylogenetic models' ability to account for ancestral polymorphisms and incorporate parameters that measure frequency-dependent selection, allowing us to determine the strength of the effect and exact frequencies under selection. We implemented validation tests and assessed the model on the data simulated with SLiM and a custom Moran model simulator. Real sequence analysis of Drosophila populations reveals insights into the evolutionary dynamics of regions subject to frequency-dependent balancing selection, particularly in the context of sex-limited color dimorphism in Drosophila erecta.
Collapse
Affiliation(s)
- Svitlana Braichenko
- Centre for Biological Diversity, School of Biology, University of St Andrews, Fife KY16 9TH, UK
- Institute of Genetics and Cancer, University of Edinburgh, Edinburgh EH4 2XU, UK
| | - Rui Borges
- Institut für Populationsgenetik, Vetmeduni Vienna, Wien 1210, Austria
| | - Carolin Kosiol
- Centre for Biological Diversity, School of Biology, University of St Andrews, Fife KY16 9TH, UK
| |
Collapse
|
7
|
Cornetti L, Fields PD, Du Pasquier L, Ebert D. Long-term balancing selection for pathogen resistance maintains trans-species polymorphisms in a planktonic crustacean. Nat Commun 2024; 15:5333. [PMID: 38909039 PMCID: PMC11193740 DOI: 10.1038/s41467-024-49726-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Accepted: 06/18/2024] [Indexed: 06/24/2024] Open
Abstract
Balancing selection is an evolutionary process that maintains genetic polymorphisms at selected loci and strongly reduces the likelihood of allele fixation. When allelic polymorphisms that predate speciation events are maintained independently in the resulting lineages, a pattern of trans-species polymorphisms may occur. Trans-species polymorphisms have been identified for loci related to mating systems and the MHC, but they are generally rare. Trans-species polymorphisms in disease loci are believed to be a consequence of long-term host-parasite coevolution by balancing selection, the so-called Red Queen dynamics. Here we scan the genomes of three crustaceans with a divergence of over 15 million years and identify 11 genes containing identical-by-descent trans-species polymorphisms with the same polymorphisms in all three species. Four of these genes display molecular footprints of balancing selection and have a function related to immunity. Three of them are located in or close to loci involved in resistance to a virulent bacterial pathogen, Pasteuria, with which the Daphnia host is known to coevolve. This provides rare evidence of trans-species polymorphisms for loci known to be functionally relevant in interactions with a widespread and highly specific parasite. These findings support the theory that specific antagonistic coevolution is able to maintain genetic diversity over millions of years.
Collapse
Affiliation(s)
- Luca Cornetti
- Department of Environmental Sciences, Zoology, University of Basel, Basel, Switzerland
- Syngenta Crop Protection AG, Stein, Switzerland
| | - Peter D Fields
- Department of Environmental Sciences, Zoology, University of Basel, Basel, Switzerland
| | - Louis Du Pasquier
- Department of Environmental Sciences, Zoology, University of Basel, Basel, Switzerland
| | - Dieter Ebert
- Department of Environmental Sciences, Zoology, University of Basel, Basel, Switzerland.
| |
Collapse
|
8
|
Soni V, Jensen JD. Temporal challenges in detecting balancing selection from population genomic data. G3 (BETHESDA, MD.) 2024; 14:jkae069. [PMID: 38551137 DOI: 10.1093/g3journal/jkae069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 12/21/2023] [Accepted: 03/19/2024] [Indexed: 04/28/2024]
Abstract
The role of balancing selection in maintaining genetic variation remains an open question in population genetics. Recent years have seen numerous studies identifying candidate loci potentially experiencing balancing selection, most predominantly in human populations. There are however numerous alternative evolutionary processes that may leave similar patterns of variation, thereby potentially confounding inference, and the expected signatures of balancing selection additionally change in a temporal fashion. Here we use forward-in-time simulations to quantify expected statistical power to detect balancing selection using both site frequency spectrum- and linkage disequilibrium-based methods under a variety of evolutionarily realistic null models. We find that whilst site frequency spectrum-based methods have little power immediately after a balanced mutation begins segregating, power increases with time since the introduction of the balanced allele. Conversely, linkage disequilibrium-based methods have considerable power whilst the allele is young, and power dissipates rapidly as the time since introduction increases. Taken together, this suggests that site frequency spectrum-based methods are most effective at detecting long-term balancing selection (>25N generations since the introduction of the balanced allele) whilst linkage disequilibrium-based methods are effective over much shorter timescales (<1N generations), thereby leaving a large time frame over which current methods have little power to detect the action of balancing selection. Finally, we investigate the extent to which alternative evolutionary processes may mimic these patterns, and demonstrate the need for caution in attempting to distinguish the signatures of balancing selection from those of both neutral processes (e.g. population structure and admixture) as well as of alternative selective processes (e.g. partial selective sweeps).
Collapse
Affiliation(s)
- Vivak Soni
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, AZ 85281, USA
| | - Jeffrey D Jensen
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, AZ 85281, USA
| |
Collapse
|
9
|
Murray CS, Karram M, Bass DJ, Doceti M, Becker D, Nunez JCB, Ratan A, Bergland AO. Balancing selection and the functional effects of shared polymorphism in cryptic Daphnia species. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.16.589693. [PMID: 38659826 PMCID: PMC11042267 DOI: 10.1101/2024.04.16.589693] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/26/2024]
Abstract
The patterns of genetic variation within and between related taxa represent the genetic history of a species. Shared polymorphisms, loci with identical alleles across species, are of unique interest as they may represent cases of ancient selection maintaining functional variation post-speciation. In this study, we investigate the abundance of shared polymorphism in the Daphnia pulex species complex. We test whether shared mutations are consistent with the action of balancing selection or alternative hypotheses such as hybridization, incomplete lineage sorting, or convergent evolution. We analyzed over 2,000 genomes from North American and European D. pulex and several outgroup species to examine the prevalence and distribution of shared alleles between the focal species pair, North American and European D. pulex. We show that while North American and European D. pulex diverged over ten million years ago, they retained tens of thousands of shared alleles. We found that the number of shared polymorphisms between North American and European D. pulex cannot be explained by hybridization or incomplete lineage sorting alone. Instead, we show that most shared polymorphisms could be the product of convergent evolution, that a limited number appear to be old trans-specific polymorphisms, and that balancing selection is affecting young and ancient mutations alike. Finally, we provide evidence that a blue wavelength opsin gene with trans-specific polymorphisms has functional effects on behavior and fitness in the wild. Ultimately, our findings provide insights into the genetic basis of adaptation and the maintenance of genetic diversity between species.
Collapse
Affiliation(s)
- Connor S. Murray
- Department of Biology, University of Virginia, Charlottesville, VA, USA
| | - Madison Karram
- Department of Biology, University of Virginia, Charlottesville, VA, USA
| | - David J. Bass
- Department of Biology, University of Virginia, Charlottesville, VA, USA
| | - Madison Doceti
- Department of Biology, University of Virginia, Charlottesville, VA, USA
| | - Dörthe Becker
- Department of Biology, University of Virginia, Charlottesville, VA, USA
- School of Biosciences, Ecology and Evolutionary Biology, University of Sheffield, Sheffield, UK
| | | | - Aakrosh Ratan
- Center of Public Health Genomics, University of Virginia, Charlottesville, VA, USA
- Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA
| | - Alan O. Bergland
- Department of Biology, University of Virginia, Charlottesville, VA, USA
| |
Collapse
|
10
|
Riley R, Mathieson I, Mathieson S. Interpreting generative adversarial networks to infer natural selection from genetic data. Genetics 2024; 226:iyae024. [PMID: 38386895 PMCID: PMC10990424 DOI: 10.1093/genetics/iyae024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Revised: 01/15/2024] [Accepted: 01/19/2024] [Indexed: 02/24/2024] Open
Abstract
Understanding natural selection and other forms of non-neutrality is a major focus for the use of machine learning in population genetics. Existing methods rely on computationally intensive simulated training data. Unlike efficient neutral coalescent simulations for demographic inference, realistic simulations of selection typically require slow forward simulations. Because there are many possible modes of selection, a high dimensional parameter space must be explored, with no guarantee that the simulated models are close to the real processes. Finally, it is difficult to interpret trained neural networks, leading to a lack of understanding about what features contribute to classification. Here we develop a new approach to detect selection and other local evolutionary processes that requires relatively few selection simulations during training. We build upon a generative adversarial network trained to simulate realistic neutral data. This consists of a generator (fitted demographic model), and a discriminator (convolutional neural network) that predicts whether a genomic region is real or fake. As the generator can only generate data under neutral demographic processes, regions of real data that the discriminator recognizes as having a high probability of being "real" do not fit the neutral demographic model and are therefore candidates for targets of selection. To incentivize identification of a specific mode of selection, we fine-tune the discriminator with a small number of custom non-neutral simulations. We show that this approach has high power to detect various forms of selection in simulations, and that it finds regions under positive selection identified by state-of-the-art population genetic methods in three human populations. Finally, we show how to interpret the trained networks by clustering hidden units of the discriminator based on their correlation patterns with known summary statistics.
Collapse
Affiliation(s)
- Rebecca Riley
- Department of Computer Science, Haverford College, Haverford, PA 19041, USA
| | - Iain Mathieson
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Sara Mathieson
- Department of Computer Science, Haverford College, Haverford, PA 19041, USA
| |
Collapse
|
11
|
Liu S, Luo H, Zhang P, Li Y, Hao D, Zhang S, Song T, Xu T, He S. Adaptive Selection of Cis-regulatory Elements in the Han Chinese. Mol Biol Evol 2024; 41:msae034. [PMID: 38377343 PMCID: PMC10917166 DOI: 10.1093/molbev/msae034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Revised: 01/18/2024] [Accepted: 02/05/2024] [Indexed: 02/22/2024] Open
Abstract
Cis-regulatory elements have an important role in human adaptation to the living environment. However, the lag in population genomic cohort studies and epigenomic studies, hinders the research in the adaptive analysis of cis-regulatory elements in human populations. In this study, we collected 4,013 unrelated individuals and performed a comprehensive analysis of adaptive selection of genome-wide cis-regulatory elements in the Han Chinese. In total, 12.34% of genomic regions are under the influence of adaptive selection, where 1.00% of enhancers and 2.06% of promoters are under positive selection, and 0.06% of enhancers and 0.02% of promoters are under balancing selection. Gene ontology enrichment analysis of these cis-regulatory elements under adaptive selection reveals that many positive selections in the Han Chinese occur in pathways involved in cell-cell adhesion processes, and many balancing selections are related to immune processes. Two classes of adaptive cis-regulatory elements related to cell adhesion were in-depth analyzed, one is the adaptive enhancers derived from neanderthal introgression, leads to lower hyaluronidase level in skin, and brings better performance on UV-radiation resistance to the Han Chinese. Another one is the cis-regulatory elements regulating wound healing, and the results suggest the positive selection inhibits coagulation and promotes angiogenesis and wound healing in the Han Chinese. Finally, we found that many pathogenic alleles, such as risky alleles of type 2 diabetes or schizophrenia, remain in the population due to the hitchhiking effect of positive selections. Our findings will help deepen our understanding of the adaptive evolution of genome regulation in the Han Chinese.
Collapse
Affiliation(s)
- Shuai Liu
- Key Laboratory of Epigenetic Regulation and Intervention, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Huaxia Luo
- Key Laboratory of Epigenetic Regulation and Intervention, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Peng Zhang
- Key Laboratory of Epigenetic Regulation and Intervention, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Yanyan Li
- Key Laboratory of Epigenetic Regulation and Intervention, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Di Hao
- Key Laboratory of Epigenetic Regulation and Intervention, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Sijia Zhang
- Key Laboratory of Epigenetic Regulation and Intervention, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Tingrui Song
- Key Laboratory of Epigenetic Regulation and Intervention, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Tao Xu
- National Laboratory of Biomacromolecules, CAS Center for Excellence in Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
- Shandong First Medical University & Shandong Academy of Medical Sciences, Jinan 250117, Shandong, China
| | - Shunmin He
- Key Laboratory of Epigenetic Regulation and Intervention, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
12
|
Hayeck TJ, Li Y, Mosbruger TL, Bradfield JP, Gleason AG, Damianos G, Shaw GTW, Duke JL, Conlin LK, Turner TN, Fernández-Viña MA, Sarmady M, Monos DS. The Impact of Patterns in Linkage Disequilibrium and Sequencing Quality on the Imprint of Balancing Selection. Genome Biol Evol 2024; 16:evae009. [PMID: 38302106 PMCID: PMC10853003 DOI: 10.1093/gbe/evae009] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Revised: 01/08/2024] [Accepted: 01/12/2024] [Indexed: 02/03/2024] Open
Abstract
Regions under balancing selection are characterized by dense polymorphisms and multiple persistent haplotypes, along with other sequence complexities. Successful identification of these patterns depends on both the statistical approach and the quality of sequencing. To address this challenge, at first, a new statistical method called LD-ABF was developed, employing efficient Bayesian techniques to effectively test for balancing selection. LD-ABF demonstrated the most robust detection of selection in a variety of simulation scenarios, compared against a range of existing tests/tools (Tajima's D, HKA, Dng, BetaScan, and BalLerMix). Furthermore, the impact of the quality of sequencing on detection of balancing selection was explored, as well, using: (i) SNP genotyping and exome data, (ii) targeted high-resolution HLA genotyping (IHIW), and (iii) whole-genome long-read sequencing data (Pangenome). In the analysis of SNP genotyping and exome data, we identified known targets and 38 new selection signatures in genes not previously linked to balancing selection. To further investigate the impact of sequencing quality on detection of balancing selection, a detailed investigation of the MHC was performed with high-resolution HLA typing data. Higher quality sequencing revealed the HLA-DQ genes consistently demonstrated strong selection signatures otherwise not observed from the sparser SNP array and exome data. The HLA-DQ selection signature was also replicated in the Pangenome samples using considerably less samples but, with high-quality long-read sequence data. The improved statistical method, coupled with higher quality sequencing, leads to more consistent identification of selection and enhanced localization of variants under selection, particularly in complex regions.
Collapse
Affiliation(s)
- Tristan J Hayeck
- Division of Genomic Diagnostics, Department of Pathology and Laboratory Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Yang Li
- Division of Genomic Diagnostics, Department of Pathology and Laboratory Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Timothy L Mosbruger
- Division of Genomic Diagnostics, Department of Pathology and Laboratory Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | | | - Adam G Gleason
- Division of Genomic Diagnostics, Department of Pathology and Laboratory Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - George Damianos
- Division of Genomic Diagnostics, Department of Pathology and Laboratory Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Grace Tzun-Wen Shaw
- Division of Genomic Diagnostics, Department of Pathology and Laboratory Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Jamie L Duke
- Division of Genomic Diagnostics, Department of Pathology and Laboratory Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Laura K Conlin
- Division of Genomic Diagnostics, Department of Pathology and Laboratory Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Tychele N Turner
- Department of Genetics, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Marcelo A Fernández-Viña
- Department of Pathology, Stanford University School of Medicine, Palo Alto, CA, USA
- Histocompatibility and Immunogenetics Laboratory, Stanford Blood Center, Palo Alto, CA, USA
| | - Mahdi Sarmady
- Division of Genomic Diagnostics, Department of Pathology and Laboratory Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Dimitri S Monos
- Division of Genomic Diagnostics, Department of Pathology and Laboratory Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| |
Collapse
|
13
|
Richard-St-Hilaire A, Gamache I, Pelletier J, Grenier JC, Poujol R, Hussin JG. Signatures of Co-evolution and Co-regulation in the CYP3A and CYP4F Genes in Humans. Genome Biol Evol 2024; 16:evad236. [PMID: 38207129 PMCID: PMC10805436 DOI: 10.1093/gbe/evad236] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 12/19/2023] [Accepted: 12/22/2023] [Indexed: 01/13/2024] Open
Abstract
Cytochromes P450 (CYP450) are hemoproteins generally involved in the detoxification of the body of xenobiotic molecules. They participate in the metabolism of many drugs and genetic polymorphisms in humans have been found to impact drug responses and metabolic functions. In this study, we investigate the genetic diversity of CYP450 genes. We found that two clusters, CYP3A and CYP4F, are notably differentiated across human populations with evidence for selective pressures acting on both clusters: we found signals of recent positive selection in CYP3A and CYP4F genes and signals of balancing selection in CYP4F genes. Furthermore, an extensive amount of unusual linkage disequilibrium is detected in this latter cluster, indicating co-evolution signatures among CYP4F genes. Several of the selective signals uncovered co-localize with expression quantitative trait loci (eQTL), which could suggest epistasis acting on co-regulation in these gene families. In particular, we detected a potential co-regulation event between CYP3A5 and CYP3A43, a gene whose function remains poorly characterized. We further identified a causal relationship between CYP3A5 expression and reticulocyte count through Mendelian randomization analyses, potentially involving a regulatory region displaying a selective signal specific to African populations. Our findings linking natural selection and gene expression in CYP3A and CYP4F subfamilies are of importance in understanding population differences in metabolism of nutrients and drugs.
Collapse
Affiliation(s)
- Alex Richard-St-Hilaire
- Département de biochimie et médecine moléculaire, Université de Montréal, Montreal, QC, Canada
- Sainte-Justine Hospital, Research Center, Montreal, QC, Canada
| | - Isabel Gamache
- Département de biochimie et médecine moléculaire, Université de Montréal, Montreal, QC, Canada
- Montreal Heart Institute, Research Center, Montreal, QC, Canada
| | - Justin Pelletier
- Département de biochimie et médecine moléculaire, Université de Montréal, Montreal, QC, Canada
- McGill CERC in Genomic Medicine, McGill University, Montreal, Canada
| | | | - Raphaël Poujol
- Montreal Heart Institute, Research Center, Montreal, QC, Canada
| | - Julie G Hussin
- Montreal Heart Institute, Research Center, Montreal, QC, Canada
- Département de médecine, Université de Montréal, Montreal, QC, Canada
- Mila-Quebec AI institute, Montreal, QC, Canada
| |
Collapse
|
14
|
Russo CAM, Eyre-Walker A, Katz LA, Gaut BS. Forty Years of Inferential Methods in the Journals of the Society for Molecular Biology and Evolution. Mol Biol Evol 2024; 41:msad264. [PMID: 38197288 PMCID: PMC10763999 DOI: 10.1093/molbev/msad264] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2023] [Accepted: 11/27/2023] [Indexed: 01/11/2024] Open
Abstract
We are launching a series to celebrate the 40th anniversary of the first issue of Molecular Biology and Evolution. In 2024, we will publish virtual issues containing selected papers published in the Society for Molecular Biology and Evolution journals, Molecular Biology and Evolution and Genome Biology and Evolution. Each virtual issue will be accompanied by a perspective that highlights the historic and contemporary contributions of our journals to a specific topic in molecular evolution. This perspective, the first in the series, presents an account of the broad array of methods that have been published in the Society for Molecular Biology and Evolution journals, including methods to infer phylogenies, to test hypotheses in a phylogenetic framework, and to infer population genetic processes. We also mention many of the software implementations that make methods tractable for empiricists. In short, the Society for Molecular Biology and Evolution community has much to celebrate after four decades of publishing high-quality science including numerous important inferential methods.
Collapse
Affiliation(s)
- Claudia A M Russo
- Departamento de Genética, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| | | | - Laura A Katz
- Department of Biological Sciences, Smith College, Northampton, MA, USA
| | - Brandon S Gaut
- School of Biological Sciences, University of California, Irvine, CA, USA
| |
Collapse
|
15
|
Chaumier T, Yang F, Manirakiza E, Ait-Mohamed O, Wu Y, Chandola U, Jesus B, Piganeau G, Groisillier A, Tirichine L. Genome-wide assessment of genetic diversity and transcript variations in 17 accessions of the model diatom Phaeodactylum tricornutum. ISME COMMUNICATIONS 2024; 4:ycad008. [PMID: 38304080 PMCID: PMC10833087 DOI: 10.1093/ismeco/ycad008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Revised: 12/01/2023] [Accepted: 12/13/2023] [Indexed: 02/03/2024]
Abstract
Diatoms, a prominent group of phytoplankton, have a significant impact on both the oceanic food chain and carbon sequestration, thereby playing a crucial role in regulating the climate. These highly diverse organisms show a wide geographic distribution across various latitudes. In addition to their ecological significance, diatoms represent a vital source of bioactive compounds that are widely used in biotechnology applications. In the present study, we investigated the genetic and transcriptomic diversity of 17 accessions of the model diatom Phaeodactylum tricornutum including those sampled a century ago as well as more recently collected accessions. The analysis of the data reveals a higher genetic diversity and the emergence of novel clades, indicating an increasing diversity within the P. tricornutum population structure, compared to the previous study and a persistent long-term balancing selection of genes in old and newly sampled accessions. However, the study did not establish a clear link between the year of sampling and genetic diversity, thereby, rejecting the hypothesis of loss of heterozygoty in cultured strains. Transcript analysis identified novel transcript including noncoding RNA and other categories of small RNA such as PiwiRNAs. Additionally, transcripts analysis using differential expression as well as Weighted Gene Correlation Network Analysis has provided evidence that the suppression or downregulation of genes cannot be solely attributed to loss-of-function mutations. This implies that other contributing factors, such as epigenetic modifications, may play a crucial role in regulating gene expression. Our study provides novel genetic resources, which are now accessible through the platform PhaeoEpiview (https://PhaeoEpiView.univ-nantes.fr), that offer both ease of use and advanced tools to further investigate microalgae biology and ecology, consequently enriching our current understanding of these organisms.
Collapse
Affiliation(s)
| | - Feng Yang
- Nantes Université, CNRS, US2B, UMR 6286, Nantes F-44000, France
| | - Eric Manirakiza
- Nantes Université, CNRS, US2B, UMR 6286, Nantes F-44000, France
| | - Ouardia Ait-Mohamed
- Immunity and Cancer Department, Institut Curie, PSL Research University, INSERM U932, Paris 75005, France
| | - Yue Wu
- Nantes Université, CNRS, US2B, UMR 6286, Nantes F-44000, France
| | - Udita Chandola
- Nantes Université, CNRS, US2B, UMR 6286, Nantes F-44000, France
| | - Bruno Jesus
- Institut des Substances et Organismes de la Mer, ISOMer, Nantes Université, UR 2160, Nantes F-44000, France
| | - Gwenael Piganeau
- Sorbonne Université, CNRS, Laboratoire de Biodiversité et Biotechnologies Microbiennes, LBBM, F-66650 Banyuls-sur-Mer, France
| | | | - Leila Tirichine
- Nantes Université, CNRS, US2B, UMR 6286, Nantes F-44000, France
| |
Collapse
|
16
|
Gao Z. Unveiling recent and ongoing adaptive selection in human populations. PLoS Biol 2024; 22:e3002469. [PMID: 38236800 PMCID: PMC10796035 DOI: 10.1371/journal.pbio.3002469] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2024] Open
Abstract
Genome-wide scans for signals of selection have become a routine part of the analysis of population genomic variation datasets and have resulted in compelling evidence of selection during recent human evolution. This Essay spotlights methodological innovations that have enabled the detection of selection over very recent timescales, even in contemporary human populations. By harnessing large-scale genomic and phenotypic datasets, these new methods use different strategies to uncover connections between genotype, phenotype, and fitness. This Essay outlines the rationale and key findings of each strategy, discusses challenges in interpretation, and describes opportunities to improve detection and understanding of ongoing selection in human populations.
Collapse
Affiliation(s)
- Ziyue Gao
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| |
Collapse
|
17
|
Levi R, Levi L, Louzoun Y. Bw4 ligand and direct T-cell receptor binding induced selection on HLA A and B alleles. Front Immunol 2023; 14:1236080. [PMID: 38077375 PMCID: PMC10703150 DOI: 10.3389/fimmu.2023.1236080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Accepted: 10/26/2023] [Indexed: 12/18/2023] Open
Abstract
Introduction The HLA region is the hallmark of balancing selection, argued to be driven by the pressure to present a wide variety of viral epitopes. As such selection on the peptide-binding positions has been proposed to drive HLA population genetics. MHC molecules also directly binds to the T-Cell Receptor and killer cell immunoglobulin-like receptors (KIR). Methods We here combine the HLA allele frequencies in over six-million Hematopoietic Stem Cells (HSC) donors with a novel machine-learning-based method to predict allele frequency. Results We show for the first time that allele frequency can be predicted from their sequences. This prediction yields a natural measure for selection. The strongest selection is affecting KIR binding regions, followed by the peptide-binding cleft. The selection from the direct interaction with the KIR and TCR is centered on positively charged residues (mainly Arginine), and some positions in the peptide-binding cleft are not associated with the allele frequency, especially Tyrosine residues. Discussion These results suggest that the balancing selection for peptide presentation is combined with a positive selection for KIR and TCR binding.
Collapse
Affiliation(s)
| | | | - Yoram Louzoun
- Department of Mathematics, Bar-Ilan University, Ramat Gan, Israel
| |
Collapse
|
18
|
Zhao S, Chi L, Chen H. CEGA: a method for inferring natural selection by comparative population genomic analysis across species. Genome Biol 2023; 24:219. [PMID: 37789379 PMCID: PMC10548728 DOI: 10.1186/s13059-023-03068-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Accepted: 09/20/2023] [Indexed: 10/05/2023] Open
Abstract
We developed maximum likelihood method for detecting positive selection or balancing selection using multilocus or genomic polymorphism and divergence data from two species. The method is especially useful for investigating natural selection in noncoding regions. Simulations demonstrate that the method outperforms existing methods in detecting both positive and balancing selection. We apply the method to population genomic data from human and chimpanzee. The list of genes identified under selection in the noncoding regions is prominently enriched in pathways related to the brain and nervous system. Therefore, our method will serve as a useful tool for comparative population genomic analysis.
Collapse
Affiliation(s)
- Shilei Zhao
- CAS Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, 100101, China
- China National Center for Bioinformation, Beijing, 100101, China
- School of Future Technology, College of Life Sciences and Sino-Danish College, University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Lianjiang Chi
- CAS Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, 100101, China
- China National Center for Bioinformation, Beijing, 100101, China
| | - Hua Chen
- CAS Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, 100101, China.
- China National Center for Bioinformation, Beijing, 100101, China.
- School of Future Technology, College of Life Sciences and Sino-Danish College, University of Chinese Academy of Sciences, Beijing, 100049, China.
- CAS Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, 650223, China.
| |
Collapse
|
19
|
Wang X, Ingvarsson PK. Quantifying adaptive evolution and the effects of natural selection across the Norway spruce genome. Mol Ecol 2023; 32:5288-5304. [PMID: 37622583 DOI: 10.1111/mec.17106] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2021] [Revised: 08/07/2023] [Accepted: 08/09/2023] [Indexed: 08/26/2023]
Abstract
Detecting natural selection is one of the major goals of evolutionary genomics. Here, we sequenced the whole genome of 25 Picea abies individuals and quantified the amount of selection across the genome. Using an estimate of the distribution of fitness effects, we showed that both negative selection and the rate of positively selected substitutions are very limited in coding regions. We found a positive correlation between the rate of adaptive substitutions and recombination rate and a negative correlation between the rate of adaptive substitutions and gene density, suggesting a widespread influence from Hill-Robertson interference on the efficiency of protein adaptation in P. abies. Finally, the distinct population statistics between genomic regions under either positive or balancing selection with that under neutral regions indicated the impact of natural selection on the genomic architecture of Norway spruce. Further gene ontology enrichment analysis for genes located in regions identified as undergoing either positive or long-term balancing selection also highlighted the specific molecular functions and biological processes that appear to be targets of selection in Norway spruce.
Collapse
Affiliation(s)
- Xi Wang
- Umeå Plant Science Centre, Department of Ecology and Environmental Science, Umeå University, Umeå, Sweden
| | - Pär K Ingvarsson
- Linnean Centre for Plant Biology, Department of Plant Biology, Swedish University of Agricultural Sciences, Uppsala, Sweden
| |
Collapse
|
20
|
Yong B, Zhu W, Wei S, Li B, Wang Y, Xu N, Lu J, Chen Q, He C. Parallel selection of loss-of-function alleles of Pdh1 orthologous genes in warm-season legumes for pod indehiscence and plasticity is related to precipitation. THE NEW PHYTOLOGIST 2023; 240:863-879. [PMID: 37501344 DOI: 10.1111/nph.19150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Accepted: 07/04/2023] [Indexed: 07/29/2023]
Abstract
Pod dehiscence facilitates seed dispersal in wild legumes but results in yield loss in cultivated legumes. The evolutionary genetics of the legume pod dehiscence trait remain largely elusive. We characterized the pod dehiscence of chromosome segment substitution lines of Glycine max crossed with Glycine soja and found that the gene underlying the predominant quantitative trait locus (QTL) of soybean pod-shattering trait was Pod dehiscence 1 (Pdh1). A few rare loss-of-function (LoF) Pdh1 alleles were identified in G. soja, while only an allele featuring a premature stop codon was selected for pod indehiscence in cultivated soybean and spread to low-precipitation regions with increased frequency. Moreover, correlated interactions among Pdh1's haplotype, gene expression, and environmental changes for the developmental plasticity of the pod dehiscence trait were revealed in G. max. We found that orthologous Pdh1 genes specifically originated in warm-season legumes and their LoF alleles were then parallel-selected during the domestication of legume crops. Our results provide insights into the convergent evolution of pod dehiscence in warm-season legumes, facilitate an understanding of the intricate interactions between genetic robustness and environmental adaptation for developmental plasticity, and guide the breeding of new legume varieties with pod indehiscence.
Collapse
Affiliation(s)
- Bin Yong
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Nanxincun 20, Xiangshan, Beijing, 100093, China
- China National Botanical Garden, Beijing, 100093, China
- University of Chinese Academy of Sciences, Yuquan Road 19A, Beijing, 100049, China
| | - Weiwei Zhu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Nanxincun 20, Xiangshan, Beijing, 100093, China
- China National Botanical Garden, Beijing, 100093, China
- University of Chinese Academy of Sciences, Yuquan Road 19A, Beijing, 100049, China
| | - Siming Wei
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Nanxincun 20, Xiangshan, Beijing, 100093, China
- China National Botanical Garden, Beijing, 100093, China
- University of Chinese Academy of Sciences, Yuquan Road 19A, Beijing, 100049, China
| | - Bingbing Li
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Nanxincun 20, Xiangshan, Beijing, 100093, China
- China National Botanical Garden, Beijing, 100093, China
- University of Chinese Academy of Sciences, Yuquan Road 19A, Beijing, 100049, China
| | - Yan Wang
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Nanxincun 20, Xiangshan, Beijing, 100093, China
- China National Botanical Garden, Beijing, 100093, China
| | - Nan Xu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Nanxincun 20, Xiangshan, Beijing, 100093, China
- China National Botanical Garden, Beijing, 100093, China
- University of Chinese Academy of Sciences, Yuquan Road 19A, Beijing, 100049, China
| | - Jiangjie Lu
- Zhejiang Provincial Key Laboratory for Genetic Improvement and Quality Control of Medicinal Plants, College of Life and Environmental Science, Hangzhou Normal University, Hangzhou, 311121, China
| | - Qingshan Chen
- College of Agriculture, Northeast Agricultural University, Harbin, 150030, China
| | - Chaoying He
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Nanxincun 20, Xiangshan, Beijing, 100093, China
- China National Botanical Garden, Beijing, 100093, China
- University of Chinese Academy of Sciences, Yuquan Road 19A, Beijing, 100049, China
- The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, 100101, China
| |
Collapse
|
21
|
Tong X, Chen D, Hu J, Lin S, Ling Z, Ai H, Zhang Z, Huang L. Accurate haplotype construction and detection of selection signatures enabled by high quality pig genome sequences. Nat Commun 2023; 14:5126. [PMID: 37612277 PMCID: PMC10447580 DOI: 10.1038/s41467-023-40434-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Accepted: 07/27/2023] [Indexed: 08/25/2023] Open
Abstract
High-quality whole-genome resequencing in large-scale pig populations with pedigree structure and multiple breeds would enable accurate construction of haplotype and robust selection-signature detection. Here, we sequence 740 pigs, combine with 149 of our previously published resequencing data, retrieve 207 resequencing datasets, and form a panel of worldwide distributed wild boars, aboriginal and highly selected pigs with pedigree structures, amounting to 1096 genomes from 43 breeds. Combining with their haplotype-informative reads and pedigree structure, we accurately construct a panel of 1874 haploid genomes with 41,964,356 genetic variants. We further demonstrate its valuable applications in GWAS by identifying five novel loci for intramuscular fat content, and in genomic selection by increasing the accuracy of estimated breeding value by 36.7%. In evolutionary selection, we detect MUC13 gene under a long-term balancing selection, as well as NPR3 gene under positive selection for pig stature. Our study provides abundant genomic variations for robust selection-signature detection and accurate haplotypes for deciphering complex traits in pigs.
Collapse
Affiliation(s)
- Xinkai Tong
- National Key Laboratory for Swine genetic improvement and production technology, Ministry of Science and Technology of China, Jiangxi Agricultural University, NanChang, Jiangxi Province, PR China
- College of Life Sciences, Jiangxi Normal University, NanChang, Jiangxi Province, PR China
| | - Dong Chen
- National Key Laboratory for Swine genetic improvement and production technology, Ministry of Science and Technology of China, Jiangxi Agricultural University, NanChang, Jiangxi Province, PR China
| | - Jianchao Hu
- National Key Laboratory for Swine genetic improvement and production technology, Ministry of Science and Technology of China, Jiangxi Agricultural University, NanChang, Jiangxi Province, PR China
| | - Shiyao Lin
- National Key Laboratory for Swine genetic improvement and production technology, Ministry of Science and Technology of China, Jiangxi Agricultural University, NanChang, Jiangxi Province, PR China
| | - Ziqi Ling
- National Key Laboratory for Swine genetic improvement and production technology, Ministry of Science and Technology of China, Jiangxi Agricultural University, NanChang, Jiangxi Province, PR China
| | - Huashui Ai
- National Key Laboratory for Swine genetic improvement and production technology, Ministry of Science and Technology of China, Jiangxi Agricultural University, NanChang, Jiangxi Province, PR China
| | - Zhiyan Zhang
- National Key Laboratory for Swine genetic improvement and production technology, Ministry of Science and Technology of China, Jiangxi Agricultural University, NanChang, Jiangxi Province, PR China.
| | - Lusheng Huang
- National Key Laboratory for Swine genetic improvement and production technology, Ministry of Science and Technology of China, Jiangxi Agricultural University, NanChang, Jiangxi Province, PR China.
| |
Collapse
|
22
|
Teterina AA, Willis JH, Lukac M, Jovelin R, Cutter AD, Phillips PC. Genomic diversity landscapes in outcrossing and selfing Caenorhabditis nematodes. PLoS Genet 2023; 19:e1010879. [PMID: 37585484 PMCID: PMC10461856 DOI: 10.1371/journal.pgen.1010879] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Revised: 08/28/2023] [Accepted: 07/21/2023] [Indexed: 08/18/2023] Open
Abstract
Caenorhabditis nematodes form an excellent model for studying how the mode of reproduction affects genetic diversity, as some species reproduce via outcrossing whereas others can self-fertilize. Currently, chromosome-level patterns of diversity and recombination are only available for self-reproducing Caenorhabditis, making the generality of genomic patterns across the genus unclear given the profound potential influence of reproductive mode. Here we present a whole-genome diversity landscape, coupled with a new genetic map, for the outcrossing nematode C. remanei. We demonstrate that the genomic distribution of recombination in C. remanei, like the model nematode C. elegans, shows high recombination rates on chromosome arms and low rates toward the central regions. Patterns of genetic variation across the genome are also similar between these species, but differ dramatically in scale, being tenfold greater for C. remanei. Historical reconstructions of variation in effective population size over the past million generations echo this difference in polymorphism. Evolutionary simulations demonstrate how selection, recombination, mutation, and selfing shape variation along the genome, and that multiple drivers can produce patterns similar to those observed in natural populations. The results illustrate how genome organization and selection play a crucial role in shaping the genomic pattern of diversity whereas demographic processes scale the level of diversity across the genome as a whole.
Collapse
Affiliation(s)
- Anastasia A. Teterina
- Institute of Ecology and Evolution, University of Oregon, Eugene, Oregon, United States of America
- Center of Parasitology, Severtsov Institute of Ecology and Evolution RAS, Moscow, Russia
| | - John H. Willis
- Institute of Ecology and Evolution, University of Oregon, Eugene, Oregon, United States of America
| | - Matt Lukac
- Institute of Ecology and Evolution, University of Oregon, Eugene, Oregon, United States of America
| | - Richard Jovelin
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Ontario, Canada
| | - Asher D. Cutter
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Ontario, Canada
| | - Patrick C. Phillips
- Institute of Ecology and Evolution, University of Oregon, Eugene, Oregon, United States of America
| |
Collapse
|
23
|
Nandakumar M, Lundberg M, Carlsson F, Råberg L. Balancing selection on the complement system of a wild rodent. BMC Ecol Evol 2023; 23:21. [PMID: 37231383 DOI: 10.1186/s12862-023-02122-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Accepted: 05/10/2023] [Indexed: 05/27/2023] Open
Abstract
BACKGROUND Selection pressure exerted by pathogens can influence patterns of genetic diversity in the host. In the immune system especially, numerous genes encode proteins involved in antagonistic interactions with pathogens, paving the way for coevolution that results in increased genetic diversity as a consequence of balancing selection. The complement system is a key component of innate immunity. Many complement proteins interact directly with pathogens, either by recognising pathogen molecules for complement activation, or by serving as targets of pathogen immune evasion mechanisms. Complement genes can therefore be expected to be important targets of pathogen-mediated balancing selection, but analyses of such selection on this part of the immune system have been limited. RESULTS Using a population sample of whole-genome resequencing data from wild bank voles (n = 31), we estimated the extent of genetic diversity and tested for signatures of balancing selection in multiple complement genes (n = 44). Complement genes showed higher values of standardised β (a statistic expected to be high under balancing selection) than the genome-wide average of protein coding genes. One complement gene, FCNA, a pattern recognition molecule that interacts directly with pathogens, was found to have a signature of balancing selection, as indicated by the Hudson-Kreitman-Aguadé test (HKA) test. Scans for localised signatures of balancing selection in this gene indicated that the target of balancing selection was found in exonic regions involved in ligand binding. CONCLUSION The present study adds to the growing evidence that balancing selection may be an important evolutionary force on components of the innate immune system. The identified target in the complement system typifies the expectation that balancing selection acts on genes encoding proteins involved in direct interactions with pathogens.
Collapse
Affiliation(s)
| | - Max Lundberg
- Department of Biology, Lund University, Lund, Sweden
| | | | - Lars Råberg
- Department of Biology, Lund University, Lund, Sweden
| |
Collapse
|
24
|
Huang G, Wu W, Chen Y, Zhi X, Zou P, Ning Z, Fan Q, Liu Y, Deng S, Zeng K, Zhou R. Balancing selection on an MYB transcription factor maintains the twig trichome color variation in Melastoma normale. BMC Biol 2023; 21:122. [PMID: 37226197 DOI: 10.1186/s12915-023-01611-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Accepted: 05/03/2023] [Indexed: 05/26/2023] Open
Abstract
BACKGROUND The factors that maintain phenotypic and genetic variation within a population have received long-term attention in evolutionary biology. Here the genetic basis and evolution of the geographically widespread variation in twig trichome color (from red to white) in a shrub Melastoma normale was investigated using Pool-seq and evolutionary analyses. RESULTS The results show that the twig trichome coloration is under selection in different light environments and that a 6-kb region containing an R2R3 MYB transcription factor gene is the major region of divergence between the extreme red and white morphs. This gene has two highly divergent groups of alleles, one of which likely originated from introgression from another species in this genus and has risen to high frequency (> 0.6) within each of the three populations under investigation. In contrast, polymorphisms in other regions of the genome show no sign of differentiation between the two morphs, suggesting that genomic patterns of diversity have been shaped by homogenizing gene flow. Population genetics analysis reveals signals of balancing selection acting on this gene, and it is suggested that spatially varying selection is the most likely mechanism of balancing selection in this case. CONCLUSIONS This study demonstrate that polymorphisms on a single transcription factor gene largely confer the twig trichome color variation in M. normale, while also explaining how adaptive divergence can occur and be maintained in the face of gene flow.
Collapse
Affiliation(s)
- Guilian Huang
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-Sen University, Guangzhou, 510275, China
| | - Wei Wu
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-Sen University, Guangzhou, 510275, China
| | - Yongmei Chen
- College of Chemical Engineering, Sichuan University of Science & Engineering, Zigong, Sichuan, 643000, China
| | - Xueke Zhi
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-Sen University, Guangzhou, 510275, China
| | - Peishan Zou
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-Sen University, Guangzhou, 510275, China
| | - Zulin Ning
- Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China
| | - Qiang Fan
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-Sen University, Guangzhou, 510275, China
| | - Ying Liu
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-Sen University, Guangzhou, 510275, China
| | - Shulin Deng
- Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China
| | - Kai Zeng
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, UK.
| | - Renchao Zhou
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-Sen University, Guangzhou, 510275, China.
| |
Collapse
|
25
|
Ahlquist KD, Sugden LA, Ramachandran S. Enabling interpretable machine learning for biological data with reliability scores. PLoS Comput Biol 2023; 19:e1011175. [PMID: 37235578 PMCID: PMC10249903 DOI: 10.1371/journal.pcbi.1011175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2022] [Revised: 06/08/2023] [Accepted: 05/10/2023] [Indexed: 05/28/2023] Open
Abstract
Machine learning tools have proven useful across biological disciplines, allowing researchers to draw conclusions from large datasets, and opening up new opportunities for interpreting complex and heterogeneous biological data. Alongside the rapid growth of machine learning, there have also been growing pains: some models that appear to perform well have later been revealed to rely on features of the data that are artifactual or biased; this feeds into the general criticism that machine learning models are designed to optimize model performance over the creation of new biological insights. A natural question arises: how do we develop machine learning models that are inherently interpretable or explainable? In this manuscript, we describe the SWIF(r) reliability score (SRS), a method building on the SWIF(r) generative framework that reflects the trustworthiness of the classification of a specific instance. The concept of the reliability score has the potential to generalize to other machine learning methods. We demonstrate the utility of the SRS when faced with common challenges in machine learning including: 1) an unknown class present in testing data that was not present in training data, 2) systemic mismatch between training and testing data, and 3) instances of testing data that have missing values for some attributes. We explore these applications of the SRS using a range of biological datasets, from agricultural data on seed morphology, to 22 quantitative traits in the UK Biobank, and population genetic simulations and 1000 Genomes Project data. With each of these examples, we demonstrate how the SRS can allow researchers to interrogate their data and training approach thoroughly, and to pair their domain-specific knowledge with powerful machine-learning frameworks. We also compare the SRS to related tools for outlier and novelty detection, and find that it has comparable performance, with the advantage of being able to operate when some data are missing. The SRS, and the broader discussion of interpretable scientific machine learning, will aid researchers in the biological machine learning space as they seek to harness the power of machine learning without sacrificing rigor and biological insight.
Collapse
Affiliation(s)
- K. D. Ahlquist
- Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, United States of America
- Department of Molecular Biology, Cell Biology, and Biochemistry, Brown University, Providence, Rhode Island, United States of America
| | - Lauren A. Sugden
- Department of Mathematics and Computer Science, Duquesne University, Pittsburgh, Pennsylvania, United States of America
| | - Sohini Ramachandran
- Center for Computational Molecular Biology, Brown University, Providence, Rhode Island, United States of America
- Department of Ecology, Evolution and Organismal Biology, Brown University, Providence, Rhode Island, United States of America
- Data Science Initiative, Brown University, Providence, Rhode Island, United States of America
| |
Collapse
|
26
|
Mathieson I, Day FR, Barban N, Tropf FC, Brazel DM, Vaez A, van Zuydam N, Bitarello BD, Gardner EJ, Akimova ET, Azad A, Bergmann S, Bielak LF, Boomsma DI, Bosak K, Brumat M, Buring JE, Cesarini D, Chasman DI, Chavarro JE, Cocca M, Concas MP, Davey Smith G, Davies G, Deary IJ, Esko T, Faul JD, Franco O, Ganna A, Gaskins AJ, Gelemanovic A, de Geus EJC, Gieger C, Girotto G, Gopinath B, Grabe HJ, Gunderson EP, Hayward C, He C, van Heemst D, Hill WD, Hoffmann ER, Homuth G, Hottenga JJ, Huang H, Hyppӧnen E, Ikram MA, Jansen R, Johannesson M, Kamali Z, Kardia SLR, Kavousi M, Kifley A, Kiiskinen T, Kraft P, Kühnel B, Langenberg C, Liew G, Lind PA, Luan J, Mägi R, Magnusson PKE, Mahajan A, Martin NG, Mbarek H, McCarthy MI, McMahon G, Medland SE, Meitinger T, Metspalu A, Mihailov E, Milani L, Missmer SA, Mitchell P, Møllegaard S, Mook-Kanamori DO, Morgan A, van der Most PJ, de Mutsert R, Nauck M, Nolte IM, Noordam R, Penninx BWJH, Peters A, Peyser PA, Polašek O, Power C, Pribisalic A, Redmond P, Rich-Edwards JW, Ridker PM, Rietveld CA, Ring SM, Rose LM, Rueedi R, Shukla V, Smith JA, Stankovic S, Stefánsson K, Stöckl D, Strauch K, Swertz MA, Teumer A, Thorleifsson G, Thorsteinsdottir U, Thurik AR, Timpson NJ, Turman C, Uitterlinden AG, Waldenberger M, Wareham NJ, Weir DR, Willemsen G, Zhao JH, Zhao W, Zhao Y, Snieder H, den Hoed M, Ong KK, Mills MC, Perry JRB. Genome-wide analysis identifies genetic effects on reproductive success and ongoing natural selection at the FADS locus. Nat Hum Behav 2023; 7:790-801. [PMID: 36864135 DOI: 10.1038/s41562-023-01528-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2021] [Accepted: 01/12/2023] [Indexed: 03/04/2023]
Abstract
Identifying genetic determinants of reproductive success may highlight mechanisms underlying fertility and identify alleles under present-day selection. Using data in 785,604 individuals of European ancestry, we identified 43 genomic loci associated with either number of children ever born (NEB) or childlessness. These loci span diverse aspects of reproductive biology, including puberty timing, age at first birth, sex hormone regulation, endometriosis and age at menopause. Missense variants in ARHGAP27 were associated with higher NEB but shorter reproductive lifespan, suggesting a trade-off at this locus between reproductive ageing and intensity. Other genes implicated by coding variants include PIK3IP1, ZFP82 and LRP4, and our results suggest a new role for the melanocortin 1 receptor (MC1R) in reproductive biology. As NEB is one component of evolutionary fitness, our identified associations indicate loci under present-day natural selection. Integration with data from historical selection scans highlighted an allele in the FADS1/2 gene locus that has been under selection for thousands of years and remains so today. Collectively, our findings demonstrate that a broad range of biological mechanisms contribute to reproductive success.
Collapse
Affiliation(s)
- Iain Mathieson
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
| | - Felix R Day
- MRC Epidemiology Unit, Institute of Metabolic Science, University of Cambridge, Cambridge, UK
| | - Nicola Barban
- Alma Mater Studiorum University of Bologna, Bologna, Italy
| | - Felix C Tropf
- Nuffield College, University of Oxford, Oxford, UK
- École Nationale de la Statistique et de L'administration Économique (ENSAE), Paris, France
- Center for Research in Economics and Statistics (CREST), Paris, France
| | - David M Brazel
- Nuffield College, University of Oxford, Oxford, UK
- Leverhulme Centre for Demographic Science, University of Oxford, Oxford, UK
| | - Ahmad Vaez
- Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
- Department of Bioinformatics, Isfahan University of Medical Sciences, Isfahan, Iran
| | - Natalie van Zuydam
- Beijer Laboratory and Department of Immunology, Genetics and Pathology, Uppsala University and SciLifeLab, Uppsala, Sweden
| | - Bárbara D Bitarello
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Eugene J Gardner
- MRC Epidemiology Unit, Institute of Metabolic Science, University of Cambridge, Cambridge, UK
| | - Evelina T Akimova
- Nuffield College, University of Oxford, Oxford, UK
- Leverhulme Centre for Demographic Science, University of Oxford, Oxford, UK
| | - Ajuna Azad
- DNRF Center for Chromosome Stability, Department of Cellular and Molecular Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Sven Bergmann
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Department of Integrative Biomedical Sciences, University of Cape Town, Cape Town, South Africa
| | - Lawrence F Bielak
- Department of Epidemiology, University of Michigan, Ann Arbor, MI, USA
| | - Dorret I Boomsma
- Department of Biological Psychology, Amsterdam Public Health Research Institute, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
- Amsterdam Reproduction and Development (AR&D) Research Institute, Amsterdam, the Netherlands
| | | | - Marco Brumat
- Department of Medical, Surgical and Health Sciences, University of Trieste, Trieste, Italy
| | - Julie E Buring
- Brigham and Women's Hospital, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - David Cesarini
- Department of Economics, New York University, New York, NY, USA
- Research Institute for Industrial Economics, Stockholm, Sweden
- National Bureau of Economic Research, Cambridge, MA, USA
| | - Daniel I Chasman
- Brigham and Women's Hospital, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Jorge E Chavarro
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Department of Nutrition, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Massimiliano Cocca
- Institute for Maternal and Child Health, IRCCS 'Burlo Garofolo', Trieste, Italy
| | - Maria Pina Concas
- Institute for Maternal and Child Health, IRCCS 'Burlo Garofolo', Trieste, Italy
| | | | - Gail Davies
- Lothian Birth Cohorts, Department of Psychology, University of Edinburgh, Edinburgh, UK
| | - Ian J Deary
- Lothian Birth Cohorts, Department of Psychology, University of Edinburgh, Edinburgh, UK
| | - Tõnu Esko
- Estonian Genome Center, University of Tartu, Tartu, Estonia
- Broad Institute of the Massachusetts Institute of Technology and Harvard University, Cambridge, MA, USA
| | - Jessica D Faul
- Survey Research Center, Institute for Social Research, University of Michigan, Ann Arbor, MI, USA
| | - Oscar Franco
- Department of Epidemiology, Erasmus MC, University Medical Center Rotterdam, Rotterdam, the Netherlands
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
| | - Andrea Ganna
- Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Helsinki, Finland
- Analytic and Translational Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Audrey J Gaskins
- Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta, GA, USA
| | | | - Eco J C de Geus
- Department of Biological Psychology, Amsterdam Public Health Research Institute, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
| | - Christian Gieger
- Research Unit of Molecular Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
| | - Giorgia Girotto
- Department of Medical, Surgical and Health Sciences, University of Trieste, Trieste, Italy
- Institute for Maternal and Child Health, IRCCS 'Burlo Garofolo', Trieste, Italy
| | - Bamini Gopinath
- Centre for Vision Research, Westmead Institute for Medical Research and Department of Ophthalmology, University of Sydney, Sydney, New South Wales, Australia
| | - Hans Jörgen Grabe
- Department of Psychiatry and Psychotherapy, University Medicine Greifswald, Greifswald, Germany
| | - Erica P Gunderson
- Division of Research, Kaiser Permanente Northern California, Oakland, CA, USA
| | - Caroline Hayward
- Medical Research Council Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, UK
| | - Chunyan He
- Markey Cancer Center, University of Kentucky, Lexington, KY, USA
- Department of Internal Medicine, Division of Medical Oncology, University of Kentucky College of Medicine, Lexington, KY, USA
| | - Diana van Heemst
- Department of Internal Medicine, Section of Gerontology and Geriatrics, Leiden University Medical Center, Leiden, the Netherlands
| | - W David Hill
- Lothian Birth Cohorts, Department of Psychology, University of Edinburgh, Edinburgh, UK
| | - Eva R Hoffmann
- DNRF Center for Chromosome Stability, Department of Cellular and Molecular Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Georg Homuth
- Interfaculty Institute for Genetics and Functional Genomics, University of Greifswald, Greifswald, Germany
| | - Jouke Jan Hottenga
- Department of Biological Psychology, Amsterdam Public Health Research Institute, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
| | - Hongyang Huang
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Elina Hyppӧnen
- Australian Centre for Precision Health, University of South Australia Cancer Research Institute, Adelaide, South Australia, Australia
- South Australian Health and Medical Research Institute, Adelaide, South Australia, Australia
| | - M Arfan Ikram
- Department of Epidemiology, Erasmus MC, University Medical Center Rotterdam, Rotterdam, the Netherlands
| | - Rick Jansen
- Department of Psychiatry, Amsterdam Public Health and Amsterdam Neuroscience, Amsterdam UMC, Vrije Universiteit, Amsterdam, the Netherlands
| | - Magnus Johannesson
- Department of Economics, Stockholm School of Economics, Stockholm, Sweden
| | - Zoha Kamali
- Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
- Department of Bioinformatics, Isfahan University of Medical Sciences, Isfahan, Iran
| | - Sharon L R Kardia
- Department of Epidemiology, University of Michigan, Ann Arbor, MI, USA
| | - Maryam Kavousi
- Department of Epidemiology, Erasmus MC, University Medical Center Rotterdam, Rotterdam, the Netherlands
| | - Annette Kifley
- Centre for Vision Research, Westmead Institute for Medical Research and Department of Ophthalmology, University of Sydney, Sydney, New South Wales, Australia
| | - Tuomo Kiiskinen
- Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Helsinki, Finland
- National Institute for Health and Welfare, Helsinki, Finland
| | - Peter Kraft
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Brigitte Kühnel
- Research Unit of Molecular Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
| | - Claudia Langenberg
- MRC Epidemiology Unit, Institute of Metabolic Science, University of Cambridge, Cambridge, UK
| | - Gerald Liew
- Centre for Vision Research, Westmead Institute for Medical Research and Department of Ophthalmology, University of Sydney, Sydney, New South Wales, Australia
| | - Penelope A Lind
- Psychiatric Genetics, QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | - Jian'an Luan
- MRC Epidemiology Unit, Institute of Metabolic Science, University of Cambridge, Cambridge, UK
| | - Reedik Mägi
- Estonian Genome Center, University of Tartu, Tartu, Estonia
| | - Patrik K E Magnusson
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Anubha Mahajan
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
- Oxford Centre for Diabetes, Endocrinology and Metabolism, Radcliffe Department of Medicine, University of Oxford, Oxford, UK
| | - Nicholas G Martin
- Genetic Epidemiology, QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | - Hamdi Mbarek
- Department of Biological Psychology, Amsterdam Public Health Research Institute, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
- Qatar Genome Programme, Qatar Foundation Research, Development and Innovation, Qatar Foundation, Doha, Qatar
| | - Mark I McCarthy
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
- Oxford Centre for Diabetes, Endocrinology and Metabolism, Radcliffe Department of Medicine, University of Oxford, Oxford, UK
| | - George McMahon
- School of Social and Community Medicine, University of Bristol, Bristol, UK
| | - Sarah E Medland
- Psychiatric Genetics, QIMR Berghofer Medical Research Institute, Brisbane, Queensland, Australia
| | - Thomas Meitinger
- Institute of Human Genetics, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
| | - Andres Metspalu
- Estonian Genome Center, University of Tartu, Tartu, Estonia
- Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
| | | | - Lili Milani
- Estonian Genome Center, University of Tartu, Tartu, Estonia
| | - Stacey A Missmer
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Division of Adolescent and Young Adult Medicine, Department of Medicine, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
- Department of Obstetrics, Gynecology, and Reproductive Biology, College of Human Medicine, Michigan State University, Grand Rapids, MI, USA
| | - Paul Mitchell
- Centre for Vision Research, Westmead Institute for Medical Research and Department of Ophthalmology, University of Sydney, Sydney, New South Wales, Australia
| | - Stine Møllegaard
- Department of Sociology, University of Copenhagen, Copenhagen, Denmark
| | - Dennis O Mook-Kanamori
- Department of Clinical Epidemiology, Leiden University Medical Center, Leiden, the Netherlands
- Department of Public Health and Primary Care, Leiden University Medical Center, Leiden, the Netherlands
| | - Anna Morgan
- Institute for Maternal and Child Health, IRCCS 'Burlo Garofolo', Trieste, Italy
| | - Peter J van der Most
- Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Renée de Mutsert
- Department of Clinical Epidemiology, Leiden University Medical Center, Leiden, the Netherlands
| | - Matthias Nauck
- Institute of Clinical Chemistry and Laboratory Medicine, University Medicine Greifswald, Greifswald, Germany
| | - Ilja M Nolte
- Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Raymond Noordam
- Department of Internal Medicine, Section of Gerontology and Geriatrics, Leiden University Medical Center, Leiden, the Netherlands
| | - Brenda W J H Penninx
- Department of Psychiatry, EMGO Institute for Health and Care Research and Neuroscience Campus Amsterdam, VU University Medical Center/GGZ inGeest, Amsterdam, the Netherlands
| | - Annette Peters
- Institute of Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
| | - Patricia A Peyser
- Department of Epidemiology, University of Michigan, Ann Arbor, MI, USA
| | - Ozren Polašek
- University of Split School of Medicine, Split, Croatia
- Algebra University College, Zagreb, Croatia
| | - Chris Power
- Population, Policy and Practice Research and Teaching Department, UCL Great Ormond Street Institute of Child Health, London, UK
| | | | - Paul Redmond
- Lothian Birth Cohorts, Department of Psychology, University of Edinburgh, Edinburgh, UK
| | - Janet W Rich-Edwards
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
- Division of Women's Health, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Paul M Ridker
- Brigham and Women's Hospital, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Cornelius A Rietveld
- Erasmus University Rotterdam Institute for Behavior and Biology, Rotterdam, the Netherlands
- Department of Applied Economics, Erasmus School of Economics, Rotterdam, the Netherlands
| | - Susan M Ring
- MRC Integrative Epidemiology Unit, University of Bristol, Bristol, UK
| | | | - Rico Rueedi
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Vallari Shukla
- DNRF Center for Chromosome Stability, Department of Cellular and Molecular Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Jennifer A Smith
- Department of Epidemiology, University of Michigan, Ann Arbor, MI, USA
- Survey Research Center, Institute for Social Research, University of Michigan, Ann Arbor, MI, USA
| | - Stasa Stankovic
- MRC Epidemiology Unit, Institute of Metabolic Science, University of Cambridge, Cambridge, UK
| | | | - Doris Stöckl
- Institute of Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
| | - Konstantin Strauch
- Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI), University Medical Center, Johannes Gutenberg University, Mainz, Germany
- Institute of Genetic Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
- Chair of Genetic Epidemiology, IBE, Faculty of Medicine, LMU, Munich, Germany
| | - Morris A Swertz
- Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Alexander Teumer
- Institute for Community Medicine, University Medicine Greifswald, Greifswald, Germany
| | | | | | - A Roy Thurik
- Erasmus University Rotterdam Institute for Behavior and Biology, Rotterdam, the Netherlands
- Department of Applied Economics, Erasmus School of Economics, Rotterdam, the Netherlands
- Montpellier Business School, Montpellier, France
| | | | - Constance Turman
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - André G Uitterlinden
- Erasmus University Rotterdam Institute for Behavior and Biology, Rotterdam, the Netherlands
- Department of Internal Medicine, Erasmus MC, University Medical Center Rotterdam, Rotterdam, the Netherlands
| | - Melanie Waldenberger
- Research Unit of Molecular Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
- Institute of Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
| | - Nicholas J Wareham
- MRC Epidemiology Unit, Institute of Metabolic Science, University of Cambridge, Cambridge, UK
| | - David R Weir
- Survey Research Center, Institute for Social Research, University of Michigan, Ann Arbor, MI, USA
| | - Gonneke Willemsen
- Department of Biological Psychology, Amsterdam Public Health Research Institute, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
| | - Jing Hau Zhao
- MRC Epidemiology Unit, Institute of Metabolic Science, University of Cambridge, Cambridge, UK
| | - Wei Zhao
- Department of Epidemiology, University of Michigan, Ann Arbor, MI, USA
| | - Yajie Zhao
- MRC Epidemiology Unit, Institute of Metabolic Science, University of Cambridge, Cambridge, UK
| | - Harold Snieder
- Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Marcel den Hoed
- Beijer Laboratory and Department of Immunology, Genetics and Pathology, Uppsala University and SciLifeLab, Uppsala, Sweden
| | - Ken K Ong
- MRC Epidemiology Unit, Institute of Metabolic Science, University of Cambridge, Cambridge, UK
| | - Melinda C Mills
- Nuffield College, University of Oxford, Oxford, UK.
- Leverhulme Centre for Demographic Science, University of Oxford, Oxford, UK.
- Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands.
- Department of Economics, Econometrics and Finance, University of Groningen, Groningen, the Netherlands.
| | - John R B Perry
- MRC Epidemiology Unit, Institute of Metabolic Science, University of Cambridge, Cambridge, UK.
| |
Collapse
|
27
|
Fluctuating selection and the determinants of genetic variation. Trends Genet 2023; 39:491-504. [PMID: 36890036 DOI: 10.1016/j.tig.2023.02.004] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Revised: 02/01/2023] [Accepted: 02/07/2023] [Indexed: 03/08/2023]
Abstract
Recent studies of cosmopolitan Drosophila populations have found hundreds to thousands of genetic loci with seasonally fluctuating allele frequencies, bringing temporally fluctuating selection to the forefront of the historical debate surrounding the maintenance of genetic variation in natural populations. Numerous mechanisms have been explored in this longstanding area of research, but these exciting empirical findings have prompted several recent theoretical and experimental studies that seek to better understand the drivers, dynamics, and genome-wide influence of fluctuating selection. In this review, we evaluate the latest evidence for multilocus fluctuating selection in Drosophila and other taxa, highlighting the role of potential genetic and ecological mechanisms in maintaining these loci and their impacts on neutral genetic variation.
Collapse
|
28
|
Aqil A, Speidel L, Pavlidis P, Gokcumen O. Balancing selection on genomic deletion polymorphisms in humans. eLife 2023; 12:79111. [PMID: 36625544 PMCID: PMC9943071 DOI: 10.7554/elife.79111] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Accepted: 01/05/2023] [Indexed: 01/11/2023] Open
Abstract
A key question in biology is why genomic variation persists in a population for extended periods. Recent studies have identified examples of genomic deletions that have remained polymorphic in the human lineage for hundreds of millennia, ostensibly owing to balancing selection. Nevertheless, genome-wide investigation of ancient and possibly adaptive deletions remains an imperative exercise. Here, we demonstrate an excess of polymorphisms in present-day humans that predate the modern human-Neanderthal split (ancient polymorphisms), which cannot be explained solely by selectively neutral scenarios. We analyze the adaptive mechanisms that underlie this excess in deletion polymorphisms. Using a previously published measure of balancing selection, we show that this excess of ancient deletions is largely owing to balancing selection. Based on the absence of signatures of overdominance, we conclude that it is a rare mode of balancing selection among ancient deletions. Instead, more complex scenarios involving spatially and temporally variable selective pressures are likely more common mechanisms. Our results suggest that balancing selection resulted in ancient deletions harboring disproportionately more exonic variants with GWAS (genome-wide association studies) associations. We further found that ancient deletions are significantly enriched for traits related to metabolism and immunity. As a by-product of our analysis, we show that deletions are, on average, more deleterious than single nucleotide variants. We can now argue that not only is a vast majority of common variants shared among human populations, but a considerable portion of biologically relevant variants has been segregating among our ancestors for hundreds of thousands, if not millions, of years.
Collapse
Affiliation(s)
- Alber Aqil
- Department of Biological Sciences, University at BuffaloBuffaloUnited States
| | - Leo Speidel
- University College London, Genetics InstituteLondonUnited Kingdom
- The Francis Crick InstituteLondonUnited Kingdom
| | - Pavlos Pavlidis
- Institute of Computer Science (ICS), Foundation of Research and Technology-HellasHeraklionGreece
| | - Omer Gokcumen
- Department of Biological Sciences, University at BuffaloBuffaloUnited States
| |
Collapse
|
29
|
Cascalho M, Platt JL. TNFRSF13B in B cell responses to organ transplantation. Hum Immunol 2023; 84:27-33. [PMID: 36333165 PMCID: PMC10429825 DOI: 10.1016/j.humimm.2022.09.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Revised: 09/14/2022] [Accepted: 09/27/2022] [Indexed: 11/07/2022]
Abstract
Antibodies directed against organ transplants are thought to pose the most vexing hurdle to enduring function and survival of the transplants, particularly organ xenotransplants, and accordingly basic and clinical investigation has focused on elucidating the specificity and pathogenicity of graft-specific antibodies. While much has been learned about these matters, far less is known about the B cells producing graft-specific antibodies and why these antibodies appear to injure some grafts but not others. With the goal of addressing those questions, we have investigated the properties of tumor necrosis factor receptor super family-13B (TNFRSF13B), which regulates various aspects of B cell responses. A full understanding of the functions of TNFRSF13B however is hindered by extreme polymorphism and by diversity of interactions of the protein. Nevertheless, TNFRSF13B variants have been found to exert distinct impact on natural and elicited antibody responses and host defense and mutations of TNFRSF13B have been found to influence the propensity for development of antibody-mediated rejection of organ transplants. Because B cell responses potentially limit application of xenotransplantation, understanding how TNFRSF13B diversity and TNFRSF13B variants govern immunity in xenotransplantation could inspire development of novel therapeutics that could in turn accelerate clinical implementation of xenotransplantation.
Collapse
Affiliation(s)
- Marilia Cascalho
- Department of Surgery and Department of Microbiology & Immunology, University of Michigan, Ann Arbor, MI, United States.
| | - Jeffrey L Platt
- Department of Surgery and Department of Microbiology & Immunology, University of Michigan, Ann Arbor, MI, United States.
| |
Collapse
|
30
|
Prioritizing autoimmunity risk variants for functional analyses by fine-mapping mutations under natural selection. Nat Commun 2022; 13:7069. [PMID: 36400766 PMCID: PMC9674589 DOI: 10.1038/s41467-022-34461-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Accepted: 10/25/2022] [Indexed: 11/19/2022] Open
Abstract
Pathogen-driven selection shaped adaptive mutations in immunity genes, including those contributing to inflammatory disorders. Functional characterization of such adaptive variants can shed light on disease biology and past adaptations. This popular idea, however, was difficult to test due to challenges in pinpointing adaptive mutations in selection footprints. In this study, using a local-tree-based approach, we show that 28% of risk loci (153/535) in 21 inflammatory disorders bear footprints of moderate and weak selection, and part of them are population specific. Weak selection footprints allow partial fine-mapping, and we show that in 19% (29/153) of the risk loci under selection, candidate disease variants are hitchhikers, and only in 39% of cases they are likely selection targets. We predict function for a subset of these selected SNPs and highlight examples of antagonistic pleiotropy. We conclude by offering disease variants under selection that can be tested functionally using infectious agents and other stressors to decipher the poorly understood link between environmental stressors and genetic risk in inflammatory conditions.
Collapse
|
31
|
Abraham A, LaBella AL, Capra JA, Rokas A. Mosaic patterns of selection in genomic regions associated with diverse human traits. PLoS Genet 2022; 18:e1010494. [PMID: 36342969 PMCID: PMC9671423 DOI: 10.1371/journal.pgen.1010494] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Revised: 11/17/2022] [Accepted: 10/21/2022] [Indexed: 11/09/2022] Open
Abstract
Natural selection shapes the genetic architecture of many human traits. However, the prevalence of different modes of selection on genomic regions associated with variation in traits remains poorly understood. To address this, we developed an efficient computational framework to calculate positive and negative enrichment of different evolutionary measures among regions associated with complex traits. We applied the framework to summary statistics from >900 genome-wide association studies (GWASs) and 11 evolutionary measures of sequence constraint, population differentiation, and allele age while accounting for linkage disequilibrium, allele frequency, and other potential confounders. We demonstrate that this framework yields consistent results across GWASs with variable sample sizes, numbers of trait-associated SNPs, and analytical approaches. The resulting evolutionary atlas maps diverse signatures of selection on genomic regions associated with complex human traits on an unprecedented scale. We detected positive enrichment for sequence conservation among trait-associated regions for the majority of traits (>77% of 290 high power GWASs), which included reproductive traits. Many traits also exhibited substantial positive enrichment for population differentiation, especially among hair, skin, and pigmentation traits. In contrast, we detected widespread negative enrichment for signatures of balancing selection (51% of GWASs) and absence of enrichment for evolutionary signals in regions associated with late-onset Alzheimer's disease. These results support a pervasive role for negative selection on regions of the human genome that contribute to variation in complex traits, but also demonstrate that diverse modes of evolution are likely to have shaped trait-associated loci. This atlas of evolutionary signatures across the diversity of available GWASs will enable exploration of the relationship between the genetic architecture and evolutionary processes in the human genome.
Collapse
Affiliation(s)
- Abin Abraham
- Vanderbilt University Medical Center, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Abigail L. LaBella
- Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee, United States of America
- Evolutionary Studies Initiative, Vanderbilt University, Nashville, Tennessee, United States of America
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, North Carolina, United States of America
- North Carolina Research Center, Kannapolis, North Carolina, United States of America
| | - John A. Capra
- Bakar Computational Health Sciences Institute, University of California, San Francisco, California, United States of America
- Department of Epidemiology and Biostatistics, University of California, San Francisco, California, United States of America
| | - Antonis Rokas
- Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee, United States of America
- Evolutionary Studies Initiative, Vanderbilt University, Nashville, Tennessee, United States of America
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, Tennessee, United States of America
| |
Collapse
|
32
|
Nikolakis ZL, Adams RH, Wade KJ, Lund AJ, Carlton EJ, Castoe TA, Pollock DD. Prospects for genomic surveillance for selection in schistosome parasites. FRONTIERS IN EPIDEMIOLOGY 2022; 2:932021. [PMID: 38455290 PMCID: PMC10910990 DOI: 10.3389/fepid.2022.932021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Accepted: 09/12/2022] [Indexed: 03/09/2024]
Abstract
Schistosomiasis is a neglected tropical disease caused by multiple parasitic Schistosoma species, and which impacts over 200 million people globally, mainly in low- and middle-income countries. Genomic surveillance to detect evidence for natural selection in schistosome populations represents an emerging and promising approach to identify and interpret schistosome responses to ongoing control efforts or other environmental factors. Here we review how genomic variation is used to detect selection, how these approaches have been applied to schistosomes, and how future studies to detect selection may be improved. We discuss the theory of genomic analyses to detect selection, identify experimental designs for such analyses, and review studies that have applied these approaches to schistosomes. We then consider the biological characteristics of schistosomes that are expected to respond to selection, particularly those that may be impacted by control programs. Examples include drug resistance, host specificity, and life history traits, and we review our current understanding of specific genes that underlie them in schistosomes. We also discuss how inherent features of schistosome reproduction and demography pose substantial challenges for effective identification of these traits and their genomic bases. We conclude by discussing how genomic surveillance for selection should be designed to improve understanding of schistosome biology, and how the parasite changes in response to selection.
Collapse
Affiliation(s)
- Zachary L. Nikolakis
- Department of Biology, University of Texas at Arlington, Arlington, TX, United States
| | - Richard H. Adams
- Department of Biological and Environmental Sciences, Georgia College and State University, Milledgeville, GA, United States
| | - Kristen J. Wade
- Department of Biochemistry and Molecular Genetics, University of Colorado School of Medicine, Aurora, CO, United States
| | - Andrea J. Lund
- Department of Environmental and Occupational Health, Colorado School of Public Health, University of Colorado, Anschutz, Aurora, CO, United States
| | - Elizabeth J. Carlton
- Department of Environmental and Occupational Health, Colorado School of Public Health, University of Colorado, Anschutz, Aurora, CO, United States
| | - Todd A. Castoe
- Department of Biology, University of Texas at Arlington, Arlington, TX, United States
| | - David D. Pollock
- Department of Biochemistry and Molecular Genetics, University of Colorado School of Medicine, Aurora, CO, United States
| |
Collapse
|
33
|
Gagalova KK, Warren RL, Coombe L, Wong J, Nip KM, Yuen MMS, Whitehill JGA, Celedon JM, Ritland C, Taylor GA, Cheng D, Plettner P, Hammond SA, Mohamadi H, Zhao Y, Moore RA, Mungall AJ, Boyle B, Laroche J, Cottrell J, Mackay JJ, Lamothe M, Gérardi S, Isabel N, Pavy N, Jones SJM, Bohlmann J, Bousquet J, Birol I. Spruce giga-genomes: structurally similar yet distinctive with differentially expanding gene families and rapidly evolving genes. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022; 111:1469-1485. [PMID: 35789009 DOI: 10.1111/tpj.15889] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Revised: 06/22/2022] [Accepted: 06/27/2022] [Indexed: 06/15/2023]
Abstract
Spruces (Picea spp.) are coniferous trees widespread in boreal and mountainous forests of the northern hemisphere, with large economic significance and enormous contributions to global carbon sequestration. Spruces harbor very large genomes with high repetitiveness, hampering their comparative analysis. Here, we present and compare the genomes of four different North American spruces: the genome assemblies for Engelmann spruce (Picea engelmannii) and Sitka spruce (Picea sitchensis) together with improved and more contiguous genome assemblies for white spruce (Picea glauca) and for a naturally occurring introgress of these three species known as interior spruce (P. engelmannii × glauca × sitchensis). The genomes were structurally similar, and a large part of scaffolds could be anchored to a genetic map. The composition of the interior spruce genome indicated asymmetric contributions from the three ancestral genomes. Phylogenetic analysis of the nuclear and organelle genomes revealed a topology indicative of ancient reticulation. Different patterns of expansion of gene families among genomes were observed and related with presumed diversifying ecological adaptations. We identified rapidly evolving genes that harbored high rates of non-synonymous polymorphisms relative to synonymous ones, indicative of positive selection and its hitchhiking effects. These gene sets were mostly distinct between the genomes of ecologically contrasted species, and signatures of convergent balancing selection were detected. Stress and stimulus response was identified as the most frequent function assigned to expanding gene families and rapidly evolving genes. These two aspects of genomic evolution were complementary in their contribution to divergent evolution of presumed adaptive nature. These more contiguous spruce giga-genome sequences should strengthen our understanding of conifer genome structure and evolution, as their comparison offers clues into the genetic basis of adaptation and ecology of conifers at the genomic level. They will also provide tools to better monitor natural genetic diversity and improve the management of conifer forests. The genomes of four closely related North American spruces indicate that their high similarity at the morphological level is paralleled by the high conservation of their physical genome structure. Yet, the evidence of divergent evolution is apparent in their rapidly evolving genomes, supported by differential expansion of key gene families and large sets of genes under positive selection, largely in relation to stimulus and environmental stress response.
Collapse
Affiliation(s)
- Kristina K Gagalova
- Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
| | - René L Warren
- Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
| | - Lauren Coombe
- Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
| | - Johnathan Wong
- Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
| | - Ka Ming Nip
- Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
| | - Macaire Man Saint Yuen
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
| | - Justin G A Whitehill
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
| | - Jose M Celedon
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
| | - Carol Ritland
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
| | - Greg A Taylor
- Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
| | - Dean Cheng
- Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
| | - Patrick Plettner
- Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
| | - S Austin Hammond
- Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
- Next-Generation Sequencing Facility, University of Saskatchewan, Saskatoon, SK, S7N 5E5, Canada
| | - Hamid Mohamadi
- Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
| | - Yongjun Zhao
- Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
| | - Richard A Moore
- Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
| | - Andrew J Mungall
- Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
| | - Brian Boyle
- Institute for Systems and Integrative Biology, Université Laval, Québec, QC, GIV 0A6, Canada
| | - Jérôme Laroche
- Institute for Systems and Integrative Biology, Université Laval, Québec, QC, GIV 0A6, Canada
| | - Joan Cottrell
- Forest Research, U.K. Forestry Commission, Northern Research Station, Roslin, EH25 9SY, Midlothian, UK
| | - John J Mackay
- Department of Plant Sciences, University of Oxford, Oxford, OX1 3RB, UK
| | - Manuel Lamothe
- Natural Resources Canada, Canadian Forest Service, Laurentian Forestry Centre, Québec, QC, G1V 4C7, Canada
| | - Sébastien Gérardi
- Institute for Systems and Integrative Biology, Université Laval, Québec, QC, GIV 0A6, Canada
- Canada Research Chair in Forest Genomics, Forest Research Centre, Université Laval, Québec, QC, G1V 0A6, Canada
| | - Nathalie Isabel
- Natural Resources Canada, Canadian Forest Service, Laurentian Forestry Centre, Québec, QC, G1V 4C7, Canada
- Canada Research Chair in Forest Genomics, Forest Research Centre, Université Laval, Québec, QC, G1V 0A6, Canada
| | - Nathalie Pavy
- Institute for Systems and Integrative Biology, Université Laval, Québec, QC, GIV 0A6, Canada
- Canada Research Chair in Forest Genomics, Forest Research Centre, Université Laval, Québec, QC, G1V 0A6, Canada
| | - Steven J M Jones
- Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
| | - Joerg Bohlmann
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
| | - Jean Bousquet
- Institute for Systems and Integrative Biology, Université Laval, Québec, QC, GIV 0A6, Canada
- Canada Research Chair in Forest Genomics, Forest Research Centre, Université Laval, Québec, QC, G1V 0A6, Canada
| | - Inanc Birol
- Canada's Michael Smith Genome Sciences Centre, Vancouver, BC, V5Z 4S6, Canada
| |
Collapse
|
34
|
Schield DR, Perry BW, Adams RH, Holding ML, Nikolakis ZL, Gopalan SS, Smith CF, Parker JM, Meik JM, DeGiorgio M, Mackessy SP, Castoe TA. The roles of balancing selection and recombination in the evolution of rattlesnake venom. Nat Ecol Evol 2022; 6:1367-1380. [PMID: 35851850 PMCID: PMC9888523 DOI: 10.1038/s41559-022-01829-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2021] [Accepted: 06/15/2022] [Indexed: 02/02/2023]
Abstract
The origin of snake venom involved duplication and recruitment of non-venom genes into venom systems. Several studies have predicted that directional positive selection has governed this process. Venom composition varies substantially across snake species and venom phenotypes are locally adapted to prey, leading to coevolutionary interactions between predator and prey. Venom origins and contemporary snake venom evolution may therefore be driven by fundamentally different selection regimes, yet investigations of population-level patterns of selection have been limited. Here, we use whole-genome data from 68 rattlesnakes to test hypotheses about the factors that drive genomic diversity and differentiation in major venom gene regions. We show that selection has resulted in long-term maintenance of genetic diversity within and between species in multiple venom gene families. Our findings are inconsistent with a dominant role of directional positive selection and instead support a role of long-term balancing selection in shaping venom evolution. We also detect rapid decay of linkage disequilibrium due to high recombination rates in venom regions, suggesting that venom genes have reduced selective interference with nearby loci, including other venom paralogues. Our results provide an example of long-term balancing selection that drives trans-species polymorphism and help to explain how snake venom keeps pace with prey resistance.
Collapse
Affiliation(s)
- Drew R Schield
- Department of Biology, University of Texas at Arlington, Arlington, TX, USA.
- Department of Ecology and Evolutionary Biology, University of Colorado, Boulder, CO, USA.
| | - Blair W Perry
- Department of Biology, University of Texas at Arlington, Arlington, TX, USA
- School of Biological Sciences, Washington State University, Pullman, WA, USA
| | - Richard H Adams
- Department of Biological and Environmental Sciences, Georgia College and State University, Milledgeville, GA, USA
| | | | | | | | - Cara F Smith
- School of Biological Sciences, University of Northern Colorado, Greeley, CO, USA
| | - Joshua M Parker
- Life Science Department, Fresno City College, Fresno, CA, USA
| | - Jesse M Meik
- Department of Biological Sciences, Tarleton State University, Stephenville, TX, USA
| | - Michael DeGiorgio
- Department of Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FL, USA
| | - Stephen P Mackessy
- School of Biological Sciences, University of Northern Colorado, Greeley, CO, USA
| | - Todd A Castoe
- Department of Biology, University of Texas at Arlington, Arlington, TX, USA.
| |
Collapse
|
35
|
Markowitz RHG, LaBella AL, Shi M, Rokas A, Capra JA, Ferguson JF, Mosley JD, Bordenstein SR. Microbiome-associated human genetic variants impact phenome-wide disease risk. Proc Natl Acad Sci U S A 2022; 119:e2200551119. [PMID: 35749358 PMCID: PMC9245617 DOI: 10.1073/pnas.2200551119] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Accepted: 04/29/2022] [Indexed: 12/26/2022] Open
Abstract
Human genetic variation associates with the composition of the gut microbiome, yet its influence on clinical traits remains largely unknown. We analyzed the consequences of nearly a thousand gut microbiome-associated variants (MAVs) on phenotypes reported in electronic health records from tens of thousands of individuals. We discovered and replicated associations of MAVs with neurological, metabolic, digestive, and circulatory diseases. Five significant MAVs in these categories correlate with the relative abundance of microbes down to the strain level. We also demonstrate that these relationships are independently observed and concordant with microbe by disease associations reported in case-control studies. Moreover, a selective sweep and population differentiation impacted some disease-linked MAVs. Combined, these findings establish triad relationships among the human genome, microbiome, and disease. Consequently, human genetic influences may offer opportunities for precision diagnostics of microbiome-associated diseases but also highlight the relevance of genetic background for microbiome modulation and therapeutics.
Collapse
Affiliation(s)
- Robert H. George Markowitz
- Vanderbilt Microbiome Innovation Center, Vanderbilt University, Nashville, TN 37232
- Department of Biological Sciences, Vanderbilt University, Nashville, TN 37232
| | | | - Mingjian Shi
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37232
| | - Antonis Rokas
- Department of Biological Sciences, Vanderbilt University, Nashville, TN 37232
| | - John A. Capra
- Department of Epidemiology and Biostatistics, University of California, San Francisco, CA 94143
- Bakar Computational Health Sciences Institute, University of California, San Francisco, CA 94143
| | - Jane F. Ferguson
- Vanderbilt Microbiome Innovation Center, Vanderbilt University, Nashville, TN 37232
- Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232
| | - Jonathan D. Mosley
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37232
- Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232
| | - Seth R. Bordenstein
- Vanderbilt Microbiome Innovation Center, Vanderbilt University, Nashville, TN 37232
- Department of Biological Sciences, Vanderbilt University, Nashville, TN 37232
- Vanderbilt Institute for Infection, Immunology and Inflammation, Vanderbilt University Medical Center, Nashville, TN 37232
- Department of Pathology, Microbiology, and Immunology, School of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232
| |
Collapse
|
36
|
Soni V, Vos M, Eyre-Walker A. A new test suggests hundreds of amino acid polymorphisms in humans are subject to balancing selection. PLoS Biol 2022; 20:e3001645. [PMID: 35653351 PMCID: PMC9162324 DOI: 10.1371/journal.pbio.3001645] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Accepted: 04/25/2022] [Indexed: 11/18/2022] Open
Abstract
The role that balancing selection plays in the maintenance of genetic diversity remains unresolved. Here, we introduce a new test, based on the McDonald–Kreitman test, in which the number of polymorphisms that are shared between populations is contrasted to those that are private at selected and neutral sites. We show that this simple test is robust to a variety of demographic changes, and that it can also give a direct estimate of the number of shared polymorphisms that are directly maintained by balancing selection. We apply our method to population genomic data from humans and provide some evidence that hundreds of nonsynonymous polymorphisms are subject to balancing selection.
Collapse
Affiliation(s)
- Vivak Soni
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | - Michiel Vos
- European Centre for Environment and Human Health, University of Exeter Medical School, Environment and Sustainability Institute, Penryn, United Kingdom
| | - Adam Eyre-Walker
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
- * E-mail:
| |
Collapse
|
37
|
Velazquez-Arcelay K, Benton ML, Capra JA. Diverse functions associate with non-coding polymorphisms shared between humans and chimpanzees. BMC Ecol Evol 2022; 22:68. [PMID: 35606693 PMCID: PMC9125839 DOI: 10.1186/s12862-022-02020-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2022] [Accepted: 05/09/2022] [Indexed: 11/24/2022] Open
Abstract
Background Long-term balancing selection (LTBS) can maintain allelic variation at a locus over millions of years and through speciation events. Variants shared between species in the state of identity-by-descent, hereafter “trans-species polymorphisms”, can result from LTBS, often due to host–pathogen interactions. For instance, the major histocompatibility complex (MHC) locus contains TSPs present across primates. Several hundred candidate LTBS regions have been identified in humans and chimpanzees; however, because many are in non-protein-coding regions of the genome, the functions and potential adaptive roles for most remain unknown. Results We integrated diverse genomic annotations to explore the functions of 60 previously identified regions with multiple shared polymorphisms (SPs) between humans and chimpanzees, including 19 with strong evidence of LTBS. We analyzed genome-wide functional assays, expression quantitative trait loci (eQTL), genome-wide association studies (GWAS), and phenome-wide association studies (PheWAS) for all the regions. We identify functional annotations for 59 regions, including 58 with evidence of gene regulatory function from GTEx or functional genomics data and 19 with evidence of trait association from GWAS or PheWAS. As expected, the SPs associate in humans with many immune system phenotypes, including response to pathogens, but we also find associations with a range of other phenotypes, including body size, alcohol intake, cognitive performance, risk-taking behavior, and urate levels. Conclusions The diversity of traits associated with non-coding regions with multiple SPs support previous hypotheses that functions beyond the immune system are likely subject to LTBS. Furthermore, several of these trait associations provide support and candidate genetic loci for previous hypothesis about behavioral diversity in human and chimpanzee populations, such as the importance of variation in risk sensitivity. Supplementary Information The online version contains supplementary material available at 10.1186/s12862-022-02020-x.
Collapse
|
38
|
Alharbi AF, Sheng N, Nicol K, Strömberg N, Hollox EJ. Balancing selection at the human salivary agglutinin gene (DMBT1) driven by host-microbe interactions. iScience 2022; 25:104189. [PMID: 35494225 PMCID: PMC9038570 DOI: 10.1016/j.isci.2022.104189] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Revised: 02/07/2022] [Accepted: 03/30/2022] [Indexed: 11/19/2022] Open
Abstract
Discovering loci under balancing selection in humans can identify loci with alleles that affect response to the environment and disease. Genome variation data have identified the 5′ region of the DMBT1 gene as undergoing balancing selection in humans. DMBT1 encodes the pattern-recognition glycoprotein DMBT1, also known as SALSA, gp340, or salivary agglutinin. DMBT1 binds to a variety of pathogens through a tandemly arranged scavenger receptor cysteine-rich (SRCR) domain, with the number of domains polymorphic in humans. We show that the signal of balancing selection is driven by one haplotype usually carrying a shorter SRCR repeat and another usually carrying a longer SRCR repeat. DMBT1 encoded by a shorter SRCR repeat allele does not bind a cariogenic and invasive Streptococcus mutans strain, in contrast to the long SRCR allele that shows binding. Our results suggest that balancing selection at DMBT1 is due to host-microbe interactions of encoded SRCR tandem repeat alleles. Clear evidence from many analyses show balancing selection at DMBT1 Scavenger-receptor cysteine-rich domain array associated with balancing selection Genetic variation, not alternative splicing, responsible for protein isoforms Long, but not short, DMBT1 isoforms bind a cariogenic strain of Streptococcus mutans
Collapse
Affiliation(s)
- Adel F. Alharbi
- Department of Genetics and Genome Biology, University of Leicester, Leicester, UK
- Medina Regional Laboratory, General Directorate of Health Affairs, Ministry of Health, Medina, Saudi Arabia
| | - Nongfei Sheng
- Department of Odontology, Umeå University, Umeå, Sweden
| | - Katie Nicol
- Department of Genetics and Genome Biology, University of Leicester, Leicester, UK
| | | | - Edward J. Hollox
- Department of Genetics and Genome Biology, University of Leicester, Leicester, UK
- Corresponding author
| |
Collapse
|
39
|
Stauber L, Croll D, Prospero S. Temporal changes in pathogen diversity in a perennial plant-pathogen-hyperparasite system. Mol Ecol 2022; 31:2073-2088. [PMID: 35122694 PMCID: PMC9540319 DOI: 10.1111/mec.16386] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Revised: 01/26/2022] [Accepted: 01/31/2022] [Indexed: 11/27/2022]
Abstract
Hyperparasites can affect the evolution of pathosystems by influencing the stability of both pathogen and host populations. However, how pathogens of perennial hosts evolve in the presence of a hyperparasite has rarely been studied. Here, we investigated temporal changes in genetic diversity of the invasive chestnut blight pathogen Cryphonectria parasitica in the presence of its parasitic mycovirus Cryphonectria hypovirus 1 (CHV1). The virus reduces fungal virulence and represents an effective natural biocontrol agent against chestnut blight in Europe. We analysed genome-wide diversity and CHV1 prevalence in C. parasitica populations in southern Switzerland that were sampled twice at an interval of about 30 years. Overall, we found that both pathogen population structure and CHV1 prevalence were retained over time. The results suggest that recent bottlenecks have influenced the structure of C. parasitica populations in southern Switzerland. Strong balancing selection signals were found at a single vegetative incompatibility (vic) locus, consistent with negative frequency-dependent selection imposed by the vegetative incompatibility system. High levels of mating among related individuals (i.e., inbreeding) and genetic drift are probably at the origin of imbalanced allele ratios at vic loci and subsequently low vc type diversity. Virus infection rates were stable at ~30% over the study period and we found no significant impact of the virus on fungal population diversity. Consequently, the efficacy of CHV1-mediated biocontrol was probably retained.
Collapse
Affiliation(s)
- Lea Stauber
- Swiss Federal Institute for Forest, Snow and Landscape Research (WSL)BirmensdorfSwitzerland
- Laboratory of Evolutionary GeneticsInstitute of BiologyUniversity of NeuchâtelNeuchâtelSwitzerland
- Department of Environmental SciencesUniversity of BaselBaselSwitzerland
| | - Daniel Croll
- Laboratory of Evolutionary GeneticsInstitute of BiologyUniversity of NeuchâtelNeuchâtelSwitzerland
| | - Simone Prospero
- Swiss Federal Institute for Forest, Snow and Landscape Research (WSL)BirmensdorfSwitzerland
| |
Collapse
|
40
|
Liu Y. Conservation prioritization based on past cascading climatic effects on genetic diversity and population size dynamics: Insights from a temperate tree species. DIVERS DISTRIB 2022. [DOI: 10.1111/ddi.13490] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
Affiliation(s)
- Yang Liu
- Department of Forest and Conservation Sciences University of British Columbia Vancouver British Columbia Canada
- Department of Archaeology University of Cambridge Cambridge UK
| |
Collapse
|
41
|
Myers AN, Lawhon SD, Diesel AB, Bradley CW, Rodrigues Hoffmann A, Murphy WJ. An ancient haplotype containing antimicrobial peptide gene variants is associated with severe fungal skin disease in Persian cats. PLoS Genet 2022; 18:e1010062. [PMID: 35157719 PMCID: PMC8880935 DOI: 10.1371/journal.pgen.1010062] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2021] [Revised: 02/25/2022] [Accepted: 01/28/2022] [Indexed: 11/19/2022] Open
Abstract
Dermatophytosis, also known as ringworm, is a contagious fungal skin disease affecting humans and animals worldwide. Persian cats exhibit severe forms of the disease more commonly than other breeds of cat, including other long-haired breeds. Certain types of severe dermatophytosis in humans are reportedly caused by monogenic inborn errors of immunity. The goal of this study was to identify genetic variants in Persian cats contributing to the phenotype of severe dermatophytosis. Whole-genome sequencing of case and control Persian cats followed by a genome-wide association study identified a highly divergent, disease-associated haplotype on chromosome F1 containing the S100 family of genes. S100 calcium binding protein A9 (S100A9), which encodes a subunit of the antimicrobial heterodimer known as calprotectin, contained 13 nonsynonymous variants between cases and controls. Evolutionary analysis of S100A9 haplotypes comparing cases, controls, and wild felids suggested the divergent disease-associated haplotype was likely introgressed into the domestic cat lineage and maintained via balancing selection. We demonstrated marked upregulation of calprotectin expression in the feline epidermis during dermatophytosis, suggesting involvement in disease pathogenesis. Given this divergent allele has been maintained in domestic cat and wildcat populations, this haplotype may have beneficial effects against other pathogens. The pathogen specificity of this altered protein should be investigated before attempting to reduce the allele frequency in the Persian cat breed. Further work is needed to clarify if severe Persian dermatophytosis is a monogenic disease or if hidden disease-susceptibility loci remain to be discovered. Consideration should be given to engineering antimicrobial peptides such as calprotectin for topical treatment of dermatophytosis in humans and animals. Fungal skin infections known as ringworm or dermatophytosis affect billions of humans and animals worldwide. Normally the disease is self-limiting in affected individuals. The Persian cat breed is a popular breed known for its long hair coat and short nose as well as its propensity to develop severe, chronic dermatophytosis. By examining the genomes of Persian cats, we discovered that a specific region of DNA is highly altered between cats with and without severe dermatophytosis. The DNA sequence in this region is particularly divergent within a cluster of genes involved in immune defense against pathogens. Notably, alterations to the DNA sequence cause several changes in the antimicrobial protein known as calprotectin, which defends against pathogens in the skin of cats. Persian cats with severe dermatophytosis have a version of calprotectin similar to a version maintained by certain desert-dwelling wild felids such as sand cats and Asiatic wildcats. Therefore, we think this version of the protein is beneficial in some environments or against certain pathogens but not against the fungus that causes ringworm in cats. Our findings suggest changes to calprotectin may affect pathogen specificity and engineered calprotectin could be considered as a novel therapy for dermatophytosis in humans and animals.
Collapse
Affiliation(s)
- Alexandra N. Myers
- Department of Veterinary Pathobiology, College of Veterinary Medicine & Biomedical Sciences, Texas A&M University, College Station, Texas, Unites States of America
- * E-mail: (ANM); (WJM)
| | - Sara D. Lawhon
- Department of Veterinary Pathobiology, College of Veterinary Medicine & Biomedical Sciences, Texas A&M University, College Station, Texas, Unites States of America
| | - Alison B. Diesel
- Department of Small Animal Clinical Sciences, College of Veterinary Medicine & Biomedical Sciences, Texas A&M University, College Station, Texas, Unites States of America
| | - Charles W. Bradley
- Department of Pathobiology, School of Veterinary Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, Unites States of America
| | - Aline Rodrigues Hoffmann
- Department of Veterinary Pathobiology, College of Veterinary Medicine & Biomedical Sciences, Texas A&M University, College Station, Texas, Unites States of America
| | - William J. Murphy
- Department of Veterinary Integrative Biosciences, College of Veterinary Medicine & Biomedical Sciences, Texas A&M University, College Station, Texas, Unites States of America
- * E-mail: (ANM); (WJM)
| | | |
Collapse
|
42
|
Derks MFL, Steensma M. Review: Balancing Selection for Deleterious Alleles in Livestock. Front Genet 2021; 12:761728. [PMID: 34925454 PMCID: PMC8678120 DOI: 10.3389/fgene.2021.761728] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2021] [Accepted: 11/19/2021] [Indexed: 01/04/2023] Open
Abstract
Harmful alleles can be under balancing selection due to an interplay of artificial selection for the variant in heterozygotes and purifying selection against the variant in homozygotes. These pleiotropic variants can remain at moderate to high frequency expressing an advantage for favorable traits in heterozygotes, while harmful in homozygotes. The impact on the population and selection strength depends on the consequence of the variant both in heterozygotes and homozygotes. The deleterious phenotype expressed in homozygotes can range from early lethality to a slightly lower fitness in the population. In this review, we explore a range of causative variants under balancing selection including loss-of-function variation (i.e., frameshift, stop-gained variants) and regulatory variation (affecting gene expression). We report that harmful alleles often affect orthologous genes in different species, often influencing analogous traits. The recent discoveries are mainly driven by the increasing genomic and phenotypic resources in livestock populations. However, the low frequency and sometimes subtle effects in homozygotes prevent accurate mapping of such pleiotropic variants, which requires novel strategies to discover. After discovery, the selection strategy for deleterious variants under balancing selection is under debate, as variants can contribute to the heterosis effect in crossbred animals in various livestock species, compensating for the loss in purebred animals. Nevertheless, gene-assisted selection is a useful tool to decrease the frequency of the harmful allele in the population, if desired. Together, this review marks various deleterious variants under balancing selection and describing the functional consequences at the molecular, phenotypic, and population level, providing a resource for further study.
Collapse
Affiliation(s)
- Martijn F L Derks
- Animal Breeding and Genomics, Wageningen University and Research, Wageningen, Netherlands.,Topigs Norsvin Research Center, Beuningen, Netherlands
| | - Marije Steensma
- Animal Breeding and Genomics, Wageningen University and Research, Wageningen, Netherlands
| |
Collapse
|
43
|
Grace CA, Forrester S, Silva VC, Carvalho KSS, Kilford H, Chew YP, James S, Costa DL, Mottram JC, Costa CCHN, Jeffares DC. Candidates for Balancing Selection in Leishmania donovani Complex Parasites. Genome Biol Evol 2021; 13:6448231. [PMID: 34865011 PMCID: PMC8717319 DOI: 10.1093/gbe/evab265] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/22/2021] [Indexed: 12/19/2022] Open
Abstract
The Leishmania donovani species complex is the causative agent of visceral leishmaniasis, which cause 20–40,000 fatalities a year. Here, we conduct a screen for balancing selection in this species complex. We used 384 publicly available L. donovani and L. infantum genomes, and sequence 93 isolates of L. infantum from Brazil to describe the global diversity of this species complex. We identify five genetically distinct populations that are sufficiently represented by genomic data to search for signatures of selection. We find that signals of balancing selection are generally not shared between populations, consistent with transient adaptive events, rather than long-term balancing selection. We then apply multiple diversity metrics to identify candidate genes with robust signatures of balancing selection, identifying a curated set of 24 genes with robust signatures. These include zeta toxin, nodulin-like, and flagellum attachment proteins. This study highlights the extent of genetic divergence between L. donovani complex parasites and provides genes for further study.
Collapse
Affiliation(s)
- Cooper Alastair Grace
- Department of Biology, York Biomedical Research Institute, University of York, York, United Kingdom
| | - Sarah Forrester
- Department of Biology, York Biomedical Research Institute, University of York, York, United Kingdom
| | - Vladimir Costa Silva
- Instituto de Doenças do Sertão, Instituto de Doenças Tropicais Natan Portella, Centro de Ciências da Saúde da Universidade Federal do Piauí, Teresina-PI, Brazil
| | - Kátia Silene Sousa Carvalho
- Instituto de Doenças do Sertão, Instituto de Doenças Tropicais Natan Portella, Centro de Ciências da Saúde da Universidade Federal do Piauí, Teresina-PI, Brazil
| | - Hannah Kilford
- Department of Biology, York Biomedical Research Institute, University of York, York, United Kingdom
| | - Yen Peng Chew
- Department of Biology, York Biomedical Research Institute, University of York, York, United Kingdom.,Institute of Molecular Plant Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Sally James
- Department of Biology, York Biomedical Research Institute, University of York, York, United Kingdom
| | - Dorcas L Costa
- Instituto de Doenças do Sertão, Instituto de Doenças Tropicais Natan Portella, Centro de Ciências da Saúde da Universidade Federal do Piauí, Teresina-PI, Brazil
| | - Jeremy C Mottram
- Department of Biology, York Biomedical Research Institute, University of York, York, United Kingdom
| | - Carlos C H N Costa
- Instituto de Doenças do Sertão, Instituto de Doenças Tropicais Natan Portella, Centro de Ciências da Saúde da Universidade Federal do Piauí, Teresina-PI, Brazil
| | - Daniel C Jeffares
- Department of Biology, York Biomedical Research Institute, University of York, York, United Kingdom
| |
Collapse
|
44
|
Cheng X, DeGiorgio M. BalLeRMix +: mixture model approaches for robust joint identification of both positive selection and long-term balancing selection. Bioinformatics 2021; 38:861-863. [PMID: 34664624 PMCID: PMC8756184 DOI: 10.1093/bioinformatics/btab720] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Revised: 09/13/2021] [Accepted: 10/13/2021] [Indexed: 02/03/2023] Open
Abstract
SUMMARY The growing availability of genomewide polymorphism data has fueled interest in detecting diverse selective processes affecting population diversity. However, no model-based approaches exist to jointly detect and distinguish the two complementary processes of balancing and positive selection. We extend the BalLeRMix B-statistic framework described in Cheng and DeGiorgio (2020) for detecting balancing selection and present BalLeRMix+, which implements five B statistic extensions based on mixture models to robustly identify both types of selection. BalLeRMix+ is implemented in Python and computes the composite likelihood ratios and associated model parameters for each genomic test position. AVAILABILITY AND IMPLEMENTATION BalLeRMix+ is freely available at https://github.com/bioXiaoheng/BallerMixPlus. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
45
|
Rendón-Anaya M, Wilson J, Sveinsson S, Fedorkov A, Cottrell J, Bailey MES, Ruņģis D, Lexer C, Jansson S, Robinson KM, Street NR, Ingvarsson PK. Adaptive introgression facilitate adaptation to high latitudes in European aspen (Populus tremula L.). Mol Biol Evol 2021; 38:5034-5050. [PMID: 34329481 PMCID: PMC8557470 DOI: 10.1093/molbev/msab229] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Understanding local adaptation has become a key research area given the ongoing climate challenge and the concomitant requirement to conserve genetic resources. Perennial plants, such as forest trees, are good models to study local adaptation given their wide geographic distribution, largely outcrossing mating systems, and demographic histories. We evaluated signatures of local adaptation in European aspen (Populus tremula) across Europe by means of whole-genome resequencing of a collection of 411 individual trees. We dissected admixture patterns between aspen lineages and observed a strong genomic mosaicism in Scandinavian trees, evidencing different colonization trajectories into the peninsula from Russia, Central and Western Europe. As a consequence of the secondary contacts between populations after the last glacial maximum, we detected an adaptive introgression event in a genome region of ∼500 kb in chromosome 10, harboring a large-effect locus that has previously been shown to contribute to adaptation to the short growing seasons characteristic of Northern Scandinavia. Demographic simulations and ancestry inference suggest an Eastern origin—probably Russian—of the adaptive Nordic allele which nowadays is present in a homozygous state at the north of Scandinavia. The strength of introgression and positive selection signatures in this region is a unique feature in the genome. Furthermore, we detected signals of balancing selection, shared across regional populations, that highlight the importance of standing variation as a primary source of alleles that facilitate local adaptation. Our results, therefore, emphasize the importance of migration–selection balance underlying the genetic architecture of key adaptive quantitative traits.
Collapse
Affiliation(s)
- Martha Rendón-Anaya
- Linnean Centre for Plant Biology, Department of Plant Biology, Uppsala BioCenter, Swedish University of Agricultural Science, Uppsala, Sweden
| | - Jonathan Wilson
- Linnean Centre for Plant Biology, Department of Plant Biology, Uppsala BioCenter, Swedish University of Agricultural Science, Uppsala, Sweden
| | | | - Aleksey Fedorkov
- Institute of Biology, Komi Science Center, Russian Academy of Sciences, Syktyvkar, Russia
| | - Joan Cottrell
- Forest Research, Northern Research Station, Roslin, UK
| | - Mark E S Bailey
- School of Life Sciences, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, UK
| | - Dainis Ruņģis
- Genetic Resource Centre, Latvian State Forest Research Institute "Silava", LV2169 Salaspils, Latvia
| | - Christian Lexer
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| | - Stefan Jansson
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, Umeå, Sweden
| | - Kathryn M Robinson
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, Umeå, Sweden
| | - Nathaniel R Street
- Umeå Plant Science Centre, Department of Plant Physiology, Umeå University, Umeå, Sweden
| | - Pär K Ingvarsson
- Linnean Centre for Plant Biology, Department of Plant Biology, Uppsala BioCenter, Swedish University of Agricultural Science, Uppsala, Sweden
| |
Collapse
|
46
|
Bourgeois Y, Fields P, Bento G, Ebert D. Balancing selection for pathogen resistance reveals an intercontinental signature of Red Queen coevolution. Mol Biol Evol 2021; 38:4918-4933. [PMID: 34289047 PMCID: PMC8557431 DOI: 10.1093/molbev/msab217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The link between long-term host–parasite coevolution and genetic diversity is key to understanding genetic epidemiology and the evolution of resistance. The model of Red Queen host–parasite coevolution posits that high genetic diversity is maintained when rare host resistance variants have a selective advantage, which is believed to be the mechanistic basis for the extraordinarily high levels of diversity at disease-related genes such as the major histocompatibility complex in jawed vertebrates and R-genes in plants. The parasites that drive long-term coevolution are, however, often elusive. Here we present evidence for long-term balancing selection at the phenotypic (variation in resistance) and genomic (resistance locus) level in a particular host–parasite system: the planktonic crustacean Daphnia magna and the bacterium Pasteuria ramosa. The host shows widespread polymorphisms for pathogen resistance regardless of geographic distance, even though there is a clear genome-wide pattern of isolation by distance at other sites. In the genomic region of a previously identified resistance supergene, we observed consistent molecular signals of balancing selection, including higher genetic diversity, older coalescence times, and lower differentiation between populations, which set this region apart from the rest of the genome. We propose that specific long-term coevolution by negative-frequency-dependent selection drives this elevated diversity at the host's resistance loci on an intercontinental scale and provide an example of a direct link between the host’s resistance to a virulent pathogen and the large-scale diversity of its underlying genes.
Collapse
Affiliation(s)
- Yann Bourgeois
- University of Basel, Department of Environmental Sciences, Zoology, Vesalgasse 1, 4051 Basel, Switzerland
| | - Peter Fields
- University of Basel, Department of Environmental Sciences, Zoology, Vesalgasse 1, 4051 Basel, Switzerland
| | - Gilberto Bento
- University of Basel, Department of Environmental Sciences, Zoology, Vesalgasse 1, 4051 Basel, Switzerland
| | - Dieter Ebert
- University of Basel, Department of Environmental Sciences, Zoology, Vesalgasse 1, 4051 Basel, Switzerland
| |
Collapse
|
47
|
de Mattos Barbosa MG, Lefferts AR, Huynh D, Liu H, Zhang Y, Fu B, Barnes J, Samaniego M, Bram RJ, Geha R, Shikanov A, Luning Prak ET, Farkash EA, Platt JL, Cascalho M. TNFRSF13B genotypes control immune-mediated pathology by regulating the functions of innate B cells. JCI Insight 2021; 6:e150483. [PMID: 34283811 PMCID: PMC8492324 DOI: 10.1172/jci.insight.150483] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Accepted: 07/14/2021] [Indexed: 11/20/2022] Open
Abstract
Host genes define the severity of inflammation and immunity but specific loci doing so are unknown. Here we show that TNF receptor superfamily member 13B (TNFRSF13B) variants, which enhance defense against certain pathogens, also control immune-mediated injury of transplants, by regulating innate B cells’ functions. Analysis of TNFRSF13B in human kidney transplant recipients revealed that 33% of those with antibody-mediated rejection (AMR) but fewer than 6% of those with stable graft function had TNFRSF13B missense mutations. To explore mechanisms underlying aggressive immune responses, we investigated alloimmunity and rejection in mice. Cardiac allografts in Tnfrsf13b-mutant mice underwent early and severe AMR. The dominance and precocity of AMR in Tnfrsf13b-deficient mice were not caused by increased alloantibodies. Rather, Tnfrsf13b mutations decreased “natural” IgM and compromised complement regulation, leading to complement deposition in allografted hearts and autogenous kidneys. Thus, WT TNFRSF13B and Tnfrsf13b support innate B cell functions that limit complement-associated inflammation; in contrast, common variants of these genes intensify inflammatory responses that help clear microbial infections but allow inadvertent tissue injury to ensue. The wide variation in inflammatory reactions associated with TNFRSF13B diversity suggests polymorphisms could underlie variation in host defense and explosive inflammatory responses that sometimes enhance morbidity associated with immune responses.
Collapse
Affiliation(s)
| | - Adam R Lefferts
- Department of Surgery, University of Michigan, Ann Arbor, United States of America
| | - Daniel Huynh
- Department of Surgery, University of Michigan, Ann Arbor, United States of America
| | - Hui Liu
- Department of Surgery, University of Michigan, Ann Arbor, United States of America
| | - Yu Zhang
- Department of Surgery, University of Michigan, Ann Arbor, United States of America
| | - Beverly Fu
- Department of Surgery, University of Michigan, Ann Arbor, United States of America
| | - Jenna Barnes
- Department of Pathology, University of Michigan, Ann Arbor, United States of America
| | - Milagros Samaniego
- Department of Medicine, University of Michigan, Ann Arbor, United States of America
| | - Richard J Bram
- Department of Pediatric and Adolescent Medicine, Mayo Clinic/Foundation, Rochester, United States of America
| | - Raif Geha
- Division of Immunology, Department of Pediatrics, Harvard Medical School, Boston, United States of America
| | - Ariella Shikanov
- Department of Biomedical Engineering, University of Michigan, Ann Arbor, United States of America
| | - Eline T Luning Prak
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, United States of America
| | - Evan A Farkash
- Department of Pathology, University of Michigan, Ann Arbor, United States of America
| | - Jeffrey L Platt
- Transplantation Biology, University of Michigan, Ann Arbor, United States of America
| | - Marilia Cascalho
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, United States of America
| |
Collapse
|
48
|
Hollox EJ, Zuccherato LW, Tucci S. Genome structural variation in human evolution. Trends Genet 2021; 38:45-58. [PMID: 34284881 DOI: 10.1016/j.tig.2021.06.015] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Revised: 06/21/2021] [Accepted: 06/22/2021] [Indexed: 01/01/2023]
Abstract
Structural variation (SV) is a large difference (typically >100 bp) in the genomic structure of two genomes and includes both copy number variation and variation that does not change copy number of a genomic region, such as an inversion. Improved reference genomes, combined with widespread genome sequencing using short-read sequencing technology, and increasingly using long-read sequencing, have reignited interest in SV. Recent large-scale studies and functional focused analyses have highlighted the role of SV in human evolution. In this review, we highlight human-specific SVs involved in changes in the brain, population-specific SVs that affect response to the environment, including adaptation to diet and infectious diseases, and summarise the contribution of archaic hominin admixture to present-day human SV.
Collapse
Affiliation(s)
- Edward J Hollox
- Department of Genetics and Genome Biology, University of Leicester, UK.
| | - Luciana W Zuccherato
- Núcleo de Ensino e Pesquisa, Instituto Mário Penna, Belo Horizonte, Brazil; Departmento de Bioquímica e Imunologia, Universidade de Minas Gerais, Belo Horizonte, Brazil
| | - Serena Tucci
- Department of Anthropology, Yale University, New Haven, CT, USA
| |
Collapse
|
49
|
Liu Q, Mishra M, Saxena AS, Wu H, Qiu Y, Zhang X, You X, Ding S, Miyamoto MM. Balancing selection maintains ancient polymorphisms at conserved enhancers for the olfactory receptor genes of a Chinese marine fish. Mol Ecol 2021; 30:4023-4038. [PMID: 34107131 DOI: 10.1111/mec.16016] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Revised: 05/10/2021] [Accepted: 06/01/2021] [Indexed: 12/22/2022]
Abstract
The study of balancing selection, as a selective force maintaining adaptive genetic variation in gene pools longer than expected by drift, is currently experiencing renewed interest due to the increased availability of new data, methods of analysis, and case studies. In this investigation, evidence of balancing selection operating on conserved enhancers of the olfactory receptor (OR) genes is presented for the Chinese sleeper (Bostrychus sinensis), a coastal marine fish that is emerging as a model species for evolutionary studies in the Northwest Pacific marginal seas. Coupled with tests for Gene Ontology enrichment and transcription factor binding, population genomic data allow for the identification of an OR cluster in the sleeper with a downstream flanking region containing three enhancers that are conserved with human and other fish species. Phylogenetic and population genetic analyses indicate that the enhancers are under balancing selection as evidenced by their translineage polymorphisms, excess common alleles, and increased within-group diversities. Age comparisons between the translineage polymorphisms and most recent common ancestors of neutral genealogies substantiate that the former are old, and thus, due to ancient balancing selection. The survival and reproduction of vertebrates depend on their sense of smell, and thereby, on their ORs. In addition to locus duplication and allelic variation of structural genes, this study highlights a third mechanism by which receptor diversity can be achieved for detecting and responding to the huge variety of environmental odorants (i.e., by balancing selection acting on OR gene expression through their enhancer variability).
Collapse
Affiliation(s)
- Qiaohong Liu
- Xiamen Key Laboratory of Urban Sea Ecological Conservation and Restoration, College of Ocean and Earth Sciences, Xiamen University, Xiamen, China.,Function Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
| | - Mrinal Mishra
- Department of Biology, University of Florida, Gainesville, FL, USA
| | - Ayush S Saxena
- Department of Biology, University of Florida, Gainesville, FL, USA
| | - Haohao Wu
- Xiamen Key Laboratory of Urban Sea Ecological Conservation and Restoration, College of Ocean and Earth Sciences, Xiamen University, Xiamen, China.,Function Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
| | - Ying Qiu
- Shenzhen Key Laboratory of Marine Genomics, Guangdong Provincial Key Laboratory of Molecular Breeding in Marine Economic Animals, BGI Academy of Sciences, BGI Marine, Shenzhen, China
| | - Xinhui Zhang
- Shenzhen Key Laboratory of Marine Genomics, Guangdong Provincial Key Laboratory of Molecular Breeding in Marine Economic Animals, BGI Academy of Sciences, BGI Marine, Shenzhen, China
| | - Xinxin You
- Shenzhen Key Laboratory of Marine Genomics, Guangdong Provincial Key Laboratory of Molecular Breeding in Marine Economic Animals, BGI Academy of Sciences, BGI Marine, Shenzhen, China
| | - Shaoxiong Ding
- Xiamen Key Laboratory of Urban Sea Ecological Conservation and Restoration, College of Ocean and Earth Sciences, Xiamen University, Xiamen, China.,Function Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
| | | |
Collapse
|
50
|
Lee D, Zdraljevic S, Stevens L, Wang Y, Tanny RE, Crombie TA, Cook DE, Webster AK, Chirakar R, Baugh LR, Sterken MG, Braendle C, Félix MA, Rockman MV, Andersen EC. Balancing selection maintains hyper-divergent haplotypes in Caenorhabditis elegans. Nat Ecol Evol 2021; 5:794-807. [PMID: 33820969 PMCID: PMC8202730 DOI: 10.1038/s41559-021-01435-x] [Citation(s) in RCA: 61] [Impact Index Per Article: 20.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Accepted: 02/26/2021] [Indexed: 12/16/2022]
Abstract
Across diverse taxa, selfing species have evolved independently from outcrossing species thousands of times. The transition from outcrossing to selfing decreases the effective population size, effective recombination rate and heterozygosity within a species. These changes lead to a reduction in genetic diversity, and therefore adaptive potential, by intensifying the effects of random genetic drift and linked selection. Within the nematode genus Caenorhabditis, selfing has evolved at least three times, and all three species, including the model organism Caenorhabditis elegans, show substantially reduced genetic diversity relative to outcrossing species. Selfing and outcrossing Caenorhabditis species are often found in the same niches, but we still do not know how selfing species with limited genetic diversity can adapt to these environments. Here, we examine the whole-genome sequences from 609 wild C. elegans strains isolated worldwide and show that genetic variation is concentrated in punctuated hyper-divergent regions that cover 20% of the C. elegans reference genome. These regions are enriched in environmental response genes that mediate sensory perception, pathogen response and xenobiotic stress response. Population genomic evidence suggests that genetic diversity in these regions has been maintained by long-term balancing selection. Using long-read genome assemblies for 15 wild strains, we show that hyper-divergent haplotypes contain unique sets of genes and show levels of divergence comparable to levels found between Caenorhabditis species that diverged millions of years ago. These results provide an example of how species can avoid the evolutionary dead end associated with selfing.
Collapse
Affiliation(s)
- Daehan Lee
- Department of Molecular Biosciences, Northwestern University, Evanston, IL, USA
- Center for Integrative Genomics, University of Lausanne, Lausanne, Switzerland
| | - Stefan Zdraljevic
- Department of Molecular Biosciences, Northwestern University, Evanston, IL, USA
- Interdisciplinary Biological Sciences Program, Northwestern University, Evanston, IL, USA
- Department of Human Genetics, University of California, Los Angeles, CA, USA
- Howard Hughes Medical Institute, University of California, Los Angeles, CA, USA
| | - Lewis Stevens
- Department of Molecular Biosciences, Northwestern University, Evanston, IL, USA
| | - Ye Wang
- Department of Molecular Biosciences, Northwestern University, Evanston, IL, USA
- Sichuan Key Laboratory of Conservation Biology on Endangered Wildlife, Chengdu Research Base of Giant Panda Breeding, Chengdu, People's Republic of China
| | - Robyn E Tanny
- Department of Molecular Biosciences, Northwestern University, Evanston, IL, USA
| | - Timothy A Crombie
- Department of Molecular Biosciences, Northwestern University, Evanston, IL, USA
| | - Daniel E Cook
- Department of Molecular Biosciences, Northwestern University, Evanston, IL, USA
| | - Amy K Webster
- Department of Biology, Duke University, Durham, NC, USA
- University Program in Genetics and Genomics, Duke University, Durham, NC, USA
| | | | - L Ryan Baugh
- Department of Biology, Duke University, Durham, NC, USA
- Center for Genomic and Computational Biology, Duke University, Durham, NC, USA
| | - Mark G Sterken
- Laboratory of Nematology, Wageningen University and Research, Wageningen, the Netherlands
| | | | - Marie-Anne Félix
- Institut de Biologie de l'Ecole Normale Supérieure, Centre National de la Recherche Scientifique, INSERM, École Normale Supérieure, Paris Sciences et Lettres, Paris, France
| | - Matthew V Rockman
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, USA
| | - Erik C Andersen
- Department of Molecular Biosciences, Northwestern University, Evanston, IL, USA.
| |
Collapse
|