1
|
Tagore D, Akey JM. Archaic hominin admixture and its consequences for modern humans. Curr Opin Genet Dev 2025; 90:102280. [PMID: 39577372 PMCID: PMC11770379 DOI: 10.1016/j.gde.2024.102280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2024] [Revised: 10/18/2024] [Accepted: 10/22/2024] [Indexed: 11/24/2024]
Abstract
As anatomically modern humans dispersed out of Africa, they encountered and mated with now extinct hominins, including Neanderthals and Denisovans. It is now well established that all non-African individuals derive approximately 2% of their genome from Neanderthal ancestors and individuals of Melanesian and Australian aboriginal ancestry inherited an additional 2%-5% of their genomes from Denisovan ancestors. Attention has started to shift from documenting amounts of archaic admixture and identifying introgressed segments to understanding their molecular, phenotypic, and evolutionary consequences and refining models of human history. Here, we review recent insights into admixture between modern and archaic humans, emphasizing methodological innovations and the functional and phenotypic effects Neanderthal and Denisovan sequences have in contemporary individuals.
Collapse
Affiliation(s)
- Debashree Tagore
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton 08540, USA. https://twitter.com/@TagoreDebashree
| | - Joshua M Akey
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton 08540, USA.
| |
Collapse
|
2
|
Betschart RO, Riccio C, Aguilera-Garcia D, Blankenberg S, Guo L, Moch H, Seidl D, Solleder H, Thalén F, Thiéry A, Twerenbold R, Zeller T, Zoche M, Ziegler A. Biostatistical Aspects of Whole Genome Sequencing Studies: Preprocessing and Quality Control. Biom J 2024; 66:e202300278. [PMID: 38988195 DOI: 10.1002/bimj.202300278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Revised: 03/21/2024] [Accepted: 05/14/2024] [Indexed: 07/12/2024]
Abstract
Rapid advances in high-throughput DNA sequencing technologies have enabled large-scale whole genome sequencing (WGS) studies. Before performing association analysis between phenotypes and genotypes, preprocessing and quality control (QC) of the raw sequence data need to be performed. Because many biostatisticians have not been working with WGS data so far, we first sketch Illumina's short-read sequencing technology. Second, we explain the general preprocessing pipeline for WGS studies. Third, we provide an overview of important QC metrics, which are applied to WGS data: on the raw data, after mapping and alignment, after variant calling, and after multisample variant calling. Fourth, we illustrate the QC with the data from the GENEtic SequencIng Study Hamburg-Davos (GENESIS-HD), a study involving more than 9000 human whole genomes. All samples were sequenced on an Illumina NovaSeq 6000 with an average coverage of 35× using a PCR-free protocol. For QC, one genome in a bottle (GIAB) trio was sequenced in four replicates, and one GIAB sample was successfully sequenced 70 times in different runs. Fifth, we provide empirical data on the compression of raw data using the DRAGEN original read archive (ORA). The most important quality metrics in the application were genetic similarity, sample cross-contamination, deviations from the expected Het/Hom ratio, relatedness, and coverage. The compression ratio of the raw files using DRAGEN ORA was 5.6:1, and compression time was linear by genome coverage. In summary, the preprocessing, joint calling, and QC of large WGS studies are feasible within a reasonable time, and efficient QC procedures are readily available.
Collapse
Affiliation(s)
| | | | - Domingo Aguilera-Garcia
- Institute of Pathology and Molecular Pathology, University Hospital Zurich, Zurich, Switzerland
| | - Stefan Blankenberg
- Cardio-CARE, Medizincampus Davos, Davos, Switzerland
- Department of Cardiology, University Heart and Vascular Center Hamburg, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
- Center for Population Health Innovation (POINT), University Heart and Vascular Center Hamburg, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Linlin Guo
- Department of Cardiology, University Heart and Vascular Center Hamburg, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Holger Moch
- Institute of Pathology and Molecular Pathology, University Hospital Zurich, Zurich, Switzerland
| | - Dagmar Seidl
- Institute of Pathology and Molecular Pathology, University Hospital Zurich, Zurich, Switzerland
| | - Hugo Solleder
- Cardio-CARE, Medizincampus Davos, Davos, Switzerland
| | - Felix Thalén
- Cardio-CARE, Medizincampus Davos, Davos, Switzerland
| | | | - Raphael Twerenbold
- Department of Cardiology, University Heart and Vascular Center Hamburg, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
- Center for Population Health Innovation (POINT), University Heart and Vascular Center Hamburg, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
- German Center for Cardiovascular Research (DZHK), partner site Hamburg/Kiel/Lübeck, Hamburg, Germany
| | - Tanja Zeller
- Department of Cardiology, University Heart and Vascular Center Hamburg, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
- Center for Population Health Innovation (POINT), University Heart and Vascular Center Hamburg, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
- German Center for Cardiovascular Research (DZHK), partner site Hamburg/Kiel/Lübeck, Hamburg, Germany
| | - Martin Zoche
- Institute of Pathology and Molecular Pathology, University Hospital Zurich, Zurich, Switzerland
| | - Andreas Ziegler
- Cardio-CARE, Medizincampus Davos, Davos, Switzerland
- Department of Cardiology, University Heart and Vascular Center Hamburg, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
- Center for Population Health Innovation (POINT), University Heart and Vascular Center Hamburg, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
- School of Mathematics, Statistics and Computer Science, University of KwaZulu-Natal, Pietermaritzburg, South Africa
| |
Collapse
|
3
|
Shpak M, Lawrence KN, Pool JE. The Precision and Power of Population Branch Statistics in Identifying the Genomic Signatures of Local Adaptation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.14.594139. [PMID: 38798330 PMCID: PMC11118325 DOI: 10.1101/2024.05.14.594139] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
Population branch statistics, which estimate the branch lengths of focal populations with respect to two outgroups, have been used as an alternative to FST-based genome-wide scans for identifying loci associated with local selective sweeps. In addition to the original population branch statistic (PBS), there are subsequently proposed branch rescalings: normalized population branch statistic (PBSn1), which adjusts focal branch length with respect to outgroup branch lengths at the same locus, and population branch excess (PBE), which also incorporates median branch lengths at other loci. PBSn1 and PBE have been proposed to be less sensitive to allele frequency divergence generated by background selection or geographically ubiquitous positive selection rather than local selective sweeps. However, the accuracy and statistical power of branch statistics have not been systematically assessed. To do so, we simulate genomes in representative large and small populations with varying proportions of sites evolving under genetic drift or background selection (approximated using variable Ne), local selective sweeps, and geographically parallel selective sweeps. We then assess the probability that local selective sweep loci are correctly identified as outliers by FST and by each of the branch statistics. We find that branch statistics consistently outperform FST at identifying local sweeps. When background selection and/or parallel sweeps are introduced, PBSn1 and especially PBE correctly identify local sweeps among their top outliers at a higher frequency than PBS. These results validate the greater specificity of rescaled branch statistics such as PBE to detect population-specific positive selection, supporting their use in genomic studies focused on local adaptation.
Collapse
Affiliation(s)
- Max Shpak
- Laboratory of Genetics, University of Wisconsin–Madison, Madison, WI, USA
| | - Kadee N. Lawrence
- Laboratory of Genetics, University of Wisconsin–Madison, Madison, WI, USA
| | - John E. Pool
- Laboratory of Genetics, University of Wisconsin–Madison, Madison, WI, USA
| |
Collapse
|
4
|
Riley R, Mathieson I, Mathieson S. Interpreting generative adversarial networks to infer natural selection from genetic data. Genetics 2024; 226:iyae024. [PMID: 38386895 PMCID: PMC10990424 DOI: 10.1093/genetics/iyae024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Revised: 01/15/2024] [Accepted: 01/19/2024] [Indexed: 02/24/2024] Open
Abstract
Understanding natural selection and other forms of non-neutrality is a major focus for the use of machine learning in population genetics. Existing methods rely on computationally intensive simulated training data. Unlike efficient neutral coalescent simulations for demographic inference, realistic simulations of selection typically require slow forward simulations. Because there are many possible modes of selection, a high dimensional parameter space must be explored, with no guarantee that the simulated models are close to the real processes. Finally, it is difficult to interpret trained neural networks, leading to a lack of understanding about what features contribute to classification. Here we develop a new approach to detect selection and other local evolutionary processes that requires relatively few selection simulations during training. We build upon a generative adversarial network trained to simulate realistic neutral data. This consists of a generator (fitted demographic model), and a discriminator (convolutional neural network) that predicts whether a genomic region is real or fake. As the generator can only generate data under neutral demographic processes, regions of real data that the discriminator recognizes as having a high probability of being "real" do not fit the neutral demographic model and are therefore candidates for targets of selection. To incentivize identification of a specific mode of selection, we fine-tune the discriminator with a small number of custom non-neutral simulations. We show that this approach has high power to detect various forms of selection in simulations, and that it finds regions under positive selection identified by state-of-the-art population genetic methods in three human populations. Finally, we show how to interpret the trained networks by clustering hidden units of the discriminator based on their correlation patterns with known summary statistics.
Collapse
Affiliation(s)
- Rebecca Riley
- Department of Computer Science, Haverford College, Haverford, PA 19041, USA
| | - Iain Mathieson
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Sara Mathieson
- Department of Computer Science, Haverford College, Haverford, PA 19041, USA
| |
Collapse
|
5
|
Alipanah M, Mazloom SM, Gharari F. Detection of selective sweep in European wild sheep breeds. 3 Biotech 2024; 14:122. [PMID: 38560387 PMCID: PMC10978567 DOI: 10.1007/s13205-024-03964-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Accepted: 02/22/2024] [Indexed: 04/04/2024] Open
Abstract
In wild animal populations, there is a differentiation between populations due to natural selection. The direction and pressure of natural selection in the wild sheep are different in the various geographic areas. Linkage disequilibrium studies showed that regions of the genome in whole wild sheep are under natural selection and that natural selection can affect immune or reproductive or metabolic traits. The study aimed to identify genomic regions under natural selection in wild sheep. For this purpose, the genetic information of 24 European wild sheep and 24 Sardinian wild sheep was used. The genotypes were determined using Illumina 50 K SNPChip arrays based on Oar_4.0 version of the sheep genome. After quality control steps, finally, 31,560 SNP markers were analyzed. The value of LD was calculated by calculating the r2 statistic between all pairs of locations through PLINK software. To identify signs of selection based on linkage disequilibrium methods, an extended haplotype homozygosity test of XP-EHH crossing population and iHS intrapopulation was used. The results of iHS studies showed that in European and Sardinian wild sheep, the highest iHS coefficient under natural selection was observed on 3 and 2 chromosome numbers, respectively. Also, the results of XP-EHH studies showed that the largest XP-EHH coefficients under natural selection in European wild sheep compared to Sardinian and vice versa in Sardinian wild sheep compared to European wild sheep were observed on 3 and 16 chromosome numbers, respectively. In addition, the results of gene cycle studies showed that COPB1, SEC24D, ZDHHC17, BBS4, RFX3, SLC26A8, CAMK2D, GRIA1, GRM1, GRID2, PPP2R1A, CPEB4, PLEKHA5 and KIF13A, VPS39, VPS53, DTNBP1, DYNC1I1, FAM91A genes are under natural selection in Sardinian and European wild sheeps, respectively. The direction and selection pressure of natural selection in the two breeds of wild sheep is different due to different geographic conditions.
Collapse
Affiliation(s)
- Masoud Alipanah
- Department of Plant Production, University of Torbat Heydarieh, Torbat Heydarieh, 9516168595 Iran
| | - Seyed Mostafa Mazloom
- Department of Animal Science, Ferdowsi University of Mashhad, Mashhad, 9177948974 Iran
| | - Faezeh Gharari
- Department of Plant Production, University of Torbat Heydarieh, Torbat Heydarieh, 9516168595 Iran
- Department of Animal Science, Ferdowsi University of Mashhad, Mashhad, 9177948974 Iran
| |
Collapse
|
6
|
Lamkey CM, Lorenz AJ. A genomic analysis of the University of Nebraska Replicated Recurrent Selection program. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2023; 136:243. [PMID: 37950832 DOI: 10.1007/s00122-023-04475-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 09/27/2023] [Indexed: 11/13/2023]
Abstract
The inbred-hybrid system of maize breeding closely resembles a reciprocal full-sib (RFS) selection program. Studying changes in genetic variation as a result of RFS selection can help illuminate long-standing questions regarding the relative roles of selection and genetic drift and help understand the nature of adaptation occurring in selection programs. The University of Nebraska-Lincoln Replicated Recurrent Selection (UNL-RpRS) program underwent eight cycles of replicated RFS and S1-progeny selection, making it a powerful system to study genomic changes accompanying selection for inter-population performance. The objectives of this study were to identify regions of the genome under selection after eight cycles of selection and evaluate the effect eight cycles of selection for inter-population full-sib performance had in expanding genome-wide and localized population structure. We address these questions with a large set of individuals sampled from the UNL-RpRS program with dense genotyping-by-sequence data. We found evidence of parallel selection signatures in the UNL-RpRS program, with a region on chromosome 7 being implicated in three of the four selection systems studied. Regions that displayed selection signatures across independently run selection programs represent regions likely to be capitalizing on standing genetic variation and support a soft sweep model of adaptation. We did not find selection to be a strong force in diverging populations undergoing RFS. This could be due to the nature of adaptation occurring in these populations, underlying gene action, or a result of unstable genetic topographies.
Collapse
Affiliation(s)
- Collin M Lamkey
- Department of Agronomy and Horticulture, University of Nebraska, Lincoln, NE, 68583, USA
- Corteva Agriscience, 19456 Hwy 22, Mankato, MN, 56001, USA
| | - Aaron J Lorenz
- Department of Agronomy and Horticulture, University of Nebraska, Lincoln, NE, 68583, USA.
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN, 55108, USA.
| |
Collapse
|
7
|
Sun KY, Bai X, Chen S, Bao S, Kapoor M, Zhang C, Backman J, Joseph T, Maxwell E, Mitra G, Gorovits A, Mansfield A, Boutkov B, Gokhale S, Habegger L, Marcketta A, Locke A, Kessler MD, Sharma D, Staples J, Bovijn J, Gelfman S, Gioia AD, Rajagopal V, Lopez A, Varela JR, Alegre J, Berumen J, Tapia-Conyer R, Kuri-Morales P, Torres J, Emberson J, Collins R, Cantor M, Thornton T, Kang HM, Overton J, Shuldiner AR, Cremona ML, Nafde M, Baras A, Abecasis G, Marchini J, Reid JG, Salerno W, Balasubramanian S. A deep catalog of protein-coding variation in 985,830 individuals. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.09.539329. [PMID: 37214792 PMCID: PMC10197621 DOI: 10.1101/2023.05.09.539329] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Coding variants that have significant impact on function can provide insights into the biology of a gene but are typically rare in the population. Identifying and ascertaining the frequency of such rare variants requires very large sample sizes. Here, we present the largest catalog of human protein-coding variation to date, derived from exome sequencing of 985,830 individuals of diverse ancestry to serve as a rich resource for studying rare coding variants. Individuals of African, Admixed American, East Asian, Middle Eastern, and South Asian ancestry account for 20% of this Exome dataset. Our catalog of variants includes approximately 10.5 million missense (54% novel) and 1.1 million predicted loss-of-function (pLOF) variants (65% novel, 53% observed only once). We identified individuals with rare homozygous pLOF variants in 4,874 genes, and for 1,838 of these this work is the first to document at least one pLOF homozygote. Additional insights from the RGC-ME dataset include 1) improved estimates of selection against heterozygous loss-of-function and identification of 3,459 genes intolerant to loss-of-function, 83 of which were previously assessed as tolerant to loss-of-function and 1,241 that lack disease annotations; 2) identification of regions depleted of missense variation in 457 genes that are tolerant to loss-of-function; 3) functional interpretation for 10,708 variants of unknown or conflicting significance reported in ClinVar as cryptic splice sites using splicing score thresholds based on empirical variant deleteriousness scores derived from RGC-ME; and 4) an observation that approximately 3% of sequenced individuals carry a clinically actionable genetic variant in the ACMG SF 3.1 list of genes. We make this important resource of coding variation available to the public through a variant allele frequency browser. We anticipate that this report and the RGC-ME dataset will serve as a valuable reference for understanding rare coding variation and help advance precision medicine efforts.
Collapse
Affiliation(s)
| | | | - Siying Chen
- Regeneron Genetics Center, Tarrytown, NY, USA
| | - Suying Bao
- Regeneron Genetics Center, Tarrytown, NY, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | - Adam Locke
- Regeneron Genetics Center, Tarrytown, NY, USA
| | | | | | | | | | | | | | | | | | | | - Jesus Alegre
- Experimental Research Unit from the Faculty of Medicine (UIME), National Autonomous University of Mexico (UNAM)
| | - Jaime Berumen
- Experimental Research Unit from the Faculty of Medicine (UIME), National Autonomous University of Mexico (UNAM)
| | - Roberto Tapia-Conyer
- Experimental Research Unit from the Faculty of Medicine (UIME), National Autonomous University of Mexico (UNAM)
| | - Pablo Kuri-Morales
- Experimental Research Unit from the Faculty of Medicine (UIME), National Autonomous University of Mexico (UNAM)
| | - Jason Torres
- Clinical Trial Service Unit & Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Jonathan Emberson
- Clinical Trial Service Unit & Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
- MRC Population Health Research Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Rory Collins
- Clinical Trial Service Unit & Epidemiological Studies Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | | | | | | | | | | | | | | | | | - Mona Nafde
- Regeneron Genetics Center, Tarrytown, NY, USA
| | - Aris Baras
- Regeneron Genetics Center, Tarrytown, NY, USA
| | | | | | | | | | | |
Collapse
|
8
|
Soni V, Johri P, Jensen JD. Evaluating power to detect recurrent selective sweeps under increasingly realistic evolutionary null models. Evolution 2023; 77:2113-2127. [PMID: 37395482 PMCID: PMC10547124 DOI: 10.1093/evolut/qpad120] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Revised: 06/15/2023] [Accepted: 06/30/2023] [Indexed: 07/04/2023]
Abstract
The detection of selective sweeps from population genomic data often relies on the premise that the beneficial mutations in question have fixed very near the sampling time. As it has been previously shown that the power to detect a selective sweep is strongly dependent on the time since fixation as well as the strength of selection, it is naturally the case that strong, recent sweeps leave the strongest signatures. However, the biological reality is that beneficial mutations enter populations at a rate, one that partially determines the mean wait time between sweep events and hence their age distribution. An important question thus remains about the power to detect recurrent selective sweeps when they are modeled by a realistic mutation rate and as part of a realistic distribution of fitness effects, as opposed to a single, recent, isolated event on a purely neutral background as is more commonly modeled. Here we use forward-in-time simulations to study the performance of commonly used sweep statistics, within the context of more realistic evolutionary baseline models incorporating purifying and background selection, population size change, and mutation and recombination rate heterogeneity. Results demonstrate the important interplay of these processes, necessitating caution when interpreting selection scans; specifically, false-positive rates are in excess of true-positive across much of the evaluated parameter space, and selective sweeps are often undetectable unless the strength of selection is exceptionally strong.
Collapse
Affiliation(s)
- Vivak Soni
- School of Life Sciences, Arizona State University, Tempe, AZ, United States
| | - Parul Johri
- School of Life Sciences, Arizona State University, Tempe, AZ, United States
| | - Jeffrey D Jensen
- School of Life Sciences, Arizona State University, Tempe, AZ, United States
| |
Collapse
|
9
|
Roca-Umbert A, Garcia-Calleja J, Vogel-González M, Fierro-Villegas A, Ill-Raga G, Herrera-Fernández V, Bosnjak A, Muntané G, Gutiérrez E, Campelo F, Vicente R, Bosch E. Human genetic adaptation related to cellular zinc homeostasis. PLoS Genet 2023; 19:e1010950. [PMID: 37747921 PMCID: PMC10553801 DOI: 10.1371/journal.pgen.1010950] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 10/05/2023] [Accepted: 08/31/2023] [Indexed: 09/27/2023] Open
Abstract
SLC30A9 encodes a ubiquitously zinc transporter (ZnT9) and has been consistently suggested as a candidate for positive selection in humans. However, no direct adaptive molecular phenotype has been demonstrated. Our results provide evidence for directional selection operating in two major complementary haplotypes in Africa and East Asia. These haplotypes are associated with differential gene expression but also differ in the Met50Val substitution (rs1047626) in ZnT9, which we show is found in homozygosis in the Denisovan genome and displays accompanying signatures suggestive of archaic introgression. Although we found no significant differences in systemic zinc content between individuals with different rs1047626 genotypes, we demonstrate that the expression of the derived isoform (ZnT9 50Val) in HEK293 cells shows a gain of function when compared with the ancestral (ZnT9 50Met) variant. Notably, the ZnT9 50Val variant was found associated with differences in zinc handling by the mitochondria and endoplasmic reticulum, with an impact on mitochondrial metabolism. Given the essential role of the mitochondria in skeletal muscle and since the derived allele at rs1047626 is known to be associated with greater susceptibility to several neuropsychiatric traits, we propose that adaptation to cold may have driven this selection event, while also impacting predisposition to neuropsychiatric disorders in modern humans.
Collapse
Affiliation(s)
- Ana Roca-Umbert
- Institut de Biologia Evolutiva (UPF-CSIC), Departament de Medicina i Ciències de la Vida, Universitat Pompeu Fabra, Parc de Recerca Biomèdica de Barcelona, Barcelona, Spain
| | - Jorge Garcia-Calleja
- Institut de Biologia Evolutiva (UPF-CSIC), Departament de Medicina i Ciències de la Vida, Universitat Pompeu Fabra, Parc de Recerca Biomèdica de Barcelona, Barcelona, Spain
| | - Marina Vogel-González
- Laboratory of Molecular Physiology, Department of Medicine and Life Sciences (MELIS), Universitat Pompeu Fabra, Barcelona, Spain
| | - Alejandro Fierro-Villegas
- Laboratory of Molecular Physiology, Department of Medicine and Life Sciences (MELIS), Universitat Pompeu Fabra, Barcelona, Spain
| | - Gerard Ill-Raga
- Laboratory of Molecular Physiology, Department of Medicine and Life Sciences (MELIS), Universitat Pompeu Fabra, Barcelona, Spain
| | - Víctor Herrera-Fernández
- Laboratory of Molecular Physiology, Department of Medicine and Life Sciences (MELIS), Universitat Pompeu Fabra, Barcelona, Spain
| | - Anja Bosnjak
- Laboratory of Molecular Physiology, Department of Medicine and Life Sciences (MELIS), Universitat Pompeu Fabra, Barcelona, Spain
| | - Gerard Muntané
- Institut de Biologia Evolutiva (UPF-CSIC), Departament de Medicina i Ciències de la Vida, Universitat Pompeu Fabra, Parc de Recerca Biomèdica de Barcelona, Barcelona, Spain
- Hospital Universitari Institut Pere Mata, IISPV, Universitat Rovira i Virgili, Reus, Spain
- Centro de Investigación Biomédica en Red de Salud Mental, Instituto de Salud Carlos III, Madrid, Spain
| | - Esteban Gutiérrez
- Laboratory of Molecular Physiology, Department of Medicine and Life Sciences (MELIS), Universitat Pompeu Fabra, Barcelona, Spain
| | - Felix Campelo
- ICFO-Institut de Ciencies Fotoniques, The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Rubén Vicente
- Laboratory of Molecular Physiology, Department of Medicine and Life Sciences (MELIS), Universitat Pompeu Fabra, Barcelona, Spain
| | - Elena Bosch
- Institut de Biologia Evolutiva (UPF-CSIC), Departament de Medicina i Ciències de la Vida, Universitat Pompeu Fabra, Parc de Recerca Biomèdica de Barcelona, Barcelona, Spain
- Centro de Investigación Biomédica en Red de Salud Mental, Instituto de Salud Carlos III, Madrid, Spain
| |
Collapse
|
10
|
Riley R, Mathieson I, Mathieson S. INTERPRETING GENERATIVE ADVERSARIAL NETWORKS TO INFER NATURAL SELECTION FROM GENETIC DATA. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.07.531546. [PMID: 36945387 PMCID: PMC10028936 DOI: 10.1101/2023.03.07.531546] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/10/2023]
Abstract
Understanding natural selection in humans and other species is a major focus for the use of machine learning in population genetics. Existing methods rely on computationally intensive simulated training data. Unlike efficient neutral coalescent simulations for demographic inference, realistic simulations of selection typically requires slow forward simulations. Because there are many possible modes of selection, a high dimensional parameter space must be explored, with no guarantee that the simulated models are close to the real processes. Mismatches between simulated training data and real test data can lead to incorrect inference. Finally, it is difficult to interpret trained neural networks, leading to a lack of understanding about what features contribute to classification. Here we develop a new approach to detect selection that requires relatively few selection simulations during training. We use a Generative Adversarial Network (GAN) trained to simulate realistic neutral data. The resulting GAN consists of a generator (fitted demographic model) and a discriminator (convolutional neural network). For a genomic region, the discriminator predicts whether it is "real" or "fake" in the sense that it could have been simulated by the generator. As the "real" training data includes regions that experienced selection and the generator cannot produce such regions, regions with a high probability of being real are likely to have experienced selection. To further incentivize this behavior, we "fine-tune" the discriminator with a small number of selection simulations. We show that this approach has high power to detect selection in simulations, and that it finds regions under selection identified by state-of-the art population genetic methods in three human populations. Finally, we show how to interpret the trained networks by clustering hidden units of the discriminator based on their correlation patterns with known summary statistics. In summary, our approach is a novel, efficient, and powerful way to use machine learning to detect natural selection.
Collapse
Affiliation(s)
- Rebecca Riley
- Department of Computer Science, Haverford College, Haverford PA, 19041 USA
| | - Iain Mathieson
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia PA, 19104 USA
| | - Sara Mathieson
- Department of Computer Science, Haverford College, Haverford PA, 19041 USA
| |
Collapse
|
11
|
Soni V, Johri P, Jensen JD. Evaluating power to detect recurrent selective sweeps under increasingly realistic evolutionary null models. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.15.545166. [PMID: 37398347 PMCID: PMC10312679 DOI: 10.1101/2023.06.15.545166] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
The detection of selective sweeps from population genomic data often relies on the premise that the beneficial mutations in question have fixed very near the sampling time. As it has been previously shown that the power to detect a selective sweep is strongly dependent on the time since fixation as well as the strength of selection, it is naturally the case that strong, recent sweeps leave the strongest signatures. However, the biological reality is that beneficial mutations enter populations at a rate, one that partially determines the mean wait time between sweep events and hence their age distribution. An important question thus remains about the power to detect recurrent selective sweeps when they are modelled by a realistic mutation rate and as part of a realistic distribution of fitness effects (DFE), as opposed to a single, recent, isolated event on a purely neutral background as is more commonly modelled. Here we use forward-in-time simulations to study the performance of commonly used sweep statistics, within the context of more realistic evolutionary baseline models incorporating purifying and background selection, population size change, and mutation and recombination rate heterogeneity. Results demonstrate the important interplay of these processes, necessitating caution when interpreting selection scans; specifically, false positive rates are in excess of true positive across much of the evaluated parameter space, and selective sweeps are often undetectable unless the strength of selection is exceptionally strong. Teaser Text Outlier-based genomic scans have proven a popular approach for identifying loci that have potentially experienced recent positive selection. However, it has previously been shown that an evolutionarily appropriate baseline model that incorporates non-equilibrium population histories, purifying and background selection, and variation in mutation and recombination rates is necessary to reduce often extreme false positive rates when performing genomic scans. Here we evaluate the power to detect recurrent selective sweeps using common SFS-based and haplotype-based methods under these increasingly realistic models. We find that while these appropriate evolutionary baselines are essential to reduce false positive rates, the power to accurately detect recurrent selective sweeps is generally low across much of the biologically relevant parameter space.
Collapse
Affiliation(s)
- Vivak Soni
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Parul Johri
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
- Present address: Department of Biology, Department of Genetics, University of North Carolina, Chapel Hill, NC, USA
| | | |
Collapse
|
12
|
Garcia OA, Arslanian K, Whorf D, Thariath S, Shriver M, Li JZ, Bigham AW. The Legacy of Infectious Disease Exposure on the Genomic Diversity of Indigenous Southern Mexicans. Genome Biol Evol 2023; 15:7023365. [PMID: 36726304 PMCID: PMC10016042 DOI: 10.1093/gbe/evad015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2022] [Revised: 12/19/2022] [Accepted: 01/25/2023] [Indexed: 02/03/2023] Open
Abstract
To characterize host risk factors for infectious disease in Mesoamerican populations, we interrogated 857,481 SNPs assayed using the Affymetrix 6.0 genotyping array for signatures of natural selection in immune response genes. We applied three statistical tests to identify signatures of natural selection: locus-specific branch length (LSBL), the cross-population extended haplotype homozygosity (XP-EHH), and the integrated haplotype score (iHS). Each of the haplotype tests (XP-EHH and iHS) were paired with LSBL and significance was determined at the 1% level. For the paired analyses, we identified 95 statistically significant windows for XP-EHH/LSBL and 63 statistically significant windows for iHS/LSBL. Among our top immune response loci, we found evidence of recent directional selection associated with the major histocompatibility complex (MHC) and the peroxisome proliferator-activated receptor gamma (PPAR-γ) signaling pathway. These findings illustrate that Mesoamerican populations' immunity has been shaped by exposure to infectious disease. As targets of selection, these variants are likely to encode phenotypes that manifest themselves physiologically and therefore may contribute to population-level variation in immune response. Our results shed light on past selective events influencing the host response to modern diseases, both pathogenic infection as well as autoimmune disorders.
Collapse
Affiliation(s)
- Obed A Garcia
- Department of Anthropology, University of Michigan, Ann Arbor, Michigan.,Department of Biomedical Data Science, Stanford University, Stanford, California
| | | | - Daniel Whorf
- College of Medicine, University of Illinois, Peoria, Illinois
| | - Serena Thariath
- Department of Anthropology, University of Tennessee, Knoxville, Tennessee
| | - Mark Shriver
- Department of Anthropology, Penn State University, State College, Pennsylvania
| | - Jun Z Li
- Department of Human Genetics, University of Michigan, Ann Arbor, Michigan
| | - Abigail W Bigham
- Department of Anthropology, University of California, Los Angeles, California
| |
Collapse
|
13
|
Zhong L, Zhu Y, Olsen KM. Wild progenitors provide a sound baseline model for evolutionary analysis of domesticated crop species. Heredity (Edinb) 2023; 130:111-113. [PMID: 36829043 PMCID: PMC9981727 DOI: 10.1038/s41437-023-00605-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Revised: 02/09/2023] [Accepted: 02/10/2023] [Indexed: 02/26/2023] Open
Affiliation(s)
- Limei Zhong
- Key Laboratory of Molecular Biology and Gene Engineering in Jiangxi, School of Life Sciences, Nanchang University, Nanchang, China.
| | - Youlin Zhu
- Key Laboratory of Molecular Biology and Gene Engineering in Jiangxi, School of Life Sciences, Nanchang University, Nanchang, China.
| | - Kenneth M Olsen
- Department of Biology, Washington University in St. Louis, St. Louis, MO, USA.
| |
Collapse
|
14
|
Genome-wide signatures of the geographic expansion and breeding of soybean. SCIENCE CHINA. LIFE SCIENCES 2023; 66:350-365. [PMID: 35997916 DOI: 10.1007/s11427-022-2158-7] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Accepted: 06/30/2022] [Indexed: 10/15/2022]
Abstract
Soybean is a leguminous crop that provides oil and protein. Exploring the genomic signatures of soybean evolution is crucial for breeding varieties with improved adaptability to environmental extremes. We analyzed the genome sequences of 2,214 soybeans and proposed a soybean evolutionary route, i.e., the expansion of annual wild soybean (Glycine soja Sieb. & Zucc.) from southern China and its domestication in central China, followed by the expansion and local breeding selection of its landraces (G. max (L.) Merr.). We observed that the genetic introgression in soybean landraces was mostly derived from sympatric rather than allopatric wild populations during the geographic expansion. Soybean expansion and breeding were accompanied by the positive selection of flowering time genes, including GmSPA3c. Our study sheds light on the evolutionary history of soybean and provides valuable genetic resources for its future breeding.
Collapse
|
15
|
Lu J, Zhen S, Zhang J, Xie Y, He C, Wang X, Wang Z, Zhang S, Li Y, Cui Y, Wang G, Wang J, Liu J, Li L, Gu R, Zheng X, Fu J. Combined population transcriptomic and genomic analysis reveals cis-regulatory differentiation of non-coding RNAs in maize. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2023; 136:16. [PMID: 36662257 DOI: 10.1007/s00122-023-04293-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Accepted: 11/10/2022] [Indexed: 06/17/2023]
Abstract
Long intergenic non-coding RNA (lincRNA), cis-acting expression quantitative trait locus (cis-eQTL), maize, regulatory evolution. The law of genetic variation during domestication explains the evolutionary mechanism and provides a theoretical basis for improving existing varieties of maize. Previous studies focused on exploiting regulatory variations controlling the expression of protein-coding genes rather than of non-protein-coding genes. Here, we examined the genetic and evolutionary features of long non-coding RNAs from intergenic regions (long intergenic non-coding RNAs, lincRNAs) using population-scale transcriptome data and identified 1168 lincRNAs with cis-acting expression quantitative trait loci (cis-eQTLs). We found that lincRNAs are more likely to be regulated by cis-eQTLs, which exert stronger effects than the protein-coding genes. During maize domestication and improvement, upregulated alleles of lincRNAs, which originated from both standing variation and new mutation, accumulate more frequently and show larger effect sizes than the coding genes. A stronger signature of genetic differentiation was observed in their regulatory regions compared to those of randomly sampled lincRNAs. In addition, we found that cis-regulatory differentiation of lincRNAs is related to the sequence conservation of lincRNA transcripts. Non-conserved lincRNAs more tend to gain upregulated alleles and show a stronger relationship with selected traits than conserved lincRNAs between maize and its wild relatives. Our findings in maize improve the understanding of cis-regulatory variation in lincRNA genes during domestication and improvement and provide an effective approach for prioritizing candidates for further investigation.
Collapse
Affiliation(s)
- Jiawen Lu
- Center of Seed Science and Technology, Beijing Innovation Center for Seed Technology, Ministry of Agriculture and Rural Affairs, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100193, China
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Sihan Zhen
- Center of Seed Science and Technology, Beijing Innovation Center for Seed Technology, Ministry of Agriculture and Rural Affairs, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100193, China
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Jie Zhang
- Center of Seed Science and Technology, Beijing Innovation Center for Seed Technology, Ministry of Agriculture and Rural Affairs, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100193, China
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Yuxin Xie
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Cheng He
- Center of Seed Science and Technology, Beijing Innovation Center for Seed Technology, Ministry of Agriculture and Rural Affairs, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100193, China
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Xiaoli Wang
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Zheyuan Wang
- Center of Seed Science and Technology, Beijing Innovation Center for Seed Technology, Ministry of Agriculture and Rural Affairs, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100193, China
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Song Zhang
- Center of Seed Science and Technology, Beijing Innovation Center for Seed Technology, Ministry of Agriculture and Rural Affairs, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100193, China
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Yongxiang Li
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Yu Cui
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Guoying Wang
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, 572024, China
| | - Jianhua Wang
- Center of Seed Science and Technology, Beijing Innovation Center for Seed Technology, Ministry of Agriculture and Rural Affairs, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100193, China
| | - Jun Liu
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Lin Li
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, China
| | - Riliang Gu
- Center of Seed Science and Technology, Beijing Innovation Center for Seed Technology, Ministry of Agriculture and Rural Affairs, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100193, China.
| | - Xiaoming Zheng
- National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081, China.
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, 572024, China.
- International Rice Research Institute, DAPO Box 7777, Metro Manila, Philippines.
| | - Junjie Fu
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081, China.
| |
Collapse
|
16
|
Mendes M, Jonnalagadda M, Ozarkar S, Lima Torres FC, Borda Pua V, Kendall C, Tarazona-Santos E, Parra EJ. Identifying signatures of natural selection in Indian populations. PLoS One 2022; 17:e0271767. [PMID: 35925921 PMCID: PMC9352006 DOI: 10.1371/journal.pone.0271767] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2022] [Accepted: 07/06/2022] [Indexed: 11/18/2022] Open
Abstract
In this study, we present the results of a genome-wide scan for signatures of positive selection using data from four tribal groups (Kokana, Warli, Bhil, and Pawara) and two caste groups (Deshastha Brahmin and Kunbi Maratha) from West of the Maharashtra State In India, as well as two samples of South Asian ancestry from the 1KG project (Gujarati Indian from Houston, Texas and Indian Telugu from UK). We used an outlier approach based on different statistics, including PBS, xpEHH, iHS, CLR, Tajima's D, as well as two recently developed methods: Graph-aware Retrieval of Selective Sweeps (GRoSS) and Ascertained Sequentially Markovian Coalescent (ASMC). In order to minimize the risk of false positives, we selected regions that are outliers in all the samples included in the study using more than one method. We identified putative selection signals in 107 regions encompassing 434 genes. Many of the regions overlap with only one gene. The signals observed using microarray-based data are very consistent with our analyses using high-coverage sequencing data, as well as those identified with a novel coalescence-based method (ASMC). Importantly, at least 24 of these genomic regions have been identified in previous selection scans in South Asian populations or in other population groups. Our study highlights genomic regions that may have played a role in the adaptation of anatomically modern humans to novel environmental conditions after the out of Africa migration.
Collapse
Affiliation(s)
- Marla Mendes
- Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brazil
- Department of Anthropology, University of Toronto—Mississauga Campus, Mississauga, ON, Canada
| | - Manjari Jonnalagadda
- Symbiosis School for Liberal Arts (SSLA), Symbiosis International University (SIU), Pune, India
| | - Shantanu Ozarkar
- Department of Anthropology, Savitribai Phule Pune University, Pune, India
| | - Flávia Carolina Lima Torres
- Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brazil
| | - Victor Borda Pua
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, United States of America
| | - Christopher Kendall
- Department of Anthropology, University of Toronto—Mississauga Campus, Mississauga, ON, Canada
| | - Eduardo Tarazona-Santos
- Departamento de Genética, Ecologia e Evolução, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brazil
| | - Esteban J. Parra
- Department of Anthropology, University of Toronto—Mississauga Campus, Mississauga, ON, Canada
| |
Collapse
|
17
|
Padró J, De Panis DN, Luisi P, Dopazo H, Szajnman S, Hasson E, Soto IM. Ortholog genes from cactophilic Drosophila provide insight into human adaptation to hallucinogenic cacti. Sci Rep 2022; 12:13180. [PMID: 35915153 PMCID: PMC9343604 DOI: 10.1038/s41598-022-17118-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Accepted: 07/20/2022] [Indexed: 11/18/2022] Open
Abstract
Cultural transformations of lifestyles and dietary practices have been key drivers of human evolution. However, while most of the evidence of genomic adaptations is related to the hunter-gatherer transition to agricultural societies, little is known on the influence of other major cultural manifestations. Shamanism is considered the oldest religion that predominated throughout most of human prehistory and still prevails in many indigenous populations. Several lines of evidence from ethno-archeological studies have demonstrated the continuity and importance of psychoactive plants in South American cultures. However, despite the well-known importance of secondary metabolites in human health, little is known about its role in the evolution of ethnic differences. Herein, we identified candidate genes of adaptation to hallucinogenic cactus in Native Andean populations with a long history of shamanic practices. We used genome-wide expression data from the cactophilic fly Drosophila buzzatii exposed to a hallucinogenic columnar cactus, also consumed by humans, to identify ortholog genes exhibiting adaptive footprints of alkaloid tolerance. Genomic analyses in human populations revealed a suite of ortholog genes evolving under recent positive selection in indigenous populations of the Central Andes. Our results provide evidence of selection in genetic variants related to alkaloids toxicity, xenobiotic metabolism, and neuronal plasticity in Aymara and Quechua populations, suggesting a possible process of gene-culture coevolution driven by religious practices.
Collapse
Affiliation(s)
- Julian Padró
- INIBIOMA-CONICET, Universidad Nacional del Comahue, Quintral 1250, R8400FRF, San Carlos de Bariloche, Argentina.
| | - Diego N De Panis
- IEGEBA-CONICET, Departamento de Ecología, Genética y Evolución, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Universitaria, Intendente Güiraldes 2160, C1428EHA, Buenos Aires, Argentina
| | - Pierre Luisi
- Facultad de Filosofía y Humanidades, Universidad Nacional de Córdoba (FFyH-UNC), Córdoba, Argentina.,Microbial Paleogenomics Unit, Institut Pasteur, 25-28 Rue du Dr Roux, 75015, Paris, France
| | - Hernan Dopazo
- IEGEBA-CONICET, Departamento de Ecología, Genética y Evolución, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Universitaria, Intendente Güiraldes 2160, C1428EHA, Buenos Aires, Argentina
| | - Sergio Szajnman
- Departamento de Química Orgánica and UMYMFOR (CONICET-FCEyN), Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Universitaria, Intendente Güiraldes 2160, C1428EHA, Buenos Aires, Argentina
| | - Esteban Hasson
- IEGEBA-CONICET, Departamento de Ecología, Genética y Evolución, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Universitaria, Intendente Güiraldes 2160, C1428EHA, Buenos Aires, Argentina
| | - Ignacio M Soto
- IEGEBA-CONICET, Departamento de Ecología, Genética y Evolución, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Universitaria, Intendente Güiraldes 2160, C1428EHA, Buenos Aires, Argentina
| |
Collapse
|
18
|
Hitchhiking Mapping of Candidate Regions Associated with Fat Deposition in Iranian Thin and Fat Tail Sheep Breeds Suggests New Insights into Molecular Aspects of Fat Tail Selection. Animals (Basel) 2022; 12:ani12111423. [PMID: 35681887 PMCID: PMC9179914 DOI: 10.3390/ani12111423] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Revised: 05/07/2022] [Accepted: 05/12/2022] [Indexed: 12/20/2022] Open
Abstract
Simple Summary Fatness-related traits are economically very important in sheep production and are associated with serious diseases in humans. Using a denser set of SNP markers and a variety of statistical approaches, our results were able to refine the regions associated with fat deposition and to suggest new insights into molecular aspects of fat tail selection. These results may provide a strong foundation for studying the regulation of fat deposition in sheep and do offer hope that the causal mutations and the mode of inheritance of this trait will soon be discovered by further investigation. Abstract The fat tail is a phenotype that divides indigenous Iranian sheep genetic resources into two major groups. The objective of the present study is to refine the map location of candidate regions associated with fat deposition, obtained via two separate whole genome scans contrasting thin and fat tail breeds, and to determine the nature of the selection occurring in these regions using a hitchhiking approach. Zel (thin tail) and Lori-Bakhtiari (fat tail) breed samples that had previously been run on the Illumina Ovine 50 k BeadChip, were genotyped with a denser set of SNPs in the three candidate regions using a Sequenom Mass ARRAY platform. Statistical tests were then performed using different and complementary methods based on either site frequency (FST and Median homozygosity) or haplotype (iHS and XP-EHH). The results from candidate regions on chromosome 5 and X revealed clear evidence of selection with the derived haplotypes that was consistent with selection to near fixation for the haplotypes affecting fat tail size in the fat tail breed. An analysis of the candidate region on chromosome 7 indicated that selection differentiated the beneficial alleles between breeds and homozygosity has increased in the thin tail breed which also had the ancestral haplotype. These results enabled us to confirm the signature of selection in these regions and refine the critical intervals from 113 kb, 201 kb, and 2831 kb to 28 kb, 142 kb, and 1006 kb on chromosome 5, 7, and X respectively. These regions contain several genes associated with fat metabolism or developmental processes consisting of TCF7 and PPP2CA (OAR5), PTGDR and NID2 (OAR7), AR, EBP, CACNA1F, HSD17B10,SLC35A2, BMP15, WDR13, and RBM3 (OAR X), and each of which could potentially be the actual target of selection. The study of core haplotypes alleles in our regions of interest also supported the hypothesis that the first domesticated sheep were thin tailed, and that fat tail animals were developed later. Overall, our results provide a comprehensive assessment of how and where selection has affected the patterns of variation in candidate regions associated with fat deposition in thin and fat tail sheep breeds.
Collapse
|
19
|
The challenge of detecting recent natural selection in human populations. Proc Natl Acad Sci U S A 2022; 119:e2203237119. [PMID: 35353603 PMCID: PMC9169803 DOI: 10.1073/pnas.2203237119] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
|
20
|
Colomer-Vilaplana A, Murga-Moreno J, Canalda-Baltrons A, Inserte C, Soto D, Coronado-Zamora M, Barbadilla A, Casillas S. PopHumanVar: an interactive application for the functional characterization and prioritization of adaptive genomic variants in humans. Nucleic Acids Res 2022; 50:D1069-D1076. [PMID: 34664660 PMCID: PMC8728255 DOI: 10.1093/nar/gkab925] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2021] [Revised: 09/17/2021] [Accepted: 09/28/2021] [Indexed: 12/22/2022] Open
Abstract
Adaptive challenges that humans faced as they expanded across the globe left specific molecular footprints that can be decoded in our today's genomes. Different sets of metrics are used to identify genomic regions that have undergone selection. However, there are fewer methods capable of pinpointing the allele ultimately responsible for this selection. Here, we present PopHumanVar, an interactive online application that is designed to facilitate the exploration and thorough analysis of candidate genomic regions by integrating both functional and population genomics data currently available. PopHumanVar generates useful summary reports of prioritized variants that are putatively causal of recent selective sweeps. It compiles data and graphically represents different layers of information, including natural selection statistics, as well as functional annotations and genealogical estimations of variant age, for biallelic single nucleotide variants (SNVs) of the 1000 Genomes Project phase 3. Specifically, PopHumanVar amasses SNV-based information from GEVA, SnpEFF, GWAS Catalog, ClinVar, RegulomeDB and DisGeNET databases, as well as accurate estimations of iHS, nSL and iSAFE statistics. Notably, PopHumanVar can successfully identify known causal variants of frequently reported candidate selection regions, including EDAR in East-Asians, ACKR1 (DARC) in Africans and LCT/MCM6 in Europeans. PopHumanVar is open and freely available at https://pophumanvar.uab.cat.
Collapse
Affiliation(s)
- Aina Colomer-Vilaplana
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | - Jesús Murga-Moreno
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | - Aleix Canalda-Baltrons
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | - Clara Inserte
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | - Daniel Soto
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | - Marta Coronado-Zamora
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | - Antonio Barbadilla
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | - Sònia Casillas
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| |
Collapse
|
21
|
Natri HM, Hudjashov G, Jacobs G, Kusuma P, Saag L, Darusallam CC, Metspalu M, Sudoyo H, Cox MP, Gallego Romero I, Banovich NE. Genetic architecture of gene regulation in Indonesian populations identifies QTLs associated with global and local ancestries. Am J Hum Genet 2022; 109:50-65. [PMID: 34919805 PMCID: PMC8764200 DOI: 10.1016/j.ajhg.2021.11.017] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2021] [Accepted: 11/16/2021] [Indexed: 02/07/2023] Open
Abstract
Lack of diversity in human genomics limits our understanding of the genetic underpinnings of complex traits, hinders precision medicine, and contributes to health disparities. To map genetic effects on gene regulation in the underrepresented Indonesian population, we have integrated genotype, gene expression, and CpG methylation data from 115 participants across three island populations that capture the major sources of genomic diversity in the region. In a comparison with European datasets, we identify eQTLs shared between Indonesia and Europe as well as population-specific eQTLs that exhibit differences in allele frequencies and/or overall expression levels between populations. By combining local ancestry and archaic introgression inference with eQTLs and methylQTLs, we identify regulatory loci driven by modern Papuan ancestry as well as introgressed Denisovan and Neanderthal variation. GWAS colocalization connects QTLs detected here to hematological traits, and further comparison with European datasets reflects the poor overall transferability of GWAS statistics across diverse populations. Our findings illustrate how population-specific genetic architecture, local ancestry, and archaic introgression drive variation in gene regulation across genetically distinct and in admixed populations and highlight the need for performing association studies on non-European populations.
Collapse
Affiliation(s)
- Heini M Natri
- Center for Evolution and Medicine, School of Life Sciences, Arizona State University, Tempe, AZ 85281, USA; The Translational Genomics Research Institute, Phoenix, AZ 85004, USA
| | - Georgi Hudjashov
- Statistics and Bioinformatics Group, School of Fundamental Sciences, Massey University, Palmerston North 4410, New Zealand; Centre for Genomics, Evolution and Medicine, Institute of Genomics, University of Tartu, Tartu 51010, Estonia
| | - Guy Jacobs
- Leverhulme Centre for Human Evolutionary Studies, Department of Archaeology, University of Cambridge, Cambridge CB2 1QH, UK; Complexity Institute, Nanyang Technological University, Singapore, 637460
| | - Pradiptajati Kusuma
- Complexity Institute, Nanyang Technological University, Singapore, 637460; Laboratory of Genome Diversity and Disease, Eijkman Institute for Molecular Biology, Jakarta 10430, Indonesia
| | - Lauri Saag
- Institute of Genomics, University of Tartu, Tartu 51010, Estonia
| | - Chelzie Crenna Darusallam
- Laboratory of Genome Diversity and Disease, Eijkman Institute for Molecular Biology, Jakarta 10430, Indonesia
| | - Mait Metspalu
- Institute of Genomics, University of Tartu, Tartu 51010, Estonia
| | - Herawati Sudoyo
- Laboratory of Genome Diversity and Disease, Eijkman Institute for Molecular Biology, Jakarta 10430, Indonesia
| | - Murray P Cox
- Statistics and Bioinformatics Group, School of Fundamental Sciences, Massey University, Palmerston North 4410, New Zealand
| | - Irene Gallego Romero
- Centre for Genomics, Evolution and Medicine, Institute of Genomics, University of Tartu, Tartu 51010, Estonia; Melbourne Integrative Genomics, University of Melbourne, Parkville, VIC 3010, Australia; School of BioSciences, University of Melbourne, Parkville, VIC 3010, Australia; Centre for Stem Cell Systems, University of Melbourne, Parkville, VIC 3010, Australia
| | | |
Collapse
|
22
|
Kun Á. Is there still evolution in the human population? Biol Futur 2022; 73:359-374. [PMID: 36592324 PMCID: PMC9806833 DOI: 10.1007/s42977-022-00146-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Accepted: 12/08/2022] [Indexed: 01/03/2023]
Abstract
It is often claimed that humanity has stopped evolving because modern medicine erased all selection on survival. Even if that would be true, and it is not, there would be other mechanisms of evolution which could still led to changes in allelic frequencies. Here I show, by applying basic evolutionary genetics knowledge, that we expect humanity to evolve. The results from genome sequencing projects have repeatedly affirmed that there are still recent signs of selection in our genomes. I give some examples of such adaptation. Then I briefly discuss what our evolutionary future has in store for us.
Collapse
Affiliation(s)
- Ádám Kun
- grid.5591.80000 0001 2294 6276Department of Plant Systematics, Ecology and Theoretical Biology, Eötvös University, Budapest, Hungary ,Parmenides Center for the Conceptual Foundations of Science, Pöcking, Germany ,grid.481817.3Institute of Evolution, Centre for Ecological Research, Budapest, Hungary ,grid.5018.c0000 0001 2149 4407MTA-ELTE Theoretical Biology and Evolutionary Ecology Research Group, Budapest, Hungary ,grid.5018.c0000 0001 2149 4407MTA-ELTE-MTM Ecology Research Group, Budapest, Hungary
| |
Collapse
|
23
|
Laval G, Patin E, Boutillier P, Quintana-Murci L. Sporadic occurrence of recent selective sweeps from standing variation in humans as revealed by an approximate Bayesian computation approach. Genetics 2021; 219:6377789. [PMID: 34849862 DOI: 10.1093/genetics/iyab161] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2021] [Accepted: 09/01/2021] [Indexed: 12/14/2022] Open
Abstract
During their dispersals over the last 100,000 years, modern humans have been exposed to a large variety of environments, resulting in genetic adaptation. While genome-wide scans for the footprints of positive Darwinian selection have increased knowledge of genes and functions potentially involved in human local adaptation, they have globally produced evidence of a limited contribution of selective sweeps in humans. Conversely, studies based on machine learning algorithms suggest that recent sweeps from standing variation are widespread in humans, an observation that has been recently questioned. Here, we sought to formally quantify the number of recent selective sweeps in humans, by leveraging approximate Bayesian computation and whole-genome sequence data. Our computer simulations revealed suitable ABC estimations, regardless of the frequency of the selected alleles at the onset of selection and the completion of sweeps. Under a model of recent selection from standing variation, we inferred that an average of 68 (from 56 to 79) and 140 (from 94 to 198) sweeps occurred over the last 100,000 years of human history, in African and Eurasian populations, respectively. The former estimation is compatible with human adaptation rates estimated since divergence with chimps, and reveals numbers of sweeps per generation per site in the range of values estimated in Drosophila. Our results confirm the rarity of selective sweeps in humans and show a low contribution of sweeps from standing variation to recent human adaptation.
Collapse
Affiliation(s)
- Guillaume Laval
- Human Evolutionary Genetics Unit, Institut Pasteur, UMR 2000, CNRS, Paris 75015, France
| | - Etienne Patin
- Human Evolutionary Genetics Unit, Institut Pasteur, UMR 2000, CNRS, Paris 75015, France
| | - Pierre Boutillier
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
| | - Lluis Quintana-Murci
- Human Evolutionary Genetics Unit, Institut Pasteur, UMR 2000, CNRS, Paris 75015, France.,Human Genomics and Evolution, Collège de France, 75005 Paris, France
| |
Collapse
|
24
|
Trigila AP, Pisciottano F, Franchini LF. Hearing loss genes reveal patterns of adaptive evolution at the coding and non-coding levels in mammals. BMC Biol 2021; 19:244. [PMID: 34784928 PMCID: PMC8594068 DOI: 10.1186/s12915-021-01170-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Accepted: 10/21/2021] [Indexed: 11/26/2022] Open
Abstract
Background Mammals possess unique hearing capacities that differ significantly from those of the rest of the amniotes. In order to gain insights into the evolution of the mammalian inner ear, we aim to identify the set of genetic changes and the evolutionary forces that underlie this process. We hypothesize that genes that impair hearing when mutated in humans or in mice (hearing loss (HL) genes) must play important roles in the development and physiology of the inner ear and may have been targets of selective forces across the evolution of mammals. Additionally, we investigated if these HL genes underwent a human-specific evolutionary process that could underlie the evolution of phenotypic traits that characterize human hearing. Results We compiled a dataset of HL genes including non-syndromic deafness genes identified by genetic screenings in humans and mice. We found that many genes including those required for the normal function of the inner ear such as LOXHD1, TMC1, OTOF, CDH23, and PCDH15 show strong signatures of positive selection. We also found numerous noncoding accelerated regions in HL genes, and among them, we identified active transcriptional enhancers through functional enhancer assays in transgenic zebrafish. Conclusions Our results indicate that the key inner ear genes and regulatory regions underwent adaptive evolution in the basal branch of mammals and along the human-specific branch, suggesting that they could have played an important role in the functional remodeling of the cochlea. Altogether, our data suggest that morphological and functional evolution could be attained through molecular changes affecting both coding and noncoding regulatory regions. Supplementary Information The online version contains supplementary material available at 10.1186/s12915-021-01170-6.
Collapse
Affiliation(s)
- Anabella P Trigila
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular (INGEBI), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), C1428, Buenos Aires, Argentina
| | - Francisco Pisciottano
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular (INGEBI), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), C1428, Buenos Aires, Argentina.,Current address: Instituto de Biología y Medicina Experimental (IBYME), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), C1428, Buenos Aires, Argentina
| | - Lucía F Franchini
- Instituto de Investigaciones en Ingeniería Genética y Biología Molecular (INGEBI), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), C1428, Buenos Aires, Argentina.
| |
Collapse
|
25
|
Kumar R, Gyawali A, Morrison GD, Saski CA, Robertson DJ, Cook DD, Tharayil N, Schaefer RJ, Beissinger TM, Sekhon RS. Genetic Architecture of Maize Rind Strength Revealed by the Analysis of Divergently Selected Populations. PLANT & CELL PHYSIOLOGY 2021; 62:1199-1214. [PMID: 34015110 DOI: 10.1093/pcp/pcab059] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Revised: 05/04/2021] [Accepted: 05/19/2021] [Indexed: 06/12/2023]
Abstract
The strength of the stalk rind, measured as rind penetrometer resistance (RPR), is an important contributor to stalk lodging resistance. To enhance the genetic architecture of RPR, we combined selection mapping on populations developed by 15 cycles of divergent selection for high and low RPR with time-course transcriptomic and metabolic analyses of the stalks. Divergent selection significantly altered allele frequencies of 3,656 and 3,412 single- nucleotide polymorphisms (SNPs) in the high and low RPR populations, respectively. Surprisingly, only 110 (1.56%) SNPs under selection were common in both populations, while the majority (98.4%) were unique to each population. This result indicated that high and low RPR phenotypes are produced by biologically distinct mechanisms. Remarkably, regions harboring lignin and polysaccharide genes were preferentially selected in high and low RPR populations, respectively. The preferential selection was manifested as higher lignification and increased saccharification of the high and low RPR stalks, respectively. The evolution of distinct gene classes according to the direction of selection was unexpected in the context of parallel evolution and demonstrated that selection for a trait, albeit in different directions, does not necessarily act on the same genes. Tricin, a grass-specific monolignol that initiates the incorporation of lignin in the cell walls, emerged as a key determinant of RPR. Integration of selection mapping and transcriptomic analyses with published genetic studies of RPR identified several candidate genes including ZmMYB31, ZmNAC25, ZmMADS1, ZmEXPA2, ZmIAA41 and hk5. These findings provide a foundation for an enhanced understanding of RPR and the improvement of stalk lodging resistance.
Collapse
Affiliation(s)
- Rohit Kumar
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC 29634, USA
| | - Abiskar Gyawali
- Division of Biological Sciences, University of Missouri, 105 Tucker Hall, Columbia, MO 65211, USA
| | - Ginnie D Morrison
- Division of Biological Sciences, University of Missouri, 105 Tucker Hall, Columbia, MO 65211, USA
| | - Christopher A Saski
- Department of Plant and Environmental Sciences, Clemson University, Clemson, SC 29634, USA
| | - Daniel J Robertson
- Department of Mechanical Engineering, University of Idaho, Moscow, ID, USA
| | - Douglas D Cook
- Department of Mechanical Engineering, Brigham Young University, Provo, UT, USA
| | - Nishanth Tharayil
- Department of Plant and Environmental Sciences, Clemson University, Clemson, SC 29634, USA
| | | | - Timothy M Beissinger
- Department of Plant Breeding Methodology, University of Göttingen, Göttingen 37075, Germany
- Center for Integrated Breeding Research, University of Göttingen, Göttingen 37075, Germany
| | - Rajandeep S Sekhon
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC 29634, USA
| |
Collapse
|
26
|
Giannini HM, Meyer NJ. Genetics of Acute Respiratory Distress Syndrome: Pathways to Precision. Crit Care Clin 2021; 37:817-834. [PMID: 34548135 DOI: 10.1016/j.ccc.2021.05.006] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Abstract
Clinical risk factors alone fail to fully explain acute respiratory distress syndrome (ARDS) risk or ARDS death, suggesting that individual risk factors contribute. The goals of genomic ARDS studies include better mechanistic understanding, identifying dysregulated pathways that may be amenable to pharmacologic targeting, using genomic causal inference techniques to find measurable traits with meaning, and deconvoluting ARDS heterogeneity by proving reproducible subpopulations that may share a unique biology. This article discusses the latest advances in ARDS genomics, provides historical perspective, and highlights some of the ways that the coronavirus disease 2019 (COVID-19) pandemic is accelerating genomic ARDS research.
Collapse
Affiliation(s)
- Heather M Giannini
- University of Pennsylvania Perelman School of Medicine, 3400 Spruce Street, 5038 Gates Building, Philadelphia, PA 19104, USA
| | - Nuala J Meyer
- University of Pennsylvania Perelman School of Medicine, 3400 Spruce Street, 5038 Gates Building, Philadelphia, PA 19104, USA.
| |
Collapse
|
27
|
Evidence for opposing selective forces operating on human-specific duplicated TCAF genes in Neanderthals and humans. Nat Commun 2021; 12:5118. [PMID: 34433829 PMCID: PMC8387397 DOI: 10.1038/s41467-021-25435-4] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2020] [Accepted: 08/04/2021] [Indexed: 11/30/2022] Open
Abstract
TRP channel-associated factor 1/2 (TCAF1/TCAF2) proteins antagonistically regulate the cold-sensor protein TRPM8 in multiple human tissues. Understanding their significance has been complicated given the locus spans a gap-ridden region with complex segmental duplications in GRCh38. Using long-read sequencing, we sequence-resolve the locus, annotate full-length TCAF models in primate genomes, and show substantial human-specific TCAF copy number variation. We identify two human super haplogroups, H4 and H5, and establish that TCAF duplications originated ~1.7 million years ago but diversified only in Homo sapiens by recurrent structural mutations. Conversely, in all archaic-hominin samples the fixation for a specific H4 haplotype without duplication is likely due to positive selection. Here, our results of TCAF copy number expansion, selection signals in hominins, and differential TCAF2 expression between haplogroups and high TCAF2 and TRPM8 expression in liver and prostate in modern-day humans imply TCAF diversification among hominins potentially in response to cold or dietary adaptations. Duplications of gene segments can allow novel physiological adaptations to evolve. A detailed analysis of the TCAF gene family in primates and archaic humans suggest rapid duplication and diversification in this gene family is associated with cold or dietary adaptations.
Collapse
|
28
|
Nunes K, Maia MHT, Dos Santos EJM, Dos Santos SEB, Guerreiro JF, Petzl-Erler ML, Bedoya G, Gallo C, Poletti G, Llop E, Tsuneto L, Bortolini MC, Rothhammer F, Single R, Ruiz-Linares A, Rocha J, Meyer D. How natural selection shapes genetic differentiation in the MHC region: A case study with Native Americans. Hum Immunol 2021; 82:523-531. [PMID: 33812704 PMCID: PMC8217218 DOI: 10.1016/j.humimm.2021.03.005] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2020] [Revised: 02/15/2021] [Accepted: 03/09/2021] [Indexed: 12/19/2022]
Abstract
The Human Leukocyte Antigen (HLA) loci are extremely well documented targets of balancing selection, yet few studies have explored how selection affects population differentiation at these loci. In the present study we investigate genetic differentiation at HLA genes by comparing differentiation at microsatellites distributed genomewide to those in the MHC region. Our study uses a sample of 494 individuals from 30 human populations, 28 of which are Native Americans, all of whom were typed for genomewide and MHC region microsatellites. We find greater differentiation in the MHC than in the remainder of the genome (FST-MHC = 0.130 and FST-Genomic = 0.087), and use a permutation approach to show that this difference is statistically significant, and not accounted for by confounding factors. This finding lies in the opposite direction to the expectation that balancing selection reduces population differentiation. We interpret our findings as evidence that selection favors different sets of alleles in distinct localities, leading to increased differentiation. Thus, balancing selection at HLA genes simultaneously increases intra-population polymorphism and inter-population differentiation in Native Americans.
Collapse
Affiliation(s)
- Kelly Nunes
- Departamento de Genética e Biologia Evolutiva, Universidade de São Paulo, São Paulo, Brazil.
| | | | | | | | | | | | - Gabriel Bedoya
- Instituto de Biología, Universidad de Antioquia, Medellín, Colombia
| | - Carla Gallo
- Laboratorios de Investigación y Desarrollo, Universidad Peruana Cayetano Heredia, Lima, Peru
| | - Giovanni Poletti
- Facultad de Medicina, Universidad Peruana Cayetano Heredia, Lima, Peru
| | - Elena Llop
- Instituto de Ciencias Biomédicas, Faculdad de Medicina, Universidade de Chile, Santiago, Chile
| | - Luiza Tsuneto
- Departamento de Ciências Básicas da Saúde, Universidade Estadual de Maringá, Maringá, Brazil
| | - Maria Cátira Bortolini
- Departamento de Genética, Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil
| | | | - Richard Single
- Department of Mathematics and Statistics, University of Vermont, Burlington, VT, USA
| | - Andrés Ruiz-Linares
- Ministry of Education Key Laboratory of Contemporary Anthropology and Collaborative Innovation Center of Genetics and Development, School of Life Sciences and Human Phenome Institute, Fudan University, Shanghai 200433, China; D Aix-Marseille University, CNRS, EFS, ADES, Marseille 13007, France
| | - Jorge Rocha
- Departamento de Biologia, Faculdade de Ciências, Universidade do Porto, Porto, Portugal; CIBIO - Centro de Investigação em Biodiversidade e Recursos Genéticos, Universidade do Porto, Porto, Portugal.
| | - Diogo Meyer
- Departamento de Genética e Biologia Evolutiva, Universidade de São Paulo, São Paulo, Brazil.
| |
Collapse
|
29
|
Contribution of Evolutionary Selected Immune Gene Polymorphism to Immune-Related Disorders: The Case of Lymphocyte Scavenger Receptors CD5 and CD6. Int J Mol Sci 2021; 22:ijms22105315. [PMID: 34070159 PMCID: PMC8158487 DOI: 10.3390/ijms22105315] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2021] [Revised: 05/07/2021] [Accepted: 05/11/2021] [Indexed: 01/23/2023] Open
Abstract
Pathogens are one of the main selective pressures that ancestral humans had to adapt to. Components of the immune response system have been preferential targets of natural selection in response to such pathogen-driven pressure. In turn, there is compelling evidence showing that positively selected immune gene variants conferring increased resistance to past or present infectious agents are today associated with increased risk for autoimmune or inflammatory disorders but decreased risk of cancer, the other side of the same coin. CD5 and CD6 are lymphocytic scavenger receptors at the interphase of the innate and adaptive immune responses since they are involved in both: (i) microbial-associated pattern recognition; and (ii) modulation of intracellular signals mediated by the clonotypic antigen-specific receptor present in T and B cells (TCR and BCR, respectively). Here, we review available information on CD5 and CD6 as targets of natural selection as well as on the role of CD5 and CD6 variation in autoimmunity and cancer.
Collapse
|
30
|
Feng Y, McQuillan MA, Tishkoff SA. Evolutionary genetics of skin pigmentation in African populations. Hum Mol Genet 2021; 30:R88-R97. [PMID: 33438000 PMCID: PMC8117430 DOI: 10.1093/hmg/ddab007] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2020] [Revised: 01/07/2021] [Accepted: 01/07/2021] [Indexed: 12/14/2022] Open
Abstract
Skin color is a highly heritable human trait, and global variation in skin pigmentation has been shaped by natural selection, migration and admixture. Ethnically diverse African populations harbor extremely high levels of genetic and phenotypic diversity, and skin pigmentation varies widely across Africa. Recent genome-wide genetic studies of skin pigmentation in African populations have advanced our understanding of pigmentation biology and human evolutionary history. For example, novel roles in skin pigmentation for loci near MFSD12 and DDB1 have recently been identified in African populations. However, due to an underrepresentation of Africans in human genetic studies, there is still much to learn about the evolutionary genetics of skin pigmentation. Here, we summarize recent progress in skin pigmentation genetics in Africans and discuss the importance of including more ethnically diverse African populations in future genetic studies. In addition, we discuss methods for functional validation of adaptive variants related to skin pigmentation.
Collapse
Affiliation(s)
- Yuanqing Feng
- Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Michael A McQuillan
- Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Sarah A Tishkoff
- Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| |
Collapse
|
31
|
Tennessen JA, Duraisingh MT. Three Signatures of Adaptive Polymorphism Exemplified by Malaria-Associated Genes. Mol Biol Evol 2021; 38:1356-1371. [PMID: 33185667 PMCID: PMC8042748 DOI: 10.1093/molbev/msaa294] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Malaria has been one of the strongest selective pressures on our species. Many of the best-characterized cases of adaptive evolution in humans are in genes tied to malaria resistance. However, the complex evolutionary patterns at these genes are poorly captured by standard scans for nonneutral evolution. Here, we present three new statistical tests for selection based on population genetic patterns that are observed more than once among key malaria resistance loci. We assess these tests using forward-time evolutionary simulations and apply them to global whole-genome sequencing data from humans, and thus we show that they are effective at distinguishing selection from neutrality. Each test captures a distinct evolutionary pattern, here called Divergent Haplotypes, Repeated Shifts, and Arrested Sweeps, associated with a particular period of human prehistory. We clarify the selective signatures at known malaria-relevant genes and identify additional genes showing similar adaptive evolutionary patterns. Among our top outliers, we see a particular enrichment for genes involved in erythropoiesis and for genes previously associated with malaria resistance, consistent with a major role for malaria in shaping these patterns of genetic diversity. Polymorphisms at these genes are likely to impact resistance to malaria infection and contribute to ongoing host-parasite coevolutionary dynamics.
Collapse
|
32
|
Tian S, Zhou X, Phuntsok T, Zhao N, Zhang D, Ning C, Li D, Zhao H. Genomic Analyses Reveal Genetic Adaptations to Tropical Climates in Chickens. iScience 2020; 23:101644. [PMID: 33103083 PMCID: PMC7578744 DOI: 10.1016/j.isci.2020.101644] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2020] [Revised: 08/19/2020] [Accepted: 09/30/2020] [Indexed: 12/05/2022] Open
Abstract
The genetic footprints of adaptations to naturally occurring tropical stress along with domestication are poorly reported in chickens. Here, by conducting population genomic analyses of 67 chickens inhabiting distinct climates, we found signals of gene flow from Tibetan chickens to Sri Lankan and Saudi Arabian breeds and identified 12 positively selected genes that are likely involved in genetic adaptations to both tropical desert and tropical monsoon island climates. Notably, in tropical desert climate, advantageous alleles of TLR7 and ZC3HAV1, which could inhibit replication of viruses in cells, suggest immune adaptation to the defense against zoonotic diseases in chickens. Furthermore, comparative genomic analysis showed that four genes (OC90, PLA2G12B, GPR17 and TNFRSF11A) involved in arachidonic acid metabolism have undergone convergent adaptation to tropical desert climate between birds and mammals. Our study offers insights into the genetic mechanisms of adaptations to tropical climates in birds and other animals and provides practical value for breeding design and medical research on avian viruses.
Collapse
Affiliation(s)
- Shilin Tian
- Department of Ecology, Tibetan Centre for Ecology and Conservation at WHU-TU, Hubei Key Laboratory of Cell Homeostasis, College of Life Sciences, Wuhan University, 299 Bayi Road, Wuhan, Hubei 430072, China
| | - Xuming Zhou
- CAS Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Beijing 100101, China
| | - Tashi Phuntsok
- Laboratory of Extreme Environmental Biological Resources and Adaptive Evolution, Research Center for Ecology, College of Science, Tibet University, Lhasa 850000, China
| | - Ning Zhao
- Laboratory of Extreme Environmental Biological Resources and Adaptive Evolution, Research Center for Ecology, College of Science, Tibet University, Lhasa 850000, China
| | - Dejing Zhang
- Novogene Bioinformatics Institute, Beijing 100015, China
| | - Chunyou Ning
- Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu 611130, China
| | - Diyan Li
- Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu 611130, China
| | - Huabin Zhao
- Department of Ecology, Tibetan Centre for Ecology and Conservation at WHU-TU, Hubei Key Laboratory of Cell Homeostasis, College of Life Sciences, Wuhan University, 299 Bayi Road, Wuhan, Hubei 430072, China
- Laboratory of Extreme Environmental Biological Resources and Adaptive Evolution, Research Center for Ecology, College of Science, Tibet University, Lhasa 850000, China
| |
Collapse
|
33
|
Single RM, Meyer D, Nunes K, Francisco RS, Hünemeier T, Maiers M, Hurley CK, Bedoya G, Gallo C, Hurtado AM, Llop E, Petzl-Erler ML, Poletti G, Rothhammer F, Tsuneto L, Klitz W, Ruiz-Linares A. Demographic history and selection at HLA loci in Native Americans. PLoS One 2020; 15:e0241282. [PMID: 33147239 PMCID: PMC7641399 DOI: 10.1371/journal.pone.0241282] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2019] [Accepted: 10/12/2020] [Indexed: 12/22/2022] Open
Abstract
The American continent was the last to be occupied by modern humans, and native populations bear the marks of recent expansions, bottlenecks, natural selection, and population substructure. Here we investigate how this demographic history has shaped genetic variation at the strongly selected HLA loci. In order to disentangle the relative contributions of selection and demography process, we assembled a dataset with genome-wide microsatellites and HLA-A, -B, -C, and -DRB1 typing data for a set of 424 Native American individuals. We find that demographic history explains a sizeable fraction of HLA variation, both within and among populations. A striking feature of HLA variation in the Americas is the existence of alleles which are present in the continent but either absent or very rare elsewhere in the world. We show that this feature is consistent with demographic history (i.e., the combination of changes in population size associated with bottlenecks and subsequent population expansions). However, signatures of selection at HLA loci are still visible, with significant evidence selection at deeper timescales for most loci and populations, as well as population differentiation at HLA loci exceeding that seen at neutral markers.
Collapse
Affiliation(s)
- Richard M. Single
- Department of Mathematics and Statistics, University of Vermont, Burlington, Vermont, United States of America
- * E-mail:
| | - Diogo Meyer
- Departmento de Genética e Biologia Evolutiva, Universidade de São Paulo, São Paulo, Brazil
| | - Kelly Nunes
- Departmento de Genética e Biologia Evolutiva, Universidade de São Paulo, São Paulo, Brazil
| | | | - Tábita Hünemeier
- Departmento de Genética e Biologia Evolutiva, Universidade de São Paulo, São Paulo, Brazil
| | - Martin Maiers
- Center for International Blood and Marrow Transplant Research, Minneapolis, Minnesota, United States of America
| | - Carolyn K. Hurley
- CW Bill Young Marrow Donor Recruitment and Research Program, Georgetown University, Washington, DC, United States of America
| | - Gabriel Bedoya
- Instituto de Biología, Universidad de Antioquia Medellín, Medellín, Colombia
| | - Carla Gallo
- Laboratorios de Investigación y Desarrollo, Universidad Peruana Cayetano Heredia, Lima, Peru
| | - Ana Magdalena Hurtado
- School of Human Evolution and Social Change, Arizona State University, Tempe, Arizona, United States of America
| | - Elena Llop
- Programa de Genética Humana, Instituto de Ciencias Biomédicas, Facultad de Medicina, Universidad de Chile, Santiago, Chile
| | | | - Giovanni Poletti
- Facultad de Medicina, Universidad Peruana Cayetano Heredia, Lima, Peru
| | - Francisco Rothhammer
- Programa de Genética Humana, Instituto de Ciencias Biomédicas, Facultad de Medicina, Universidad de Chile, Santiago, Chile
- Instituto de Alta Investigación, Tarapacá University, Arica, Chile
| | - Luiza Tsuneto
- Department of Basic Health Sciences, Universidade Estadual de Maringá, Maringá, Paraná, Brazil
| | - William Klitz
- Department of Integrative Biology, University of California, Berkeley, California, United States of America
| | - Andrés Ruiz-Linares
- Ministry of Education Key Laboratory of Contemporary Anthropology and Collaborative Innovation Center of Genetics and Development, School of Life Sciences and Human Phenome Institute, Fudan University, Shanghai, China
- CNRS, EFS, ADES, D Aix-Marseille University, Marseille, France
| |
Collapse
|
34
|
DeSilva R, Dodd RS. Association of genetic and climatic variability in giant sequoia, Sequoiadendron giganteum, reveals signatures of local adaptation along moisture-related gradients. Ecol Evol 2020; 10:10619-10632. [PMID: 33072284 PMCID: PMC7548164 DOI: 10.1002/ece3.6716] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2020] [Revised: 07/28/2020] [Accepted: 08/05/2020] [Indexed: 11/12/2022] Open
Abstract
Uncovering the genetic basis of local adaptation is a major goal of evolutionary biology and conservation science alike. In an era of climate change, an understanding of how environmental factors shape adaptive diversity is crucial to predicting species response and directing management. Here, we investigate patterns of genomic variation in giant sequoia, an iconic and ecologically important tree species, using 1,364 bi-allelic single nucleotide polymorphisms (SNPs). We use an F ST outlier test and two genotype-environment association methods, latent factor mixed models (LFMMs) and redundancy analysis (RDA), to detect complex signatures of local adaptation. Results indicate 79 genomic regions of potential adaptive importance, with limited overlap between the detection methods. Of the 58 loci detected by LFMM, 51 showed strong correlations to a precipitation-driven composite variable and seven to a temperature-related variable. RDA revealed 24 outlier loci with association to climate variables, all of which showed strongest relationship to summer precipitation. Nine candidate loci were indicated by two methods. After correcting for geographic distance, RDA models using climate predictors accounted for 49% of the explained variance and showed significant correlations between SNPs and climatic factors. Here, we present evidence of local adaptation in giant sequoia along gradients of precipitation and provide a first step toward identifying genomic regions of adaptive significance. The results of this study will provide information to guide management strategies that seek to maximize adaptive potential in the face of climate change.
Collapse
Affiliation(s)
- Rainbow DeSilva
- Department of Environmental Science, Policy, and Management University of California at Berkeley Berkeley California USA
| | - Richard S Dodd
- Department of Environmental Science, Policy, and Management University of California at Berkeley Berkeley California USA
| |
Collapse
|
35
|
LaBella AL, Abraham A, Pichkar Y, Fong SL, Zhang G, Muglia LJ, Abbot P, Rokas A, Capra JA. Accounting for diverse evolutionary forces reveals mosaic patterns of selection on human preterm birth loci. Nat Commun 2020; 11:3731. [PMID: 32709900 PMCID: PMC7382462 DOI: 10.1038/s41467-020-17258-6] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2019] [Accepted: 06/19/2020] [Indexed: 02/02/2023] Open
Abstract
Currently, there is no comprehensive framework to evaluate the evolutionary forces acting on genomic regions associated with human complex traits and contextualize the relationship between evolution and molecular function. Here, we develop an approach to test for signatures of diverse evolutionary forces on trait-associated genomic regions. We apply our method to regions associated with spontaneous preterm birth (sPTB), a complex disorder of global health concern. We find that sPTB-associated regions harbor diverse evolutionary signatures including conservation, excess population differentiation, accelerated evolution, and balanced polymorphism. Furthermore, we integrate evolutionary context with molecular evidence to hypothesize how these regions contribute to sPTB risk. Finally, we observe enrichment in signatures of diverse evolutionary forces in sPTB-associated regions compared to genomic background. By quantifying multiple evolutionary forces acting on sPTB-associated regions, our approach improves understanding of both functional roles and the mosaic of evolutionary forces acting on loci. Our work provides a blueprint for investigating evolutionary pressures on complex traits.
Collapse
Affiliation(s)
- Abigail L LaBella
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, 37235, USA
| | - Abin Abraham
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, 37235, USA
- Vanderbilt University Medical Center, Vanderbilt University, Nashville, TN, 37232, USA
| | - Yakov Pichkar
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, 37235, USA
| | - Sarah L Fong
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, 37235, USA
| | - Ge Zhang
- Division of Human Genetics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, 45229, USA
- The Center for Prevention of Preterm Birth, Perinatal Institute, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, 45229, USA
- March of Dimes Prematurity Research Center Ohio Collaborative, Cincinnati, OH, 45267, USA
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH, 45267, USA
| | - Louis J Muglia
- Division of Human Genetics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, 45229, USA
- The Center for Prevention of Preterm Birth, Perinatal Institute, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, 45229, USA
- March of Dimes Prematurity Research Center Ohio Collaborative, Cincinnati, OH, 45267, USA
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH, 45267, USA
| | - Patrick Abbot
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, 37235, USA
| | - Antonis Rokas
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, 37235, USA.
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN, 37235, USA.
| | - John A Capra
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, 37235, USA.
- Departments of Biomedical Informatics and Computer Science, Vanderbilt Genetics Institute, Center for Structural Biology, Vanderbilt University, Nashville, TN, 37235, USA.
| |
Collapse
|
36
|
Mathieson I. Human adaptation over the past 40,000 years. Curr Opin Genet Dev 2020; 62:97-104. [PMID: 32745952 PMCID: PMC7484260 DOI: 10.1016/j.gde.2020.06.003] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2020] [Revised: 05/10/2020] [Accepted: 06/01/2020] [Indexed: 02/07/2023]
Abstract
Over the past few years several methodological and data-driven advances have greatly improved our ability to robustly detect genomic signatures of selection in humans. New methods applied to large samples of present-day genomes provide increased power, while ancient DNA allows precise estimation of timing and tempo. However, despite these advances, we are still limited in our ability to translate these signatures into understanding about which traits were actually under selection, and why. Combining information from different populations and timescales may allow interpretation of selective sweeps. Other modes of selection have proved more difficult to detect. In particular, despite strong evidence of the polygenicity of most human traits, evidence for polygenic selection is weak, and its importance in recent human evolution remains unclear. Balancing selection and archaic introgression seem important for the maintenance of potentially adaptive immune diversity, but perhaps less so for other traits.
Collapse
Affiliation(s)
- Iain Mathieson
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, United States.
| |
Collapse
|
37
|
Murga-Moreno J, Coronado-Zamora M, Bodelón A, Barbadilla A, Casillas S. PopHumanScan: the online catalog of human genome adaptation. Nucleic Acids Res 2020; 47:D1080-D1089. [PMID: 30335169 PMCID: PMC6323894 DOI: 10.1093/nar/gky959] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Accepted: 10/04/2018] [Indexed: 12/20/2022] Open
Abstract
Since the migrations that led humans to colonize Earth, our species has faced frequent adaptive challenges that have left signatures in the landscape of genetic variation and that we can identify in our today’s genomes. Here, we (i) perform an outlier approach on eight different population genetic statistics for 22 non-admixed human populations of the Phase III of the 1000 Genomes Project to detect selective sweeps at different historical ages, as well as events of recurrent positive selection in the human lineage; and (ii) create PopHumanScan, an online catalog that compiles and annotates all candidate regions under selection to facilitate their validation and thoroughly analysis. Well-known examples of human genetic adaptation published elsewhere are included in the catalog, as well as hundreds of other attractive candidates that will require further investigation. Designed as a collaborative database, PopHumanScan aims to become a central repository to share information, guide future studies and help advance our understanding of how selection has modeled our genomes as a response to changes in the environment or lifestyle of human populations. PopHumanScan is open and freely available at https://pophumanscan.uab.cat.
Collapse
Affiliation(s)
- Jesús Murga-Moreno
- Institut de Biotecnologia i de Biomedicina and Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
| | - Marta Coronado-Zamora
- Institut de Biotecnologia i de Biomedicina and Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
| | - Alejandra Bodelón
- Institut de Biotecnologia i de Biomedicina and Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
| | - Antonio Barbadilla
- Institut de Biotecnologia i de Biomedicina and Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
| | - Sònia Casillas
- Institut de Biotecnologia i de Biomedicina and Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
| |
Collapse
|
38
|
The exhaustive genomic scan approach, with an application to rare-variant association analysis. Eur J Hum Genet 2020; 28:1283-1291. [PMID: 32415273 PMCID: PMC7608423 DOI: 10.1038/s41431-020-0639-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2019] [Revised: 02/28/2020] [Accepted: 04/07/2020] [Indexed: 12/12/2022] Open
Abstract
Region-based genome-wide scans are usually performed by use of a priori chosen analysis regions. Such an approach will likely miss the region comprising the strongest signal and, thus, may result in increased type II error rates and decreased power. Here, we propose a genomic exhaustive scan approach that analyzes all possible subsequences and does not rely on a prior definition of the analysis regions. As a prime instance, we present a computationally ultraefficient implementation using the rare-variant collapsing test for phenotypic association, the genomic exhaustive collapsing scan (GECS). Our implementation allows for the identification of regions comprising the strongest signals in large, genome-wide rare-variant association studies while controlling the family-wise error rate via permutation. Application of GECS to two genomic data sets revealed several novel significantly associated regions for age-related macular degeneration and for schizophrenia. Our approach also offers a high potential to improve genome-wide scans for selection, methylation, and other analyses.
Collapse
|
39
|
Shao Y, Tian HY, Zhang JJ, Kharrati-Koopaee H, Guo X, Zhuang XL, Li ML, Nanaie HA, Dehghani Tafti E, Shojaei B, Reza Namavar M, Sotoudeh N, Oluwakemi Ayoola A, Li JL, Liang B, Esmailizadeh A, Wang S, Wu DD. Genomic and Phenotypic Analyses Reveal Mechanisms Underlying Homing Ability in Pigeon. Mol Biol Evol 2020; 37:134-148. [PMID: 31501895 DOI: 10.1093/molbev/msz208] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
The homing pigeon was selectively bred from the domestic pigeon for a homing ability over long distances, a very fascinating but complex behavioral trait. Here, we generate a total of 95 whole genomes from diverse pigeon breeds. Comparing the genomes from the homing pigeon population with those from other breeds identifies candidate positively selected genes, including many genes involved in the central nervous system, particularly spatial learning and memory such as LRP8. Expression profiling reveals many neuronal genes displaying differential expression in the hippocampus, which is the key organ for memory and navigation and exhibits significantly larger size in the homing pigeon. In addition, we uncover a candidate gene GSR (encoding glutathione-disulfide reductase) experiencing positive selection in the homing pigeon. Expression profiling finds that GSR is highly expressed in the wattle and visual pigment cell layer, and displays increased expression levels in the homing pigeon. In vitro, a magnetic field stimulates increases in calcium ion concentration in cells expressing pigeon GSR. These findings support the importance of the hippocampus (functioning in spatial memory and navigation) for homing ability, and the potential involvement of GSR in pigeon magnetoreception.
Collapse
Affiliation(s)
- Yong Shao
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
| | - Hang-Yu Tian
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China.,Kunming College of Life Science, University of the Chinese Academy of Sciences, Kunming, China
| | - Jing-Jing Zhang
- Key Laboratory of Animal Models and Human Disease Mechanisms, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China.,Department of Hepatobiliary Surgery, Affiliated Hospital of Guangdong Medical University, Zhanjiang, China
| | - Hamed Kharrati-Koopaee
- Department of Animal Science, Faculty of Agriculture, Shahid Bahonar University of Kerman, Kerman, Iran.,Institute of Biotechnology, School of Agriculture, Shiraz University, Shiraz, Iran
| | - Xing Guo
- College of Animal Science and Technology, Anhui Agricultural University, Hefei, China
| | - Xiao-Lin Zhuang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China.,Kunming College of Life Science, University of the Chinese Academy of Sciences, Kunming, China
| | - Ming-Li Li
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China.,Kunming College of Life Science, University of the Chinese Academy of Sciences, Kunming, China
| | | | - Elahe Dehghani Tafti
- Department of Basic Sciences, Faculty of Veterinary Medicine, Shahid Bahonar University of Kerman, Kerman, Iran
| | - Bahador Shojaei
- Department of Basic Sciences, Faculty of Veterinary Medicine, Shahid Bahonar University of Kerman, Kerman, Iran
| | - Mohammad Reza Namavar
- Clinical Neurology Research Center, Shiraz University of Medical Sciences, Shiraz, Iran.,Histomorphometry and Stereology Research Center, Shiraz University of Medical Science, Shiraz, Iran
| | - Narges Sotoudeh
- Clinical Neurology Research Center, Shiraz University of Medical Sciences, Shiraz, Iran.,Anatomy Department, School of Medicine, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Adeola Oluwakemi Ayoola
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China.,Kunming College of Life Science, University of the Chinese Academy of Sciences, Kunming, China
| | - Jia-Li Li
- Key Laboratory of Animal Models and Human Disease Mechanisms, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China.,Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China
| | - Bin Liang
- Key Laboratory of Animal Models and Human Disease Mechanisms, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China.,Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China
| | - Ali Esmailizadeh
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China.,Department of Animal Science, Faculty of Agriculture, Shahid Bahonar University of Kerman, Kerman, Iran
| | - Shu Wang
- School of Basic Medical Sciences, The Second Affiliated Hospital, Guangzhou Medical University, Guangzhou, China
| | - Dong-Dong Wu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China.,Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China
| |
Collapse
|
40
|
Bhati M, Kadri NK, Crysnanto D, Pausch H. Assessing genomic diversity and signatures of selection in Original Braunvieh cattle using whole-genome sequencing data. BMC Genomics 2020; 21:27. [PMID: 31914939 PMCID: PMC6950892 DOI: 10.1186/s12864-020-6446-y] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2019] [Accepted: 12/31/2019] [Indexed: 02/07/2023] Open
Abstract
Background Autochthonous cattle breeds are an important source of genetic variation because they might carry alleles that enable them to adapt to local environment and food conditions. Original Braunvieh (OB) is a local cattle breed of Switzerland used for beef and milk production in alpine areas. Using whole-genome sequencing (WGS) data of 49 key ancestors, we characterize genomic diversity, genomic inbreeding, and signatures of selection in Swiss OB cattle at nucleotide resolution. Results We annotated 15,722,811 SNPs and 1,580,878 Indels including 10,738 and 2763 missense deleterious and high impact variants, respectively, that were discovered in 49 OB key ancestors. Six Mendelian trait-associated variants that were previously detected in breeds other than OB, segregated in the sequenced key ancestors including variants causal for recessive xanthinuria and albinism. The average nucleotide diversity (1.6 × 10− 3) was higher in OB than many mainstream European cattle breeds. Accordingly, the average genomic inbreeding derived from runs of homozygosity (ROH) was relatively low (FROH = 0.14) in the 49 OB key ancestor animals. However, genomic inbreeding was higher in OB cattle of more recent generations (FROH = 0.16) due to a higher number of long (> 1 Mb) runs of homozygosity. Using two complementary approaches, composite likelihood ratio test and integrated haplotype score, we identified 95 and 162 genomic regions encompassing 136 and 157 protein-coding genes, respectively, that showed evidence (P < 0.005) of past and ongoing selection. These selection signals were enriched for quantitative trait loci related to beef traits including meat quality, feed efficiency and body weight and pathways related to blood coagulation, nervous and sensory stimulus. Conclusions We provide a comprehensive overview of sequence variation in Swiss OB cattle genomes. With WGS data, we observe higher genomic diversity and less inbreeding in OB than many European mainstream cattle breeds. Footprints of selection were detected in genomic regions that are possibly relevant for meat quality and adaptation to local environmental conditions. Considering that the population size is low and genomic inbreeding increased in the past generations, the implementation of optimal mating strategies seems warranted to maintain genetic diversity in the Swiss OB cattle population.
Collapse
Affiliation(s)
- Meenu Bhati
- Animal Genomics, ETH Zürich, Zürich, Switzerland.
| | | | | | | |
Collapse
|
41
|
Satta Y, Zheng W, Nishiyama KV, Iwasaki RL, Hayakawa T, Fujito NT, Takahata N. Two-dimensional site frequency spectrum for detecting, classifying and dating incomplete selective sweeps. Genes Genet Syst 2019; 94:283-300. [DOI: 10.1266/ggs.19-00012] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Affiliation(s)
- Yoko Satta
- School of Advanced Sciences, SOKENDAI (The Graduate University for Advanced Studies)
| | - Wanjing Zheng
- School of Advanced Sciences, SOKENDAI (The Graduate University for Advanced Studies)
| | - Kumiko V. Nishiyama
- School of Advanced Sciences, SOKENDAI (The Graduate University for Advanced Studies)
| | - Risa L. Iwasaki
- School of Advanced Sciences, SOKENDAI (The Graduate University for Advanced Studies)
| | - Toshiyuki Hayakawa
- Graduate School of Systems Life Sciences and Faculty of Arts and Science, Kyushu University
| | - Naoko T. Fujito
- Institute for Human Genetics and Department of Epidemiology and Biostatistics, University of California
| | - Naoyuki Takahata
- School of Advanced Sciences, SOKENDAI (The Graduate University for Advanced Studies)
| |
Collapse
|
42
|
Lynn H, Sun X, Casanova N, Gonzales-Garay M, Bime C, Garcia JGN. Genomic and Genetic Approaches to Deciphering Acute Respiratory Distress Syndrome Risk and Mortality. Antioxid Redox Signal 2019; 31:1027-1052. [PMID: 31016989 PMCID: PMC6939590 DOI: 10.1089/ars.2018.7701] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Significance: Acute respiratory distress syndrome (ARDS) is a severe, highly heterogeneous critical illness with staggering mortality that is influenced by environmental factors, such as mechanical ventilation, and genetic factors. Significant unmet needs in ARDS are addressing the paucity of validated predictive biomarkers for ARDS risk and susceptibility that hamper the conduct of successful clinical trials in ARDS and the complete absence of novel disease-modifying therapeutic strategies. Recent Advances: The current ARDS definition relies on clinical characteristics that fail to capture the diversity of disease pathology, severity, and mortality risk. We undertook a comprehensive survey of the available ARDS literature to identify genes and genetic variants (candidate gene and limited genome-wide association study approaches) implicated in susceptibility to developing ARDS in hopes of uncovering novel biomarkers for ARDS risk and mortality and potentially novel therapeutic targets in ARDS. We further attempted to address the well-known health disparities that exist in susceptibility to and mortality from ARDS. Critical Issues: Bioinformatic analyses identified 201 ARDS candidate genes with pathway analysis indicating a strong predominance in key evolutionarily conserved inflammatory pathways, including reactive oxygen species, innate immunity-related inflammation, and endothelial vascular signaling pathways. Future Directions: Future studies employing a system biology approach that combines clinical characteristics, genomics, transcriptomics, and proteomics may allow for a better definition of biologically relevant pathways and genotype-phenotype connections and result in improved strategies for the sub-phenotyping of diverse ARDS patients via molecular signatures. These efforts should facilitate the potential for successful clinical trials in ARDS and yield a better fundamental understanding of ARDS pathobiology.
Collapse
Affiliation(s)
- Heather Lynn
- Department of Physiological Sciences and University of Arizona, Tucson, Arizona.,Department of Health Sciences, University of Arizona, Tucson, Arizona
| | - Xiaoguang Sun
- Department of Health Sciences, University of Arizona, Tucson, Arizona
| | - Nancy Casanova
- Department of Health Sciences, University of Arizona, Tucson, Arizona
| | | | - Christian Bime
- Department of Health Sciences, University of Arizona, Tucson, Arizona
| | - Joe G N Garcia
- Department of Health Sciences, University of Arizona, Tucson, Arizona
| |
Collapse
|
43
|
Zhang C, Gao Y, Ning Z, Lu Y, Zhang X, Liu J, Xie B, Xue Z, Wang X, Yuan K, Ge X, Pan Y, Liu C, Tian L, Wang Y, Lu D, Hoh BP, Xu S. PGG.SNV: understanding the evolutionary and medical implications of human single nucleotide variations in diverse populations. Genome Biol 2019; 20:215. [PMID: 31640808 PMCID: PMC6805450 DOI: 10.1186/s13059-019-1838-5] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2019] [Accepted: 09/26/2019] [Indexed: 12/23/2022] Open
Abstract
Despite the tremendous growth of the DNA sequencing data in the last decade, our understanding of the human genome is still in its infancy. To understand the implications of genetic variants in the light of population genetics and molecular evolution, we developed a database, PGG.SNV ( https://www.pggsnv.org ), which gives much higher weight to previously under-investigated indigenous populations in Asia. PGG.SNV archives 265 million SNVs across 220,147 present-day genomes and 1018 ancient genomes, including 1009 newly sequenced genomes, representing 977 global populations. Moreover, estimation of population genetic diversity and evolutionary parameters is available in PGG.SNV, a unique feature compared with other databases.
Collapse
Affiliation(s)
- Chao Zhang
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, CAS, Shanghai, 200031, China
- Present Address: Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Yang Gao
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, CAS, Shanghai, 200031, China
- School of Life Science and Technology, ShanghaiTech University, Shanghai, 201210, China
| | - Zhilin Ning
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, CAS, Shanghai, 200031, China
| | - Yan Lu
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, CAS, Shanghai, 200031, China
| | - Xiaoxi Zhang
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, CAS, Shanghai, 200031, China
- School of Life Science and Technology, ShanghaiTech University, Shanghai, 201210, China
| | - Jiaojiao Liu
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, CAS, Shanghai, 200031, China
- School of Life Science and Technology, ShanghaiTech University, Shanghai, 201210, China
| | - Bo Xie
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, CAS, Shanghai, 200031, China
| | - Zhe Xue
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, CAS, Shanghai, 200031, China
| | - Xiaoji Wang
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, CAS, Shanghai, 200031, China
| | - Kai Yuan
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, CAS, Shanghai, 200031, China
| | - Xueling Ge
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, CAS, Shanghai, 200031, China
| | - Yuwen Pan
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, CAS, Shanghai, 200031, China
| | - Chang Liu
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, CAS, Shanghai, 200031, China
| | - Lei Tian
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, CAS, Shanghai, 200031, China
| | - Yuchen Wang
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, CAS, Shanghai, 200031, China
| | - Dongsheng Lu
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, CAS, Shanghai, 200031, China
| | - Boon-Peng Hoh
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, CAS, Shanghai, 200031, China
- Faculty of Medicine and Health Sciences, UCSI University, Jalan Menara Gading, Taman Connaught, Cheras, 56000, Kuala Lumpur, Malaysia
| | - Shuhua Xu
- Chinese Academy of Sciences (CAS) Key Laboratory of Computational Biology, Max Planck Independent Research Group on Population Genomics, CAS-MPG Partner Institute for Computational Biology (PICB), Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, CAS, Shanghai, 200031, China.
- School of Life Science and Technology, ShanghaiTech University, Shanghai, 201210, China.
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, 650223, China.
- Collaborative Innovation Center of Genetics and Development, Shanghai, 200438, China.
| |
Collapse
|
44
|
Casillas S, Mulet R, Villegas-Mirón P, Hervas S, Sanz E, Velasco D, Bertranpetit J, Laayouni H, Barbadilla A. PopHuman: the human population genomics browser. Nucleic Acids Res 2019; 46:D1003-D1010. [PMID: 29059408 PMCID: PMC5753332 DOI: 10.1093/nar/gkx943] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2017] [Accepted: 10/04/2017] [Indexed: 12/17/2022] Open
Abstract
The 1000 Genomes Project (1000GP) represents the most comprehensive world-wide nucleotide variation data set so far in humans, providing the sequencing and analysis of 2504 genomes from 26 populations and reporting >84 million variants. The availability of this sequence data provides the human lineage with an invaluable resource for population genomics studies, allowing the testing of molecular population genetics hypotheses and eventually the understanding of the evolutionary dynamics of genetic variation in human populations. Here we present PopHuman, a new population genomics-oriented genome browser based on JBrowse that allows the interactive visualization and retrieval of an extensive inventory of population genetics metrics. Efficient and reliable parameter estimates have been computed using a novel pipeline that faces the unique features and limitations of the 1000GP data, and include a battery of nucleotide variation measures, divergence and linkage disequilibrium parameters, as well as different tests of neutrality, estimated in non-overlapping windows along the chromosomes and in annotated genes for all 26 populations of the 1000GP. PopHuman is open and freely available at http://pophuman.uab.cat.
Collapse
Affiliation(s)
- Sònia Casillas
- Institut de Biotecnologia i de Biomedicina and Department de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
- To whom correspondence should be addressed. Sònia Casillas. Tel: +34 93 5868958; Fax: +34 93 5812011; . Correspondence may also be addressed to Antonio Barbadilla.
| | - Roger Mulet
- Institut de Biotecnologia i de Biomedicina and Department de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
| | - Pablo Villegas-Mirón
- Institute of Evolutionary Biology (UPF-CSIC), Universitat Pompeu Fabra, Doctor Aiguader 88 (PRBB), 08003 Barcelona, Catalonia, Spain
| | - Sergi Hervas
- Institut de Biotecnologia i de Biomedicina and Department de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
| | - Esteve Sanz
- Servei de Genòmica i Bioinformàtica, Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
| | - Daniel Velasco
- Institut de Biotecnologia i de Biomedicina and Department de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
| | - Jaume Bertranpetit
- Institute of Evolutionary Biology (UPF-CSIC), Universitat Pompeu Fabra, Doctor Aiguader 88 (PRBB), 08003 Barcelona, Catalonia, Spain
| | - Hafid Laayouni
- Institute of Evolutionary Biology (UPF-CSIC), Universitat Pompeu Fabra, Doctor Aiguader 88 (PRBB), 08003 Barcelona, Catalonia, Spain
- Bioinformatics Studies, ESCI-UPF, Pg. Pujades 1, 08003 Barcelona, Spain
| | - Antonio Barbadilla
- Institut de Biotecnologia i de Biomedicina and Department de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
- Servei de Genòmica i Bioinformàtica, Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
- To whom correspondence should be addressed. Sònia Casillas. Tel: +34 93 5868958; Fax: +34 93 5812011; . Correspondence may also be addressed to Antonio Barbadilla.
| |
Collapse
|
45
|
Genomic signatures of seed mass adaptation to global precipitation gradients in sorghum. Heredity (Edinb) 2019; 124:108-121. [PMID: 31316156 PMCID: PMC6906510 DOI: 10.1038/s41437-019-0249-4] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2019] [Revised: 06/07/2019] [Accepted: 06/21/2019] [Indexed: 12/14/2022] Open
Abstract
Seed mass is a key component of adaptation in plants and a determinant of yield in crops. The climatic drivers and genomic basis of seed mass variation remain poorly understood. In the cereal crop Sorghum bicolor, globally-distributed landraces harbor abundant variation in seed mass, which is associated with precipitation in their agroclimatic zones of origin. This study aimed to test the hypothesis that diversifying selection across precipitation gradients, acting on ancestral cereal grain size regulators, underlies seed mass variation in global sorghum germplasm. We tested this hypothesis in a set of 1901 georeferenced and genotyped sorghum landraces, 100-seed mass from common gardens, and bioclimatic precipitation variables. As predicted, 100-seed mass in global germplasm varies significantly among botanical races and is correlated to proxies of the precipitation gradients. With general and mixed linear model genome-wide associations, we identified 29 and 56 of 100 a priori candidate seed size genes with polymorphisms in the top 1% of seed mass association, respectively. Eleven of these genes harbor polymorphisms associated with the precipitation gradient, including orthologs of genes that regulate seed size in other cereals. With FarmCPU, 13 significant SNPs were identified, including one at an a priori candidate gene. Finally, we identified eleven colocalized outlier SNPs associated with seed mass and precipitation that also carry signatures of selection based on FST scans and PCAdapt, which represents a significant enrichment. Our findings suggest that seed mass in sorghum was shaped by diversifying selection on drought stress, and can inform genomics-enabled breeding for climate-resilient cereals.
Collapse
|
46
|
Szpak M, Xue Y, Ayub Q, Tyler‐Smith C. How well do we understand the basis of classic selective sweeps in humans? FEBS Lett 2019; 593:1431-1448. [DOI: 10.1002/1873-3468.13447] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2019] [Revised: 04/29/2019] [Accepted: 05/17/2019] [Indexed: 12/14/2022]
Affiliation(s)
| | - Yali Xue
- The Wellcome Sanger Institute Hinxton UK
| | - Qasim Ayub
- School of Science Monash University Malaysia Bandar Sunway Malaysia
- Tropical Medicine and Biology Multidisciplinary Platform Monash University Malaysia Genomics Facility Bandar Sunway Malaysia
| | | |
Collapse
|
47
|
Reilly JP, Calfee CS, Christie JD. Acute Respiratory Distress Syndrome Phenotypes. Semin Respir Crit Care Med 2019; 40:19-30. [PMID: 31060085 DOI: 10.1055/s-0039-1684049] [Citation(s) in RCA: 78] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
The acute respiratory distress syndrome (ARDS) phenotype was first described over 50 years ago and since that time significant progress has been made in understanding the biologic processes underlying the syndrome. Despite this improved understanding, no pharmacologic therapies aimed at the underlying biology have been proven effective in ARDS. Increasingly, ARDS has been recognized as a heterogeneous syndrome characterized by subphenotypes with distinct clinical, radiographic, and biologic differences, distinct outcomes, and potentially distinct responses to therapy. The Berlin Definition of ARDS specifies three severity classifications: mild, moderate, and severe based on the PaO2 to FiO2 ratio. Two randomized controlled trials have demonstrated a potential benefit to prone positioning and neuromuscular blockade in moderate to severe phenotypes of ARDS only. Precipitating risk factor, direct versus indirect lung injury, and timing of ARDS onset can determine other clinical phenotypes of ARDS after admission. Radiographic phenotypes of ARDS have been described based on a diffuse versus focal pattern of infiltrates on chest imaging. Finally and most promisingly, biologic subphenotypes or endotypes have increasingly been identified using plasma biomarkers, genetics, and unbiased approaches such as latent class analysis. The potential of precision medicine lies in identifying novel therapeutics aimed at ARDS biology and the subpopulation within ARDS most likely to respond. In this review, we discuss the challenges and approaches to subphenotype ARDS into clinical, radiologic, severity, and biologic phenotypes with an eye toward the future of precision medicine in critical care.
Collapse
Affiliation(s)
- John P Reilly
- Division of Pulmonary, Allergy, and Critical Care, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania
| | - Carolyn S Calfee
- Department of Medicine and Anesthesia, University of California, San Francisco, San Francisco, California
| | - Jason D Christie
- Division of Pulmonary, Allergy, and Critical Care, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania
| |
Collapse
|
48
|
Johnsson M. Integrating Selection Mapping With Genetic Mapping and Functional Genomics. Front Genet 2018; 9:603. [PMID: 30619447 PMCID: PMC6295561 DOI: 10.3389/fgene.2018.00603] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2018] [Accepted: 11/19/2018] [Indexed: 01/23/2023] Open
Abstract
Genomic scans for signatures of selection allow us to, in principle, detect variants and genes that underlie recent adaptations. By combining selection mapping with genetic mapping of traits known to be relevant to adaptation, we can simultaneously investigate whether genes and variants show signals of recent selection and whether they impact traits that have likely been selected. There are three ways to integrate selection mapping with genetic mapping or functional genomics: (1) To use genetic mapping data from other populations as a form of genome annotation. (2) To perform experimental evolution or artificial selection to be able to study selected variants when they segregate, either by performing genetic mapping before selection or by crossing the selected individuals to some reference population. (3) To perform a comparative study of related populations facing different selection regimes. This short review discusses these different ways of integrating selection mapping with genetic mapping and functional genomics, with examples of how each has been done.
Collapse
Affiliation(s)
- Martin Johnsson
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Edinburgh, United Kingdom.,Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, Uppsala, Sweden
| |
Collapse
|
49
|
Fabian DK, Garschall K, Klepsatel P, Santos‐Matos G, Sucena É, Kapun M, Lemaitre B, Schlötterer C, Arking R, Flatt T. Evolution of longevity improves immunity in Drosophila. Evol Lett 2018; 2:567-579. [PMID: 30564440 PMCID: PMC6292704 DOI: 10.1002/evl3.89] [Citation(s) in RCA: 44] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2018] [Accepted: 10/24/2018] [Indexed: 12/20/2022] Open
Abstract
Much has been learned about the genetics of aging from studies in model organisms, but still little is known about naturally occurring alleles that contribute to variation in longevity. For example, analysis of mutants and transgenes has identified insulin signaling as a major regulator of longevity, yet whether standing variation in this pathway underlies microevolutionary changes in lifespan and correlated fitness traits remains largely unclear. Here, we have analyzed the genomes of a set of Drosophila melanogaster lines that have been maintained under direct selection for postponed reproduction and indirect selection for longevity, relative to unselected control lines, for over 35 years. We identified many candidate loci shaped by selection for longevity and late-life fertility, but - contrary to expectation - we did not find overrepresentation of canonical longevity genes. Instead, we found an enrichment of immunity genes, particularly in the Toll pathway, suggesting that evolutionary changes in immune function might underpin - in part - the evolution of late-life fertility and longevity. To test whether this genomic signature is causative, we performed functional experiments. In contrast to control flies, long-lived flies tended to downregulate the expression of antimicrobial peptides upon infection with age yet survived fungal, bacterial, and viral infections significantly better, consistent with alleviated immunosenescence. To examine whether genes of the Toll pathway directly affect longevity, we employed conditional knockdown using in vivo RNAi. In adults, RNAi against the Toll receptor extended lifespan, whereas silencing the pathway antagonist cactus--causing immune hyperactivation - dramatically shortened lifespan. Together, our results suggest that genetic changes in the age-dependent regulation of immune homeostasis might contribute to the evolution of longer life.
Collapse
Affiliation(s)
- Daniel K. Fabian
- Centre for Pathogen Evolution, Department of ZoologyUniversity of CambridgeCambridgeUnited Kingdom
- Institut für PopulationsgenetikVetmeduni ViennaViennaAustria
- Vienna Graduate School of Population GeneticsViennaAustria
| | - Kathrin Garschall
- Department of Ecology and EvolutionUniversity of LausanneLausanneSwitzerland
| | - Peter Klepsatel
- Institut für PopulationsgenetikVetmeduni ViennaViennaAustria
- Institute of ZoologySlovak Academy of Sciences845 06 BratislavaSlovakia
| | | | - Élio Sucena
- Instituto Gulbenkian de CiênciaOeirasPortugal
- Departamento de Biologia AnimalFaculdade de Ciências da Universidade de LisboaLisboaPortugal
| | - Martin Kapun
- Department of Ecology and EvolutionUniversity of LausanneLausanneSwitzerland
| | - Bruno Lemaitre
- Global Health InstituteSchool of Life Sciences, EPFLLausanneSwitzerland
| | | | - Robert Arking
- Department of Biological SciencesWayne State UniversityDetroitMichigan
| | - Thomas Flatt
- Institut für PopulationsgenetikVetmeduni ViennaViennaAustria
- Department of Ecology and EvolutionUniversity of LausanneLausanneSwitzerland
- Department of BiologyUniversity of FribourgFribourgSwitzerland
| |
Collapse
|
50
|
Architecture of polymorphisms in the human genome reveals functionally important and positively selected variants in immune response and drug transporter genes. Hum Genomics 2018; 12:43. [PMID: 30219098 PMCID: PMC6139121 DOI: 10.1186/s40246-018-0175-1] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2018] [Accepted: 08/29/2018] [Indexed: 02/07/2023] Open
Abstract
Background Genetic polymorphisms can contribute to phenotypic differences amongst individuals, including disease risk and drug response. Characterization of genetic polymorphisms that modulate gene expression and/or protein function may facilitate the identification of the causal variants. Here, we present the architecture of genetic polymorphisms in the human genome focusing on those predicted to be potentially functional/under natural selection and the pathways that they reside. Results In the human genome, polymorphisms that directly affect protein sequences and potentially affect function are the most constrained variants with the lowest single-nucleotide variant (SNV) density, least population differentiation and most significant enrichment of rare alleles. SNVs which potentially alter various regulatory sites, e.g. splicing regulatory elements, are also generally under negative selection. Interestingly, genes that regulate the expression of transcription/splicing factors and histones are conserved as a higher proportion of these genes is non-polymorphic, contain ultra-conserved elements (UCEs) and/or has no non-synonymous SNVs (nsSNVs)/coding INDELs. On the other hand, major histocompatibility complex (MHC) genes are the most polymorphic with SNVs potentially affecting the binding of transcription/splicing factors and microRNAs (miRNA) exhibiting recent positive selection (RPS). The drug transporter genes carry the most number of potentially deleterious nsSNVs and exhibit signatures of RPS and/or population differentiation. These observations suggest that genes that interact with the environment are highly polymorphic and targeted by RPS. Conclusions In conclusion, selective constraints are observed in coding regions, master regulator genes, and potentially functional SNVs. In contrast, genes that modulate response to the environment are highly polymorphic and under positive selection. Electronic supplementary material The online version of this article (10.1186/s40246-018-0175-1) contains supplementary material, which is available to authorized users.
Collapse
|