Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Nelson D, Kelleher J, Ragsdale AP, Moreau C, McVean G, Gravel S. Accounting for long-range correlations in genome-wide simulations of large cohorts. PLoS Genet 2020;16:e1008619. [PMID: 32369493 PMCID: PMC7266353 DOI: 10.1371/journal.pgen.1008619] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2019] [Revised: 06/02/2020] [Accepted: 01/21/2020] [Indexed: 11/20/2022] Open

For:	Nelson D, Kelleher J, Ragsdale AP, Moreau C, McVean G, Gravel S. Accounting for long-range correlations in genome-wide simulations of large cohorts. PLoS Genet 2020;16:e1008619. [PMID: 32369493 PMCID: PMC7266353 DOI: 10.1371/journal.pgen.1008619] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2019] [Revised: 06/02/2020] [Accepted: 01/21/2020] [Indexed: 11/20/2022] Open

Number

Cited by Other Article(s)

Belman S, Pesonen H, Croucher NJ, Bentley SD, Corander J. Estimating between-country migration in pneumococcal populations. G3 (BETHESDA, MD.) 2024;14:jkae058. [PMID: 38507601 PMCID: PMC11152062 DOI: 10.1093/g3journal/jkae058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Revised: 02/29/2024] [Accepted: 03/11/2024] [Indexed: 03/22/2024]

Wong Y, Ignatieva A, Koskela J, Gorjanc G, Wohns AW, Kelleher J. A general and efficient representation of ancestral recombination graphs. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.11.03.565466. [PMID: 37961279 PMCID: PMC10635123 DOI: 10.1101/2023.11.03.565466] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]

Williams MP, Flegontov P, Maier R, Huber CD. Testing Times: Challenges in Disentangling Admixture Histories in Recent and Complex Demographies. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.13.566841. [PMID: 38014190 PMCID: PMC10680674 DOI: 10.1101/2023.11.13.566841] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]

Abstract

Paleogenomics has expanded our knowledge of human evolutionary history. Since the 2020s, the study of ancient DNA has increased its focus on reconstructing the recent past. However, the accuracy of paleogenomic methods in answering questions of historical and archaeological importance amidst the increased demographic complexity and decreased genetic differentiation within the historical period remains an open question. We used two simulation approaches to evaluate the limitations and behavior of commonly used methods, qpAdm and the f 3 -statistic, on admixture inference. The first is based on branch-length data simulated from four simple demographic models of varying complexities and configurations. The second, an analysis of Eurasian history composed of 59 populations using whole-genome data modified with ancient DNA conditions such as SNP ascertainment, data missingness, and pseudo-haploidization. We show that under conditions resembling historical populations, qpAdm can identify a small candidate set of true sources and populations closely related to them. However, in typical ancient DNA conditions, qpAdm is unable to further distinguish between them, limiting its utility for resolving fine-scaled hypotheses. Notably, we find that complex gene-flow histories generally lead to improvements in the performance of qpAdm and observe no bias in the estimation of admixture weights. We offer a heuristic for admixture inference that incorporates admixture weight estimate and P -values of qpAdm models, and f 3 -statistics to enhance the power to distinguish between multiple plausible candidates. Finally, we highlight the future potential of qpAdm through whole-genome branch-length f 2 -statistics, demonstrating the improved demographic inference that could be achieved with advancements in f -statistic estimations.

Collapse

Yüncü E, Işıldak U, Williams MP, Huber CD, Flegontova O, Vyazov LA, Changmai P, Flegontov P. False discovery rates of qpAdm-based screens for genetic admixture. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.25.538339. [PMID: 37904998 PMCID: PMC10614728 DOI: 10.1101/2023.04.25.538339] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/02/2023]

Abstract

Although a broad range of methods exists for reconstructing population history from genome-wide single nucleotide polymorphism data, just a few methods gained popularity in archaeogenetics: principal component analysis (PCA); ADMIXTURE, an algorithm that models individuals as mixtures of multiple ancestral sources represented by actual or inferred populations; formal tests for admixture such as f3-statistics and D/f4-statistics; and qpAdm, a tool for fitting two-component and more complex admixture models to groups or individuals. Despite their popularity in archaeogenetics, which is explained by modest computational requirements and ability to analyze data of various types and qualities, protocols relying on qpAdm that screen numerous alternative models of varying complexity and find "fitting" models (often considering both estimated admixture proportions and p-values as a composite criterion of model fit) remain untested on complex simulated population histories in the form of admixture graphs of random topology. We analyzed genotype data extracted from such simulations and tested various types of high-throughput qpAdm protocols ("rotating" and "non-rotating", with or without temporal stratification of target groups and proxy ancestry sources, and with or without a "model competition" step). We caution that high-throughput qpAdm protocols may be inappropriate for exploratory analyses in poorly studied regions/periods since their false discovery rates varied between 12% and 68% depending on the details of the protocol and on the amount and quality of simulated data (i.e., >12% of fitting two-way admixture models imply gene flows that were not simulated). We demonstrate that for reducing false discovery rates of qpAdm protocols to nearly 0% it is advisable to use large SNP sets with low missing data rates, the rotating qpAdm protocol with a strictly enforced rule that target groups do not pre-date their proxy sources, and an unsupervised ADMIXTURE analysis as a way to verify feasible qpAdm models. Our study has a number of limitations: for instance, these recommendations depend on the assumption that the underlying genetic history is a complex admixture graph and not a stepping-stone model.

Collapse

Medina-Muñoz SG, Ortega-Del Vecchyo D, Cruz-Hervert LP, Ferreyra-Reyes L, García-García L, Moreno-Estrada A, Ragsdale AP. Demographic modeling of admixed Latin American populations from whole genomes. Am J Hum Genet 2023;110:1804-1816. [PMID: 37725976 PMCID: PMC10577084 DOI: 10.1016/j.ajhg.2023.08.015] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Revised: 08/17/2023] [Accepted: 08/23/2023] [Indexed: 09/21/2023] Open

Abstract

Demographic models of Latin American populations often fail to fully capture their complex evolutionary history, which has been shaped by both recent admixture and deeper-in-time demographic events. To address this gap, we used high-coverage whole-genome data from Indigenous American ancestries in present-day Mexico and existing genomes from across Latin America to infer multiple demographic models that capture the impact of different timescales on genetic diversity. Our approach, which combines analyses of allele frequencies and ancestry tract length distributions, represents a significant improvement over current models in predicting patterns of genetic variation in admixed Latin American populations. We jointly modeled the contribution of European, African, East Asian, and Indigenous American ancestries into present-day Latin American populations. We infer that the ancestors of Indigenous Americans and East Asians diverged ∼30 thousand years ago, and we characterize genetic contributions of recent migrations from East and Southeast Asia to Peru and Mexico. Our inferred demographic histories are consistent across different genomic regions and annotations, suggesting that our inferences are robust to the potential effects of linked selection. In conjunction with published distributions of fitness effects for new nonsynonymous mutations in humans, we show in large-scale simulations that our models recover important features of both neutral and deleterious variation. By providing a more realistic framework for understanding the evolutionary history of Latin American populations, our models can help address the historical under-representation of admixed groups in genomics research and can be a valuable resource for future studies of populations with complex admixture and demographic histories.

Collapse

Lauterbur ME, Cavassim MIA, Gladstein AL, Gower G, Pope NS, Tsambos G, Adrion J, Belsare S, Biddanda A, Caudill V, Cury J, Echevarria I, Haller BC, Hasan AR, Huang X, Iasi LNM, Noskova E, Obsteter J, Pavinato VAC, Pearson A, Peede D, Perez MF, Rodrigues MF, Smith CCR, Spence JP, Teterina A, Tittes S, Unneberg P, Vazquez JM, Waples RK, Wohns AW, Wong Y, Baumdicker F, Cartwright RA, Gorjanc G, Gutenkunst RN, Kelleher J, Kern AD, Ragsdale AP, Ralph PL, Schrider DR, Gronau I. Expanding the stdpopsim species catalog, and lessons learned for realistic genome simulations. eLife 2023;12:RP84874. [PMID: 37342968 DOI: 10.7554/elife.84874] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/23/2023] Open

Abstract

Simulation is a key tool in population genetics for both methods development and empirical research, but producing simulations that recapitulate the main features of genomic datasets remains a major obstacle. Today, more realistic simulations are possible thanks to large increases in the quantity and quality of available genetic data, and the sophistication of inference and simulation software. However, implementing these simulations still requires substantial time and specialized knowledge. These challenges are especially pronounced for simulating genomes for species that are not well-studied, since it is not always clear what information is required to produce simulations with a level of realism sufficient to confidently answer a given question. The community-developed framework stdpopsim seeks to lower this barrier by facilitating the simulation of complex population genetic models using up-to-date information. The initial version of stdpopsim focused on establishing this framework using six well-characterized model species (Adrion et al., 2020). Here, we report on major improvements made in the new release of stdpopsim (version 0.2), which includes a significant expansion of the species catalog and substantial additions to simulation capabilities. Features added to improve the realism of the simulated genomes include non-crossover recombination and provision of species-specific genomic annotations. Through community-driven efforts, we expanded the number of species in the catalog more than threefold and broadened coverage across the tree of life. During the process of expanding the catalog, we have identified common sticking points and developed the best practices for setting up genome-scale simulations. We describe the input data required for generating a realistic simulation, suggest good practices for obtaining the relevant information from the literature, and discuss common pitfalls and major considerations. These improvements to stdpopsim aim to further promote the use of realistic whole-genome population genetic simulations, especially in non-model organisms, making them available, transparent, and accessible to everyone.

Collapse

Affiliation(s)

M Elise Lauterbur Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, United States
Maria Izabel A Cavassim Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, United States
Ariella L Gladstein Embark Veterinary, Inc, Boston, United States
Graham Gower Section for Molecular Ecology and Evolution, Globe Institute, University of Copenhagen, Copenhagen, Denmark
Nathaniel S Pope Institute of Ecology and Evolution, University of Oregon, Eugene, United States
Georgia Tsambos School of Mathematics and Statistics, University of Melbourne, Melbourne, Australia
Jeffrey Adrion Institute of Ecology and Evolution, University of Oregon, Eugene, United States Ancestry DNA, San Francisco, United States
Saurabh Belsare Institute of Ecology and Evolution, University of Oregon, Eugene, United States
Arjun Biddanda 54Gene, Inc, Washington, United States
Victoria Caudill Institute of Ecology and Evolution, University of Oregon, Eugene, United States
Jean Cury Universite Paris-Saclay, CNRS, INRIA, Laboratoire Interdisciplinaire des Sciences du Numerique, Orsay, France
Ignacio Echevarria School of Life Sciences, University of Glasgow, Glasgow, United Kingdom
Benjamin C Haller Department of Computational Biology, Cornell University, Ithaca, United States
Ahmed R Hasan Department of Cell and Systems Biology, University of Toronto, Toronto, Canada Department of Biology, University of Toronto Mississauga, Mississauga, Canada
Xin Huang Department of Evolutionary Anthropology, University of Vienna, Vienna, Austria Human Evolution and Archaeological Sciences (HEAS), University of Vienna, Vienna, Austria
Leonardo Nicola Martin Iasi Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
Ekaterina Noskova Computer Technologies Laboratory, ITMO University, St Petersburg, Russian Federation
Jana Obsteter Agricultural Institute of Slovenia, Department of Animal Science, Ljubljana, Slovenia
Vitor Antonio Correa Pavinato Entomology Department, The Ohio State University, Wooster, United States
Alice Pearson Department of Genetics, University of Cambridge, Cambridge, United Kingdom Department of Zoology, University of Cambridge, Cambridge, United Kingdom
David Peede Department of Ecology, Evolution, and Organismal Biology, Brown University, Providence, United States Center for Computational Molecular Biology, Brown University, Providence, United States
Manolo F Perez Department of Genetics and Evolution, Federal University of Sao Carlos, Sao Carlos, Brazil
Murillo F Rodrigues Institute of Ecology and Evolution, University of Oregon, Eugene, United States
Chris C R Smith Institute of Ecology and Evolution, University of Oregon, Eugene, United States
Jeffrey P Spence Department of Genetics, Stanford University School of Medicine, Stanford, United States
Anastasia Teterina Institute of Ecology and Evolution, University of Oregon, Eugene, United States
Silas Tittes Institute of Ecology and Evolution, University of Oregon, Eugene, United States
Per Unneberg Department of Cell and Molecular Biology, National Bioinformatics Infrastructure Sweden, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
Juan Manuel Vazquez Department of Integrative Biology, University of California, Berkeley, Berkeley, United States
Ryan K Waples Department of Biostatistics, University of Washington, Seattle, United States
Anthony Wilder Wohns Broad Institute of MIT and Harvard, Cambridge, United States
Yan Wong Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, United Kingdom
Franz Baumdicker Cluster of Excellence - Controlling Microbes to Fight Infections, Eberhard Karls Universit¨at Tubingen, Tubingen, Germany
Reed A Cartwright School of Life Sciences and The Biodesign Institute, Arizona State University, Tempe, United States
Gregor Gorjanc The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Edinburgh, United Kingdom
Ryan N Gutenkunst Department of Molecular and Cellular Biology, University of Arizona, Tucson, United States
Jerome Kelleher Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, United Kingdom
Andrew D Kern Institute of Ecology and Evolution, University of Oregon, Eugene, United States
Aaron P Ragsdale Department of Integrative Biology, University of Wisconsin-Madison, Madison, United States
Peter L Ralph Institute of Ecology and Evolution, University of Oregon, Eugene, United States Department of Mathematics, University of Oregon, Eugene, United States
Daniel R Schrider Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, United States
Ilan Gronau Efi Arazi School of Computer Science, Reichman University, Herzliya, Israel

Collapse

Wei Y, Naseri A, Zhi D, Zhang S. RaPID-Query for fast identity by descent search and genealogical analysis. Bioinformatics 2023;39:btad312. [PMID: 37166451 PMCID: PMC10244210 DOI: 10.1093/bioinformatics/btad312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Revised: 04/26/2023] [Accepted: 05/09/2023] [Indexed: 05/12/2023] Open

Anderson-Trocmé L, Nelson D, Zabad S, Diaz-Papkovich A, Kryukov I, Baya N, Touvier M, Jeffery B, Dina C, Vézina H, Kelleher J, Gravel S. On the genes, genealogies, and geographies of Quebec. Science 2023;380:849-855. [PMID: 37228217 DOI: 10.1126/science.add5300] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Accepted: 04/24/2023] [Indexed: 05/27/2023]

Nickchi P, Karunarathna C, Graham J. An exploration of linkage fine-mapping on sequences from case-control studies. Genet Epidemiol 2023;47:78-94. [PMID: 36047334 PMCID: PMC10087369 DOI: 10.1002/gepi.22502] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Revised: 05/30/2022] [Accepted: 08/09/2022] [Indexed: 02/01/2023]

Flegontov P, Işıldak U, Maier R, Yüncü E, Changmai P, Reich D. Modeling of African population history using f -statistics can be highly biased and is not addressed by previously suggested SNP ascertainment schemes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.22.525077. [PMID: 36711923 PMCID: PMC9882349 DOI: 10.1101/2023.01.22.525077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]

Abstract

f -statistics have emerged as a first line of analysis for making inferences about demographic history from genome-wide data. These statistics can provide strong evidence for either admixture or cladality, which can be robust to substantial rates of errors or missing data. f -statistics are guaranteed to be unbiased under "SNP ascertainment" (analyzing non-randomly chosen subsets of single nucleotide polymorphisms) only if it relies on a population that is an outgroup for all groups analyzed. However, ascertainment on a true outgroup that is not co-analyzed with other populations is often impractical and uncommon in the literature. In this study focused on practical rather than theoretical aspects of SNP ascertainment, we show that many non-outgroup ascertainment schemes lead to false rejection of true demographic histories, as well as to failure to reject incorrect models. But the bias introduced by common ascertainments such as the 1240K panel is mostly limited to situations when more than one sub-Saharan African and/or archaic human groups (Neanderthals and Denisovans) or non-human outgroups are co-modelled, for example, f 4 -statistics involving one non-African group, two African groups, and one archaic group. Analyzing panels of SNPs polymorphic in archaic humans, which has been suggested as a solution for the ascertainment problem, cannot fix all these problems since for some classes of f -statistics it is not a clean outgroup ascertainment, and in other cases it demonstrates relatively low power to reject incorrect demographic models since it provides a relatively small number of variants common in anatomically modern humans. And due to the paucity of high-coverage archaic genomes, archaic individuals used for ascertainment often act as sole representatives of the respective groups in an analysis, and we show that this approach is highly problematic. By carrying out large numbers of simulations of diverse demographic histories, we find that bias in inferences based on f -statistics introduced by non-outgroup ascertainment can be minimized if the derived allele frequency spectrum in the population used for ascertainment approaches the spectrum that existed at the root of all groups being co-analyzed. Ascertaining on sites with variants common in a diverse group of African individuals provides a good approximation to such a set of SNPs, addressing the great majority of biases and also retaining high statistical power for studying population history. Such a "pan-African" ascertainment, although not completely problem-free, allows unbiased exploration of demographic models for the widest set of archaic and modern human populations, as compared to the other ascertainment schemes we explored.

Collapse

Korunes KL, Soares-Souza GB, Bobrek K, Tang H, Araújo II, Goldberg A, Beleza S. Sex-biased admixture and assortative mating shape genetic variation and influence demographic inference in admixed Cabo Verdeans. G3 GENES|GENOMES|GENETICS 2022;12:6647844. [PMID: 35861404 PMCID: PMC9526050 DOI: 10.1093/g3journal/jkac183] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Accepted: 06/21/2022] [Indexed: 11/22/2022]

Avadhanam S, Williams AL. Simultaneous inference of parental admixture proportions and admixture times from unphased local ancestry calls. Am J Hum Genet 2022;109:1405-1420. [PMID: 35908549 PMCID: PMC9388397 DOI: 10.1016/j.ajhg.2022.06.016] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2022] [Accepted: 06/24/2022] [Indexed: 02/06/2023] Open

Gopalan S, Berl REW, Myrick JW, Garfield ZH, Reynolds AW, Bafens BK, Belbin G, Mastoras M, Williams C, Daya M, Negash AN, Feldman MW, Hewlett BS, Henn BM. Hunter-gatherer genomes reveal diverse demographic trajectories during the rise of farming in Eastern Africa. Curr Biol 2022;32:1852-1860.e5. [PMID: 35271793 PMCID: PMC9050894 DOI: 10.1016/j.cub.2022.02.050] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2019] [Revised: 05/12/2021] [Accepted: 02/16/2022] [Indexed: 12/31/2022]

Affiliation(s)

Shyamalika Gopalan Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY 11794, USA; Center for Genetic Epidemiology, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA
Richard E W Berl School of Biological Sciences, Washington State University, Pullman, WA 99164, USA; Department of Human Dimensions of Natural Resources, Colorado State University, Fort Collins, CO 80523, USA
Justin W Myrick Department of Anthropology, University of California, Davis, Davis, CA 95616, USA; UC Davis Genome Center, University of California, Davis, Davis, CA 95616, USA
Zachary H Garfield Department of Anthropology, Washington State University, Vancouver, WA 98686, USA; Institute for Advanced Study in Toulouse, Université Toulouse, Toulouse 31080, France
Austin W Reynolds Department of Anthropology, University of California, Davis, Davis, CA 95616, USA; Department of Anthropology, Baylor University, Waco, TX 76798, USA
Barnabas K Bafens Diaspora and Protocol Affairs Office, Bench Sheko Zone Administration, Mizan, Ethiopia
Gillian Belbin Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
Mira Mastoras UC Davis Genome Center, University of California, Davis, Davis, CA 95616, USA
Cole Williams Department of Medicine, University of Colorado, Anschutz Medical Campus, Aurora, CO 80045, USA
Michelle Daya Department of Medicine, University of Colorado, Anschutz Medical Campus, Aurora, CO 80045, USA
Akmel N Negash Department of Anthropology, Hawassa University, Hawassa, SNNPR, Ethiopia
Marcus W Feldman Department of Biology, Stanford University, Stanford, CA 94305, USA
Barry S Hewlett Department of Anthropology, Washington State University, Vancouver, WA 98686, USA.
Brenna M Henn Department of Anthropology, University of California, Davis, Davis, CA 95616, USA; UC Davis Genome Center, University of California, Davis, Davis, CA 95616, USA.

Collapse

Charney E. The "Golden Age" of Behavior Genetics? PERSPECTIVES ON PSYCHOLOGICAL SCIENCE 2022;17:1188-1210. [PMID: 35180032 DOI: 10.1177/17456916211041602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Patel A, García-Closas M, Olshan AF, Perou CM, Troester MA, Love MI, Bhattacharya A. Gene-Level Germline Contributions to Clinical Risk of Recurrence Scores in Black and White Patients with Breast Cancer. Cancer Res 2022;82:25-35. [PMID: 34711612 PMCID: PMC8732329 DOI: 10.1158/0008-5472.can-21-1207] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2021] [Revised: 09/30/2021] [Accepted: 10/25/2021] [Indexed: 01/09/2023]

Abstract

Continuous risk of recurrence scores (CRS) based on tumor gene expression are vital prognostic tools for breast cancer. Studies have shown that Black women (BW) have higher CRS than White women (WW). Although systemic injustices contribute substantially to breast cancer disparities, evidence of biological and germline contributions is emerging. In this study, we investigated germline genetic associations with CRS and CRS disparity using approaches modeled after transcriptome-wide association studies (TWAS). In the Carolina Breast Cancer Study, using race-specific predictive models of tumor expression from germline genetics, we performed race-stratified (N = 1,043 WW, 1,083 BW) linear regressions of three CRS (ROR-S: PAM50 subtype score; proliferation score; ROR-P: ROR-S plus proliferation score) on imputed tumor genetically regulated tumor expression (GReX). Bayesian multivariate regression and adaptive shrinkage tested GReX-prioritized genes for associations with tumor PAM50 expression and subtype to elucidate patterns of germline regulation underlying GReX-CRS associations. At FDR-adjusted P < 0.10, 7 and 1 GReX prioritized genes among WW and BW, respectively. Among WW, CRS were positively associated with MCM10, FAM64A, CCNB2, and MMP1 GReX and negatively associated with VAV3, PCSK6, and GNG11 GReX. Among BW, higher MMP1 GReX predicted lower proliferation score and ROR-P. GReX-prioritized gene and PAM50 tumor expression associations highlighted potential mechanisms for GReX-prioritized gene to CRS associations. Among patients with breast cancer, differential germline associations with CRS were found by race, underscoring the need for larger, diverse datasets in molecular studies of breast cancer. These findings also suggest possible germline trans-regulation of PAM50 tumor expression, with potential implications for CRS interpretation in clinical settings. SIGNIFICANCE: This study identifies race-specific genetic associations with breast cancer risk of recurrence scores and suggests mediation of these associations by PAM50 subtype and expression, with implications for clinical interpretation of these scores.

Collapse

Baumdicker F, Bisschop G, Goldstein D, Gower G, Ragsdale AP, Tsambos G, Zhu S, Eldon B, Ellerman EC, Galloway JG, Gladstein AL, Gorjanc G, Guo B, Jeffery B, Kretzschmar WW, Lohse K, Matschiner M, Nelson D, Pope NS, Quinto-Cortés CD, Rodrigues MF, Saunack K, Sellinger T, Thornton K, van Kemenade H, Wohns AW, Wong Y, Gravel S, Kern AD, Koskela J, Ralph PL, Kelleher J. Efficient ancestry and mutation simulation with msprime 1.0. Genetics 2021;220:6460344. [PMID: 34897427 PMCID: PMC9176297 DOI: 10.1093/genetics/iyab229] [Citation(s) in RCA: 91] [Impact Index Per Article: 30.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Accepted: 12/03/2021] [Indexed: 11/13/2022] Open

Affiliation(s)

Franz Baumdicker Cluster of Excellence "Controlling Microbes to Fight Infections", Mathematical and Computational Population Genetics, University of Tübingen, 72076 Tübingen, Germany
Gertjan Bisschop Institute of Evolutionary Biology,The University of Edinburgh, EH9 3FL, UK
Daniel Goldstein Khoury College of Computer Sciences, Northeastern University, MA 02115, USA.,No affiliation
Graham Gower Lundbeck GeoGenetics Centre, Globe Institute, University of Copenhagen, 1350 Copenhagen K, Denmark
Aaron P Ragsdale Department of Integrative Biology, University of Wisconsin-Madison, WI 53706, USA
Georgia Tsambos Melbourne Integrative Genomics, School of Mathematics and Statistics, University of Melbourne, Victoria, 3010, Australia
Sha Zhu Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, OX3 7LF, UK
Bjarki Eldon Leibniz Institute for Evolution and Biodiversity Science,Museum für Naturkunde Berlin, 10115, Germany
E Castedo Ellerman Fresh Pond Research Institute, Cambridge, MA 02140, USA
Jared G Galloway Institute of Ecology and Evolution, Department of Biology, University of Oregon, OR 97403-5289, USA.,Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA 98102, USA
Ariella L Gladstein Department of Genetics, University of North Carolina at Chapel Hill, NC 27599-7264, USA.,Embark Veterinary, Inc., Boston, MA 02111, USA
Gregor Gorjanc The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, EH25 9RG, UK
Bing Guo Institute for Genome Sciences,University of Maryland School of Medicine, Baltimore, MD, 21201, USA
Ben Jeffery Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, OX3 7LF, UK
Warren W Kretzschmar Center for Hematology and Regenerative Medicine, Karolinska Institute, 141 83 Huddinge, Sweden
Konrad Lohse Institute of Evolutionary Biology,The University of Edinburgh, EH9 3FL, UK
Michael Matschiner Natural History Museum, University of Oslo, Blindern 0318 Oslo, Norway
Dominic Nelson Department of Human Genetics, McGill University, Montréal, QC H3A 0C7, Canada
Nathaniel S Pope Department of Entomology, Pennsylvania State University, PA 16802, USA
Consuelo D Quinto-Cortés National Laboratory of Genomics for Biodiversity (LANGEBIO), Unit of Advanced Genomics, CINVESTAV, Irapuato, Mexico
Murillo F Rodrigues Institute of Ecology and Evolution, Department of Biology, University of Oregon, OR 97403-5289, USA
Kumar Saunack IIT Bombay, Powai, Mumbai 400 076, Maharashtra, India
Thibaut Sellinger Professorship for Population Genetics, Department of Life Science Systems, Technical University of Munich, 85354 Freising, Germany
Kevin Thornton Ecology and Evolutionary Biology, University of California, Irvine, CA 92697, USA
Hugo van Kemenade No affiliation
Anthony W Wohns Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, OX3 7LF, UK.,Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
Yan Wong Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, OX3 7LF, UK
Simon Gravel Department of Human Genetics, McGill University, Montréal, QC H3A 0C7, Canada
Andrew D Kern Institute of Ecology and Evolution, Department of Biology, University of Oregon, OR 97403-5289, USA
Jere Koskela Department of Statistics, University of Warwick, CV4 7AL, UK
Peter L Ralph Institute of Ecology and Evolution, Department of Biology, University of Oregon, OR 97403-5289, USA.,Department of Mathematics, University of Oregon, OR 97403-5289 USA
Jerome Kelleher Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, OX3 7LF, UK

Collapse

Virgoulay T, Rousset F, Leblois R. GSpace: an exact coalescence simulator of recombining genomes under isolation by distance. Bioinformatics 2021;37:3673-3675. [PMID: 33964130 DOI: 10.1093/bioinformatics/btab261] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2020] [Revised: 04/16/2021] [Accepted: 04/27/2021] [Indexed: 11/12/2022] Open

Waples RS, Waples RK, Ward EJ. Pseudoreplication in genomics-scale datasets. Mol Ecol Resour 2021;22:503-518. [PMID: 34351073 DOI: 10.1111/1755-0998.13482] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2020] [Revised: 06/14/2021] [Accepted: 07/23/2021] [Indexed: 11/30/2022]

Mukhopadhyay A, Chakraborty S. Replicator equations induced by microscopic processes in nonoverlapping population playing bimatrix games. CHAOS (WOODBURY, N.Y.) 2021;31:023123. [PMID: 33653037 DOI: 10.1063/5.0032311] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Accepted: 01/27/2021] [Indexed: 06/12/2023]

Cavazos TB, Witte JS. Inclusion of variants discovered from diverse populations improves polygenic risk score transferability. HGG ADVANCES 2020;2. [PMID: 33564748 PMCID: PMC7869832 DOI: 10.1016/j.xhgg.2020.100017] [Citation(s) in RCA: 51] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open

Lessons Learned from Bugs in Models of Human History. Am J Hum Genet 2020;107:583-588. [PMID: 33007197 DOI: 10.1016/j.ajhg.2020.08.017] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open