Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Chen GK, Marjoram P, Wall JD. Fast and flexible simulation of DNA sequence data. Genome Res 2009;19:136-42. [PMID: 19029539 DOI: 10.1101/gr.083634.108] [Citation(s) in RCA: 254] [Impact Index Per Article: 15.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Number

Cited by Other Article(s)

151

Layer RM, Kindlon N, Karczewski KJ, Quinlan AR. Efficient genotype compression and analysis of large genetic-variation data sets. Nat Methods 2016;13:63-5. [PMID: 26550772 PMCID: PMC4697868 DOI: 10.1038/nmeth.3654] [Citation(s) in RCA: 49] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2015] [Accepted: 10/07/2015] [Indexed: 11/08/2022]

152

Jacquin L, Cao TV, Grenier C, Ahmadi N. DHOEM: a statistical simulation software for simulating new markers in real SNP marker data. BMC Bioinformatics 2015;16:404. [PMID: 26634451 PMCID: PMC4669601 DOI: 10.1186/s12859-015-0830-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2015] [Accepted: 11/16/2015] [Indexed: 11/10/2022] Open

153

Cui R, Schumer M, Rosenthal GG. Admix’em: a flexible framework for forward-time simulations of hybrid populations with selection and mate choice. Bioinformatics 2015;32:1103-5. [DOI: 10.1093/bioinformatics/btv700] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2015] [Accepted: 11/25/2015] [Indexed: 11/13/2022] Open

154

Singhal S, Leffler EM, Sannareddy K, Turner I, Venn O, Hooper DM, Strand AI, Li Q, Raney B, Balakrishnan CN, Griffith SC, McVean G, Przeworski M. Stable recombination hotspots in birds. Science 2015;350:928-32. [PMID: 26586757 PMCID: PMC4864528 DOI: 10.1126/science.aad0843] [Citation(s) in RCA: 198] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

155

Stram AH, Marjoram P, Chen GK. al3c: high-performance software for parameter inference using Approximate Bayesian Computation. Bioinformatics 2015;31:3549-51. [PMID: 26142186 PMCID: PMC4626746 DOI: 10.1093/bioinformatics/btv393] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2015] [Revised: 05/21/2015] [Accepted: 06/24/2015] [Indexed: 11/14/2022] Open

156

Browning SR, Browning BL. Accurate Non-parametric Estimation of Recent Effective Population Size from Segments of Identity by Descent. Am J Hum Genet 2015;97:404-18. [PMID: 26299365 PMCID: PMC4564943 DOI: 10.1016/j.ajhg.2015.07.012] [Citation(s) in RCA: 182] [Impact Index Per Article: 20.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2015] [Accepted: 07/28/2015] [Indexed: 10/23/2022] Open

157

Frantz LAF, Schraiber JG, Madsen O, Megens HJ, Cagan A, Bosse M, Paudel Y, Crooijmans RPMA, Larson G, Groenen MAM. Evidence of long-term gene flow and selection during domestication from analyses of Eurasian wild and domestic pig genomes. Nat Genet 2015;47:1141-8. [PMID: 26323058 DOI: 10.1038/ng.3394] [Citation(s) in RCA: 173] [Impact Index Per Article: 19.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2014] [Accepted: 08/10/2015] [Indexed: 12/18/2022]

158

van Dorp L, Balding D, Myers S, Pagani L, Tyler-Smith C, Bekele E, Tarekegn A, Thomas MG, Bradman N, Hellenthal G. Evidence for a Common Origin of Blacksmiths and Cultivators in the Ethiopian Ari within the Last 4500 Years: Lessons for Clustering-Based Inference. PLoS Genet 2015;11:e1005397. [PMID: 26291793 PMCID: PMC4546361 DOI: 10.1371/journal.pgen.1005397] [Citation(s) in RCA: 49] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2014] [Accepted: 06/26/2015] [Indexed: 01/02/2023] Open

Abstract

The Ari peoples of Ethiopia are comprised of different occupational groups that can be distinguished genetically, with Ari Cultivators and the socially marginalised Ari Blacksmiths recently shown to have a similar level of genetic differentiation between them (F_ST ≈ 0.023 − 0.04) as that observed among multiple ethnic groups sampled throughout Ethiopia. Anthropologists have proposed two competing theories to explain the origins of the Ari Blacksmiths as (i) remnants of a population that inhabited Ethiopia prior to the arrival of agriculturists (e.g. Cultivators), or (ii) relatively recently related to the Cultivators but presently marginalized in the community due to their trade. Two recent studies by different groups analysed genome-wide DNA from samples of Ari Blacksmiths and Cultivators and suggested that genetic patterns between the two groups were more consistent with model (i) and subsequent assimilation of the indigenous peoples into the expanding agriculturalist community. We analysed the same samples using approaches designed to attenuate signals of genetic differentiation that are attributable to allelic drift within a population. By doing so, we provide evidence that the genetic differences between Ari Blacksmiths and Cultivators can be entirely explained by bottleneck effects consistent with hypothesis (ii). This finding serves as both a cautionary tale about interpreting results from unsupervised clustering algorithms, and suggests that social constructions are contributing directly to genetic differentiation over a relatively short time period among previously genetically similar groups.

While it is widely recognized that DNA patterns vary across world-wide human populations, the primary features that drive these differences are less well understood. As an example, the Ari peoples of Ethiopia are presently socially divided according to occupation, with Ari Blacksmiths marginalised relative to Ari Cultivators. Two competing theories proposed by anthropologists to explain the existence of these occupational groupings suggest very different histories: (i) the Cultivators reflect migrants who moved into the region occupied by ancestors of the Blacksmiths perhaps many thousands of years ago, versus (ii) the Blacksmiths and Cultivators comprised the same ancestral group before the former was marginalised due solely to their trade. Recent genetic studies showed that Blacksmiths and Cultivators are distinguishable by their DNA, and suggested that overall DNA patterns among the two groups were consistent with (i). However, we demonstrate here that interpreting the results of currently popular algorithms that compare DNA is not always straight-forward. Instead we use a variety of analyses to show that (ii) seems a more likely explanation, perhaps illustrating how social marginalisation can lead to groups becoming genetically distinguishable over a relatively short time period.

Collapse

159

Gorjanc G, Bijma P, Hickey JM. Reliability of pedigree-based and genomic evaluations in selected populations. Genet Sel Evol 2015;47:65. [PMID: 26271246 PMCID: PMC4536753 DOI: 10.1186/s12711-015-0145-1] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2014] [Accepted: 07/29/2015] [Indexed: 11/14/2022] Open

Abstract

Background

Reliability is an important parameter in breeding. It measures the precision of estimated breeding values (EBV) and, thus, potential response to selection on those EBV. The precision of EBV is commonly measured by relating the prediction error variance (PEV) of EBV to the base population additive genetic variance (base PEV reliability), while the potential for response to selection is commonly measured by the squared correlation between the EBV and breeding values (BV) on selection candidates (reliability of selection). While these two measures are equivalent for unselected populations, they are not equivalent for selected populations. The aim of this study was to quantify the effect of selection on these two measures of reliability and to show how this affects comparison of breeding programs using pedigree-based or genomic evaluations.

Methods

Two scenarios with random and best linear unbiased prediction (BLUP) selection were simulated, where the EBV of selection candidates were estimated using only pedigree, pedigree and phenotype, genome-wide marker genotypes and phenotype, or only genome-wide marker genotypes. The base PEV reliabilities of these EBV were compared to the corresponding reliabilities of selection. Realized genetic selection intensity was evaluated to quantify the potential of selection on the different types of EBV and, thus, to validate differences in reliabilities. Finally, the contribution of different underlying processes to changes in additive genetic variance and reliabilities was quantified.

Results

The simulations showed that, for selected populations, the base PEV reliability substantially overestimates the reliability of selection of EBV that are mainly based on old information from the parental generation, as is the case with pedigree-based prediction. Selection on such EBV gave very low realized genetic selection intensities, confirming the overestimation and importance of genotyping both male and female selection candidates. The two measures of reliability matched when the reductions in additive genetic variance due to the Bulmer effect, selection, and inbreeding were taken into account.

Conclusions

For populations under selection, EBV based on genome-wide information are more valuable than suggested by the comparison of the base PEV reliabilities between the different types of EBV. This implies that genome-wide marker information is undervalued for selected populations and that genotyping un-phenotyped female selection candidates should be reconsidered.

Electronic supplementary material

The online version of this article (doi:10.1186/s12711-015-0145-1) contains supplementary material, which is available to authorized users.

Collapse

160

Bayesian Nonparametric Inference of Population Size Changes from Sequential Genealogies. Genetics 2015. [PMID: 26224734 PMCID: PMC4566269 DOI: 10.1534/genetics.115.177980] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

161

Schumer M, Cui R, Rosenthal GG, Andolfatto P. simMSG: an experimental design tool for high-throughput genotyping of hybrids. Mol Ecol Resour 2015;16:183-92. [PMID: 26032857 DOI: 10.1111/1755-0998.12434] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2015] [Revised: 05/19/2015] [Accepted: 05/22/2015] [Indexed: 11/30/2022]

162

Jenko J, Gorjanc G, Cleveland MA, Varshney RK, Whitelaw CBA, Woolliams JA, Hickey JM. Potential of promotion of alleles by genome editing to improve quantitative traits in livestock breeding programs. Genet Sel Evol 2015;47:55. [PMID: 26133579 PMCID: PMC4487592 DOI: 10.1186/s12711-015-0135-3] [Citation(s) in RCA: 71] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2014] [Accepted: 06/15/2015] [Indexed: 12/29/2022] Open

Abstract

Background

Genome editing (GE) is a method that enables specific nucleotides in the genome of an individual to be changed. To date, use of GE in livestock has focussed on simple traits that are controlled by a few quantitative trait nucleotides (QTN) with large effects. The aim of this study was to evaluate the potential of GE to improve quantitative traits that are controlled by many QTN, referred to here as promotion of alleles by genome editing (PAGE).

Methods

Multiple scenarios were simulated to test alternative PAGE strategies for a quantitative trait. They differed in (i) the number of edits per sire (0 to 100), (ii) the number of edits per generation (0 to 500), and (iii) the extent of use of PAGE (i.e. editing all sires or only a proportion of them). The base line scenario involved selecting individuals on true breeding values (i.e., genomic selection only (GS only)-genomic selection with perfect accuracy) for several generations. Alternative scenarios complemented this base line scenario with PAGE (GS + PAGE). The effect of different PAGE strategies was quantified by comparing response to selection, changes in allele frequencies, the number of distinct QTN edited, the sum of absolute effects of the edited QTN per generation, and inbreeding.

Results

Response to selection after 20 generations was between 1.08 and 4.12 times higher with GS + PAGE than with GS only. Increases in response to selection were larger with more edits per sire and more sires edited. When the total resources for PAGE were limited, editing a few sires for many QTN resulted in greater response to selection and inbreeding compared to editing many sires for a few QTN. Between the scenarios GS only and GS + PAGE, there was little difference in the average change in QTN allele frequencies, but there was a major difference for the QTN with the largest effects. The sum of the effects of the edited QTN decreased across generations.

Conclusions

This study showed that PAGE has great potential for application in livestock breeding programs, but inbreeding needs to be managed.

Electronic supplementary material

The online version of this article (doi:10.1186/s12711-015-0135-3) contains supplementary material, which is available to authorized users.

Collapse

163

Pérez-Enciso M, Rincón JC, Legarra A. Sequence- vs. chip-assisted genomic selection: accurate biological information is advised. Genet Sel Evol 2015;47:43. [PMID: 25956961 PMCID: PMC4424891 DOI: 10.1186/s12711-015-0117-5] [Citation(s) in RCA: 88] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2014] [Accepted: 03/31/2015] [Indexed: 12/29/2022] Open

Abstract

Background

The development of next-generation sequencing technologies (NGS) has made the use of whole-genome sequence data for routine genetic evaluations possible, which has triggered a considerable interest in animal and plant breeding fields. Here, we investigated whether complete or partial sequence data can improve upon existing SNP (single nucleotide polymorphism) array-based selection strategies by simulation using a mixed coalescence - gene-dropping approach.

Results

We simulated 20 or 100 causal mutations (quantitative trait nucleotides, QTN) within 65 predefined ‘gene’ regions, each 10 kb long, within a genome composed of ten 3-Mb chromosomes. We compared prediction accuracy by cross-validation using a medium-density chip (7.5 k SNPs), a high-density (HD, 17 k) and sequence data (335 k). Genetic evaluation was based on a GBLUP method. The simulations showed: (1) a law of diminishing returns with increasing number of SNPs; (2) a modest effect of SNP ascertainment bias in arrays; (3) a small advantage of using whole-genome sequence data vs. HD arrays i.e. ~4%; (4) a minor effect of NGS errors except when imputation error rates are high (≥20%); and (5) if QTN were known, prediction accuracy approached 1. Since this is obviously unrealistic, we explored milder assumptions. We showed that, if all SNPs within causal genes were included in the prediction model, accuracy could also dramatically increase by ~40%. However, this criterion was highly sensitive to either misspecification (including wrong genes) or to the use of an incomplete gene list; in these cases, accuracy fell rapidly towards that reached when all SNPs from sequence data were blindly included in the model.

Conclusions

Our study shows that, unless an accurate prior estimate on the functionality of SNPs can be included in the predictor, there is a law of diminishing returns with increasing SNP density. As a result, use of whole-genome sequence data may not result in a highly increased selection response over high-density genotyping.

Collapse

164

Bianco E, Soto HW, Vargas L, Pérez-Enciso M. The chimerical genome of Isla del Coco feral pigs (Costa Rica), an isolated population since 1793 but with remarkable levels of diversity. Mol Ecol 2015;24:2364-78. [DOI: 10.1111/mec.13182] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2015] [Revised: 03/18/2015] [Accepted: 03/24/2015] [Indexed: 01/27/2023]

165

Yunusbayev B, Metspalu M, Metspalu E, Valeev A, Litvinov S, Valiev R, Akhmetova V, Balanovska E, Balanovsky O, Turdikulova S, Dalimova D, Nymadawa P, Bahmanimehr A, Sahakyan H, Tambets K, Fedorova S, Barashkov N, Khidiyatova I, Mihailov E, Khusainova R, Damba L, Derenko M, Malyarchuk B, Osipova L, Voevoda M, Yepiskoposyan L, Kivisild T, Khusnutdinova E, Villems R. The genetic legacy of the expansion of Turkic-speaking nomads across Eurasia. PLoS Genet 2015;11:e1005068. [PMID: 25898006 PMCID: PMC4405460 DOI: 10.1371/journal.pgen.1005068] [Citation(s) in RCA: 104] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2013] [Accepted: 02/11/2015] [Indexed: 12/28/2022] Open

Abstract

The Turkic peoples represent a diverse collection of ethnic groups defined by the Turkic languages. These groups have dispersed across a vast area, including Siberia, Northwest China, Central Asia, East Europe, the Caucasus, Anatolia, the Middle East, and Afghanistan. The origin and early dispersal history of the Turkic peoples is disputed, with candidates for their ancient homeland ranging from the Transcaspian steppe to Manchuria in Northeast Asia. Previous genetic studies have not identified a clear-cut unifying genetic signal for the Turkic peoples, which lends support for language replacement rather than demic diffusion as the model for the Turkic language’s expansion. We addressed the genetic origin of 373 individuals from 22 Turkic-speaking populations, representing their current geographic range, by analyzing genome-wide high-density genotype data. In agreement with the elite dominance model of language expansion most of the Turkic peoples studied genetically resemble their geographic neighbors. However, western Turkic peoples sampled across West Eurasia shared an excess of long chromosomal tracts that are identical by descent (IBD) with populations from present-day South Siberia and Mongolia (SSM), an area where historians center a series of early Turkic and non-Turkic steppe polities. While SSM matching IBD tracts (> 1cM) are also observed in non-Turkic populations, Turkic peoples demonstrate a higher percentage of such tracts (p-values ≤ 0.01) compared to their non-Turkic neighbors. Finally, we used the ALDER method and inferred admixture dates (~9th–17th centuries) that overlap with the Turkic migrations of the 5th–16th centuries. Thus, our results indicate historical admixture among Turkic peoples, and the recent shared ancestry with modern populations in SSM supports one of the hypothesized homelands for their nomadic Turkic and related Mongolic ancestors.

Centuries of nomadic migrations have ultimately resulted in the distribution of Turkic languages over a large area ranging from Siberia, across Central Asia to Eastern Europe and the Middle East. Despite the profound cultural impact left by these nomadic peoples, little is known about their prehistoric origins. Moreover, because contemporary Turkic speakers tend to genetically resemble their geographic neighbors, it is not clear whether their nomadic ancestors left an identifiable genetic trace. In this study, we show that Turkic-speaking peoples sampled across the Middle East, Caucasus, East Europe, and Central Asia share varying proportions of Asian ancestry that originate in a single area, southern Siberia and Mongolia. Mongolic- and Turkic-speaking populations from this area bear an unusually high number of long chromosomal tracts that are identical by descent with Turkic peoples from across west Eurasia. Admixture induced linkage disequilibrium decay across chromosomes in these populations indicates that admixture occurred during the 9th–17th centuries, in agreement with the historically recorded Turkic nomadic migrations and later Mongol expansion. Thus, our findings reveal genetic traces of recent large-scale nomadic migrations and map their source to a previously hypothesized area of Mongolia and southern Siberia.

Collapse

Affiliation(s)

Bayazit Yunusbayev Evolutionary Biology group, Estonian Biocentre, Tartu, Estonia Institute of Biochemistry and Genetics, Ufa Research Centre, RAS, Ufa, Bashkortostan, Russia * E-mail: ,
Mait Metspalu Evolutionary Biology group, Estonian Biocentre, Tartu, Estonia Department of Evolutionary Biology, University of Tartu, Tartu, Estonia Department of Integrative Biology, University of California Berkeley, Berkeley, California, United States of America
Ene Metspalu Department of Evolutionary Biology, University of Tartu, Tartu, Estonia
Albert Valeev Institute of Biochemistry and Genetics, Ufa Research Centre, RAS, Ufa, Bashkortostan, Russia
Sergei Litvinov Evolutionary Biology group, Estonian Biocentre, Tartu, Estonia Institute of Biochemistry and Genetics, Ufa Research Centre, RAS, Ufa, Bashkortostan, Russia
Ruslan Valiev Department of Genetics and Fundamental Medicine, Bashkir State University, Ufa, Bashkortostan, Russia
Vita Akhmetova Institute of Biochemistry and Genetics, Ufa Research Centre, RAS, Ufa, Bashkortostan, Russia
Elena Balanovska Research Centre for Medical Genetics, RAMS, Moscow, Russia
Oleg Balanovsky Research Centre for Medical Genetics, RAMS, Moscow, Russia Vavilov Institute for General Genetics, RAS, Moscow, Russia
Shahlo Turdikulova Laboratory of Genomics, Institute of Bioorganic Chemistry, Academy of Sciences Republic of Uzbekistan, Tashkent, Uzbekistan
Dilbar Dalimova Laboratory of Genomics, Institute of Bioorganic Chemistry, Academy of Sciences Republic of Uzbekistan, Tashkent, Uzbekistan
Pagbajabyn Nymadawa Mongolian Academy of Medical Sciences, Ulaanbaatar, Mongolia
Ardeshir Bahmanimehr Department of Medical Genetics, Shiraz University of Medical Sciences, Shiraz, Iran
Hovhannes Sahakyan Evolutionary Biology group, Estonian Biocentre, Tartu, Estonia Laboratory of Ethnogenomics, Institute of Molecular Biology, Academy of Sciences of Armenia, Yerevan, Armenia
Kristiina Tambets Evolutionary Biology group, Estonian Biocentre, Tartu, Estonia
Sardana Fedorova Laboratory of Molecular Genetics, Yakut Research Center of Complex Medical Problems, Yakutsk, Sakha Republic, Russia Laboratory of Molecular Biology, North-Eastern Federal University, Yakutsk, Sakha Republic, Russia
Nikolay Barashkov Laboratory of Molecular Genetics, Yakut Research Center of Complex Medical Problems, Yakutsk, Sakha Republic, Russia Laboratory of Molecular Biology, North-Eastern Federal University, Yakutsk, Sakha Republic, Russia
Irina Khidiyatova Institute of Biochemistry and Genetics, Ufa Research Centre, RAS, Ufa, Bashkortostan, Russia Department of Genetics and Fundamental Medicine, Bashkir State University, Ufa, Bashkortostan, Russia
Evelin Mihailov Estonian Genome Center, University of Tartu, Tartu, Estonia Gene Technology Workgroup, Estonian Biocentre, Tartu, Estonia
Rita Khusainova Institute of Biochemistry and Genetics, Ufa Research Centre, RAS, Ufa, Bashkortostan, Russia Department of Genetics and Fundamental Medicine, Bashkir State University, Ufa, Bashkortostan, Russia
Larisa Damba Institute of Internal Medicine, SB RAMS, Novosibirsk, Russia
Miroslava Derenko Institute of Biological Problems of the North, Magadan, Russia
Boris Malyarchuk Institute of Biological Problems of the North, Magadan, Russia
Ludmila Osipova Institute of Cytology and Genetics, SB RAS, Novosibirsk, Russia
Mikhail Voevoda Institute of Internal Medicine, SB RAMS, Novosibirsk, Russia Institute of Cytology and Genetics, SB RAS, Novosibirsk, Russia
Levon Yepiskoposyan Laboratory of Ethnogenomics, Institute of Molecular Biology, Academy of Sciences of Armenia, Yerevan, Armenia
Toomas Kivisild Division of Biological Anthropology, University of Cambridge, Cambridge, United Kingdom
Elza Khusnutdinova Institute of Biochemistry and Genetics, Ufa Research Centre, RAS, Ufa, Bashkortostan, Russia Department of Genetics and Fundamental Medicine, Bashkir State University, Ufa, Bashkortostan, Russia
Richard Villems Evolutionary Biology group, Estonian Biocentre, Tartu, Estonia Department of Evolutionary Biology, University of Tartu, Tartu, Estonia Estonian Academy of Sciences, Tallinn, Estonia

Collapse

166

Reconstructing Past Admixture Processes from Local Genomic Ancestry Using Wavelet Transformation. Genetics 2015;200:469-81. [PMID: 25852078 PMCID: PMC4492373 DOI: 10.1534/genetics.115.176842] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2014] [Accepted: 04/03/2015] [Indexed: 11/18/2022] Open

167

Exploring population size changes using SNP frequency spectra. Nat Genet 2015;47:555-9. [PMID: 25848749 PMCID: PMC4414822 DOI: 10.1038/ng.3254] [Citation(s) in RCA: 246] [Impact Index Per Article: 27.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2014] [Accepted: 02/26/2015] [Indexed: 02/05/2023]

168

Cheng JY, Mailund T. Ancestral population genomics using coalescence hidden Markov models and heuristic optimisation algorithms. Comput Biol Chem 2015;57:80-92. [PMID: 25819138 DOI: 10.1016/j.compbiolchem.2015.02.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2015] [Accepted: 02/02/2015] [Indexed: 10/23/2022]

169

Bianco E, Nevado B, Ramos-Onsins SE, Pérez-Enciso M. A deep catalog of autosomal single nucleotide variation in the pig. PLoS One 2015;10:e0118867. [PMID: 25789620 PMCID: PMC4366260 DOI: 10.1371/journal.pone.0118867] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2014] [Accepted: 12/27/2014] [Indexed: 12/31/2022] Open

Abstract

A comprehensive catalog of variability in a given species is useful for many important purposes, e.g., designing high density arrays or pinpointing potential mutations of economic or physiological interest. Here we provide a genomewide, worldwide catalog of single nucleotide variants by simultaneously analyzing the shotgun sequence of 128 pigs and five suid outgroups. Despite the high SNP missing rate of some individuals (up to 88%), we retrieved over 48 million high quality variants. Of them, we were able to assess the ancestral allele of more than 39M biallelic SNPs. We found SNPs in 21,455 out of the 25,322 annotated genes in pig assembly 10.2. The annotation showed that more than 40% of the variants were novel variants, not present in dbSNP. Surprisingly, we found a large variability in transition / transversion rate along the genome, which is very well explained (R²=0.79) primarily by genome differences in in CpG content and recombination rate. The number of SNPs per window also varied but was less dependent of known factors such as gene density, missing rate or recombination (R²=0.48). When we divided the samples in four groups, Asian wild boar (ASWB), Asian domestics (ASDM), European wild boar (EUWB) and European domestics (EUDM), we found a marked correlation in allele frequencies between domestics and wild boars within Asia and within Europe, but not across continents, due to the large evolutive distance between pigs of both continents (~1.2 MYA). In general, the porcine species showed a small percentage of SNPs exclusive of each population group. EUWB and EUDM were predicted to harbor a larger fraction of potentially deleterious mutations, according to the SIFT algorithm, than Asian samples, perhaps a result of background selection being less effective due to a lower effective population size in Europe.

Collapse

170

The SMC' is a highly accurate approximation to the ancestral recombination graph. Genetics 2015;200:343-55. [PMID: 25786855 DOI: 10.1534/genetics.114.173898] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2014] [Accepted: 03/12/2015] [Indexed: 11/18/2022] Open

171

Gorjanc G, Cleveland MA, Houston RD, Hickey JM. Potential of genotyping-by-sequencing for genomic selection in livestock populations. Genet Sel Evol 2015;47:12. [PMID: 25887531 PMCID: PMC4344748 DOI: 10.1186/s12711-015-0102-z] [Citation(s) in RCA: 67] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2014] [Accepted: 01/29/2015] [Indexed: 12/12/2022] Open

Abstract

Background

Next-generation sequencing techniques, such as genotyping-by-sequencing (GBS), provide alternatives to single nucleotide polymorphism (SNP) arrays. The aim of this work was to evaluate the potential of GBS compared to SNP array genotyping for genomic selection in livestock populations.

Methods

The value of GBS was quantified by simulation analyses in which three parameters were varied: (i) genome-wide sequence read depth (x) per individual from 0.01x to 20x or using SNP array genotyping; (ii) number of genotyped markers from 3000 to 300 000; and (iii) size of training and prediction sets from 500 to 50 000 individuals. The latter was achieved by distributing the total available x of 1000x, 5000x, or 10 000x per genotyped locus among the varying number of individuals. With SNP arrays, genotypes were called from sequence data directly. With GBS, genotypes were called from sequence reads that varied between loci and individuals according to a Poisson distribution with mean equal to x. Simulated data were analyzed with ridge regression and the accuracy and bias of genomic predictions and response to selection were quantified under the different scenarios.

Results

Accuracies of genomic predictions using GBS data or SNP array data were comparable when large numbers of markers were used and x per individual was ~1x or higher. The bias of genomic predictions was very high at a very low x. When the total available x was distributed among the training individuals, the accuracy of prediction was maximized when a large number of individuals was used that had GBS data with low x for a large number of markers. Similarly, response to selection was maximized under the same conditions due to increasing both accuracy and selection intensity.

Conclusions

GBS offers great potential for developing genomic selection in livestock populations because it makes it possible to cover large fractions of the genome and to vary the sequence read depth per individual. Thus, the accuracy of predictions is improved by increasing the size of training populations and the intensity of selection is increased by genotyping a larger number of selection candidates.

Electronic supplementary material

The online version of this article (doi:10.1186/s12711-015-0102-z) contains supplementary material, which is available to authorized users.

Collapse

172

Staab PR, Zhu S, Metzler D, Lunter G. scrm: efficiently simulating long sequences using the approximated coalescent with recombination. ACTA ACUST UNITED AC 2015;31:1680-2. [PMID: 25596205 PMCID: PMC4426833 DOI: 10.1093/bioinformatics/btu861] [Citation(s) in RCA: 90] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2014] [Accepted: 12/23/2014] [Indexed: 11/13/2022]

173

Depperschmidt A, Pardoux É, Pfaffelhuber P. A mixing tree-valued process arising under neutral evolution with recombination. ELECTRON J PROBAB 2015. [DOI: 10.1214/ejp.v20-4286] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

174

Peng B, Chen HS, Mechanic LE, Racine B, Clarke J, Gillanders E, Feuer EJ. Genetic data simulators and their applications: an overview. Genet Epidemiol 2014;39:2-10. [PMID: 25504286 DOI: 10.1002/gepi.21876] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2014] [Revised: 09/14/2014] [Accepted: 10/31/2014] [Indexed: 11/10/2022]

175

Hobolth A, Jensen JL. Markovian approximation to the finite loci coalescent with recombination along multiple sequences. Theor Popul Biol 2014;98:48-58. [DOI: 10.1016/j.tpb.2014.01.002] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2013] [Revised: 10/23/2013] [Accepted: 01/18/2014] [Indexed: 10/25/2022]

176

Li P, Guo M, Wang C, Liu X, Zou Q. An overview of SNP interactions in genome-wide association studies. Brief Funct Genomics 2014;14:143-55. [PMID: 25241224 DOI: 10.1093/bfgp/elu036] [Citation(s) in RCA: 80] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open

177

Mailund T, Munch K, Schierup MH. Lineage sorting in apes. Annu Rev Genet 2014;48:519-35. [PMID: 25251849 DOI: 10.1146/annurev-genet-120213-092532] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

178

Sequencing an Ashkenazi reference panel supports population-targeted personal genomics and illuminates Jewish and European origins. Nat Commun 2014;5:4835. [PMID: 25203624 PMCID: PMC4164776 DOI: 10.1038/ncomms5835] [Citation(s) in RCA: 112] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2014] [Accepted: 07/28/2014] [Indexed: 12/17/2022] Open

179

Wang Y, Zhou Y, Li L, Chen X, Liu Y, Ma ZM, Xu S. A new method for modeling coalescent processes with recombination. BMC Bioinformatics 2014;15:273. [PMID: 25113665 PMCID: PMC4137079 DOI: 10.1186/1471-2105-15-273] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2014] [Accepted: 07/17/2014] [Indexed: 11/10/2022] Open

Abstract

Background

Recombination plays an important role in the maintenance of genetic diversity in many types of organisms, especially diploid eukaryotes. Recombination can be studied and used to map diseases. However, recombination adds a great deal of complexity to the genetic information. This renders estimation of evolutionary parameters more difficult. After the coalescent process was formulated, models capable of describing recombination using graphs, such as ancestral recombination graphs (ARG) were also developed. There are two typical models based on which to simulate ARG: back-in-time model such as ms and spatial model including Wiuf&Hein’s, SMC, SMC’, and MaCS.

Results

In this study, a new method of modeling coalescence with recombination, Spatial Coalescent simulator (SC), was developed, which considerably improved the algorithm described by Wiuf and Hein. The present algorithm constructs ARG spatially along the sequence, but it does not produce any redundant branches which are inevitable in Wiuf and Hein’s algorithm. Interestingly, the distribution of ARG generated by the present new algorithm is identical to that generated by a typical back-in-time model adopted by ms, an algorithm commonly used to model coalescence. It is here demonstrated that the existing approximate methods such as the sequentially Markov coalescent (SMC), a related method called SMC′, and Markovian coalescent simulator (MaCS) can be viewed as special cases of the present method. Using simulation analysis, the time to the most common ancestor (TMRCA) in the local trees of ARGs generated by the present algorithm was found to be closer to that produced by ms than time produced by MaCS. Sample-consistent ARGs can be generated using the present method. This may significantly reduce the computational burden.

Conclusion

In summary, the present method and algorithm may facilitate the estimation and description of recombination in population genomics and evolutionary biology.

Electronic supplementary material

The online version of this article (doi:10.1186/1471-2105-15-273) contains supplementary material, which is available to authorized users.

Collapse

180

Mathieson I, McVean G. Demography and the age of rare variants. PLoS Genet 2014;10:e1004528. [PMID: 25101869 PMCID: PMC4125085 DOI: 10.1371/journal.pgen.1004528] [Citation(s) in RCA: 66] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2014] [Accepted: 06/06/2014] [Indexed: 12/17/2022] Open

181

Chen X, Ma ZM, Wang Y. Markov jump processes in modeling coalescent with recombination. Ann Stat 2014. [DOI: 10.1214/14-aos1227] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

182

Colonna V, Ayub Q, Chen Y, Pagani L, Luisi P, Pybus M, Garrison E, Xue Y, Tyler-Smith C, Abecasis GR, Auton A, Brooks LD, DePristo MA, Durbin RM, Handsaker RE, Kang HM, Marth GT, McVean GA. Human genomic regions with exceptionally high levels of population differentiation identified from 911 whole-genome sequences. Genome Biol 2014;15:R88. [PMID: 24980144 PMCID: PMC4197830 DOI: 10.1186/gb-2014-15-6-r88] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2014] [Accepted: 06/30/2014] [Indexed: 01/10/2023] Open

183

Nevado B, Perez-Enciso M. Pipeliner: software to evaluate the performance of bioinformatics pipelines for next-generation resequencing. Mol Ecol Resour 2014;15:99-106. [PMID: 24890372 DOI: 10.1111/1755-0998.12286] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2014] [Revised: 05/19/2014] [Accepted: 05/23/2014] [Indexed: 12/30/2022]

184

Schiffels S, Durbin R. Inferring human population size and separation history from multiple genome sequences. Nat Genet 2014;46:919-25. [PMID: 24952747 PMCID: PMC4116295 DOI: 10.1038/ng.3015] [Citation(s) in RCA: 591] [Impact Index Per Article: 59.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2013] [Accepted: 05/30/2014] [Indexed: 01/07/2023]

185

A C++ template library for efficient forward-time population genetic simulation of large populations. Genetics 2014;198:157-66. [PMID: 24950894 DOI: 10.1534/genetics.114.165019] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

186

Moreno-Estrada A, Gignoux CR, Fernández-López JC, Zakharia F, Sikora M, Contreras AV, Acuña-Alonzo V, Sandoval K, Eng C, Romero-Hidalgo S, Ortiz-Tello P, Robles V, Kenny EE, Nuño-Arana I, Barquera-Lozano R, Macín-Pérez G, Granados-Arriola J, Huntsman S, Galanter JM, Via M, Ford JG, Chapela R, Rodriguez-Cintron W, Rodríguez-Santana JR, Romieu I, Sienra-Monge JJ, del Rio Navarro B, London SJ, Ruiz-Linares A, Garcia-Herrera R, Estrada K, Hidalgo-Miranda A, Jimenez-Sanchez G, Carnevale A, Soberón X, Canizales-Quinteros S, Rangel-Villalobos H, Silva-Zolezzi I, Burchard EG, Bustamante CD. Human genetics. The genetics of Mexico recapitulates Native American substructure and affects biomedical traits. Science 2014;344:1280-5. [PMID: 24926019 PMCID: PMC4156478 DOI: 10.1126/science.1251688] [Citation(s) in RCA: 331] [Impact Index Per Article: 33.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

Affiliation(s)

Andrés Moreno-Estrada Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA.
Christopher R Gignoux Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA.
Juan Carlos Fernández-López Instituto Nacional de Medicina Genómica (INMEGEN), Mexico City, Mexico
Fouad Zakharia Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA
Martin Sikora Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA
Alejandra V Contreras Instituto Nacional de Medicina Genómica (INMEGEN), Mexico City, Mexico
Victor Acuña-Alonzo Escuela Nacional de Antropología e Historia (ENAH), Mexico City, Mexico. Department of Genetics, Evolution and Environment, University College London, London, UK
Karla Sandoval Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA
Celeste Eng Department of Medicine, University of California, San Francisco, CA, USA
Sandra Romero-Hidalgo Instituto Nacional de Medicina Genómica (INMEGEN), Mexico City, Mexico
Patricia Ortiz-Tello Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA
Victoria Robles Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA
Eimear E Kenny Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA
Ismael Nuño-Arana Instituto de Investigación en Genética Molecular, Universidad de Guadalajara, Ocotlán, Mexico
Rodrigo Barquera-Lozano Escuela Nacional de Antropología e Historia (ENAH), Mexico City, Mexico
Gastón Macín-Pérez Escuela Nacional de Antropología e Historia (ENAH), Mexico City, Mexico
Julio Granados-Arriola Instituto Nacional de Ciencias Médicas y Nutrición Salvador Zubirán, Mexico City, Mexico
Scott Huntsman Department of Medicine, University of California, San Francisco, CA, USA
Joshua M Galanter Department of Medicine, University of California, San Francisco, CA, USA. Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
Marc Via Department of Medicine, University of California, San Francisco, CA, USA
Jean G Ford The Brooklyn Hospital Center, Brooklyn, NY, USA
Rocío Chapela Instituto Nacional de Enfermedades Respiratorias (INER), Mexico City, Mexico
William Rodriguez-Cintron Veterans Caribbean Health Care System, San Juan, Puerto Rico
Jose R Rodríguez-Santana Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA. Instituto Nacional de Medicina Genómica (INMEGEN), Mexico City, Mexico
Isabelle Romieu International Agency for Research on Cancer, Lyon, France
Juan José Sienra-Monge Hospital Infantil de México Federico Gomez, Mexico City, Mexico
Blanca del Rio Navarro Hospital Infantil de México Federico Gomez, Mexico City, Mexico
Stephanie J London National Institute of Environmental Health Sciences, National Institutes of Health, Department of Health and Human Services, Research Triangle Park, NC, USA
Andrés Ruiz-Linares Department of Genetics, Evolution and Environment, University College London, London, UK
Rodrigo Garcia-Herrera Instituto Nacional de Medicina Genómica (INMEGEN), Mexico City, Mexico
Karol Estrada Instituto Nacional de Medicina Genómica (INMEGEN), Mexico City, Mexico
Alfredo Hidalgo-Miranda Instituto Nacional de Medicina Genómica (INMEGEN), Mexico City, Mexico
Gerardo Jimenez-Sanchez Instituto Nacional de Medicina Genómica (INMEGEN), Mexico City, Mexico
Alessandra Carnevale Instituto Nacional de Medicina Genómica (INMEGEN), Mexico City, Mexico
Xavier Soberón Instituto Nacional de Medicina Genómica (INMEGEN), Mexico City, Mexico
Samuel Canizales-Quinteros Instituto Nacional de Medicina Genómica (INMEGEN), Mexico City, Mexico. Facultad de Química, Universidad Nacional Autónoma de México, Mexico City, Mexico
Héctor Rangel-Villalobos Instituto de Investigación en Genética Molecular, Universidad de Guadalajara, Ocotlán, Mexico
Irma Silva-Zolezzi Instituto Nacional de Medicina Genómica (INMEGEN), Mexico City, Mexico
Esteban Gonzalez Burchard Department of Medicine, University of California, San Francisco, CA, USA. Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA.
Carlos D Bustamante Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA.

Collapse

187

How population growth affects linkage disequilibrium. Genetics 2014;197:1329-41. [PMID: 24907258 DOI: 10.1534/genetics.114.166454] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

188

Kelleher J, Etheridge AM, Barton NH. Coalescent simulation in continuous space: algorithms for large neighbourhood size. Theor Popul Biol 2014;95:13-23. [PMID: 24910324 DOI: 10.1016/j.tpb.2014.05.001] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2013] [Revised: 05/20/2014] [Accepted: 05/22/2014] [Indexed: 11/15/2022]

189

Hickey JM, Gorjanc G, Hearne S, Huang BE. AlphaMPSim: flexible simulation of multi-parent crosses. Bioinformatics 2014;30:2686-8. [DOI: 10.1093/bioinformatics/btu206] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

190

Nossa CW, Havlak P, Yue JX, Lv J, Vincent KY, Brockmann HJ, Putnam NH. Joint assembly and genetic mapping of the Atlantic horseshoe crab genome reveals ancient whole genome duplication. Gigascience 2014;3:9. [PMID: 24987520 PMCID: PMC4066314 DOI: 10.1186/2047-217x-3-9] [Citation(s) in RCA: 72] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2013] [Accepted: 04/23/2014] [Indexed: 11/11/2022] Open

Abstract

Background

Horseshoe crabs are marine arthropods with a fossil record extending back approximately 450 million years. They exhibit remarkable morphological stability over their long evolutionary history, retaining a number of ancestral arthropod traits, and are often cited as examples of “living fossils.” As arthropods, they belong to the Ecdysozoa, an ancient super-phylum whose sequenced genomes (including insects and nematodes) have thus far shown more divergence from the ancestral pattern of eumetazoan genome organization than cnidarians, deuterostomes and lophotrochozoans. However, much of ecdysozoan diversity remains unrepresented in comparative genomic analyses.

Results

Here we apply a new strategy of combined de novo assembly and genetic mapping to examine the chromosome-scale genome organization of the Atlantic horseshoe crab, Limulus polyphemus. We constructed a genetic linkage map of this 2.7 Gbp genome by sequencing the nuclear DNA of 34 wild-collected, full-sibling embryos and their parents at a mean redundancy of 1.1x per sample. The map includes 84,307 sequence markers grouped into 1,876 distinct genetic intervals and 5,775 candidate conserved protein coding genes.

Conclusions

Comparison with other metazoan genomes shows that the L. polyphemus genome preserves ancestral bilaterian linkage groups, and that a common ancestor of modern horseshoe crabs underwent one or more ancient whole genome duplications 300 million years ago, followed by extensive chromosome fusion. These results provide a counter-example to the often noted correlation between whole genome duplication and evolutionary radiations. The new, low-cost genetic mapping method for obtaining a chromosome-scale view of non-model organism genomes that we demonstrate here does not require laboratory culture, and is potentially applicable to a broad range of other species.

Collapse

191

DAIRRy-BLUP: a high-performance computing approach to genomic prediction. Genetics 2014;197:813-22. [PMID: 24736932 DOI: 10.1534/genetics.114.163683] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

192

Hellenthal G, Busby GB, Band G, Wilson JF, Capelli C, Falush D, Myers S. A genetic atlas of human admixture history. Science 2014;343:747-751. [PMID: 24531965 PMCID: PMC4209567 DOI: 10.1126/science.1243518] [Citation(s) in RCA: 477] [Impact Index Per Article: 47.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

193

Bouwman AC, Hickey JM, Calus MPL, Veerkamp RF. Imputation of non-genotyped individuals based on genotyped relatives: assessing the imputation accuracy of a real case scenario in dairy cattle. Genet Sel Evol 2014;46:6. [PMID: 24490796 PMCID: PMC3929150 DOI: 10.1186/1297-9686-46-6] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2013] [Accepted: 01/07/2014] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Imputation of genotypes for ungenotyped individuals could enable the use of valuable phenotypes created before the genomic era in analyses that require genotypes. The objective of this study was to investigate the accuracy of imputation of non-genotyped individuals using genotype information from relatives.

METHODS

Genotypes were simulated for all individuals in the pedigree of a real (historical) dataset of phenotyped dairy cows and with part of the pedigree genotyped. The software AlphaImpute was used for imputation in its standard settings but also without phasing, i.e. using basic inheritance rules and segregation analysis only. Different scenarios were evaluated i.e.: (1) the real data scenario, (2) addition of genotypes of sires and maternal grandsires of the ungenotyped individuals, and (3) addition of one, two, or four genotyped offspring of the ungenotyped individuals to the reference population.

RESULTS

The imputation accuracy using AlphaImpute in its standard settings was lower than without phasing. Including genotypes of sires and maternal grandsires in the reference population improved imputation accuracy, i.e. the correlation of the true genotypes with the imputed genotype dosages, corrected for mean gene content, across all animals increased from 0.47 (real situation) to 0.60. Including one, two and four genotyped offspring increased the accuracy of imputation across all animals from 0.57 (no offspring) to 0.73, 0.82, and 0.92, respectively.

CONCLUSIONS

At present, the use of basic inheritance rules and segregation analysis appears to be the best imputation method for ungenotyped individuals. Comparison of our empirical animal-specific imputation accuracies to predictions based on selection index theory suggested that not correcting for mean gene content considerably overestimates the true accuracy. Imputation of ungenotyped individuals can help to include valuable phenotypes for genome-wide association studies or for genomic prediction, especially when the ungenotyped individuals have genotyped offspring.

Collapse

194

Baldwin-Brown JG, Long AD, Thornton KR. The power to detect quantitative trait loci using resequenced, experimentally evolved populations of diploid, sexual organisms. Mol Biol Evol 2014;31:1040-55. [PMID: 24441104 PMCID: PMC3969567 DOI: 10.1093/molbev/msu048] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open

195

Durbin R. Efficient haplotype matching and storage using the positional Burrows-Wheeler transform (PBWT). Bioinformatics 2014;30:1266-72. [PMID: 24413527 PMCID: PMC3998136 DOI: 10.1093/bioinformatics/btu014] [Citation(s) in RCA: 241] [Impact Index Per Article: 24.1] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open

196

Pérez-Enciso M. Genomic relationships computed from either next-generation sequence or array SNP data. J Anim Breed Genet 2014;131:85-96. [PMID: 24397314 DOI: 10.1111/jbg.12074] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2013] [Accepted: 12/02/2013] [Indexed: 01/18/2023]

197

Yang T, Deng HW, Niu T. Critical assessment of coalescent simulators in modeling recombination hotspots in genomic sequences. BMC Bioinformatics 2014;15:3. [PMID: 24387001 PMCID: PMC3890628 DOI: 10.1186/1471-2105-15-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2013] [Accepted: 12/30/2013] [Indexed: 12/04/2022] Open

Abstract

Background

Coalescent simulation is pivotal for understanding population evolutionary models and demographic histories, as well as for developing novel analytical methods for genetic association studies for DNA sequence data. A plethora of coalescent simulators are developed, but selecting the most appropriate program remains challenging.

Results

We extensively compared performances of five widely used coalescent simulators – Hudson’s ms, msHOT, MaCS, Simcoal2, and fastsimcoal, to provide a practical guide considering three crucial factors, 1) speed, 2) scalability and 3) recombination hotspot position and intensity accuracy. Although ms represents a popular standard coalescent simulator, it lacks the ability to simulate sequences with recombination hotspots. An extended program msHOT has compensated for the deficiency of ms by incorporating recombination hotspots and gene conversion events at arbitrarily chosen locations and intensities, but remains limited in simulating long stretches of DNA sequences. Simcoal2, based on a discrete generation-by-generation approach, could simulate more complex demographic scenarios, but runs comparatively slow. MaCS and fastsimcoal, both built on fast, modified sequential Markov coalescent algorithms to approximate standard coalescent, are much more efficient whilst keeping salient features of msHOT and Simcoal2, respectively. Our simulations demonstrate that they are more advantageous over other programs for a spectrum of evolutionary models. To validate recombination hotspots, LDhat 2.2 rhomap package, sequenceLDhot and Haploview were compared for hotspot detection, and sequenceLDhot exhibited the best performance based on both real and simulated data.

Conclusions

While ms remains an excellent choice for general coalescent simulations of DNA sequences, MaCS and fastsimcoal are much more scalable and flexible in simulating a variety of demographic events under different recombination hotspot models. Furthermore, sequenceLDhot appears to give the most optimal performance in detecting and validating cross-over hotspots.

Collapse

198

Qian Y, Browning BL, Browning SR. Efficient clustering of identity-by-descent between multiple individuals. ACTA ACUST UNITED AC 2013;30:915-22. [PMID: 24363374 DOI: 10.1093/bioinformatics/btt734] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

199

Koch E, Ristroph M, Kirkpatrick M. Long range linkage disequilibrium across the human genome. PLoS One 2013;8:e80754. [PMID: 24349013 PMCID: PMC3861250 DOI: 10.1371/journal.pone.0080754] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2013] [Accepted: 10/17/2013] [Indexed: 11/19/2022] Open

200

Kessner D, Novembre J. forqs: forward-in-time simulation of recombination, quantitative traits and selection. ACTA ACUST UNITED AC 2013;30:576-7. [PMID: 24336146 PMCID: PMC3928523 DOI: 10.1093/bioinformatics/btt712] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]