Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Browning SR, Weir BS. Population structure with localized haplotype clusters. Genetics 2010;185:1337-44. [PMID: 20457877 DOI: 10.1534/genetics.110.116681] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open

For:	Browning SR, Weir BS. Population structure with localized haplotype clusters. Genetics 2010;185:1337-44. [PMID: 20457877 DOI: 10.1534/genetics.110.116681] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open

Number

Cited by Other Article(s)

Lee D, Kim Y, Chung Y, Lee D, Seo D, Choi TJ, Lim D, Yoon D, Lee SH. Accuracy of genotype imputation based on reference population size and marker density in Hanwoo cattle. JOURNAL OF ANIMAL SCIENCE AND TECHNOLOGY 2021;63:1232-1246. [PMID: 34957440 PMCID: PMC8672260 DOI: 10.5187/jast.2021.e117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 10/13/2021] [Accepted: 10/14/2021] [Indexed: 11/20/2022]

Comparison of Selection Signatures between Korean Native and Commercial Chickens Using 600K SNP Array Data. Genes (Basel) 2021;12:genes12060824. [PMID: 34072132 PMCID: PMC8230197 DOI: 10.3390/genes12060824] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Revised: 05/18/2021] [Accepted: 05/24/2021] [Indexed: 12/14/2022] Open

Eydivandi S, Roudbar MA, Karimi MO, Sahana G. Genomic scans for selective sweeps through haplotype homozygosity and allelic fixation in 14 indigenous sheep breeds from Middle East and South Asia. Sci Rep 2021;11:2834. [PMID: 33531649 PMCID: PMC7854752 DOI: 10.1038/s41598-021-82625-2] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2020] [Accepted: 01/22/2021] [Indexed: 01/30/2023] Open

Abstract

The performance and productivity of livestock have consistently improved by natural and artificial selection over the centuries. Both these selections are expected to leave patterns on the genome and lead to changes in allele frequencies, but natural selection has played the major role among indigenous populations. Detecting selective sweeps in livestock may assist in understanding the processes involved in domestication, genome evolution and discovery of genomic regions associated with economically important traits. We investigated population genetic diversity and selection signals in this study using SNP genotype data of 14 indigenous sheep breeds from Middle East and South Asia, including six breeds from Iran, namely Iranian Balochi, Afshari, Moghani, Qezel, Zel, and Lori-Bakhtiari, three breeds from Afghanistan, namely Afghan Balochi, Arabi, and Gadik, three breeds from India, namely Indian Garole, Changthangi, and Deccani, and two breeds from Bangladesh, namely Bangladeshi Garole and Bangladesh East. The SNP genotype data were generated by the Illumina OvineSNP50 Genotyping BeadChip array. To detect genetic diversity and population structure, we used principal component analysis (PCA), admixture, phylogenetic analyses, and Runs of homozygosity. We applied four complementary statistical tests, FST (fixation index), xp-EHH (cross-population extended haplotype homozygosity), Rsb (extended haplotype homozygosity between-populations), and FLK (the extension of the Lewontin and Krakauer) to detect selective sweeps. Our results not only confirm the previous studies but also provide a suite of novel candidate genes involved in different traits in sheep. On average, FST, xp-EHH, Rsb, and FLK detected 128, 207, 222, and 252 genomic regions as candidates for selective sweeps, respectively. Furthermore, nine overlapping candidate genes were detected by these four tests, especially TNIK, DOCK1, USH2A, and TYW1B which associate with resistance to diseases and climate adaptation. Knowledge of candidate genomic regions in sheep populations may facilitate the identification and potential exploitation of the underlying genes in sheep breeding.

Collapse

Eydivandi S, Roudbar MA, Ardestani SS, Momen M, Sahana G. A selection signatures study among Middle Eastern and European sheep breeds. J Anim Breed Genet 2021;138:574-588. [PMID: 33453096 DOI: 10.1111/jbg.12536] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2020] [Revised: 11/25/2020] [Accepted: 12/26/2020] [Indexed: 01/26/2023]

Abstract

Selection, both natural and artificial, leaves patterns on the genome during domestication of animals and leads to changes in allele frequencies among populations. Detecting genomic regions influenced by selection in livestock may assist in understanding the processes involved in genome evolution and discovering genomic regions related to traits of economic and ecological interests. In the current study, genetic diversity analyses were conducted on 34,206 quality-filtered SNP positions from 450 individuals in 15 sheep breeds, including six indigenous breeds from the Middle East, namely Iranian Balouchi, Afshari, Moghani, Qezel, Karakas and Norduz, and nine breeds from Europe, namely East Friesian Sheep, Ile de France, Mourerous, Romane, Swiss Mirror, Spaelsau, Suffolk, Comisana and Engadine Red Sheep. The SNP genotype data generated by the Illumina OvineSNP50 Genotyping BeadChip array were used in this analysis. We applied two complementary statistical analyses, F_ST (fixation index) and xp-EHH (cross-population extended haplotype homozygosity), to detect selection signatures in Middle Eastern and European sheep populations. F_ST and xp-EHH detected 629 and 256 genes indicating signatures of selection, respectively. Genomic regions identified using F_ST and xp-EHH contained the CIDEA, HHATL, MGST1, FADS1, RTL1 and DGKG genes, which were reported earlier to influence a number of economic traits. Both F_ST and xp-EHH approaches identified 60 shared genes as the signatures of selection, including four candidate genes (NT5E, ADA2, C8A and C8B) that were enriched for two significant Gene Ontology (GO) terms associated with the adenosine metabolic procedure. Knowledge about the candidate genomic regions under selective pressure in sheep breeds may facilitate identification of the underlying genes and enhance our understanding on these genes role in local adaptation.

Collapse

Khvorykh GV, Khrunin AV. imputeqc: an R package for assessing imputation quality of genotypes and optimizing imputation parameters. BMC Bioinformatics 2020;21:304. [PMID: 32703240 PMCID: PMC7379353 DOI: 10.1186/s12859-020-03589-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Accepted: 06/08/2020] [Indexed: 11/26/2022] Open

Theodoridis S, Randin C, Szövényi P, Boucher FC, Patsiou TS, Conti E. How Do Cold-Adapted Plants Respond to Climatic Cycles? Interglacial Expansion Explains Current Distribution and Genomic Diversity in Primula farinosa L. Syst Biol 2018;66:715-736. [PMID: 28334079 DOI: 10.1093/sysbio/syw114] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2016] [Accepted: 12/14/2016] [Indexed: 12/16/2022] Open

Louzoun Y, Alter I, Gragert L, Albrecht M, Maiers M. Modeling coverage gaps in haplotype frequencies via Bayesian inference to improve stem cell donor selection. Immunogenetics 2017;70:279-292. [DOI: 10.1007/s00251-017-1040-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2017] [Accepted: 10/23/2017] [Indexed: 11/24/2022]

Weir BS, Goudet J. A Unified Characterization of Population Structure and Relatedness. Genetics 2017;206:2085-2103. [PMID: 28550018 PMCID: PMC5560808 DOI: 10.1534/genetics.116.198424] [Citation(s) in RCA: 83] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2016] [Accepted: 05/17/2017] [Indexed: 11/18/2022] Open

Abstract

Many population genetic activities, ranging from evolutionary studies to association mapping, to forensic identification, rely on appropriate estimates of population structure or relatedness. All applications require recognition that quantities with an underlying meaning of allelic dependence are not defined in an absolute sense, but instead are made "relative to" some set of alleles other than the target set. The 1984 Weir and Cockerham [Formula: see text] estimate made explicit that the reference set of alleles was across populations, whereas standard kinship estimates do not make the reference explicit. Weir and Cockerham stated that their [Formula: see text] estimates were for independent populations, and standard kinship estimates have an implicit assumption that pairs of individuals in a study sample, other than the target pair, are unrelated or are not inbred. However, populations lose independence when there is migration between them, and dependencies between pairs of individuals in a population exist for more than one target pair. We have therefore recast our treatments of population structure, relatedness, and inbreeding to make explicit that the parameters of interest involve the differences in degrees of allelic dependence between the target and the reference sets of alleles, and so can be negative. We take the reference set to be the population from which study individuals have been sampled. We provide simple moment estimates of these parameters, phrased in terms of allelic matching within and between individuals for relatedness and inbreeding, or within and between populations for population structure. A multi-level hierarchy of alleles within individuals, alleles between individuals within populations, and alleles between populations, allows a unified treatment of relatedness and population structure. We expect our new measures to have a wide range of applications, but we note that their estimates are sensitive to rare or private variants: some population-characterization applications suggest exploiting those sensitivities, whereas estimation of relatedness may best use all genetic markers without filtering on minor allele frequency.

Collapse

Roshyara NR, Scholz M. Impact of genetic similarity on imputation accuracy. BMC Genet 2015;16:90. [PMID: 26193934 PMCID: PMC4509609 DOI: 10.1186/s12863-015-0248-2] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2015] [Accepted: 07/07/2015] [Indexed: 01/06/2023] Open

Abstract

Background

Genotype imputation is a common technique in genetic research. Genetic similarity between target population and reference dataset is crucial for high-quality results. Although several reference panels are available, it is often not clear which is the most optimal for a particular target dataset to be imputed. Maximizing genetic similarity between study sample and intended reference panels may be the straight forward method for selecting the genetically best-matched reference. However, the impact of genetic similarity on imputation accuracy has not yet been studied in detail.

Results

We performed a simulation study in 20 ethnic groups obtained from POPRES. High-quality SNPs were masked and re-imputed with MaCH, MaCH-minimac and IMPUTE2 using four different HapMap reference panels (CEU, CHB-JPT, MEX and YRI). Imputation accuracy was assessed by different statistics. Genetic similarity between ethnic groups and reference populations were measured by F -statistics (F_ST) originally proposed by Wright and G -statistics (G_ST) introduced by Nei and others. To assess the predictive power of these measures regarding imputation accuracy, we analysed relations between them and corresponding imputation accuracy scores. We found that population genetic distances between homogeneous reference and target populations were strongly linearly correlated with resulting imputation accuracies irrespective of considered distance measure, imputation accuracy measure, missingness and imputation software used. Possible exception was African population.

Conclusion

Usage of G_ST or F_ST-related measures for predicting the optimal reference panel for imputation frameworks relying on a specific reference is highly recommended. A cut-off of G_ST < 0.01 is recommended to achieve good imputation results for high-frequency variants and small data sets. The linear relationship is less pronounced for low-frequency variants for which we also observed a dependence of imputation accuracy on the number of polymorphic sites in the reference. We also show that the software specific measures MaCH-Rsq and IMPUTE-info must be interpreted with caution if the genetic distance of target and reference population is high.

Electronic supplementary material

The online version of this article (doi:10.1186/s12863-015-0248-2) contains supplementary material, which is available to authorized users.

Collapse

Gholami M, Reimer C, Erbe M, Preisinger R, Weigend A, Weigend S, Servin B, Simianer H. Genome Scan for Selection in Structured Layer Chicken Populations Exploiting Linkage Disequilibrium Information. PLoS One 2015;10:e0130497. [PMID: 26151449 PMCID: PMC4494984 DOI: 10.1371/journal.pone.0130497] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2014] [Accepted: 05/20/2015] [Indexed: 01/02/2023] Open

Mapping signatures of positive selection in the genome of livestock. Livest Sci 2014. [DOI: 10.1016/j.livsci.2014.05.003] [Citation(s) in RCA: 96] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Randhawa IAS, Khatkar MS, Thomson PC, Raadsma HW. Composite selection signals can localize the trait specific genomic regions in multi-breed populations of cattle and sheep. BMC Genet 2014;15:34. [PMID: 24636660 PMCID: PMC4101850 DOI: 10.1186/1471-2156-15-34] [Citation(s) in RCA: 64] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2013] [Accepted: 03/10/2014] [Indexed: 12/22/2022] Open

Abstract

Background

Discerning the traits evolving under neutral conditions from those traits evolving rapidly because of various selection pressures is a great challenge. We propose a new method, composite selection signals (CSS), which unifies the multiple pieces of selection evidence from the rank distribution of its diverse constituent tests. The extreme CSS scores capture highly differentiated loci and underlying common variants hauling excess haplotype homozygosity in the samples of a target population.

Results

The data on high-density genotypes were analyzed for evidence of an association with either polledness or double muscling in various cohorts of cattle and sheep. In cattle, extreme CSS scores were found in the candidate regions on autosome BTA-1 and BTA-2, flanking the POLL locus and MSTN gene, for polledness and double muscling, respectively. In sheep, the regions with extreme scores were localized on autosome OAR-2 harbouring the MSTN gene for double muscling and on OAR-10 harbouring the RXFP2 gene for polledness. In comparison to the constituent tests, there was a partial agreement between the signals at the four candidate loci; however, they consistently identified additional genomic regions harbouring no known genes. Persuasively, our list of all the additional significant CSS regions contains genes that have been successfully implicated to secondary phenotypic diversity among several subpopulations in our data. For example, the method identified a strong selection signature for stature in cattle capturing selective sweeps harbouring UQCC-GDF5 and PLAG1-CHCHD7 gene regions on BTA-13 and BTA-14, respectively. Both gene pairs have been previously associated with height in humans, while PLAG1-CHCHD7 has also been reported for stature in cattle. In the additional analysis, CSS identified significant regions harbouring multiple genes for various traits under selection in European cattle including polledness, adaptation, metabolism, growth rate, stature, immunity, reproduction traits and some other candidate genes for dairy and beef production.

Conclusions

CSS successfully localized the candidate regions in validation datasets as well as identified previously known and novel regions for various traits experiencing selection pressure. Together, the results demonstrate the utility of CSS by its improved power, reduced false positives and high-resolution of selection signals as compared to individual constituent tests.

Collapse

Detecting and measuring selection from gene frequency data. Genetics 2013;196:799-817. [PMID: 24361938 DOI: 10.1534/genetics.113.152991] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Abstract

The recent advent of high-throughput sequencing and genotyping technologies makes it possible to produce, easily and cost effectively, large amounts of detailed data on the genotype composition of populations. Detecting locus-specific effects may help identify those genes that have been, or are currently, targeted by natural selection. How best to identify these selected regions, loci, or single nucleotides remains a challenging issue. Here, we introduce a new model-based method, called SelEstim, to distinguish putative selected polymorphisms from the background of neutral (or nearly neutral) ones and to estimate the intensity of selection at the former. The underlying population genetic model is a diffusion approximation for the distribution of allele frequency in a population subdivided into a number of demes that exchange migrants. We use a Markov chain Monte Carlo algorithm for sampling from the joint posterior distribution of the model parameters, in a hierarchical Bayesian framework. We present evidence from stochastic simulations, which demonstrates the good power of SelEstim to identify loci targeted by selection and to estimate the strength of selection acting on these loci, within each deme. We also reanalyze a subset of SNP data from the Stanford HGDP-CEPH Human Genome Diversity Cell Line Panel to illustrate the performance of SelEstim on real data. In agreement with previous studies, our analyses point to a very strong signal of positive selection upstream of the LCT gene, which encodes for the enzyme lactase-phlorizin hydrolase and is associated with adult-type hypolactasia. The geographical distribution of the strength of positive selection across the Old World matches the interpolated map of lactase persistence phenotype frequencies, with the strongest selection coefficients in Europe and in the Indus Valley.

Collapse

Detecting signatures of selection through haplotype differentiation among hierarchically structured populations. Genetics 2013;193:929-41. [PMID: 23307896 DOI: 10.1534/genetics.112.147231] [Citation(s) in RCA: 208] [Impact Index Per Article: 18.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Schlebusch CM, Soodyall H. Extensive Population Structure in San, Khoe, and Mixed Ancestry Populations from Southern Africa Revealed by 44 Short 5-SNP Haplotypes. Hum Biol 2012;84:695-724. [DOI: 10.3378/027.084.0603] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/15/2013] [Indexed: 11/05/2022]

Weir BS. Estimating F-statistics: A historical view. PHILOSOPHY OF SCIENCE 2012;79:637-643. [PMID: 26405363 PMCID: PMC4578636 DOI: 10.1086/667904] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Salm MPA, Horswell SD, Hutchison CE, Speedy HE, Yang X, Liang L, Schadt EE, Cookson WO, Wierzbicki AS, Naoumova RP, Shoulders CC. The origin, global distribution, and functional impact of the human 8p23 inversion polymorphism. Genome Res 2012;22:1144-53. [PMID: 22399572 PMCID: PMC3371712 DOI: 10.1101/gr.126037.111] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]

Lenstra JA, Groeneveld LF, Eding H, Kantanen J, Williams JL, Taberlet P, Nicolazzi EL, Sölkner J, Simianer H, Ciani E, Garcia JF, Bruford MW, Ajmone-Marsan P, Weigend S. Molecular tools and analytical approaches for the characterization of farm animal genetic diversity. Anim Genet 2012;43:483-502. [DOI: 10.1111/j.1365-2052.2011.02309.x] [Citation(s) in RCA: 86] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/15/2011] [Indexed: 12/30/2022]

Lawson DJ, Hellenthal G, Myers S, Falush D. Inference of population structure using dense haplotype data. PLoS Genet 2012;8:e1002453. [PMID: 22291602 PMCID: PMC3266881 DOI: 10.1371/journal.pgen.1002453] [Citation(s) in RCA: 703] [Impact Index Per Article: 58.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2011] [Accepted: 11/21/2011] [Indexed: 12/12/2022] Open

Abstract

The advent of genome-wide dense variation data provides an opportunity to investigate ancestry in unprecedented detail, but presents new statistical challenges. We propose a novel inference framework that aims to efficiently capture information on population structure provided by patterns of haplotype similarity. Each individual in a sample is considered in turn as a recipient, whose chromosomes are reconstructed using chunks of DNA donated by the other individuals. Results of this "chromosome painting" can be summarized as a "coancestry matrix," which directly reveals key information about ancestral relationships among individuals. If markers are viewed as independent, we show that this matrix almost completely captures the information used by both standard Principal Components Analysis (PCA) and model-based approaches such as STRUCTURE in a unified manner. Furthermore, when markers are in linkage disequilibrium, the matrix combines information across successive markers to increase the ability to discern fine-scale population structure using PCA. In parallel, we have developed an efficient model-based approach to identify discrete populations using this matrix, which offers advantages over PCA in terms of interpretability and over existing clustering algorithms in terms of speed, number of separable populations, and sensitivity to subtle population structure. We analyse Human Genome Diversity Panel data for 938 individuals and 641,000 markers, and we identify 226 populations reflecting differences on continental, regional, local, and family scales. We present multiple lines of evidence that, while many methods capture similar information among strongly differentiated groups, more subtle population structure in human populations is consistently present at a much finer level than currently available geographic labels and is only captured by the haplotype-based approach. The software used for this article, ChromoPainter and fineSTRUCTURE, is available from http://www.paintmychromosomes.com/.

Collapse

San Lucas FA, Rosenberg NA, Scheet P. Haploscope: a tool for the graphical display of haplotype structure in populations. Genet Epidemiol 2011;36:17-21. [PMID: 22147662 DOI: 10.1002/gepi.20640] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2011] [Revised: 09/14/2011] [Accepted: 09/21/2011] [Indexed: 11/11/2022]

Allendorf FW, Hohenlohe PA, Luikart G. Genomics and the future of conservation genetics. Nat Rev Genet 2010;11:697-709. [DOI: 10.1038/nrg2844] [Citation(s) in RCA: 939] [Impact Index Per Article: 67.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]