1
|
Nisa FU, Kaul H, Asif M, Amin I, Mrode R, Mansoor S, Mukhtar Z. Genetic insights into crossbred dairy cattle of Pakistan: exploring allele frequency, linkage disequilibrium, and effective population size at a genome-wide scale. Mamm Genome 2023; 34:602-614. [PMID: 37804434 DOI: 10.1007/s00335-023-10019-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 09/13/2023] [Indexed: 10/09/2023]
Abstract
Linkage disequilibrium (LD) affects genomic studies accuracy. High-density genotyping platforms identify SNPs across animal genomes, increasing LD evaluation resolution for accurate analysis. This study aimed to evaluate the decay and magnitude of LD in a cohort of 81 crossbred dairy cattle using the GGP_HDv3_C Bead Chip. After quality control, 116,710 Single Nucleotide Polymorphisms (SNPs) across 2520.241 Mb of autosomes were retained. LD extent was assessed between autosomal SNPs within a 10 Mb range using the r2 statistics. LD value declined as inter-marker distance increased. The average r2 value was 0.24 for SNP pairs < 10 kb apart, decreasing to 0.13 for 50-100 kb distances. Minor allele frequency (MAF) and sample size significantly impact LD. Lower MAF thresholds result in smaller r2 values, while higher thresholds show increased r2 values. Additionally, smaller sample sizes exhibit higher average r2 values, especially for larger physical distance intervals (> 50 kb) between SNP pairs. Effective population size and inbreeding coefficient were 150 and 0.028 for the present generation, indicating a decrease in genetic diversity over time. These findings imply that the utilization of high-density SNP panels and customized/breed-specific SNP panels represent a highly favorable approach for conducting genome-wide association studies (GWAS) and implementing genomic selection (GS) in the Bos indicus cattle breeds, whose genomes are still largely unexplored. Furthermore, it is imperative to devise a meticulous breeding strategy tailored to each herd, aiming to enhance desired traits while simultaneously preserving genetic diversity.
Collapse
Affiliation(s)
- Fakhar Un Nisa
- Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering College (NIBGE-C), Faisalabad, Pakistan
- Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad, Pakistan
- Department of Animal Breeding and Genetics, University of Veterinary and Animal Sciences, Lahore, Pakistan
| | - Haiba Kaul
- Department of Animal Breeding and Genetics, University of Veterinary and Animal Sciences, Lahore, Pakistan
| | - Muhammad Asif
- Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering College (NIBGE-C), Faisalabad, Pakistan
- Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad, Pakistan
| | - Imran Amin
- Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering College (NIBGE-C), Faisalabad, Pakistan
- Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad, Pakistan
| | - Raphael Mrode
- Animal Biosciences, International Livestock Research Institute, Nairobi, Kenya
- Animal and Veterinary Sciences, Scotland's Rural College, Edinburgh, UK
| | - Shahid Mansoor
- Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering College (NIBGE-C), Faisalabad, Pakistan
- Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad, Pakistan
- International Centre for Chemical and Biological Sciences, University of Karachi, Karachi, Pakistan
| | - Zahid Mukhtar
- Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering College (NIBGE-C), Faisalabad, Pakistan.
- Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad, Pakistan.
| |
Collapse
|
2
|
Barani S, Nejati-Javaremi A, Moradi MH, Moradi-Sharbabak M, Gholizadeh M, Esfandyari H. Genome-wide study of linkage disequilibrium, population structure, and inbreeding in Iranian indigenous sheep breeds. PLoS One 2023; 18:e0286463. [PMID: 37267244 DOI: 10.1371/journal.pone.0286463] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Accepted: 05/16/2023] [Indexed: 06/04/2023] Open
Abstract
Knowledge of linkage disequilibrium (LD), genetic structure and genetic diversity are some key parameters to study the breeding history of indigenous small ruminants. In this study, the OvineSNP50 Bead Chip array was used to estimate and compare LD, genetic diversity, effective population size (Ne) and genomic inbreeding in 186 individuals, from three Iranian indigenous sheep breeds consisting of Baluchi (n = 96), Lori-Bakhtiari (n = 47) and Zel (n = 47). The results of principal component analysis (PCA) revealed that all animals were allocated to the groups that they sampled and the admixture analysis revealed that the structure within the populations is best explained when separated into three groups (K = 3). The average r2 values estimated between adjacent single nucleotide polymorphisms (SNPs) at distances up to 10Kb, were 0.388±0.324, 0.353±0.311, and 0.333±0.309 for Baluchi, Lori-Bakhtiari and Zel, respectively. Estimation of genetic diversity and effective population size (Ne) showed that the Zel breed had the highest heterozygosity and Ne, whereas the lowest value was found in Baluchi breed. Estimation of genomic inbreeding using FROH (based on the long stretches of consecutive homozygous genotypes) showed the highest inbreeding coefficient in Baluchi and the lowest in Zel breed that could be due to higher pressure of artificial selection on Baluchi breed. The results of genomic inbreeding and Ne showed an increase in sharing haplotypes in Baluchi, leading to the enlargement of LD and the consequences of linkage disequilibrium and haplotype blocks confirmed this point. Also, the persistence of the LD phase between Zel and Lori-Bakhtiari was highest indicating that these two breeds would be combined in a multi-breed training population in genomic selection studies.
Collapse
Affiliation(s)
- S Barani
- Department of Animal Science, University College of Agriculture and Natural Resources, University of Tehran, Karaj, Iran
| | - A Nejati-Javaremi
- Department of Animal Science, University College of Agriculture and Natural Resources, University of Tehran, Karaj, Iran
| | - M H Moradi
- Department of Animal Science, Faculty of Agriculture and Natural Resources, Arak University, Arak, Iran
| | - M Moradi-Sharbabak
- Department of Animal Science, University College of Agriculture and Natural Resources, University of Tehran, Karaj, Iran
| | - M Gholizadeh
- Department of Animal Science, Sari Agricultural Sciences and Natural Resources University, Sari, Mazandaran, Iran
| | | |
Collapse
|
3
|
Xu J, Zhang W, Zhang P, Sun W, Han Y, Li L. A comprehensive analysis of copy number variations in diverse apple populations. BMC Genomics 2023; 24:256. [PMID: 37170226 PMCID: PMC10176694 DOI: 10.1186/s12864-023-09347-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Accepted: 08/16/2022] [Indexed: 05/13/2023] Open
Abstract
BACKGROUND As an important source of genetic variation, copy number variation (CNV) can alter the dosage of DNA segments, which in turn may affect gene expression level and phenotype. However, our knowledge of CNV in apple is still limited. Here, we obtained high-confidence CNVs and investigated their functional impact based on genome resequencing data of two apple populations, cultivars and wild relatives. RESULTS In this study, we identified 914,610 CNVs comprising 14,839 CNV regions (CNVRs) from 346 apple accessions, including 289 cultivars and 57 wild relatives. CNVRs summed to 71.19 Mb, accounting for 10.03% of the apple genome. Under the low linkage disequilibrium (LD) with nearby SNPs, they could also accurately reflect the population structure of apple independent of SNPs. Furthermore, A total of 3,621 genes were covered by CNVRs and functionally involved in biological processes such as defense response, reproduction and metabolic processes. In addition, the population differentiation index ([Formula: see text]) analysis between cultivars and wild relatives revealed 127 CN-differentiated genes, which may contribute to trait differences in these two populations. CONCLUSIONS This study was based on identification of CNVs from 346 diverse apple accessions, which to our knowledge was the largest dataset for CNV analysis in apple. Our work presented the first comprehensive CNV map and provided valuable resources for understanding genomic variations in apple.
Collapse
Affiliation(s)
- Jinsheng Xu
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China
| | - Weihan Zhang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China
| | - Ping Zhang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China
| | - Weicheng Sun
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China
| | - Yuepeng Han
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Wuhan, 430074, China.
- Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, 430070, China.
| | - Li Li
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, China.
- Hubei Hongshan Laboratory, Huazhong Agricultural University, Wuhan, 430070, China.
| |
Collapse
|
4
|
Zimmermann E, Distl O. SNP-Based Heritability of Osteochondrosis Dissecans in Hanoverian Warmblood Horses. Animals (Basel) 2023; 13:ani13091462. [PMID: 37174498 PMCID: PMC10177438 DOI: 10.3390/ani13091462] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 03/24/2023] [Accepted: 04/21/2023] [Indexed: 05/15/2023] Open
Abstract
Before the genomics era, heritability estimates were performed using pedigree data. Data collection for pedigree analysis is time consuming and holds the risk of incorrect or incomplete data. With the availability of SNP-based arrays, heritability can now be estimated based on genotyping data. We used SNP array and 1.6 million imputed genotype data with different minor allele frequency restrictions to estimate heritabilities for osteochondrosis dissecans in the fetlock, hock and stifle joints of 446 Hanoverian warmblood horses. SNP-based heritabilities were estimated using a genomic restricted maximum likelihood (GREML) method and accounting for patterns of regional linkage disequilibrium in the equine genome. In addition, we employed GREML for family data to account for different degrees of relatedness in the study population. Our results indicate that we were able to capture a larger proportion of additive genetic variance compared to pedigree-based estimates in the same population of Hanoverian horses. Heritability estimates on the linear scale for fetlock-, hock- and stifle-osteochondrosis dissecans were 0.41-0.43, 0.62-0.63, and 0.23-0.25, respectively, with standard errors of 0.11-0.14. Accounting for linkage disequilibrium patterns had an upward effect on the imputed data and a downward impact on the SNP array genotype data. GREML for family data resulted in higher heritability estimates for fetlock-osteochondrosis dissecans and slightly higher estimates for hock-osteochondrosis dissecans, but had no effect on stifle-osteochondrosis dissecans. The largest and most consistent heritability estimates were obtained when we employed GREML for family data with genomic relationship matrices weighted through patterns of regional linkage disequilibrium. Estimation of SNP-based heritability should be recommended for traits that can only be phenotyped in smaller samples or are cost-effective.
Collapse
Affiliation(s)
- Elisa Zimmermann
- Institute for Animal Breeding and Genetics, University of Veterinary Medicine Hannover (Foundation), 30559 Hannover, Germany
| | - Ottmar Distl
- Institute for Animal Breeding and Genetics, University of Veterinary Medicine Hannover (Foundation), 30559 Hannover, Germany
| |
Collapse
|
5
|
Balog K, Mizeranschi AE, Wanjala G, Sipos B, Kusza S, Bagi Z. Application potential of chicken DNA chip in domestic pigeon species - Preliminary results. Saudi J Biol Sci 2023; 30:103594. [PMID: 36874200 PMCID: PMC9975693 DOI: 10.1016/j.sjbs.2023.103594] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Revised: 01/12/2023] [Accepted: 02/06/2023] [Indexed: 02/12/2023] Open
Abstract
Introducing the SNP technology to pigeon breeding will enhance the competitiveness of a sector that produces one of the healthiest and best quality meats. The present study aimed to test the applicability of the Illumina Chicken_50K_CobbCons array on 24 domestic pigeon individuals from the Mirthys hybrids and Racing pigeon breeds. A total of 53,313 SNPs were genotyped. Principal component analysis shows a significant overlap between the two groups. The chip performed poorly in this data set, with a call rate per sample of 0.474 (49%). The low call rate was likely due to an increase in the evolutionary distance. A total of 356 SNPs were retained after a relatively strict quality control. We have demonstrated that it is technically feasible to use a chicken microarray chip on pigeon samples. Presumably, with a larger sample size and by assigning phenotypic data, efficiency would be improved, allowing more thorough analyses, such as genome-wide association studies.
Collapse
Affiliation(s)
- Katalin Balog
- University of Debrecen, Doctoral School of Animal Science, Böszörményi út 138, 4032, Debrecen, Hungary.,Centre for Agricultural Genomics and Biotechnology, Faculty of Agricultural and Food Sciences and Environmental Management, University of Debrecen, 4002 Debrecen, Hungary
| | | | - George Wanjala
- University of Debrecen, Doctoral School of Animal Science, Böszörményi út 138, 4032, Debrecen, Hungary.,Centre for Agricultural Genomics and Biotechnology, Faculty of Agricultural and Food Sciences and Environmental Management, University of Debrecen, 4002 Debrecen, Hungary
| | - Bíborka Sipos
- University of Debrecen, Faculty of Agricultural and Food Sciences and Environmental Management, Böszörményi út 138, 4032, Debrecen, Hungary
| | - Szilvia Kusza
- Centre for Agricultural Genomics and Biotechnology, Faculty of Agricultural and Food Sciences and Environmental Management, University of Debrecen, 4002 Debrecen, Hungary
| | - Zoltán Bagi
- Centre for Agricultural Genomics and Biotechnology, Faculty of Agricultural and Food Sciences and Environmental Management, University of Debrecen, 4002 Debrecen, Hungary
| |
Collapse
|
6
|
Imputation of SNPs associated with presbycusis through linkage disequilibrium analysis in the ILDR1 gene. J Genet 2023. [DOI: 10.1007/s12041-022-01416-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
|
7
|
Gomez-Raya L, Rauw WM. Failing the four-gamete test enables exact phasing: the Corners’ Algorithm. GENETICS SELECTION EVOLUTION 2022; 54:74. [PMCID: PMC9661815 DOI: 10.1186/s12711-022-00763-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Accepted: 10/24/2022] [Indexed: 11/16/2022]
Abstract
Abstract
Background
Failing the four-gamete test for two polymorphic DNA markers is an indication that two or three rather than four haplotypes segregate in the population. The objective of this paper is to show that when just three haplotypes are segregating, all three haplotypes can be fully and unambiguously phase-resolved.
Theory and methods
The Corners’ Algorithm tests the four corners in a 3 × 3 table of two-locus genotypes. If one of the four corners is filled with zeroes, then the missing haplotype is identified and the phases of all three haplotypes can be unambiguously resolved for all individuals. Three applications of this method are proposed when the four-gamete test fails: (1) direct estimation of linkage disequilibrium (LD), (2) haplotype-based genome-wide association studies (GWAS) of three haplotypes (single-marker GWAS tests for two out of three haplotypes only), and (3) haplotyping of chromosomal regions that are comprised of pairs of single nucleotide polymorphisms (SNPs) that consist of just three haplotypes. An example based on 435 sows with performance records for total number of piglets born is used to illustrate the methods.
Results
Of 20,339 SNPs, approximately 50% of the pairs of flanking SNPs failed the four-gamete test. For those, the expectation maximization (EM) algorithm gave the same results. The average of the absolute value of the difference in r2 between flanking SNPs across the genome between the two methods was 0.00082. Single-marker GWAS (using two of three haplotypes) detected significant associations for total number of piglets born on chromosomes 1, 2, 6, 9, 10, 12, 13, 14, 15, and 18. Haplotype-based GWAS using the third haplotype resolved with the Corners’ Algorithm detected additional significant associations for total number of piglets born on chromosomes 2, 5, 10, 13, 14, 15, and 18. Estimated substitution effects ranged from 0.40 to 1.35 piglets. Haplotyping of chromosomal regions that failed the four-gamete test for any pair of SNPs covered 961 Mb out of the 2249 Mb by the SNP array.
Conclusions
The Corner’s Algorithm allows to fully phase haplotypes when the four-gamete test fails. Longer haplotypes in chromosomal regions in which the four-gamete test fails for any pair of SNPs can be used as a multi-allelic marker with increased polymorphism information content.
Collapse
|
8
|
Wang Y, Tsuo K, Kanai M, Neale BM, Martin AR. Challenges and Opportunities for Developing More Generalizable Polygenic Risk Scores. Annu Rev Biomed Data Sci 2022; 5:293-320. [PMID: 35576555 PMCID: PMC9828290 DOI: 10.1146/annurev-biodatasci-111721-074830] [Citation(s) in RCA: 53] [Impact Index Per Article: 26.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
Polygenic risk scores (PRS) estimate an individual's genetic likelihood of complex traits and diseases by aggregating information across multiple genetic variants identified from genome-wide association studies. PRS can predict a broad spectrum of diseases and have therefore been widely used in research settings. Some work has investigated their potential applications as biomarkers in preventative medicine, but significant work is still needed to definitively establish and communicate absolute risk to patients for genetic and modifiable risk factors across demographic groups. However, the biggest limitation of PRS currently is that they show poor generalizability across diverse ancestries and cohorts. Major efforts are underway through methodological development and data generation initiatives to improve their generalizability. This review aims to comprehensively discuss current progress on the development of PRS, the factors that affect their generalizability, and promising areas for improving their accuracy, portability, and implementation.
Collapse
Affiliation(s)
- Ying Wang
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts, USA;
- Stanley Center for Psychiatric Research and Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA
| | - Kristin Tsuo
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts, USA;
- Stanley Center for Psychiatric Research and Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA
- Biological and Biomedical Sciences, Harvard Medical School, Boston, Massachusetts, USA
| | - Masahiro Kanai
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts, USA;
- Stanley Center for Psychiatric Research and Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
- Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan
| | - Benjamin M Neale
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts, USA;
- Stanley Center for Psychiatric Research and Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA
| | - Alicia R Martin
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts, USA;
- Stanley Center for Psychiatric Research and Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, Massachusetts, USA
| |
Collapse
|
9
|
Zafar A, Shafiq M, Ali B, Sadee W, Shakoori AR, Shakoori FR. Association of IRGM promoter region polymorphisms and haplotype with pulmonary tuberculosis in Pakistani (Punjab) population. Tuberculosis (Edinb) 2022; 136:102233. [DOI: 10.1016/j.tube.2022.102233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2021] [Revised: 06/03/2022] [Accepted: 07/13/2022] [Indexed: 10/17/2022]
|
10
|
Biddanda A, Steinrücken M, Novembre J. Properties of Two-Locus Genealogies and Linkage Disequilibrium in Temporally Structured Samples. Genetics 2022; 221:6549526. [PMID: 35294015 PMCID: PMC9245597 DOI: 10.1093/genetics/iyac038] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Accepted: 02/06/2022] [Indexed: 11/13/2022] Open
Abstract
Archaeogenetics has been revolutionary, revealing insights into demographic history and recent positive selection. However, most studies to date have ignored the non-random association of genetic variants at different loci (i.e., linkage disequilibrium, LD). This may be in part because basic properties of LD in samples from different times are still not well understood. Here, we derive several results for summary statistics of haplotypic variation under a model with time-stratified sampling: 1) The correlation between the number of pairwise differences observed between time-staggered samples (πΔt) in models with and without strict population continuity; 2) The product of the LD coefficient, D, between ancient and modern samples, which is a measure of haplotypic similarity between modern and ancient samples; and 3) The expected switch rate in the Li and Stephens haplotype copying model. The latter has implications for genotype imputation and phasing in ancient samples with modern reference panels. Overall, these results provide a characterization of how haplotype patterns are affected by sample age, recombination rates, and population sizes. We expect these results will help guide the interpretation and analysis of haplotype data from ancient and modern samples.
Collapse
Affiliation(s)
- Arjun Biddanda
- Department of Human Genetics, University of Chicago, Chicago, IL 60637, USA
| | - Matthias Steinrücken
- Department of Human Genetics, University of Chicago, Chicago, IL 60637, USA.,Department of Ecology and Evolution, University of Chicago, Chicago, IL 60637, USA
| | - John Novembre
- Department of Human Genetics, University of Chicago, Chicago, IL 60637, USA.,Department of Ecology and Evolution, University of Chicago, Chicago, IL 60637, USA
| |
Collapse
|
11
|
Wang QS, Huang H. Methods for statistical fine-mapping and their applications to auto-immune diseases. Semin Immunopathol 2022; 44:101-113. [PMID: 35041074 PMCID: PMC8837575 DOI: 10.1007/s00281-021-00902-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Accepted: 10/22/2021] [Indexed: 01/07/2023]
Abstract
Although genome-wide association studies (GWAS) have identified thousands of loci in the human genome that are associated with different traits, understanding the biological mechanisms underlying the association signals identified in GWAS remains challenging. Statistical fine-mapping is a method aiming to refine GWAS signals by evaluating which variant(s) are truly causal to the phenotype. Here, we review the types of statistical fine-mapping methods that have been widely used to date, with a focus on recently developed functionally informed fine-mapping (FIFM) methods that utilize functional annotations. We then systematically review the applications of statistical fine-mapping in autoimmune disease studies to highlight the value of statistical fine-mapping in biological contexts.
Collapse
Affiliation(s)
- Qingbo S Wang
- Department of Statistical Genetics, Osaka University Graduate School of Medicine, Osaka, Japan.
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA.
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| | - Hailiang Huang
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA.
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Department of Medicine, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
12
|
Estimation of linkage disequilibrium levels and allele frequency distribution in crossbred Vrindavani cattle using 50K SNP data. PLoS One 2021; 16:e0259572. [PMID: 34762692 PMCID: PMC8584695 DOI: 10.1371/journal.pone.0259572] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Accepted: 10/22/2021] [Indexed: 11/29/2022] Open
Abstract
The objective of this study was to calculate the extent and decay of linkage disequilibrium (LD) in 96 crossbred Vrindavani cattle genotyped with Bovine SNP50K Bead Chip. After filtering, 43,821 SNPs were retained for final analysis, across 2500.3 Mb of autosome. A significant percentage of SNPs was having minor allele frequency of less than 0.20. The extent of LD between autosomal SNPs up to 10 Mb apart across the genome was measured using r2 statistic. The mean r2 value was 0.43, if pairwise distance of marker was less than10 kb and it decreased further to 0.21 for 25–50 kb markers distance. Further, the effect of minor allele frequency and sample size on LD estimate was investigated. The LD value decreased with the increase in inter-marker distance, and increased with the increase of minor allelic frequency. The estimated inbreeding coefficient and effective population size were 0.04, and 46 for present generation, which indicated small and unstable population of Vrindavani cattle. These findings suggested that a denser or breed specific SNP panel would be required to cover all genome of Vrindavani cattle for genome wide association studies (GWAS).
Collapse
|
13
|
Chen J, Ali MW, Yan L, Dighe SG, Dai JY, Vaughan TL, Casey G, Buas MF. Prioritization and functional analysis of GWAS risk loci for Barrett's esophagus and esophageal adenocarcinoma. Hum Mol Genet 2021; 31:410-422. [PMID: 34505128 DOI: 10.1093/hmg/ddab259] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Revised: 08/17/2021] [Accepted: 08/30/2021] [Indexed: 01/03/2023] Open
Abstract
Genome-wide association studies (GWAS) have identified ~ 20 genetic susceptibility loci for esophageal adenocarcinoma (EAC), and its precursor, Barrett's esophagus (BE). Despite such advances, functional/causal variants and gene targets at these loci remain undefined, hindering clinical translation. A key challenge is that most causal variants map to non-coding regulatory regions such as enhancers, and typically, numerous potential candidate variants at GWAS loci require testing. We developed a systematic informatics pipeline for prioritizing candidate functional variants via integrative functional potential scores consolidated from multi-omics annotations, and used this pipeline to identify two high-scoring variants for experimental interrogation: chr9q22.32/rs11789015 and chr19p13.11/rs10423674. Minimal candidate enhancer regions spanning these variants were evaluated using luciferase reporter assays in two EAC cell lines. One of the two variants tested (rs10423674) exhibited allele-specific enhancer activity. CRISPR-mediated deletion of the putative enhancer region in EAC cell lines correlated with reduced expression of two genes-CREB-regulated transcription coactivator 1 (CRTC1) and Cartilage oligomeric matrix protein (COMP); expression of five other genes remained unchanged (CRLF1, KLHL26, TMEM59L, UBA52, RFXANK). Expression quantitative trait locus (eQTL) mapping indicated that rs10423674 genotype correlated with CRTC1 and COMP expression in normal esophagus. This study represents the first experimental effort to bridge GWAS associations to biology in BE/EAC, and supports the utility of functional potential scores to guide variant prioritization. Our findings reveal a functional variant and candidate risk enhancer at chr19p13.11, and implicate CRTC1 and COMP as putative gene targets, suggesting that altered expression of these genes may underlie the BE/EAC risk association.
Collapse
Affiliation(s)
- Jianhong Chen
- Department of Cancer Prevention and Control, Roswell Park Comprehensive Cancer Center, Buffalo, NY, 14263 USA
| | - Mourad Wagdy Ali
- Center for Public Health Genomics, Department of Public Health Sciences, University of Virginia, Charlottesville, VA 22903 USA
| | - Li Yan
- Department of Biostatistics and Bioinformatics, Roswell Park Comprehensive Cancer Center, Buffalo, NY, 14263 USA
| | - Shruti G Dighe
- Department of Cancer Prevention and Control, Roswell Park Comprehensive Cancer Center, Buffalo, NY, 14263 USA
| | - James Y Dai
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, 98109 USA
| | - Thomas L Vaughan
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, 98109 USA.,Department of Epidemiology, University of Washington, School of Public Health, Seattle, Washington, 98195 USA
| | - Graham Casey
- Center for Public Health Genomics, Department of Public Health Sciences, University of Virginia, Charlottesville, VA 22903 USA
| | - Matthew F Buas
- Department of Cancer Prevention and Control, Roswell Park Comprehensive Cancer Center, Buffalo, NY, 14263 USA
| |
Collapse
|
14
|
Genome-Wide Analysis Revealed Homozygosity and Demographic History of Five Chinese Sheep Breeds Adapted to Different Environments. Genes (Basel) 2020; 11:genes11121480. [PMID: 33317115 PMCID: PMC7764688 DOI: 10.3390/genes11121480] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2020] [Revised: 12/06/2020] [Accepted: 12/07/2020] [Indexed: 01/18/2023] Open
Abstract
Homozygosity of long sequence genotypes are a result of parents transmitting identical haplotypes, which can be used to estimate their auto-zygosity. Therefore, we used high-density SNP Chip data to characterize the auto-zygosity of each breed according to the occurrence and distribution of runs of homozygosity (ROH). Subsequently, we identified the genomic regions with high runs of homozygosity frequencies within individuals of each breed. We selected 96 sheep samples from five local Chinese sheep breeds belonging to different geographical locations. We identified 3046 ROHs within the study breed individuals, among which the longer segments (>1–5 Mb) were dominant. On average, ROH segments covered about 12% of the genomes; the coverage rate of OAR20 was the lowest and that of OAR2 was the highest. The distribution analysis of runs of homozygosity showed that the detected ROH mainly distributed between >26 and 28 Mb. The Hetian and Hu sheep showed the lowest ROH distribution. The estimation of homozygosity level reflects the history of modern and ancient inbreeding, which may affect the genomes of Chinese indigenous sheep breeds and indicate that some animals have experienced recent self-pollination events (Yabuyi, Karakul and Wadi). In these sheep breeds, the genomic regions were assumed to be under selection signatures frequently in line with long ROH. These regions included candidate genes associated with disease resistance traits (5S_rRNA), the innate and adaptive immune response (HERC2 and CYFIP1), digestion and metabolism (CENPJ), growth (SPP1), body size and developments (GJB2 and GJA3). This study highlighted new insights into the ROH patterns and provides a basis for future breeding and conservation strategies of Chinese sheep breeds.
Collapse
|
15
|
Awany D, Chimusa ER. Heritability jointly explained by host genotype and microbiome: will improve traits prediction? Brief Bioinform 2020; 22:5893981. [PMID: 32810866 DOI: 10.1093/bib/bbaa175] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2020] [Revised: 07/09/2020] [Accepted: 07/10/2020] [Indexed: 11/14/2022] Open
Abstract
As we observe the $70$th anniversary of the publication by Robertson that formalized the notion of 'heritability', geneticists remain puzzled by the problem of missing/hidden heritability, where heritability estimates from genome-wide association studies (GWASs) fall short of that from twin-based studies. Many possible explanations have been offered for this discrepancy, including existence of genetic variants poorly captured by existing arrays, dominance, epistasis and unaccounted-for environmental factors; albeit these remain controversial. We believe a substantial part of this problem could be solved or better understood by incorporating the host's microbiota information in the GWAS model for heritability estimation and may also increase human traits prediction for clinical utility. This is because, despite empirical observations such as (i) the intimate role of the microbiome in many complex human phenotypes, (ii) the overlap between genetic variants associated with both microbiome attributes and complex diseases and (iii) the existence of heritable bacterial taxa, current GWAS models for heritability estimate do not take into account the contributory role of the microbiome. Furthermore, heritability estimate from twin-based studies does not discern microbiome component of the observed total phenotypic variance. Here, we summarize the concept of heritability in GWAS and microbiome-wide association studies, focusing on its estimation, from a statistical genetics perspective. We then discuss a possible statistical method to incorporate the microbiome in the estimation of heritability in host GWAS.
Collapse
Affiliation(s)
- Denis Awany
- Division of Human Genetics, Department of Pathology, University of Cape Town, Cape Town, South Africa
| | - Emile R Chimusa
- Computational Biology Division, Department of Integrative Biomedical Sciences, Institute of Infectious Disease and Molecular Medicine, Faculty of Health Sciences, University of Cape Town, Cape Town, South Africa
| |
Collapse
|
16
|
Pinto AR, Silva J, Pinto R, Medeiros R. Aggressive prostate cancer phenotype and genome-wide association studies: where are we now? Pharmacogenomics 2020; 21:487-503. [PMID: 32343194 DOI: 10.2217/pgs-2019-0123] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
The majority of prostate cancer (PCa) is indolent, however, a percentage of patients are initially diagnosed with metastatic disease, for which there is a worse prognosis. There is a lack of biomarkers to identify men at greater risk for developing aggressive PCa. Genome-wide association studies (GWAS) scan the genome to search associations of SNPs with specific traits, like cancer. To date, eight GWAS have resulted in the reporting of 16 SNPs associated with aggressive PCa (p < 5.00 × 10-2). Still, validation studies need to be conducted to confirm the obtained results as GWAS can generate false-positive results. Furthermore, post-GWAS studies provide a better understanding of the functional consequences.
Collapse
Affiliation(s)
- Ana R Pinto
- Molecular Oncology & Viral Pathology Group, IPO-Porto Research Center, (CI-IPOP) Portuguese Oncology Institute of Porto (IPO-Porto), Rua Dr. António Bernardino de Almeida, 4200-4072 Porto, Portugal.,ICBAS, Abel Salazar Institute for the Biomedical Sciences, Rua de Jorge Viterbo Ferreira, 228, 4050-313 Porto, Portugal
| | - Jani Silva
- Molecular Oncology & Viral Pathology Group, IPO-Porto Research Center, (CI-IPOP) Portuguese Oncology Institute of Porto (IPO-Porto), Rua Dr. António Bernardino de Almeida, 4200-4072 Porto, Portugal
| | - Ricardo Pinto
- Molecular Oncology & Viral Pathology Group, IPO-Porto Research Center, (CI-IPOP) Portuguese Oncology Institute of Porto (IPO-Porto), Rua Dr. António Bernardino de Almeida, 4200-4072 Porto, Portugal
| | - Rui Medeiros
- Molecular Oncology & Viral Pathology Group, IPO-Porto Research Center, (CI-IPOP) Portuguese Oncology Institute of Porto (IPO-Porto), Rua Dr. António Bernardino de Almeida, 4200-4072 Porto, Portugal.,Research Department, Portuguese League Against Cancer (NRNorte), Estrada Interior da Circunvalação, 6657, 4200-172 Porto, Portugal.,CEBIMED, Faculty of Health Sciences, Fernando Pessoa University, Praça 9 de Abril, 349, 4249-004 Porto, Portugal
| |
Collapse
|
17
|
The nonlinear structure of linkage disequilibrium. Theor Popul Biol 2020; 134:160-170. [PMID: 32222435 DOI: 10.1016/j.tpb.2020.02.005] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2018] [Revised: 02/15/2020] [Accepted: 02/27/2020] [Indexed: 11/23/2022]
Abstract
The allele frequency dependence of the ranges of all measures of linkage disequilibrium is well-known. The maximum values of commonly used parameters such as r2 and D vary depending on the allele frequencies at each locus. However, though this phenomenon is recognized and accounted for in many studies, the comprehensive mathematical framework underlying the limits of linkage disequilibrium measures at various frequency combinations is often heuristic or empirical. Here, it is demonstrated that underlying this behavior is the fundamental shift between linear and nonlinear dependence in the linkage disequilibrium structure between loci. The proportion of linear and nonlinear dependence can be estimated and it demonstrates how even the same values of r2 can have different implications for the nature of the overall dependence. One result of this is the value of D', when defined as only a positive number, has a minimum value of |r|. Understanding this dependence is crucial to making correct inferences about the relationships between two loci in linkage disequilibrium.
Collapse
|
18
|
Tharabenjasin P, Pabalan N, Jarjanazi H, Poachanukoon O. Influence of Polymorphisms in the Interleukin-18 Gene on Allergic Rhinitis: A Meta-Analysis. Int Arch Allergy Immunol 2020; 181:375-384. [PMID: 32106113 DOI: 10.1159/000506010] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2020] [Accepted: 01/17/2020] [Indexed: 11/19/2022] Open
Abstract
PURPOSE Reported associations of interleukin-18 (IL-18) single-nucleotide polymorphisms (SNPs) with allergic rhinitis (AR) have been inconsistent, prompting a meta-analysis to obtain more precise estimates. METHODS We synthesized data from 8 articles and examined 3 IL-18 SNPs. Two SNPs (rs360721 and rs187238), in linkage disequilibrium, were combined and termed RS1. The rs1946518 SNP was analyzed separately (termed RS2). The recessive, dominant, and codominant (multiplicative) genetic models were used to estimate ORs and 95% CIs. Subgroup analysis was ethnicity-based. Sources of heterogeneity were investigated with outlier treatment. Sensitivity analysis was used to assess robustness of the associative effects. Multiple comparisons were Holm-Bonferroni corrected. RESULTS All significant (pa < 0.05) outcomes indicating increased risks were found in the dominant/codominant models in RS1 and RS2. Five aspects of differences marked the significant African (RS1) and overall (RS2) outcomes: (i) magnitude of effect (ORs): greater (3.01-5.15) versus less (1.20-1.47); (ii) precision of -effects (95% CIs): less (1.07-21.52) versus more (1.01-1.89); (iii) outlier treated: no versus yes; (iv) sensitivity outcomes: nonrobust versus robust (dominant model only); and (v) greater evidential strength for RS2 (pa = 0.002) compared to RS1 (pa = 0.02) rendered RS2 our core finding. These levels of statistical significance for RS1/RS2 enabled both to survive the Holm-Bonferroni correction. CONCLUSIONS The core outcome indicating a 1.5-fold increased risk could render the IL-18 polymorphisms useful in the clinical genetics of AR. Future studies that could focus on other IL-18 SNPs may find deeper associations with AR than what we found here.
Collapse
Affiliation(s)
- Phuntila Tharabenjasin
- Chulabhorn International College of Medicine, Thammasat University, Pathum Thani, Thailand,
| | - Noel Pabalan
- Chulabhorn International College of Medicine, Thammasat University, Pathum Thani, Thailand
| | - Hamdi Jarjanazi
- Environmental Monitoring and Reporting Branch, Ontario Ministry of the Environment, Conservation and Parks, Toronto, Ontario, Canada
| | - Orapan Poachanukoon
- Center of Excellence for Allergy, Asthma and Pulmonary Diseases, Faculty of Medicine, Thammasat University, Pathum Thani, Thailand
| |
Collapse
|
19
|
Kang JTL, Rosenberg NA. Mathematical Properties of Linkage Disequilibrium Statistics Defined by Normalization of the Coefficient D = pAB - pApB. Hum Hered 2020; 84:127-143. [PMID: 32045910 DOI: 10.1159/000504171] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2019] [Accepted: 10/10/2019] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND Many statistics for measuring linkage disequilibrium (LD) take the form of a normalization of the LD coefficient D. Different normalizations produce statistics with different ranges, interpretations, and arguments favoring their use. METHODS Here, to compare the mathematical properties of these normalizations, we consider 5 of these normalized statistics, describing their upper bounds, the mean values of their maxima over the set of possible allele frequency pairs, and the size of the allele frequency regions accessible given specified values of the statistics. RESULTS We produce detailed characterizations of these properties for the statistics d and ρ, analogous to computations previously performed for r2. We examine the relationships among the statistics, uncovering conditions under which some of them have close connections. CONCLUSION The results contribute insight into LD measurement, particularly the understanding of differences in the features of different LD measures when computed on the same data.
Collapse
Affiliation(s)
- Jonathan T L Kang
- Department of Biology, Stanford University, Stanford, California, USA,
| | - Noah A Rosenberg
- Department of Biology, Stanford University, Stanford, California, USA
| |
Collapse
|
20
|
Lee YL, Bosse M, Mullaart E, Groenen MAM, Veerkamp RF, Bouwman AC. Functional and population genetic features of copy number variations in two dairy cattle populations. BMC Genomics 2020; 21:89. [PMID: 31992181 PMCID: PMC6988284 DOI: 10.1186/s12864-020-6496-1] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2019] [Accepted: 01/14/2020] [Indexed: 12/27/2022] Open
Abstract
BACKGROUND Copy Number Variations (CNVs) are gain or loss of DNA segments that are known to play a role in shaping a wide range of phenotypes. In this study, we used two dairy cattle populations, Holstein Friesian and Jersey, to discover CNVs using the Illumina BovineHD Genotyping BeadChip aligned to the ARS-UCD1.2 assembly. The discovered CNVs were investigated for their functional impact and their population genetics features. RESULTS We discovered 14,272 autosomal CNVs, which were aggregated into 1755 CNV regions (CNVR) from 451 animals. These CNVRs together cover 2.8% of the bovine autosomes. The assessment of the functional impact of CNVRs showed that rare CNVRs (MAF < 0.01) are more likely to overlap with genes, than common CNVRs (MAF ≥ 0.05). The Population differentiation index (Fst) based on CNVRs revealed multiple highly diverged CNVRs between the two breeds. Some of these CNVRs overlapped with candidate genes such as MGAM and ADAMTS17 genes, which are related to starch digestion and body size, respectively. Lastly, linkage disequilibrium (LD) between CNVRs and BovineHD BeadChip SNPs was generally low, close to 0, although common deletions (MAF ≥ 0.05) showed slightly higher LD (r2 = ~ 0.1 at 10 kb distance) than the rest. Nevertheless, this LD is still lower than SNP-SNP LD (r2 = ~ 0.5 at 10 kb distance). CONCLUSIONS Our analyses showed that CNVRs detected using BovineHD BeadChip arrays are likely to be functional. This finding indicates that CNVs can potentially disrupt the function of genes and thus might alter phenotypes. Also, the population differentiation index revealed two candidate genes, MGAM and ADAMTS17, which hint at adaptive evolution between the two populations. Lastly, low CNVR-SNP LD implies that genetic variation from CNVs might not be fully captured in routine animal genetic evaluation, which relies solely on SNP markers.
Collapse
Affiliation(s)
- Young-Lim Lee
- Wageningen University & Research, Animal Breeding and Genomics, P.O. Box 338, Wageningen, AH, 6700, the Netherlands.
| | - Mirte Bosse
- Wageningen University & Research, Animal Breeding and Genomics, P.O. Box 338, Wageningen, AH, 6700, the Netherlands
| | | | - Martien A M Groenen
- Wageningen University & Research, Animal Breeding and Genomics, P.O. Box 338, Wageningen, AH, 6700, the Netherlands
| | - Roel F Veerkamp
- Wageningen University & Research, Animal Breeding and Genomics, P.O. Box 338, Wageningen, AH, 6700, the Netherlands
| | - Aniek C Bouwman
- Wageningen University & Research, Animal Breeding and Genomics, P.O. Box 338, Wageningen, AH, 6700, the Netherlands
| |
Collapse
|
21
|
da Silva VH, Laine VN, Bosse M, Spurgin LG, Derks MFL, van Oers K, Dibbits B, Slate J, Crooijmans RPMA, Visser ME, Groenen MAM. The Genomic Complexity of a Large Inversion in Great Tits. Genome Biol Evol 2020; 11:1870-1881. [PMID: 31114855 PMCID: PMC6609730 DOI: 10.1093/gbe/evz106] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/16/2019] [Indexed: 12/11/2022] Open
Abstract
Chromosome inversions have clear effects on genome evolution and have been associated with speciation, adaptation, and the evolution of the sex chromosomes. In birds, these inversions may play an important role in hybridization of species and disassortative mating. We identified a large (≈64 Mb) inversion polymorphism in the great tit (Parus major) that encompasses almost 1,000 genes and more than 90% of Chromosome 1A. The inversion occurs at a low frequency in a set of over 2,300 genotyped great tits in the Netherlands with only 5% of the birds being heterozygous for the inversion. In an additional analysis of 29 resequenced birds from across Europe, we found two heterozygotes. The likely inversion breakpoints show considerable genomic complexity, including multiple copy number variable segments. We identified different haplotypes for the inversion, which differ in the degree of recombination in the center of the chromosome. Overall, this remarkable genetic variant is widespread among distinct great tit populations and future studies of the inversion haplotype, including how it affects the fitness of carriers, may help to understand the mechanisms that maintain it.
Collapse
Affiliation(s)
- Vinicius H da Silva
- Animal Breeding and Genomics, Wageningen University & Research, Wageningen, The Netherlands.,Department of Animal Ecology, Netherlands Institute of Ecology (NIOO-KNAW), Wageningen, The Netherlands.,Swedish University of Agricultural Sciences (SLU), Uppsala, Sweden
| | - Veronika N Laine
- Department of Molecular and Cellular Biology, Harvard University
| | - Mirte Bosse
- Animal Breeding and Genomics, Wageningen University & Research, Wageningen, The Netherlands
| | - Lewis G Spurgin
- School of Biological Sciences, University of East Anglia, Norwich Research Park University of East Anglia, Norwich, United Kingdom
| | - Martijn F L Derks
- Animal Breeding and Genomics, Wageningen University & Research, Wageningen, The Netherlands
| | - Kees van Oers
- Department of Animal Ecology, Netherlands Institute of Ecology (NIOO-KNAW), Wageningen, The Netherlands
| | - Bert Dibbits
- Animal Breeding and Genomics, Wageningen University & Research, Wageningen, The Netherlands
| | - Jon Slate
- Department of Animal and Plant Sciences, The University of Sheffield, Sheffield, United Kingdom
| | | | - Marcel E Visser
- Animal Breeding and Genomics, Wageningen University & Research, Wageningen, The Netherlands.,Department of Animal Ecology, Netherlands Institute of Ecology (NIOO-KNAW), Wageningen, The Netherlands
| | - Martien A M Groenen
- Animal Breeding and Genomics, Wageningen University & Research, Wageningen, The Netherlands
| |
Collapse
|
22
|
Ahsan T, Urmi NJ, Sajib AA. Heterogeneity in the distribution of 159 drug-response related SNPs in world populations and their genetic relatedness. PLoS One 2020; 15:e0228000. [PMID: 31971968 PMCID: PMC6977754 DOI: 10.1371/journal.pone.0228000] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2019] [Accepted: 01/03/2020] [Indexed: 12/25/2022] Open
Abstract
Interethnic variability in drug response arises from genetic differences associated with drug metabolism, action and transport. These genetic variations can affect drug efficacy as well as cause adverse drug reactions (ADRs). We retrieved drug-response related single nucleotide polymorphism (SNP) associated data from databases and analyzed to elucidate population specific distribution of 159 drug-response related SNPs in twenty six populations belonging to five super-populations (African, Admixed Americans, East Asian, European and South Asian). Significant interpopulation differences exist in the minor (variant) allele frequencies (MAFs), linkage disequilibrium (LD) and haplotype distributions among these populations. 65 of the drug-response related alleles, which are considered as minor (variant) in global population, are present as the major alleles (frequency ≥0.5) in at least one or more populations. Populations that belong to the same super-population have similar distribution pattern for majority of the variant alleles. These drug response related variant allele frequencies and their pairwise LD measure (r2) can clearly distinguish the populations in a way that correspond to the known evolutionary history of human and current geographic distributions, while D' cannot. The data presented here may aid in identifying drugs that are more appropriate and/or require pharmacogenetic testing in these populations. Our findings emphasize on the importance of distinct, ethnicity-specific clinical guidelines, especially for the African populations, to avoid ADRs and ensure effective drug treatment.
Collapse
Affiliation(s)
- Tamim Ahsan
- Department of Genetic Engineering & Biotechnology, Bangabandhu Sheikh Mujibur Rahman Maritime University, Dhaka, Bangladesh
| | | | - Abu Ashfaqur Sajib
- Department of Genetic Engineering & Biotechnology, University of Dhaka, Dhaka, Bangladesh
| |
Collapse
|
23
|
Wang H, Zhang F, Zeng J, Wu Y, Kemper KE, Xue A, Zhang M, Powell JE, Goddard ME, Wray NR, Visscher PM, McRae AF, Yang J. Genotype-by-environment interactions inferred from genetic effects on phenotypic variability in the UK Biobank. SCIENCE ADVANCES 2019; 5:eaaw3538. [PMID: 31453325 PMCID: PMC6693916 DOI: 10.1126/sciadv.aaw3538] [Citation(s) in RCA: 102] [Impact Index Per Article: 20.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/11/2018] [Accepted: 07/11/2019] [Indexed: 05/17/2023]
Abstract
Genotype-by-environment interaction (GEI) is a fundamental component in understanding complex trait variation. However, it remains challenging to identify genetic variants with GEI effects in humans largely because of the small effect sizes and the difficulty of monitoring environmental fluctuations. Here, we demonstrate that GEI can be inferred from genetic variants associated with phenotypic variability in a large sample without the need of measuring environmental factors. We performed a genome-wide variance quantitative trait locus (vQTL) analysis of ~5.6 million variants on 348,501 unrelated individuals of European ancestry for 13 quantitative traits in the UK Biobank and identified 75 significant vQTLs with P < 2.0 × 10-9 for 9 traits, especially for those related to obesity. Direct GEI analysis with five environmental factors showed that the vQTLs were strongly enriched with GEI effects. Our results indicate pervasive GEI effects for obesity-related traits and demonstrate the detection of GEI without environmental data.
Collapse
Affiliation(s)
- Huanwei Wang
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4072, Australia
| | - Futao Zhang
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4072, Australia
| | - Jian Zeng
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4072, Australia
| | - Yang Wu
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4072, Australia
| | - Kathryn E. Kemper
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4072, Australia
| | - Angli Xue
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4072, Australia
| | - Min Zhang
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4072, Australia
| | - Joseph E. Powell
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4072, Australia
- Garvan-Weizmann Centre for Cellular Genomics, Garvan Institute for Medical Research, Sydney, New South Wales 2010, Australia
- Faculty of Medicine, University of New South Wales, Sydney, New South Wales 2052, Australia
| | - Michael E. Goddard
- Faculty of Veterinary and Agricultural Science, University of Melbourne, Parkville, Victoria, Australia
- Biosciences Research Division, Department of Economic Development, Jobs, Transport and Resources, Bundoora, Victoria, Australia
| | - Naomi R. Wray
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4072, Australia
- Queensland Brain Institute, The University of Queensland, Brisbane, Queensland 4072, Australia
| | - Peter M. Visscher
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4072, Australia
- Queensland Brain Institute, The University of Queensland, Brisbane, Queensland 4072, Australia
| | - Allan F. McRae
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4072, Australia
| | - Jian Yang
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland 4072, Australia
- Institute for Advanced Research, Wenzhou Medical University, Wenzhou, Zhejiang 325027, China
| |
Collapse
|
24
|
Berihulay H, Islam R, Jiang L, Ma Y. Genome-Wide Linkage Disequilibrium and the Extent of Effective Population Sizes in Six Chinese Goat Populations Using a 50K Single Nucleotide Polymorphism Panel. Animals (Basel) 2019; 9:ani9060350. [PMID: 31200540 PMCID: PMC6617254 DOI: 10.3390/ani9060350] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2019] [Revised: 05/20/2019] [Accepted: 05/30/2019] [Indexed: 12/25/2022] Open
Abstract
Simple Summary Information on linkage disequilibrium (LD) and the extent of effective population size (Ne) has important implications for exploring the degree of biological diversity, for predicting underlying selection pressure, and for designing animal breeding programs. In this study, we assessed LD, Ne, and the distribution of minor allele frequency in six goat populations. Accordingly, the results of LD and Ne using a single nucleotide polymorphism (SNP) panel (Caprine SNP 50K BeadChip, Lincoln, NE, USA) are helpful for the sustainable conservation, proper management, and utilization of Chinese goat populations. Abstract Genome-wide linkage disequilibrium is a useful parameter to study quantitative trait locus (QTL) mapping and genetic selection. In many genomic methodologies, effective population size is an important genetic parameter because of its relationship to the loss of genetic variation, increases in inbreeding, the accumulation of mutations, and the effectiveness of selection. In this study, a total of 193 individuals were genotyped to assess the extent of LD and Ne in six Chinese goat populations using the SNP 50K BeadChip. Across the determined autosomal chromosomes, we found an average of 0.02 and 0.23 for r2 and D’ values, respectively. The average r2 between all the populations varied little and ranged from 0.055 r2 for the Jining Grey to 0.128 r2 for the Guangfeng, with an overall mean of 0.083. Across the 29 autosomal chromosomes, minor allele frequency (MAF) was highest on chromosome 1 (0.321) and lowest on chromosome 25 (0.309), with an average MAF of 0.317, and showing the lowest (25.5% for Louping) and highest (28.8% for Qingeda) SNP proportions at MAF values > 0.3. The inbreeding coefficient ranged from 0.064 to 0.085, with a mean of 0.075 for all the autosomes. The Jining Grey and Qingeda populations showed higher Ne estimates, highlighting that these animals could have been influenced by artificial selection. Furthermore, a declining recent Ne was distinguished for the Arbas Cashmere and Guangfeng populations, and their estimated values were closer to 64 and 95, respectively, 13 generations ago, which indicates that these breeds were exposed to strong selection. This study provides an insight into valuable genetic information and will open up the opportunity for further genomic selection analysis of Chinese goat populations.
Collapse
Affiliation(s)
- Haile Berihulay
- Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing 100193, China.
| | - Rabiul Islam
- Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing 100193, China.
| | - Lin Jiang
- Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing 100193, China.
| | - Yuehui Ma
- Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing 100193, China.
| |
Collapse
|
25
|
Coleman JRI, Bryois J, Gaspar HA, Jansen PR, Savage JE, Skene N, Plomin R, Muñoz-Manchado AB, Linnarsson S, Crawford G, Hjerling-Leffler J, Sullivan PF, Posthuma D, Breen G. Biological annotation of genetic loci associated with intelligence in a meta-analysis of 87,740 individuals. Mol Psychiatry 2019; 24:182-197. [PMID: 29520040 PMCID: PMC6330082 DOI: 10.1038/s41380-018-0040-6] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/18/2017] [Revised: 11/13/2017] [Accepted: 01/02/2018] [Indexed: 11/08/2022]
Abstract
Variance in IQ is associated with a wide range of health outcomes, and 1% of the population are affected by intellectual disability. Despite a century of research, the fundamental neural underpinnings of intelligence remain unclear. We integrate results from genome-wide association studies (GWAS) of intelligence with brain tissue and single cell gene expression data to identify tissues and cell types associated with intelligence. GWAS data for IQ (N = 78,308) were meta-analyzed with a study comparing 1247 individuals with mean IQ ~170 to 8185 controls. Genes associated with intelligence implicate pyramidal neurons of the somatosensory cortex and CA1 region of the hippocampus, and midbrain embryonic GABAergic neurons. Tissue-specific analyses find the most significant enrichment for frontal cortex brain expressed genes. These results suggest specific neuronal cell types and genes may be involved in intelligence and provide new hypotheses for neuroscience experiments using model systems.
Collapse
Affiliation(s)
- Jonathan R I Coleman
- MRC Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, SE5 8AF, UK
- NIHR Biomedical Research Centre for Mental Health, South London and Maudsley NHS Trust, London, SE5 8AF, UK
| | - Julien Bryois
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, SE-17177, Sweden
| | - Héléna A Gaspar
- MRC Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, SE5 8AF, UK
| | - Philip R Jansen
- Department of Complex Trait Genetics, VU University, Center for Neurogenomics and Cognitive Research, Amsterdam, 1081 HV, The Netherlands
- Department of Child and Adolescent Psychiatry, Erasmus University Medical Center, Rotterdam, The Netherlands
| | - Jeanne E Savage
- Department of Complex Trait Genetics, VU University, Center for Neurogenomics and Cognitive Research, Amsterdam, 1081 HV, The Netherlands
| | - Nathan Skene
- Laboratory of Molecular Neurobiology, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, SE-17177, Sweden
| | - Robert Plomin
- MRC Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, SE5 8AF, UK
| | - Ana B Muñoz-Manchado
- Laboratory of Molecular Neurobiology, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, SE-17177, Sweden
| | - Sten Linnarsson
- Laboratory of Molecular Neurobiology, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, SE-17177, Sweden
| | - Greg Crawford
- Center for Genomic and Computational Biology, Duke University, Durham, NC, 27708, USA
- Department of Pediatrics, Medical Genetics Division, Duke University, Durham, NC, 27708, USA
| | - Jens Hjerling-Leffler
- Laboratory of Molecular Neurobiology, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, SE-17177, Sweden
| | - Patrick F Sullivan
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, SE-17177, Sweden
- Departments of Genetics, University of North Carolina, Chapel Hill, NC, 27599-7264, USA
| | - Danielle Posthuma
- Department of Complex Trait Genetics, VU University, Center for Neurogenomics and Cognitive Research, Amsterdam, 1081 HV, The Netherlands.
- Department of Clinical Genetics, VU University Medical Center (VUMC), Neuroscience Campus Amsterdam, Amsterdam, 1081 HV, The Netherlands.
| | - Gerome Breen
- MRC Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, SE5 8AF, UK.
- NIHR Biomedical Research Centre for Mental Health, South London and Maudsley NHS Trust, London, SE5 8AF, UK.
| |
Collapse
|
26
|
Liu Y, Lusk CM, Cho MH, Silverman EK, Qiao D, Zhang R, Scheurer ME, Kheradmand F, Wheeler DA, Tsavachidis S, Armstrong G, Zhu D, Wistuba II, Chow CWB, Behrens C, Pikielny CW, Neslund-Dudas C, Pinney SM, Anderson M, Kupert E, Bailey-Wilson J, Gaba C, Mandal D, You M, de Andrade M, Yang P, Field JK, Liloglou T, Davies M, Lissowska J, Swiatkowska B, Zaridze D, Mukeriya A, Janout V, Holcatova I, Mates D, Milosavljevic S, Scelo G, Brennan P, McKay J, Liu G, Hung RJ, Christiani DC, Schwartz AG, Amos CI, Spitz MR. Rare Variants in Known Susceptibility Loci and Their Contribution to Risk of Lung Cancer. J Thorac Oncol 2018; 13:1483-1495. [PMID: 29981437 PMCID: PMC6366341 DOI: 10.1016/j.jtho.2018.06.016] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2018] [Revised: 06/06/2018] [Accepted: 06/17/2018] [Indexed: 10/28/2022]
Abstract
BACKGROUND Genome-wide association studies are widely used to map genomic regions contributing to lung cancer (LC) susceptibility, but they typically do not identify the precise disease-causing genes/variants. To unveil the inherited genetic variants that cause LC, we performed focused exome-sequencing analyses on genes located in 121 genome-wide association study-identified loci previously implicated in the risk of LC, chronic obstructive pulmonary disease, pulmonary function level, and smoking behavior. METHODS Germline DNA from 260 case patients with LC and 318 controls were sequenced by utilizing VCRome 2.1 exome capture. Filtering was based on enrichment of rare and potential deleterious variants in cases (risk alleles) or controls (protective alleles). Allelic association analyses of single-variant and gene-based burden tests of multiple variants were performed. Promising candidates were tested in two independent validation studies with a total of 1773 case patients and 1123 controls. RESULTS We identified 48 rare variants with deleterious effects in the discovery analysis and validated 12 of the 43 candidates that were covered in the validation platforms. The top validated candidates included one well-established truncating variant, namely, BRCA2, DNA repair associated gene (BRCA2) K3326X (OR = 2.36, 95% confidence interval [CI]: 1.38-3.99), and three newly identified variations, namely, lymphotoxin beta gene (LTB) p.Leu87Phe (OR = 7.52, 95% CI: 1.01-16.56), prolyl 3-hydroxylase 2 gene (P3H2) p.Gln185His (OR = 5.39, 95% CI: 0.75-15.43), and dishevelled associated activator of morphogenesis 2 gene (DAAM2) p.Asp762Gly (OR = 0.25, 95% CI: 0.10-0.79). Burden tests revealed strong associations between zinc finger protein 93 gene (ZNF93), DAAM2, bromodomain containing 9 gene (BRD9), and the gene LTB and LC susceptibility. CONCLUSION Our results extend the catalogue of regions associated with LC and highlight the importance of germline rare coding variants in LC susceptibility.
Collapse
Affiliation(s)
- Yanhong Liu
- Dan L. Duncan Comprehensive Cancer Center, Department of Medicine, Baylor College of Medicine, Houston, TX 77030, USA
| | - Christine M. Lusk
- Karmanos Cancer Institute, Wayne State University, Detroit, MI 48201, USA
| | - Michael H. Cho
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA 02115, USA
| | - Edwin K. Silverman
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA 02115, USA
| | - Dandi Qiao
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA 02115, USA
| | - Ruyang Zhang
- Harvard University School of Public Health, Boston, MA 02115, USA
| | - Michael E. Scheurer
- Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Farrah Kheradmand
- Dan L. Duncan Comprehensive Cancer Center, Department of Medicine, Baylor College of Medicine, Houston, TX 77030, USA
- Michael E. DeBakey Veterans Affairs Medical Center; Houston, TX 77030, USA
| | - David A. Wheeler
- Department of Molecular and Human Genetics, Human Genome Sequence Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Spiridon Tsavachidis
- Dan L. Duncan Comprehensive Cancer Center, Department of Medicine, Baylor College of Medicine, Houston, TX 77030, USA
| | - Georgina Armstrong
- Dan L. Duncan Comprehensive Cancer Center, Department of Medicine, Baylor College of Medicine, Houston, TX 77030, USA
| | - Dakai Zhu
- Dan L. Duncan Comprehensive Cancer Center, Department of Medicine, Baylor College of Medicine, Houston, TX 77030, USA
- Institute for Clinical and Translational Research, Baylor College of Medicine, Houston, TX 77030, USA
| | - Ignacio I. Wistuba
- Department of Translational Molecular Pathology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Chi-Wan B. Chow
- Department of Translational Molecular Pathology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Carmen Behrens
- Department of Thoracic/Head and Neck Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Claudio W. Pikielny
- Department of Biomedical Data Science, Geisel School of Medicine, Dartmouth College, Lebanon, NH 03755, USA
| | | | - Susan M. Pinney
- University of Cincinnati College of Medicine, Cincinnati, OH 45267, USA
| | - Marshall Anderson
- University of Cincinnati College of Medicine, Cincinnati, OH 45267, USA
| | - Elena Kupert
- University of Cincinnati College of Medicine, Cincinnati, OH 45267, USA
| | | | - Colette Gaba
- The University of Toledo College of Medicine, Toledo, OH 43614, USA
| | - Diptasri Mandal
- Louisiana State University Health Sciences Center, New Orleans, LA 70112, USA
| | - Ming You
- Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | | | - Ping Yang
- Mayo Clinic College of Medicine, Rochester, MN 55905, USA
| | - John K. Field
- Roy Castle Lung Cancer Research Programme, The University of Liverpool, Department of Molecular and Clinical Cancer Medicine, Liverpool, UK
| | - Triantafillos Liloglou
- Roy Castle Lung Cancer Research Programme, The University of Liverpool, Department of Molecular and Clinical Cancer Medicine, Liverpool, UK
| | - Michael Davies
- Roy Castle Lung Cancer Research Programme, The University of Liverpool, Department of Molecular and Clinical Cancer Medicine, Liverpool, UK
| | - Jolanta Lissowska
- The M. Sklodowska-Curie Institute of Oncology Center, Warsaw 02781, Poland
| | - Beata Swiatkowska
- Nofer Institute of Occupational Medicine, Department of Environmental Epidemiology, Lodz 91348, Poland
| | - David Zaridze
- Russian N.N. Blokhin Cancer Research Centre, Moscow 115478, Russian Federation
| | - Anush Mukeriya
- Russian N.N. Blokhin Cancer Research Centre, Moscow 115478, Russian Federation
| | - Vladimir Janout
- Faculty of Health Sciences, Palacky University, Olomouc 77515, Czech Republic
| | - Ivana Holcatova
- Institute of Public Health and Preventive Medicine, Charles University, 2nd Faculty of Medicine, Prague 12800, Czech Republic
| | - Dana Mates
- National Institute of Public Health, Bucharest 050463, Romania
| | - Sasa Milosavljevic
- International Organization for Cancer Prevention and Research (IOCPR), Belgrade, Serbia
| | | | - Paul Brennan
- International Agency for Research on Cancer, Lyon, France
| | - James McKay
- International Agency for Research on Cancer, Lyon, France
| | - Geoffrey Liu
- Princess Margaret Cancer Center, Toronto, ON, M5G 2M9, Canada
| | - Rayjean J. Hung
- Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, ON, M5G 1X5 Canada
| | | | | | - Ann G. Schwartz
- Karmanos Cancer Institute, Wayne State University, Detroit, MI 48201, USA
| | - Christopher I Amos
- Dan L. Duncan Comprehensive Cancer Center, Department of Medicine, Baylor College of Medicine, Houston, TX 77030, USA
- Institute for Clinical and Translational Research, Baylor College of Medicine, Houston, TX 77030, USA
| | - Margaret R. Spitz
- Dan L. Duncan Comprehensive Cancer Center, Department of Medicine, Baylor College of Medicine, Houston, TX 77030, USA
| |
Collapse
|
27
|
Alvarenga AB, Rovadoscki GA, Petrini J, Coutinho LL, Morota G, Spangler ML, Pinto LFB, Carvalho GGP, Mourão GB. Linkage disequilibrium in Brazilian Santa Inês breed, Ovis aries. Sci Rep 2018; 8:8851. [PMID: 29892085 PMCID: PMC5995818 DOI: 10.1038/s41598-018-27259-7] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2017] [Accepted: 05/25/2018] [Indexed: 11/13/2022] Open
Abstract
For genomic selection to be successful, there must be sufficient linkage disequilibrium between the markers and the causal mutations. The objectives of this study were to evaluate the extent of LD in ovine using the Santa Inês breed and to infer the minimum number of markers required to reach reasonable prediction accuracy. In total, 38,168 SNPs and 395 samples were used. The mean LD between adjacent marker pairs measured by r2 and |D′| were 0.166 and 0.617, respectively. LD values between adjacent marker pairs ranged from 0.135 to 0.194 and from 0.568 to 0.650 for r2 for |D′| across all chromosomes. The average r2 between all pairwise SNPs on each chromosome was 0.018. SNPs separated by between 0.10 to 0.20 Mb had an estimated average r2 equal to 0.1033. The identified haplotype blocks consisted of 2 to 21 markers. Moreover, estimates of average coefficients of inbreeding and effective population size were 0.04 and 96, respectively. LD estimated in this study was lower than that reported in other species and was characterized by short haplotype blocks. Our results suggest that the use of a higher density SNP panel is recommended for the implementation of genomic selection in the Santa Inês breed.
Collapse
Affiliation(s)
- Amanda Botelho Alvarenga
- Department of Animal Science, University of São Paulo (USP)/Luiz de Queiroz College of Agriculture (ESALQ), Piracicaba, SP, Brazil
| | - Gregori Alberto Rovadoscki
- Department of Animal Science, University of São Paulo (USP)/Luiz de Queiroz College of Agriculture (ESALQ), Piracicaba, SP, Brazil
| | - Juliana Petrini
- Department of Animal Science, University of São Paulo (USP)/Luiz de Queiroz College of Agriculture (ESALQ), Piracicaba, SP, Brazil
| | - Luiz Lehmann Coutinho
- Department of Animal Science, University of São Paulo (USP)/Luiz de Queiroz College of Agriculture (ESALQ), Piracicaba, SP, Brazil
| | - Gota Morota
- Department of Animal Science, University of Nebraska, Lincoln, NE, USA
| | | | | | | | - Gerson Barreto Mourão
- Department of Animal Science, University of São Paulo (USP)/Luiz de Queiroz College of Agriculture (ESALQ), Piracicaba, SP, Brazil.
| |
Collapse
|
28
|
Davoudi P, Abdollahi-Arpanahi R, Nejati-Javaremi A. The impact of QTL allele frequency distribution on the accuracy of genomic prediction. Arch Anim Breed 2018. [DOI: 10.5194/aab-61-207-2018] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
Abstract
Abstract. The accuracy of genomic prediction of quantitative traits
based on single nucleotide polymorphism (SNP) markers depends among other factors on the allele frequency
distribution of quantitative trait loci (QTL). Therefore, the aim of this
study was to investigate different QTL allele frequency distributions and
their effect on the accuracy of genomic estimated breeding values (GEBVs)
using best linear unbiased genomic prediction (GBLUP) in simulated data. A
population of 1000 individuals composed of 500 males and 500 females as well
as a genome of 1000 cM consisting of 10 chromosomes and with a mutation
rate of 2.5 × 10−5 per locus was simulated. QTL frequencies
were derived from five distributions of allele frequency including
constant, uniform, U-shaped, L-shaped and minor allele frequency (MAF) less
than 0.01 (lowMAF). QTL effects were generated from a standard normal
distribution. The number of QTL was assumed to be 500, and the simulation was
done in 10 replications. The genomic prediction accuracy in the first-validation generation in constant, and the uniform allele frequency distribution
was 0.59 and 0.57, respectively. Results showed that the highest accuracy of
GEBVs was obtained with constant and uniform distributions followed by L-shaped, U-shaped
and lowMAF QTL allele frequency distribution. The regression of true
breeding values on predicted breeding values in the first-validation generation
was 0.94, 0.92, 0.88, 0.85 and 0.75 for constant, uniform, L-shaped,
U-shaped and lowMAF distributions, respectively. Depite different values
of regression coefficients, in all scenarios GEBVs are biased downward.
Overall, results showed that when QTL had a lower MAF relative to SNP
markers, a low linkage disequilibrium (LD) was observed, which had a negative
effect on the accuracy of GEBVs. Hence, the effect of the QTL allele frequency
distribution on prediction accuracy can be alleviated through using a genomic
relationship weighted by MAF or an LD-adjusted relationship matrix.
Collapse
|
29
|
Selection for long and short sleep duration in Drosophila melanogaster reveals the complex genetic network underlying natural variation in sleep. PLoS Genet 2017; 13:e1007098. [PMID: 29240764 PMCID: PMC5730107 DOI: 10.1371/journal.pgen.1007098] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2017] [Accepted: 11/01/2017] [Indexed: 12/16/2022] Open
Abstract
Why do some individuals need more sleep than others? Forward mutagenesis screens in flies using engineered mutations have established a clear genetic component to sleep duration, revealing mutants that convey very long or short sleep. Whether such extreme long or short sleep could exist in natural populations was unknown. We applied artificial selection for high and low night sleep duration to an outbred population of Drosophila melanogaster for 13 generations. At the end of the selection procedure, night sleep duration diverged by 9.97 hours in the long and short sleeper populations, and 24-hour sleep was reduced to 3.3 hours in the short sleepers. Neither long nor short sleeper lifespan differed appreciably from controls, suggesting little physiological consequences to being an extreme long or short sleeper. Whole genome sequence data from seven generations of selection revealed several hundred thousand changes in allele frequencies at polymorphic loci across the genome. Combining the data from long and short sleeper populations across generations in a logistic regression implicated 126 polymorphisms in 80 candidate genes, and we confirmed three of these genes and a larger genomic region with mutant and chromosomal deficiency tests, respectively. Many of these genes could be connected in a single network based on previously known physical and genetic interactions. Candidate genes have known roles in several classic, highly conserved developmental and signaling pathways—EGFR, Wnt, Hippo, and MAPK. The involvement of highly pleiotropic pathway genes suggests that sleep duration in natural populations can be influenced by a wide variety of biological processes, which may be why the purpose of sleep has been so elusive. One of the biggest mysteries in biology is the need to sleep. Sleep duration has an underlying genetic basis, suggesting that very long and short sleep times could be bred for experimentally. How far can sleep duration be driven up or down? Here we achieved extremely long and short night sleep duration by subjecting a wild-derived population of Drosophila melanogaster to an experimental breeding program. At the end of the breeding program, long sleepers averaged 9.97 hours more nightly sleep than short sleepers. We analyzed whole-genome sequences from seven generations of the experimental breeding to identify allele frequencies that diverged between long and short sleepers, and verified genes and genomic regions with mutation and deficiency testing. These alleles map to classic developmental and signaling pathways, implicating many diverse processes that potentially affect sleep duration.
Collapse
|
30
|
Broeckx BJG, Derrien T, Mottier S, Wucher V, Cadieu E, Hédan B, Le Béguec C, Botherel N, Lindblad-Toh K, Saunders JH, Deforce D, André C, Peelman L, Hitte C. An exome sequencing based approach for genome-wide association studies in the dog. Sci Rep 2017; 7:15680. [PMID: 29142306 PMCID: PMC5688105 DOI: 10.1038/s41598-017-15947-9] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2017] [Accepted: 11/04/2017] [Indexed: 12/12/2022] Open
Abstract
Genome-wide association studies (GWAS) are widely used to identify loci associated with phenotypic traits in the domestic dog that has emerged as a model for Mendelian and complex traits. However, a disadvantage of GWAS is that it always requires subsequent fine-mapping or sequencing to pinpoint causal mutations. Here, we performed whole exome sequencing (WES) and canine high-density (cHD) SNP genotyping of 28 dogs from 3 breeds to compare the SNP and linkage disequilibrium characteristics together with the power and mapping precision of exome-guided GWAS (EG-GWAS) versus cHD-based GWAS. Using simulated phenotypes, we showed that EG-GWAS has a higher power than cHD to detect associations within target regions and less power outside target regions, with power being influenced further by sample size and SNP density. We analyzed two real phenotypes (hair length and furnishing), that are fixed in certain breeds to characterize mapping precision of the known causal mutations. EG-GWAS identified the associated exonic and 3'UTR variants within the FGF5 and RSPO2 genes, respectively, with only a few samples per breed. In conclusion, we demonstrated that EG-GWAS can identify loci associated with Mendelian phenotypes both within and across breeds.
Collapse
Affiliation(s)
- Bart J G Broeckx
- Laboratory of Animal Genetics, Faculty of Veterinary Medicine, Ghent University, Merelbeke, Belgium.
| | - Thomas Derrien
- Institut de Génétique et Développement de Rennes, CNRS-URM6290, Université Rennes1, Rennes, France
| | - Stéphanie Mottier
- Institut de Génétique et Développement de Rennes, CNRS-URM6290, Université Rennes1, Rennes, France
| | - Valentin Wucher
- Institut de Génétique et Développement de Rennes, CNRS-URM6290, Université Rennes1, Rennes, France
| | - Edouard Cadieu
- Institut de Génétique et Développement de Rennes, CNRS-URM6290, Université Rennes1, Rennes, France
| | - Benoît Hédan
- Institut de Génétique et Développement de Rennes, CNRS-URM6290, Université Rennes1, Rennes, France
| | - Céline Le Béguec
- Institut de Génétique et Développement de Rennes, CNRS-URM6290, Université Rennes1, Rennes, France
| | - Nadine Botherel
- Institut de Génétique et Développement de Rennes, CNRS-URM6290, Université Rennes1, Rennes, France
| | - Kerstin Lindblad-Toh
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
- Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
| | - Jimmy H Saunders
- Department of Medical Imaging and Orthopedics, Faculty of Veterinary Medicine, Ghent University, Merelbeke, Belgium
| | - Dieter Deforce
- Laboratory of Pharmaceutical Biotechnology, Faculty of Pharmaceutical Sciences, Ghent University, Ghent, Belgium
| | - Catherine André
- Institut de Génétique et Développement de Rennes, CNRS-URM6290, Université Rennes1, Rennes, France
| | - Luc Peelman
- Laboratory of Animal Genetics, Faculty of Veterinary Medicine, Ghent University, Merelbeke, Belgium
| | - Christophe Hitte
- Institut de Génétique et Développement de Rennes, CNRS-URM6290, Université Rennes1, Rennes, France.
| |
Collapse
|
31
|
Oliveira Júnior GA, Chud TCS, Ventura RV, Garrick DJ, Cole JB, Munari DP, Ferraz JBS, Mullart E, DeNise S, Smith S, da Silva MVGB. Genotype imputation in a tropical crossbred dairy cattle population. J Dairy Sci 2017; 100:9623-9634. [PMID: 28987572 DOI: 10.3168/jds.2017-12732] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2017] [Accepted: 08/16/2017] [Indexed: 11/19/2022]
Abstract
The objective of this study was to investigate different strategies for genotype imputation in a population of crossbred Girolando (Gyr × Holstein) dairy cattle. The data set consisted of 478 Girolando, 583 Gyr, and 1,198 Holstein sires genotyped at high density with the Illumina BovineHD (Illumina, San Diego, CA) panel, which includes ∼777K markers. The accuracy of imputation from low (20K) and medium densities (50K and 70K) to the HD panel density and from low to 50K density were investigated. Seven scenarios using different reference populations (RPop) considering Girolando, Gyr, and Holstein breeds separately or combinations of animals of these breeds were tested for imputing genotypes of 166 randomly chosen Girolando animals. The population genotype imputation were performed using FImpute. Imputation accuracy was measured as the correlation between observed and imputed genotypes (CORR) and also as the proportion of genotypes that were imputed correctly (CR). This is the first paper on imputation accuracy in a Girolando population. The sample-specific imputation accuracies ranged from 0.38 to 0.97 (CORR) and from 0.49 to 0.96 (CR) imputing from low and medium densities to HD, and 0.41 to 0.95 (CORR) and from 0.50 to 0.94 (CR) for imputation from 20K to 50K. The CORRanim exceeded 0.96 (for 50K and 70K panels) when only Girolando animals were included in RPop (S1). We found smaller CORRanim when Gyr (S2) was used instead of Holstein (S3) as RPop. The same behavior was observed between S4 (Gyr + Girolando) and S5 (Holstein + Girolando) because the target animals were more related to the Holstein population than to the Gyr population. The highest imputation accuracies were observed for scenarios including Girolando animals in the reference population, whereas using only Gyr animals resulted in low imputation accuracies, suggesting that the haplotypes segregating in the Girolando population had a greater effect on accuracy than the purebred haplotypes. All chromosomes had similar imputation accuracies (CORRsnp) within each scenario. Crossbred animals (Girolando) must be included in the reference population to provide the best imputation accuracies.
Collapse
Affiliation(s)
- Gerson A Oliveira Júnior
- Departamento de Medicina Veterinária, Universidade de São Paulo (USP), Faculdade de Zootecnia e Engenharia de Alimentos, Pirassununga, SP, 13635-900, Brazil
| | - Tatiane C S Chud
- Departamento de Ciências Exatas, Universidade Estadual Paulista (Unesp), Faculdade de Ciências Agrárias e Veterinárias, Jaboticabal, SP, 14884-900, Brazil
| | - Ricardo V Ventura
- Beef Improvement Opportunities, Guelph, ON N1K1E5, Canada; Centre for Genetic Improvement of Livestock, University of Guelph, Guelph, ON N1G2W1, Canada
| | - Dorian J Garrick
- Department of Animal Science, Iowa State University, Ames 50011-3150
| | - John B Cole
- Animal Genomics and Improvement Laboratory, Agricultural Research Service, United States Department of Agriculture, Beltsville, MD, 20705-2350
| | - Danísio P Munari
- Departamento de Ciências Exatas, Universidade Estadual Paulista (Unesp), Faculdade de Ciências Agrárias e Veterinárias, Jaboticabal, SP, 14884-900, Brazil
| | - José B S Ferraz
- Departamento de Medicina Veterinária, Universidade de São Paulo (USP), Faculdade de Zootecnia e Engenharia de Alimentos, Pirassununga, SP, 13635-900, Brazil
| | | | | | | | | |
Collapse
|
32
|
Yang J, Zeng J, Goddard ME, Wray NR, Visscher PM. Concepts, estimation and interpretation of SNP-based heritability. Nat Genet 2017; 49:1304-1310. [PMID: 28854176 DOI: 10.1038/ng.3941] [Citation(s) in RCA: 240] [Impact Index Per Article: 34.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2016] [Accepted: 07/31/2017] [Indexed: 12/17/2022]
Abstract
Narrow-sense heritability (h2) is an important genetic parameter that quantifies the proportion of phenotypic variance in a trait attributable to the additive genetic variation generated by all causal variants. Estimation of h2 previously relied on closely related individuals, but recent developments allow estimation of the variance explained by all SNPs used in a genome-wide association study (GWAS) in conventionally unrelated individuals, that is, the SNP-based heritability (). In this Perspective, we discuss recently developed methods to estimate for a complex trait (and genetic correlation between traits) using individual-level or summary GWAS data. We discuss issues that could influence the accuracy of , definitions, assumptions and interpretations of the models, and pitfalls of misusing the methods and misinterpreting the models and results.
Collapse
Affiliation(s)
- Jian Yang
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland, Australia.,Queensland Brain Institute, The University of Queensland, Brisbane, Queensland, Australia
| | - Jian Zeng
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland, Australia
| | - Michael E Goddard
- Faculty of Veterinary and Agricultural Science, University of Melbourne, Parkville, Victoria, Australia.,Biosciences Research Division, Department of Economic Development, Jobs, Transport and Resources, Bundoora, Victoria, Australia
| | - Naomi R Wray
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland, Australia.,Queensland Brain Institute, The University of Queensland, Brisbane, Queensland, Australia
| | - Peter M Visscher
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Queensland, Australia.,Queensland Brain Institute, The University of Queensland, Brisbane, Queensland, Australia
| |
Collapse
|
33
|
Visscher PM, Wray NR, Zhang Q, Sklar P, McCarthy MI, Brown MA, Yang J. 10 Years of GWAS Discovery: Biology, Function, and Translation. Am J Hum Genet 2017; 101:5-22. [PMID: 28686856 DOI: 10.1016/j.ajhg.2017.06.005] [Citation(s) in RCA: 1924] [Impact Index Per Article: 274.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open
Abstract
Application of the experimental design of genome-wide association studies (GWASs) is now 10 years old (young), and here we review the remarkable range of discoveries it has facilitated in population and complex-trait genetics, the biology of diseases, and translation toward new therapeutics. We predict the likely discoveries in the next 10 years, when GWASs will be based on millions of samples with array data imputed to a large fully sequenced reference panel and on hundreds of thousands of samples with whole-genome sequencing data.
Collapse
|
34
|
Brito LF, McEwan JC, Miller SP, Pickering NK, Bain WE, Dodds KG, Schenkel FS, Clarke SM. Genetic diversity of a New Zealand multi-breed sheep population and composite breeds' history revealed by a high-density SNP chip. BMC Genet 2017; 18:25. [PMID: 28288558 PMCID: PMC5348757 DOI: 10.1186/s12863-017-0492-8] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2016] [Accepted: 03/07/2017] [Indexed: 12/22/2022] Open
Abstract
Background Knowledge about the genetic diversity of a population is a crucial parameter for the implementation of successful genomic selection and conservation of genetic resources. The aim of this research was to establish the scientific basis for the implementation of genomic selection in a composite Terminal sheep breeding scheme by providing consolidated linkage disequilibrium (LD) measures across SNP markers, estimating consistency of gametic phase between breed-groups, and assessing genetic diversity measures, such as effective population size (Ne), and population structure parameters, using a large number of animals (n = 14,845) genotyped with a high density SNP chip (606,006 markers). Information generated in this research will be useful for optimizing molecular breeding values predictions and managing the available genetic resources. Results Overall, as expected, levels of pairwise LD decreased with increasing distance between SNP pairs. The mean LD r2 between adjacent SNP was 0.26 ± 0.10. The most recent effective population size for all animals (687) and separately per breed-groups: Primera (974), Lamb Supreme (380), Texel (227) and Dual-Purpose (125) was quite variable. The genotyped animals were outbred or had an average low level of inbreeding. Consistency of gametic phase was higher than 0.94 for all breed pairs at the average distance between SNP on the chip (~4.74 kb). Moreover, there was not a clear separation between the breed-groups based on principal component analysis, suggesting that a mixed-breed training population for calculation of molecular breeding values would be beneficial. Conclusions This study reports, for the first time, estimates of linkage disequilibrium, genetic diversity and population structure parameters from a genome-wide perspective in New Zealand Terminal Sire composite sheep breeds. The levels of linkage disequilibrium indicate that genomic selection could be implemented with the high density SNP panel. The moderate to high consistency of gametic phase between breed-groups and overlapping population structure support the pooling of the animals in a mixed training population for genomic predictions. In addition, the moderate to high Ne highlights the need to genotype and phenotype a large training population in order to capture most of the haplotype diversity and increase accuracies of genomic predictions. The results reported herein are a first step toward understanding the genomic architecture of a Terminal Sire composite sheep population and for the optimal implementation of genomic selection and genome-wide association studies in this sheep population. Electronic supplementary material The online version of this article (doi:10.1186/s12863-017-0492-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Luiz F Brito
- Centre for Genetic Improvement of Livestock, University of Guelph, Guelph, N1G 2W1, Canada. .,AgResearch, Invermay Agricultural Centre, Private Bag 50034, Mosgiel, 9053, New Zealand.
| | - John C McEwan
- AgResearch, Invermay Agricultural Centre, Private Bag 50034, Mosgiel, 9053, New Zealand
| | - Stephen P Miller
- Centre for Genetic Improvement of Livestock, University of Guelph, Guelph, N1G 2W1, Canada.,AgResearch, Invermay Agricultural Centre, Private Bag 50034, Mosgiel, 9053, New Zealand
| | | | - Wendy E Bain
- AgResearch, Invermay Agricultural Centre, Private Bag 50034, Mosgiel, 9053, New Zealand
| | - Ken G Dodds
- AgResearch, Invermay Agricultural Centre, Private Bag 50034, Mosgiel, 9053, New Zealand
| | - Flávio S Schenkel
- Centre for Genetic Improvement of Livestock, University of Guelph, Guelph, N1G 2W1, Canada
| | - Shannon M Clarke
- AgResearch, Invermay Agricultural Centre, Private Bag 50034, Mosgiel, 9053, New Zealand
| |
Collapse
|
35
|
Abstract
Gametic phase disequilibrium is the nonrandom association of alleles within gametes. Linkage disequilibrium (LD) describes the special case of deviation from independence between alleles at two linked genetic loci. Estimation of allelic LD requires knowledge of haplotypes. Genotype-based LD measures dispense with the haplotype estimation step and avoid bias in LD estimation. In this chapter, the most important measures for allelic and genotypic LD are introduced. The use of software packages for LD estimation is illustrated.
Collapse
Affiliation(s)
- Maren Vens
- Institut für Medizinische Biometrie und Epidemiologie, Universitätsklinikum Hamburg-Eppendorf, Martinistraße 52, Hamburg, 20246, Germany.
| | - Andreas Ziegler
- Institut für Medizinische Biometrie und Statistik & Zentrum für klinische Studien, Universität zu Lübeck, Universitätsklinikum Schleswig-Holstein, Campus Lübeck, Ratzeburger Allee 160, Lübeck, 23562, Germany
| |
Collapse
|
36
|
Escott-Price V, Shoai M, Pither R, Williams J, Hardy J. Polygenic score prediction captures nearly all common genetic risk for Alzheimer's disease. Neurobiol Aging 2017; 49:214.e7-214.e11. [DOI: 10.1016/j.neurobiolaging.2016.07.018] [Citation(s) in RCA: 73] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2016] [Revised: 07/14/2016] [Accepted: 07/15/2016] [Indexed: 01/25/2023]
|
37
|
Effects of corticotropin-releasing hormone receptor 1 SNPs on major depressive disorder are influenced by sex and smoking status. J Affect Disord 2016; 205:282-288. [PMID: 27544317 DOI: 10.1016/j.jad.2016.08.008] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/26/2016] [Revised: 07/10/2016] [Accepted: 08/11/2016] [Indexed: 01/24/2023]
Abstract
BACKGROUND The corticotropin-releasing hormone receptor 1 (CRHR1) gene has been repeatedly implicated in Major Depressive Disorder (MDD) in humans and animal models; however, the findings are not absolutely convergent. Since recent evidence from genome-wide association studies suggests that narrowing the phenotypic heterogeneity may be crucial in genetic studies of MDD, the aim of this study was to evaluate the effects of CRHR1 polymorphisms on MDD while addressing the influence of sex and smoking status. METHODS The association of the CRHR1 SNPs rs12944712, rs110402, and rs878886 with MDD was evaluated in 629 Brazilian adults of European descent recruited from the general population [180 (28.6%) with lifetime MDD]. The sample was subdivided according to sex and smoking status RESULTS: Among nonsmokers, there were nominal associations between MDD and all tested SNPs (rs12944712, P=0.042; rs110402, P=0.031, and rs878886, P=0.040), regardless of sex. In addition, there were significant effects of rs110402 in women (Pcorr=0.034) and rs878886 in men (Pcorr=0.013). Among lifetime smokers, there were no significant associations between CRHR1 SNPs and MDD LIMITATIONS: The lack of a depression rating scale; scarcity of information on the functionality of the CRHR1 SNPs; and relatively small sample sizes in some subgroups. CONCLUSIONS Our results strengthen the evidence for the role of CRHR1 SNPs in MDD susceptibility and suggest that their effects may be modulated by sex and smoking status. These findings suggest the perspective that reducing phenotypic heterogeneity is warranted in genetic studies of MDD.
Collapse
|
38
|
Rare Variants in Transcript and Potential Regulatory Regions Explain a Small Percentage of the Missing Heritability of Complex Traits in Cattle. PLoS One 2015; 10:e0143945. [PMID: 26642058 PMCID: PMC4671594 DOI: 10.1371/journal.pone.0143945] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2015] [Accepted: 11/11/2015] [Indexed: 11/19/2022] Open
Abstract
The proportion of genetic variation in complex traits explained by rare variants is a key question for genomic prediction, and for identifying the basis of “missing heritability”–the proportion of additive genetic variation not captured by common variants on SNP arrays. Sequence variants in transcript and regulatory regions from 429 sequenced animals were used to impute high density SNP genotypes of 3311 Holstein sires to sequence. There were 675,062 common variants (MAF>0.05), 102,549 uncommon variants (0.01<MAF<0.05), and 83,856 rare variants (MAF<0.01). We describe a novel method for estimating the proportion of the rare variants that are sequencing errors using parent-progeny duos. We then used mixed model methodology to estimate the proportion of variance captured by these different classes of variants for fat, milk and protein yields, as well as for fertility. Common sequence variants captured 83%, 77%, 76% and 84% of the total genetic variance for fat, milk, and protein yields and fertility, respectively. This was between 2 and 5% more variance than that captured from 600k SNPs on a high density chip, although the difference was not significant. Rare variants captured 3%, 0%, 1% and 14% of the genetic variance for fat, milk and protein yields, and fertility respectively, whereas pedigree explained the remaining amount of genetic variance (none for fertility). The proportion of variation explained by rare variants is likely to be under-estimated due to reduced accuracies of imputation for this class of variants. Using common sequence variants slightly improved accuracy of genomic predictions for fat and milk yield, compared to high density SNP array genotypes. However, including rare variants from transcript regions did not increase the accuracy of genomic predictions. These results suggest that rare variants recover a small percentage of the missing heritability for complex traits, however very large reference sets will be required to exploit this to improve the accuracy of genomic predictions. Our results do suggest the contribution of rare variants to genetic variation may be greater for fitness traits.
Collapse
|
39
|
Berger S, Schlather M, de los Campos G, Weigend S, Preisinger R, Erbe M, Simianer H. A Scale-Corrected Comparison of Linkage Disequilibrium Levels between Genic and Non-Genic Regions. PLoS One 2015; 10:e0141216. [PMID: 26517830 PMCID: PMC4627745 DOI: 10.1371/journal.pone.0141216] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2015] [Accepted: 10/06/2015] [Indexed: 12/27/2022] Open
Abstract
The understanding of non-random association between loci, termed linkage disequilibrium (LD), plays a central role in genomic research. Since causal mutations are generally not included in genomic marker data, LD between those and available markers is essential for capturing the effects of causal loci on localizing genes responsible for traits. Thus, the interpretation of association studies requires a detailed knowledge of LD patterns. It is well known that most LD measures depend on minor allele frequencies (MAF) of the considered loci and the magnitude of LD is influenced by the physical distances between loci. In the present study, a procedure to compare the LD structure between genomic regions comprising several markers each is suggested. The approach accounts for different scaling factors, namely the distribution of MAF, the distribution of pair-wise differences in MAF, and the physical extent of compared regions, reflected by the distribution of pair-wise physical distances. In the first step, genomic regions are matched based on similarity in these scaling factors. In the second step, chromosome- and genome-wide significance tests for differences in medians of LD measures in each pair are performed. The proposed framework was applied to test the hypothesis that the average LD is different in genic and non-genic regions. This was tested with a genome-wide approach with data sets for humans (Homo sapiens), a highly selected chicken line (Gallus gallus domesticus) and the model plant Arabidopsis thaliana. In all three data sets we found a significantly higher level of LD in genic regions compared to non-genic regions. About 31% more LD was detected genome-wide in genic compared to non-genic regions in Arabidopsis thaliana, followed by 13.6% in human and 6% chicken. Chromosome-wide comparison discovered significant differences on all 5 chromosomes in Arabidopsis thaliana and on one third of the human and of the chicken chromosomes.
Collapse
Affiliation(s)
- Swetlana Berger
- Animal Breeding and Genetics Group, Department of Animal Sciences, Georg-August-University, Goettingen, Germany
| | - Martin Schlather
- School of Business Informatics and Mathematics, University of Mannheim, Mannheim, Germany
| | - Gustavo de los Campos
- Department of Epidemiology and Biostatistics, Michigan State University, East Lansing, Michigan, United States of America
| | - Steffen Weigend
- Institut of Farm Animal Genetics, Friedrich-Loeffler Institut, Neustadt-Mariensee, Germany
| | | | - Malena Erbe
- Animal Breeding and Genetics Group, Department of Animal Sciences, Georg-August-University, Goettingen, Germany
| | - Henner Simianer
- Animal Breeding and Genetics Group, Department of Animal Sciences, Georg-August-University, Goettingen, Germany
| |
Collapse
|
40
|
Bolormaa S, Gore K, van der Werf JHJ, Hayes BJ, Daetwyler HD. Design of a low-density SNP chip for the main Australian sheep breeds and its effect on imputation and genomic prediction accuracy. Anim Genet 2015; 46:544-56. [DOI: 10.1111/age.12340] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/03/2015] [Indexed: 12/20/2022]
Affiliation(s)
- S. Bolormaa
- AgriBio; Centre for AgriBioscience; DEDJTR; Bundoora VIC 3083 Australia
- Cooperative Research Centre for Sheep Industry Innovation; Armidale NSW 2351 Australia
| | - K. Gore
- School of Environmental and Rural Science; University of New England; Armidale NSW 2351 Australia
| | - J. H. J. van der Werf
- Cooperative Research Centre for Sheep Industry Innovation; Armidale NSW 2351 Australia
- School of Environmental and Rural Science; University of New England; Armidale NSW 2351 Australia
| | - B. J. Hayes
- AgriBio; Centre for AgriBioscience; DEDJTR; Bundoora VIC 3083 Australia
- Cooperative Research Centre for Sheep Industry Innovation; Armidale NSW 2351 Australia
- School of Applied Systems Biology; La Trobe University; Bundoora VIC 3086 Australia
| | - H. D. Daetwyler
- AgriBio; Centre for AgriBioscience; DEDJTR; Bundoora VIC 3083 Australia
- Cooperative Research Centre for Sheep Industry Innovation; Armidale NSW 2351 Australia
- School of Applied Systems Biology; La Trobe University; Bundoora VIC 3086 Australia
| |
Collapse
|
41
|
Sieradzka D, Power RA, Freeman D, Cardno AG, Dudbridge F, Ronald A. Heritability of Individual Psychotic Experiences Captured by Common Genetic Variants in a Community Sample of Adolescents. Behav Genet 2015; 45:493-502. [PMID: 26049723 PMCID: PMC4561057 DOI: 10.1007/s10519-015-9727-5] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2014] [Accepted: 05/28/2015] [Indexed: 11/28/2022]
Abstract
Occurrence of psychotic experiences is common amongst adolescents in the general population. Twin studies suggest that a third to a half of variance in adolescent psychotic experiences is explained by genetic influences. Here we test the extent to which common genetic variants account for some of the twin-based heritability. Psychotic experiences were assessed with the Specific Psychotic Experiences Questionnaire in a community sample of 2152 16-year-olds. Self-reported measures of Paranoia, Hallucinations, Cognitive Disorganization, Grandiosity, Anhedonia, and Parent-rated Negative Symptoms were obtained. Estimates of SNP heritability were derived and compared to the twin heritability estimates from the same sample. Three approaches to genome-wide restricted maximum likelihood (GREML) analyses were compared: (1) standard GREML performed on full genome-wide data; (2) GREML stratified by minor allele frequency (MAF); and (3) GREML performed on pruned data. The standard GREML revealed a significant SNP heritability of 20 % for Anhedonia (SE = 0.12; p < 0.046) and an estimate of 19 % for Cognitive Disorganization, which was close to significant (SE = 0.13; p < 0.059). Grandiosity and Paranoia showed modest SNP heritability estimates (17 %; SE = 0.13 and 14 %; SE = 0.13, respectively, both n.s.), and zero estimates were found for Hallucinations and Negative Symptoms. The estimates for Anhedonia, Cognitive Disorganization and Grandiosity accounted for approximately half the previously reported twin heritability. SNP heritability estimates from the MAF-stratified approach were mostly consistent with the standard estimates and offered additional information about the distribution of heritability across the MAF range of the SNPs. In contrast, the estimates derived from the pruned data were for the most part not consistent with the other two approaches. It is likely that the difference seen in the pruned estimates was driven by the loss of tagged causal variants, an issue fundamental to this approach. The current results suggest that common genetic variants play a role in the etiology of some adolescent psychotic experiences, however further research on larger samples is desired and the use of MAF-stratified approach recommended.
Collapse
Affiliation(s)
- Dominika Sieradzka
- Centre for Brain and Cognitive Development, Birkbeck, University of London, 32 Torrington Square, London, WC1E 7HX, UK,
| | | | | | | | | | | |
Collapse
|
42
|
Yang G, Zhou H, Wang R, Hickford J. Variation in the ovine PRKAG3 gene. Gene 2015; 567:251-4. [PMID: 25967386 DOI: 10.1016/j.gene.2015.05.021] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2014] [Revised: 04/08/2015] [Accepted: 05/02/2015] [Indexed: 10/23/2022]
Abstract
The 5'AMP-activated protein kinase (AMPK) is a heterotrimeric enzyme that controls cellular energy homeostasis in response to environmental or nutritional stress. The PRKAG3 gene (PRKAG3) encodes the γ3 subunit of the AMPK. Variation in this gene has been found to be associated with meat quality traits in pigs. In this study, we used polymerase chain reaction-single stranded conformational polymorphism (PCR-SSCP) to investigate variation in exon 3 and exons 4-6 of ovine PRKAG3. In 160 New Zealand Suffolk sheep, two variant sequences (named a and b) were identified in the exon 3 region of the gene and three variant sequences (named A, B and C) were identified in the exon 4-6 region of the gene, respectively. A total of three nucleotide substitutions were revealed and these were located in intron 4, exon 4 and intron 5, respectively. The nucleotide substitution identified in the exon 4 (g.2656 C>T) could nominally lead to an amino acid substitution of tryptophan to arginine at position 230 (R230W) in ovine PRKAG3. In comparison with the PRKAG3 amino acid sequences in other species, this R230W substitution appeared to occur only in sheep. This is the first report of genetic variation in ovine PRKAG3, and the variation found in this study could be functionally important for AMPK activity, which in turn may affect meat quality traits in sheep.
Collapse
Affiliation(s)
- Guo Yang
- Cold and Arid Regions Environmental and Engineering Research Institute, Chinese Academy of Sciences, Lanzhou 730000, China; Gene-Marker Laboratory, Faculty of Agriculture and Life Sciences, P.O. Box 84, Lincoln University, Lincoln 7647, New Zealand
| | - Huitong Zhou
- Gene-Marker Laboratory, Faculty of Agriculture and Life Sciences, P.O. Box 84, Lincoln University, Lincoln 7647, New Zealand
| | - Ruoyu Wang
- Cold and Arid Regions Environmental and Engineering Research Institute, Chinese Academy of Sciences, Lanzhou 730000, China.
| | - Jon Hickford
- Gene-Marker Laboratory, Faculty of Agriculture and Life Sciences, P.O. Box 84, Lincoln University, Lincoln 7647, New Zealand.
| |
Collapse
|
43
|
A replication study for association of 53 single nucleotide polymorphisms in ScoliScore test with adolescent idiopathic scoliosis in French-Canadian population. Spine (Phila Pa 1976) 2015; 40:537-43. [PMID: 25646748 DOI: 10.1097/brs.0000000000000807] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
STUDY DESIGN A replication association study that used genomic data generated from French-Canadian case and control cohorts. OBJECTIVES To determine whether the 53 single nucleotide polymorphisms (SNPs) that were previously associated with spinal deformity progression in an American Caucasian cohort are similarly associated in French-Canadian population. SUMMARY OF BACKGROUND DATA It is widely accepted that genetic factors contribute to adolescent idiopathic scoliosis. The identification of genetic variants associated with the predisposition or progression of curvature could facilitate diagnostic/prognostic tool development. Although 53 SNPs have been associated with spinal curve progression in Caucasian cohorts in the United States, these associations were not replicated in a large Japanese population study, arguing that such a discrepancy could be explained by ethnicity, thus raising the importance of a replication study in an independent Caucasian population of European descent. METHODS Genomic data were collected from the French-Canadian population, using the Illumina HumanOmni 2.5M BeadChip. Fifty-two SNPs, tested in ScoliScore or in high linkage disequilibrium with SNPs in the test, were selected to assess their association with scoliosis generally, and with spinal curve progression. One SNP in ScoliScore, rs16909285, could not be evaluated in our Genome-Wide association study. RESULTS None of the SNPs used in ScoliScore were associated with adolescent idiopathic scoliosis curve progression or curve occurrence in French-Canadian population. We evaluated 52 SNPs in severe patients by comparing risk allele frequencies with those in nonsevere patients and with those in control individuals. There was no significant difference between the severe group and the nonsevere group or between the severe group and the control group. CONCLUSION Although the 52 SNPs studied here were previously associated with curve progression in an American population of European descent, we found no association in French-Canadian patients with adolescent idiopathic scoliosis. This second replication cohort suggests that the lack of association of these SNPs in a Japanese cohort is not due to ethnicity. LEVEL OF EVIDENCE 4.
Collapse
|
44
|
Abstract
We describe the scientific enterprise at the intersection of evolutionary psychology and behavioral genetics-a field that could be termed Evolutionary Behavioral Genetics-and how modern genetic data is revolutionizing our ability to test questions in this field. We first explain how genetically informative data and designs can be used to investigate questions about the evolution of human behavior, and describe some of the findings arising from these approaches. Second, we explain how evolutionary theory can be applied to the investigation of behavioral genetic variation. We give examples of how new data and methods provide insight into the genetic architecture of behavioral variation and what this tells us about the evolutionary processes that acted on the underlying causal genetic variants.
Collapse
Affiliation(s)
- Brendan P Zietsch
- School of Psychology, University of Queensland, Brisbane, Queensland, Australia ; Genetic Epidemiology Laboratory, QIMR Berghofer, Brisbane, Queensland, Australia
| | - Teresa R de Candia
- Department of Psychology and Neuroscience, University of Colorado Boulder, Boulder, Colorado, United States of America ; Institute for Behavioral Genetics, University of Colorado Boulder, Boulder, Colorado, United States of America
| | - Matthew C Keller
- Department of Psychology and Neuroscience, University of Colorado Boulder, Boulder, Colorado, United States of America ; Institute for Behavioral Genetics, University of Colorado Boulder, Boulder, Colorado, United States of America
| |
Collapse
|
45
|
Johnson EO, Hancock DB, Gaddis NC, Levy JL, Page G, Novak SP, Glasheen C, Saccone NL, Rice JP, Moreau MP, Doheny KF, Romm JM, Brooks AI, Aouizerat BE, Bierut LJ, Kral AH. Novel genetic locus implicated for HIV-1 acquisition with putative regulatory links to HIV replication and infectivity: a genome-wide association study. PLoS One 2015; 10:e0118149. [PMID: 25786224 PMCID: PMC4364715 DOI: 10.1371/journal.pone.0118149] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2014] [Accepted: 01/05/2015] [Indexed: 11/18/2022] Open
Abstract
Fifty percent of variability in HIV-1 susceptibility is attributable to host genetics. Thus identifying genetic associations is essential to understanding pathogenesis of HIV-1 and important for targeting drug development. To date, however, CCR5 remains the only gene conclusively associated with HIV acquisition. To identify novel host genetic determinants of HIV-1 acquisition, we conducted a genome-wide association study among a high-risk sample of 3,136 injection drug users (IDUs) from the Urban Health Study (UHS). In addition to being IDUs, HIV-controls were frequency-matched to cases on environmental exposures to enhance detection of genetic effects. We tested independent replication in the Women's Interagency HIV Study (N=2,533). We also examined publicly available gene expression data to link SNPs associated with HIV acquisition to known mechanisms affecting HIV replication/infectivity. Analysis of the UHS nominated eight genetic regions for replication testing. SNP rs4878712 in FRMPD1 met multiple testing correction for independent replication (P=1.38x10(-4)), although the UHS-WIHS meta-analysis p-value did not reach genome-wide significance (P=4.47x10(-7) vs. P<5.0x10(-8)) Gene expression analyses provided promising biological support for the protective G allele at rs4878712 lowering risk of HIV: (1) the G allele was associated with reduced expression of FBXO10 (r=-0.49, P=6.9x10(-5)); (2) FBXO10 is a component of the Skp1-Cul1-F-box protein E3 ubiquitin ligase complex that targets Bcl-2 protein for degradation; (3) lower FBXO10 expression was associated with higher BCL2 expression (r=-0.49, P=8x10(-5)); (4) higher basal levels of Bcl-2 are known to reduce HIV replication and infectivity in human and animal in vitro studies. These results suggest new potential biological pathways by which host genetics affect susceptibility to HIV upon exposure for follow-up in subsequent studies.
Collapse
Affiliation(s)
- Eric O. Johnson
- RTI International, Research Triangle Park, NC, Atlanta, GA, San Francisco, CA, United States of America
| | - Dana B. Hancock
- RTI International, Research Triangle Park, NC, Atlanta, GA, San Francisco, CA, United States of America
| | - Nathan C. Gaddis
- RTI International, Research Triangle Park, NC, Atlanta, GA, San Francisco, CA, United States of America
| | - Joshua L. Levy
- RTI International, Research Triangle Park, NC, Atlanta, GA, San Francisco, CA, United States of America
| | - Grier Page
- RTI International, Research Triangle Park, NC, Atlanta, GA, San Francisco, CA, United States of America
| | - Scott P. Novak
- RTI International, Research Triangle Park, NC, Atlanta, GA, San Francisco, CA, United States of America
| | - Cristie Glasheen
- RTI International, Research Triangle Park, NC, Atlanta, GA, San Francisco, CA, United States of America
| | - Nancy L. Saccone
- Washington University School of Medicine, St. Louis, MO, United States of America
| | - John P. Rice
- Washington University School of Medicine, St. Louis, MO, United States of America
| | - Michael P. Moreau
- Rutgers University Cell and DNA Repository (RUCDR), Piscataway, NJ, United States of America
| | - Kimberly F. Doheny
- Center for Inherited Disease Research (CIDR), Johns Hopkins University, Baltimore, MD, United States of America
| | - Jane M. Romm
- Center for Inherited Disease Research (CIDR), Johns Hopkins University, Baltimore, MD, United States of America
| | - Andrew I. Brooks
- Rutgers University Cell and DNA Repository (RUCDR), Piscataway, NJ, United States of America
| | - Bradley E. Aouizerat
- School of Nursing, University of California San Francisco, San Francisco, CA, United States of America
- Institute for Human Genetics, University of California San Francisco, San Francisco, CA, United States of America
| | - Laura J. Bierut
- Washington University School of Medicine, St. Louis, MO, United States of America
| | - Alex H. Kral
- RTI International, Research Triangle Park, NC, Atlanta, GA, San Francisco, CA, United States of America
| |
Collapse
|
46
|
van der Spek D, van Arendonk JAM, Bovenhuis H. Genome-wide association study for claw disorders and trimming status in dairy cattle. J Dairy Sci 2014; 98:1286-95. [PMID: 25497826 DOI: 10.3168/jds.2014-8302] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2014] [Accepted: 10/28/2014] [Indexed: 01/09/2023]
Abstract
Performing a genome-wide association study (GWAS) might add to a better understanding of the development of claw disorders and the need for trimming. Therefore, the aim of the current study was to perform a GWAS on claw disorders and trimming status and to validate the results for claw disorders based on an independent data set. Data consisted of 20,474 cows with phenotypes for claw disorders and 50,238 cows with phenotypes for trimming status. Recorded claw disorders used in the current study were double sole (DS), interdigital hyperplasia (IH), sole hemorrhage (SH), sole ulcer (SU), white line separation (WLS), a combination of infectious claw disorders consisting of (inter-)digital dermatitis and heel erosion, and a combination of laminitis-related claw disorders (DS, SH, SU, and WLS). Of the cows with phenotypes for claw disorders, 1,771 cows were genotyped and these cow data were used for the GWAS on claw disorders. A SNP was considered significant when the false discovery rate≤0.05 and suggestive when the false discovery rate≤0.20. An independent data set of 185 genotyped bulls having at least 5 daughters with phenotypes (6,824 daughters in total) for claw disorders was used to validate significant and suggestive SNP detected based on the cow data. To analyze the trait "trimming status" (i.e., the need for claw trimming), a data set with 327 genotyped bulls having at least 5 daughters with phenotypes (18,525 daughters in total) was used. Based on the cow data, in total 10 significant and 45 suggestive SNP were detected for claw disorders. The 10 significant SNP were associated with SU, and mainly located on BTA8. The suggestive SNP were associated with DS, IH, SU, and laminitis-related claw disorders. Three of the suggestive SNP were validated in the data set of 185 bulls, and were located on BTA13, BTA14, and BTA17. For infectious claw disorders, SH, and WLS, no significant or suggestive SNP associations were detected. For trimming status, 1 significant and 1 suggestive SNP were detected, both located close to each other on BTA15. Some significant and suggestive SNP were located close to SNP detected in studies on feet and leg conformation traits. Genes with major effects could not be detected and SNP associations were spread across the genome, indicating that many SNP, each explaining a small proportion of the genetic variance, influence claw disorders. Therefore, to reduce the incidence of claw disorders by breeding, genomic selection is a promising approach.
Collapse
Affiliation(s)
- D van der Spek
- Animal Breeding and Genomics Centre, Wageningen University, PO 338, 6700 AH, Wageningen, the Netherlands.
| | - J A M van Arendonk
- Animal Breeding and Genomics Centre, Wageningen University, PO 338, 6700 AH, Wageningen, the Netherlands
| | - H Bovenhuis
- Animal Breeding and Genomics Centre, Wageningen University, PO 338, 6700 AH, Wageningen, the Netherlands
| |
Collapse
|
47
|
Wray NR, Lee SH, Mehta D, Vinkhuyzen AAE, Dudbridge F, Middeldorp CM. Research review: Polygenic methods and their application to psychiatric traits. J Child Psychol Psychiatry 2014; 55:1068-87. [PMID: 25132410 DOI: 10.1111/jcpp.12295] [Citation(s) in RCA: 456] [Impact Index Per Article: 45.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 03/13/2014] [Indexed: 12/18/2022]
Abstract
BACKGROUND Despite evidence from twin and family studies for an important contribution of genetic factors to both childhood and adult onset psychiatric disorders, identifying robustly associated specific DNA variants has proved challenging. In the pregenomics era the genetic architecture (number, frequency and effect size of risk variants) of complex genetic disorders was unknown. Empirical evidence for the genetic architecture of psychiatric disorders is emerging from the genetic studies of the last 5 years. METHODS AND SCOPE We review the methods investigating the polygenic nature of complex disorders. We provide mini-guides to genomic profile (or polygenic) risk scoring and to estimation of variance (or heritability) from common SNPs; a glossary of key terms is also provided. We review results of applications of the methods to psychiatric disorders and related traits and consider how these methods inform on missing heritability, hidden heritability and still-missing heritability. FINDINGS Genome-wide genotyping and sequencing studies are providing evidence that psychiatric disorders are truly polygenic, that is they have a genetic architecture of many genetic variants, including risk variants that are both common and rare in the population. Sample sizes published to date are mostly underpowered to detect effect sizes of the magnitude presented by nature, and these effect sizes may be constrained by the biological validity of the diagnostic constructs. CONCLUSIONS Increasing the sample size for genome wide association studies of psychiatric disorders will lead to the identification of more associated genetic variants, as already found for schizophrenia. These loci provide the starting point of functional analyses that might eventually lead to new prevention and treatment options and to improved biological validity of diagnostic constructs. Polygenic analyses will contribute further to our understanding of complex genetic traits as sample sizes increase and as sample resources become richer in phenotypic descriptors, both in terms of clinical symptoms and of nongenetic risk factors.
Collapse
Affiliation(s)
- Naomi R Wray
- Queensland Brain Institute, The University of Queensland, St Lucia, Qld, Australia
| | | | | | | | | | | |
Collapse
|
48
|
The effects of demography and long-term selection on the accuracy of genomic prediction with sequence data. Genetics 2014; 198:1671-84. [PMID: 25233989 DOI: 10.1534/genetics.114.168344] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
The use of dense SNPs to predict the genetic value of an individual for a complex trait is often referred to as "genomic selection" in livestock and crops, but is also relevant to human genetics to predict, for example, complex genetic disease risk. The accuracy of prediction depends on the strength of linkage disequilibrium (LD) between SNPs and causal mutations. If sequence data were used instead of dense SNPs, accuracy should increase because causal mutations are present, but demographic history and long-term negative selection also influence accuracy. We therefore evaluated genomic prediction, using simulated sequence in two contrasting populations: one reducing from an ancestrally large effective population size (Ne) to a small one, with high LD common in domestic livestock, while the second had a large constant-sized Ne with low LD similar to that in some human or outbred plant populations. There were two scenarios in each population; causal variants were either neutral or under long-term negative selection. For large Ne, sequence data led to a 22% increase in accuracy relative to ∼600K SNP chip data with a Bayesian analysis and a more modest advantage with a BLUP analysis. This advantage increased when causal variants were influenced by negative selection, and accuracy persisted when 10 generations separated reference and validation populations. However, in the reducing Ne population, there was little advantage for sequence even with negative selection. This study demonstrates the joint influence of demography and selection on accuracy of prediction and improves our understanding of how best to exploit sequence for genomic prediction.
Collapse
|
49
|
Kijas JW, Porto-Neto L, Dominik S, Reverter A, Bunch R, McCulloch R, Hayes BJ, Brauning R, McEwan J. Linkage disequilibrium over short physical distances measured in sheep using a high-density SNP chip. Anim Genet 2014; 45:754-7. [PMID: 25040320 DOI: 10.1111/age.12197] [Citation(s) in RCA: 72] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/14/2014] [Indexed: 01/08/2023]
Abstract
The extent of linkage disequilibrium (LD) between genetic loci has implications for both association studies and the accuracy of genomic prediction. To characterise the persistence of LD in diverse sheep breeds, two SNP genotyping platforms were used. First, existing SNP genotypes from 63 breeds obtained using the ovine SNP50 BeadChip (49,034 loci) were used to estimate LD decay in populations with contrasting levels of genetic diversity. Given the paucity of marker pairs separated by short physical distances on the SNP50 BeadChip, genotyping was subsequently performed for four breeds using the recently developed ovine HD BeadChip that assays approximately 600,000 SNPs with an average genomic spacing of 5 kb. This facilitated a highly accurate estimate of LD over short genomic distances (<30 kb) and revealed LD varies considerably between sheep breeds. Further, sheep appear to contain generally lower levels of LD than do other domestic species, likely a reflection of aspects of their past population history.
Collapse
Affiliation(s)
- James W Kijas
- CSIRO Agriculture Flagship, St Lucia, Qld 4067, Australia
| | | | | | | | | | | | | | | | | | | |
Collapse
|
50
|
Cao Y, Wei P, Bailey M, Kauwe JSK, Maxwell TJ. A versatile omnibus test for detecting mean and variance heterogeneity. Genet Epidemiol 2014; 38:51-59. [PMID: 24482837 DOI: 10.1002/gepi.21778] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Recent research has revealed loci that display variance heterogeneity through various means such as biological disruption, linkage disequilibrium (LD), gene-by-gene (G × G), or gene-by-environment interaction. We propose a versatile likelihood ratio test that allows joint testing for mean and variance heterogeneity (LRT(MV)) or either effect alone (LRT(M) or LRT(V)) in the presence of covariates. Using extensive simulations for our method and others, we found that all parametric tests were sensitive to nonnormality regardless of any trait transformations. Coupling our test with the parametric bootstrap solves this issue. Using simulations and empirical data from a known mean-only functional variant, we demonstrate how LD can produce variance-heterogeneity loci (vQTL) in a predictable fashion based on differential allele frequencies, high D', and relatively low r² values. We propose that a joint test for mean and variance heterogeneity is more powerful than a variance-only test for detecting vQTL. This takes advantage of loci that also have mean effects without sacrificing much power to detect variance only effects. We discuss using vQTL as an approach to detect G × G interactions and also how vQTL are related to relationship loci, and how both can create prior hypothesis for each other and reveal the relationships between traits and possibly between components of a composite trait.
Collapse
Affiliation(s)
- Ying Cao
- Human Genetics Center, UT School of Public Health, Houston, TX 77030, USA.,Division of Biostatistics, UT School of Public Health, Houston, TX 77030, USA
| | - Peng Wei
- Human Genetics Center, UT School of Public Health, Houston, TX 77030, USA.,Division of Biostatistics, UT School of Public Health, Houston, TX 77030, USA
| | - Matthew Bailey
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - John S K Kauwe
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Taylor J Maxwell
- Human Genetics Center, UT School of Public Health, Houston, TX 77030, USA
| |
Collapse
|