Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: López de Maturana E, Ibáñez-Escriche N, González-Recio Ó, Marenne G, Mehrban H, Chanock SJ, Goddard ME, Malats N. Next generation modeling in GWAS: comparing different genetic architectures. Hum Genet 2014;133:1235-53. [DOI: 10.1007/s00439-014-1461-1] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2014] [Accepted: 06/05/2014] [Indexed: 12/14/2022]

For:	López de Maturana E, Ibáñez-Escriche N, González-Recio Ó, Marenne G, Mehrban H, Chanock SJ, Goddard ME, Malats N. Next generation modeling in GWAS: comparing different genetic architectures. Hum Genet 2014;133:1235-53. [DOI: 10.1007/s00439-014-1461-1] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2014] [Accepted: 06/05/2014] [Indexed: 12/14/2022]

Number

Cited by Other Article(s)

Sandoval-Castillo J, Beheregaray LB, Wellenreuther M. Genomic prediction of growth in a commercially, recreationally, and culturally important marine resource, the Australian snapper (Chrysophrys auratus). G3 (BETHESDA, MD.) 2022;12:jkac015. [PMID: 35100370 PMCID: PMC8896003 DOI: 10.1093/g3journal/jkac015] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Accepted: 01/07/2022] [Indexed: 06/14/2023]

Abstract

Growth is one of the most important traits of an organism. For exploited species, this trait has ecological and evolutionary consequences as well as economical and conservation significance. Rapid changes in growth rate associated with anthropogenic stressors have been reported for several marine fishes, but little is known about the genetic basis of growth traits in teleosts. We used reduced genome representation data and genome-wide association approaches to identify growth-related genetic variation in the commercially, recreationally, and culturally important Australian snapper (Chrysophrys auratus, Sparidae). Based on 17,490 high-quality single-nucleotide polymorphisms and 363 individuals representing extreme growth phenotypes from 15,000 fish of the same age and reared under identical conditions in a sea pen, we identified 100 unique candidates that were annotated to 51 proteins. We documented a complex polygenic nature of growth in the species that included several loci with small effects and a few loci with larger effects. Overall heritability was high (75.7%), reflected in the high accuracy of the genomic prediction for the phenotype (small vs large). Although the single-nucleotide polymorphisms were distributed across the genome, most candidates (60%) clustered on chromosome 16, which also explains the largest proportion of heritability (16.4%). This study demonstrates that reduced genome representation single-nucleotide polymorphisms and the right bioinformatic tools provide a cost-efficient approach to identify growth-related loci and to describe genomic architectures of complex quantitative traits. Our results help to inform captive aquaculture breeding programs and are of relevance to monitor growth-related evolutionary shifts in wild populations in response to anthropogenic pressures.

Collapse

Casto-Rebollo C, Argente MJ, García ML, Pena R, Ibáñez-Escriche N. Identification of functional mutations associated with environmental variance of litter size in rabbits. Genet Sel Evol 2020;52:22. [PMID: 32375645 PMCID: PMC7203823 DOI: 10.1186/s12711-020-00542-w] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2019] [Accepted: 04/27/2020] [Indexed: 12/18/2022] Open

Abstract

Background

Environmental variance (V_E) is partly under genetic control and has recently been proposed as a measure of resilience. Unravelling the genetic background of the V_E of complex traits could help to improve resilience of livestock and stabilize their production across farming systems. The objective of this study was to identify genes and functional mutations associated with variation in V_E of litter size (LS) in rabbits. To achieve this, we combined the results of a genome-wide association study (GWAS) and a whole-genome sequencing (WGS) analysis using data from two divergently selected rabbit lines for high and low V_E of LS. These lines differ in terms of biomarkers of immune response and mortality. Moreover, rabbits with a lower V_E of LS were found to be more resilient to infections than animals with a higher V_E of LS.

Results

By using two GWAS approaches (single-marker regression and Bayesian multiple-marker regression), we identified four genomic regions associated with V_E of LS, on chromosomes 3, 7, 10, and 14. We detected 38 genes in the associated genomic regions and, using WGS, we identified 129 variants in the splicing, UTR, and coding (missense and frameshift effects) regions of 16 of these 38 genes. These genes were related to the immune system, the development of sensory structures, and stress responses. All of these variants (except one) segregated in one of the rabbit lines and were absent (n = 91) or fixed in the other one (n = 37). The fixed variants were in the HDAC9, ITGB8, MIS18A, ENSOCUG00000021276 and URB1 genes. We also identified a 1-bp deletion in the 3′UTR region of the HUNK gene that was fixed in the low V_E line and absent in the high V_E line.

Conclusions

This is the first study that combines GWAS and WGS analyses to study the genetic basis of V_E. The new candidate genes and functional mutations identified in this study suggest that the V_E of LS is under the control of functions related to the immune system, stress response, and the nervous system. These findings could also explain differences in resilience between rabbits with homogeneous and heterogeneous V_E of litter size.

Collapse

Sosa‐Madrid BS, Hernández P, Blasco A, Haley CS, Fontanesi L, Santacreu MA, Pena RN, Navarro P, Ibáñez‐Escriche N. Genomic regions influencing intramuscular fat in divergently selected rabbit lines. Anim Genet 2020;51:58-69. [PMID: 31696970 PMCID: PMC7004202 DOI: 10.1111/age.12873] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/04/2019] [Indexed: 12/12/2022]

Sosa-Madrid BS, Santacreu MA, Blasco A, Fontanesi L, Pena RN, Ibáñez-Escriche N. A genomewide association study in divergently selected lines in rabbits reveals novel genomic regions associated with litter size traits. J Anim Breed Genet 2019;137:123-138. [PMID: 31657065 DOI: 10.1111/jbg.12451] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2019] [Revised: 10/02/2019] [Accepted: 10/03/2019] [Indexed: 12/28/2022]

Liu Y, Lusk CM, Cho MH, Silverman EK, Qiao D, Zhang R, Scheurer ME, Kheradmand F, Wheeler DA, Tsavachidis S, Armstrong G, Zhu D, Wistuba II, Chow CWB, Behrens C, Pikielny CW, Neslund-Dudas C, Pinney SM, Anderson M, Kupert E, Bailey-Wilson J, Gaba C, Mandal D, You M, de Andrade M, Yang P, Field JK, Liloglou T, Davies M, Lissowska J, Swiatkowska B, Zaridze D, Mukeriya A, Janout V, Holcatova I, Mates D, Milosavljevic S, Scelo G, Brennan P, McKay J, Liu G, Hung RJ, Christiani DC, Schwartz AG, Amos CI, Spitz MR. Rare Variants in Known Susceptibility Loci and Their Contribution to Risk of Lung Cancer. J Thorac Oncol 2018;13:1483-1495. [PMID: 29981437 PMCID: PMC6366341 DOI: 10.1016/j.jtho.2018.06.016] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2018] [Revised: 06/06/2018] [Accepted: 06/17/2018] [Indexed: 10/28/2022]

Abstract

BACKGROUND

Genome-wide association studies are widely used to map genomic regions contributing to lung cancer (LC) susceptibility, but they typically do not identify the precise disease-causing genes/variants. To unveil the inherited genetic variants that cause LC, we performed focused exome-sequencing analyses on genes located in 121 genome-wide association study-identified loci previously implicated in the risk of LC, chronic obstructive pulmonary disease, pulmonary function level, and smoking behavior.

METHODS

Germline DNA from 260 case patients with LC and 318 controls were sequenced by utilizing VCRome 2.1 exome capture. Filtering was based on enrichment of rare and potential deleterious variants in cases (risk alleles) or controls (protective alleles). Allelic association analyses of single-variant and gene-based burden tests of multiple variants were performed. Promising candidates were tested in two independent validation studies with a total of 1773 case patients and 1123 controls.

RESULTS

We identified 48 rare variants with deleterious effects in the discovery analysis and validated 12 of the 43 candidates that were covered in the validation platforms. The top validated candidates included one well-established truncating variant, namely, BRCA2, DNA repair associated gene (BRCA2) K3326X (OR = 2.36, 95% confidence interval [CI]: 1.38-3.99), and three newly identified variations, namely, lymphotoxin beta gene (LTB) p.Leu87Phe (OR = 7.52, 95% CI: 1.01-16.56), prolyl 3-hydroxylase 2 gene (P3H2) p.Gln185His (OR = 5.39, 95% CI: 0.75-15.43), and dishevelled associated activator of morphogenesis 2 gene (DAAM2) p.Asp762Gly (OR = 0.25, 95% CI: 0.10-0.79). Burden tests revealed strong associations between zinc finger protein 93 gene (ZNF93), DAAM2, bromodomain containing 9 gene (BRD9), and the gene LTB and LC susceptibility.

CONCLUSION

Our results extend the catalogue of regions associated with LC and highlight the importance of germline rare coding variants in LC susceptibility.

Collapse

Affiliation(s)

Yanhong Liu Dan L. Duncan Comprehensive Cancer Center, Department of Medicine, Baylor College of Medicine, Houston, TX 77030, USA
Christine M. Lusk Karmanos Cancer Institute, Wayne State University, Detroit, MI 48201, USA
Michael H. Cho Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA 02115, USA
Edwin K. Silverman Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA 02115, USA
Dandi Qiao Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA 02115, USA
Ruyang Zhang Harvard University School of Public Health, Boston, MA 02115, USA
Michael E. Scheurer Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA
Farrah Kheradmand Dan L. Duncan Comprehensive Cancer Center, Department of Medicine, Baylor College of Medicine, Houston, TX 77030, USA Michael E. DeBakey Veterans Affairs Medical Center; Houston, TX 77030, USA
David A. Wheeler Department of Molecular and Human Genetics, Human Genome Sequence Center, Baylor College of Medicine, Houston, TX 77030, USA
Spiridon Tsavachidis Dan L. Duncan Comprehensive Cancer Center, Department of Medicine, Baylor College of Medicine, Houston, TX 77030, USA
Georgina Armstrong Dan L. Duncan Comprehensive Cancer Center, Department of Medicine, Baylor College of Medicine, Houston, TX 77030, USA
Dakai Zhu Dan L. Duncan Comprehensive Cancer Center, Department of Medicine, Baylor College of Medicine, Houston, TX 77030, USA Institute for Clinical and Translational Research, Baylor College of Medicine, Houston, TX 77030, USA
Ignacio I. Wistuba Department of Translational Molecular Pathology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
Chi-Wan B. Chow Department of Translational Molecular Pathology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
Carmen Behrens Department of Thoracic/Head and Neck Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
Claudio W. Pikielny Department of Biomedical Data Science, Geisel School of Medicine, Dartmouth College, Lebanon, NH 03755, USA
Christine Neslund-Dudas Department of Public Health Sciences, Henry Ford health System, Detroit, MI 48202, USA
Susan M. Pinney University of Cincinnati College of Medicine, Cincinnati, OH 45267, USA
Marshall Anderson University of Cincinnati College of Medicine, Cincinnati, OH 45267, USA
Elena Kupert University of Cincinnati College of Medicine, Cincinnati, OH 45267, USA
Joan Bailey-Wilson National Human Genome Research Institute, Bethesda, MD 20892, USA
Colette Gaba The University of Toledo College of Medicine, Toledo, OH 43614, USA
Diptasri Mandal Louisiana State University Health Sciences Center, New Orleans, LA 70112, USA
Ming You Medical College of Wisconsin, Milwaukee, WI 53226, USA
Mariza de Andrade Mayo Clinic College of Medicine, Rochester, MN 55905, USA
Ping Yang Mayo Clinic College of Medicine, Rochester, MN 55905, USA
John K. Field Roy Castle Lung Cancer Research Programme, The University of Liverpool, Department of Molecular and Clinical Cancer Medicine, Liverpool, UK
Triantafillos Liloglou Roy Castle Lung Cancer Research Programme, The University of Liverpool, Department of Molecular and Clinical Cancer Medicine, Liverpool, UK
Michael Davies Roy Castle Lung Cancer Research Programme, The University of Liverpool, Department of Molecular and Clinical Cancer Medicine, Liverpool, UK
Jolanta Lissowska The M. Sklodowska-Curie Institute of Oncology Center, Warsaw 02781, Poland
Beata Swiatkowska Nofer Institute of Occupational Medicine, Department of Environmental Epidemiology, Lodz 91348, Poland
David Zaridze Russian N.N. Blokhin Cancer Research Centre, Moscow 115478, Russian Federation
Anush Mukeriya Russian N.N. Blokhin Cancer Research Centre, Moscow 115478, Russian Federation
Vladimir Janout Faculty of Health Sciences, Palacky University, Olomouc 77515, Czech Republic
Ivana Holcatova Institute of Public Health and Preventive Medicine, Charles University, 2nd Faculty of Medicine, Prague 12800, Czech Republic
Dana Mates National Institute of Public Health, Bucharest 050463, Romania
Sasa Milosavljevic International Organization for Cancer Prevention and Research (IOCPR), Belgrade, Serbia
Ghislaine Scelo International Agency for Research on Cancer, Lyon, France
Paul Brennan International Agency for Research on Cancer, Lyon, France
James McKay International Agency for Research on Cancer, Lyon, France
Geoffrey Liu Princess Margaret Cancer Center, Toronto, ON, M5G 2M9, Canada
Rayjean J. Hung Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, ON, M5G 1X5 Canada
The COPDGene Investigators
David C. Christiani Harvard University School of Public Health, Boston, MA 02115, USA
Ann G. Schwartz Karmanos Cancer Institute, Wayne State University, Detroit, MI 48201, USA
Christopher I Amos Dan L. Duncan Comprehensive Cancer Center, Department of Medicine, Baylor College of Medicine, Houston, TX 77030, USA Institute for Clinical and Translational Research, Baylor College of Medicine, Houston, TX 77030, USA
Margaret R. Spitz Dan L. Duncan Comprehensive Cancer Center, Department of Medicine, Baylor College of Medicine, Houston, TX 77030, USA

Collapse

Kirpich A, Ainsworth EA, Wedow JM, Newman JRB, Michailidis G, McIntyre LM. Variable selection in omics data: A practical evaluation of small sample sizes. PLoS One 2018;13:e0197910. [PMID: 29927942 PMCID: PMC6013185 DOI: 10.1371/journal.pone.0197910] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2017] [Accepted: 05/10/2018] [Indexed: 01/04/2023] Open

Genetics of body fat mass and related traits in a pig population selected for leanness. Sci Rep 2017;7:9118. [PMID: 28831160 PMCID: PMC5567295 DOI: 10.1038/s41598-017-08961-4] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2017] [Accepted: 07/17/2017] [Indexed: 12/21/2022] Open

López de Maturana E, Pineda S, Brand A, Van Steen K, Malats N. Toward the integration of Omics data in epidemiological studies: still a "long and winding road". Genet Epidemiol 2016;40:558-569. [PMID: 27432111 DOI: 10.1002/gepi.21992] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2015] [Revised: 05/22/2016] [Accepted: 06/05/2016] [Indexed: 12/23/2022]

Masson-Lecomte A, López de Maturana E, Goddard ME, Picornell A, Rava M, González-Neira A, Márquez M, Carrato A, Tardon A, Lloreta J, Garcia-Closas M, Silverman D, Rothman N, Kogevinas M, Allory Y, Chanock SJ, Real FX, Malats N. Inflammatory-Related Genetic Variants in Non-Muscle-Invasive Bladder Cancer Prognosis: A Multimarker Bayesian Assessment. Cancer Epidemiol Biomarkers Prev 2016;25:1144-50. [PMID: 27197286 DOI: 10.1158/1055-9965.epi-15-0894] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2015] [Accepted: 04/22/2016] [Indexed: 11/16/2022] Open

Abstract

BACKGROUND

Increasing evidence points to the role of tumor immunologic environment on urothelial bladder cancer prognosis. This effect might be partly dependent on the host genetic context. We evaluated the association of SNPs in inflammation-related genes with non-muscle-invasive bladder cancer (NMIBC) risk-of-recurrence and risk-of-progression.

METHODS

We considered 822 NMIBC included in the SBC/EPICURO Study followed-up >10 years. We selected 1,679 SNPs belonging to 251 inflammatory genes. The association of SNPs with risk-of-recurrence and risk-of-progression was assessed using Cox regression single-marker (SMM) and multimarker methods (MMM) Bayes A and Bayesian LASSO. Discriminative abilities of the models were calculated using the c index and validated with bootstrap cross-validation procedures.

RESULTS

While no SNP was found to be associated with risk-of-recurrence using SMM, three SNPs in TNIP1, CD5, and JAK3 showed very strong association with posterior probabilities >90% using MMM. Regarding risk-of-progression, one SNP in CD3G was significantly associated using SMM (HR, 2.69; P = 1.55 × 10(-5)) and two SNPs in MASP1 and AIRE, showed a posterior probability ≥80% with MMM. Validated discriminative abilities of the models without and with the SNPs were 58.4% versus 60.5% and 72.1% versus 72.8% for risk-of-recurrence and risk-of-progression, respectively.

CONCLUSIONS

Using innovative analytic approaches, we demonstrated that SNPs in inflammatory-related genes were associated with NMIBC prognosis and that they improve the discriminative ability of prognostic clinical models for NMIBC.

IMPACT

This study provides proof of concept for the joint effect of genetic variants in improving the discriminative ability of clinical prognostic models. The approach may be extended to other diseases. Cancer Epidemiol Biomarkers Prev; 25(7); 1144-50. ©2016 AACR.

Collapse

Affiliation(s)

Alexandra Masson-Lecomte Genetic and Molecular Epidemiology Group, Spanish National Cancer Research Centre (CNIO), Madrid, Spain. Urology Department, Henri Mondor Academic Hospital, Paris Est Créteil University, Créteil, France
Evangelina López de Maturana Genetic and Molecular Epidemiology Group, Spanish National Cancer Research Centre (CNIO), Madrid, Spain
Michael E Goddard Biosciences Research Division, Department of Environment and Primary Industries, Agribio, Bundoora, Victoria, Australia. Department of Food and Agricultural Systems, University of Melbourne, Melbourne, Australia
Antoni Picornell Genetic and Molecular Epidemiology Group, Spanish National Cancer Research Centre (CNIO), Madrid, Spain
Marta Rava Genetic and Molecular Epidemiology Group, Spanish National Cancer Research Centre (CNIO), Madrid, Spain
Anna González-Neira Human Genotyping-CEGEN Unit, Spanish National Cancer Research Centre (CNIO), Madrid, Spain
Mirari Márquez Genetic and Molecular Epidemiology Group, Spanish National Cancer Research Centre (CNIO), Madrid, Spain
Alfredo Carrato Servicio de Oncología, Hospital Universitario Ramon y Cajal, Madrid, and Servicio de Oncología, Hospital Universitario de Elche, Elche, Spain
Adonina Tardon Department of Preventive Medicine, Universidad de Oviedo, Oviedo, Spain
Josep Lloreta Institut Municipal d'Investigació Mèdica - Hospital del Mar and Departament de Patologia, Hospital del Mar - IMAS, Barcelona, Spain
Montserrat Garcia-Closas Division of Genetics and Epidemiology, Institute of Cancer Research, London, United Kingdom
Debra Silverman Division of Cancer Epidemiology and Genetics, National Cancer Institute, Department of Health and Human Services, Bethesda, Maryland
Nathaniel Rothman Division of Cancer Epidemiology and Genetics, National Cancer Institute, Department of Health and Human Services, Bethesda, Maryland
Manolis Kogevinas Centre for Research in Environmental Epidemiology (CREAL) and Institut Municipal d'Investigació Mèdica - Hospital del Mar, Barcelona, Spain
Yves Allory Pathology Department, Henri Mondor Academic Hospital, Paris Est Créteil University, INSERM, Créteil, France
Stephen J Chanock Division of Cancer Epidemiology and Genetics, National Cancer Institute, Department of Health and Human Services, Bethesda, Maryland
Francisco X Real Epithelial Carcinogenesis Group, Spanish National Cancer Research Centre (CNIO), Madrid, Spain. Departament de Ciències Experimentals i de la Salut, Universitat Pompeu Fabra, Barcelona, Spain
Núria Malats Genetic and Molecular Epidemiology Group, Spanish National Cancer Research Centre (CNIO), Madrid, Spain.

Collapse

Rare Variants in Transcript and Potential Regulatory Regions Explain a Small Percentage of the Missing Heritability of Complex Traits in Cattle. PLoS One 2015;10:e0143945. [PMID: 26642058 PMCID: PMC4671594 DOI: 10.1371/journal.pone.0143945] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2015] [Accepted: 11/11/2015] [Indexed: 11/19/2022] Open

Abstract

The proportion of genetic variation in complex traits explained by rare variants is a key question for genomic prediction, and for identifying the basis of “missing heritability”–the proportion of additive genetic variation not captured by common variants on SNP arrays. Sequence variants in transcript and regulatory regions from 429 sequenced animals were used to impute high density SNP genotypes of 3311 Holstein sires to sequence. There were 675,062 common variants (MAF>0.05), 102,549 uncommon variants (0.01<MAF<0.05), and 83,856 rare variants (MAF<0.01). We describe a novel method for estimating the proportion of the rare variants that are sequencing errors using parent-progeny duos. We then used mixed model methodology to estimate the proportion of variance captured by these different classes of variants for fat, milk and protein yields, as well as for fertility. Common sequence variants captured 83%, 77%, 76% and 84% of the total genetic variance for fat, milk, and protein yields and fertility, respectively. This was between 2 and 5% more variance than that captured from 600k SNPs on a high density chip, although the difference was not significant. Rare variants captured 3%, 0%, 1% and 14% of the genetic variance for fat, milk and protein yields, and fertility respectively, whereas pedigree explained the remaining amount of genetic variance (none for fertility). The proportion of variation explained by rare variants is likely to be under-estimated due to reduced accuracies of imputation for this class of variants. Using common sequence variants slightly improved accuracy of genomic predictions for fat and milk yield, compared to high density SNP array genotypes. However, including rare variants from transcript regions did not increase the accuracy of genomic predictions. These results suggest that rare variants recover a small percentage of the missing heritability for complex traits, however very large reference sets will be required to exploit this to improve the accuracy of genomic predictions. Our results do suggest the contribution of rare variants to genetic variation may be greater for fitness traits.

Collapse

The genetics of feed conversion efficiency traits in a commercial broiler line. Sci Rep 2015;5:16387. [PMID: 26552583 PMCID: PMC4639841 DOI: 10.1038/srep16387] [Citation(s) in RCA: 47] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2015] [Accepted: 10/14/2015] [Indexed: 11/26/2022] Open

Dehman A, Ambroise C, Neuvial P. Performance of a blockwise approach in variable selection using linkage disequilibrium information. BMC Bioinformatics 2015;16:148. [PMID: 25951947 PMCID: PMC4430909 DOI: 10.1186/s12859-015-0556-6] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2014] [Accepted: 03/30/2015] [Indexed: 12/03/2022] Open

Abstract

Background

Genome-wide association studies (GWAS) aim at finding genetic markers that are significantly associated with a phenotype of interest. Single nucleotide polymorphism (SNP) data from the entire genome are collected for many thousands of SNP markers, leading to high-dimensional regression problems where the number of predictors greatly exceeds the number of observations. Moreover, these predictors are statistically dependent, in particular due to linkage disequilibrium (LD).

We propose a three-step approach that explicitly takes advantage of the grouping structure induced by LD in order to identify common variants which may have been missed by single marker analyses (SMA). In the first step, we perform a hierarchical clustering of SNPs with an adjacency constraint using LD as a similarity measure. In the second step, we apply a model selection approach to the obtained hierarchy in order to define LD blocks. Finally, we perform Group Lasso regression on the inferred LD blocks. We investigate the efficiency of this approach compared to state-of-the art regression methods: haplotype association tests, SMA, and Lasso and Elastic-Net regressions.

Results

Our results on simulated data show that the proposed method performs better than state-of-the-art approaches as soon as the number of causal SNPs within an LD block exceeds 2. Our results on semi-simulated data and a previously published HIV data set illustrate the relevance of the proposed method and its robustness to a real LD structure. The method is implemented in the R package BALD (Blockwise Approach using Linkage Disequilibrium), available from http://www.math-evry.cnrs.fr/publications/logiciels.

Conclusions

Our results show that the proposed method is efficient not only at the level of LD blocks by inferring well the underlying block structure but also at the level of individual SNPs. Thus, this study demonstrates the importance of tailored integration of biological knowledge in high-dimensional genomic studies such as GWAS.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-015-0556-6) contains supplementary material, which is available to authorized users.

Collapse