Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Hoffman GE. Correcting for population structure and kinship using the linear mixed model: theory and extensions. PLoS One 2013;8:e75707. [PMID: 24204578 PMCID: PMC3810480 DOI: 10.1371/journal.pone.0075707] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2013] [Accepted: 08/20/2013] [Indexed: 01/20/2023] Open

For:	Hoffman GE. Correcting for population structure and kinship using the linear mixed model: theory and extensions. PLoS One 2013;8:e75707. [PMID: 24204578 PMCID: PMC3810480 DOI: 10.1371/journal.pone.0075707] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2013] [Accepted: 08/20/2013] [Indexed: 01/20/2023] Open

Number

Cited by Other Article(s)

Song S, Li Y, Qiu M, Xu N, Li B, Zhang L, Li L, Chen W, Li J, Wang T, Qiu Y, Gong M, Yu D, Dong H, Xia S, Pan Y, Yuan D, Li L. Structural variations of a new fertility restorer gene, Rf20, underlie the restoration of wild abortive-type cytoplasmic male sterility in rice. MOLECULAR PLANT 2024;17:1272-1288. [PMID: 38956872 DOI: 10.1016/j.molp.2024.07.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/08/2024] [Revised: 06/25/2024] [Accepted: 07/01/2024] [Indexed: 07/04/2024]

Affiliation(s)

Shufeng Song State Key Laboratory of Hybrid Rice, Hunan Hybrid Rice Research Center, Hunan Academy of Agricultural Sciences, Changsha 410125, China
Yixing Li State Key Laboratory of Hybrid Rice, Hunan Hybrid Rice Research Center, Hunan Academy of Agricultural Sciences, Changsha 410125, China
Mudan Qiu State Key Laboratory of Hybrid Rice, Hunan Hybrid Rice Research Center, Hunan Academy of Agricultural Sciences, Changsha 410125, China; College of Agronomy, Hunan Agricultural University, Changsha 410128, China
Na Xu State Key Laboratory of Hybrid Rice, Hunan Hybrid Rice Research Center, Hunan Academy of Agricultural Sciences, Changsha 410125, China; College of Plant Protection, Hunan Agricultural University, Changsha 410128, China
Bin Li State Key Laboratory of Hybrid Rice, Hunan Hybrid Rice Research Center, Hunan Academy of Agricultural Sciences, Changsha 410125, China
Longhui Zhang College of Tropical Crops, Hainan University, Haikou 570228, China
Lei Li Longping Branch, College of Biology, Hunan University, Changsha 410125, China
Weijun Chen State Key Laboratory of Hybrid Rice, Hunan Hybrid Rice Research Center, Hunan Academy of Agricultural Sciences, Changsha 410125, China
Jinglei Li Longping Branch, College of Biology, Hunan University, Changsha 410125, China
Tiankang Wang State Key Laboratory of Hybrid Rice, Hunan Hybrid Rice Research Center, Hunan Academy of Agricultural Sciences, Changsha 410125, China
Yingxin Qiu Longping Branch, College of Biology, Hunan University, Changsha 410125, China
Mengmeng Gong College of Agronomy, Hunan Agricultural University, Changsha 410128, China
Dong Yu State Key Laboratory of Hybrid Rice, Hunan Hybrid Rice Research Center, Hunan Academy of Agricultural Sciences, Changsha 410125, China
Hao Dong Longping Branch, College of Biology, Hunan University, Changsha 410125, China
Siqi Xia College of Agronomy, Hunan Agricultural University, Changsha 410128, China
Yi Pan State Key Laboratory of Hybrid Rice, Hunan Hybrid Rice Research Center, Hunan Academy of Agricultural Sciences, Changsha 410125, China
Dingyang Yuan State Key Laboratory of Hybrid Rice, Hunan Hybrid Rice Research Center, Hunan Academy of Agricultural Sciences, Changsha 410125, China; Longping Branch, College of Biology, Hunan University, Changsha 410125, China
Li Li State Key Laboratory of Hybrid Rice, Hunan Hybrid Rice Research Center, Hunan Academy of Agricultural Sciences, Changsha 410125, China; Longping Branch, College of Biology, Hunan University, Changsha 410125, China.

Collapse

Lavanchy E, Weir BS, Goudet J. Detecting inbreeding depression in structured populations. Proc Natl Acad Sci U S A 2024;121:e2315780121. [PMID: 38687793 PMCID: PMC11087799 DOI: 10.1073/pnas.2315780121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Accepted: 03/19/2024] [Indexed: 05/02/2024] Open

Morris KM, Sutton K, Girma M, Sánchez-Molano E, Solomon B, Esatu W, Dessie T, Vervelde L, Psifidi A, Hanotte O, Banos G. Phenotypic and genomic characterisation of performance of tropically adapted chickens raised in smallholder farm conditions in Ethiopia. Front Genet 2024;15:1383609. [PMID: 38706792 PMCID: PMC11066160 DOI: 10.3389/fgene.2024.1383609] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Accepted: 04/01/2024] [Indexed: 05/07/2024] Open

Abstract

Background

In sub-Saharan Africa, 80% of poultry production is on smallholder village farms, where chickens are typically reared outdoors in free-ranging conditions. There is limited knowledge on chickens' phenotypic characteristics and genetics under these conditions.

Objective

The present is a large-scale study set out to phenotypically characterise the performance of tropically adapted commercial chickens in typical smallholder farm conditions, and to examine the genetic profile of chicken phenotypes associated with growth, meat production, immunity, and survival.

Methods

A total of 2,573 T451A dual-purpose Sasso chickens kept outdoors in emulated free-ranging conditions at the poultry facility of the International Livestock Research Institute in Addis Ababa, Ethiopia, were included in the study. The chickens were raised in five equally sized batches and were individually monitored and phenotyped from the age of 56 days for 8 weeks. Individual chicken data collected included weekly body weight, growth rate, body and breast meat weight at slaughter, Newcastle Disease Virus (NDV) titres and intestinal Immunoglobulin A (IgA) levels recorded at the beginning and the end of the period of study, and survival rate during the same period. Genotyping by sequencing was performed on all chickens using a low-coverage and imputation approach. Chicken phenotypes and genotypes were combined in genomic association analyses.

Results

We discovered that the chickens were phenotypically diverse, with extensive variance levels observed in all traits. Batch number and sex of the chicken significantly affected the studied phenotypes. Following quality assurance, genotypes consisted of 2.9 million Single Nucleotide Polymorphism markers that were used in the genomic analyses. Results revealed a largely polygenic mode of genetic control of all phenotypic traits. Nevertheless, 15 distinct markers were identified that were significantly associated with growth, carcass traits, NDV titres, IgA levels, and chicken survival. These markers were located in regions harbouring relevant annotated genes.

Conclusion

Results suggest that performance of chickens raised under smallholder farm conditions is amenable to genetic improvement and may inform selective breeding programmes for enhanced chicken productivity in sub-Saharan Africa.

Collapse

Liu Z, Turkmen AS, Lin S. Bayesian LASSO for population stratification correction in rare haplotype association studies. Stat Appl Genet Mol Biol 2024;23:sagmb-2022-0034. [PMID: 38235525 PMCID: PMC10794901 DOI: 10.1515/sagmb-2022-0034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Accepted: 12/19/2023] [Indexed: 01/19/2024]

Liu Z, Turkmen AS, Lin S. Population stratification correction using Bayesian shrinkage priors for genetic association studies. Ann Hum Genet 2023;87:302-315. [PMID: 37771252 DOI: 10.1111/ahg.12527] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Revised: 08/20/2023] [Accepted: 08/24/2023] [Indexed: 09/30/2023]

Devogel N, Auer PL, Manansala R, Wang T. On asymptotic distributions of several test statistics for familial relatedness in linear mixed models. Stat Med 2023;42:2962-2981. [PMID: 37345498 DOI: 10.1002/sim.9762] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2021] [Revised: 03/16/2023] [Accepted: 04/26/2023] [Indexed: 06/23/2023]

Li Q, Chen J, Faux P, Delgado ME, Bonfante B, Fuentes-Guajardo M, Mendoza-Revilla J, Chacón-Duque JC, Hurtado M, Villegas V, Granja V, Jaramillo C, Arias W, Barquera R, Everardo-Martínez P, Sánchez-Quinto M, Gómez-Valdés J, Villamil-Ramírez H, Silva de Cerqueira CC, Hünemeier T, Ramallo V, Wu S, Du S, Giardina A, Paria SS, Khokan MR, Gonzalez-José R, Schüler-Faccini L, Bortolini MC, Acuña-Alonzo V, Canizales-Quinteros S, Gallo C, Poletti G, Rojas W, Rothhammer F, Navarro N, Wang S, Adhikari K, Ruiz-Linares A. Automatic landmarking identifies new loci associated with face morphology and implicates Neanderthal introgression in human nasal shape. Commun Biol 2023;6:481. [PMID: 37156940 PMCID: PMC10167347 DOI: 10.1038/s42003-023-04838-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Accepted: 04/12/2023] [Indexed: 05/10/2023] Open

Affiliation(s)

Qing Li Ministry of Education Key Laboratory of Contemporary Anthropology and Collaborative Innovation Center of Genetics and Development, School of Life Sciences and Human Phenome Institute, Fudan University, Yangpu District, Shanghai, 200438, China
Jieyi Chen Ministry of Education Key Laboratory of Contemporary Anthropology and Collaborative Innovation Center of Genetics and Development, School of Life Sciences and Human Phenome Institute, Fudan University, Yangpu District, Shanghai, 200438, China CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, 320 Yue Yang Road, Shanghai, 200031, China
Pierre Faux Aix-Marseille Université, CNRS, EFS, ADES, Marseille, 13005, France
Miguel Eduardo Delgado Ministry of Education Key Laboratory of Contemporary Anthropology and Collaborative Innovation Center of Genetics and Development, School of Life Sciences and Human Phenome Institute, Fudan University, Yangpu District, Shanghai, 200438, China División Antropología, Facultad de Ciencias Naturales y Museo, Universidad Nacional de La Plata, La Plata, República Argentina Consejo Nacional de Investigaciones Científicas y Técnicas, CONICET, Buenos Aires, República Argentina
Betty Bonfante Aix-Marseille Université, CNRS, EFS, ADES, Marseille, 13005, France
Macarena Fuentes-Guajardo Departamento de Tecnología Médica, Facultad de Ciencias de la Salud, Universidad de Tarapacá, Arica, 1000000, Chile
Javier Mendoza-Revilla Laboratorios de Investigación y Desarrollo, Facultad de Ciencias y Filosofía, Universidad Peruana Cayetano Heredia, Lima, 31, Perú Unit of Human Evolutionary Genetics, Institut Pasteur, Paris, 75015, France
J Camilo Chacón-Duque Division of Vertebrates and Anthropology, Department of Earth Sciences, Natural History Museum, London, SW7 5BD, UK
Malena Hurtado Laboratorios de Investigación y Desarrollo, Facultad de Ciencias y Filosofía, Universidad Peruana Cayetano Heredia, Lima, 31, Perú
Valeria Villegas Laboratorios de Investigación y Desarrollo, Facultad de Ciencias y Filosofía, Universidad Peruana Cayetano Heredia, Lima, 31, Perú
Vanessa Granja Laboratorios de Investigación y Desarrollo, Facultad de Ciencias y Filosofía, Universidad Peruana Cayetano Heredia, Lima, 31, Perú
Claudia Jaramillo GENMOL (Genética Molecular), Universidad de Antioquia, Medellín, 5001000, Colombia
William Arias GENMOL (Genética Molecular), Universidad de Antioquia, Medellín, 5001000, Colombia
Rodrigo Barquera Molecular Genetics Laboratory, National School of Anthropology and History, Mexico City, 14050, Mexico, 6600, Mexico Department of Archaeogenetics, Max Planck Institute for the Science of Human History (MPI-SHH), Jena, 07745, Germany
Paola Everardo-Martínez Molecular Genetics Laboratory, National School of Anthropology and History, Mexico City, 14050, Mexico, 6600, Mexico
Mirsha Sánchez-Quinto Forensic Science, Faculty of Medicine, UNAM (Universidad Nacional Autónoma de México), Mexico City, 06320, Mexico
Jorge Gómez-Valdés Molecular Genetics Laboratory, National School of Anthropology and History, Mexico City, 14050, Mexico, 6600, Mexico
Hugo Villamil-Ramírez Unidad de Genomica de Poblaciones Aplicada a la Salud, Facultad de Química, UNAM-Instituto Nacional de Medicina Genómica, Mexico City, 4510, Mexico
Caio C Silva de Cerqueira Scientific Police of São Paulo State, Ourinhos, SP, 19900-109, Brazil
Tábita Hünemeier Departamento de Genética e Biologia Evolutiva, Instituto de Biociências, Universidade de São Paulo, São Paulo, SP, 05508-090, Brazil
Virginia Ramallo Departamento de Genética, Universidade Federal do Rio Grande do Sul, Porto Alegre, 90040-060, Brazil Instituto Patagónico de Ciencias Sociales y Humanas, Centro Nacional Patagónico, CONICET, Puerto Madryn, U9129ACD, Argentina
Sijie Wu Ministry of Education Key Laboratory of Contemporary Anthropology and Collaborative Innovation Center of Genetics and Development, School of Life Sciences and Human Phenome Institute, Fudan University, Yangpu District, Shanghai, 200438, China CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, 320 Yue Yang Road, Shanghai, 200031, China
Siyuan Du CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, 320 Yue Yang Road, Shanghai, 200031, China
Andrea Giardina School of Mathematics and Statistics, Faculty of Science, Technology, Engineering and Mathematics, The Open University, Milton Keynes, MK7 6AA, United Kingdom
Soumya Subhra Paria School of Mathematics and Statistics, Faculty of Science, Technology, Engineering and Mathematics, The Open University, Milton Keynes, MK7 6AA, United Kingdom
Mahfuzur Rahman Khokan School of Mathematics and Statistics, Faculty of Science, Technology, Engineering and Mathematics, The Open University, Milton Keynes, MK7 6AA, United Kingdom
Rolando Gonzalez-José Instituto Patagónico de Ciencias Sociales y Humanas, Centro Nacional Patagónico, CONICET, Puerto Madryn, U9129ACD, Argentina
Lavinia Schüler-Faccini Departamento de Genética, Universidade Federal do Rio Grande do Sul, Porto Alegre, 90040-060, Brazil
Maria-Cátira Bortolini Departamento de Genética, Universidade Federal do Rio Grande do Sul, Porto Alegre, 90040-060, Brazil
Victor Acuña-Alonzo Molecular Genetics Laboratory, National School of Anthropology and History, Mexico City, 14050, Mexico, 6600, Mexico
Samuel Canizales-Quinteros Unidad de Genomica de Poblaciones Aplicada a la Salud, Facultad de Química, UNAM-Instituto Nacional de Medicina Genómica, Mexico City, 4510, Mexico
Carla Gallo Laboratorios de Investigación y Desarrollo, Facultad de Ciencias y Filosofía, Universidad Peruana Cayetano Heredia, Lima, 31, Perú
Giovanni Poletti Laboratorios de Investigación y Desarrollo, Facultad de Ciencias y Filosofía, Universidad Peruana Cayetano Heredia, Lima, 31, Perú
Winston Rojas GENMOL (Genética Molecular), Universidad de Antioquia, Medellín, 5001000, Colombia
Francisco Rothhammer Instituto de Alta Investigación, Universidad de Tarapacá, Arica, Arica, 1000000, Chile
Nicolas Navarro Biogéosciences, UMR 6282 CNRS, Université de Bourgogne, Dijon, 21000, France EPHE, PSL University, Paris, 75014, France
Sijia Wang Ministry of Education Key Laboratory of Contemporary Anthropology and Collaborative Innovation Center of Genetics and Development, School of Life Sciences and Human Phenome Institute, Fudan University, Yangpu District, Shanghai, 200438, China CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, 320 Yue Yang Road, Shanghai, 200031, China
Kaustubh Adhikari School of Mathematics and Statistics, Faculty of Science, Technology, Engineering and Mathematics, The Open University, Milton Keynes, MK7 6AA, United Kingdom. Department of Genetics, Evolution and Environment, and UCL Genetics Institute, University College London, London, WC1E 6BT, UK.
Andrés Ruiz-Linares Ministry of Education Key Laboratory of Contemporary Anthropology and Collaborative Innovation Center of Genetics and Development, School of Life Sciences and Human Phenome Institute, Fudan University, Yangpu District, Shanghai, 200438, China. Aix-Marseille Université, CNRS, EFS, ADES, Marseille, 13005, France. Department of Genetics, Evolution and Environment, and UCL Genetics Institute, University College London, London, WC1E 6BT, UK.

Collapse

Hou Z, Ochoa A. Genetic association models are robust to common population kinship estimation biases. Genetics 2023;224:iyad030. [PMID: 36843304 PMCID: PMC10474929 DOI: 10.1093/genetics/iyad030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Revised: 11/08/2022] [Accepted: 02/17/2023] [Indexed: 02/28/2023] Open

Farooq M, van Dijk AD, Nijveen H, Mansoor S, de Ridder D. Genomic prediction in plants: opportunities for ensemble machine learning based approaches. F1000Res 2023;11:802. [PMID: 37035464 PMCID: PMC10080209 DOI: 10.12688/f1000research.122437.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 01/04/2023] [Indexed: 01/12/2023] Open

Jiang W, Zhang X, Li S, Song S, Zhao H. An unbiased kinship estimation method for genetic data analysis. BMC Bioinformatics 2022;23:525. [PMID: 36474154 PMCID: PMC9727941 DOI: 10.1186/s12859-022-05082-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Accepted: 11/25/2022] [Indexed: 12/13/2022] Open

Abstract

Accurate estimate of relatedness is important for genetic data analyses, such as heritability estimation and association mapping based on data collected from genome-wide association studies. Inaccurate relatedness estimates may lead to biased heritability estimations and spurious associations. Individual-level genotype data are often used to estimate kinship coefficient between individuals. The commonly used sample correlation-based genomic relationship matrix (scGRM) method estimates kinship coefficient by calculating the average sample correlation coefficient among all single nucleotide polymorphisms (SNPs), where the observed allele frequencies are used to calculate both the expectations and variances of genotypes. Although this method is widely used, a substantial proportion of estimated kinship coefficients are negative, which are difficult to interpret. In this paper, through mathematical derivation, we show that there indeed exists bias in the estimated kinship coefficient using the scGRM method when the observed allele frequencies are regarded as true frequencies. This leads to negative bias for the average estimate of kinship among all individuals, which explains the estimated negative kinship coefficients. Based on this observation, we propose an unbiased estimation method, UKin, which can reduce kinship estimation bias. We justify our improved method with rigorous mathematical proof. We have conducted simulations as well as two real data analyses to compare UKin with scGRM and three other kinship estimating methods: rGRM, tsGRM, and KING. Our results demonstrate that both bias and root mean square error in kinship coefficient estimation could be reduced by using UKin. We further investigated the performance of UKin, KING, and three GRM-based methods in calculating the SNP-based heritability, and show that UKin can improve estimation accuracy for heritability regardless of the scale of SNP panel.

Collapse

Farooq M, van Dijk AD, Nijveen H, Mansoor S, de Ridder D. Genomic prediction in plants: opportunities for ensemble machine learning based approaches. F1000Res 2022;11:802. [PMID: 37035464 PMCID: PMC10080209 DOI: 10.12688/f1000research.122437.1] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 07/08/2022] [Indexed: 12/15/2022] Open

Eizenga GC, Kim H, Jung JKH, Greenberg AJ, Edwards JD, Naredo MEB, Banaticla-Hilario MCN, Harrington SE, Shi Y, Kimball JA, Harper LA, McNally KL, McCouch SR. Phenotypic Variation and the Impact of Admixture in the Oryza rufipogon Species Complex (ORSC). FRONTIERS IN PLANT SCIENCE 2022;13:787703. [PMID: 35769295 PMCID: PMC9235872 DOI: 10.3389/fpls.2022.787703] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/01/2021] [Accepted: 04/13/2022] [Indexed: 06/15/2023]

Abstract

Crop wild relatives represent valuable reservoirs of variation for breeding, but their populations are threatened in natural habitats, are sparsely represented in genebanks, and most are poorly characterized. The focus of this study is the Oryza rufipogon species complex (ORSC), wild progenitor of Asian rice (Oryza sativa L.). The ORSC comprises perennial, annual and intermediate forms which were historically designated as O. rufipogon, O. nivara, and O. sativa f. spontanea (or Oryza spp., an annual form of mixed O. rufipogon/O. nivara and O. sativa ancestry), respectively, based on non-standardized morphological, geographical, and/or ecologically-based species definitions and boundaries. Here, a collection of 240 diverse ORSC accessions, characterized by genotyping-by-sequencing (113,739 SNPs), was phenotyped for 44 traits associated with plant, panicle, and seed morphology in the screenhouse at the International Rice Research Institute, Philippines. These traits included heritable phenotypes often recorded as characterization data by genebanks. Over 100 of these ORSC accessions were also phenotyped in the greenhouse for 18 traits in Stuttgart, Arkansas, and 16 traits in Ithaca, New York, United States. We implemented a Bayesian Gaussian mixture model to infer accession groups from a subset of these phenotypic data and ascertained three phenotype-based group assignments. We used concordance between the genotypic subpopulations and these phenotype-based groups to identify a suite of phenotypic traits that could reliably differentiate the ORSC populations, whether measured in tropical or temperate regions. The traits provide insight into plant morphology, life history (perenniality versus annuality) and mating habit (self- versus cross-pollinated), and are largely consistent with genebank species designations. One phenotypic group contains predominantly O. rufipogon accessions characterized as perennial and largely out-crossing and one contains predominantly O. nivara accessions characterized as annual and largely inbreeding. From these groups, 42 "core" O. rufipogon and 25 "core" O. nivara accessions were identified for domestication studies. The third group, comprising 20% of our collection, has the most accessions identified as Oryza spp. (51.2%) and levels of O. sativa admixture accounting for more than 50% of the genome. This third group is potentially useful as a "pre-breeding" pool for breeders attempting to incorporate novel variation into elite breeding lines.

Collapse

Seal S, Vu T, Ghosh T, Wrobel J, Ghosh D. DenVar: density-based variation analysis of multiplex imaging data. BIOINFORMATICS ADVANCES 2022;2:vbac039. [PMID: 36699398 PMCID: PMC9710661 DOI: 10.1093/bioadv/vbac039] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 04/17/2022] [Accepted: 05/18/2022] [Indexed: 02/01/2023]

Wang H, Aragam B, Xing EP. Trade-offs of Linear Mixed Models in Genome-Wide Association Studies. J Comput Biol 2022;29:233-242. [PMID: 35230156 PMCID: PMC8968846 DOI: 10.1089/cmb.2021.0157] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Yang JJ, Luo X, Trucco EM, Buu A. Polygenic risk prediction based on singular value decomposition with applications to alcohol use disorder. BMC Bioinformatics 2022;23:28. [PMID: 35012447 PMCID: PMC8744290 DOI: 10.1186/s12859-022-04566-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Accepted: 01/05/2022] [Indexed: 12/24/2022] Open

Semagn K, Iqbal M, Alachiotis N, N'Diaye A, Pozniak C, Spaner D. Genetic diversity and selective sweeps in historical and modern Canadian spring wheat cultivars using the 90K SNP array. Sci Rep 2021;11:23773. [PMID: 34893626 PMCID: PMC8664822 DOI: 10.1038/s41598-021-02666-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2021] [Accepted: 11/22/2021] [Indexed: 12/14/2022] Open

Khan N, Essemine J, Hamdani S, Qu M, Lyu MJA, Perveen S, Stirbet A, Govindjee G, Zhu XG. Natural variation in the fast phase of chlorophyll a fluorescence induction curve (OJIP) in a global rice minicore panel. PHOTOSYNTHESIS RESEARCH 2021;150:137-158. [PMID: 33159615 DOI: 10.1007/s11120-020-00794-z] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2020] [Accepted: 10/26/2020] [Indexed: 06/11/2023]

Abstract

Photosynthesis can be probed through Chlorophyll a fluorescence induction (FI), which provides detailed insight into the electron transfer process in Photosystem II, and beyond. Here, we have systematically studied the natural variation of the fast phase of the FI, i.e. the OJIP phase, in rice. The OJIP phase of the Chl a fluorescence induction curve is referred to as "fast transient" lasting for less than a second; it is obtained after a dark-adapted sample is exposed to saturating light. In the OJIP curve, "O" stands for "origin" (minimal fluorescence), "P" for "peak" (maximum fluorescence), and J and I for inflection points between the O and P levels. Further, F_o is the fluorescence intensity at the "O" level, whereas F_m is the intensity at the P level, and F_v (= F_m - F_o) is the variable fluorescence. We surveyed a set of quantitative parameters derived from the FI curves of 199 rice accessions, grown under both field condition (FC) and growth room condition (GC). Our results show a significant variation between Japonica (JAP) and Indica (IND) subgroups, under both the growth conditions, in almost all the parameters derived from the OJIP curves. The ratio of the variable to the maximum (F_v/F_m) and of the variable to the minimum (F_v/F_o) fluorescence, the performance index (PI_abs), as well as the amplitude of the I-P phase (A_I-P) show higher values in JAP compared to that in the IND subpopulation. In contrast, the amplitude of the O-J phase (A_O-J) and the normalized area above the OJIP curve (S_m) show an opposite trend. The performed genetic analysis shows that plants grown under GC appear much more affected by environmental factors than those grown in the field. We further conducted a genome-wide association study (GWAS) using 11 parameters derived from plants grown in the field. In total, 596 non-unique significant loci based on these parameters were identified by GWAS. Several photosynthesis-related proteins were identified to be associated with different OJIP parameters. We found that traits with high correlation are usually associated with similar genomic regions. Specifically, the thermal phase of FI, which includes the amplitudes of the J-I and I-P subphases (A_J-I and A_I-P) of the OJIP curve, is, in turn, associated with certain common genomic regions. Our study is the first one dealing with the natural variations in rice, with the aim to characterize potential candidate genes controlling the magnitude and half-time of each of the phases in the OJIP FI curve.

Collapse

Fu L, Wang Y, Li T, Hu YQ. A Novel Approach Integrating Hierarchical Clustering and Weighted Combination for Association Study of Multiple Phenotypes and a Genetic Variant. Front Genet 2021;12:654804. [PMID: 34220938 PMCID: PMC8249926 DOI: 10.3389/fgene.2021.654804] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Accepted: 04/20/2021] [Indexed: 11/26/2022] Open

Reisetter AC, Breheny P. Penalized linear mixed models for structured genetic data. Genet Epidemiol 2021;45:427-444. [PMID: 33998038 DOI: 10.1002/gepi.22384] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Revised: 03/19/2021] [Accepted: 03/29/2021] [Indexed: 11/12/2022]

Hoffman GE, Roussos P. Dream: powerful differential expression analysis for repeated measures designs. Bioinformatics 2021;37:192-201. [PMID: 32730587 DOI: 10.1093/bioinformatics/btaa687] [Citation(s) in RCA: 103] [Impact Index Per Article: 34.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2020] [Revised: 07/13/2020] [Accepted: 07/23/2020] [Indexed: 01/08/2023] Open

Depardieu C, Gérardi S, Nadeau S, Parent GJ, Mackay J, Lenz P, Lamothe M, Girardin MP, Bousquet J, Isabel N. Connecting tree-ring phenotypes, genetic associations and transcriptomics to decipher the genomic architecture of drought adaptation in a widespread conifer. Mol Ecol 2021;30:3898-3917. [PMID: 33586257 PMCID: PMC8451828 DOI: 10.1111/mec.15846] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Revised: 01/15/2021] [Accepted: 01/27/2021] [Indexed: 01/02/2023]

Affiliation(s)

Claire Depardieu Canada Research Chair in Forest GenomicsInstitute for Systems and Integrative BiologyUniversité LavalQuébecQCCanada Centre for Forest ResearchDépartement des sciences du bois et de la forêtUniversité LavalQuébecQCCanada Natural Resources CanadaCanadian Forest ServiceLaurentian Forestry CenterQuébecQCCanada
Sébastien Gérardi Canada Research Chair in Forest GenomicsInstitute for Systems and Integrative BiologyUniversité LavalQuébecQCCanada Centre for Forest ResearchDépartement des sciences du bois et de la forêtUniversité LavalQuébecQCCanada
Simon Nadeau Natural Resources CanadaCanadian Forest ServiceCanadian Wood Fibre CenterQuébecQCCanada
Geneviève J. Parent Laboratory of GenomicsMaurice‐Lamontagne Institute, Fisheries and Oceans CanadaMont‐JoliQCCanada
John Mackay Canada Research Chair in Forest GenomicsInstitute for Systems and Integrative BiologyUniversité LavalQuébecQCCanada Department of Plant SciencesUniversity of OxfordOxfordUK
Patrick Lenz Canada Research Chair in Forest GenomicsInstitute for Systems and Integrative BiologyUniversité LavalQuébecQCCanada Natural Resources CanadaCanadian Forest ServiceCanadian Wood Fibre CenterQuébecQCCanada
Manuel Lamothe Canada Research Chair in Forest GenomicsInstitute for Systems and Integrative BiologyUniversité LavalQuébecQCCanada Natural Resources CanadaCanadian Forest ServiceLaurentian Forestry CenterQuébecQCCanada
Martin P. Girardin Natural Resources CanadaCanadian Forest ServiceLaurentian Forestry CenterQuébecQCCanada Centre for Forest ResearchUniversité du Québec à MontréalMontréalQCCanada
Jean Bousquet Canada Research Chair in Forest GenomicsInstitute for Systems and Integrative BiologyUniversité LavalQuébecQCCanada Centre for Forest ResearchDépartement des sciences du bois et de la forêtUniversité LavalQuébecQCCanada
Nathalie Isabel Canada Research Chair in Forest GenomicsInstitute for Systems and Integrative BiologyUniversité LavalQuébecQCCanada Centre for Forest ResearchDépartement des sciences du bois et de la forêtUniversité LavalQuébecQCCanada Natural Resources CanadaCanadian Forest ServiceLaurentian Forestry CenterQuébecQCCanada

Collapse

Abegaz F, Van Lishout F, Mahachie John JM, Chiachoompu K, Bhardwaj A, Duroux D, Gusareva ES, Wei Z, Hakonarson H, Van Steen K. Performance of model-based multifactor dimensionality reduction methods for epistasis detection by controlling population structure. BioData Min 2021;14:16. [PMID: 33608043 PMCID: PMC7893746 DOI: 10.1186/s13040-021-00247-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2020] [Accepted: 02/07/2021] [Indexed: 12/15/2022] Open

Novel directions in data pre-processing and genome-wide association study (GWAS) methodologies to overcome ongoing challenges. INFORMATICS IN MEDICINE UNLOCKED 2021. [DOI: 10.1016/j.imu.2021.100586] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

DeVogel N, Auer PL, Manansala R, Rau A, Wang T. A unified linear mixed model for familial relatedness and population structure in genetic association studies. Genet Epidemiol 2020;45:305-315. [PMID: 33175443 DOI: 10.1002/gepi.22371] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2020] [Revised: 09/14/2020] [Accepted: 10/20/2020] [Indexed: 11/10/2022]

Wan Y, Wick RR, Zobel J, Ingle DJ, Inouye M, Holt KE. GeneMates: an R package for detecting horizontal gene co-transfer between bacteria using gene-gene associations controlled for population structure. BMC Genomics 2020;21:658. [PMID: 32972363 PMCID: PMC7513276 DOI: 10.1186/s12864-020-07019-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2020] [Accepted: 08/20/2020] [Indexed: 12/15/2022] Open

Abstract

Background

Horizontal gene transfer contributes to bacterial evolution through mobilising genes across various taxonomical boundaries. It is frequently mediated by mobile genetic elements (MGEs), which may capture, maintain, and rearrange mobile genes and co-mobilise them between bacteria, causing horizontal gene co-transfer (HGcoT). This physical linkage between mobile genes poses a great threat to public health as it facilitates dissemination and co-selection of clinically important genes amongst bacteria. Although rapid accumulation of bacterial whole-genome sequencing data since the 2000s enables study of HGcoT at the population level, results based on genetic co-occurrence counts and simple association tests are usually confounded by bacterial population structure when sampled bacteria belong to the same species, leading to spurious conclusions.

Results

We have developed a network approach to explore WGS data for evidence of intraspecies HGcoT and have implemented it in R package GeneMates (github.com/wanyuac/GeneMates). The package takes as input an allelic presence-absence matrix of interested genes and a matrix of core-genome single-nucleotide polymorphisms, performs association tests with linear mixed models controlled for population structure, produces a network of significantly associated alleles, and identifies clusters within the network as plausible co-transferred alleles. GeneMates users may choose to score consistency of allelic physical distances measured in genome assemblies using a novel approach we have developed and overlay scores to the network for further evidence of HGcoT. Validation studies of GeneMates on known acquired antimicrobial resistance genes in Escherichia coli and Salmonella Typhimurium show advantages of our network approach over simple association analysis: (1) distinguishing between allelic co-occurrence driven by HGcoT and that driven by clonal reproduction, (2) evaluating effects of population structure on allelic co-occurrence, and (3) direct links between allele clusters in the network and MGEs when physical distances are incorporated.

Conclusion

GeneMates offers an effective approach to detection of intraspecies HGcoT using WGS data.

Collapse

Abegaz F, Chaichoompu K, Génin E, Fardo DW, König IR, Mahachie John JM, Van Steen K. Principals about principal components in statistical genetics. Brief Bioinform 2020;20:2200-2216. [PMID: 30219892 DOI: 10.1093/bib/bby081] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2018] [Revised: 07/21/2018] [Accepted: 08/12/2018] [Indexed: 12/13/2022] Open

Hines O, Diaz-Ordaz K, Vansteelandt S, Jamshidi Y. Causal graphs for the analysis of genetic cohort data. Physiol Genomics 2020;52:369-378. [PMID: 32687429 DOI: 10.1152/physiolgenomics.00115.2019] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

Low Additive Genetic Variation in a Trait Under Selection in Domesticated Rice. G3-GENES GENOMES GENETICS 2020;10:2435-2443. [PMID: 32439738 PMCID: PMC7341149 DOI: 10.1534/g3.120.401194] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Abstract

Quantitative traits are important targets of both natural and artificial selection. The genetic architecture of these traits and its change during the adaptive process is thus of fundamental interest. The fate of the additive effects of variants underlying a trait receives particular attention because they constitute the genetic variation component that is transferred from parents to offspring and thus governs the response to selection. While estimation of this component of phenotypic variation is challenging, the increasing availability of dense molecular markers puts it within reach. Inbred plant species offer an additional advantage because phenotypes of genetically identical individuals can be measured in replicate. This makes it possible to estimate marker effects separately from the contribution of the genetic background not captured by genotyped loci. We focused on root growth in domesticated rice, Oryza sativa, under normal and aluminum (Al) stress conditions, a trait under recent selection because it correlates with survival under drought. A dense single nucleotide polymorphism (SNP) map is available for all accessions studied. Taking advantage of this map and a set of Bayesian models, we assessed additive marker effects. While total genetic variation accounted for a large proportion of phenotypic variance, marker effects contributed little information, particularly in the Al-tolerant tropical japonica population of rice. We were unable to identify any loci associated with root growth in this population. Models estimating the aggregate effects of all measured genotypes likewise produced low estimates of marker heritability and were unable to predict total genetic values accurately. Our results support the long-standing conjecture that additive genetic variation is depleted in traits under selection. We further provide evidence that this depletion is due to the prevalence of low-frequency alleles that underlie the trait.

Collapse

Whole-genome genotyping and resequencing reveal the association of a deletion in the complex interferon alpha gene cluster with hypothyroidism in dogs. BMC Genomics 2020;21:307. [PMID: 32299354 PMCID: PMC7160888 DOI: 10.1186/s12864-020-6700-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2019] [Accepted: 03/24/2020] [Indexed: 12/30/2022] Open

Hussain W, Campbell MT, Jarquin D, Walia H, Morota G. Variance heterogeneity genome-wide mapping for cadmium in bread wheat reveals novel genomic loci and epistatic interactions. THE PLANT GENOME 2020;13:e20011. [PMID: 33016629 DOI: 10.1002/tpg2.20011] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/15/2019] [Accepted: 01/22/2020] [Indexed: 06/11/2023]

Lawson DJ, Davies NM, Haworth S, Ashraf B, Howe L, Crawford A, Hemani G, Davey Smith G, Timpson NJ. Is population structure in the genetic biobank era irrelevant, a challenge, or an opportunity? Hum Genet 2020;139:23-41. [PMID: 31030318 PMCID: PMC6942007 DOI: 10.1007/s00439-019-02014-8] [Citation(s) in RCA: 48] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2018] [Accepted: 04/12/2019] [Indexed: 12/11/2022]

Genome-wide association mapping for adult resistance to powdery mildew in common wheat. Mol Biol Rep 2019;47:1241-1256. [PMID: 31813131 DOI: 10.1007/s11033-019-05225-4] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2019] [Accepted: 12/04/2019] [Indexed: 12/23/2022]

Zhang W, Dai X, Xu S, Zhao PX. GPU empowered pipelines for calculating genome-wide kinship matrices with ultra-high dimensional genetic variants and facilitating 1D and 2D GWAS. NAR Genom Bioinform 2019;2:lqz009. [PMID: 33575561 PMCID: PMC7671369 DOI: 10.1093/nargab/lqz009] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2019] [Revised: 08/22/2019] [Accepted: 09/25/2019] [Indexed: 12/13/2022] Open

Bustos‐Korts D, Dawson IK, Russell J, Tondelli A, Guerra D, Ferrandi C, Strozzi F, Nicolazzi EL, Molnar‐Lang M, Ozkan H, Megyeri M, Miko P, Çakır E, Yakışır E, Trabanco N, Delbono S, Kyriakidis S, Booth A, Cammarano D, Mascher M, Werner P, Cattivelli L, Rossini L, Stein N, Kilian B, Waugh R, van Eeuwijk FA. Exome sequences and multi-environment field trials elucidate the genetic basis of adaptation in barley. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2019;99:1172-1191. [PMID: 31108005 PMCID: PMC6851764 DOI: 10.1111/tpj.14414] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/15/2019] [Revised: 04/30/2019] [Accepted: 05/13/2019] [Indexed: 05/25/2023]

Affiliation(s)

Daniela Bustos‐Korts BiometrisWageningen University and Research CentrePO Box 166700 ACWageningenThe Netherlands
Ian K. Dawson Cell and Molecular SciencesJames Hutton InstituteInvergowrie, DundeeUK
Joanne Russell Cell and Molecular SciencesJames Hutton InstituteInvergowrie, DundeeUK
Alessandro Tondelli CREA – Research Centre for Genomics and BioinformaticsVia S. Protaso 30229017Fiorenzuola d'ArdaItaly
Davide Guerra CREA – Research Centre for Genomics and BioinformaticsVia S. Protaso 30229017Fiorenzuola d'ArdaItaly
Chiara Ferrandi PTP Science ParkVia Einstein, Loc. Cascina Codazza26900LodiItaly
Francesco Strozzi PTP Science ParkVia Einstein, Loc. Cascina Codazza26900LodiItaly
Ezequiel L. Nicolazzi PTP Science ParkVia Einstein, Loc. Cascina Codazza26900LodiItaly
Marta Molnar‐Lang Agricultural InstituteCentre for Agricultural ResearchHungarian Academy of Sciences2462MartonvásárHungary
Hakan Ozkan University of ÇukurovaFaculty of AgricultureDepartment of Field Crops01330AdanaTurkey
Maria Megyeri Agricultural InstituteCentre for Agricultural ResearchHungarian Academy of Sciences2462MartonvásárHungary
Peter Miko Agricultural InstituteCentre for Agricultural ResearchHungarian Academy of Sciences2462MartonvásárHungary
Esra Çakır University of ÇukurovaFaculty of AgricultureDepartment of Field Crops01330AdanaTurkey
Enes Yakışır Bahri Dagdas International Agricultural Research InstituteKonyaTurkey
Noemi Trabanco Università degli Studi di Milano – DiSAAVia Celoria 220133MilanoItaly
Stefano Delbono CREA – Research Centre for Genomics and BioinformaticsVia S. Protaso 30229017Fiorenzuola d'ArdaItaly
Stylianos Kyriakidis Cell and Molecular SciencesJames Hutton InstituteInvergowrie, DundeeUK
Allan Booth Cell and Molecular SciencesJames Hutton InstituteInvergowrie, DundeeUK
Davide Cammarano Cell and Molecular SciencesJames Hutton InstituteInvergowrie, DundeeUK
Martin Mascher Leibniz Institute of Plant Genetics and Crop Plant Research (IPK)06466SeelandGermany
Peter Werner KWS UK Ltd56 Church StreetThriplow, RoystonSG8 7REUK
Luigi Cattivelli CREA – Research Centre for Genomics and BioinformaticsVia S. Protaso 30229017Fiorenzuola d'ArdaItaly
Laura Rossini Università degli Studi di Milano – DiSAAVia Celoria 220133MilanoItaly
Nils Stein Leibniz Institute of Plant Genetics and Crop Plant Research (IPK)06466SeelandGermany
Benjamin Kilian Leibniz Institute of Plant Genetics and Crop Plant Research (IPK)06466SeelandGermany Present address: Global Crop Diversity TrustPlatz der Vereinten Nationen 753113BonnGermany
Robbie Waugh Cell and Molecular SciencesJames Hutton InstituteInvergowrie, DundeeUK Division of Plant SciencesSchool of Life SciencesUniversity of DundeeDow StreetDundeeDD1 5EHUK
Fred A. van Eeuwijk BiometrisWageningen University and Research CentrePO Box 166700 ACWageningenThe Netherlands

Collapse

Guo Y, Wu C, Guo M, Zou Q, Liu X, Keinan A. Combining Sparse Group Lasso and Linear Mixed Model Improves Power to Detect Genetic Variants Underlying Quantitative Traits. Front Genet 2019;10:271. [PMID: 31024614 PMCID: PMC6469383 DOI: 10.3389/fgene.2019.00271] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2018] [Accepted: 03/12/2019] [Indexed: 11/13/2022] Open

Gianola D, Fernando RL, Garrick DJ. A certain invariance property of BLUE in a whole-genome regression context. J Anim Breed Genet 2019;136:113-117. [PMID: 30614572 PMCID: PMC6850311 DOI: 10.1111/jbg.12378] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2018] [Revised: 12/03/2018] [Accepted: 12/06/2018] [Indexed: 11/30/2022]

Wang H, Aragam B, Xing EP. Variable selection in heterogeneous datasets: A truncated-rank sparse linear mixed model with applications to genome-wide association studies. Methods 2018;145:2-9. [PMID: 29705212 DOI: 10.1016/j.ymeth.2018.04.021] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2018] [Revised: 04/14/2018] [Accepted: 04/23/2018] [Indexed: 10/17/2022] Open

Zhu H, Zhang S, Sha Q. A novel method to test associations between a weighted combination of phenotypes and genetic variants. PLoS One 2018;13:e0190788. [PMID: 29329304 PMCID: PMC5766098 DOI: 10.1371/journal.pone.0190788] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2017] [Accepted: 12/20/2017] [Indexed: 11/18/2022] Open

Fonseca PAS, Leal TP, Santos FC, Gouveia MH, Id-Lahoucine S, Rosse IC, Ventura RV, Bruneli FAT, Machado MA, Peixoto MGCD, Tarazona-Santos E, Carvalho MRS. Reducing cryptic relatedness in genomic data sets via a central node exclusion algorithm. Mol Ecol Resour 2017;18:435-447. [PMID: 29271609 DOI: 10.1111/1755-0998.12746] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2017] [Revised: 12/04/2017] [Accepted: 12/14/2017] [Indexed: 11/30/2022]

Wang H, Aragam B, Xing EP. Variable Selection in Heterogeneous Datasets: A Truncated-rank Sparse Linear Mixed Model with Applications to Genome-wide Association Studies. PROCEEDINGS. IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE 2017;2017:431-438. [PMID: 29629235 DOI: 10.1109/bibm.2017.8217687] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]

Meyers-Wallen VN, Boyko AR, Danko CG, Grenier JK, Mezey JG, Hayward JJ, Shannon LM, Gao C, Shafquat A, Rice EJ, Pujar S, Eggers S, Ohnesorg T, Sinclair AH. XX Disorder of Sex Development is associated with an insertion on chromosome 9 and downregulation of RSPO1 in dogs (Canis lupus familiaris). PLoS One 2017;12:e0186331. [PMID: 29053721 PMCID: PMC5650465 DOI: 10.1371/journal.pone.0186331] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2017] [Accepted: 09/28/2017] [Indexed: 12/15/2022] Open

Abstract

Remarkable progress has been achieved in understanding the mechanisms controlling sex determination, yet the cause for many Disorders of Sex Development (DSD) remains unknown. Of particular interest is a rare XX DSD subtype in which individuals are negative for SRY, the testis determining factor on the Y chromosome, yet develop testes or ovotestes, and both of these phenotypes occur in the same family. This is a naturally occurring disorder in humans (Homo sapiens) and dogs (C. familiaris). Phenotypes in the canine XX DSD model are strikingly similar to those of the human XX DSD subtype. The purposes of this study were to identify 1) a variant associated with XX DSD in the canine model and 2) gene expression alterations in canine embryonic gonads that could be informative to causation. Using a genome wide association study (GWAS) and whole genome sequencing (WGS), we identified a variant on C. familiaris autosome 9 (CFA9) that is associated with XX DSD in the canine model and in affected purebred dogs. This is the first marker identified for inherited canine XX DSD. It lies upstream of SOX9 within the canine ortholog for the human disorder, which resides on 17q24. Inheritance of this variant indicates that XX DSD is a complex trait in which breed genetic background affects penetrance. Furthermore, the homozygous variant genotype is associated with embryonic lethality in at least one breed. Our analysis of gene expression studies (RNA-seq and PRO-seq) in embryonic gonads at risk of XX DSD from the canine model identified significant RSPO1 downregulation in comparison to XX controls, without significant upregulation of SOX9 or other known testis pathway genes. Based on these data, a novel mechanism is proposed in which molecular lesions acting upstream of RSPO1 induce epigenomic gonadal mosaicism.

Collapse

Affiliation(s)

Vicki N. Meyers-Wallen Baker Institute for Animal Health, Cornell University, Ithaca, NY, United States of America Department of Biomedical Sciences, Cornell University, Ithaca, NY, United States of America * E-mail:
Adam R. Boyko Department of Biomedical Sciences, Cornell University, Ithaca, NY, United States of America
Charles G. Danko Baker Institute for Animal Health, Cornell University, Ithaca, NY, United States of America Department of Biomedical Sciences, Cornell University, Ithaca, NY, United States of America
Jennifer K. Grenier Department of Biomedical Sciences, Cornell University, Ithaca, NY, United States of America
Jason G. Mezey Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, NY, United States of America Department of Genetic Medicine, Weill Cornell Medical College, New York, NY, United States of America
Jessica J. Hayward Department of Biomedical Sciences, Cornell University, Ithaca, NY, United States of America
Laura M. Shannon Department of Biomedical Sciences, Cornell University, Ithaca, NY, United States of America
Chuan Gao Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, NY, United States of America
Afrah Shafquat Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, NY, United States of America
Edward J. Rice Baker Institute for Animal Health, Cornell University, Ithaca, NY, United States of America
Shashikant Pujar Baker Institute for Animal Health, Cornell University, Ithaca, NY, United States of America
Stefanie Eggers Murdoch Children’s Research Institute, Royal Children's Hospital, Melbourne, VIC, Australia
Thomas Ohnesorg Murdoch Children’s Research Institute, Royal Children's Hospital, Melbourne, VIC, Australia
Andrew H. Sinclair Murdoch Children’s Research Institute, Royal Children's Hospital, Melbourne, VIC, Australia Department of Paediatrics, University of Melbourne, Melbourne, VIC, Australia

Collapse

Ju JH, Shenoy SA, Crystal RG, Mezey JG. An independent component analysis confounding factor correction framework for identifying broad impact expression quantitative trait loci. PLoS Comput Biol 2017;13:e1005537. [PMID: 28505156 PMCID: PMC5448815 DOI: 10.1371/journal.pcbi.1005537] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2016] [Revised: 05/30/2017] [Accepted: 04/28/2017] [Indexed: 11/19/2022] Open

Abstract

Genome-wide expression Quantitative Trait Loci (eQTL) studies in humans have provided numerous insights into the genetics of both gene expression and complex diseases. While the majority of eQTL identified in genome-wide analyses impact a single gene, eQTL that impact many genes are particularly valuable for network modeling and disease analysis. To enable the identification of such broad impact eQTL, we introduce CONFETI: Confounding Factor Estimation Through Independent component analysis. CONFETI is designed to address two conflicting issues when searching for broad impact eQTL: the need to account for non-genetic confounding factors that can lower the power of the analysis or produce broad impact eQTL false positives, and the tendency of methods that account for confounding factors to model broad impact eQTL as non-genetic variation. The key advance of the CONFETI framework is the use of Independent Component Analysis (ICA) to identify variation likely caused by broad impact eQTL when constructing the sample covariance matrix used for the random effect in a mixed model. We show that CONFETI has better performance than other mixed model confounding factor methods when considering broad impact eQTL recovery from synthetic data. We also used the CONFETI framework and these same confounding factor methods to identify eQTL that replicate between matched twin pair datasets in the Multiple Tissue Human Expression Resource (MuTHER), the Depression Genes Networks study (DGN), the Netherlands Study of Depression and Anxiety (NESDA), and multiple tissue types in the Genotype-Tissue Expression (GTEx) consortium. These analyses identified both cis-eQTL and trans-eQTL impacting individual genes, and CONFETI had better or comparable performance to other mixed model confounding factor analysis methods when identifying such eQTL. In these analyses, we were able to identify and replicate a few broad impact eQTL although the overall number was small even when applying CONFETI. In light of these results, we discuss the broad impact eQTL that have been previously reported from the analysis of human data and suggest that considerable caution should be exercised when making biological inferences based on these reported eQTL.

Collapse

N’Diaye A, Haile JK, Cory AT, Clarke FR, Clarke JM, Knox RE, Pozniak CJ. Single Marker and Haplotype-Based Association Analysis of Semolina and Pasta Colour in Elite Durum Wheat Breeding Lines Using a High-Density Consensus Map. PLoS One 2017;12:e0170941. [PMID: 28135299 PMCID: PMC5279799 DOI: 10.1371/journal.pone.0170941] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2016] [Accepted: 01/12/2017] [Indexed: 12/30/2022] Open

Abstract

Association mapping is usually performed by testing the correlation between a single marker and phenotypes. However, because patterns of variation within genomes are inherited as blocks, clustering markers into haplotypes for genome-wide scans could be a worthwhile approach to improve statistical power to detect associations. The availability of high-density molecular data allows the possibility to assess the potential of both approaches to identify marker-trait associations in durum wheat. In the present study, we used single marker- and haplotype-based approaches to identify loci associated with semolina and pasta colour in durum wheat, the main objective being to evaluate the potential benefits of haplotype-based analysis for identifying quantitative trait loci. One hundred sixty-nine durum lines were genotyped using the Illumina 90K Infinium iSelect assay, and 12,234 polymorphic single nucleotide polymorphism (SNP) markers were generated and used to assess the population structure and the linkage disequilibrium (LD) patterns. A total of 8,581 SNPs previously localized to a high-density consensus map were clustered into 406 haplotype blocks based on the average LD distance of 5.3 cM. Combining multiple SNPs into haplotype blocks increased the average polymorphism information content (PIC) from 0.27 per SNP to 0.50 per haplotype. The haplotype-based analysis identified 12 loci associated with grain pigment colour traits, including the five loci identified by the single marker-based analysis. Furthermore, the haplotype-based analysis resulted in an increase of the phenotypic variance explained (50.4% on average) and the allelic effect (33.7% on average) when compared to single marker analysis. The presence of multiple allelic combinations within each haplotype locus offers potential for screening the most favorable haplotype series and may facilitate marker-assisted selection of grain pigment colour in durum wheat. These results suggest a benefit of haplotype-based analysis over single marker analysis to detect loci associated with colour traits in durum wheat.

Collapse

Dandine-Roulland C, Perdry H. The Use of the Linear Mixed Model in Human Genetics. Hum Hered 2016;80:196-206. [PMID: 27576760 DOI: 10.1159/000447634] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open

dos Santos JPR, Pires LPM, de Castro Vasconcellos RC, Pereira GS, Von Pinho RG, Balestre M. Genomic selection to resistance to Stenocarpella maydis in maize lines using DArTseq markers. BMC Genet 2016;17:86. [PMID: 27316946 PMCID: PMC4912722 DOI: 10.1186/s12863-016-0392-3] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2016] [Accepted: 06/07/2016] [Indexed: 12/30/2022] Open

Phelan J, Coll F, McNerney R, Ascher DB, Pires DEV, Furnham N, Coeck N, Hill-Cawthorne GA, Nair MB, Mallard K, Ramsay A, Campino S, Hibberd ML, Pain A, Rigouts L, Clark TG. Mycobacterium tuberculosis whole genome sequencing and protein structure modelling provides insights into anti-tuberculosis drug resistance. BMC Med 2016;14:31. [PMID: 27005572 PMCID: PMC4804620 DOI: 10.1186/s12916-016-0575-9] [Citation(s) in RCA: 87] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/07/2015] [Accepted: 02/02/2016] [Indexed: 12/21/2022] Open

Abstract

BACKGROUND

Combating the spread of drug resistant tuberculosis is a global health priority. Whole genome association studies are being applied to identify genetic determinants of resistance to anti-tuberculosis drugs. Protein structure and interaction modelling are used to understand the functional effects of putative mutations and provide insight into the molecular mechanisms leading to resistance.

METHODS

To investigate the potential utility of these approaches, we analysed the genomes of 144 Mycobacterium tuberculosis clinical isolates from The Special Programme for Research and Training in Tropical Diseases (TDR) collection sourced from 20 countries in four continents. A genome-wide approach was applied to 127 isolates to identify polymorphisms associated with minimum inhibitory concentrations for first-line anti-tuberculosis drugs. In addition, the effect of identified candidate mutations on protein stability and interactions was assessed quantitatively with well-established computational methods.

RESULTS

The analysis revealed that mutations in the genes rpoB (rifampicin), katG (isoniazid), inhA-promoter (isoniazid), rpsL (streptomycin) and embB (ethambutol) were responsible for the majority of resistance observed. A subset of the mutations identified in rpoB and katG were predicted to affect protein stability. Further, a strong direct correlation was observed between the minimum inhibitory concentration values and the distance of the mutated residues in the three-dimensional structures of rpoB and katG to their respective drugs binding sites.

CONCLUSIONS

Using the TDR resource, we demonstrate the usefulness of whole genome association and convergent evolution approaches to detect known and potentially novel mutations associated with drug resistance. Further, protein structural modelling could provide a means of predicting the impact of polymorphisms on drug efficacy in the absence of phenotypic data. These approaches could ultimately lead to novel resistance mutations to improve the design of tuberculosis control measures, such as diagnostics, and inform patient management.

Collapse

Affiliation(s)

Jody Phelan Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, Keppel Street, London, WC1E 7HT, UK
Francesc Coll Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, Keppel Street, London, WC1E 7HT, UK
Ruth McNerney Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, Keppel Street, London, WC1E 7HT, UK.,University of Cape Town Lung Institute, Lung Infection & Immunity Unit, Old Main Building, Groote Schuur Hospital, Observatory, Cape Town, 7925, South Africa
David B Ascher Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Cambridge, CB2 1GA, UK
Douglas E V Pires Centro de Pesquisas René Rachou, Fundação Oswaldo Cruz, Avenida Augusto de Lima 1715, Belo Horizonte, 30190-002, Brazil
Nick Furnham Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, Keppel Street, London, WC1E 7HT, UK
Nele Coeck Mycobacteriology Unit, Institute of Tropical Medicine, Antwerp, Belgium
Grant A Hill-Cawthorne Pathogen Genomics Laboratory, BESE Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia.,Sydney Emerging Infections and Biosecurity Institute and School of Public Health, Sydney Medical School, University of Sydney, Sydney, NSW, 2006, Australia
Mridul B Nair Pathogen Genomics Laboratory, BESE Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
Kim Mallard Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, Keppel Street, London, WC1E 7HT, UK
Andrew Ramsay Special Programme for Research and Training in Tropical Diseases (TDR), World Health Organisation, Geneva, Switzerland
Susana Campino Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, Keppel Street, London, WC1E 7HT, UK
Martin L Hibberd Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, Keppel Street, London, WC1E 7HT, UK
Arnab Pain Pathogen Genomics Laboratory, BESE Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
Leen Rigouts Mycobacteriology Unit, Institute of Tropical Medicine, Antwerp, Belgium.,Department of Biomedical Sciences, Antwerp University, Antwerp, Belgium
Taane G Clark Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, Keppel Street, London, WC1E 7HT, UK. .,Faculty of Epidemiology and Population Health, London School of Hygiene & Tropical Medicine, Keppel Street, London, WC1E 7HT, UK. .,Department of Pathogen Molecular Biology, Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, Keppel Street, London, UK.

Collapse

Bianchi M, Dahlgren S, Massey J, Dietschi E, Kierczak M, Lund-Ziener M, Sundberg K, Thoresen SI, Kämpe O, Andersson G, Ollier WER, Hedhammar Å, Leeb T, Lindblad-Toh K, Kennedy LJ, Lingaas F, Rosengren Pielberg G. A Multi-Breed Genome-Wide Association Analysis for Canine Hypothyroidism Identifies a Shared Major Risk Locus on CFA12. PLoS One 2015;10:e0134720. [PMID: 26261983 PMCID: PMC4532498 DOI: 10.1371/journal.pone.0134720] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2015] [Accepted: 07/13/2015] [Indexed: 01/12/2023] Open

Affiliation(s)

Matteo Bianchi Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
Stina Dahlgren Department of Basic Sciences and Aquatic Medicine, Norwegian University of Life Sciences, Oslo, Norway
Jonathan Massey Centre for Integrated Genomic Medical Research, The University of Manchester, Manchester Academic Health Science Centre, Manchester, United Kingdom
Elisabeth Dietschi Institute of Genetics, Vetsuisse Faculty, University of Bern, Bern, Switzerland
Marcin Kierczak Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
Martine Lund-Ziener Department of Basic Sciences and Aquatic Medicine, Norwegian University of Life Sciences, Oslo, Norway
Katarina Sundberg Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, Uppsala, Sweden
Stein Istre Thoresen Department of Basic Sciences and Aquatic Medicine, Norwegian University of Life Sciences, Oslo, Norway
Olle Kämpe Department of Medicine (Solna), Karolinska Institutet, Stockholm, Sweden
Göran Andersson Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, Uppsala, Sweden
William E. R. Ollier Centre for Integrated Genomic Medical Research, The University of Manchester, Manchester Academic Health Science Centre, Manchester, United Kingdom
Åke Hedhammar Department of Clinical Sciences, Swedish University of Agricultural Sciences, Uppsala, Sweden
Tosso Leeb Institute of Genetics, Vetsuisse Faculty, University of Bern, Bern, Switzerland
Kerstin Lindblad-Toh Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
Lorna J. Kennedy Centre for Integrated Genomic Medical Research, The University of Manchester, Manchester Academic Health Science Centre, Manchester, United Kingdom
Frode Lingaas Department of Basic Sciences and Aquatic Medicine, Norwegian University of Life Sciences, Oslo, Norway
Gerli Rosengren Pielberg Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden * E-mail:

Collapse

Zhang Y, Pan W. Principal component regression and linear mixed model in association analysis of structured samples: competitors or complements? Genet Epidemiol 2014;39:149-55. [PMID: 25536929 DOI: 10.1002/gepi.21879] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2014] [Revised: 11/11/2014] [Accepted: 11/11/2014] [Indexed: 11/10/2022]

Abstract

Genome-wide association studies (GWAS) have been established as a major tool to identify genetic variants associated with complex traits, such as common diseases. However, GWAS may suffer from false positives and false negatives due to confounding population structures, including known or unknown relatedness. Another important issue is unmeasured environmental risk factors. Among many methods for adjusting for population structures, two approaches stand out: one is principal component regression (PCR) based on principal component analysis, which is perhaps the most popular due to its early appearance, simplicity, and general effectiveness; the other is based on a linear mixed model (LMM) that has emerged recently as perhaps the most flexible and effective, especially for samples with complex structures as in model organisms. As shown previously, the PCR approach can be regarded as an approximation to an LMM; such an approximation depends on the number of the top principal components (PCs) used, the choice of which is often difficult in practice. Hence, in the presence of population structure, the LMM appears to outperform the PCR method. However, due to the different treatments of fixed vs. random effects in the two approaches, we show an advantage of PCR over LMM: in the presence of an unknown but spatially confined environmental confounder (e.g., environmental pollution or lifestyle), the PCs may be able to implicitly and effectively adjust for the confounder whereas the LMM cannot. Accordingly, to adjust for both population structures and nongenetic confounders, we propose a hybrid method combining the use and, thus, strengths of PCR and LMM. We use real genotype data and simulated phenotypes to confirm the above points, and establish the superior performance of the hybrid method across all scenarios.

Collapse

Widmer C, Lippert C, Weissbrod O, Fusi N, Kadie C, Davidson R, Listgarten J, Heckerman D. Further improvements to linear mixed models for genome-wide association studies. Sci Rep 2014;4:6874. [PMID: 25387525 PMCID: PMC4230738 DOI: 10.1038/srep06874] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2013] [Accepted: 10/14/2014] [Indexed: 11/09/2022] Open

Hoffman GE, Mezey JG, Schadt EE. lrgpr: interactive linear mixed model analysis of genome-wide association studies with composite hypothesis testing and regression diagnostics in R. ACTA ACUST UNITED AC 2014;30:3134-5. [PMID: 25035399 DOI: 10.1093/bioinformatics/btu435] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]

Affiliation(s)

Gabriel E Hoffman Department of Genetics and Genomic Sciences, Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, USA, Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, USA and Department of Genetic Medicine, Weill Cornell Medical College, New York, NY, USA Department of Genetics and Genomic Sciences, Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, USA, Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, USA and Department of Genetic Medicine, Weill Cornell Medical College, New York, NY, USA Department of Genetics and Genomic Sciences, Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, USA, Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, USA and Department of Genetic Medicine, Weill Cornell Medical College, New York, NY, USA
Jason G Mezey Department of Genetics and Genomic Sciences, Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, USA, Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, USA and Department of Genetic Medicine, Weill Cornell Medical College, New York, NY, USA Department of Genetics and Genomic Sciences, Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, USA, Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, USA and Department of Genetic Medicine, Weill Cornell Medical College, New York, NY, USA
Eric E Schadt Department of Genetics and Genomic Sciences, Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, USA, Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, USA and Department of Genetic Medicine, Weill Cornell Medical College, New York, NY, USA Department of Genetics and Genomic Sciences, Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, USA, Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, USA and Department of Genetic Medicine, Weill Cornell Medical College, New York, NY, USA

Collapse