1
|
Rönneburg T, Zan Y, Honaker CF, Siegel PB, Carlborg Ö. Low-coverage sequencing in a deep intercross of the Virginia body weight lines provides insight to the polygenic genetic architecture of growth: novel loci revealed by increased power and improved genome-coverage. Poult Sci 2022; 102:102203. [PMID: 36907123 PMCID: PMC10024170 DOI: 10.1016/j.psj.2022.102203] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2021] [Revised: 07/05/2022] [Accepted: 08/24/2022] [Indexed: 11/29/2022] Open
Abstract
Genetic dissection of highly polygenic traits is a challenge, in part due to the power necessary to confidently identify loci with minor effects. Experimental crosses are valuable resources for mapping such traits. Traditionally, genome-wide analyses of experimental crosses have targeted major loci using data from a single generation (often the F2) with individuals from later generations being generated for replication and fine-mapping. Here, we aim to confidently identify minor-effect loci contributing to the highly polygenic basis of the long-term, bi-directional selection responses for 56-d body weight in the Virginia body weight chicken lines. To achieve this, a strategy was developed to make use of data from all generations (F2-F18) of the advanced intercross line, developed by crossing the low and high selected lines after 40 generations of selection. A cost-efficient low-coverage sequencing based approach was used to obtain high-confidence genotypes in 1Mb bins across 99.3% of the chicken genome for >3,300 intercross individuals. In total, 12 genome-wide significant, and 30 additional suggestive QTL reaching a 10% FDR threshold, were mapped for 56-d body weight. Only 2 of these QTL reached genome-wide significance in earlier analyses of the F2 generation. The minor-effect QTL mapped here were generally due to an overall increase in power by integrating data across generations, with contributions from increased genome-coverage and improved marker information content. The 12 significant QTL explain >37% of the difference between the parental lines, three times more than 2 previously reported significant QTL. The 42 significant and suggestive QTL together explain >80%. Making integrated use of all available samples from multiple generations in experimental crosses are economically feasible using the low-cost, sequencing-based genotyping strategies outlined here. Our empirical results illustrate the value of this strategy for mapping novel minor-effect loci contributing to complex traits to provide a more confident, comprehensive view of the individual loci that form the genetic basis of the highly polygenic, long-term selection responses for 56-d body weight in the Virginia body weight chicken lines.
Collapse
Affiliation(s)
- T Rönneburg
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
| | - Y Zan
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
| | - C F Honaker
- Department of Animal and Poultry Sciences, Virginia Polytechnic Institute and State University, Blacksburg VA, USA
| | - P B Siegel
- Department of Animal and Poultry Sciences, Virginia Polytechnic Institute and State University, Blacksburg VA, USA
| | - Ö Carlborg
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden.
| |
Collapse
|
2
|
Radcliffe RA, Dowell R, Odell AT, Richmond PA, Bennett B, Larson C, Kechris K, Saba LM, Rudra P, Wen S. Systems genetics analysis of the LXS recombinant inbred mouse strains:Genetic and molecular insights into acute ethanol tolerance. PLoS One 2020; 15:e0240253. [PMID: 33095786 PMCID: PMC7584226 DOI: 10.1371/journal.pone.0240253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2020] [Accepted: 09/22/2020] [Indexed: 11/18/2022] Open
Abstract
We have been using the Inbred Long- and Short-Sleep mouse strains (ILS, ISS) and a recombinant inbred panel derived from them, the LXS, to investigate the genetic underpinnings of acute ethanol tolerance which is considered to be a risk factor for alcohol use disorders (AUDs). Here, we have used RNA-seq to examine the transcriptome of whole brain in 40 of the LXS strains 8 hours after a saline or ethanol "pretreatment" as in previous behavioral studies. Approximately 1/3 of the 14,184 expressed genes were significantly heritable and many were unique to the pretreatment. Several thousand cis- and trans-eQTLs were mapped; a portion of these also were unique to pretreatment. Ethanol pretreatment caused differential expression (DE) of 1,230 genes. Gene Ontology (GO) enrichment analysis suggested involvement in numerous biological processes including astrocyte differentiation, histone acetylation, mRNA splicing, and neuron projection development. Genetic correlation analysis identified hundreds of genes that were correlated to the behaviors. GO analysis indicated that these genes are involved in gene expression, chromosome organization, and protein transport, among others. The expression profiles of the DE genes and genes correlated to AFT in the ethanol pretreatment group (AFT-Et) were found to be similar to profiles of HDAC inhibitors. Hdac1, a cis-regulated gene that is located at the peak of a previously mapped QTL for AFT-Et, was correlated to 437 genes, most of which were also correlated to AFT-Et. GO analysis of these genes identified several enriched biological process terms including neuron-neuron synaptic transmission and potassium transport. In summary, the results suggest widespread genetic effects on gene expression, including effects that are pretreatment-specific. A number of candidate genes and biological functions were identified that could be mediating the behavioral responses. The most prominent of these was Hdac1 which may be regulating genes associated with glutamatergic signaling and potassium conductance.
Collapse
Affiliation(s)
- Richard A. Radcliffe
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of Colorado Anschutz Medical Campus, Aurora, CO, United States of America
- Institute for Behavioral Genetics, University of Colorado Boulder, Boulder CO, United States of America
| | - Robin Dowell
- BioFrontiers Institute, University of Colorado Boulder, Boulder, CO, United States of America
- Department of Molecular, Cellular, and Developmental Biology, University of Colorado Boulder, Boulder, CO, United States of America
- Department of Computer Science, University of Colorado Boulder, Boulder, CO, United States of America
| | - Aaron T. Odell
- BioFrontiers Institute, University of Colorado Boulder, Boulder, CO, United States of America
| | - Phillip A. Richmond
- BioFrontiers Institute, University of Colorado Boulder, Boulder, CO, United States of America
| | - Beth Bennett
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of Colorado Anschutz Medical Campus, Aurora, CO, United States of America
| | - Colin Larson
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of Colorado Anschutz Medical Campus, Aurora, CO, United States of America
| | - Katerina Kechris
- Department of Biostatistics and Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, United States of America
| | - Laura M. Saba
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of Colorado Anschutz Medical Campus, Aurora, CO, United States of America
| | - Pratyaydipta Rudra
- Department of Statistics, Oklahoma State University, Stillwater, OK, United States of America
| | - Shi Wen
- Department of Biostatistics and Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, United States of America
| |
Collapse
|
3
|
Abstract
Octoploid strawberry (Fragaria ×ananassa) is a valuable specialty crop, but profitable production and availability are threatened by many pathogens. Efforts to identify and introgress useful disease resistance genes (R-genes) in breeding programs are complicated by strawberry’s complex octoploid genome. Recently-developed resources in strawberry, including a complete octoploid reference genome and high-resolution octoploid genotyping, enable new analyses in strawberry disease resistance genetics. This study characterizes the complete R-gene collection in the genomes of commercial octoploid strawberry and two diploid ancestral relatives, and introduces several new technological and data resources for strawberry disease resistance research. These include octoploid R-gene transcription profiling, dN/dS analysis, expression quantitative trait loci (eQTL) analysis and RenSeq analysis in cultivars. Octoploid fruit eQTL were identified for 76 putative R-genes. R-genes from the ancestral diploids Fragaria vesca and Fragaria iinumae were compared, revealing differential inheritance and retention of various octoploid R-gene subtypes. The mode and magnitude of natural selection of individual F. ×ananassa R-genes was also determined via dN/dS analysis. R-gene sequencing using enriched libraries (RenSeq) has been used recently for R-gene discovery in many crops, however this technique somewhat relies upon a priori knowledge of desired sequences. An octoploid strawberry capture-probe panel, derived from the results of this study, is validated in a RenSeq experiment and is presented for community use. These results give unprecedented insight into crop disease resistance genetics, and represent an advance toward exploiting variation for strawberry cultivar improvement.
Collapse
|
4
|
Ashbrook DG, Mulligan MK, Williams RW. Post-genomic behavioral genetics: From revolution to routine. GENES, BRAIN, AND BEHAVIOR 2018; 17:e12441. [PMID: 29193773 PMCID: PMC5876106 DOI: 10.1111/gbb.12441] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/04/2017] [Revised: 11/02/2017] [Accepted: 11/20/2017] [Indexed: 12/16/2022]
Abstract
What was once expensive and revolutionary-full-genome sequence-is now affordable and routine. Costs will continue to drop, opening up new frontiers in behavioral genetics. This shift in costs from the genome to the phenome is most notable in large clinical studies of behavior and associated diseases in cohorts that exceed hundreds of thousands of subjects. Examples include the Women's Health Initiative (www.whi.org), the Million Veterans Program (www. RESEARCH va.gov/MVP), the 100 000 Genomes Project (genomicsengland.co.uk) and commercial efforts such as those by deCode (www.decode.com) and 23andme (www.23andme.com). The same transition is happening in experimental neuro- and behavioral genetics, and sample sizes of many hundreds of cases are becoming routine (www.genenetwork.org, www.mousephenotyping.org). There are two major consequences of this new affordability of massive omics datasets: (1) it is now far more practical to explore genetic modulation of behavioral differences and the key role of gene-by-environment interactions. Researchers are already doing the hard part-the quantitative analysis of behavior. Adding the omics component can provide powerful links to molecules, cells, circuits and even better treatment. (2) There is an acute need to highlight and train behavioral scientists in how best to exploit new omics approaches. This review addresses this second issue and highlights several new trends and opportunities that will be of interest to experts in animal and human behaviors.
Collapse
Affiliation(s)
- D G Ashbrook
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Sciences Center, College of Medicine, Memphis, Tennessee
| | - M K Mulligan
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Sciences Center, College of Medicine, Memphis, Tennessee
| | - R W Williams
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Sciences Center, College of Medicine, Memphis, Tennessee
| |
Collapse
|
5
|
Zych K, Snoek BL, Elvin M, Rodriguez M, Van der Velde KJ, Arends D, Westra HJ, Swertz MA, Poulin G, Kammenga JE, Breitling R, Jansen RC, Li Y. reGenotyper: Detecting mislabeled samples in genetic data. PLoS One 2017; 12:e0171324. [PMID: 28192439 PMCID: PMC5305221 DOI: 10.1371/journal.pone.0171324] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2016] [Accepted: 01/19/2017] [Indexed: 12/11/2022] Open
Abstract
In high-throughput molecular profiling studies, genotype labels can be wrongly assigned at various experimental steps; the resulting mislabeled samples seriously reduce the power to detect the genetic basis of phenotypic variation. We have developed an approach to detect potential mislabeling, recover the “ideal” genotype and identify “best-matched” labels for mislabeled samples. On average, we identified 4% of samples as mislabeled in eight published datasets, highlighting the necessity of applying a “data cleaning” step before standard data analysis.
Collapse
Affiliation(s)
- Konrad Zych
- Groningen Bioinformatics Centre, University of Groningen, Groningen, The Netherlands
| | - Basten L. Snoek
- Laboratory of Nematology, Wageningen University, Wageningen, The Netherlands
| | - Mark Elvin
- Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom
| | - Miriam Rodriguez
- Laboratory of Nematology, Wageningen University, Wageningen, The Netherlands
| | - K. Joeri Van der Velde
- Genomics Coordination Center, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Danny Arends
- Groningen Bioinformatics Centre, University of Groningen, Groningen, The Netherlands
| | - Harm-Jan Westra
- Divisions of Genetics and Rheumatology, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, Massachusetts, United States of America
- Partners Center for Personalized Genetic Medicine, Boston, Massachusetts, United States of America
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - Morris A. Swertz
- Genomics Coordination Center, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Gino Poulin
- Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom
| | - Jan E. Kammenga
- Laboratory of Nematology, Wageningen University, Wageningen, The Netherlands
| | - Rainer Breitling
- Manchester Institute of Biotechnology, Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom
| | - Ritsert C. Jansen
- Groningen Bioinformatics Centre, University of Groningen, Groningen, The Netherlands
| | - Yang Li
- Groningen Bioinformatics Centre, University of Groningen, Groningen, The Netherlands
- University of Groningen, University Medical Center Groningen, Department of Genetics, Groningen, The Netherlands
- * E-mail:
| |
Collapse
|
6
|
Chintalapudi SR, Jablonski MM. Systems Genetics Analysis to Identify the Genetic Modulation of a Glaucoma-Associated Gene. Methods Mol Biol 2017; 1488:391-417. [PMID: 27933535 DOI: 10.1007/978-1-4939-6427-7_18] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/10/2022]
Abstract
Loss of retinal ganglion cells (RGCs) is one of the hallmarks of retinal neurodegenerative diseases, glaucoma being one of the most common. Recently, γ-synuclein (SNCG) was shown to be highly expressed in the somas and axons of RGCs. In various mouse models of glaucoma, downregulation of Sncg gene expression correlates with RGC loss. To investigate the regulation of Sncg in RGCs, we used a systems genetics approach to identify a gene that modulates the expression of Sncg, followed by confirmatory studies in both healthy and diseased retinas. We found that chromosome 1 harbors an eQTL that modulates the expression of Sncg in the mouse retina and identified Pfdn2 as the candidate upstream modulator of Sncg expression. Downregulation of Pfdn2 in enriched RGCs causes a concomitant reduction in Sncg. In this chapter, we describe our strategy and methods for identifying and confirming a genetic modulation of a glaucoma-associated gene. A similar method can be applied to other genes expressed in other tissues.
Collapse
Affiliation(s)
- Sumana R Chintalapudi
- Department of Anatomy and Neurobiology, Hamilton Eye Institute, The University of Tennessee Health Science Center, 930 Madison Ave., Suite 710, Memphis, TN, 38163, USA
| | - Monica M Jablonski
- Department of Ophthalmology, Hamilton Eye Institute, The University of Tennessee Health Science Center, 930 Madison Ave., Suite 710, Memphis, TN, 38163, USA. .,Department of Anatomy and Neurobiology, Hamilton Eye Institute, The University of Tennessee Health Science Center, 930 Madison Ave., Suite 710, Memphis, TN, 38163, USA. .,Department of Pharmaceutical Sciences, Hamilton Eye Institute, The University of Tennessee Health Science Center, 930 Madison Ave., Suite 710, Memphis, TN, 38163, USA.
| |
Collapse
|
7
|
Dowell R, Odell A, Richmond P, Malmer D, Halper-Stromberg E, Bennett B, Larson C, Leach S, Radcliffe RA. Genome characterization of the selected long- and short-sleep mouse lines. Mamm Genome 2016; 27:574-586. [PMID: 27651241 PMCID: PMC5110614 DOI: 10.1007/s00335-016-9663-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2016] [Accepted: 08/22/2016] [Indexed: 01/29/2023]
Abstract
The Inbred Long- and Short-Sleep (ILS, ISS) mouse lines were selected for differences in acute ethanol sensitivity using the loss of righting response (LORR) as the selection trait. The lines show an over tenfold difference in LORR and, along with a recombinant inbred panel derived from them (the LXS), have been widely used to dissect the genetic underpinnings of acute ethanol sensitivity. Here we have sequenced the genomes of the ILS and ISS to investigate the DNA variants that contribute to their sensitivity difference. We identified ~2.7 million high-confidence SNPs and small indels and ~7000 structural variants between the lines; variants were found to occur in 6382 annotated genes. Using a hidden Markov model, we were able to reconstruct the genome-wide ancestry patterns of the eight inbred progenitor strains from which the ILS and ISS were derived, and found that quantitative trait loci that have been mapped for LORR were slightly enriched for DNA variants. Finally, by mapping and quantifying RNA-seq reads from the ILS and ISS to their strain-specific genomes rather than to the reference genome, we found a substantial improvement in a differential expression analysis between the lines. This work will help in identifying and characterizing the DNA sequence variants that contribute to the difference in ethanol sensitivity between the ILS and ISS and will also aid in accurate quantification of RNA-seq data generated from the LXS RIs.
Collapse
Affiliation(s)
- Robin Dowell
- BioFrontiers Institute, University of Colorado Boulder, Boulder, CO, 80309, USA. .,Department of Molecular, Cellular, and Developmental Biology, University of Colorado Boulder, Boulder, CO, 80309, USA. .,Department of Computer Science, University of Colorado Boulder, Boulder, CO, 80309, USA.
| | - Aaron Odell
- BioFrontiers Institute, University of Colorado Boulder, Boulder, CO, 80309, USA
| | - Phillip Richmond
- BioFrontiers Institute, University of Colorado Boulder, Boulder, CO, 80309, USA.,Department of Molecular, Cellular, and Developmental Biology, University of Colorado Boulder, Boulder, CO, 80309, USA
| | - Daniel Malmer
- Department of Computer Science, University of Colorado Boulder, Boulder, CO, 80309, USA
| | - Eitan Halper-Stromberg
- Center for Genes, Environment and Health, National Jewish Health, Denver, CO, 80206, USA
| | - Beth Bennett
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of Colorado, Aurora, CO, 80045, USA
| | - Colin Larson
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of Colorado, Aurora, CO, 80045, USA
| | - Sonia Leach
- Center for Genes, Environment and Health, National Jewish Health, Denver, CO, 80206, USA
| | - Richard A Radcliffe
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of Colorado, Aurora, CO, 80045, USA.
| |
Collapse
|
8
|
Joo JWJ, Sul JH, Han B, Ye C, Eskin E. Effectively identifying regulatory hotspots while capturing expression heterogeneity in gene expression studies. Genome Biol 2014; 15:r61. [PMID: 24708878 PMCID: PMC4053820 DOI: 10.1186/gb-2014-15-4-r61] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2013] [Accepted: 04/07/2014] [Indexed: 02/03/2023] Open
Abstract
Expression quantitative trait loci (eQTL) mapping is a tool that can systematically identify genetic variation affecting gene expression. eQTL mapping studies have shown that certain genomic locations, referred to as regulatory hotspots, may affect the expression levels of many genes. Recently, studies have shown that various confounding factors may induce spurious regulatory hotspots. Here, we introduce a novel statistical method that effectively eliminates spurious hotspots while retaining genuine hotspots. Applied to simulated and real datasets, we validate that our method achieves greater sensitivity while retaining low false discovery rates compared to previous methods.
Collapse
|
9
|
Abstract
Transcriptome studies have revealed a surprisingly high level of variation among individuals in expression of key genes in the CNS under both normal and experimental conditions. Ten-fold variation is common, yet the specific causes and consequences of this variation are largely unknown. By combining classic gene mapping methods-family linkage studies and genomewide association-with high-throughput genomics, it is now possible to define quantitative trait loci (QTLs), single-gene variants, and even single SNPs and indels that control gene expression in different brain regions and cells. This review considers some of the major technical and conceptual challenges in analyzing variation in expression in the CNS with a focus on mRNAs, rather than noncoding RNAs or proteins. At one level of analysis, this work has been highly successful, and we finally have techniques that can be used to track down small numbers of loci that control expression in the CNS. But at a higher level of analysis, we still do not understand the genetic architecture of gene expression in brain, the consequences of expression QTLs on protein levels or on cell function, or the combined impact of expression differences on behavior and disease risk. These important gaps are likely to be bridged over the next several decades using (1) much larger sample sizes, (2) more powerful RNA sequencing and proteomic methods, and (3) novel statistical and computational models to predict genome-to-phenome relations.
Collapse
Affiliation(s)
- Ashutosh K Pandey
- Department of Genetics, Genomics and Informatics, Center for Integrative and Translational Genomics, University of Tennessee Health Science Center, Memphis, Tennessee, USA
| | - Robert W Williams
- Department of Genetics, Genomics and Informatics, Center for Integrative and Translational Genomics, University of Tennessee Health Science Center, Memphis, Tennessee, USA.
| |
Collapse
|
10
|
Hitzemann R, Darakjian P, Walter N, Iancu OD, Searles R, McWeeney S. Introduction to sequencing the brain transcriptome. INTERNATIONAL REVIEW OF NEUROBIOLOGY 2014; 116:1-19. [PMID: 25172469 DOI: 10.1016/b978-0-12-801105-8.00001-1] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
High-throughput next-generation sequencing is now entering its second decade. However, it was not until 2008 that the first report of sequencing the brain transcriptome appeared (Mortazavi, Williams, Mccue, Schaeffer, & Wold, 2008). These authors compared short-read RNA-Seq data for mouse whole brain with microarray results for the same sample and noted both the advantages and disadvantages of the RNA-Seq approach. While RNA-Seq provided exon level resolution, the majority of the reads were provided by a small proportion of highly expressed genes and the data analysis was exceedingly complex. Over the past 6 years, there have been substantial improvements in both RNA-Seq technology and data analysis. This volume contains 11 chapters that detail various aspects of sequencing the brain transcriptome. Some of the chapters are very methods driven, while others focus on the use of RNA-Seq to study such diverse areas as development, schizophrenia, and drug abuse. This chapter briefly reviews the transition from microarrays to RNA-Seq as the preferred method for analyzing the brain transcriptome. Compared with microarrays, RNA-Seq has a greater dynamic range, detects both coding and noncoding RNAs, is superior for gene network construction, detects alternative spliced transcripts, and can be used to extract genotype information, e.g., nonsynonymous coding single nucleotide polymorphisms. RNA-Seq embraces the complexity of the brain transcriptome and provides a mechanism to understand the underlying regulatory code; the potential to inform the brain-behavior-disease relationships is substantial.
Collapse
Affiliation(s)
- Robert Hitzemann
- Department of Behavioral Neuroscience, Oregon Health & Science University, Portland, Oregon, USA; Research Service, Veterans Affairs Medical Center, Portland, Oregon, USA.
| | - Priscila Darakjian
- Department of Behavioral Neuroscience, Oregon Health & Science University, Portland, Oregon, USA
| | - Nikki Walter
- Department of Behavioral Neuroscience, Oregon Health & Science University, Portland, Oregon, USA; Research Service, Veterans Affairs Medical Center, Portland, Oregon, USA
| | - Ovidiu Dan Iancu
- Department of Behavioral Neuroscience, Oregon Health & Science University, Portland, Oregon, USA
| | - Robert Searles
- Integrative Genomics Laboratory, Oregon Health & Science University, Portland, Oregon, USA
| | - Shannon McWeeney
- Oregon Clinical and Translational Research Institute, Oregon Health & Science University, Portland, Oregon, USA; Division of Biostatistics, Public Health & Preventative Medicine, Oregon Health & Science University, Portland, Oregon, USA
| |
Collapse
|
11
|
Hitzemann R, Bottomly D, Iancu O, Buck K, Wilmot B, Mooney M, Searles R, Zheng C, Belknap J, Crabbe J, McWeeney S. The genetics of gene expression in complex mouse crosses as a tool to study the molecular underpinnings of behavior traits. Mamm Genome 2013; 25:12-22. [PMID: 24374554 PMCID: PMC3916704 DOI: 10.1007/s00335-013-9495-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2013] [Accepted: 11/25/2013] [Indexed: 02/06/2023]
Abstract
Complex Mus musculus crosses provide increased resolution to examine the relationships between gene expression and behavior. While the advantages are clear, there are numerous analytical and technological concerns that arise from the increased genetic complexity that must be considered. Each of these issues is discussed, providing an initial framework for complex cross study design and planning.
Collapse
Affiliation(s)
- Robert Hitzemann
- Portland Alcohol Research Center, Veterans Affairs Medical Center, Portland, 97239, OR, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Hitzemann R, Bottomly D, Darakjian P, Walter N, Iancu O, Searles R, Wilmot B, McWeeney S. Genes, behavior and next-generation RNA sequencing. GENES, BRAIN, AND BEHAVIOR 2013; 12:1-12. [PMID: 23194347 PMCID: PMC6050050 DOI: 10.1111/gbb.12007] [Citation(s) in RCA: 69] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/19/2012] [Revised: 10/31/2012] [Accepted: 11/21/2012] [Indexed: 12/30/2022]
Abstract
Advances in next-generation sequencing suggest that RNA-Seq is poised to supplant microarray-based approaches for transcriptome analysis. This article briefly reviews the use of microarrays in the brain-behavior context and then illustrates why RNA-Seq is a superior strategy. Compared with microarrays, RNA-Seq has a greater dynamic range, detects both coding and noncoding RNAs, is superior for gene network construction, detects alternative spliced transcripts, detects allele specific expression and can be used to extract genotype information, e.g. nonsynonymous coding single nucleotide polymorphisms. Examples of where RNA-Seq has been used to assess brain gene expression are provided. Despite the advantages of RNA-Seq, some disadvantages remain. These include the high cost of RNA-Seq and the computational complexities associated with data analysis. RNA-Seq embraces the complexity of the transcriptome and provides a mechanism to understand the underlying regulatory code; the potential to inform the brain-behavior relationship is substantial.
Collapse
Affiliation(s)
- R Hitzemann
- Department of Behavioral Neuroscience, Oregon Health & Science University, Portland, OR 97239-3098, USA.
| | | | | | | | | | | | | | | |
Collapse
|
13
|
van Eijk KR, de Jong S, Boks MPM, Langeveld T, Colas F, Veldink JH, de Kovel CGF, Janson E, Strengman E, Langfelder P, Kahn RS, van den Berg LH, Horvath S, Ophoff RA. Genetic analysis of DNA methylation and gene expression levels in whole blood of healthy human subjects. BMC Genomics 2012; 13:636. [PMID: 23157493 PMCID: PMC3583143 DOI: 10.1186/1471-2164-13-636] [Citation(s) in RCA: 161] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2012] [Accepted: 10/30/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The predominant model for regulation of gene expression through DNA methylation is an inverse association in which increased methylation results in decreased gene expression levels. However, recent studies suggest that the relationship between genetic variation, DNA methylation and expression is more complex. RESULTS Systems genetic approaches for examining relationships between gene expression and methylation array data were used to find both negative and positive associations between these levels. A weighted correlation network analysis revealed that i) both transcriptome and methylome are organized in modules, ii) co-expression modules are generally not preserved in the methylation data and vice-versa, and iii) highly significant correlations exist between co-expression and co-methylation modules, suggesting the existence of factors that affect expression and methylation of different modules (i.e., trans effects at the level of modules). We observed that methylation probes associated with expression in cis were more likely to be located outside CpG islands, whereas specificity for CpG island shores was present when methylation, associated with expression, was under local genetic control. A structural equation model based analysis found strong support in particular for a traditional causal model in which gene expression is regulated by genetic variation via DNA methylation instead of gene expression affecting DNA methylation levels. CONCLUSIONS Our results provide new insights into the complex mechanisms between genetic markers, epigenetic mechanisms and gene expression. We find strong support for the classical model of genetic variants regulating methylation, which in turn regulates gene expression. Moreover we show that, although the methylation and expression modules differ, they are highly correlated.
Collapse
Affiliation(s)
- Kristel R van Eijk
- Department of Medical Genetics, University Medical Center Utrecht, Utrecht 3584, CG, The Netherlands
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Murine gut microbiota is defined by host genetics and modulates variation of metabolic traits. PLoS One 2012; 7:e39191. [PMID: 22723961 PMCID: PMC3377628 DOI: 10.1371/journal.pone.0039191] [Citation(s) in RCA: 168] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2011] [Accepted: 05/16/2012] [Indexed: 12/16/2022] Open
Abstract
The gastrointestinal tract harbors a complex and diverse microbiota that has an important role in host metabolism. Microbial diversity is influenced by a combination of environmental and host genetic factors and is associated with several polygenic diseases. In this study we combined next-generation sequencing, genetic mapping, and a set of physiological traits of the BXD mouse population to explore genetic factors that explain differences in gut microbiota and its impact on metabolic traits. Molecular profiling of the gut microbiota revealed important quantitative differences in microbial composition among BXD strains. These differences in gut microbial composition are influenced by host-genetics, which is complex and involves many loci. Linkage analysis defined Quantitative Trait Loci (QTLs) restricted to a particular taxon, branch or that influenced the variation of taxa across phyla. Gene expression within the gastrointestinal tract and sequence analysis of the parental genomes in the QTL regions uncovered candidate genes with potential to alter gut immunological profiles and impact the balance between gut microbial communities. A QTL region on Chr 4 that overlaps several interferon genes modulates the population of Bacteroides, and potentially Bacteroidetes and Firmicutes–the predominant BXD gut phyla. Irak4, a signaling molecule in the Toll-like receptor pathways is a candidate for the QTL on Chr15 that modulates Rikenellaceae, whereas Tgfb3, a cytokine modulating the barrier function of the intestine and tolerance to commensal bacteria, overlaps a QTL on Chr 12 that influence Prevotellaceae. Relationships between gut microflora, morphological and metabolic traits were uncovered, some potentially a result of common genetic sources of variation.
Collapse
|
15
|
Mozhui K, Lu L, Armstrong WE, Williams RW. Sex-specific modulation of gene expression networks in murine hypothalamus. Front Neurosci 2012; 6:63. [PMID: 22593731 PMCID: PMC3350311 DOI: 10.3389/fnins.2012.00063] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2012] [Accepted: 04/10/2012] [Indexed: 11/30/2022] Open
Abstract
The hypothalamus contains nuclei and cell populations that are critical in reproduction and that differ significantly between the sexes in structure and function. To examine the molecular and genetic basis for these differences, we quantified gene expression in the hypothalamus of 39 pairs of adult male and female mice belonging to the BXD strains. This experimental design enabled us to define hypothalamic gene coexpression networks and provided robust estimates of absolute expression differences. As expected, sex has the strongest effect on the expression of genes on the X and Y chromosomes (e.g., Uty, Xist, Kdm6a). Transcripts associated with the endocrine system and neuropeptide signaling also differ significantly. Sex-differentiated transcripts often have well delimited expression within specific hypothalamic nuclei that have roles in reproduction. For instance, the estrogen receptor (Esr1) and neurokinin B (Tac2) genes have intense expression in the medial preoptic and arcuate nuclei and comparatively high expression in females. Despite the strong effect of sex on single transcripts, the global pattern of covariance among transcripts is well preserved, and consequently, males and females have well matched coexpression modules. However, there are sex-specific hub genes in functionally equivalent modules. For example, only in males is the Y-linked gene, Uty, a highly connected transcript in a network that regulates chromatin modification and gene transcription. In females, the X chromosome paralog, Kdm6a, takes the place of Uty in the same network. We also find significant effect of sex on genetic regulation and the same network in males and females can be associated with markedly different regulatory loci. With the exception of a few sex-specific modules, our analysis reveals a system in which sets of functionally related transcripts are organized into stable sex-independent networks that are controlled at a higher level by sex-specific modulators.
Collapse
Affiliation(s)
- Khyobeni Mozhui
- Department of Anatomy and Neurobiology, University of Tennessee Health Science Center Memphis, TN, USA
| | | | | | | |
Collapse
|
16
|
Wang X, Mozhui K, Li Z, Mulligan MK, Ingels JF, Zhou X, Hori RT, Chen H, Cook MN, Williams RW, Lu L. A promoter polymorphism in the Per3 gene is associated with alcohol and stress response. Transl Psychiatry 2012; 2:e73. [PMID: 22832735 PMCID: PMC3309544 DOI: 10.1038/tp.2011.71] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
The period homolog genes Per1, Per2 and Per3 are important components of the circadian clock system. In addition to their role in maintaining circadian rhythm, these genes have been linked to mood disorders, stress response and vulnerability to addiction and alcoholism. In this study, we combined high-resolution sequence analysis and quantitative trait locus (QTL) mapping of gene expression and behavioral traits to identify Per3 as a compelling candidate for the interaction between circadian rhythm, alcohol and stress response. In the BXD family of mouse strains, sequence variants in Per3 have marked effects on steady-state mRNA and protein levels. As a result, the transcript maps as a cis-acting expression QTL (eQTL). We found that an insertion/deletion (indel) variant in a putative stress response element in the promoter region of Per3 causes local control of transcript abundance. This indel results in differences in protein binding affinities between the two alleles through the Nrf2 transcriptional activator. Variation in Per3 is also associated with downstream differences in the expression of genes involved in circadian rhythm, alcohol, stress response and schizophrenia. We found that the Per3 locus is linked to stress/anxiety traits, and that the basal expression of Per3 is also correlated with several anxiety and addiction-related phenotypes. Treatment with alcohol results in increased expression of Per3 in the hippocampus, and this effect interacts with acute restraint stress. Our data provide strong evidence that variation in the Per3 transcript is causally associated with and also responsive to stress and alcohol.
Collapse
Affiliation(s)
- X Wang
- Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, TN, USA
| | - K Mozhui
- Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Z Li
- Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, TN, USA
| | - M K Mulligan
- Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, TN, USA
| | - J F Ingels
- Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, TN, USA
| | - X Zhou
- Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, TN, USA
| | - R T Hori
- Department of Microbiology, Immunology and Biochemistry, University of Tennessee Health Science Center, Memphis, TN, USA
| | - H Chen
- Department of Pharmacology, University of Tennessee Health Science Center, Memphis, TN, USA
| | - M N Cook
- Department of Psychology, University of Memphis, Memphis, TN, USA
| | - R W Williams
- Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, TN, USA
| | - L Lu
- Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, TN, USA,Jiangsu Ley Laboratory of Neuroregeneration, Nantong University, Nantong, China,Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, 855 Monroe Avenue, Memphis, TN 38163, USA. E-mail:
| |
Collapse
|
17
|
Fehrmann RSN, Jansen RC, Veldink JH, Westra HJ, Arends D, Bonder MJ, Fu J, Deelen P, Groen HJM, Smolonska A, Weersma RK, Hofstra RMW, Buurman WA, Rensen S, Wolfs MGM, Platteel M, Zhernakova A, Elbers CC, Festen EM, Trynka G, Hofker MH, Saris CGJ, Ophoff RA, van den Berg LH, van Heel DA, Wijmenga C, te Meerman GJ, Franke L. Trans-eQTLs reveal that independent genetic variants associated with a complex phenotype converge on intermediate genes, with a major role for the HLA. PLoS Genet 2011; 7:e1002197. [PMID: 21829388 PMCID: PMC3150446 DOI: 10.1371/journal.pgen.1002197] [Citation(s) in RCA: 268] [Impact Index Per Article: 20.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2011] [Accepted: 06/06/2011] [Indexed: 12/19/2022] Open
Abstract
For many complex traits, genetic variants have been found associated. However, it is still mostly unclear through which downstream mechanism these variants cause these phenotypes. Knowledge of these intermediate steps is crucial to understand pathogenesis, while also providing leads for potential pharmacological intervention. Here we relied upon natural human genetic variation to identify effects of these variants on trans-gene expression (expression quantitative trait locus mapping, eQTL) in whole peripheral blood from 1,469 unrelated individuals. We looked at 1,167 published trait- or disease-associated SNPs and observed trans-eQTL effects on 113 different genes, of which we replicated 46 in monocytes of 1,490 different individuals and 18 in a smaller dataset that comprised subcutaneous adipose, visceral adipose, liver tissue, and muscle tissue. HLA single-nucleotide polymorphisms (SNPs) were 10-fold enriched for trans-eQTLs: 48% of the trans-acting SNPs map within the HLA, including ulcerative colitis susceptibility variants that affect plausible candidate genes AOAH and TRBV18 in trans. We identified 18 pairs of unlinked SNPs associated with the same phenotype and affecting expression of the same trans-gene (21 times more than expected, P<10−16). This was particularly pronounced for mean platelet volume (MPV): Two independent SNPs significantly affect the well-known blood coagulation genes GP9 and F13A1 but also C19orf33, SAMD14, VCL, and GNG11. Several of these SNPs have a substantially higher effect on the downstream trans-genes than on the eventual phenotypes, supporting the concept that the effects of these SNPs on expression seems to be much less multifactorial. Therefore, these trans-eQTLs could well represent some of the intermediate genes that connect genetic variants with their eventual complex phenotypic outcomes. Many genetic variants have been found associated with diseases. However, for many of these genetic variants, it remains unclear how they exert their effect on the eventual phenotype. We investigated genetic variants that are known to be associated with diseases and complex phenotypes and assessed whether these variants were also associated with gene expression levels in a set of 1,469 unrelated whole blood samples. For several diseases, such as type 1 diabetes and ulcerative colitis, we observed that genetic variants affect the expression of genes, not implicated before. For complex traits, such as mean platelet volume and mean corpuscular volume, we observed that independent genetic variants on different chromosomes influence the expression of exactly the same genes. For mean platelet volume, these genes include well-known blood coagulation genes but also genes with still unknown functions. These results indicate that, by systematically correlating genetic variation with gene expression levels, it is possible to identify downstream genes, which provide important avenues for further research.
Collapse
Affiliation(s)
- Rudolf S. N. Fehrmann
- Department of Genetics, University Medical Center Groningen and University of Groningen, Groningen, The Netherlands
| | - Ritsert C. Jansen
- Groningen Bioinformatics Centre, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Haren, The Netherlands
| | - Jan H. Veldink
- Department of Neurology, Rudolf Magnus Institute of Neuroscience, University Medical Centre Utrecht, Utrecht, The Netherlands
| | - Harm-Jan Westra
- Department of Genetics, University Medical Center Groningen and University of Groningen, Groningen, The Netherlands
| | - Danny Arends
- Groningen Bioinformatics Centre, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Haren, The Netherlands
| | - Marc Jan Bonder
- Department of Genetics, University Medical Center Groningen and University of Groningen, Groningen, The Netherlands
| | - Jingyuan Fu
- Department of Genetics, University Medical Center Groningen and University of Groningen, Groningen, The Netherlands
| | - Patrick Deelen
- Department of Genetics, University Medical Center Groningen and University of Groningen, Groningen, The Netherlands
| | - Harry J. M. Groen
- Department of Pulmonology, University Medical Center Groningen and University of Groningen, Groningen, The Netherlands
| | - Asia Smolonska
- Department of Genetics, University Medical Center Groningen and University of Groningen, Groningen, The Netherlands
| | - Rinse K. Weersma
- Department of Genetics, University Medical Center Groningen and University of Groningen, Groningen, The Netherlands
- Department of Gastroenterology and Hepatology, University Medical Centre Groningen and University of Groningen, Groningen, The Netherlands
| | - Robert M. W. Hofstra
- Department of Genetics, University Medical Center Groningen and University of Groningen, Groningen, The Netherlands
| | - Wim A. Buurman
- NUTRIM School for Nutrition, Toxicology, and Metabolism, Department of General Surgery, Maastricht University Medical Center, Maastricht, The Netherlands
| | - Sander Rensen
- NUTRIM School for Nutrition, Toxicology, and Metabolism, Department of General Surgery, Maastricht University Medical Center, Maastricht, The Netherlands
| | - Marcel G. M. Wolfs
- Department of Pathology and Medical Biology, Medical Biology Section, Molecular Genetics, University Medical Center Groningen and University of Groningen, Groningen, The Netherlands
| | - Mathieu Platteel
- Department of Genetics, University Medical Center Groningen and University of Groningen, Groningen, The Netherlands
| | - Alexandra Zhernakova
- Department of Rheumatology, Leiden University Medical Center, Leiden, The Netherlands
| | - Clara C. Elbers
- Department of Genetics, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Eleanora M. Festen
- Department of Genetics, University Medical Center Groningen and University of Groningen, Groningen, The Netherlands
| | - Gosia Trynka
- Department of Genetics, University Medical Center Groningen and University of Groningen, Groningen, The Netherlands
| | - Marten H. Hofker
- Department of Pathology and Medical Biology, Medical Biology Section, Molecular Genetics, University Medical Center Groningen and University of Groningen, Groningen, The Netherlands
| | - Christiaan G. J. Saris
- Department of Neurology, Rudolf Magnus Institute of Neuroscience, University Medical Centre Utrecht, Utrecht, The Netherlands
| | - Roel A. Ophoff
- Department of Neurology, Rudolf Magnus Institute of Neuroscience, University Medical Centre Utrecht, Utrecht, The Netherlands
- Department of Medical Genetics, University Medical Center Utrecht, Utrecht, The Netherlands
- Center for Neurobehavioral Genetics, University of California Los Angeles, Los Angeles, California, United States of America
| | - Leonard H. van den Berg
- Department of Neurology, Rudolf Magnus Institute of Neuroscience, University Medical Centre Utrecht, Utrecht, The Netherlands
| | - David A. van Heel
- Blizard Institute of Cell and Molecular Science, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London, United Kingdom
| | - Cisca Wijmenga
- Department of Genetics, University Medical Center Groningen and University of Groningen, Groningen, The Netherlands
| | - Gerard J. te Meerman
- Department of Genetics, University Medical Center Groningen and University of Groningen, Groningen, The Netherlands
| | - Lude Franke
- Department of Genetics, University Medical Center Groningen and University of Groningen, Groningen, The Netherlands
- Blizard Institute of Cell and Molecular Science, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London, United Kingdom
- * E-mail:
| |
Collapse
|
18
|
Genetic regulation of Nrxn1 [corrected] expression: an integrative cross-species analysis of schizophrenia candidate genes. Transl Psychiatry 2011; 1:e25. [PMID: 22832527 PMCID: PMC3309521 DOI: 10.1038/tp.2011.24] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Neurexin 1 (NRXN1) is a large presynaptic transmembrane protein that has complex and variable patterns of expression in the brain. Sequence variants in NRXN1 are associated with differences in cognition, and with schizophrenia and autism. The murine Nrxn1 gene is also highly polymorphic and is associated with significant variation in expression that is under strong genetic control. Here, we use co-expression analysis, high coverage genomic sequence, and expression quantitative trait locus (eQTL) mapping to study the regulation of this gene in the brain. We profiled a family of 72 isogenic progeny strains of a cross between C57BL/6J and DBA/2J (the BXD family) using exon arrays and massively parallel RNA sequencing. Expression of most Nrxn1 exons have high genetic correlation (r>0.6) because of the segregation of a common trans eQTL on chromosome (Chr) 8 and a common cis eQTL on Chr 17. These two loci are also linked to murine phenotypes relevant to schizophrenia and to a novel human schizophrenia candidate gene with high neuronal expression (Pleckstrin and Sec7 domain containing 3). In both human and mice, NRXN1 is co-expressed with numerous synaptic and cell signaling genes, and known schizophrenia candidates. Cross-species co-expression and protein interaction network analyses identified glycogen synthase kinase 3 beta (GSK3B) as one of the most consistent and conserved covariates of NRXN1. By using the Molecular Genetics of Schizophrenia data set, we were able to test and confirm that markers in NRXN1 and GSK3B have epistatic interactions in human populations that can jointly modulate risk of schizophrenia.
Collapse
|
19
|
Innocenti F, Cooper GM, Stanaway IB, Gamazon ER, Smith JD, Mirkov S, Ramirez J, Liu W, Lin YS, Moloney C, Aldred SF, Trinklein ND, Schuetz E, Nickerson DA, Thummel KE, Rieder MJ, Rettie AE, Ratain MJ, Cox NJ, Brown CD. Identification, replication, and functional fine-mapping of expression quantitative trait loci in primary human liver tissue. PLoS Genet 2011; 7:e1002078. [PMID: 21637794 PMCID: PMC3102751 DOI: 10.1371/journal.pgen.1002078] [Citation(s) in RCA: 173] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2010] [Accepted: 03/28/2011] [Indexed: 01/10/2023] Open
Abstract
The discovery of expression quantitative trait loci ("eQTLs") can help to unravel genetic contributions to complex traits. We identified genetic determinants of human liver gene expression variation using two independent collections of primary tissue profiled with Agilent (n = 206) and Illumina (n = 60) expression arrays and Illumina SNP genotyping (550K), and we also incorporated data from a published study (n = 266). We found that ∼30% of SNP-expression correlations in one study failed to replicate in either of the others, even at thresholds yielding high reproducibility in simulations, and we quantified numerous factors affecting reproducibility. Our data suggest that drug exposure, clinical descriptors, and unknown factors associated with tissue ascertainment and analysis have substantial effects on gene expression and that controlling for hidden confounding variables significantly increases replication rate. Furthermore, we found that reproducible eQTL SNPs were heavily enriched near gene starts and ends, and subsequently resequenced the promoters and 3'UTRs for 14 genes and tested the identified haplotypes using luciferase assays. For three genes, significant haplotype-specific in vitro functional differences correlated directly with expression levels, suggesting that many bona fide eQTLs result from functional variants that can be mechanistically isolated in a high-throughput fashion. Finally, given our study design, we were able to discover and validate hundreds of liver eQTLs. Many of these relate directly to complex traits for which liver-specific analyses are likely to be relevant, and we identified dozens of potential connections with disease-associated loci. These included previously characterized eQTL contributors to diabetes, drug response, and lipid levels, and they suggest novel candidates such as a role for NOD2 expression in leprosy risk and C2orf43 in prostate cancer. In general, the work presented here will be valuable for future efforts to precisely identify and functionally characterize genetic contributions to a variety of complex traits.
Collapse
Affiliation(s)
- Federico Innocenti
- Cancer Research Center, Committee on Clinical
Pharmacology and Pharmacogenomics, Department of Medicine, The University of
Chicago, Chicago, Illinois, United States of America
| | - Gregory M. Cooper
- Department of Genome Sciences, University of
Washington, Seattle, Washington, United States of America
| | - Ian B. Stanaway
- Department of Genome Sciences, University of
Washington, Seattle, Washington, United States of America
| | - Eric R. Gamazon
- Section of Genetic Medicine, Department of
Medicine, The University of Chicago, Chicago, Illinois, United States of
America
| | - Joshua D. Smith
- Department of Genome Sciences, University of
Washington, Seattle, Washington, United States of America
| | - Snezana Mirkov
- Section of Hematology/Oncology, Department of
Medicine, The University of Chicago, Chicago, Illinois, United States of
America
| | - Jacqueline Ramirez
- Section of Hematology/Oncology, Department of
Medicine, The University of Chicago, Chicago, Illinois, United States of
America
| | - Wanqing Liu
- Section of Hematology/Oncology, Department of
Medicine, The University of Chicago, Chicago, Illinois, United States of
America
| | - Yvonne S. Lin
- Department of Medicinal Chemistry, School of
Pharmacy, University of Washington, Seattle, Washington, United States of
America
- Department of Pharmaceutics, University of
Washington, Seattle, Washington, United States of America
| | - Cliona Moloney
- Merck Research Laboratories, Boston,
Massachusetts, United States of America
| | | | | | - Erin Schuetz
- Department of Pharmaceutical Sciences, St.
Jude Children's Research Hospital, Memphis, Tennessee, United States of
America
| | - Deborah A. Nickerson
- Department of Genome Sciences, University of
Washington, Seattle, Washington, United States of America
| | - Ken E. Thummel
- Department of Medicinal Chemistry, School of
Pharmacy, University of Washington, Seattle, Washington, United States of
America
- Department of Pharmaceutics, University of
Washington, Seattle, Washington, United States of America
| | - Mark J. Rieder
- Department of Genome Sciences, University of
Washington, Seattle, Washington, United States of America
| | - Allan E. Rettie
- Department of Medicinal Chemistry, School of
Pharmacy, University of Washington, Seattle, Washington, United States of
America
| | - Mark J. Ratain
- Cancer Research Center, Committee on Clinical
Pharmacology and Pharmacogenomics, Department of Medicine, The University of
Chicago, Chicago, Illinois, United States of America
| | - Nancy J. Cox
- Section of Genetic Medicine, Department of
Medicine, The University of Chicago, Chicago, Illinois, United States of
America
| | - Christopher D. Brown
- Institute for Genomics and Systems Biology,
The University of Chicago and Argonne National Laboratory, Chicago, Illinois,
United States of America
- Departments of Human Genetics and Ecology
and Evolution, The University of Chicago, Chicago, Illinois, United States of
America
| |
Collapse
|
20
|
Suwanwela J, Farber CR, Haung BL, Song B, Pan C, Lyons KM, Lusis AJ. Systems genetics analysis of mouse chondrocyte differentiation. J Bone Miner Res 2011; 26:747-60. [PMID: 20954177 PMCID: PMC3179327 DOI: 10.1002/jbmr.271] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
One of the goals of systems genetics is the reconstruction of gene networks that underlie key processes in development and disease. To identify cartilage gene networks that play an important role in bone development, we used a systems genetics approach that integrated microarray gene expression profiles from cartilage and bone phenotypic data from two sets of recombinant inbred strains. Microarray profiles generated from isolated chondrocytes were used to generate weighted gene coexpression networks. This analysis resulted in the identification of subnetworks (modules) of coexpressed genes that then were examined for relationships with bone geometry and density. One module exhibited significant correlation with femur length (r = 0.416), anteroposterior diameter (r = 0.418), mediolateral diameter (r = 0.576), and bone mineral density (r = 0.475). Highly connected genes (n = 28) from this and other modules were tested in vitro using prechondrocyte ATDC5 cells and RNA interference. Five of the 28 genes were found to play a role in chondrocyte differentiation. Two of these, Hspd1 and Cdkn1a, were known previously to function in chondrocyte development, whereas the other three, Bhlhb9, Cugbp1, and Spcs3, are novel genes. Our integrative analysis provided a systems-level view of cartilage development and identified genes that may be involved in bone development.
Collapse
Affiliation(s)
- Jaijam Suwanwela
- Department of Oral Biology, School of Dentistry, UCLA, Los Angeles, CA 90095, USA.
| | | | | | | | | | | | | |
Collapse
|
21
|
Bottomly D, Walter NAR, Hunter JE, Darakjian P, Kawane S, Buck KJ, Searles RP, Mooney M, McWeeney SK, Hitzemann R. Evaluating gene expression in C57BL/6J and DBA/2J mouse striatum using RNA-Seq and microarrays. PLoS One 2011; 6:e17820. [PMID: 21455293 PMCID: PMC3063777 DOI: 10.1371/journal.pone.0017820] [Citation(s) in RCA: 177] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2010] [Accepted: 02/10/2011] [Indexed: 12/14/2022] Open
Abstract
C57BL/6J (B6) and DBA/2J (D2) are two of the most commonly used inbred mouse strains in neuroscience research. However, the only currently available mouse genome is based entirely on the B6 strain sequence. Subsequently, oligonucleotide microarray probes are based solely on this B6 reference sequence, making their application for gene expression profiling comparisons across mouse strains dubious due to their allelic sequence differences, including single nucleotide polymorphisms (SNPs). The emergence of next-generation sequencing (NGS) and the RNA-Seq application provides a clear alternative to oligonucleotide arrays for detecting differential gene expression without the problems inherent to hybridization-based technologies. Using RNA-Seq, an average of 22 million short sequencing reads were generated per sample for 21 samples (10 B6 and 11 D2), and these reads were aligned to the mouse reference genome, allowing 16,183 Ensembl genes to be queried in striatum for both strains. To determine differential expression, ‘digital mRNA counting’ is applied based on reads that map to exons. The current study compares RNA-Seq (Illumina GA IIx) with two microarray platforms (Illumina MouseRef-8 v2.0 and Affymetrix MOE 430 2.0) to detect differential striatal gene expression between the B6 and D2 inbred mouse strains. We show that by using stringent data processing requirements differential expression as determined by RNA-Seq is concordant with both the Affymetrix and Illumina platforms in more instances than it is concordant with only a single platform, and that instances of discordance with respect to direction of fold change were rare. Finally, we show that additional information is gained from RNA-Seq compared to hybridization-based techniques as RNA-Seq detects more genes than either microarray platform. The majority of genes differentially expressed in RNA-Seq were only detected as present in RNA-Seq, which is important for studies with smaller effect sizes where the sensitivity of hybridization-based techniques could bias interpretation.
Collapse
Affiliation(s)
- Daniel Bottomly
- Oregon Clinical and Translational Research Institute, Oregon Health & Science University, Portland, Oregon, United States of America.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
22
|
Genomic loci and candidate genes underlying inflammatory nociception. Pain 2010; 152:599-606. [PMID: 21195549 DOI: 10.1016/j.pain.2010.11.029] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2010] [Revised: 11/16/2010] [Accepted: 11/29/2010] [Indexed: 11/20/2022]
Abstract
Heritable genetic factors contribute significantly to inflammatory nociception. To determine candidate genes underlying inflammatory nociception, the current study used a mouse model of abdominal inflammatory pain. BXD recombinant inbred (RI) mouse strains were administered the intraperitoneal acetic acid test, and genome-wide quantitative trait locus (QTL) mapping was performed on the mean number of abdominal contraction and extension movements in 3 distinct groups of BXD RI mouse strains in 2 separate experiments. Combined mapping results detected 2 QTLs on chromosomes (Chr) 3 and 10 across experiments and groups of mice; an additional sex-specific QTL was detected on Chr 16. The results replicate previous findings of a significant QTL, Nociq2, on distal Chr 10 for formalin-induced inflammatory nociception and will aid in identification of the underlying candidate genes. Comparisons of sensitivity to intraperitoneal acetic acid in BXD RI mouse strains with microarray mRNA transcript expression profiles in specific brain areas detected covarying expression of candidate genes that are also found in the detected QTL confidence intervals. The results indicate that common and distinct genetic mechanisms underlie heritable sensitivity to diverse inflammatory insults, and provide a discrete set of high-priority candidate genes to investigate further in rodents and human association studies. Novel genomic regions linked to inflammatory nociception were detected, a previously reported locus was confirmed, and high-priority candidate genes for inflammatory nociception and pain were identified.
Collapse
|
23
|
Loguercio S, Overall RW, Michaelson JJ, Wiltshire T, Pletcher MT, Miller BH, Walker JR, Kempermann G, Su AI, Beyer A. Integrative analysis of low- and high-resolution eQTL. PLoS One 2010; 5:e13920. [PMID: 21085707 PMCID: PMC2978079 DOI: 10.1371/journal.pone.0013920] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2010] [Accepted: 10/17/2010] [Indexed: 11/18/2022] Open
Abstract
The study of expression quantitative trait loci (eQTL) is a powerful way of detecting transcriptional regulators at a genomic scale and for elucidating how natural genetic variation impacts gene expression. Power and genetic resolution are heavily affected by the study population: whereas recombinant inbred (RI) strains yield greater statistical power with low genetic resolution, using diverse inbred or outbred strains improves genetic resolution at the cost of lower power. In order to overcome the limitations of both individual approaches, we combine data from RI strains with genetically more diverse strains and analyze hippocampus eQTL data obtained from mouse RI strains (BXD) and from a panel of diverse inbred strains (Mouse Diversity Panel, MDP). We perform a systematic analysis of the consistency of eQTL independently obtained from these two populations and demonstrate that a significant fraction of eQTL can be replicated. Based on existing knowledge from pathway databases we assess different approaches for using the high-resolution MDP data for fine mapping BXD eQTL. Finally, we apply this framework to an eQTL hotspot on chromosome 1 (Qrr1), which has been implicated in a range of neurological traits. Here we present the first systematic examination of the consistency between eQTL obtained independently from the BXD and MDP populations. Our analysis of fine-mapping approaches is based on 'real life' data as opposed to simulated data and it allows us to propose a strategy for using MDP data to fine map BXD eQTL. Application of this framework to Qrr1 reveals that this eQTL hotspot is not caused by just one (or few) 'master regulators', but actually by a set of polymorphic genes specific to the central nervous system.
Collapse
Affiliation(s)
| | - Rupert W. Overall
- Center for Regenerative Therapies Dresden (CRTD), Technische Universität Dresden, Dresden, Germany
| | | | - Tim Wiltshire
- Division of Pharmacotherapy and Experimental Therapeutics, University of North Carolina School of Pharmacy, Chapel Hill, North Carolina, United States of America
| | - Mathew T. Pletcher
- Compound Safety Prediction, Pfizer Global Research and Development, Groton, Connecticut, United States of America
| | - Brooke H. Miller
- Department of Neuroscience, The Scripps Research Institute, Scripps Florida, Jupiter, Florida, United States of America
| | - John R. Walker
- Genomics Institute of the Novartis Research Foundation, San Diego, California, United States of America
| | - Gerd Kempermann
- Center for Regenerative Therapies Dresden (CRTD), Technische Universität Dresden, Dresden, Germany
| | - Andrew I. Su
- Genomics Institute of the Novartis Research Foundation, San Diego, California, United States of America
| | - Andreas Beyer
- Biotechnology Center, Technische Universität Dresden, Dresden, Germany
- Center for Regenerative Therapies Dresden (CRTD), Technische Universität Dresden, Dresden, Germany
- * E-mail:
| |
Collapse
|
24
|
Gatti DM, Zhao N, Chesler EJ, Bradford BU, Shabalin AA, Yordanova R, Lu L, Rusyn I. Sex-specific gene expression in the BXD mouse liver. Physiol Genomics 2010; 42:456-68. [PMID: 20551147 PMCID: PMC2929887 DOI: 10.1152/physiolgenomics.00110.2009] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2009] [Accepted: 06/08/2010] [Indexed: 11/22/2022] Open
Abstract
Differences in clinical phenotypes between the sexes are well documented and have their roots in differential gene expression. While sex has a major effect on gene expression, transcription is also influenced by complex interactions between individual genetic variation and environmental stimuli. In this study, we sought to understand how genetic variation affects sex-related differences in liver gene expression by performing genetic mapping of genomewide liver mRNA expression data in a genetically defined population of naive male and female mice from C57BL/6J, DBA/2J, B6D2F1, and 37 C57BL/6J x DBA/2J (BXD) recombinant inbred strains. As expected, we found that many genes important to xenobiotic metabolism and other important pathways exhibit sexually dimorphic expression. We also performed gene expression quantitative trait locus mapping in this panel and report that the most significant loci that appear to regulate a larger number of genes than expected by chance are largely sex independent. Importantly, we found that the degree of correlation within gene expression networks differs substantially between the sexes. Finally, we compare our results to a recently released human liver gene expression data set and report on important similarities in sexually dimorphic liver gene expression between mouse and human. This study enhances our understanding of sex differences at the genome level and between species, as well as increasing our knowledge of the molecular underpinnings of sex differences in responses to xenobiotics.
Collapse
Affiliation(s)
- Daniel M Gatti
- Department of Environmental Sciences and Engineering, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| | | | | | | | | | | | | | | |
Collapse
|
25
|
Abstract
By treating the transcript abundance as a quantitative trait, gene expression can be mapped to local or distant genomic regions relative to the gene encoding the transcript. Local expression quantitative trait loci (eQTL) generally act in cis (that is, control the expression of only the contiguous structural gene), whereas distal eQTL act in trans. Distal eQTL are more difficult to identify with certainty due to the fact that significant thresholds are very high since all regions of the genome must be tested, and confounding factors such as batch effects can produce false positives. Here, we compare findings from two large genetic crosses between mouse strains C3H/HeJ and C57BL/6J to evaluate the reliability of distal eQTL detection, including "hotspots" influencing the expression of multiple genes in trans. We found that >63% of local eQTL and >18% of distal eQTL were replicable at a threshold of LOD > 4.3 between crosses and 76% of local and >24% of distal eQTL at a threshold of LOD > 6. Additionally, at LOD > 4.3 four tissues studied (adipose, brain, liver, and muscle) exhibited >50% preservation of local eQTL and >17% preservation of distal eQTL. We observed replicated distal eQTL hotspots between the crosses on chromosomes 9 and 17. Finally, >69% of local eQTL and >10% of distal eQTL were preserved in most tissues between sexes. We conclude that most local eQTL are highly replicable between mouse crosses, tissues, and sex as compared to distal eQTL, which exhibited modest replicability.
Collapse
|
26
|
Petretto E, Bottolo L, Langley SR, Heinig M, McDermott-Roe C, Sarwar R, Pravenec M, Hübner N, Aitman TJ, Cook SA, Richardson S. New insights into the genetic control of gene expression using a Bayesian multi-tissue approach. PLoS Comput Biol 2010; 6:e1000737. [PMID: 20386736 PMCID: PMC2851562 DOI: 10.1371/journal.pcbi.1000737] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2009] [Accepted: 03/03/2010] [Indexed: 01/29/2023] Open
Abstract
The majority of expression quantitative trait locus (eQTL) studies have been carried out in single tissues or cell types, using methods that ignore information shared across tissues. Although global analysis of RNA expression in multiple tissues is now feasible, few integrated statistical frameworks for joint analysis of gene expression across tissues combined with simultaneous analysis of multiple genetic variants have been developed to date. Here, we propose Sparse Bayesian Regression models for mapping eQTLs within individual tissues and simultaneously across tissues. Testing these on a set of 2,000 genes in four tissues, we demonstrate that our methods are more powerful than traditional approaches in revealing the true complexity of the eQTL landscape at the systems-level. Highlighting the power of our method, we identified a two-eQTL model (cis/trans) for the Hopx gene that was experimentally validated and was not detected by conventional approaches. We showed common genetic regulation of gene expression across four tissues for ∼27% of transcripts, providing >5 fold increase in eQTLs detection when compared with single tissue analyses at 5% FDR level. These findings provide a new opportunity to uncover complex genetic regulatory mechanisms controlling global gene expression while the generality of our modelling approach makes it adaptable to other model systems and humans, with broad application to analysis of multiple intermediate and whole-body phenotypes. Integrated analysis of genome-wide genetic polymorphisms and gene expression profiles from different tissues or cell types has been highly successful in identifying genes modulating complex phenotypes in animal models and humans. However, an important limitation of the current approaches consists in their sole application to individual tissues, thus ignoring information shared across different tissues. To uncover complex genetic regulatory mechanisms controlling gene expression at the whole organism's level, it is essential to develop appropriate analytical methods for the analysis of genome-wide genetic polymorphisms and gene expression profiles simultaneously in multiple tissues. This paper presents a novel, fully integrated Bayesian approach for mapping the genetic components of gene expression within and across multiple tissues. In addition to increased power and enhanced mapping resolution when compared with traditional approaches, our model directly provides information on potential systemic effects on transcriptional profiles and co-existing local (cis) and distant (trans) genetic control of gene expression. We also discuss the possibility to extend our approach for the analysis of different phenotypes, and other study designs, thus providing an integrated computational tool to explore the genetic control underlying transcriptional regulation at the systems-level, beyond the single tissue resolution.
Collapse
Affiliation(s)
- Enrico Petretto
- Medical Research Council Clinical Sciences Centre, Faculty of Medicine, Imperial College London, London, United Kingdom
- Department of Epidemiology and Biostatistics, Faculty of Medicine, Imperial College, London, United Kingdom
| | - Leonardo Bottolo
- Medical Research Council Clinical Sciences Centre, Faculty of Medicine, Imperial College London, London, United Kingdom
- Department of Epidemiology and Biostatistics, Faculty of Medicine, Imperial College, London, United Kingdom
| | - Sarah R. Langley
- Medical Research Council Clinical Sciences Centre, Faculty of Medicine, Imperial College London, London, United Kingdom
| | | | - Chris McDermott-Roe
- Medical Research Council Clinical Sciences Centre, Faculty of Medicine, Imperial College London, London, United Kingdom
| | - Rizwan Sarwar
- Medical Research Council Clinical Sciences Centre, Faculty of Medicine, Imperial College London, London, United Kingdom
| | - Michal Pravenec
- Institute of Physiology, Czech Academy of Sciences and Centre for Applied Genomics, Prague, Czech Republic
- Charles University in Prague, Institute of Biology and Medical Genetics of the First Faculty of Medicine and General Teaching Hospital, Prague, Czech Republic
| | - Norbert Hübner
- Max-Delbrück Center for Molecular Medicine, Berlin, Germany
| | - Timothy J. Aitman
- Medical Research Council Clinical Sciences Centre, Faculty of Medicine, Imperial College London, London, United Kingdom
- Section of Molecular Genetics and Rheumatology, Division and Faculty of Medicine, Imperial College, London, United Kingdom
| | - Stuart A. Cook
- Medical Research Council Clinical Sciences Centre, Faculty of Medicine, Imperial College London, London, United Kingdom
- National Heart and Lung Institute, Imperial College, London, United Kingdom
| | - Sylvia Richardson
- Department of Epidemiology and Biostatistics, Faculty of Medicine, Imperial College, London, United Kingdom
- * E-mail:
| |
Collapse
|
27
|
Philip VM, Duvvuru S, Gomero B, Ansah TA, Blaha CD, Cook MN, Hamre KM, Lariviere WR, Matthews DB, Mittleman G, Goldowitz D, Chesler EJ. High-throughput behavioral phenotyping in the expanded panel of BXD recombinant inbred strains. GENES, BRAIN, AND BEHAVIOR 2010; 9:129-59. [PMID: 19958391 PMCID: PMC2855868 DOI: 10.1111/j.1601-183x.2009.00540.x] [Citation(s) in RCA: 143] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/02/2008] [Revised: 08/14/2009] [Accepted: 09/09/2009] [Indexed: 01/10/2023]
Abstract
Genetic reference populations, particularly the BXD recombinant inbred (BXD RI) strains derived from C57BL/6J and DBA/2J mice, are a valuable resource for the discovery of the bio-molecular substrates and genetic drivers responsible for trait variation and covariation. This approach can be profitably applied in the analysis of susceptibility and mechanisms of drug and alcohol use disorders for which many predisposing behaviors may predict the occurrence and manifestation of increased preference for these substances. Many of these traits are modeled by common mouse behavioral assays, facilitating the detection of patterns and sources of genetic coregulation of predisposing phenotypes and substance consumption. Members of the Tennessee Mouse Genome Consortium (TMGC) have obtained phenotype data from over 250 measures related to multiple behavioral assays across several batteries: response to, and withdrawal from cocaine, 3,4-methylenedioxymethamphetamine; "ecstasy" (MDMA), morphine and alcohol; novelty seeking; behavioral despair and related neurological phenomena; pain sensitivity; stress sensitivity; anxiety; hyperactivity and sleep/wake cycles. All traits have been measured in both sexes in approximately 70 strains of the recently expanded panel of BXD RI strains. Sex differences and heritability estimates were obtained for each trait, and a comparison of early (N = 32) and recent (N = 37) BXD RI lines was performed. Primary data are publicly available for heritability, sex difference and genetic analyses using the MouseTrack database, and are also available in GeneNetwork.org for quantitative trait locus (QTL) detection and genetic analysis of gene expression. Together with the results of related studies, these data form a public resource for integrative systems genetic analysis of neurobehavioral traits.
Collapse
Affiliation(s)
- V M Philip
- Systems Genetics Group, Biosciences Division, Oak Ridge National LaboratoryOak Ridge TN
| | - S Duvvuru
- Systems Genetics Group, Biosciences Division, Oak Ridge National LaboratoryOak Ridge TN
| | - B Gomero
- Systems Genetics Group, Biosciences Division, Oak Ridge National LaboratoryOak Ridge TN
| | - T A Ansah
- Department of Neurobiology and Neurotoxicology, Meharry Medical CollegeNashville, TN
| | - C D Blaha
- Department of Psychology, The University of MemphisMemphis, TN
| | - M N Cook
- Department of Psychology, The University of MemphisMemphis, TN
| | - K M Hamre
- Departments of Anatomy and Neurobiology, University of Tennessee Health Science CenterMemphis, TN
| | - W R Lariviere
- Departments of Anesthesiology and Neurobiology, University of Pittsburgh School of MedicinePittsburgh, PA
| | - D B Matthews
- Departments of Psychology and Neuroscience, Baylor UniversityWaco, TX, USA
- Present address: Department of Psychology, Nanyang Technological UniversitySingapore
| | - G Mittleman
- Department of Psychology, The University of MemphisMemphis, TN
| | - D Goldowitz
- Centre for Molecular Medicine and Therapeutics, Department of Medical Genetics, University of British ColumbiaVancouver, BC, Canada
| | - E J Chesler
- Systems Genetics Group, Biosciences Division, Oak Ridge National LaboratoryOak Ridge TN
| |
Collapse
|
28
|
Abstract
Common sequence variants within a gene often generate important differences in expression of corresponding mRNAs. This high level of local (allelic) control-or cis modulation-rivals that produced by gene targeting, but expression is titrated finely over a range of levels. We are interested in exploiting this allelic variation to study gene function and downstream consequences of differences in expression dosage. We have used several bioinformatics and molecular approaches to estimate error rates in the discovery of cis modulation and to analyze some of the biological and technical confounds that contribute to the variation in gene expression profiling. Our analysis of SNPs and alternative transcripts, combined with eQTL maps and selective gene resequencing, revealed that between 17 and 25% of apparent cis modulation is caused by SNPs that overlap probes rather than by genuine quantitative differences in mRNA levels. This estimate climbs to 40-50% when qualitative differences between isoform variants are included. We have developed an analytical approach to filter differences in expression and improve the yield of genuine cis-modulated transcripts to approximately 80%. This improvement is important because the resulting variation can be successfully used to study downstream consequences of altered expression on higher-order phenotypes. Using a systems genetics approach we show that two validated cis-modulated genes, Stk25 and Rasd2, are likely to control expression of downstream targets and affect disease susceptibility.
Collapse
|
29
|
Abstract
Variation in gene expression constitutes an important source of biological variability within and between populations that is likely to contribute significantly to phenotypic diversity. Recent conceptual, technical, and methodological advances have enabled the genome-scale dissection of transcriptional variation. Here, we outline common approaches for detecting gene expression quantitative trait loci, and summarize the insights gleaned from these studies regarding the genetic architecture of transcriptional variation and the nature of regulatory alleles. Particular emphasis is placed on human studies, and we discuss experimental designs that ensure that increasingly large and complex studies continue to advance our understanding of gene expression variation. We conclude by discussing the evolution of gene expression levels, and we explore prospects for leveraging new technological developments to investigate inherited variation in gene expression in even greater depth.
Collapse
Affiliation(s)
- Daniel A Skelly
- Department of Genome Sciences, University of Washington, Seattle, Washington, 98195, USA.
| | | | | |
Collapse
|
30
|
Gatti DM, Harrill AH, Wright FA, Threadgill DW, Rusyn I. Replication and narrowing of gene expression quantitative trait loci using inbred mice. Mamm Genome 2009; 20:437-46. [PMID: 19609828 DOI: 10.1007/s00335-009-9199-0] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2009] [Accepted: 06/02/2009] [Indexed: 10/20/2022]
Abstract
Gene expression quantitative trait locus (eQTL) mapping has become a powerful tool in systems biology. While many authors have made important discoveries using this approach, one persistent challenge in eQTL studies is the selection of loci and genes that should receive further biological investigation. In this study we compared eQTL generated from gene expression profiling in the livers of two panels of mouse strains: 41 BXD recombinant inbred and 36 Mouse Diversity Panel (MDP) strains. Cis-eQTL, loci in which the transcript and its maximum QTL are colocated, have been shown to be more reproducible than trans-eQTL, which are not colocated with the transcript. We observed that between 9.9 and 12.1% of cis-eQTL and between 2.0 and 12.6% of trans-eQTL replicated between the two panels depending on the degree of statistical stringency. Notably, a significant eQTL hotspot on distal chromosome 12 observed in the BXD panel was reproduced in the MDP. Furthermore, the shorter linkage disequilibrium in the MDP strains allowed us to considerably narrow the locus and limit the number of candidate genes to a cluster of Serpin genes, which code for extracellular proteases. We conclude that this strategy has some utility in increasing confidence and resolution in eQTL mapping studies; however, due to the high false-positive rate in the MDP, eQTL mapping in inbred strains is best carried out in combination with an eQTL linkage study.
Collapse
Affiliation(s)
- Daniel M Gatti
- Department of Environmental Sciences & Engineering, University of North Carolina, CB 7431, Chapel Hill, NC 27599, USA
| | | | | | | | | |
Collapse
|
31
|
Gatti DM, Shabalin AA, Lam TC, Wright FA, Rusyn I, Nobel AB. FastMap: fast eQTL mapping in homozygous populations. ACTA ACUST UNITED AC 2008; 25:482-9. [PMID: 19091771 PMCID: PMC2642639 DOI: 10.1093/bioinformatics/btn648] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Motivation: Gene expression Quantitative Trait Locus (eQTL) mapping measures the association between transcript expression and genotype in order to find genomic locations likely to regulate transcript expression. The availability of both gene expression and high-density genotype data has improved our ability to perform eQTL mapping in inbred mouse and other homozygous populations. However, existing eQTL mapping software does not scale well when the number of transcripts and markers are on the order of 105 and 105–106, respectively. Results: We propose a new method, FastMap, for fast and efficient eQTL mapping in homozygous inbred populations with binary allele calls. FastMap exploits the discrete nature and structure of the measured single nucleotide polymorphisms (SNPs). In particular, SNPs are organized into a Hamming distance-based tree that minimizes the number of arithmetic operations required to calculate the association of a SNP by making use of the association of its parent SNP in the tree. FastMap's tree can be used to perform both single marker mapping and haplotype association mapping over an m-SNP window. These performance enhancements also permit permutation-based significance testing. Availability: The FastMap program and source code are available at the website: http://cebc.unc.edu/fastmap86.html Contact:iir@unc.edu; nobel@email.unc.edu Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Daniel M Gatti
- Department of Environmental Sciences and Engineering, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| | | | | | | | | | | |
Collapse
|
32
|
Crabbe JC. Review. Neurogenetic studies of alcohol addiction. Philos Trans R Soc Lond B Biol Sci 2008; 363:3201-11. [PMID: 18640917 DOI: 10.1098/rstb.2008.0101] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Neurogenetic studies of alcohol dependence have relied substantially on genetic animal models, particularly rodents. Studies of inbred strains, selectively bred lines and mutants bearing genes whose function has been targeted for over or under expression are reviewed. Studies focused on gene expression changes are the most recent contributors to this literature, and some genetic effects may work through epigenetic mechanisms. In a few instances, interesting parallels have been revealed between genetic risk in humans and studies in non-human animal models. Future approaches are likely to be increasingly complex.
Collapse
Affiliation(s)
- John C Crabbe
- Department of Behavioral Neuroscience, Portland Alcohol Research Center, Oregon Health & Science University, VA Medical Center R&D 12, 3710 Southwest US Veterans Hospital Road, Portland, OR 97239, USA.
| |
Collapse
|
33
|
Mozhui K, Ciobanu DC, Schikorski T, Wang X, Lu L, Williams RW. Dissection of a QTL hotspot on mouse distal chromosome 1 that modulates neurobehavioral phenotypes and gene expression. PLoS Genet 2008; 4:e1000260. [PMID: 19008955 PMCID: PMC2577893 DOI: 10.1371/journal.pgen.1000260] [Citation(s) in RCA: 88] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2008] [Accepted: 10/14/2008] [Indexed: 11/18/2022] Open
Abstract
A remarkably diverse set of traits maps to a region on mouse distal chromosome 1 (Chr 1) that corresponds to human Chr 1q21-q23. This region is highly enriched in quantitative trait loci (QTLs) that control neural and behavioral phenotypes, including motor behavior, escape latency, emotionality, seizure susceptibility (Szs1), and responses to ethanol, caffeine, pentobarbital, and haloperidol. This region also controls the expression of a remarkably large number of genes, including genes that are associated with some of the classical traits that map to distal Chr 1 (e.g., seizure susceptibility). Here, we ask whether this QTL-rich region on Chr 1 (Qrr1) consists of a single master locus or a mixture of linked, but functionally unrelated, QTLs. To answer this question and to evaluate candidate genes, we generated and analyzed several gene expression, haplotype, and sequence datasets. We exploited six complementary mouse crosses, and combed through 18 expression datasets to determine class membership of genes modulated by Qrr1. Qrr1 can be broadly divided into a proximal part (Qrr1p) and a distal part (Qrr1d), each associated with the expression of distinct subsets of genes. Qrr1d controls RNA metabolism and protein synthesis, including the expression of approximately 20 aminoacyl-tRNA synthetases. Qrr1d contains a tRNA cluster, and this is a functionally pertinent candidate for the tRNA synthetases. Rgs7 and Fmn2 are other strong candidates in Qrr1d. FMN2 protein has pronounced expression in neurons, including in the dendrites, and deletion of Fmn2 had a strong effect on the expression of few genes modulated by Qrr1d. Our analysis revealed a highly complex gene expression regulatory interval in Qrr1, composed of multiple loci modulating the expression of functionally cognate sets of genes.
Collapse
Affiliation(s)
- Khyobeni Mozhui
- Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, Tennessee, United States of America
| | - Daniel C. Ciobanu
- Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, Tennessee, United States of America
| | - Thomas Schikorski
- Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, Tennessee, United States of America
| | - Xusheng Wang
- Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, Tennessee, United States of America
| | - Lu Lu
- Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, Tennessee, United States of America
| | - Robert W. Williams
- Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, Tennessee, United States of America
- * E-mail:
| |
Collapse
|
34
|
Accurate discovery of expression quantitative trait loci under confounding from spurious and genuine regulatory hotspots. Genetics 2008; 180:1909-25. [PMID: 18791227 DOI: 10.1534/genetics.108.094201] [Citation(s) in RCA: 101] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
In genomewide mapping of expression quantitative trait loci (eQTL), it is widely believed that thousands of genes are trans-regulated by a small number of genomic regions called "regulatory hotspots," resulting in "trans-regulatory bands" in an eQTL map. As several recent studies have demonstrated, technical confounding factors such as batch effects can complicate eQTL analysis by causing many spurious associations including spurious regulatory hotspots. Yet little is understood about how these technical confounding factors affect eQTL analyses and how to correct for these factors. Our analysis of data sets with biological replicates suggests that it is this intersample correlation structure inherent in expression data that leads to spurious associations between genetic loci and a large number of transcripts inducing spurious regulatory hotspots. We propose a statistical method that corrects for the spurious associations caused by complex intersample correlation of expression measurements in eQTL mapping. Applying our intersample correlation emended (ICE) eQTL mapping method to mouse, yeast, and human identifies many more cis associations while eliminating most of the spurious trans associations. The concordances of cis and trans associations have consistently increased between different replicates, tissues, and populations, demonstrating the higher accuracy of our method to identify real genetic effects.
Collapse
|
35
|
Heimel JA, Hermans JM, Sommeijer JP, Levelt CN. Genetic control of experience-dependent plasticity in the visual cortex. GENES BRAIN AND BEHAVIOR 2008; 7:915-23. [PMID: 18700840 DOI: 10.1111/j.1601-183x.2008.00431.x] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Depriving one eye of visual experience during a sensitive period of development results in a shift in ocular dominance (OD) in the primary visual cortex (V1). To assess the heritability of this form of cortical plasticity and identify the responsible gene loci, we studied the influence of monocular deprivation on OD in a large number of recombinant inbred mouse strains derived from mixed C57BL/6J and DBA/2J backgrounds (BXD). The strength of imaged intrinsic signal responses in V1 to visual stimuli was strongly heritable as were various elements of OD plasticity. This has important implications for the use of mice of mixed genetic backgrounds for studying OD plasticity. C57BL/6J showed the most significant shift in OD, while some BXD strains did not show any shift at all. Interestingly, the increase in undeprived ipsilateral eye responses was not correlated to the decrease in deprived contralateral eye responses, suggesting that the size of these components of OD plasticity are not genetically controlled by only a single mechanism. We identified a quantitative trait locus regulating the change in response to the deprived eye. The locus encompasses 13 genes, two of which--Stch and Nrip1--contain missense polymorphisms. The expression levels of Stch and to a lesser extent Nrip1 in whole brain correlate with the trait identifying them as novel candidate plasticity genes.
Collapse
Affiliation(s)
- J A Heimel
- Netherlands Institute for Neuroscience, Royal Netherlands Academy of Arts and Sciences, Meibergdreef 47, 1105 BA, Amsterdam, The Netherlands.
| | | | | | | | | |
Collapse
|
36
|
Gilad Y, Rifkin SA, Pritchard JK. Revealing the architecture of gene regulation: the promise of eQTL studies. Trends Genet 2008; 24:408-15. [PMID: 18597885 DOI: 10.1016/j.tig.2008.06.001] [Citation(s) in RCA: 357] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2008] [Revised: 06/09/2008] [Accepted: 06/09/2008] [Indexed: 10/21/2022]
Abstract
Expression quantitative trait loci (eQTL) mapping studies have become a widely used tool for identifying genetic variants that affect gene regulation. In these studies, expression levels are viewed as quantitative traits, and gene expression phenotypes are mapped to particular genomic loci by combining studies of variation in gene expression patterns with genome-wide genotyping. Results from recent eQTL mapping studies have revealed substantial heritable variation in gene expression within and between populations. In many cases, genetic factors that influence gene expression levels can be mapped to proximal (putatively cis) eQTLs and, less often, to distal (putatively trans) eQTLs. Beyond providing great insight into the biology of gene regulation, a combination of eQTL studies with results from traditional linkage or association studies of human disease may help predict a specific regulatory role for polymorphic sites previously associated with disease.
Collapse
Affiliation(s)
- Yoav Gilad
- Department of Human Genetics, University of Chicago, Chicago, IL 60637, USA.
| | | | | |
Collapse
|
37
|
Bhasin JM, Chakrabarti E, Peng DQ, Kulkarni A, Chen X, Smith JD. Sex specific gene regulation and expression QTLs in mouse macrophages from a strain intercross. PLoS One 2008; 3:e1435. [PMID: 18197246 PMCID: PMC2174529 DOI: 10.1371/journal.pone.0001435] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2007] [Accepted: 12/04/2007] [Indexed: 01/10/2023] Open
Abstract
BACKGROUND A powerful way to identify genes for complex traits it to combine genetic and genomic methods. Many trait quantitative trait loci (QTLs) for complex traits are sex specific, but the reason for this is not well understood. METHODOLOGY/PRINCIPAL FINDINGS RNA was prepared from bone marrow derived macrophages of 93 female and 114 male F(2) mice derived from a strain intercross between apoE-deficient mice on the AKR and DBA/2 genetic backgrounds, and was subjected to transcriptome profiling using microarrays. A high density genome scan was performed using a mouse SNP chip, and expression QTLs (eQTLs) were located for expressed transcripts. Using suggestive and significant LOD score cutoffs of 3.0 and 4.3, respectively, thousands of eQTLs in the female and male cohorts were identified. At the suggestive LOD threshold the majority of the eQTLs were trans eQTLs, mapping unlinked to the position of the gene. Cis eQTLs, which mapped to the location of the gene, had much higher LOD scores than trans eQTLs, indicating their more direct effect on gene expression. The majority of cis eQTLs were common to both males and females, but only approximately 1% of the trans eQTLs were shared by both sexes. At the significant LOD threshold, the majority of eQTLs were cis eQTLs, which were mostly sex-shared, while the trans eQTLs were overwhelmingly sex-specific. Pooling the male and female data, 31% of expressed transcripts were expressed at different levels in males vs. females after correction for multiple testing. CONCLUSIONS/SIGNIFICANCE These studies demonstrate a large sex effect on gene expression and trans regulation, under conditions where male and female derived cells were cultured ex vivo and thus without the influence of endogenous sex steroids. These data suggest that eQTL data from male and female mice should be analyzed separately, as many effects, such as trans regulation are sex specific.
Collapse
Affiliation(s)
- Jeffrey M. Bhasin
- Department of Cell Biology, Cleveland Clinic, Cleveland, Ohio, United States of America
| | - Enakshi Chakrabarti
- Department of Cell Biology, Cleveland Clinic, Cleveland, Ohio, United States of America
| | - Dao-Quan Peng
- Department of Cell Biology, Cleveland Clinic, Cleveland, Ohio, United States of America
| | - Aneesh Kulkarni
- Department of Cell Biology, Cleveland Clinic, Cleveland, Ohio, United States of America
| | - Xi Chen
- Department of Quantitative Health Sciences, Cleveland Clinic, Cleveland, Ohio, United States of America
| | - Jonathan D. Smith
- Department of Cell Biology, Cleveland Clinic, Cleveland, Ohio, United States of America
- Department of Molecular Medicine, Cleveland Clinic Lerner College of Medicine of Case Western Reserve University, Cleveland, Ohio, United States of America
| |
Collapse
|
38
|
Hofstetter JR, Svihla-Jones DA, Mayeda AR. A QTL on mouse chromosome 12 for the genetic variance in free-running circadian period between inbred strains of mice. J Circadian Rhythms 2007; 5:7. [PMID: 17974007 PMCID: PMC2174920 DOI: 10.1186/1740-3391-5-7] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2007] [Accepted: 10/31/2007] [Indexed: 11/21/2022] Open
Abstract
Background Many genes control circadian period in mice. Prior studies suggested a quantitative trait locus (QTL) on proximal mouse chromosome 12 for interstrain differences in circadian period. Since the B6.D2NAhrd/J strain has DBA/2 alleles for a portion of proximal chromosome 12 introgressed onto its C57BL/6J background, we hypothesized that these mice would have a shorter circadian period than C57BL/6J mice. Methods We compared circadian phenotypes of B6.D2NAhrd/J and C57BL/6 mice: period of general locomotor activity in constant dark and rest/activity pattern in alternating light and dark. We genotyped the B6.D2NAhrd/J mice to characterize the size of the genomic insert. To aid in identifying candidate quantitative trait genes we queried databases about the resident SNPs, whole brain gene expression in C57BL/6J versus DBA/2J mice, and circadian patterns of gene expression. Results The B6.D2NAhrd/J inbred mice have a shorter circadian period of locomotor activity than the C57BL/6J strain. Furthermore, the genomic insert is associated with another phenotype: the mean phase of activity minimum in the dark part of a light-dark lighting cycle. It was one hour later than in the background strain. The B6.D2NAhrd/J mice have a DBA/2J genomic insert spanning 35.4 to 41.0 megabase pairs on Chromosome 12. The insert contains 15 genes and 12 predicted genes. In this region Ahr (arylhydrocarbon receptor) and Zfp277 (zinc finger protein 277) both contain non-synonymous SNPs. Zfp277 also showed differential expression in whole brain and was cis-regulated. Three genes and one predicted gene showed a circadian pattern of expression in liver, including Zfp277. Conclusion We not only fine-mapped the QTL for circadian period on chromosome 12 but found a new QTL there as well: an association with the timing of the nocturnal activity-minimum. Candidate quantitative trait genes in this QTL are zinc finger protein 277 and arylhydrocarbon receptor. Arylhydrocarbon receptor is structurally related to Bmal1, a canonical clock gene.
Collapse
Affiliation(s)
- John R Hofstetter
- Department of Psychiatry, Richard L, Roudebush Veterans Administration Medical Center (VAMC), Indianapolis, IN 46202, USA.
| | | | | |
Collapse
|
39
|
Hofstetter JR, Hitzemann RJ, Belknap JK, Walter NAR, McWeeney SK, Mayeda AR. Characterization of the quantitative trait locus for haloperidol-induced catalepsy on distal mouse chromosome 1. GENES BRAIN AND BEHAVIOR 2007; 7:214-23. [PMID: 17696997 DOI: 10.1111/j.1601-183x.2007.00340.x] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
We report here the confirmation of the quantitative trait locus for haloperidol-induced catalepsy on distal chromosome (Chr) 1. We determined that this quantitative trait locus was captured in the B6.D2-Mtv7a/Ty congenic mouse strain, whose introgressed genomic interval extends from approximately 169.1 to 191.3 Mb. We then constructed a group of overlapping interval-specific congenic strains to further break up the interval and remapped the locus between 177.5 and 183.4 Mb. We next queried single nucleotide polymorphism (SNP) data sets and identified three genes with nonsynonymous coding SNPs in the quantitative trait locus. We also queried two brain gene expression data sets and found five known genes in this 5.9-Mb interval that are differentially expressed in both whole brain and striatum. Three of the candidate quantitative trait genes were differentially expressed using quantitative real-time polymerase chain reaction analyses. Overall, the current study illustrates how multiple approaches, including congenic fine mapping, SNP analysis and microarray gene expression screens, can be integrated both to reduce the quantitative trait locus interval significantly and to detect promising candidate quantitative trait genes.
Collapse
Affiliation(s)
- J R Hofstetter
- Department of Veterans Affairs, Richard L. Roudebush Medical Center, Indianapolis, IN 46202, USA.
| | | | | | | | | | | |
Collapse
|
40
|
Gatti D, Maki A, Chesler EJ, Kirova R, Kosyk O, Lu L, Manly KF, Qu Y, Williams RW, Perkins A, Langston MA, Threadgill DW, Rusyn I. Genome-level analysis of genetic regulation of liver gene expression networks. Hepatology 2007; 46:548-57. [PMID: 17542012 PMCID: PMC3518845 DOI: 10.1002/hep.21682] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
UNLABELLED The liver is the primary site for the metabolism of nutrients, drugs, and chemical agents. Although metabolic pathways are complex and tightly regulated, genetic variation among individuals, reflected in variations in gene expression levels, introduces complexity into research on liver disease. This study dissected genetic networks that control liver gene expression through the combination of large-scale quantitative mRNA expression analysis with genetic mapping in a reference population of BXD recombinant inbred mouse strains for which extensive single-nucleotide polymorphism, haplotype, and phenotypic data are publicly available. We profiled gene expression in livers of naive mice of both sexes from C57BL/6J, DBA/2J, B6D2F1, and 37 BXD strains using Agilent oligonucleotide microarrays. These data were used to map quantitative trait loci (QTLs) responsible for variations in the expression of about 19,000 transcripts. We identified polymorphic local and distant QTLs, including several loci that control the expression of large numbers of genes in liver, by comparing the physical transcript position with the location of the controlling QTL. CONCLUSION The data are available through a public web-based resource (www.genenetwork.org) that allows custom data mining, identification of coregulated transcripts and correlated phenotypes, cross-tissue, and cross-species comparisons, as well as testing of a broad array of hypotheses.
Collapse
Affiliation(s)
- Daniel Gatti
- Department of Environmental Sciences and Engineering, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599
| | - Akira Maki
- Department of Environmental Sciences and Engineering, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599
| | - Elissa J. Chesler
- Life Sciences Division, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831
| | - Roumyana Kirova
- Life Sciences Division, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831
| | - Oksana Kosyk
- Department of Environmental Sciences and Engineering, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599
| | - Lu Lu
- Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, TN 38163
| | | | - Yanhua Qu
- Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, TN 38163
| | - Robert W. Williams
- Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, TN 38163
| | - Andy Perkins
- Department of Computer Science, University of Tennessee, Knoxville, TN 37996
| | - Michael A. Langston
- Department of Computer Science, University of Tennessee, Knoxville, TN 37996
| | - David W. Threadgill
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599
| | - Ivan Rusyn
- Department of Environmental Sciences and Engineering, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599
- Corresponding Author: Ivan Rusyn, M.D., Ph.D., Department of Environmental Sciences and Engineering, School of Public Health, University of North Carolina at Chapel Hill, CB #7431, Chapel Hill, NC 27599; Phone/Fax: 919-843-2596,
| |
Collapse
|
41
|
Alberts R, Terpstra P, Li Y, Breitling R, Nap JP, Jansen RC. Sequence polymorphisms cause many false cis eQTLs. PLoS One 2007; 2:e622. [PMID: 17637838 PMCID: PMC1906859 DOI: 10.1371/journal.pone.0000622] [Citation(s) in RCA: 108] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2007] [Accepted: 05/29/2007] [Indexed: 11/23/2022] Open
Abstract
Many investigations have reported the successful mapping of quantitative trait loci (QTLs) for gene expression phenotypes (eQTLs). Local eQTLs, where expression phenotypes map to the genes themselves, are of especially great interest, because they are direct candidates for previously mapped physiological QTLs. Here we show that many mapped local eQTLs in genetical genomics experiments do not reflect actual expression differences caused by sequence polymorphisms in cis-acting factors changing mRNA levels. Instead they indicate hybridization differences caused by sequence polymorphisms in the mRNA region that is targeted by the microarray probes. Many such polymorphisms can be detected by a sensitive and novel statistical approach that takes the individual probe signals into account. Applying this approach to recent mouse and human eQTL data, we demonstrate that indeed many local eQTLs are falsely reported as “cis-acting” or “cis” and can be successfully detected and eliminated with this approach.
Collapse
Affiliation(s)
- Rudi Alberts
- Groningen Bioinformatics Centre, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Haren, The Netherlands
| | - Peter Terpstra
- Groningen Bioinformatics Centre, University Medical Centre Groningen, University of Groningen, Groningen, The Netherlands
| | - Yang Li
- Groningen Bioinformatics Centre, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Haren, The Netherlands
| | - Rainer Breitling
- Groningen Bioinformatics Centre, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Haren, The Netherlands
| | - Jan-Peter Nap
- Groningen Bioinformatics Centre, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Haren, The Netherlands
- Bioinformatics Expertise Center, Institute for Life Science and Technology, Hanze University Groningen, University for Applied Sciences, Groningen, The Netherlands
| | - Ritsert C. Jansen
- Groningen Bioinformatics Centre, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Haren, The Netherlands
- Groningen Bioinformatics Centre, University Medical Centre Groningen, University of Groningen, Groningen, The Netherlands
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
42
|
Greenhall JA, Zapala MA, Cáceres M, Libiger O, Barlow C, Schork NJ, Lockhart DJ. Detecting genetic variation in microarray expression data. Genome Res 2007; 17:1228-35. [PMID: 17609390 PMCID: PMC1933513 DOI: 10.1101/gr.6307307] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The use of high-density oligonucleotide arrays to measure the expression levels of thousands of genes in parallel has become commonplace. To take further advantage of the growing body of data, we developed a method, termed "GeSNP," to mine the detailed hybridization patterns in oligonucleotide array expression data for evidence of genetic variation. To demonstrate the performance of the algorithm, the hybridization patterns in data obtained previously from SAMP8/Ta, SAMP10/Ta, and SAMR1/Ta inbred mice and from humans and chimpanzees were analyzed. Genes with consistent strain-specific and species-specific hybridization pattern differences were identified, and approximately 90% of the candidate genes were independently confirmed to harbor sequence differences. Importantly, the quality of gene expression data was also improved by masking the probes of regions with putative sequence differences between species and strains. To illustrate the application to human disease groups, data from an inflammatory bowel disease study were analyzed. GeSNP identified sequence differences in candidate genes previously discovered in independent association and linkage studies and uncovered many promising new candidates. This approach enables the opportunistic extraction of genetic variation information from new or pre-existing gene expression data obtained with high-density oligonucleotide arrays.
Collapse
Affiliation(s)
- Jennifer A. Greenhall
- The Salk Institute for Biological Studies, La Jolla, California 92037, USA
- Neurosciences Graduate Program, School of Medicine, University of California, San Diego, California 92093, USA
| | - Matthew A. Zapala
- Biomedical Sciences Graduate Program, School of Medicine, University of California, San Diego, California 92093, USA
- Polymorphism Research Laboratory, Department of Psychiatry, University of California, San Diego, California 92093, USA
| | - Mario Cáceres
- The Salk Institute for Biological Studies, La Jolla, California 92037, USA
- Genes and Disease Program, Center for Genomic Regulation (CRG-UPF), Barcelona 08003, Spain
| | - Ondrej Libiger
- Polymorphism Research Laboratory, Department of Psychiatry, University of California, San Diego, California 92093, USA
| | - Carrolee Barlow
- The Salk Institute for Biological Studies, La Jolla, California 92037, USA
- Brain Cells, Inc., San Diego, California 92121, USA
| | - Nicholas J. Schork
- Biomedical Sciences Graduate Program, School of Medicine, University of California, San Diego, California 92093, USA
- Polymorphism Research Laboratory, Department of Psychiatry, University of California, San Diego, California 92093, USA
| | - David J. Lockhart
- The Salk Institute for Biological Studies, La Jolla, California 92037, USA
- Amicus Therapeutics, Cranbury, New Jersey 08512, USA
- Corresponding author.E-mail ; fax (609) 662-2001
| |
Collapse
|
43
|
Reply to “Normalization procedures and detection of linkage signal in genetical-genomics experiments”. Nat Genet 2006. [DOI: 10.1038/ng0806-856] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
44
|
Williams RW. Expression genetics and the phenotype revolution. Mamm Genome 2006; 17:496-502. [PMID: 16783631 DOI: 10.1007/s00335-006-0006-x] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2006] [Accepted: 02/06/2006] [Indexed: 01/22/2023]
Abstract
Genetic analysis of variation demands large numbers of individuals and even larger numbers of genotypes. The identification of alleles associated with Mendelian disorders has involved sample sizes of a thousand or more. Pervasive and common diseases that afflict human populations--cancer, heart disease, diabetes, neurodegeneration, addiction--are all polygenic and are even more demanding of large numbers. DeCode Genetics (http://www.decode.com) has harnessed the human resources of Iceland to unravel genetic and molecular causes of complex disease. The UK BioBank project (http://www.ukbiobank.ac.uk/) will incorporate 500,000 adult volunteers. The murine Collaborative Cross is the experimental equivalent of these human populations and will consist of a panel of approximately 1000 recombinant strains, expandable by intercrossing to much larger numbers of isogenic but heterozygous F(1)s. Massive projects of these types require efficient technologies. We have made enormous progress on the genotyping front, and it is now important to focus energy on devising ultrahigh-throughput methods to phenotype. Molecular phenotyping of the transcriptome has matured, and it is now possible to acquire hundreds of thousands of mRNA phenotypes at a cost matching those of SNPs. Proteomic and cell-based assays are also maturing rapidly. The acquisition of a personal genome along with a personal molecular phenome will provide an effective foundation for personalized medicine. Rodent models will be essential to test our ability to predict susceptibility and disease outcome using SNP data, molecular phenomes, and environmental exposures. These models will also be essential to test new treatments in a robust systems context that accounts for genetic variation.
Collapse
Affiliation(s)
- Robert W Williams
- Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, 855 Monroe Avenue, Memphis, TN 38163, USA.
| |
Collapse
|