51
|
Matukumalli LK, Schroeder SG. Sequence Based Gene Expression Analysis. Bioinformatics 2009. [DOI: 10.1007/978-0-387-92738-1_9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022] Open
|
52
|
Grieve IC, Dickens NJ, Pravenec M, Kren V, Hubner N, Cook SA, Aitman TJ, Petretto E, Mangion J. Genome-wide co-expression analysis in multiple tissues. PLoS One 2008; 3:e4033. [PMID: 19112506 PMCID: PMC2603584 DOI: 10.1371/journal.pone.0004033] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2008] [Accepted: 11/24/2008] [Indexed: 11/18/2022] Open
Abstract
Expression quantitative trait loci (eQTLs) represent genetic control points of gene expression, and can be categorized as cis- and trans-acting, reflecting local and distant regulation of gene expression respectively. Although there is evidence of co-regulation within clusters of trans-eQTLs, the extent of co-expression patterns and their relationship with the genotypes at eQTLs are not fully understood. We have mapped thousands of cis- and trans-eQTLs in four tissues (fat, kidney, adrenal and left ventricle) in a large panel of rat recombinant inbred (RI) strains. Here we investigate the genome-wide correlation structure in expression levels of eQTL transcripts and underlying genotypes to elucidate the nature of co-regulation within cis- and trans-eQTL datasets. Across the four tissues, we consistently found statistically significant correlations of cis-regulated gene expression to be rare (<0.9% of all pairs tested). Most (>80%) of the observed significant correlations of cis-regulated gene expression are explained by correlation of the underlying genotypes. In comparison, co-expression of trans-regulated gene expression is more common, with significant correlation ranging from 2.9%-14.9% of all pairs of trans-eQTL transcripts. We observed a total of 81 trans-eQTL clusters (hot-spots), defined as consisting of > or =10 eQTLs linked to a common region, with very high levels of correlation between trans-regulated transcripts (77.2-90.2%). Moreover, functional analysis of large trans-eQTL clusters (> or =30 eQTLs) revealed significant functional enrichment among genes comprising 80% of the large clusters. The results of this genome-wide co-expression study show the effects of the eQTL genotypes on the observed patterns of correlation, and suggest that functional relatedness between genes underlying trans-eQTLs is reflected in the degree of co-expression observed in trans-eQTL clusters. Our results demonstrate the power of an integrative, systematic approach to the analysis of a large gene expression dataset to uncover underlying structure, and inform future eQTL studies.
Collapse
Affiliation(s)
- Ian C. Grieve
- MRC Clinical Sciences Centre, Imperial College, Hammersmith Hospital, London, United Kingdom
| | - Nicholas J. Dickens
- MRC Clinical Sciences Centre, Imperial College, Hammersmith Hospital, London, United Kingdom
- Institute of Cancer Research, Belmont, Sutton, Surrey, United Kingdom
| | - Michal Pravenec
- Institute of Biology and Medical Genetics, First Faculty of Medicine and General Teaching Hospital, Charles University, Prague, Czech Republic
- Institute of Physiology, Czech Academy of Sciences, Prague, Czech Republic
| | - Vladimir Kren
- Institute of Biology and Medical Genetics, First Faculty of Medicine and General Teaching Hospital, Charles University, Prague, Czech Republic
- Institute of Physiology, Czech Academy of Sciences, Prague, Czech Republic
| | - Norbert Hubner
- Max-Delbrűck-Center for Molecular Medicine, Berlin-Buch, Berlin, Germany
| | - Stuart A. Cook
- MRC Clinical Sciences Centre, Imperial College, Hammersmith Hospital, London, United Kingdom
- National Heart and Lung Institute, Imperial College, London, United Kingdom
| | - Timothy J. Aitman
- MRC Clinical Sciences Centre, Imperial College, Hammersmith Hospital, London, United Kingdom
| | - Enrico Petretto
- MRC Clinical Sciences Centre, Imperial College, Hammersmith Hospital, London, United Kingdom
- Division of Epidemiology, Public Health and Primary Care, Imperial College, London, United Kingdom
- * E-mail:
| | - Jonathan Mangion
- MRC Clinical Sciences Centre, Imperial College, Hammersmith Hospital, London, United Kingdom
| |
Collapse
|
53
|
Druka A, Druka I, Centeno AG, Li H, Sun Z, Thomas WTB, Bonar N, Steffenson BJ, Ullrich SE, Kleinhofs A, Wise RP, Close TJ, Potokina E, Luo Z, Wagner C, Schweizer GF, Marshall DF, Kearsey MJ, Williams RW, Waugh R. Towards systems genetic analyses in barley: Integration of phenotypic, expression and genotype data into GeneNetwork. BMC Genet 2008; 9:73. [PMID: 19017390 PMCID: PMC2630324 DOI: 10.1186/1471-2156-9-73] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2008] [Accepted: 11/18/2008] [Indexed: 02/01/2023] Open
Abstract
BACKGROUND A typical genetical genomics experiment results in four separate data sets; genotype, gene expression, higher-order phenotypic data and metadata that describe the protocols, processing and the array platform. Used in concert, these data sets provide the opportunity to perform genetic analysis at a systems level. Their predictive power is largely determined by the gene expression dataset where tens of millions of data points can be generated using currently available mRNA profiling technologies. Such large, multidimensional data sets often have value beyond that extracted during their initial analysis and interpretation, particularly if conducted on widely distributed reference genetic materials. Besides quality and scale, access to the data is of primary importance as accessibility potentially allows the extraction of considerable added value from the same primary dataset by the wider research community. Although the number of genetical genomics experiments in different plant species is rapidly increasing, none to date has been presented in a form that allows quick and efficient on-line testing for possible associations between genes, loci and traits of interest by an entire research community. DESCRIPTION Using a reference population of 150 recombinant doubled haploid barley lines we generated novel phenotypic, mRNA abundance and SNP-based genotyping data sets, added them to a considerable volume of legacy trait data and entered them into the GeneNetwork http://www.genenetwork.org. GeneNetwork is a unified on-line analytical environment that enables the user to test genetic hypotheses about how component traits, such as mRNA abundance, may interact to condition more complex biological phenotypes (higher-order traits). Here we describe these barley data sets and demonstrate some of the functionalities GeneNetwork provides as an easily accessible and integrated analytical environment for exploring them. CONCLUSION By integrating barley genotypic, phenotypic and mRNA abundance data sets directly within GeneNetwork's analytical environment we provide simple web access to the data for the research community. In this environment, a combination of correlation analysis and linkage mapping provides the potential to identify and substantiate gene targets for saturation mapping and positional cloning. By integrating datasets from an unsequenced crop plant (barley) in a database that has been designed for an animal model species (mouse) with a well established genome sequence, we prove the importance of the concept and practice of modular development and interoperability of software engineering for biological data sets.
Collapse
Affiliation(s)
- Arnis Druka
- Scottish Crop Research Institute, Invergowrie, Dundee, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
54
|
Integration of the barley genetic and seed proteome maps for chromosome 1H, 2H, 3H, 5H and 7H. Funct Integr Genomics 2008; 9:135-43. [DOI: 10.1007/s10142-008-0101-z] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2008] [Revised: 10/14/2008] [Accepted: 10/18/2008] [Indexed: 11/25/2022]
|
55
|
Gilad Y, Rifkin SA, Pritchard JK. Revealing the architecture of gene regulation: the promise of eQTL studies. Trends Genet 2008; 24:408-15. [PMID: 18597885 DOI: 10.1016/j.tig.2008.06.001] [Citation(s) in RCA: 357] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2008] [Revised: 06/09/2008] [Accepted: 06/09/2008] [Indexed: 10/21/2022]
Abstract
Expression quantitative trait loci (eQTL) mapping studies have become a widely used tool for identifying genetic variants that affect gene regulation. In these studies, expression levels are viewed as quantitative traits, and gene expression phenotypes are mapped to particular genomic loci by combining studies of variation in gene expression patterns with genome-wide genotyping. Results from recent eQTL mapping studies have revealed substantial heritable variation in gene expression within and between populations. In many cases, genetic factors that influence gene expression levels can be mapped to proximal (putatively cis) eQTLs and, less often, to distal (putatively trans) eQTLs. Beyond providing great insight into the biology of gene regulation, a combination of eQTL studies with results from traditional linkage or association studies of human disease may help predict a specific regulatory role for polymorphic sites previously associated with disease.
Collapse
Affiliation(s)
- Yoav Gilad
- Department of Human Genetics, University of Chicago, Chicago, IL 60637, USA.
| | | | | |
Collapse
|
56
|
Druka A, Potokina E, Luo Z, Bonar N, Druka I, Zhang L, Marshall DF, Steffenson BJ, Close TJ, Wise RP, Kleinhofs A, Williams RW, Kearsey MJ, Waugh R. Exploiting regulatory variation to identify genes underlying quantitative resistance to the wheat stem rust pathogen Puccinia graminis f. sp. tritici in barley. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2008; 117:261-72. [PMID: 18542913 DOI: 10.1007/s00122-008-0771-x] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/08/2007] [Accepted: 04/08/2008] [Indexed: 05/13/2023]
Abstract
We previously mapped mRNA transcript abundance traits (expression-QTL or eQTL) using the Barley1 Affymetrix array and 'whole plant' tissue from 139 progeny of the Steptoe x Morex (St/Mx) reference barley mapping population. Of the 22,840 probesets (genes) on the array, 15,987 reported transcript abundance signals that were suitable for eQTL analysis, and this revealed a genome-wide distribution of 23,738 significant eQTLs. Here we have explored the potential of using these mRNA abundance eQTL traits as surrogates for the identification of candidate genes underlying the interaction between barley and the wheat stem rust fungus Puccinia graminis f. sp. tritici. We re-analysed quantitative 'resistance phenotype' data collected on this population in 1990/1991 and identified six loci associated with barley's reaction to stem rust. One of these coincided with the major stem rust resistance locus Rpg1, that we had previously positionally cloned using this population. Correlation analysis between phenotype values for rust infection and mRNA abundance values reported by the 22,840 GeneChip probe sets placed Rpg1, which is on the Barley1 GeneChip, in the top five candidate genes for the major QTL on chromosome 7H corresponding to the location of Rpg1. A second co-located with the rpg4/Rpg5 stem rust resistance locus that has been mapped in a different population and the remaining four were novel. Correlation analyses identified candidate genes for the rpg4/Rpg5 locus on chromosome 5H. By combining our data with additional published mRNA profiling data sets, we identify a putative sensory transduction histidine kinase as a strong candidate for a novel resistance locus on chromosome 2H and compile candidate gene lists for the other three loci.
Collapse
Affiliation(s)
- Arnis Druka
- Genetics Programme, Scottish Crop Research Institute, Invergowrie, Dundee, DD2 5DA, UK
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
57
|
An integrated approach to the characterization of two autochthonous lentil (Lens culinaris) landraces of Molise (south-central Italy). Heredity (Edinb) 2008; 101:136-44. [DOI: 10.1038/hdy.2008.39] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
|
58
|
Gene set enrichment in eQTL data identifies novel annotations and pathway regulators. PLoS Genet 2008; 4:e1000070. [PMID: 18464898 PMCID: PMC2346558 DOI: 10.1371/journal.pgen.1000070] [Citation(s) in RCA: 88] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2007] [Accepted: 04/10/2008] [Indexed: 11/19/2022] Open
Abstract
Genome-wide gene expression profiling has been extensively used to generate biological hypotheses based on differential expression. Recently, many studies have used microarrays to measure gene expression levels across genetic mapping populations. These gene expression phenotypes have been used for genome-wide association analyses, an analysis referred to as expression QTL (eQTL) mapping. Here, eQTL analysis was performed in adipose tissue from 28 inbred strains of mice. We focused our analysis on "trans-eQTL bands", defined as instances in which the expression patterns of many genes were all associated to a common genetic locus. Genes comprising trans-eQTL bands were screened for enrichments in functional gene sets representing known biological pathways, and genes located at associated trans-eQTL band loci were considered candidate transcriptional modulators. We demonstrate that these patterns were enriched for previously characterized relationships between known upstream transcriptional regulators and their downstream target genes. Moreover, we used this strategy to identify both novel regulators and novel members of known pathways. Finally, based on a putative regulatory relationship identified in our analysis, we identified and validated a previously uncharacterized role for cyclin H in the regulation of oxidative phosphorylation. We believe that the specific molecular hypotheses generated in this study will reveal many additional pathway members and regulators, and that the analysis approaches described herein will be broadly applicable to other eQTL data sets.
Collapse
|
59
|
Ansel J, Bottin H, Rodriguez-Beltran C, Damon C, Nagarajan M, Fehrmann S, François J, Yvert G. Cell-to-cell stochastic variation in gene expression is a complex genetic trait. PLoS Genet 2008; 4:e1000049. [PMID: 18404214 PMCID: PMC2289839 DOI: 10.1371/journal.pgen.1000049] [Citation(s) in RCA: 129] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2008] [Accepted: 03/11/2008] [Indexed: 11/19/2022] Open
Abstract
The genetic control of common traits is rarely deterministic, with many genes contributing only to the chance of developing a given phenotype. This incomplete penetrance is poorly understood and is usually attributed to interactions between genes or interactions between genes and environmental conditions. Because many traits such as cancer can emerge from rare events happening in one or very few cells, we speculate an alternative and complementary possibility where some genotypes could facilitate these events by increasing stochastic cell-to-cell variations (or ‘noise’). As a very first step towards investigating this possibility, we studied how natural genetic variation influences the level of noise in the expression of a single gene using the yeast S. cerevisiae as a model system. Reproducible differences in noise were observed between divergent genetic backgrounds. We found that noise was highly heritable and placed under a complex genetic control. Scanning the genome, we mapped three Quantitative Trait Loci (QTL) of noise, one locus being explained by an increase in noise when transcriptional elongation was impaired. Our results suggest that the level of stochasticity in particular molecular regulations may differ between multicellular individuals depending on their genotypic background. The complex genetic architecture of noise buffering couples genetic to non-genetic robustness and provides a molecular basis to the probabilistic nature of complex traits. Although most inter-individual phenotypic variabilities are largely attributable to DNA differences, a wealth of examples illustrate how a single biological system can vary stochastically over time and between individuals. Identical twins are not identical, and similarly, clonal microbial cells differ in many aspects even when grown simultaneously in a common environment. Using yeast as a model system, we show that a population of isogenic cells all carrying genotype A showed higher cell-to-cell heterogeneity in gene expression than a population of isogenic cells of genotype B. We considered this level of intra-clonal heterogeneity as a quantitative trait and performed genetic linkage (on AxB) to search for regulators of it. This led to the demonstration that transcriptional elongation impairment increases stochastic variation in gene expression in vivo. Our results show that the two levels of inter-individual diversity, genetic and stochastic, are connected by a complex control of the former on the latter. We invite the community to revisit the interpretation of incomplete penetrance, which defines cases where a mutation does not cause the associated phenotype in all its carriers. We propose that, in the case of cancer or other diseases triggered by single cells, such mutations might increase stochastic molecular fluctuations and thereby the fraction of deviant cellular phenotypes in a human body.
Collapse
Affiliation(s)
- Juliet Ansel
- Université de Lyon, Lyon, France
- Laboratoire de Biologie Moléculaire de la Cellule, Ecole Normale Supérieure de Lyon, CNRS, Lyon, France
- IFR128 BioSciences Lyon-Gerland, Lyon, France
| | - Hélène Bottin
- Université de Lyon, Lyon, France
- Laboratoire de Biologie Moléculaire de la Cellule, Ecole Normale Supérieure de Lyon, CNRS, Lyon, France
- IFR128 BioSciences Lyon-Gerland, Lyon, France
| | - Camilo Rodriguez-Beltran
- Laboratoire de Biotechnologie et Bioprocédés, Institut National des Sciences Appliquées, Toulouse, France
| | - Christelle Damon
- Université de Lyon, Lyon, France
- Laboratoire de Biologie Moléculaire de la Cellule, Ecole Normale Supérieure de Lyon, CNRS, Lyon, France
- IFR128 BioSciences Lyon-Gerland, Lyon, France
| | - Muniyandi Nagarajan
- Université de Lyon, Lyon, France
- Laboratoire de Biologie Moléculaire de la Cellule, Ecole Normale Supérieure de Lyon, CNRS, Lyon, France
- IFR128 BioSciences Lyon-Gerland, Lyon, France
| | - Steffen Fehrmann
- Université de Lyon, Lyon, France
- Laboratoire de Biologie Moléculaire de la Cellule, Ecole Normale Supérieure de Lyon, CNRS, Lyon, France
- IFR128 BioSciences Lyon-Gerland, Lyon, France
| | - Jean François
- Laboratoire de Biotechnologie et Bioprocédés, Institut National des Sciences Appliquées, Toulouse, France
| | - Gaël Yvert
- Université de Lyon, Lyon, France
- Laboratoire de Biologie Moléculaire de la Cellule, Ecole Normale Supérieure de Lyon, CNRS, Lyon, France
- IFR128 BioSciences Lyon-Gerland, Lyon, France
- Laboratoire de Biotechnologie et Bioprocédés, Institut National des Sciences Appliquées, Toulouse, France
- * E-mail:
| |
Collapse
|
60
|
Argmann CA, Dierich A, Auwerx J. Uses of forward and reverse genetics in mice to study gene function. ACTA ACUST UNITED AC 2008; Chapter 29:Unit 29A.1. [PMID: 18265381 DOI: 10.1002/0471142727.mb29a01s73] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
As the focus of human genetics shifts from Mendelian traits to complex diseases, a sophisticated genetic tool kit-with space for genetics (classical, molecular, statistical, and quantitative), metabolics, proteomics, bioinformatics, and mathematics-is required to elucidate their multifactorial traits and regulatory processes. Importantly, mouse resources optimized to study the actions of isolated genetic loci on a fixed background are insufficient on their own for studying intact polygenic networks and genetic interactions, and researchers must work in the context of experimental model systems that optimally mimic the genetic structure of human populations. The success of such phenogenomic approaches depend on the efficacy by which specific mutations (gene targeting) and variability (recombinant inbreeding) can be introduced into the mouse genome, and on the optimization of phenotyping analyses of the mutant mouse lines. This unit describes the basic genetic approaches used to in the study of mouse model systems.
Collapse
Affiliation(s)
- Carmen A Argmann
- Institut de Génétique et de Biologie Moléculaire et Cellulaire, Illkirch, France
| | | | | |
Collapse
|
61
|
Foss EJ, Radulovic D, Shaffer SA, Ruderfer DM, Bedalov A, Goodlett DR, Kruglyak L. Genetic basis of proteome variation in yeast. Nat Genet 2007; 39:1369-75. [PMID: 17952072 DOI: 10.1038/ng.2007.22] [Citation(s) in RCA: 193] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2007] [Accepted: 09/20/2007] [Indexed: 11/09/2022]
Abstract
Proper regulation of protein levels is essential for health, and abnormal levels of proteins are hallmarks of many diseases. A number of studies have recently shown that messenger RNA levels vary among individuals of a species and that genetic linkage analysis can be used to identify quantitative trait loci that influence these levels. By contrast, little is known about the genetic basis of variation in protein levels in genetically diverse populations, in large part because techniques for large-scale measurements of protein abundance lag far behind those for measuring transcript abundance. Here we describe a label-free, mass spectrometry-based approach to measuring protein levels in total unfractionated cellular proteins, and we apply this approach to elucidate the genetic basis of variation in protein abundance in a cross between two diverse strains of yeast. Loci that influenced protein abundance differed from those that influenced transcript levels, emphasizing the importance of direct analysis of the proteome.
Collapse
Affiliation(s)
- Eric J Foss
- Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA
| | | | | | | | | | | | | |
Collapse
|
62
|
Rothan C, Causse M. Natural and artificially induced genetic variability in crop and model plant species for plant systems biology. EXS 2007; 97:21-53. [PMID: 17432262 DOI: 10.1007/978-3-7643-7439-6_2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
The sequencing of plant genomes which was completed a few years ago for Arabidopsis thaliana and Oryza sativa is currently underway for numerous crop plants of commercial value such as maize, poplar, tomato grape or tobacco. In addition, hundreds of thousands of expressed sequence tags (ESTs) are publicly available that may well represent 40-60% of the genes present in plant genomes. Despite its importance for life sciences, genome information is only an initial step towards understanding gene function (functional genomics) and deciphering the complex relationships between individual genes in the framework of gene networks. In this chapter we introduce and discuss means of generating and identifying genetic diversity, i.e., means to genetically perturb a biological system and to subsequently analyse the systems response, e.g., the changes in plant morphology and chemical composition. Generating and identifying genetic diversity is in its own right a highly powerful resource of information and is established as an invaluable tool for systems biology.
Collapse
Affiliation(s)
- Christophe Rothan
- INRA-UMR 619 Biologie des Fruits, IBVI-INRA Bordeaux, BP 81, 71 Av. EdouardBourlaux, 33883 Villenave d'Ornon, France.
| | | |
Collapse
|
63
|
Dumas ME, Wilder SP, Bihoreau MT, Barton RH, Fearnside JF, Argoud K, D'Amato L, Wallis RH, Blancher C, Keun HC, Baunsgaard D, Scott J, Sidelmann UG, Nicholson JK, Gauguier D. Direct quantitative trait locus mapping of mammalian metabolic phenotypes in diabetic and normoglycemic rat models. Nat Genet 2007; 39:666-72. [PMID: 17435758 DOI: 10.1038/ng2026] [Citation(s) in RCA: 128] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2007] [Accepted: 03/16/2007] [Indexed: 11/09/2022]
Abstract
Characterizing the relationships between genomic and phenotypic variation is essential to understanding disease etiology. Information-dense data sets derived from pathophysiological, proteomic and transcriptomic profiling have been applied to map quantitative trait loci (QTLs). Metabolic traits, already used in QTL studies in plants, are essential phenotypes in mammalian genetics to define disease biomarkers. Using a complex mammalian system, here we show chromosomal mapping of untargeted plasma metabolic fingerprints derived from NMR spectroscopic analysis in a cross between diabetic and control rats. We propose candidate metabolites for the most significant QTLs. Metabolite profiling in congenic strains provided evidence of QTL replication. Linkage to a gut microbial metabolite (benzoate) can be explained by deletion of a uridine diphosphate glucuronosyltransferase. Mapping metabotypic QTLs provides a practical approach to understanding genome-phenotype relationships in mammals and may uncover deeper biological complexity, as extended genome (microbiome) perturbations that affect disease processes through transgenomic effects may influence QTL detection.
Collapse
Affiliation(s)
- Marc-Emmanuel Dumas
- Imperial College London, Department of Biomolecular Medicine, Division of Surgery, Oncology, Reproductive Biology and Anaesthetics, Faculty of Medicine, Sir Alexander Fleming Building, Exhibition Road, South Kensington, London SW7 2AZ, UK
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
64
|
Bao L, Peirce JL, Zhou M, Li H, Goldowitz D, Williams RW, Lu L, Cui Y. An integrative genomics strategy for systematic characterization of genetic loci modulating phenotypes. Hum Mol Genet 2007; 16:1381-90. [PMID: 17428815 DOI: 10.1093/hmg/ddm089] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
Naturally occurring genetic variations may affect certain phenotypes through influencing transcript levels of the genes that are causally related to those phenotypes. Genomic regions harboring common sequence variants that modulate gene expression can be mapped as quantitative trait loci (QTLs) using a newly developed genetical genomics approach. This enables a new strategy for systematically mapping novel genetic loci underlying various phenotypes. In this work, we started from a seed set of genes with variants that are known to affect behavioral and neurological phenotypes (as recorded in Mammalian Phenotype Ontology Database) and used microarrays to analyze their expression levels in brain samples of a panel of BXD recombinant inbred mouse strains. We then systematically mapped the QTLs controlling the expression of these genes. Candidate causal genes in the QTL intervals were evaluated for evidence of functional genetic polymorphisms. Using this method, we were able to predict novel genetic loci and causal genes for a number of behavioral and neurological phenotypes. Lines of independent evidence supporting some of our results were provided by transcription factor binding site analysis and by biomedical literature. This strategy integrates gene-phenotype relations from decades of experimental mutagenesis studies and new genomic resources to provide an approach to rapidly expand knowledge on genetic loci modulating phenotypes.
Collapse
Affiliation(s)
- Lei Bao
- Department of Molecular Sciences, University of Tennessee Health Science Center, Memphis, TN 38163, USA
| | | | | | | | | | | | | | | |
Collapse
|
65
|
Abstract
A new field of genetic analysis of global gene expression has emerged in recent years, driven by the realization that traditional techniques of linkage and association analysis can be applied to thousands of transcript levels measured by microarrays. Genetic dissection of transcript abundance has shed light on the architecture of quantitative traits, provided a new approach for connecting DNA sequence variation with phenotypic variation, and improved our understanding of transcriptional regulation and regulatory variation.
Collapse
Affiliation(s)
- Matthew V Rockman
- Lewis-Sigler Institute for Integrative Genomics and Department of Ecology and Evolutionary Biology, Princeton University, Princeton, New Jersey 08544, USA
| | | |
Collapse
|
66
|
Demuth JP, Wade MJ. Experimental Methods for Measuring Gene Interactions. ANNUAL REVIEW OF ECOLOGY EVOLUTION AND SYSTEMATICS 2006. [DOI: 10.1146/annurev.ecolsys.37.091305.110124] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The role of epistasis in evolution has long been contentious. Resolving the issue requires empirical measurements that are statistically adequate and evolutionarily relevant. We review experimental methods for measuring epistasis, some that are commonly used but weak and others that are less frequently used but stronger. We review statistical genetic methods based on analyses of variances and means as well as molecular genetic methods for detecting gene interactions. We also highlight relevant empirical studies that illustrate the implementation of particular methods. In spite of the inherent weaknesses of most methods, epistasis is surprisingly common. We conclude with a discussion of how technologies for investigating genome-wide epistasis are bridging the gap between physiological and statistical epistasis for model organisms.
Collapse
Affiliation(s)
- Jeffery P. Demuth
- Department of Biology, Indiana University, Bloomington, Indiana 47405-3700;,
| | - Michael J. Wade
- Department of Biology, Indiana University, Bloomington, Indiana 47405-3700;,
| |
Collapse
|
67
|
Petretto E, Mangion J, Dickens NJ, Cook SA, Kumaran MK, Lu H, Fischer J, Maatz H, Kren V, Pravenec M, Hubner N, Aitman TJ. Heritability and tissue specificity of expression quantitative trait loci. PLoS Genet 2006; 2:e172. [PMID: 17054398 PMCID: PMC1617131 DOI: 10.1371/journal.pgen.0020172] [Citation(s) in RCA: 176] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2006] [Accepted: 08/28/2006] [Indexed: 11/19/2022] Open
Abstract
Variation in gene expression is heritable and has been mapped to the genome in humans and model organisms as expression quantitative trait loci (eQTLs). We applied integrated genome-wide expression profiling and linkage analysis to the regulation of gene expression in fat, kidney, adrenal, and heart tissues using the BXH/HXB panel of rat recombinant inbred strains. Here, we report the influence of heritability and allelic effect of the quantitative trait locus on detection of cis- and trans-acting eQTLs and discuss how these factors operate in a tissue-specific context. We identified several hundred major eQTLs in each tissue and found that cis-acting eQTLs are highly heritable and easier to detect than trans-eQTLs. The proportion of heritable expression traits was similar in all tissues; however, heritability alone was not a reliable predictor of whether an eQTL will be detected. We empirically show how the use of heritability as a filter reduces the ability to discover trans-eQTLs, particularly for eQTLs with small effects. Only 3% of cis- and trans-eQTLs exhibited large allelic effects, explaining more than 40% of the phenotypic variance, suggestive of a highly polygenic control of gene expression. Power calculations indicated that, across tissues, minor differences in genetic effects are expected to have a significant impact on detection of trans-eQTLs. Trans-eQTLs generally show smaller effects than cis-eQTLs and have a higher false discovery rate, particularly in more heterogeneous tissues, suggesting that small biological variability, likely relating to tissue composition, may influence detection of trans-eQTLs in this system. We delineate the effects of genetic architecture on variation in gene expression and show the sensitivity of this experimental design to tissue sampling variability in large-scale eQTL studies.
Collapse
Affiliation(s)
- Enrico Petretto
- Medical Research Council Clinical Sciences Centre, Faculty of Medicine, Imperial College, London, United Kingdom.
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
68
|
Williams RW. Expression genetics and the phenotype revolution. Mamm Genome 2006; 17:496-502. [PMID: 16783631 DOI: 10.1007/s00335-006-0006-x] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2006] [Accepted: 02/06/2006] [Indexed: 01/22/2023]
Abstract
Genetic analysis of variation demands large numbers of individuals and even larger numbers of genotypes. The identification of alleles associated with Mendelian disorders has involved sample sizes of a thousand or more. Pervasive and common diseases that afflict human populations--cancer, heart disease, diabetes, neurodegeneration, addiction--are all polygenic and are even more demanding of large numbers. DeCode Genetics (http://www.decode.com) has harnessed the human resources of Iceland to unravel genetic and molecular causes of complex disease. The UK BioBank project (http://www.ukbiobank.ac.uk/) will incorporate 500,000 adult volunteers. The murine Collaborative Cross is the experimental equivalent of these human populations and will consist of a panel of approximately 1000 recombinant strains, expandable by intercrossing to much larger numbers of isogenic but heterozygous F(1)s. Massive projects of these types require efficient technologies. We have made enormous progress on the genotyping front, and it is now important to focus energy on devising ultrahigh-throughput methods to phenotype. Molecular phenotyping of the transcriptome has matured, and it is now possible to acquire hundreds of thousands of mRNA phenotypes at a cost matching those of SNPs. Proteomic and cell-based assays are also maturing rapidly. The acquisition of a personal genome along with a personal molecular phenome will provide an effective foundation for personalized medicine. Rodent models will be essential to test our ability to predict susceptibility and disease outcome using SNP data, molecular phenomes, and environmental exposures. These models will also be essential to test new treatments in a robust systems context that accounts for genetic variation.
Collapse
Affiliation(s)
- Robert W Williams
- Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, 855 Monroe Avenue, Memphis, TN 38163, USA.
| |
Collapse
|
69
|
Petretto E, Mangion J, Pravanec M, Hubner N, Aitman TJ. Integrated gene expression profiling and linkage analysis in the rat. Mamm Genome 2006; 17:480-9. [PMID: 16783629 DOI: 10.1007/s00335-005-0181-1] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2005] [Accepted: 01/24/2006] [Indexed: 12/01/2022]
Abstract
The combined application of genome-wide expression profiling from microarray experiments with genetic linkage analysis enables the mapping of expression quantitative trait loci (eQTLs) which are primary control points for gene expression across the genome. This approach allows for the dissection of primary and secondary genetic determinants of gene expression. The cis-acting eQTLs in practice are easier to investigate than the trans-regulated eQTLs because they are under simpler genetic control and are likely to be due to sequence variants within the gene itself or its neighboring regulatory elements. These genes are therefore candidates both for variation in gene expression and for contributions to whole-body phenotypes, particularly when these are located within known and relevant physiologic QTLs. Multiple trans-acting eQTLs tend to cluster to the same genetic location, implying shared regulatory control mechanisms that may be amenable to network analysis to identify gene clusters within the same metabolic pathway. Such clusters may ultimately underlie development of individual complex, whole-body phenotypes. The combined expression and linkage approach has been applied successfully in several mammalian species, including the rat which has specific features that demonstrate its value as a model for studying complex traits.
Collapse
Affiliation(s)
- Enrico Petretto
- MRC Clinical Sciences Centre, Faculty of Medicine, Imperial College, London W12 0NN, United Kingdom
| | | | | | | | | |
Collapse
|
70
|
Peirce JL, Li H, Wang J, Manly KF, Hitzemann RJ, Belknap JK, Rosen GD, Goodwin S, Sutter TR, Williams RW, Lu L. How replicable are mRNA expression QTL? Mamm Genome 2006; 17:643-56. [PMID: 16783644 DOI: 10.1007/s00335-005-0187-8] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2005] [Accepted: 03/20/2006] [Indexed: 10/24/2022]
Abstract
Applying quantitative trait analysis methods to genome-wide microarray-derived mRNA expression phenotypes in segregating populations is a valuable tool in the attempt to link high-level traits to their molecular causes. The massive multiple-testing issues involved in analyzing these data make the correct level of confidence to place in mRNA abundance quantitative trait loci (QTL) a difficult problem. We use a unique resource to directly test mRNA abundance QTL replicability in mice: paired recombinant inbred (RI) and F(2) data sets derived from C57BL/6J (B6) and DBA/2J (D2) inbred strains and phenotyped using the same Affymetrix arrays. We have one forebrain and one striatum data set pair. We describe QTL replication at varying stringencies in these data. For instance, 78% of mRNA expression QTL (eQTL) with genome-wide adjusted p < or = 0.0001 in RI data replicate at a genome-wide adjusted p < 0.05 or better. Replicated QTL are disproportionately putatively cis-acting, and approximately 75% have higher apparent expression levels associated with B6 genotypes, which may be partly due to probe set generation using B6 sequence. Finally, we note that while trans-acting QTL do not replicate well between data sets in general, at least one cluster of trans-acting QTL on distal Chr 1 is notably preserved between data sets.
Collapse
Affiliation(s)
- Jeremy L Peirce
- Center for Neuroscience, Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, TN 38163, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
71
|
Wang H, Chen YS. Changes in variance components of flanking marker genotypes under varying selection intensities. YI CHUAN XUE BAO = ACTA GENETICA SINICA 2006; 33:312-8. [PMID: 16625829 DOI: 10.1016/s0379-4172(06)60056-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
Selection is practically ubiquitous during marker-QTL linkage analysis with an experimental population. Thus, it is necessary to investigate the impacts of selection upon linkage analyses in order to obtain unbiased estimates of QTL position and effect. In this article, by exploiting flanking markers through the widely applied half-sib design, we have developed the structures of three variance components, i.e., variance component between marker genotypes, polygenic variance component and recombinant variance component within marker genotypes. Changes in these variance components under varying selection intensities were investigated in this study to formulate the effects of selection on various variance components. Results showed clearly that all variance components presented were quite sensitive to changes in selection intensity. As selection intensity increased, all variance components declined by differing extents in a quadratic fashion. Comparatively speaking, the variance between marker genotypes decreased most drastically, followed by the polygenic variance within marker genotypes and then the recombinant variance within marker genotypes, which suggested a decrease of power for QTL linkage analysis. Therefore, steps should be taken to avoid as much as possible the presence of selection in real populations, so as to further eliminate the negative effects of selection on QTL linkage analysis.
Collapse
Affiliation(s)
- Hui Wang
- Guangdong Ocean University, Zhanjiang, China
| | | |
Collapse
|
72
|
|
73
|
Lan H, Chen M, Flowers JB, Yandell BS, Stapleton DS, Mata CM, Mui ETK, Flowers MT, Schueler KL, Manly KF, Williams RW, Kendziorski C, Attie AD. Combined expression trait correlations and expression quantitative trait locus mapping. PLoS Genet 2006; 2:e6. [PMID: 16424919 PMCID: PMC1331977 DOI: 10.1371/journal.pgen.0020006] [Citation(s) in RCA: 76] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2005] [Accepted: 12/06/2005] [Indexed: 02/07/2023] Open
Abstract
Coordinated regulation of gene expression levels across a series of experimental conditions provides valuable information about the functions of correlated transcripts. The consideration of gene expression correlation over a time or tissue dimension has proved valuable in predicting gene function. Here, we consider correlations over a genetic dimension. In addition to identifying coregulated genes, the genetic dimension also supplies us with information about the genomic locations of putative regulatory loci. We calculated correlations among approximately 45,000 expression traits derived from 60 individuals in an F2 sample segregating for obesity and diabetes. By combining the correlation results with linkage mapping information, we were able to identify regulatory networks, make functional predictions for uncharacterized genes, and characterize novel members of known pathways. We found evidence of coordinate regulation of 174 G protein–coupled receptor protein signaling pathway expression traits. Of the 174 traits, 50 had their major LOD peak within 10 cM of a locus on Chromosome 2, and 81 others had a secondary peak in this region. We also characterized a Riken cDNA clone that showed strong correlation with stearoyl-CoA desaturase 1 expression. Experimental validation confirmed that this clone is involved in the regulation of lipid metabolism. We conclude that trait correlation combined with linkage mapping can reveal regulatory networks that would otherwise be missed if we studied only mRNA traits with statistically significant linkages in this small cross. The combined analysis is more sensitive compared with linkage mapping alone. In order to annotate gene function and identify potential members of regulatory networks, the authors explore correlation of expression profiles across a genetic dimension, namely genotypes segregating in a panel of 60 F2 mice derived from a cross used to explore diabetes in obese mice. They first identified 6,016 seed transcripts for which they observe that the gene expression is linked to a particular region of the genome. Then they searched for transcripts whose expression is highly correlated with the seed transcripts and tested for enrichment of common biological functions among the lists of correlated transcripts. They found and explored the properties of 1,341 sets of transcripts that share a particular “gene ontology” term. Thirty-eight seeds in the G protein–coupled receptor protein signaling pathway were correlated with 174 transcripts, all of which are also annotated as G protein–coupled receptor protein signaling pathway and 131 of which share a regulatory locus on Chromosome 2. The authors note many of these findings would have been missed by simple expression quantitative trait loci analysis without the correlation step. The approach was used to identify a common set of genes involved in lipid metabolism.
Collapse
Affiliation(s)
- Hong Lan
- Department of Biochemistry, University of Wisconsin, Madison, Wisconsin, United States of America
| | - Meng Chen
- Department of Statistics, University of Wisconsin, Madison, Wisconsin, United States of America
| | - Jessica B Flowers
- Department of Biochemistry, University of Wisconsin, Madison, Wisconsin, United States of America
- Department of Nutritional Sciences, University of Wisconsin, Madison, Wisconsin, United States of America
| | - Brian S Yandell
- Department of Statistics, University of Wisconsin, Madison, Wisconsin, United States of America
- Department of Horticulture, University of Wisconsin, Madison, Wisconsin, United States of America
| | - Donnie S Stapleton
- Department of Biochemistry, University of Wisconsin, Madison, Wisconsin, United States of America
| | - Christine M Mata
- Department of Biochemistry, University of Wisconsin, Madison, Wisconsin, United States of America
| | - Eric Ton-Keen Mui
- Department of Biochemistry, University of Wisconsin, Madison, Wisconsin, United States of America
| | - Matthew T Flowers
- Department of Biochemistry, University of Wisconsin, Madison, Wisconsin, United States of America
| | - Kathryn L Schueler
- Department of Biochemistry, University of Wisconsin, Madison, Wisconsin, United States of America
| | - Kenneth F Manly
- Departments of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, Tennessee, United States of America
| | - Robert W Williams
- Departments of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, Tennessee, United States of America
| | - Christina Kendziorski
- Department of Biostatistics and Medical Informatics, University of Wisconsin, Madison, Wisconsin, United States of America
| | - Alan D Attie
- Department of Biochemistry, University of Wisconsin, Madison, Wisconsin, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
74
|
Argmann CA, Chambon P, Auwerx J. Mouse phenogenomics: the fast track to "systems metabolism". Cell Metab 2005; 2:349-60. [PMID: 16330321 DOI: 10.1016/j.cmet.2005.11.002] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/10/2005] [Revised: 10/30/2005] [Accepted: 11/03/2005] [Indexed: 01/14/2023]
Abstract
With the completion of the many genomes, genetics is positioned to meet physiology. In this review, we summarize the coming of "systems metabolism" in mammals through the use of the mouse, as a model system to study metabolism. Building on mouse genetics with increasingly sophisticated clinical and molecular phenotyping strategies has enabled scientists to now tackle complex biomedical questions, such as those related to the pathogenesis of the common metabolic disorders. The ultimate goal of such strategies will be to mimic human metabolism with the click of a mouse.
Collapse
Affiliation(s)
- Carmen A Argmann
- Institut de Génétique et de Biologie Moléculaire et Cellulaire, Centre National de la Recherche Scientifique/Institut Nationale de la Santé et de la Recherche Médicale/Université Louis Pasteur, 67404 Illkirch, France
| | | | | |
Collapse
|
75
|
Vasemägi A, Primmer CR. Challenges for identifying functionally important genetic variation: the promise of combining complementary research strategies. Mol Ecol 2005; 14:3623-42. [PMID: 16202085 DOI: 10.1111/j.1365-294x.2005.02690.x] [Citation(s) in RCA: 239] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Strategies for the identification of functional genetic variation underlying phenotypic traits of ecological and evolutionary importance have received considerable attention in the literature recently. This paper aims to bring together and compare the relative strengths and limitations of various potentially useful research strategies for dissecting functionally important genetic variation in a wide range of organisms. We briefly explore the relative strengths and limitations of traditional and emerging approaches and evaluate their potential use in free-living populations. While it is likely that much of the progress in functional genetic analyses will rely on progress in traditional model species, it is clear that with prudent choices of methods and appropriate sampling designs, much headway can be also made in a diverse range of species. We suggest that combining research approaches targeting different functional and biological levels can potentially increase understanding the genetic basis of ecological and evolutionary processes both in model and non-model organisms.
Collapse
Affiliation(s)
- A Vasemägi
- Department of Biology, University of Turku, Finland
| | | |
Collapse
|
76
|
Salvi S, Tuberosa R. To clone or not to clone plant QTLs: present and future challenges. TRENDS IN PLANT SCIENCE 2005; 10:297-304. [PMID: 15949764 DOI: 10.1016/j.tplants.2005.04.008] [Citation(s) in RCA: 229] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2004] [Revised: 03/24/2005] [Accepted: 04/26/2005] [Indexed: 05/02/2023]
Abstract
Recent technical advancements and refinement of analytical methods have enabled the loci (quantitative trait loci, QTLs) responsible for the genetic control of quantitative traits to be dissected molecularly. To date, most plant QTLs have been cloned using a positional cloning approach following identification in experimental crosses. In some cases, an association between sequence variation at a candidate gene and a phenotype has been established by analysing existing genetic accessions. These strategies can be refined using appropriate genetic materials and the latest developments in genomics platforms. We foresee that although QTL analysis and cloning addressing naturally occurring genetic variation should shed light on mechanisms of plant adaptation, a greater emphasis on approaches relying on mutagenesis and candidate gene validation is likely to accelerate the pace of discovering the genes underlying QTLs.
Collapse
Affiliation(s)
- Silvio Salvi
- Department of Agroenvironmental Science and Technology, University of Bologna, Viale Fanin, 44, 40127 Bologna, Italy
| | | |
Collapse
|
77
|
Zhang M, Montooth KL, Wells MT, Clark AG, Zhang D. Mapping multiple Quantitative Trait Loci by Bayesian classification. Genetics 2005; 169:2305-18. [PMID: 15520261 PMCID: PMC1449613 DOI: 10.1534/genetics.104.034181] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2004] [Accepted: 11/01/2004] [Indexed: 12/13/2022] Open
Abstract
We developed a classification approach to multiple quantitative trait loci (QTL) mapping built upon a Bayesian framework that incorporates the important prior information that most genotypic markers are not cotransmitted with a QTL or their QTL effects are negligible. The genetic effect of each marker is modeled using a three-component mixture prior with a class for markers having negligible effects and separate classes for markers having positive or negative effects on the trait. The posterior probability of a marker's classification provides a natural statistic for evaluating credibility of identified QTL. This approach performs well, especially with a large number of markers but a relatively small sample size. A heat map to visualize the results is proposed so as to allow investigators to be more or less conservative when identifying QTL. We validated the method using a well-characterized data set for barley heading values from the North American Barley Genome Mapping Project. Application of the method to a new data set revealed sex-specific QTL underlying differences in glucose-6-phosphate dehydrogenase enzyme activity between two Drosophila species. A simulation study demonstrated the power of this approach across levels of trait heritability and when marker data were sparse.
Collapse
Affiliation(s)
- Min Zhang
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York 14853, USA
| | | | | | | | | |
Collapse
|
78
|
Bjørnstad A, Westad F, Martens H. Analysis of genetic marker-phenotype relationships by jack-knifed partial least squares regression (PLSR). Hereditas 2005; 141:149-65. [PMID: 15660976 DOI: 10.1111/j.1601-5223.2004.01816.x] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
The utility of a relatively new multivariate method, bi-linear modelling by cross-validated partial least squares regression (PLSR), was investigated in the analysis of QTL. The distinguishing feature of PLSR is to reveal reliable covariance structures in data of different types with regard to the same set objects. Two matrices X (here: genetic markers) and Y (here: phenotypes) are interactively decomposed into latent variables (PLS components, or PCs) in a way which facilitates statistically reliable and graphically interpretable model building. Natural collinearities between input variables are utilized actively to stabilise the modelling, instead of being treated as a statistical problem. The importance of cross-validation/jack-knifing as an intuitively appealing way to avoid overfitting, is emphasized. Two datasets from chromosomal mapping studies of different complexity were chosen for illustration (QTL for tomato yield and for oat heading date). Results from PLSR analysis were compared to published results and to results using the package PLABQTL in these data sets. In all cases PLSR gave at least similar explained validation variances as the reported studies. An attractive feature is that PLSR allows the analysis of several traits/replicates in one analysis, and the direct visual identification of individuals with desirable marker genotypes. It is suggested that PLSR may be useful in structural and functional genomics and in marker assisted selection, particularly in cases with limited number of objects.
Collapse
Affiliation(s)
- Asmund Bjørnstad
- Department of Plant and Environmental Sciences, Agricultural University of Norway, As, Norway
| | | | | |
Collapse
|
79
|
Riccardi F, Gazeau P, Jacquemot MP, Vincent D, Zivy M. Deciphering genetic variations of proteome responses to water deficit in maize leaves. PLANT PHYSIOLOGY AND BIOCHEMISTRY : PPB 2004; 42:1003-11. [PMID: 15707837 DOI: 10.1016/j.plaphy.2004.09.009] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/16/2004] [Accepted: 09/29/2004] [Indexed: 05/20/2023]
Abstract
The proteome of the basal part of growing Zea mays leaves was analyzed from 4 to 14 d after stopping watering and in well watered controls. The relative quantity of 46 proteins was found to increase in leaves of plants submitted to water deficit. Different types of responses were observed, some proteins showing a constant increase during water deficit, while others showed stabilization after a first increase or a transient increase. Isoforms encoded by the same gene showed different responses. The response to water deficit showed genetic variation. Some increased proteins were induced specifically in one of the two studied genotypes (e.g. ASR1) while others were significantly induced in both genotypes but to a different level or with different kinetics. Analyses of relations between protein quantities, relative water content (RWC) and abscisic acid (ABA) concentration allowed us to show that the quantitative variation of some proteins (e.g. ABA45 and OSR40 proteins) was linked to differences in ABA accumulation between the genotypes. Other proteins showed genetic variations that were not related to differences in water status or ABA concentration (e.g. a cystatin). Data obtained from these experiments, together with data from other experiments, contribute to the characterization of maize proteome response to drought in different conditions and in different genotypes. This characterization allows the search for candidate proteins, i.e. for protein whose genetic variation of expression could be partly responsible for the variability of plant responses to drought.
Collapse
Affiliation(s)
- Frédérique Riccardi
- UMR de Génétique Végétale du Moulon, Inra/CNRS/UPS/INAPG, Ferme du Moulon, 91190 Gif-sur-Yvette, France
| | | | | | | | | |
Collapse
|
80
|
Montooth KL, Marden JH, Clark AG. Mapping Determinants of Variation in Energy Metabolism, Respiration and Flight in Drosophila. Genetics 2003; 165:623-35. [PMID: 14573475 PMCID: PMC1462806 DOI: 10.1093/genetics/165.2.623] [Citation(s) in RCA: 92] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
AbstractWe employed quantitative trait locus (QTL) mapping to dissect the genetic architecture of a hierarchy of functionally related physiological traits, including metabolic enzyme activity, metabolite storage, metabolic rate, and free-flight performance in recombinant inbred lines of Drosophila melanogaster. We identified QTL underlying variation in glycogen synthase, hexokinase, phosphoglucomutase, and trehalase activity. In each case variation mapped away from the enzyme-encoding loci, indicating that trans-acting regions of the genome are important sources of variation within the metabolic network. Individual QTL associated with variation in metabolic rate and flight performance explained between 9 and 35% of the phenotypic variance. Bayesian QTL analysis identified epistatic effects underlying variation in flight velocity, metabolic rate, glycogen content, and several metabolic enzyme activities. A region on the third chromosome was associated with expression of the glucose-6-phosphate branchpoint enzymes and with metabolic rate and flight performance. These genomic regions are of special interest as they may coordinately regulate components of energy metabolism with effects on whole-organism physiological performance. The complex biochemical network is encoded by an equally complex network of interacting genetic elements with potentially pleiotropic effects. This has important consequences for the evolution of performance traits that depend upon these metabolic networks.
Collapse
Affiliation(s)
- Kristi L Montooth
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853, USA.
| | | | | |
Collapse
|
81
|
Rockman MV. Idiomatic (gene) expressions. Bioessays 2003; 25:421-4. [PMID: 12717811 DOI: 10.1002/bies.10279] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Hidden among the myriad nucleotide variants that constitute each species' gene pool are a few variants that contribute to phenotypic variation. Many of these differences that make a difference are non-coding cis-regulatory variants, which, unlike coding variants, can only be identified through laborious experimental analysis. Recently, Cowles et al.1 described a screening method that does an end-run around this problem by searching for genes whose cis regulation varies without having to find the polymorphic nucleotides that influence transcription. While we will continue to require a diverse arsenal of experimental methods, this versatile method will speed the identification of functional genetic variation.
Collapse
|
82
|
Naciri-Graven Y, Goudet J. The additive genetic variance after bottlenecks is affected by the number of loci involved in epistatic interactions. Evolution 2003; 57:706-16. [PMID: 12778542 DOI: 10.1111/j.0014-3820.2003.tb00284.x] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
We investigated the role of the number of loci coding for a neutral trait on the release of additive variance for this trait after population bottlenecks. Different bottleneck sizes and durations were tested for various matrices of genotypic values, with initial conditions covering the allele frequency space. We used three different types of matrices. First, we extended Cheverud and Routman's model by defining matrices of "pure" epistasis for three and four independent loci; second, we used genotypic values drawn randomly from uniform, normal, and exponential distributions; and third we used two models of simple metabolic pathways leading to physiological epistasis. For all these matrices of genotypic values except the dominant metabolic pathway, we find that, as the number of loci increases from two to three and four, an increase in the release of additive variance is occurring. The amount of additive variance released for a given set of genotypic values is a function of the inbreeding coefficient, independently of the size and duration of the bottleneck. The level of inbreeding necessary to achieve maximum release in additive variance increases with the number of loci. We find that additive-by-additive epistasis is the type of epistasis most easily converted into additive variance. For a wide range of models, our results show that epistasis, rather than dominance, plays a significant role in the increase of additive variance following bottlenecks.
Collapse
Affiliation(s)
- Yamama Naciri-Graven
- Institute of Ecology, Biology Building, Lausanne University, CH 1015 Lausanne, Switzerland.
| | | |
Collapse
|
83
|
Schadt EE, Monks SA, Drake TA, Lusis AJ, Che N, Colinayo V, Ruff TG, Milligan SB, Lamb JR, Cavet G, Linsley PS, Mao M, Stoughton RB, Friend SH. Genetics of gene expression surveyed in maize, mouse and man. Nature 2003; 422:297-302. [PMID: 12646919 DOI: 10.1038/nature01434] [Citation(s) in RCA: 1038] [Impact Index Per Article: 49.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2002] [Accepted: 01/10/2003] [Indexed: 11/09/2022]
Abstract
Treating messenger RNA transcript abundances as quantitative traits and mapping gene expression quantitative trait loci for these traits has been pursued in gene-specific ways. Transcript abundances often serve as a surrogate for classical quantitative traits in that the levels of expression are significantly correlated with the classical traits across members of a segregating population. The correlation structure between transcript abundances and classical traits has been used to identify susceptibility loci for complex diseases such as diabetes and allergic asthma. One study recently completed the first comprehensive dissection of transcriptional regulation in budding yeast, giving a detailed glimpse of a genome-wide survey of the genetics of gene expression. Unlike classical quantitative traits, which often represent gross clinical measurements that may be far removed from the biological processes giving rise to them, the genetic linkages associated with transcript abundance affords a closer look at cellular biochemical processes. Here we describe comprehensive genetic screens of mouse, plant and human transcriptomes by considering gene expression values as quantitative traits. We identify a gene expression pattern strongly associated with obesity in a murine cross, and observe two distinct obesity subtypes. Furthermore, we find that these obesity subtypes are under the control of different loci.
Collapse
MESH Headings
- Animals
- Chromosome Mapping
- Chromosomes, Human, Pair 20/genetics
- Chromosomes, Mammalian/genetics
- Crosses, Genetic
- Female
- Genomics/methods
- Humans
- Lod Score
- Male
- Mice/genetics
- Mice, Inbred C57BL
- Mice, Inbred DBA
- Obesity/genetics
- Oligonucleotide Array Sequence Analysis
- Pedigree
- Polymorphism, Genetic/genetics
- Quantitative Trait Loci/genetics
- RNA, Messenger/genetics
- RNA, Messenger/metabolism
- Transcription, Genetic/genetics
- Tumor Cells, Cultured
- Zea mays/genetics
Collapse
Affiliation(s)
- Eric E Schadt
- Rosetta Inpharmatics, LLC, 12040 115th Avenue N.E., Kirkland, Washington 98034, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
84
|
Rifkin SA, Kim J, White KP. Evolution of gene expression in the Drosophila melanogaster subgroup. Nat Genet 2003; 33:138-44. [PMID: 12548287 DOI: 10.1038/ng1086] [Citation(s) in RCA: 265] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2002] [Accepted: 01/02/2003] [Indexed: 11/08/2022]
Abstract
Little is known about broad patterns of variation and evolution of gene expression during any developmental process. Here we investigate variation in genome-wide gene expression among Drosophila simulans, Drosophila yakuba and four strains of Drosophila melanogaster during a major developmental transition--the start of metamorphosis. Differences in gene activity between these lineages follow a phylogenetic pattern, and 27% of all of the genes in these genomes differ in their developmental gene expression between at least two strains or species. We identify, on a gene-by-gene basis, the evolutionary forces that shape this variation and show that, both within the transcriptional network that controls metamorphosis and across the whole genome, the expression changes of transcription factor genes are relatively stable, whereas those of their downstream targets are more likely to have evolved. Our results demonstrate extensive evolution of developmental gene expression among closely related species.
Collapse
Affiliation(s)
- Scott A Rifkin
- Department of Ecology and Evolutionary Biology, Yale University, PO Box 208106, New Haven, Connecticut 06520-8106, USA
| | | | | |
Collapse
|
85
|
Naciri-Graven Y, Goudet J. THE ADDITIVE GENETIC VARIANCE AFTER BOTTLENECKS IS AFFECTED BY THE NUMBER OF LOCI INVOLVED IN EPISTATIC INTERACTIONS. Evolution 2003. [DOI: 10.1554/0014-3820(2003)057[0706:tagvab]2.0.co;2] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
86
|
Thiellement H, Zivy M, Plomion C. Combining proteomic and genetic studies in plants. J Chromatogr B Analyt Technol Biomed Life Sci 2002; 782:137-49. [PMID: 12458003 DOI: 10.1016/s1570-0232(02)00553-6] [Citation(s) in RCA: 40] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Plant proteomics is still in its infancy, although numerous experiments have been undertaken since the end of the 1970s. In this review we focus on the interactions between proteomics and genetics. A given genome can express various proteomes according to differentiation, development, tissues, cells and subcellular compartments, and proteomes are modified in function of biotic and abiotic environment. These different proteomes and the way they respond to environment can be compared between genotypes, allowing the characterization of mutants or lines, the study of mutation pleiotropic effects, the genetic mapping of expressed genes. These comparisons also permit to hypothesize for "candidate proteins" that might be involved in the genetic variation of traits of economic or agronomic interest.
Collapse
Affiliation(s)
- Hervé Thiellement
- Unité Mixte de Génétique Végétale, INRA/CNRS, la Ferme du Moulon, F-91190 Gif-sur-Yvette, France.
| | | | | |
Collapse
|
87
|
Abstract
The modern generalization of sedentary life and caloric abundance has created new physiological conditions capable of changing the level of expression of a number of genes involved in fuel metabolism and body weight regulation. It is likely that the genetic variants or alleles of these genes have in the past participated in the adaptation of human physiology to its evolutionary constraints. The nature and prevalence of polymorphisms responsible for the quantitative variation of complex metabolic traits may have been different among human populations, depending on their environment and ancestral genetic background. These polymorphisms could likely explain differences in disease susceptibility and prevalence among groups of humans. From complex traits to potentially complex alleles, understanding the molecular genetic basis underlying quantitative variation will continue to be a growing concern among geneticists dealing with obesity and type 2 diabetes, the main fuel disorders of the modern era. Genomics and genetic epidemiology now allow high-level linkage and association studies to be designed. But the pooling of large trans-geographic cohorts may in fact increase the genetic heterogeneity of studied traits and dilute genotype-phenotype associations. In this article, we underscore the importance of selecting the traits to be subjected to quantitative genetic analysis. Although this is not possible for most other multifactorial diseases, obesity and type 2 diabetes can be subjected to a pregenetic dissection of complexity into simpler quantitative traits (QTs). This dissection is based on the pathogenic mechanisms, and the time course of the traits, and the individuals' age, within the predisease period rather than on descriptive parameters after disease diagnosis. We defend that this approach of phenotypes may ease future associations to be established between QTs of intermediate complexity and genetic polymorphisms.
Collapse
Affiliation(s)
- Pierre Bougnères
- Service d'Endocrinologie, Unité 561 INSERM, Hôpital Saint Vincent de Paul, Paris, France.
| |
Collapse
|
88
|
Abstract
Changes in gene expression and regulation--due in particular to the evolution of cis-regulatory DNA sequences--may underlie many evolutionary changes in phenotypes, yet little is known about the distribution of such variation in populations. We present in this study the first survey of experimentally validated functional cis-regulatory polymorphism. These data are derived from more than 140 polymorphisms involved in the regulation of 107 genes in Homo sapiens, the eukaryote species with the most available data. We find that functional cis-regulatory variation is widespread in the human genome and that the consequent variation in gene expression is twofold or greater for 63% of the genes surveyed. Transcription factor-DNA interactions are highly polymorphic, and regulatory interactions have been gained and lost within human populations. On average, humans are heterozygous at more functional cis-regulatory sites (>16,000) than at amino acid positions (<13,000), in part because of an overrepresentation among the former in multiallelic tandem repeat variation, especially (AC)(n) dinucleotide microsatellites. The role of microsatellites in gene expression variation may provide a larger store of heritable phenotypic variation, and a more rapid mutational input of such variation, than has been realized. Finally, we outline the distinctive consequences of cis-regulatory variation for the genotype-phenotype relationship, including ubiquitous epistasis and genotype-by-environment interactions, as well as underappreciated modes of pleiotropy and overdominance. Ordinary small-scale mutations contribute to pervasive variation in transcription rates and consequently to patterns of human phenotypic variation.
Collapse
|
89
|
Tuberosa R, Salvi S, Sanguineti MC, Landi P, Maccaferri M, Conti S. Mapping QTLs regulating morpho-physiological traits and yield: case studies, shortcomings and perspectives in drought-stressed maize. ANNALS OF BOTANY 2002; 89 Spec No:941-63. [PMID: 12102519 PMCID: PMC4233811 DOI: 10.1093/aob/mcf134] [Citation(s) in RCA: 141] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/17/2023]
Abstract
Comparative analysis of a number of studies in drought-stressed maize (Zea mays L.) reporting quantitative trait loci (QTLs) for abscisic acid concentration, root characteristics, other morpho-physiological traits (MPTs) and grain yield (GY) reveals their complex genetic basis and the influence of the genetic background and the environment on QTL effects. Chromosome regions (e.g. near umc11 on chromosome 1 and near csu133 on chromosome 2) with QTLs controlling a number of MPTs and GY across populations and conditions of different water supply have been identified. Examples are presented on the use of QTL information to elucidate the genetic and physiological bases of the association among MPTs and GY. The QTL approach allows us to develop hypotheses accounting for these associations which can be further tested by developing near isogenic lines (NILs) differing for the QTL alleles. NILs also allow for a more accurate assessment of the breeding value of MPTs and, in some cases, may allow for the map-based cloning of the gene(s) underlying the QTL. Although QTL analysis is still time-consuming and resource-demanding, its integration with genomics and post-genomics approaches (e.g. transcriptome, proteome and metabolome analyses) will play an increasingly important role for the identification and validation of candidate genes affecting MPTs and GY.
Collapse
Affiliation(s)
- Roberto Tuberosa
- Department of Agroenvironmental Science and Technology, University of Bologna, Italy.
| | | | | | | | | | | |
Collapse
|
90
|
Klose J, Nock C, Herrmann M, Stühler K, Marcus K, Blüggel M, Krause E, Schalkwyk LC, Rastan S, Brown SDM, Büssow K, Himmelbauer H, Lehrach H. Genetic analysis of the mouse brain proteome. Nat Genet 2002; 30:385-93. [PMID: 11912495 DOI: 10.1038/ng861] [Citation(s) in RCA: 187] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Proteome analysis is a fundamental step in systematic functional genomics. Here we have resolved 8,767 proteins from the mouse brain proteome by large-gel two-dimensional electrophoresis. We detected 1,324 polymorphic proteins from the European collaborative interspecific backcross. Of these, we mapped 665 proteins genetically and identified 466 proteins by mass spectrometry. Qualitatively polymorphic proteins, to 96%, reflect changes in conformation and/or mass. Quantitatively polymorphic proteins show a high frequency (73%) of allele-specific transmission in codominant heterozygotes. Variations in protein isoforms and protein quantity often mapped to chromosomal positions different from that of the structural gene, indicating that single proteins may act as polygenic traits. Genetic analysis of proteomes may detect the types of polymorphism that are most relevant in disease-association studies.
Collapse
Affiliation(s)
- Joachim Klose
- Institut für Humangenetik, Humboldt-Universität zu Berlin, Charité, Campus Virchow-Klinikum, Augustenburger Platz 1, D-13353 Berlin, Germany.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
91
|
Consoli L, Lefèvre A, Zivy M, de Vienne D, Damerval C. QTL analysis of proteome and transcriptome variations for dissecting the genetic architecture of complex traits in maize. PLANT MOLECULAR BIOLOGY 2002; 48:575-81. [PMID: 11999835 DOI: 10.1023/a:1014840810203] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
In this review, we present some studies on genetic analysis of proteome and transcriptome variations, which exemplify new strategies for a better understanding of the molecular and genetic bases of complex traits. A large genetic variability was revealed at the proteome expression level, which raised the possibility to predict phenotypical performance on the basis of gene product variability. This approach yielded limited results, but could be re-newed by extensive identification of proteins now allowed by mass spectrometry. The dissection of the genetic basis of the variation of individual protein amounts proves very powerful to select 'candidate' proteins, physiologically relevant for a given phenotypical trait, as shown by a study on the effect of water stress in maize. In order to investigate factors of grain quality in maize, we selected a regulatory locus known to control the expression of several storage protein genes, Opaque-2, and investigated the relationships between variability in zein amount and composition and the molecular polymorphism at this locus. Moreover, a QTL analysis revealed that the variability in Opaque-2 transcript abundance was controlled by several polymorphic trans-acting regulators unlinked to the Opaque-2 structural gene. Such genetic approaches should represent additional tools for physiological analysis of the huge amounts of data generated by transcritome and proteome projects.
Collapse
Affiliation(s)
- L Consoli
- Station de Génétique Végétale INRA/INA-PG/UPS, La Ferme du Moulon, Gif-sur-Yvette, France
| | | | | | | | | |
Collapse
|
92
|
Korol AB, Ronin YI, Itskovich AM, Peng J, Nevo E. Enhanced efficiency of quantitative trait loci mapping analysis based on multivariate complexes of quantitative traits. Genetics 2001; 157:1789-803. [PMID: 11290731 PMCID: PMC1461583 DOI: 10.1093/genetics/157.4.1789] [Citation(s) in RCA: 119] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
An approach to increase the efficiency of mapping quantitative trait loci (QTL) was proposed earlier by the authors on the basis of bivariate analysis of correlated traits. The power of QTL detection using the log-likelihood ratio (LOD scores) grows proportionally to the broad sense heritability. We found that this relationship holds also for correlated traits, so that an increased bivariate heritability implicates a higher LOD score, higher detection power, and better mapping resolution. However, the increased number of parameters to be estimated complicates the application of this approach when a large number of traits are considered simultaneously. Here we present a multivariate generalization of our previous two-trait QTL analysis. The proposed multivariate analogue of QTL contribution to the broad-sense heritability based on interval-specific calculation of eigenvalues and eigenvectors of the residual covariance matrix allows prediction of the expected QTL detection power and mapping resolution for any subset of the initial multivariate trait complex. Permutation technique allows chromosome-wise testing of significance for the whole trait complex and the significance of the contribution of individual traits owing to: (a) their correlation with other traits, (b) dependence on the chromosome in question, and (c) both a and b. An example of application of the proposed method on a real data set of 11 traits from an experiment performed on an F(2)/F(3) mapping population of tetraploid wheat (Triticum durum x T. dicoccoides) is provided.
Collapse
Affiliation(s)
- A B Korol
- Institute of Evolution, University of Haifa, Haifa 31905, Israel.
| | | | | | | | | |
Collapse
|
93
|
Plomion C, Pionneau C, Brach J, Costa P, Baillères H. Compression wood-responsive proteins in developing xylem of maritime pine (Pinus pinaster ait.). PLANT PHYSIOLOGY 2000; 123:959-69. [PMID: 10889244 PMCID: PMC59058 DOI: 10.1104/pp.123.3.959] [Citation(s) in RCA: 43] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/30/1999] [Accepted: 03/21/2000] [Indexed: 05/17/2023]
Abstract
When a conifer shoot is displaced from its vertical position, compression wood (CW) is formed on the under side and can eventually return the shoot to its original position. Changes in cell wall structure and chemistry associated with CW are likely to result from differential gene/protein expression. Two-dimensional polyacrylamide gel electrophoresis of differentiating xylem proteins was combined with the physical characterization of wooden samples to identify and characterize CW-responsive proteins. Differentiating xylem was harvested from a 22-year-old crooked maritime pine (Pinus pinaster Ait.) tree. Protein extracted from different samples were revealed by high-resolution silver stained two-dimensional polyacrylamide gel electrophoresis and analyzed with a computer-assisted system for single spot quantification. Growth strain (GS) measurements allowed xylem samples to be classified quantitatively from normal wood to CW. Regression of lignin and cellulose content on GS showed that an increase in the percentage of lignin and a decrease of the percentage of cellulose corresponded to increasing GS values, i.e. CW. Of the 137 studied spots, 19% were significantly associated with GS effect. Up-regulated proteins included 1-aminocyclopropane-1-carboxylate oxidase (an ethylene forming enzyme), a putative transcription factor, two lignification genes (caffeic O-methyltransferase and caffeoyl CoA-O-methyltransferase), members of the S-adenosyl-L-methionine-synthase gene family, and enzymes involved in nitrogen and carbon assimilation (glutamine synthetase and fructokinase). A clustered correlation analysis was performed to study simultaneously protein expression along a gradient of gravistimulated stressed xylem tissue. Proteins were found to form "expression clusters" that could identify: (a) Gene product under similar control mechanisms, (b) partner proteins, or (c) functional groups corresponding to specialized pathways. The possibility of obtaining regulatory correlations and anticorrelations between proteins provide us with a new category of homology (regulatory homology) in tracing functional relationships.
Collapse
Affiliation(s)
- C Plomion
- Institut National de la Recherche Agronomique, Equipe de Génétique et Amélioration des Arbres Forestiers, BP45, 33610 Pierroton, France.
| | | | | | | | | |
Collapse
|
94
|
Omholt SW, Plahte E, Oyehaug L, Xiang K. Gene regulatory networks generating the phenomena of additivity, dominance and epistasis. Genetics 2000; 155:969-80. [PMID: 10835414 PMCID: PMC1461103 DOI: 10.1093/genetics/155.2.969] [Citation(s) in RCA: 122] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
We show how the phenomena of genetic dominance, overdominance, additivity, and epistasis are generic features of simple diploid gene regulatory networks. These regulatory network models are together sufficiently complex to catch most of the suggested molecular mechanisms responsible for generating dominant mutations. These include reduced gene dosage, expression or protein activity (haploinsufficiency), increased gene dosage, ectopic or temporarily altered mRNA expression, increased or constitutive protein activity, and dominant negative effects. As classical genetics regards the phenomenon of dominance to be generated by intralocus interactions, we have studied two one-locus models, one with a negative autoregulatory feedback loop, and one with a positive autoregulatory feedback loop. To include the phenomena of epistasis and downstream regulatory effects, a model of a three-locus signal transduction network is also analyzed. It is found that genetic dominance as well as overdominance may be an intra- as well as interlocus interaction phenomenon. In the latter case the dominance phenomenon is intimately connected to either feedback-mediated epistasis or downstream-mediated epistasis. It appears that in the intra- as well as the interlocus case there is considerable room for additive gene action, which may explain to some degree the predictive power of quantitative genetic theory, with its emphasis on this type of gene action. Furthermore, the results illuminate and reconcile the prevailing explanations of heterosis, and they support the old conjecture that the phenomenon of dominance may have an evolutionary explanation related to life history strategy.
Collapse
Affiliation(s)
- S W Omholt
- Department of Animal Science, Agricultural University of Norway, Aas.
| | | | | | | |
Collapse
|
95
|
|
96
|
Bost B, Dillmann C, de Vienne D. Fluxes and metabolic pools as model traits for quantitative genetics. I. The L-shaped distribution of gene effects. Genetics 1999; 153:2001-12. [PMID: 10581302 PMCID: PMC1460848 DOI: 10.1093/genetics/153.4.2001] [Citation(s) in RCA: 42] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The fluxes through metabolic pathways can be considered as model quantitative traits, whose QTL are the polymorphic loci controlling the activity or quantity of the enzymes. Relying on metabolic control theory, we investigated the relationships between the variations of enzyme activity along metabolic pathways and the variations of the flux in a population with biallelic QTL. Two kinds of variations were taken into account, the variation of the average enzyme activity across the loci, and the variation of the activity of each enzyme of the pathway among the individuals of the population. We proposed analytical approximations for the flux mean and variance in the population as well as for the additive and dominance variances of the individual QTL. Monte Carlo simulations based on these approximations showed that an L-shaped distribution of the contributions of individual QTL to the flux variance (R(2)) is consistently expected in an F(2) progeny. This result could partly account for the classically observed L-shaped distribution of QTL effects for quantitative traits. The high correlation we found between R(2) value and flux control coefficients variance suggests that such a distribution is an intrinsic property of metabolic pathways due to the summation property of control coefficients.
Collapse
Affiliation(s)
- B Bost
- Station de Génétique Végétale, INRA/UPS/INAPG, Ferme du Moulon, 91190 Gif-sur-Yvette, France.
| | | | | |
Collapse
|
97
|
Thiellement H, Bahrman N, Damerval C, Plomion C, Rossignol M, Santoni V, de Vienne D, Zivy M. Proteomics for genetic and physiological studies in plants. Electrophoresis 1999; 20:2013-26. [PMID: 10451110 DOI: 10.1002/(sici)1522-2683(19990701)20:10<2013::aid-elps2013>3.0.co;2-#] [Citation(s) in RCA: 146] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Proteomics is becoming a necessity in plant biology, as it is in medicine, zoology and microbiology, for deciphering the function and role of the genes that are or will be sequenced. In this review we focus on the various, mainly genetic, applications of the proteomic tools that have been developed in recent years: characterization of individuals or lines, estimation of genetic variability within and between populations, establishment of genetic distances that can be used in phylogenetic studies, characterization of mutants and localization of the genes encoding the revealed proteins. Improvements in specifically devoted software have permitted precise quantification of the variation in amounts of proteins, leading to the concept of "protein quantity loci" which, combined with the "quantitative trait loci" approach, results in testable hypotheses regarding the role of "candidate proteins" in the metabolism or phenotype under study. This new development is exemplified by the reaction of plants to drought, a trait of major agronomic interest. The accumulation of data regarding genomic and cDNA sequencing will be connected to the protein databases currently developed in plants.
Collapse
Affiliation(s)
- H Thiellement
- Département de Botanique et Biologie Végétale, Université de Genève, Switzerland.
| | | | | | | | | | | | | | | |
Collapse
|
98
|
Davis GL, McMullen MD, Baysdorfer C, Musket T, Grant D, Staebell M, Xu G, Polacco M, Koster L, Melia-Hancock S, Houchins K, Chao S, Coe EH. A maize map standard with sequenced core markers, grass genome reference points and 932 expressed sequence tagged sites (ESTs) in a 1736-locus map. Genetics 1999; 152:1137-72. [PMID: 10388831 PMCID: PMC1460676 DOI: 10.1093/genetics/152.3.1137] [Citation(s) in RCA: 121] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
We have constructed a 1736-locus maize genome map containing1156 loci probed by cDNAs, 545 probed by random genomic clones, 16 by simple sequence repeats (SSRs), 14 by isozymes, and 5 by anonymous clones. Sequence information is available for 56% of the loci with 66% of the sequenced loci assigned functions. A total of 596 new ESTs were mapped from a B73 library of 5-wk-old shoots. The map contains 237 loci probed by barley, oat, wheat, rice, or tripsacum clones, which serve as grass genome reference points in comparisons between maize and other grass maps. Ninety core markers selected for low copy number, high polymorphism, and even spacing along the chromosome delineate the 100 bins on the map. The average bin size is 17 cM. Use of bin assignments enables comparison among different maize mapping populations and experiments including those involving cytogenetic stocks, mutants, or quantitative trait loci. Integration of nonmaize markers in the map extends the resources available for gene discovery beyond the boundaries of maize mapping information into the expanse of map, sequence, and phenotype information from other grass species. This map provides a foundation for numerous basic and applied investigations including studies of gene organization, gene and genome evolution, targeted cloning, and dissection of complex traits.
Collapse
Affiliation(s)
- G L Davis
- USDA-ARS, Midwest Area, Plant Genetics Research Unit, Columbia, Missouri 65211, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
99
|
Wu R, Li B. A multiplicative-epistatic model for analyzing interspecific differences in outcrossing species. Biometrics 1999; 55:355-65. [PMID: 11318188 DOI: 10.1111/j.0006-341x.1999.00355.x] [Citation(s) in RCA: 50] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Epistasis may play an important role in evolution and speciation. Under multiplicative interactions between different loci, an analytical model is proposed to estimate genetic parameters at the individual locus level that contribute to interspecific differences in outcrossing species. The multiplicative epistasis model, inferred from a number of animal and plant experiments, suggests that genotypes at a pair of loci have genotypic values equal to the product of genotypic values at the two different loci. By considering the genetic property of outcrossing species (i.e., high polymorphisms) in the multilevel family structure analysis for an intra- and interspecific factorial mating design, a method is developed to provide estimates for allele frequencies and additive and dominant effects at individual loci in each of the two parental populations, the genotypic values of newly formed heterozygotes through species combination each with one allele from a parental population and the second from the other parental population, and the numbers of genetic factors that lead to species differentiation. Use of clones offers a tremendous power to test the adequacy of the model. However, the utilization of the model with species that cannot be cloned is also discussed. An example with interspecific hybrids of two forest tree species is used to demonstrate the model.
Collapse
Affiliation(s)
- R Wu
- Department of Forestry, North Carolina State University, Raleigh 27695-8008, USA.
| | | |
Collapse
|
100
|
From Genome to Proteome. Proceedings of the 3rd Siena two-dimensional electrophoresis meeting. Siena, Italy, August 31-September 3, 1998. Electrophoresis 1999; 20:643-1119. [PMID: 10428607 DOI: 10.1002/(sici)1522-2683(19990101)20:4/5<643::aid-elps643>3.0.co;2-m] [Citation(s) in RCA: 71] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Within the framework of a pilot project on the analysis of the mouse proteome, we investigated C57BL/6 mice (Mus musculus), a standard inbred strain of the mouse, starting with the analysis of brain, liver and heart proteins. Tissue extraction and the separation of proteins were performed with techniques offering a maximum of resolution. Proteins separated were analyzed by mass spectrometry. Gene-protein identification was performed by genetic analyses using the European Collaborative Interspecific Backcross (EUCIB), established from the two mouse species Mus musculus and Mus spretus. On the basis of protein polymorphisms we mapped hundreds of genes on the mouse chromosomes, allowing us new insight into the relationship between genotype and phenotype of proteins. In particular, the results showed that protein modifications can be genetically determined, therefore representing their own class of protein phenotypes. In this context, results are discussed suggesting that phenotypes of single protein species may result from several genes. Accordingly, proteins are considered as polygenic traits. In contrast, one example demonstrates that proteins may also have pleiotropic effects: a single gene mutation (a single altered protein) may affect several other proteins. From these studies we conclude that gene-related functional proteomics will show in the future that genetic diseases, defined today by clinical symptoms and considered as etiological entireties, can be subdivided into different diseases according to different affected genes.
Collapse
|