1
|
Bhattacharya A, Cui Y. A GPU-accelerated algorithm for biclustering analysis and detection of condition-dependent coexpression network modules. Sci Rep 2017. [PMID: 28646174 PMCID: PMC5482832 DOI: 10.1038/s41598-017-04070-4] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
In the analysis of large-scale gene expression data, it is important to identify groups of genes with common expression patterns under certain conditions. Many biclustering algorithms have been developed to address this problem. However, comprehensive discovery of functionally coherent biclusters from large datasets remains a challenging problem. Here we propose a GPU-accelerated biclustering algorithm, based on searching for the largest Condition-dependent Correlation Subgroups (CCS) for each gene in the gene expression dataset. We compared CCS with thirteen widely used biclustering algorithms. CCS consistently outperformed all the thirteen biclustering algorithms on both synthetic and real gene expression datasets. As a correlation-based biclustering method, CCS can also be used to find condition-dependent coexpression network modules. We implemented the CCS algorithm using C and implemented the parallelized CCS algorithm using CUDA C for GPU computing. The source code of CCS is available from https://github.com/abhatta3/Condition-dependent-Correlation-Subgroups-CCS.
Collapse
Affiliation(s)
- Anindya Bhattacharya
- Department of Microbiology, Immunology and Biochemistry, Memphis, TN, 38163, USA. .,Center for Integrative and Translational Genomics, University of Tennessee Health Science Center, Memphis, TN, 38163, USA. .,Department of Computer Science and Engineering, University of California, San Diego, CA, 92093, USA.
| | - Yan Cui
- Department of Microbiology, Immunology and Biochemistry, Memphis, TN, 38163, USA. .,Center for Integrative and Translational Genomics, University of Tennessee Health Science Center, Memphis, TN, 38163, USA.
| |
Collapse
|
2
|
Abstract
The Bayesian Network Webserver (BNW, http://compbio.uthsc.edu/BNW ) is an integrated platform for Bayesian network modeling of biological datasets. It provides a web-based network modeling environment that seamlessly integrates advanced algorithms for probabilistic causal modeling and reasoning with Bayesian networks. BNW is designed for precise modeling of relatively small networks that contain less than 20 nodes. The structure learning algorithms used by BNW guarantee the discovery of the best (most probable) network structure given the data. To facilitate network modeling across multiple biological levels, BNW provides a very flexible interface that allows users to assign network nodes into different tiers and define the relationships between and within the tiers. This function is particularly useful for modeling systems genetics datasets that often consist of multiscalar heterogeneous genotype-to-phenotype data. BNW enables users to, within seconds or minutes, go from having a simply formatted input file containing a dataset to using a network model to make predictions about the interactions between variables and the potential effects of experimental interventions. In this chapter, we will introduce the functions of BNW and show how to model systems genetics datasets with BNW.
Collapse
|
3
|
Ghazalpour A, Bennett B, Petyuk VA, Orozco L, Hagopian R, Mungrue IN, Farber CR, Sinsheimer J, Kang HM, Furlotte N, Park CC, Wen PZ, Brewer H, Weitz K, Camp DG, Pan C, Yordanova R, Neuhaus I, Tilford C, Siemers N, Gargalovic P, Eskin E, Kirchgessner T, Smith DJ, Smith RD, Lusis AJ. Comparative analysis of proteome and transcriptome variation in mouse. PLoS Genet 2011; 7:e1001393. [PMID: 21695224 PMCID: PMC3111477 DOI: 10.1371/journal.pgen.1001393] [Citation(s) in RCA: 444] [Impact Index Per Article: 34.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2010] [Accepted: 05/10/2011] [Indexed: 12/11/2022] Open
Abstract
The relationships between the levels of transcripts and the levels of the proteins they encode have not been examined comprehensively in mammals, although previous work in plants and yeast suggest a surprisingly modest correlation. We have examined this issue using a genetic approach in which natural variations were used to perturb both transcript levels and protein levels among inbred strains of mice. We quantified over 5,000 peptides and over 22,000 transcripts in livers of 97 inbred and recombinant inbred strains and focused on the 7,185 most heritable transcripts and 486 most reliable proteins. The transcript levels were quantified by microarray analysis in three replicates and the proteins were quantified by Liquid Chromatography–Mass Spectrometry using O(18)-reference-based isotope labeling approach. We show that the levels of transcripts and proteins correlate significantly for only about half of the genes tested, with an average correlation of 0.27, and the correlations of transcripts and proteins varied depending on the cellular location and biological function of the gene. We examined technical and biological factors that could contribute to the modest correlation. For example, differential splicing clearly affects the analyses for certain genes; but, based on deep sequencing, this does not substantially contribute to the overall estimate of the correlation. We also employed genome-wide association analyses to map loci controlling both transcript and protein levels. Surprisingly, little overlap was observed between the protein- and transcript-mapped loci. We have typed numerous clinically relevant traits among the strains, including adiposity, lipoprotein levels, and tissue parameters. Using correlation analysis, we found that a low number of clinical trait relationships are preserved between the protein and mRNA gene products and that the majority of such relationships are specific to either the protein levels or transcript levels. Surprisingly, transcript levels were more strongly correlated with clinical traits than protein levels. In light of the widespread use of high-throughput technologies in both clinical and basic research, the results presented have practical as well as basic implications. An old dogma in biology states that, in every cell, the flow of biological information is from DNA to RNA to proteins and that the latter act as a working force to determine the organism's phenotype. This model predicts that changes in DNA that affect the clinical phenotype should also similarly change the cellular levels of RNA and protein levels. In this report, we test this prediction by looking at the concordance between DNA variation in population of mouse inbred strains, the RNA and protein variation in the liver tissue of these mice, and variation in metabolic phenotypes. We show that the relationship between various biological traits is not simple and that there is relatively little concordance of RNA levels and the corresponding protein levels in response to DNA perturbations. In addition, we also find that, surprisingly, metabolic traits correlate better to RNA levels than to protein levels. In light of current efforts in searching for the molecular bases of disease susceptibility in humans, our findings highlight the complexity of information flow that underlies clinical outcomes.
Collapse
Affiliation(s)
- Anatole Ghazalpour
- Department of Medicine/Division of Cardiology, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, California, United States of America.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
4
|
Alberts R, Lu L, Williams RW, Schughart K. Genome-wide analysis of the mouse lung transcriptome reveals novel molecular gene interaction networks and cell-specific expression signatures. Respir Res 2011; 12:61. [PMID: 21535883 PMCID: PMC3105947 DOI: 10.1186/1465-9921-12-61] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2011] [Accepted: 05/02/2011] [Indexed: 01/01/2023] Open
Abstract
BACKGROUND The lung is critical in surveillance and initial defense against pathogens. In humans, as in mice, individual genetic differences strongly modulate pulmonary responses to infectious agents, severity of lung disease, and potential allergic reactions. In a first step towards understanding genetic predisposition and pulmonary molecular networks that underlie individual differences in disease vulnerability, we performed a global analysis of normative lung gene expression levels in inbred mouse strains and a large family of BXD strains that are widely used for systems genetics. Our goal is to provide a key community resource on the genetics of the normative lung transcriptome that can serve as a foundation for experimental analysis and allow predicting genetic predisposition and response to pathogens, allergens, and xenobiotics. METHODS Steady-state polyA+ mRNA levels were assayed across a diverse and fully genotyped panel of 57 isogenic strains using the Affymetrix M430 2.0 array. Correlations of expression levels between genes were determined. Global expression QTL (eQTL) analysis and network covariance analysis was performed using tools and resources in GeneNetwork http://www.genenetwork.org. RESULTS Expression values were highly variable across strains and in many cases exhibited a high heritability factor. Several genes which showed a restricted expression to lung tissue were identified. Using correlations between gene expression values across all strains, we defined and extended memberships of several important molecular networks in the lung. Furthermore, we were able to extract signatures of immune cell subpopulations and characterize co-variation and shared genetic modulation. Known QTL regions for respiratory infection susceptibility were investigated and several cis-eQTL genes were identified. Numerous cis- and trans-regulated transcripts and chromosomal intervals with strong regulatory activity were mapped. The Cyp1a1 P450 transcript had a strong trans-acting eQTL (LOD 11.8) on Chr 12 at 36 ± 1 Mb. This interval contains the transcription factor Ahr that has a critical mis-sense allele in the DBA/2J haplotype and evidently modulates transcriptional activation by AhR. CONCLUSIONS Large-scale gene expression analyses in genetic reference populations revealed lung-specific and immune-cell gene expression profiles and suggested specific gene regulatory interactions.
Collapse
Affiliation(s)
- Rudi Alberts
- Department of Infection Genetics, University of Veterinary Medicine Hannover, Inhoffenstr, Braunschweig, Germany
| | | | | | | |
Collapse
|
5
|
Systems genetics, bioinformatics and eQTL mapping. Genetica 2010; 138:915-24. [DOI: 10.1007/s10709-010-9480-x] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2010] [Accepted: 07/30/2010] [Indexed: 12/15/2022]
|
6
|
SYSGENET: a meeting report from a new European network for systems genetics. Mamm Genome 2010; 21:331-6. [PMID: 20623354 PMCID: PMC2923724 DOI: 10.1007/s00335-010-9273-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2010] [Accepted: 06/24/2010] [Indexed: 12/22/2022]
Abstract
The first scientific meeting of the newly established European SYSGENET network took place at the Helmholtz Centre for Infection Research (HZI) in Braunschweig, April 7-9, 2010. About 50 researchers working in the field of systems genetics using mouse genetic reference populations (GRP) participated in the meeting and exchanged their results, phenotyping approaches, and data analysis tools for studying systems genetics. In addition, the future of GRP resources and phenotyping in Europe was discussed.
Collapse
|
7
|
Cost-effective strategies for completing the interactome. Nat Methods 2008; 6:55-61. [PMID: 19079254 PMCID: PMC2613168 DOI: 10.1038/nmeth.1283] [Citation(s) in RCA: 75] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2008] [Accepted: 11/18/2008] [Indexed: 01/17/2023]
Abstract
Comprehensive protein interaction mapping projects are underway for many model species and humans. A key step in these projects is estimating the time, cost, and personnel required for obtaining an accurate and complete map. Here, we model the cost of interaction map completion across a spectrum of experimental designs. We show that current efforts may require up to 20 independent tests covering each protein pair to approach completion. We explore designs for reducing this cost substantially, including prioritization of protein pairs, probability thresholding, and interaction prediction. The best designs lower cost by four-fold overall and >100-fold in early stages of mapping. We demonstrate the best strategy in an ongoing project in Drosophila, in which we map 450 high-confidence interactions using 47 microtiter plates, versus thousands of plates expected using current designs. This study provides a framework for assessing the feasibility of interaction mapping projects and for future efforts to increase their efficiency.
Collapse
|
8
|
Gilad Y, Rifkin SA, Pritchard JK. Revealing the architecture of gene regulation: the promise of eQTL studies. Trends Genet 2008; 24:408-15. [PMID: 18597885 DOI: 10.1016/j.tig.2008.06.001] [Citation(s) in RCA: 357] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2008] [Revised: 06/09/2008] [Accepted: 06/09/2008] [Indexed: 10/21/2022]
Abstract
Expression quantitative trait loci (eQTL) mapping studies have become a widely used tool for identifying genetic variants that affect gene regulation. In these studies, expression levels are viewed as quantitative traits, and gene expression phenotypes are mapped to particular genomic loci by combining studies of variation in gene expression patterns with genome-wide genotyping. Results from recent eQTL mapping studies have revealed substantial heritable variation in gene expression within and between populations. In many cases, genetic factors that influence gene expression levels can be mapped to proximal (putatively cis) eQTLs and, less often, to distal (putatively trans) eQTLs. Beyond providing great insight into the biology of gene regulation, a combination of eQTL studies with results from traditional linkage or association studies of human disease may help predict a specific regulatory role for polymorphic sites previously associated with disease.
Collapse
Affiliation(s)
- Yoav Gilad
- Department of Human Genetics, University of Chicago, Chicago, IL 60637, USA.
| | | | | |
Collapse
|
9
|
Ferrara CT, Wang P, Neto EC, Stevens RD, Bain JR, Wenner BR, Ilkayeva OR, Keller MP, Blasiole DA, Kendziorski C, Yandell BS, Newgard CB, Attie AD. Genetic networks of liver metabolism revealed by integration of metabolic and transcriptional profiling. PLoS Genet 2008; 4:e1000034. [PMID: 18369453 PMCID: PMC2265422 DOI: 10.1371/journal.pgen.1000034] [Citation(s) in RCA: 163] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2007] [Accepted: 02/11/2008] [Indexed: 11/19/2022] Open
Abstract
Although numerous quantitative trait loci (QTL) influencing disease-related phenotypes have been detected through gene mapping and positional cloning, identification of the individual gene(s) and molecular pathways leading to those phenotypes is often elusive. One way to improve understanding of genetic architecture is to classify phenotypes in greater depth by including transcriptional and metabolic profiling. In the current study, we have generated and analyzed mRNA expression and metabolic profiles in liver samples obtained in an F2 intercross between the diabetes-resistant C57BL/6 leptinob/ob and the diabetes-susceptible BTBR leptinob/ob mouse strains. This cross, which segregates for genotype and physiological traits, was previously used to identify several diabetes-related QTL. Our current investigation includes microarray analysis of over 40,000 probe sets, plus quantitative mass spectrometry-based measurements of sixty-seven intermediary metabolites in three different classes (amino acids, organic acids, and acyl-carnitines). We show that liver metabolites map to distinct genetic regions, thereby indicating that tissue metabolites are heritable. We also demonstrate that genomic analysis can be integrated with liver mRNA expression and metabolite profiling data to construct causal networks for control of specific metabolic processes in liver. As a proof of principle of the practical significance of this integrative approach, we illustrate the construction of a specific causal network that links gene expression and metabolic changes in the context of glutamate metabolism, and demonstrate its validity by showing that genes in the network respond to changes in glutamine and glutamate availability. Thus, the methods described here have the potential to reveal regulatory networks that contribute to chronic, complex, and highly prevalent diseases and conditions such as obesity and diabetes. Although numerous quantitative trait loci (QTL) influencing disease-related phenotypes have been detected through gene mapping and positional cloning, identifying individual genes and their potential roles in molecular pathways leading to disease remains a challenge. In this study, we include transcriptional and metabolic profiling in genomic analyses to address this limitation. We investigated an F2 intercross between the diabetes-resistant C57BL/6 leptinob/ob and the diabetes-susceptible BTBR leptinob/ob mouse strains that segregates for genotype and diabetes-related physiological traits; blood glucose, plasma insulin and body weight. Our study shows that liver metabolites (comprised of amino acids, organic acids, and acyl-carnitines) map to distinct genetic regions, thereby indicating that tissue metabolites are heritable. We also demonstrate that genomic analysis can be integrated with liver mRNA expression and metabolite profiling data to construct causal, testable networks for control of specific metabolic processes in liver. We apply an in vitro study to confirm the validity of this integrative method, and thus provide a novel approach to reveal regulatory networks that contribute to chronic, complex, and highly prevalent diseases and conditions such as obesity and diabetes.
Collapse
Affiliation(s)
- Christine T. Ferrara
- Sarah W. Stedman Nutrition and Metabolism Center, Duke University Medical Center, Durham, North Carolina, United States of America
- Department of Pharmacology and Cancer Biology, Duke University Medical Center, Durham, North Carolina, United States of America
- Department of Biochemistry, University of Wisconsin, Madison, Wisconsin, United States of America
- * E-mail: (CTF); (CBN); (ADA)
| | - Ping Wang
- Department of Statistics, University of Wisconsin, Madison, Wisconsin, United States of America
| | - Elias Chaibub Neto
- Department of Statistics, University of Wisconsin, Madison, Wisconsin, United States of America
| | - Robert D. Stevens
- Sarah W. Stedman Nutrition and Metabolism Center, Duke University Medical Center, Durham, North Carolina, United States of America
| | - James R. Bain
- Sarah W. Stedman Nutrition and Metabolism Center, Duke University Medical Center, Durham, North Carolina, United States of America
| | - Brett R. Wenner
- Sarah W. Stedman Nutrition and Metabolism Center, Duke University Medical Center, Durham, North Carolina, United States of America
| | - Olga R. Ilkayeva
- Sarah W. Stedman Nutrition and Metabolism Center, Duke University Medical Center, Durham, North Carolina, United States of America
| | - Mark P. Keller
- Department of Pharmacology and Cancer Biology, Duke University Medical Center, Durham, North Carolina, United States of America
- Department of Biochemistry, University of Wisconsin, Madison, Wisconsin, United States of America
| | - Daniel A. Blasiole
- Department of Pharmacology and Cancer Biology, Duke University Medical Center, Durham, North Carolina, United States of America
- Department of Biochemistry, University of Wisconsin, Madison, Wisconsin, United States of America
| | - Christina Kendziorski
- Department of Biostatistics and Medical Informatics, University of Wisconsin, Madison, Wisconsin, United States of America
| | - Brian S. Yandell
- Department of Statistics, University of Wisconsin, Madison, Wisconsin, United States of America
- Department of Horticulture, University of Wisconsin, Madison, Wisconsin, United States of America
| | - Christopher B. Newgard
- Sarah W. Stedman Nutrition and Metabolism Center, Duke University Medical Center, Durham, North Carolina, United States of America
- * E-mail: (CTF); (CBN); (ADA)
| | - Alan D. Attie
- Department of Pharmacology and Cancer Biology, Duke University Medical Center, Durham, North Carolina, United States of America
- * E-mail: (CTF); (CBN); (ADA)
| |
Collapse
|
10
|
Beyer A, Bandyopadhyay S, Ideker T. Integrating physical and genetic maps: from genomes to interaction networks. Nat Rev Genet 2007; 8:699-710. [PMID: 17703239 PMCID: PMC2811081 DOI: 10.1038/nrg2144] [Citation(s) in RCA: 161] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Physical and genetic mapping data have become as important to network biology as they once were to the Human Genome Project. Integrating physical and genetic networks currently faces several challenges: increasing the coverage of each type of network; establishing methods to assemble individual interaction measurements into contiguous pathway models; and annotating these pathways with detailed functional information. A particular challenge involves reconciling the wide variety of interaction types that are currently available. For this purpose, recent studies have sought to classify genetic and physical interactions along several complementary dimensions, such as ordered versus unordered, alleviating versus aggravating, and first versus second degree.
Collapse
Affiliation(s)
- Andreas Beyer
- Department of Bioengineering, University of California at San Diego, 9500 Gilman Drive, La Jolla, California 92093, USA
| | | | | |
Collapse
|
11
|
Bao L, Peirce JL, Zhou M, Li H, Goldowitz D, Williams RW, Lu L, Cui Y. An integrative genomics strategy for systematic characterization of genetic loci modulating phenotypes. Hum Mol Genet 2007; 16:1381-90. [PMID: 17428815 DOI: 10.1093/hmg/ddm089] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
Naturally occurring genetic variations may affect certain phenotypes through influencing transcript levels of the genes that are causally related to those phenotypes. Genomic regions harboring common sequence variants that modulate gene expression can be mapped as quantitative trait loci (QTLs) using a newly developed genetical genomics approach. This enables a new strategy for systematically mapping novel genetic loci underlying various phenotypes. In this work, we started from a seed set of genes with variants that are known to affect behavioral and neurological phenotypes (as recorded in Mammalian Phenotype Ontology Database) and used microarrays to analyze their expression levels in brain samples of a panel of BXD recombinant inbred mouse strains. We then systematically mapped the QTLs controlling the expression of these genes. Candidate causal genes in the QTL intervals were evaluated for evidence of functional genetic polymorphisms. Using this method, we were able to predict novel genetic loci and causal genes for a number of behavioral and neurological phenotypes. Lines of independent evidence supporting some of our results were provided by transcription factor binding site analysis and by biomedical literature. This strategy integrates gene-phenotype relations from decades of experimental mutagenesis studies and new genomic resources to provide an approach to rapidly expand knowledge on genetic loci modulating phenotypes.
Collapse
Affiliation(s)
- Lei Bao
- Department of Molecular Sciences, University of Tennessee Health Science Center, Memphis, TN 38163, USA
| | | | | | | | | | | | | | | |
Collapse
|
12
|
Bao L, Zhou M, Wu L, Lu L, Goldowitz D, Williams RW, Cui Y. PolymiRTS Database: linking polymorphisms in microRNA target sites with complex traits. Nucleic Acids Res 2006; 35:D51-4. [PMID: 17099235 PMCID: PMC1669716 DOI: 10.1093/nar/gkl797] [Citation(s) in RCA: 132] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Polymorphism in microRNA Target Site (PolymiRTS) database is a collection of naturally occurring DNA variations in putative microRNA target sites. PolymiRTSs may affect gene expression and cause variations in complex phenotypes. The database integrates sequence polymorphism, phenotype and expression microarray data, and characterizes PolymiRTSs as potential candidates responsible for the quantitative trait locus (QTL) effects. It is a resource for studying PolymiRTSs and their implications in phenotypic variations. PolymiRTS database can be accessed at http://compbio.utmem.edu/miRSNP/.
Collapse
Affiliation(s)
- Lei Bao
- Department of Molecular SciencesMemphis, TN 38163, USA
- Center of Genomics and BioinformaticsMemphis, TN 38163, USA
| | - Mi Zhou
- Department of Molecular SciencesMemphis, TN 38163, USA
- Center of Genomics and BioinformaticsMemphis, TN 38163, USA
| | - Ligang Wu
- Skirball Institute of Biomolecular Medicine, New York University School of MedicineNY, USA
| | - Lu Lu
- Center of Genomics and BioinformaticsMemphis, TN 38163, USA
- Department of Anatomy and NeurobiologyMemphis, TN 38163, USA
- Key Laboratory of Nerve Regeneration, Nantong UniversityNantong, Jiangsu Province, China
| | - Dan Goldowitz
- Center of Genomics and BioinformaticsMemphis, TN 38163, USA
- Department of Anatomy and NeurobiologyMemphis, TN 38163, USA
| | - Robert W. Williams
- Center of Genomics and BioinformaticsMemphis, TN 38163, USA
- Department of Anatomy and NeurobiologyMemphis, TN 38163, USA
- Department of Pediatrics, University of Tennessee Health Science CenterMemphis, TN 38163, USA
| | - Yan Cui
- Department of Molecular SciencesMemphis, TN 38163, USA
- Center of Genomics and BioinformaticsMemphis, TN 38163, USA
- To whom correspondence should be addressed at Department of Molecular Sciences, University of Tennessee Health Science Center, 858 Madison Avenue, Memphis, TN 38163, USA. Tel: +1 9014483240; Fax: +1 9014487360;
| |
Collapse
|
13
|
Williams RW. Expression genetics and the phenotype revolution. Mamm Genome 2006; 17:496-502. [PMID: 16783631 DOI: 10.1007/s00335-006-0006-x] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2006] [Accepted: 02/06/2006] [Indexed: 01/22/2023]
Abstract
Genetic analysis of variation demands large numbers of individuals and even larger numbers of genotypes. The identification of alleles associated with Mendelian disorders has involved sample sizes of a thousand or more. Pervasive and common diseases that afflict human populations--cancer, heart disease, diabetes, neurodegeneration, addiction--are all polygenic and are even more demanding of large numbers. DeCode Genetics (http://www.decode.com) has harnessed the human resources of Iceland to unravel genetic and molecular causes of complex disease. The UK BioBank project (http://www.ukbiobank.ac.uk/) will incorporate 500,000 adult volunteers. The murine Collaborative Cross is the experimental equivalent of these human populations and will consist of a panel of approximately 1000 recombinant strains, expandable by intercrossing to much larger numbers of isogenic but heterozygous F(1)s. Massive projects of these types require efficient technologies. We have made enormous progress on the genotyping front, and it is now important to focus energy on devising ultrahigh-throughput methods to phenotype. Molecular phenotyping of the transcriptome has matured, and it is now possible to acquire hundreds of thousands of mRNA phenotypes at a cost matching those of SNPs. Proteomic and cell-based assays are also maturing rapidly. The acquisition of a personal genome along with a personal molecular phenome will provide an effective foundation for personalized medicine. Rodent models will be essential to test our ability to predict susceptibility and disease outcome using SNP data, molecular phenomes, and environmental exposures. These models will also be essential to test new treatments in a robust systems context that accounts for genetic variation.
Collapse
Affiliation(s)
- Robert W Williams
- Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, 855 Monroe Avenue, Memphis, TN 38163, USA.
| |
Collapse
|