51
|
Yuan Y, Xu Y, Xu J, Ball RL, Liang H. Predicting the lethal phenotype of the knockout mouse by integrating comprehensive genomic data. ACTA ACUST UNITED AC 2012; 28:1246-52. [PMID: 22419784 DOI: 10.1093/bioinformatics/bts120] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
MOTIVATION The phenotypes of knockout mice provide crucial information for understanding the biological functions of mammalian genes. Among various knockout phenotypes, lethality is of great interest because those involved genes play essential roles. With the availability of large-scale genomic data, we aimed to assess how well the integration of various genomic features can predict the lethal phenotype of single-gene knockout mice. RESULTS We first assembled a comprehensive list of 491 candidate genomic features derived from diverse data sources. Using mouse genes with a known phenotype as the training set, we integrated the informative genomic features to predict the knockout lethality through three machine learning methods. Based on cross-validation, our models could achieve a good performance (accuracy = 73% and recall = 63%). Our results serve as a valuable practical resource in the mouse genetics research community, and also accelerate the translation of the knowledge of mouse genes into better strategies for studying human disease.
Collapse
Affiliation(s)
- Yuan Yuan
- Graduate Program in Structural and Computational Biology and Molecular Biophysics, Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Baylor College of Medicine, Houston, TX 77030, USA
| | | | | | | | | |
Collapse
|
52
|
Jiménez-Lozano N, Segura J, Macías JR, Vega J, Carazo JM. Integrating human and murine anatomical gene expression data for improved comparisons. ACTA ACUST UNITED AC 2011; 28:397-402. [PMID: 22106336 DOI: 10.1093/bioinformatics/btr639] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
MOTIVATION Information concerning the gene expression pattern in four dimensions (species, genes, anatomy and developmental stage) is crucial for unraveling the roles of genes through time. There are a variety of anatomical gene expression databases, but extracting information from them can be hampered by their diversity and heterogeneity. RESULTS aGEM 3.1 (anatomic Gene Expression Mapping) addresses the issues of diversity and heterogeneity of anatomical gene expression databases by integrating six mouse gene expression resources (EMAGE, GXD, GENSAT, Allen Brain Atlas data base, EUREXPRESS and BioGPS) and three human gene expression databases (HUDSEN, Human Protein Atlas and BioGPS). Furthermore, aGEM 3.1 provides new cross analysis tools to bridge these resources. AVAILABILITY AND IMPLEMENTATION aGEM 3.1 can be queried using gene and anatomical structure. Output information is presented in a friendly format, allowing the user to display expression maps and correlation matrices for a gene or structure during development. An in-depth study of a specific developmental stage is also possible using heatmaps that relate gene expression with anatomical components. http://agem.cnb.csic.es CONTACT natalia@cnb.csic.es SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
53
|
Damiri B, Holle E, Yu X, Baldwin WS. Lentiviral-mediated RNAi knockdown yields a novel mouse model for studying Cyp2b function. Toxicol Sci 2011; 125:368-81. [PMID: 22083726 DOI: 10.1093/toxsci/kfr309] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
There are few in vivo knockout models available to study the function of Cyp2 members involved in the metabolism of endogenous and exogenous chemicals. These models may help provide insight into the cytochrome P450s (CYPs) responsible for the detoxification and activation of drugs, environmental toxicants, and endobiotics. The aim of this work is to produce a potent Cyp2b-knockdown (KD) mouse for subsequent study of Cyp2b function. We made a quintuple Cyp2b-KD mouse using lentiviral-promoted short hairpin RNA (shRNA) homologous to all five murine Cyp2b subfamily members (Cyp2b9, 2b10, 2b13, 2b19, and 2b23). The Cyp2b-KD mice are viable, fertile, and without obvious gross abnormalities except for an increase in liver weight. Expression of the three hepatic Cyp2b members, 2b9, 2b10, and 2b13, is significantly repressed as demonstrated by quantitative real-time PCR and Western blotting. The constitutive androstane receptor activator, 1,4-Bis[2-(3,5-dichloropyridyloxy)] benzene (TCPOBOP), was used to determine if shRNA-mediated Cyp2b10 repression could be outcompeted by Cyp2b10 induction. TCPOBOP-treated Cyp2b-KD mice show 80-90% less Cyp2b protein expression than TCPOBOP-treated wild-type (WT) mice, demonstrating that Cyp induction does not outcompete the repressive function of the shRNA. Untreated and TCPOBOP-treated Cyp2b-KD mice are poor metabolizers of parathion compared with WT mice. Furthermore, Cyp2b-KD mice are sensitive to parathion, an organophosphate insecticide primarily metabolized by Cyp2b enzymes, when compared with WT mice. In summary, we designed an shRNA construct that repressed the expression and activity of multiple Cyp2b enzymes. We foresee that this novel Cyp2b-KD mouse model will significantly improve our understanding of the role of Cyp2b enzymes in chemical sensitivity and drug metabolism.
Collapse
Affiliation(s)
- Basma Damiri
- Environmental Toxicology Program, Clemson University, Clemson, South Carolina 29634, USA
| | | | | | | |
Collapse
|
54
|
Eppig JT, Blake JA, Bult CJ, Kadin JA, Richardson JE. The Mouse Genome Database (MGD): comprehensive resource for genetics and genomics of the laboratory mouse. Nucleic Acids Res 2011; 40:D881-6. [PMID: 22075990 PMCID: PMC3245042 DOI: 10.1093/nar/gkr974] [Citation(s) in RCA: 216] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
The Mouse Genome Database (MGD, http://www.informatics.jax.org) is the international community resource for integrated genetic, genomic and biological data about the laboratory mouse. Data in MGD are obtained through loads from major data providers and experimental consortia, electronic submissions from laboratories and from the biomedical literature. MGD maintains a comprehensive, unified, non-redundant catalog of mouse genome features generated by distilling gene predictions from NCBI, Ensembl and VEGA. MGD serves as the authoritative source for the nomenclature of mouse genes, mutations, alleles and strains. MGD is the primary source for evidence-supported functional annotations for mouse genes and gene products using the Gene Ontology (GO). MGD provides full annotation of phenotypes and human disease associations for mouse models (genotypes) using terms from the Mammalian Phenotype Ontology and disease names from the Online Mendelian Inheritance in Man (OMIM) resource. MGD is freely accessible online through our website, where users can browse and search interactively, access data in bulk using Batch Query or BioMart, download data files or use our web services Application Programming Interface (API). Improvements to MGD include expanded genome feature classifications, inclusion of new mutant allele sets and phenotype associations and extensions of GO to include new relationships and a new stream of annotations via phylogenetic-based approaches.
Collapse
|
55
|
Wei Q, Condie BG. A focused in situ hybridization screen identifies candidate transcriptional regulators of thymic epithelial cell development and function. PLoS One 2011; 6:e26795. [PMID: 22087235 PMCID: PMC3210126 DOI: 10.1371/journal.pone.0026795] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2011] [Accepted: 10/04/2011] [Indexed: 12/16/2022] Open
Abstract
Background Thymic epithelial cells (TECs) are necessary for normal T cell development. Currently, one transcription factor, Foxn1 is known to be necessary for the progression of fetal TEC differentiation. However, some aspects of fetal TEC differentiation occur in Foxn1 mutants, suggesting the existence of additional transcriptional regulators of TEC differentiation. The goal of this study was to identify some of the additional candidate transcription factors that may be involved in the specification and/or differentiation of TECs during fetal development. Methodology/Principal Findings We identified candidate fetal TEC transcriptional regulators via data and text mining. From our data mining we selected the transcription factors Foxg1, Isl1, Gata3, Nkx2-5, Nkx2-6 and Sox2 for further studies. Whole mount in situ hybridizations confirmed the expression of these transcription factors within subdomains of the third pharyngeal pouch from E9.5–E10.5. By E11.5 days Foxg1 and Isl1 transcripts were the only mRNAs from this group of genes detected exclusively within the thymus domain of the third pouch. Based on this initial in situ hybridization analysis, we focused on defining the expression of Foxg1 and Isl1 during multiple stages of thymus development and TEC differentiation. We found that Foxg1 and Isl1 are specifically expressed in differentiating TECs during fetal and postnatal stages of thymus development. In addition, we found differential expression of Islet1 and Foxn1 within the fetal and postnatal TEC population. Conclusions/Significance Our studies have identified two developmental transcription factors that are excellent candidate regulators of thymic epithelial cell specification and differentiation during fetal development. Our results suggest that Foxg1 and Isl1 may play a role in the regulation of TEC differentiation during fetal and postnatal stages. Our results also demonstrate heterogeneity of TECs marked by the differential expression of transcription factors, potentially providing new insights into the regulation of TEC differentiation.
Collapse
Affiliation(s)
- Qiaozhi Wei
- Department of Genetics, University of Georgia, Athens, Georgia, United States of America
| | - Brian G. Condie
- Department of Genetics, University of Georgia, Athens, Georgia, United States of America
- * E-mail:
| |
Collapse
|
56
|
Eichenlaub MP, Ettwiller L. De novo genesis of enhancers in vertebrates. PLoS Biol 2011; 9:e1001188. [PMID: 22069375 PMCID: PMC3206014 DOI: 10.1371/journal.pbio.1001188] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2011] [Accepted: 09/22/2011] [Indexed: 02/02/2023] Open
Abstract
Whole genome duplication in teleost fish reveals that a few changes in non-regulatory genomic sequences are a source for generating new enhancers. Evolutionary innovation relies partially on changes in gene regulation. While a growing body of evidence demonstrates that such innovation is generated by functional changes or translocation of regulatory elements via mobile genetic elements, the de novo generation of enhancers from non-regulatory/non-mobile sequences has, to our knowledge, not previously been demonstrated. Here we show evidence for the de novo genesis of enhancers in vertebrates. For this, we took advantage of the massive gene loss following the last whole genome duplication in teleosts to systematically identify regions that have lost their coding capacity but retain sequence conservation with mammals. We found that these regions show enhancer activity while the orthologous coding regions have no regulatory activity. These results demonstrate that these enhancers have been de novo generated in fish. By revealing that minor changes in non-regulatory sequences are sufficient to generate new enhancers, our study highlights an important playground for creating new regulatory variability and evolutionary innovation. The genome of each living organism contains thousands of genes, and the precise control of the timing and location of expression of these genes is key for normal development and homeostasis of each individual. Despite the oftentimes high genetic similarity between organisms, the source of phenotypic differences, for example between human and mouse, is thought to originate mainly from changes in how and when genes are expressed. This is partially determined by enhancers, that contribute to the control of gene expression. For decades, duplication of existing genomic enhancers, mobile elements, and changes in the sequence of existing enhancers were believed to be the major ways of increasing the number and modifying the activity of enhancers. In this study, we show that enhancers don't have to be derived from pre-existing ones but can also appear de novo in regions of the genome that were previously not regulating gene expression. We analyzed teleost fish genomes and found three regions for which a limited number of changes in the DNA sequence was sufficient to generate new enhancers. We predict that such a process is frequent in vertebrate genomes, making de novo generation of enhancers an important mechanism for creating variation in gene expression.
Collapse
Affiliation(s)
| | - Laurence Ettwiller
- Centre for Organismal Studies, University of Heidelberg, Heidelberg, Germany
- * E-mail:
| |
Collapse
|
57
|
Ormestad M, Martindale MQ, Röttinger E. A comparative gene expression database for invertebrates. EvoDevo 2011; 2:17. [PMID: 21861937 PMCID: PMC3180427 DOI: 10.1186/2041-9139-2-17] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2011] [Accepted: 08/24/2011] [Indexed: 12/02/2022] Open
Abstract
Background As whole genome and transcriptome sequencing gets cheaper and faster, a great number of 'exotic' animal models are emerging, rapidly adding valuable data to the ever-expanding Evo-Devo field. All these new organisms serve as a fantastic resource for the research community, but the sheer amount of data, some published, some not, makes detailed comparison of gene expression patterns very difficult to summarize - a problem sometimes even noticeable within a single lab. The need to merge existing data with new information in an organized manner that is publicly available to the research community is now more necessary than ever. Description In order to offer a homogenous way of storing and handling gene expression patterns from a variety of organisms, we have developed the first web-based comparative gene expression database for invertebrates that allows species-specific as well as cross-species gene expression comparisons. The database can be queried by gene name, developmental stage and/or expression domains. Conclusions This database provides a unique tool for the Evo-Devo research community that allows the retrieval, analysis and comparison of gene expression patterns within or among species. In addition, this database enables a quick identification of putative syn-expression groups that can be used to initiate, among other things, gene regulatory network (GRN) projects.
Collapse
Affiliation(s)
- Mattias Ormestad
- Kewalo Marine Laboratory, PBRC, University of Hawaii at Manoa, 41, Ahui Street, Honolulu, 96734, HI, USA.
| | | | | |
Collapse
|
58
|
A gene-phenotype network for the laboratory mouse and its implications for systematic phenotyping. PLoS One 2011; 6:e19693. [PMID: 21625554 PMCID: PMC3098258 DOI: 10.1371/journal.pone.0019693] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2011] [Accepted: 04/11/2011] [Indexed: 01/22/2023] Open
Abstract
The laboratory mouse is the pre-eminent model organism for the dissection of human disease pathways. With the advent of a comprehensive panel of gene knockouts, projects to characterise the phenotypes of all knockout lines are being initiated. The range of genotype-phenotype associations can be represented using the Mammalian Phenotype ontology. Using publicly available data annotated with this ontology we have constructed gene and phenotype networks representing these associations. These networks show a scale-free, hierarchical and modular character and community structure. They also exhibit enrichment for gene coexpression, protein-protein interactions and Gene Ontology annotation similarity. Close association between gene communities and some high-level ontology terms suggests that systematic phenotyping can provide a direct insight into underlying pathways. However some phenotypes are distributed more diffusely across gene networks, likely reflecting the pleiotropic roles of many genes. Phenotype communities show a many-to-many relationship to human disease communities, but stronger overlap at more granular levels of description. This may suggest that systematic phenotyping projects should aim for high granularity annotations to maximise their relevance to human disease.
Collapse
|
59
|
Murray SA. Mouse resources for craniofacial research. Genesis 2011; 49:190-9. [PMID: 21309071 DOI: 10.1002/dvg.20722] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2010] [Revised: 01/06/2011] [Accepted: 01/16/2011] [Indexed: 01/22/2023]
Abstract
The mouse, as a genetically defined and easily manipulated model organism, has played a critical role in unraveling the mechanisms of craniofacial development and dysmorphology. While numerous gene knockout strains that display craniofacial abnormalities and essential recombinase tool strains with craniofacial-specific expression have been generated, many are absent from public repositories. Large-scale, international resource-generating initiatives promise to address this concern, providing a comprehensive set of targeted mutations and a suite of new Cre driver strains. In addition, panels of genetically defined strains provide tools to dissect the multigenic, complex nature of craniofacial development, adding to the foundation of information gained from single gene studies. Continued progress will require awareness and access to these essential mouse resources. In this review, current mouse resources, large-scale efforts, and potential future directions will be outlined and discussed.
Collapse
|
60
|
Ringwald M, Eppig JT. Mouse mutants and phenotypes: accessing information for the study of mammalian gene function. Methods 2010; 53:405-10. [PMID: 21185380 DOI: 10.1016/j.ymeth.2010.12.024] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2010] [Revised: 12/01/2010] [Accepted: 12/17/2010] [Indexed: 02/02/2023] Open
Abstract
Recent advances in high-throughput gene targeting and conditional mutagenesis are creating new and powerful resources to study the in vivo function of mammalian genes using the mouse as an experimental model. Mutant ES cells and mice are being generated at a rapid rate to study the molecular and phenotypic consequences of genetic mutations, and to correlate these study results with human disease conditions. Likewise, classical genetics approaches to identify mutations in the mouse genome that cause specific phenotypes have become more effective. Here, we describe methods to quickly obtain information on what mutant ES cells and mice are available, including recombinase driver lines for the generation of conditional mutants. Further, we describe means to access genetic and phenotypic data that identify mouse models for specific human diseases.
Collapse
Affiliation(s)
- Martin Ringwald
- The Jackson Laboratory, 600 Main Street, Bar Harbor, ME 04609, USA.
| | | |
Collapse
|