1
|
Arowolo O, Suvorov A. Underexplored Molecular Mechanisms of Toxicity. J Xenobiot 2024; 14:939-949. [PMID: 39051348 PMCID: PMC11270369 DOI: 10.3390/jox14030052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Revised: 07/01/2024] [Accepted: 07/15/2024] [Indexed: 07/27/2024] Open
Abstract
Social biases may concentrate the attention of researchers on a small number of well-known molecules/mechanisms leaving others underexplored. In accordance with this view, central to mechanistic toxicology is a narrow range of molecular pathways that are assumed to be involved in a significant part of the responses to toxicity. It is unclear, however, if there are other molecular mechanisms which play an important role in toxicity events but are overlooked by toxicology. To identify overlooked genes sensitive to chemical exposures, we used publicly available databases. First, we used data on the published chemical-gene interactions for 17,338 genes to estimate their sensitivity to chemical exposures. Next, we extracted data on publication numbers per gene for 19,243 human genes from the Find My Understudied Genes database. Thresholds were applied to both datasets using our algorithm to identify chemically sensitive and chemically insensitive genes and well-studied and underexplored genes. A total of 1110 underexplored genes highly sensitive to chemical exposures were used in GSEA and Shiny GO analyses to identify enriched biological categories. The metabolism of fatty acids, amino acids, and glucose were identified as underexplored molecular mechanisms sensitive to chemical exposures. These findings suggest that future effort is needed to uncover the role of xenobiotics in the current epidemics of metabolic diseases.
Collapse
Affiliation(s)
| | - Alexander Suvorov
- Department of Environmental Health Sciences, School of Public Health and Health Sciences, University of Massachusetts, 686 North Pleasant Street, Amherst, MA 01003, USA;
| |
Collapse
|
2
|
Ekwudo MN, Malek MC, Anderson CE, Yampolsky LY. The interplay between prior selection, mild intermittent exposure, and acute severe exposure in phenotypic and transcriptional response to hypoxia. Ecol Evol 2022; 12:e9319. [PMID: 36248677 PMCID: PMC9548574 DOI: 10.1002/ece3.9319] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Revised: 09/01/2022] [Accepted: 09/02/2022] [Indexed: 11/16/2022] Open
Abstract
Hypoxia has profound and diverse effects on aerobic organisms, disrupting oxidative phosphorylation and activating several protective pathways. Predictions have been made that exposure to mild intermittent hypoxia may be protective against more severe exposure and may extend lifespan. Here we report the lifespan effects of chronic, mild, intermittent hypoxia, and short‐term survival in acute severe hypoxia in four clones of Daphnia magna originating from either permanent or intermittent habitats. We test the hypothesis that acclimation to chronic mild intermittent hypoxia can extend lifespan through activation of antioxidant and stress‐tolerance pathways and increase survival in acute severe hypoxia through activation of oxygen transport and storage proteins and adjustment to carbohydrate metabolism. Unexpectedly, we show that chronic hypoxia extended the lifespan in the two clones originating from intermittent habitats but had the opposite effect in the two clones from permanent habitats, which also showed lower tolerance to acute hypoxia. Exposure to chronic hypoxia did not protect against acute hypoxia; to the contrary, Daphnia from the chronic hypoxia treatment had lower acute hypoxia tolerance than normoxic controls. Few transcripts changed their abundance in response to the chronic hypoxia treatment in any of the clones. After 12 h of acute hypoxia treatment, the transcriptional response was more pronounced, with numerous protein‐coding genes with functionality in oxygen transport, mitochondrial and respiratory metabolism, and gluconeogenesis, showing upregulation. While clones from intermittent habitats showed somewhat stronger differential expression in response to acute hypoxia than those from permanent habitats, contrary to predictions, there were no significant hypoxia‐by‐habitat of origin or chronic‐by‐acute treatment interactions. GO enrichment analysis revealed a possible hypoxia tolerance role by accelerating the molting cycle and regulating neuron survival through upregulation of cuticular proteins and neurotrophins, respectively.
Collapse
Affiliation(s)
- Millicent N. Ekwudo
- Department of Biological SciencesEast Tennessee State UniversityJohnson CityTennesseeUSA,Ann Romney Center for Neurologic Diseases, Brigham and Women's HospitalHarvard Medical SchoolBostonMassachusettsUSA
| | - Morad C. Malek
- Department of Biological SciencesEast Tennessee State UniversityJohnson CityTennesseeUSA
| | - Cora E. Anderson
- Department of Biological SciencesEast Tennessee State UniversityJohnson CityTennesseeUSA,Department of Biological SciencesUniversity of Notre DameNotre DameIndianaUSA
| | - Lev Y. Yampolsky
- Department of Biological SciencesEast Tennessee State UniversityJohnson CityTennesseeUSA
| |
Collapse
|
3
|
Hayes WB, Mamano N. SANA NetGO: a combinatorial approach to using Gene Ontology (GO) terms to score network alignments. Bioinformatics 2019; 34:1345-1352. [PMID: 29228175 DOI: 10.1093/bioinformatics/btx716] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2017] [Accepted: 12/04/2017] [Indexed: 01/05/2023] Open
Abstract
Motivation Gene Ontology (GO) terms are frequently used to score alignments between protein-protein interaction (PPI) networks. Methods exist to measure GO similarity between proteins in isolation, but proteins in a network alignment are not isolated: each pairing is dependent on every other via the alignment itself. Existing measures fail to take into account the frequency of GO terms across networks, instead imposing arbitrary rules on when to allow GO terms. Results Here we develop NetGO, a new measure that naturally weighs infrequent, informative GO terms more heavily than frequent, less informative GO terms, without arbitrary cutoffs, instead downweighting GO terms according to their frequency in the networks being aligned. This is a global measure applicable only to alignments, independent of pairwise GO measures, in the same sense that the edge-based EC or S3 scores are global measures of topological similarity independent of pairwise topological similarities. We demonstrate the superiority of NetGO in alignments of predetermined quality and show that NetGO correlates with alignment quality better than any existing GO-based alignment measures. We also demonstrate that NetGO provides a measure of taxonomic similarity between species, consistent with existing taxonomic measuresa feature not shared with existing GObased network alignment measures. Finally, we re-score alignments produced by almost a dozen aligners from a previous study and show that NetGO does a better job at separating good alignments from bad ones. Availability and implementation Available as part of SANA. Contact whayes@uci.edu. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Wayne B Hayes
- Department of Computer Science, University of California, Irvine, CA 92697-3435, USA
| | - Nil Mamano
- Department of Computer Science, University of California, Irvine, CA 92697-3435, USA
| |
Collapse
|
4
|
Zheng C, Ma JQ, Ma CL, Shen SY, Liu YF, Chen L. Regulation of Growth and Flavonoid Formation of Tea Plants ( Camellia sinensis) by Blue and Green Light. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2019; 67:2408-2419. [PMID: 30721059 DOI: 10.1021/acs.jafc.8b07050] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/17/2023]
Abstract
The effects of blue (BL) and green light (GL) treatment during the dark period were examined in Camellia sinensis as a first step to understanding the spectral effects of artificial BL and GL on plant secondary metabolism and light signaling interactions. BL could induce the expression of CRY2/3, SPAs, HY5, and R2R3-MYBs to promote the accumulation of anthocyanins and catechins in tea plants. GL, on the other hand, could stimulate the accumulation of several functional substances (e.g., procyanidin B2/B3 and l-ascorbate) and temper these BL responses via down-regulation of CRY2/3 and PHOT2. Furthermore, the molecular events that triggered by BL and GL signals were partly overlapped with abiotic/biotic stress responses. We indicate the possibility of a targeted use of BL and GL to regulate the amount of functional metabolites to enhance tea quality and taste, and to potentially trigger defense mechanisms of tea plants.
Collapse
Affiliation(s)
- Chao Zheng
- Key Laboratory of Tea Biology and Resources Utilization, Ministry of Agriculture and Rural Affairs , Tea Research Institute of the Chinese Academy of Agricultural Sciences , Hangzhou , China
| | - Jian-Qiang Ma
- Key Laboratory of Tea Biology and Resources Utilization, Ministry of Agriculture and Rural Affairs , Tea Research Institute of the Chinese Academy of Agricultural Sciences , Hangzhou , China
| | - Chun-Lei Ma
- Key Laboratory of Tea Biology and Resources Utilization, Ministry of Agriculture and Rural Affairs , Tea Research Institute of the Chinese Academy of Agricultural Sciences , Hangzhou , China
| | - Si-Yan Shen
- Key Laboratory of Tea Biology and Resources Utilization, Ministry of Agriculture and Rural Affairs , Tea Research Institute of the Chinese Academy of Agricultural Sciences , Hangzhou , China
| | - Yu-Fei Liu
- Key Laboratory of Tea Biology and Resources Utilization, Ministry of Agriculture and Rural Affairs , Tea Research Institute of the Chinese Academy of Agricultural Sciences , Hangzhou , China
| | - Liang Chen
- Key Laboratory of Tea Biology and Resources Utilization, Ministry of Agriculture and Rural Affairs , Tea Research Institute of the Chinese Academy of Agricultural Sciences , Hangzhou , China
| |
Collapse
|
5
|
Campos B, Fletcher D, Piña B, Tauler R, Barata C. Differential gene transcription across the life cycle in Daphnia magna using a new all genome custom-made microarray. BMC Genomics 2018; 19:370. [PMID: 29776339 PMCID: PMC5960145 DOI: 10.1186/s12864-018-4725-7] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2017] [Accepted: 04/25/2018] [Indexed: 11/23/2022] Open
Abstract
Background Unravelling the link between genes and environment across the life cycle is a challenging goal that requires model organisms with well-characterized life-cycles, ecological interactions in nature, tractability in the laboratory, and available genomic tools. Very few well-studied invertebrate model species meet these requirements, being the waterflea Daphnia magna one of them. Here we report a full genome transcription profiling of D. magna during its life-cycle. The study was performed using a new microarray platform designed from the complete set of gene models representing the whole transcribed genome of D. magna. Results Up to 93% of the existing 41,317 D. magna gene models showed differential transcription patterns across the developmental stages of D. magna, 59% of which were functionally annotated. Embryos showed the highest number of unique transcribed genes, mainly related to DNA, RNA, and ribosome biogenesis, likely related to cellular proliferation and morphogenesis of the several body organs. Adult females showed an enrichment of transcripts for genes involved in reproductive processes. These female-specific transcripts were essentially absent in males, whose transcriptome was enriched in specific genes of male sexual differentiation genes, like doublesex. Conclusion Our results define major characteristics of transcriptional programs involved in the life-cycle, differentiate males and females, and show that large scale gene-transcription data collected in whole animals can be used to identify genes involved in specific biological and biochemical processes. Electronic supplementary material The online version of this article (10.1186/s12864-018-4725-7) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Bruno Campos
- IDAEA-CSIC: Institute of Environmental Diagnosis and Water Research, CSIC, Barcelona, Spain.
| | | | - Benjamín Piña
- IDAEA-CSIC: Institute of Environmental Diagnosis and Water Research, CSIC, Barcelona, Spain
| | - Romà Tauler
- IDAEA-CSIC: Institute of Environmental Diagnosis and Water Research, CSIC, Barcelona, Spain
| | - Carlos Barata
- IDAEA-CSIC: Institute of Environmental Diagnosis and Water Research, CSIC, Barcelona, Spain
| |
Collapse
|
6
|
Rago A, Gilbert DG, Choi JH, Sackton TB, Wang X, Kelkar YD, Werren JH, Colbourne JK. OGS2: genome re-annotation of the jewel wasp Nasonia vitripennis. BMC Genomics 2016; 17:678. [PMID: 27561358 PMCID: PMC5000498 DOI: 10.1186/s12864-016-2886-9] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2015] [Accepted: 07/06/2016] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND Nasonia vitripennis is an emerging insect model system with haplodiploid genetics. It holds a key position within the insect phylogeny for comparative, evolutionary and behavioral genetic studies. The draft genomes for N. vitripennis and two sibling species were published in 2010, yet a considerable amount of transcriptiome data have since been produced thereby enabling improvements to the original (OGS1.2) annotated gene set. We describe and apply the EvidentialGene method used to produce an updated gene set (OGS2). We also carry out comparative analyses showcasing the usefulness of the revised annotated gene set. RESULTS The revised annotation (OGS2) now consists of 24,388 genes with supporting evidence, compared to 18,850 for OGS1.2. Improvements include the nearly complete annotation of untranslated regions (UTR) for 97 % of the genes compared to 28 % of genes for OGS1.2. The fraction of RNA-Seq validated introns also grow from 85 to 98 % in this latest gene set. The EST and RNA-Seq expression data provide support for several non-protein coding loci and 7712 alternative transcripts for 4146 genes. Notably, we report 180 alternative transcripts for the gene lola. Nasonia now has among the most complete insect gene set; only 27 conserved single copy orthologs in arthropods are missing from OGS2. Its genome also contains 2.1-fold more duplicated genes and 1.4-fold more single copy genes than the Drosophila melanogaster genome. The Nasonia gene count is larger than those of other sequenced hymenopteran species, owing both to improvements in the genome annotation and to unique genes in the wasp lineage. We identify 1008 genes and 171 gene families that deviate significantly from other hymenopterans in their rates of protein evolution and duplication history, respectively. We also provide an analysis of alternative splicing that reveals that genes with no annotated isoforms are characterized by shorter transcripts, fewer introns, faster protein evolution and higher probabilities of duplication than genes having alternative transcripts. CONCLUSIONS Genome-wide expression data greatly improves the annotation of the N. vitripennis genome, by increasing the gene count, reducing the number of missing genes and providing more comprehensive data on splicing and gene structure. The improved gene set identifies lineage-specific genomic features tied to Nasonia's biology, as well as numerous novel genes. OGS2 and its associated search tools are available at http://arthropods.eugenes.org/EvidentialGene/nasonia/ , www.hymenopteragenome.org/nasonia/ and waspAtlas: www.tinyURL.com/waspAtlas . The EvidentialGene pipeline is available at https://sourceforge.net/projects/evidentialgene/ .
Collapse
Affiliation(s)
- Alfredo Rago
- Environmental Genomics Group, School of Biosciences, University of Birmingham, Birmingham, UK
| | | | - Jeong-Hyeon Choi
- Cancer Center, Department of Biostatistics and Epidemiology, Medical College of Georgia, Georgia Regents University, Augusta, USA
| | - Timothy B. Sackton
- Department of Organismic and Evolutionary Biology, and FAS Informatics Group, Harvard University, Cambridge, USA
| | - Xu Wang
- Department of Molecular Biology and Genetics, Cornell Center for Comparative and Population Genomics, Cornell University, Ithaca, USA
| | - Yogeshwar D. Kelkar
- Department of Biostatistics and Computational Biology, University of Rochester Medical School, Rochester, USA
| | - John H. Werren
- Department of Biology, University of Rochester, Rochester, USA
| | - John K. Colbourne
- Environmental Genomics Group, School of Biosciences, University of Birmingham, Birmingham, UK
| |
Collapse
|
7
|
Daphnia magna transcriptome by RNA-Seq across 12 environmental stressors. Sci Data 2016; 3:160030. [PMID: 27164179 PMCID: PMC4862326 DOI: 10.1038/sdata.2016.30] [Citation(s) in RCA: 71] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2015] [Accepted: 04/19/2016] [Indexed: 11/08/2022] Open
Abstract
The full exploration of gene-environment interactions requires model organisms with well-characterized ecological interactions in their natural environment, manipulability in the laboratory and genomic tools. The waterflea Daphnia magna is an established ecological and toxicological model species, central to the food webs of freshwater lentic habitats and sentinel for water quality. Its tractability and cyclic parthenogenetic life-cycle are ideal to investigate links between genes and the environment. Capitalizing on this unique model system, the STRESSFLEA consortium generated a comprehensive RNA-Seq data set by exposing two inbred genotypes of D. magna and a recombinant cross of these genotypes to a range of environmental perturbations. Gene models were constructed from the transcriptome data and mapped onto the draft genome of D. magna using EvidentialGene. The transcriptome data generated here, together with the available draft genome sequence of D. magna and a high-density genetic map will be a key asset for future investigations in environmental genomics.
Collapse
|
8
|
Xiao G, Zhou B. What can flies tell us about zinc homeostasis? Arch Biochem Biophys 2016; 611:134-141. [PMID: 27136711 DOI: 10.1016/j.abb.2016.04.016] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2016] [Revised: 04/27/2016] [Accepted: 04/29/2016] [Indexed: 12/14/2022]
Abstract
Zinc is an essential micronutrient for all organisms. For multicellular organisms, zinc uptake, storage, distribution and export are tightly regulated at both cellular and organismal levels, to cope with the multiple requirements versus the toxicity of the metal ion. During the past decade, the fruit fly Drosophila melanogaster has become an important model organism for the elucidation of metazoan zinc homeostasis. This review describes our current knowledge of various zinc transporters in Drosophila, with an emphasis on the process of dietary zinc uptake in the fly. We also discuss how Drosophila was used as a model to facilitate our understanding of the role of zinc in neurodegenerative diseases.
Collapse
Affiliation(s)
- Guiran Xiao
- College of Food Science and Engineering, Hefei University of Technology, Hefei, Anhui 230009, China; State Key Laboratory of Membrane Biology, School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Bing Zhou
- State Key Laboratory of Membrane Biology, School of Life Sciences, Tsinghua University, Beijing 100084, China.
| |
Collapse
|
9
|
Zou D, Ma L, Yu J, Zhang Z. Biological databases for human research. GENOMICS PROTEOMICS & BIOINFORMATICS 2015; 13:55-63. [PMID: 25712261 PMCID: PMC4411498 DOI: 10.1016/j.gpb.2015.01.006] [Citation(s) in RCA: 69] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 01/01/2015] [Revised: 01/16/2015] [Accepted: 01/16/2015] [Indexed: 01/01/2023]
Abstract
The completion of the Human Genome Project lays a foundation for systematically studying the human genome from evolutionary history to precision medicine against diseases. With the explosive growth of biological data, there is an increasing number of biological databases that have been developed in aid of human-related research. Here we present a collection of human-related biological databases and provide a mini-review by classifying them into different categories according to their data types. As human-related databases continue to grow not only in count but also in volume, challenges are ahead in big data storage, processing, exchange and curation.
Collapse
Affiliation(s)
- Dong Zou
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Lina Ma
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Jun Yu
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.
| | - Zhang Zhang
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.
| |
Collapse
|
10
|
Abstract
Copper (Cu) is an essential redox active metal that is potentially toxic in excess. Multicellular organisms acquire Cu from the diet and must regulate uptake, storage, distribution and export of Cu at both the cellular and organismal levels. Systemic Cu deficiency can be fatal, as seen in Menkes disease patients. Conversely Cu toxicity occurs in patients with Wilson disease. Cu dyshomeostasis has also been implicated in neurodegenerative disorders such as Alzheimer's disease. Over the last decade, the fly Drosophila melanogaster has become an important model organism for the elucidation of eukaryotic Cu regulatory mechanisms. Gene discovery approaches with Drosophila have identified novel genes with conserved protein functions relevant to Cu homeostasis in humans. This review focuses on our current understanding of Cu uptake, distribution and export in Drosophila and the implications for mammals.
Collapse
Affiliation(s)
- Adam Southon
- Department of Genetics, University of Melbourne, Parkville, Australia.
| | | | | |
Collapse
|
11
|
Lee C, Huang CH. LASAGNA: a novel algorithm for transcription factor binding site alignment. BMC Bioinformatics 2013; 14:108. [PMID: 23522376 PMCID: PMC3747862 DOI: 10.1186/1471-2105-14-108] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2012] [Accepted: 03/08/2013] [Indexed: 12/11/2022] Open
Abstract
BACKGROUND Scientists routinely scan DNA sequences for transcription factor (TF) binding sites (TFBSs). Most of the available tools rely on position-specific scoring matrices (PSSMs) constructed from aligned binding sites. Because of the resolutions of assays used to obtain TFBSs, databases such as TRANSFAC, ORegAnno and PAZAR store unaligned variable-length DNA segments containing binding sites of a TF. These DNA segments need to be aligned to build a PSSM. While the TRANSFAC database provides scoring matrices for TFs, nearly 78% of the TFs in the public release do not have matrices available. As work on TFBS alignment algorithms has been limited, it is highly desirable to have an alignment algorithm tailored to TFBSs. RESULTS We designed a novel algorithm named LASAGNA, which is aware of the lengths of input TFBSs and utilizes position dependence. Results on 189 TFs of 5 species in the TRANSFAC database showed that our method significantly outperformed ClustalW2 and MEME. We further compared a PSSM method dependent on LASAGNA to an alignment-free TFBS search method. Results on 89 TFs whose binding sites can be located in genomes showed that our method is significantly more precise at fixed recall rates. Finally, we described LASAGNA-ChIP, a more sophisticated version for ChIP (Chromatin immunoprecipitation) experiments. Under the one-per-sequence model, it showed comparable performance with MEME in discovering motifs in ChIP-seq peak sequences. CONCLUSIONS We conclude that the LASAGNA algorithm is simple and effective in aligning variable-length binding sites. It has been integrated into a user-friendly webtool for TFBS search and visualization called LASAGNA-Search. The tool currently stores precomputed PSSM models for 189 TFs and 133 TFs built from TFBSs in the TRANSFAC Public database (release 7.0) and the ORegAnno database (08Nov10 dump), respectively. The webtool is available at http://biogrid.engr.uconn.edu/lasagna_search/.
Collapse
Affiliation(s)
- Chih Lee
- Department of Computer Science and Engineering, University of Connecticut,
Fairfield Road, Storrs, CT 06269, USA
| | - Chun-Hsi Huang
- Department of Computer Science and Engineering, University of Connecticut,
Fairfield Road, Storrs, CT 06269, USA
| |
Collapse
|
12
|
Sahoo D, Dill DL, Gentles AJ, Tibshirani R, Plevritis SK. Boolean implication networks derived from large scale, whole genome microarray datasets. Genome Biol 2008; 9:R157. [PMID: 18973690 PMCID: PMC2760884 DOI: 10.1186/gb-2008-9-10-r157] [Citation(s) in RCA: 80] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2008] [Revised: 09/06/2008] [Accepted: 10/30/2008] [Indexed: 11/23/2022] Open
Abstract
A method for analysis of microarray data is presented that extracts statistically significant Boolean implication relationships between pairs of genes. We describe a method for extracting Boolean implications (if-then relationships) in very large amounts of gene expression microarray data. A meta-analysis of data from thousands of microarrays for humans, mice, and fruit flies finds millions of implication relationships between genes that would be missed by other methods. These relationships capture gender differences, tissue differences, development, and differentiation. New relationships are discovered that are preserved across all three species.
Collapse
Affiliation(s)
- Debashis Sahoo
- Department of Computer Science, Stanford University, Stanford, CA 94305, USA
| | | | | | | | | |
Collapse
|
13
|
Towards defining the nuclear proteome. Genome Biol 2008; 9:R15. [PMID: 18211718 PMCID: PMC2395251 DOI: 10.1186/gb-2008-9-1-r15] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2007] [Revised: 12/19/2007] [Accepted: 01/23/2008] [Indexed: 11/17/2022] Open
Abstract
Direct evidence is reported for 2,568 mammalian proteins within the nuclear proteome, consisting of at least 14% of the entire proteome. Background The nucleus is a complex cellular organelle and accurately defining its protein content is essential before any systematic characterization can be considered. Results We report direct evidence for 2,568 mammalian proteins within the nuclear proteome: the nuclear subcellular localization of 1,529 proteins based on a high-throughput subcellular localization protocol of full-length proteins and an additional 1,039 proteins for which clear experimental evidence is documented in published literature. This is direct evidence that the nuclear proteome consists of at least 14% of the entire proteome. This dataset was used to evaluate computational approaches designed to identify additional nuclear proteins. Conclusion This represents direct experimental evidence that the nuclear proteome consists of at least 14% of the entire proteome. This high-quality nuclear proteome dataset was used to evaluate computational approaches designed to identify additional nuclear proteins. Based on this analysis, researchers can determine the stringency and types of lines of evidence they consider to infer the size and complement of the nuclear proteome.
Collapse
|
14
|
Joshua AM, Boutros PC. Web-based resources for clinical bioinformatics. METHODS IN MOLECULAR MEDICINE 2008; 141:309-329. [PMID: 18453097 DOI: 10.1007/978-1-60327-148-6_17] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
In the post-Human Genome Project era, awareness of the resources available through the internet is essential to both molecular biologists and clinicians. An overview of the main databases and analytical tools described in this chapter is important to understand the principles upon which hypotheses are generated, experiments are based and conclusions reached. Similarly, an introduction to the terminology of these resources often facilitates their use and adoption into practice. This chapter covers database resources such as NCBI/ Entrez, Ensembl and UCSC as well as analytical tools for sequence alignment, promoter analysis and molecular interactions.
Collapse
Affiliation(s)
- Anthony M Joshua
- Department of Medical Oncology, Princess Margaret Hospital, Toronto, Canada
| | | |
Collapse
|
15
|
Samarghitean C, Väliaho J, Vihinen M. IDR knowledge base for primary immunodeficiencies. Immunome Res 2007; 3:6. [PMID: 17394641 PMCID: PMC1854887 DOI: 10.1186/1745-7580-3-6] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2007] [Accepted: 03/29/2007] [Indexed: 11/29/2022] Open
Abstract
Background The ImmunoDeficiency Resource (IDR) is a knowledge base for the integration of the clinical, biochemical, genetic, genomic, proteomic, structural, and computational data of primary immunodeficiencies. The need for the IDR arises from the lack of structured and systematic information about primary immunodeficiencies on the Internet, and from the lack of a common platform which enables doctors, researchers, students, nurses and patients to find out validated information about these diseases. Description The IDR knowledge base, first released in 1999, has grown substantially. It contains information for 158 diseases, both from a clinical as well as molecular point of view. The database and the user interface have been reformatted. This new IDR release has a richer and more complete breadth, depth and scope. The service provides the most complete and up-to-date dataset. The IDR has been integrated with several internal and external databases and services. The contents of the IDR are validated and selected for different types of users (doctors, nurses, researchers and students, as well as patients and their families). The search engine has been improved and allows either a detailed or a broad search from a simple user interface. Conclusion The IDR is the first knowledge base specifically designed to capture in a systematic and validated way both clinical and molecular information for primary immunodeficiencies. The service is freely available at http://bioinf.uta.fi/idr and is regularly updated. The IDR facilitates primary immunodeficiencies informatics and helps to parameterise in silico modelling of these diseases. The IDR is useful also as an advanced education tool for medical students, and physicians.
Collapse
Affiliation(s)
- Crina Samarghitean
- Institute of Medical Technology, FI-33014 University of Tampere, Finland
| | - Jouni Väliaho
- Institute of Medical Technology, FI-33014 University of Tampere, Finland
| | - Mauno Vihinen
- Institute of Medical Technology, FI-33014 University of Tampere, Finland
- Research Unit, Tampere University Hospital, FI-33520 Tampere, Finland
| |
Collapse
|
16
|
Hardwidge PR, Donohoe S, Aebersold R, Finlay BB. Proteomic analysis of the binding partners to enteropathogenic Escherichia coli virulence proteins expressed in Saccharomyces cerevisiae. Proteomics 2006; 6:2174-9. [PMID: 16552782 DOI: 10.1002/pmic.200500523] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Enteropathogenic Escherichia coli (EPEC) is an enteric human pathogen responsible for much worldwide morbidity and mortality. EPEC uses a type III secretion system to inject bacterial proteins into the cytosol of intestinal epithelial cells to cause diarrheal disease. We are interested in determining the host proteins to which EPEC translocator and effector proteins bind during infection. To facilitate protein enrichment, we created fusions between GST and EPEC virulence proteins, and expressed these fusions individually in Saccharomyces cerevisiae. The biology of S. cerevisiae is well understood and often employed as a model eukaryote to study the function of bacterial virulence factors. We isolated the yeast proteins that interact with individual EPEC proteins by affinity purifying against the GST tag. These complexes were subjected to ICAT combined with ESI-MS/MS. Database searching of sequenced peptides provided a list of proteins that bound specifically to each EPEC virulence protein. The dataset suggests several potential mammalian targets of these proteins that may guide future experimentation.
Collapse
Affiliation(s)
- Philip R Hardwidge
- Veterinary Science Department, South Dakota State University, Brookings, SD, USA.
| | | | | | | |
Collapse
|
17
|
Alesci S, Manoli I, Michopoulos VJ, Brouwers FM, Le H, Gold PW, Blackman MR, Rennert OM, Su YA, Chrousos GP. Development of a human mitochondria-focused cDNA microarray (hMitChip) and validation in skeletal muscle cells: implications for pharmaco- and mitogenomics. THE PHARMACOGENOMICS JOURNAL 2006; 6:333-42. [PMID: 16534508 DOI: 10.1038/sj.tpj.6500377] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
Mitochondrial research has influenced our understanding of human evolution, physiology and pathophysiology. Mitochondria, intracellular organelles widely known as 'energy factories' of the cell, also play fundamental roles in intermediary metabolism, steroid hormone and heme biosyntheses, calcium signaling, generation of radical oxygen species, and apoptosis. Mitochondria possess a distinct DNA (mitochondrial DNA); yet, the vast majority of mitochondrial proteins are encoded by the nuclear DNA. Mitochondria-related genetic defects have been described in a variety of mostly rare, often fatal, primary mitochondrial disorders; furthermore, they are increasingly reported in association with many common morbid conditions, such as cancer, obesity, diabetes and neurodegenerative disorders, although their role remains unclear. This study describes the creation of a human mitochondria-focused cDNA microarray (hMitChip) and its validation in human skeletal muscle cells treated with glucocorticoids. We suggest that hMitChip is a reliable and novel tool that will prove useful for systematically studying the contribution of mitochondrial genomics to human health and disease.
Collapse
Affiliation(s)
- S Alesci
- Clinical Neuroendocrinology Branch, NIMH, NIH, Bethesda, MD 20892-1284, USA.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Kokocinski F, Delhomme N, Wrobel G, Hummerich L, Toedt G, Lichter P. FACT--a framework for the functional interpretation of high-throughput experiments. BMC Bioinformatics 2005; 6:161. [PMID: 15985174 PMCID: PMC1189078 DOI: 10.1186/1471-2105-6-161] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2005] [Accepted: 06/28/2005] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Interpreting the results of high-throughput experiments, such as those obtained from DNA-microarrays, is an often time-consuming task due to the high number of data-points that need to be analyzed in parallel. It is usually a matter of extensive testing and unknown beforehand, which of the possible approaches for the functional analysis will be the most informative. RESULTS To address this problem, we have developed the Flexible Annotation and Correlation Tool (FACT). FACT allows for detection of important patterns in large data sets by simplifying the integration of heterogeneous data sources and the subsequent application of different algorithms for statistical evaluation or visualization of the annotated data. The system is constantly extended to include additional annotation data and comparison methods. CONCLUSION FACT serves as a highly flexible framework for the explorative analysis of large genomic and proteomic result sets. The program can be used online; open source code and supplementary information are available at http://www.factweb.de.
Collapse
Affiliation(s)
- Felix Kokocinski
- Molecular Genetics, Deutsches Krebsforschungszentrum, 69115 Heidelberg, Germany
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1HH, UK
| | - Nicolas Delhomme
- Molecular Genetics, Deutsches Krebsforschungszentrum, 69115 Heidelberg, Germany
| | - Gunnar Wrobel
- Molecular Genetics, Deutsches Krebsforschungszentrum, 69115 Heidelberg, Germany
| | - Lars Hummerich
- Molecular Genetics, Deutsches Krebsforschungszentrum, 69115 Heidelberg, Germany
| | - Grischa Toedt
- Molecular Genetics, Deutsches Krebsforschungszentrum, 69115 Heidelberg, Germany
| | - Peter Lichter
- Molecular Genetics, Deutsches Krebsforschungszentrum, 69115 Heidelberg, Germany
| |
Collapse
|
19
|
Väliaho J, Riikonen P, Vihinen M. Distribution of immunodeficiency fact files with XML--from Web to WAP. BMC Med Inform Decis Mak 2005; 5:21. [PMID: 15978138 PMCID: PMC1184081 DOI: 10.1186/1472-6947-5-21] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2004] [Accepted: 06/26/2005] [Indexed: 11/17/2022] Open
Abstract
BACKGROUND Although biomedical information is growing rapidly, it is difficult to find and retrieve validated data especially for rare hereditary diseases. There is an increased need for services capable of integrating and validating information as well as proving it in a logically organized structure. A XML-based language enables creation of open source databases for storage, maintenance and delivery for different platforms. METHODS Here we present a new data model called fact file and an XML-based specification Inherited Disease Markup Language (IDML), that were developed to facilitate disease information integration, storage and exchange. The data model was applied to primary immunodeficiencies, but it can be used for any hereditary disease. Fact files integrate biomedical, genetic and clinical information related to hereditary diseases. RESULTS IDML and fact files were used to build a comprehensive Web and WAP accessible knowledge base ImmunoDeficiency Resource (IDR) available at http://bioinf.uta.fi/idr/. A fact file is a user oriented user interface, which serves as a starting point to explore information on hereditary diseases. CONCLUSION The IDML enables the seamless integration and presentation of genetic and disease information resources in the Internet. IDML can be used to build information services for all kinds of inherited diseases. The open source specification and related programs are available at http://bioinf.uta.fi/idml/.
Collapse
Affiliation(s)
- Jouni Väliaho
- Institute of Medical Technology, FI-33014 University of Tampere, Finland
| | - Pentti Riikonen
- Institute of Medical Technology, FI-33014 University of Tampere, Finland
- Department of Information Technology, University of Turku, FI-20520 Turku, Finland
| | - Mauno Vihinen
- Institute of Medical Technology, FI-33014 University of Tampere, Finland
- Research Unit, Tampere University Hospital, FI-33520 Tampere, Finland
| |
Collapse
|
20
|
Colbourne JK, Singan VR, Gilbert DG. wFleaBase: the Daphnia genome database. BMC Bioinformatics 2005; 6:45. [PMID: 15752432 PMCID: PMC555599 DOI: 10.1186/1471-2105-6-45] [Citation(s) in RCA: 102] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2004] [Accepted: 03/07/2005] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND wFleaBase is a database with the necessary infrastructure to curate, archive and share genetic, molecular and functional genomic data and protocols for an emerging model organism, the microcrustacean Daphnia. Commonly known as the water-flea, Daphnia's ecological merit is unequaled among metazoans, largely because of its sentinel role within freshwater ecosystems and over 200 years of biological investigations. By consequence, the Daphnia Genomics Consortium (DGC) has launched an interdisciplinary research program to create the resources needed to study genes that affect ecological and evolutionary success in natural environments. DISCUSSION These tools include the genome database wFleaBase, which currently contains functions to search and extract information from expressed sequenced tags, genome survey sequences and full genome sequencing projects. This new database is built primarily from core components of the Generic Model Organism Database project, and related bioinformatics tools. SUMMARY Over the coming year, preliminary genetic maps and the nearly complete genomic sequence of Daphnia pulex will be integrated into wFleaBase, including gene predictions and ortholog assignments based on sequence similarities with eukaryote genes of known function. wFleaBase aims to serve a large ecological and evolutionary research community. Our challenge is to rapidly expand its content and to ultimately integrate genetic and functional genomic information with population-level responses to environmental challenges. URL: http://wfleabase.org/.
Collapse
Affiliation(s)
- John K Colbourne
- Center for Genomics and Bioinformatics, Indiana University, Bloomington, Indiana USA
| | - Vasanth R Singan
- School of Informatics, Indiana University, Bloomington, Indiana USA
| | - Don G Gilbert
- Department of Biology, Indiana University, Bloomington, Indiana USA
| |
Collapse
|
21
|
Matthews KA, Kaufman TC, Gelbart WM. Research resources for Drosophila: the expanding universe. Nat Rev Genet 2005; 6:179-93. [PMID: 15738962 DOI: 10.1038/nrg1554] [Citation(s) in RCA: 90] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Drosophila melanogaster has been the subject of research into central questions about biological mechanisms for almost a century. The experimental tools and resources that are available or under development for D. melanogaster and its related species, particularly those for genomic analysis, are truly outstanding. Here we review three types of resource that have been developed for D. melanogaster research: databases and other sources of information, biological materials and experimental services. These resources are there to be exploited and we hope that this guide will encourage new uses for D. melanogaster information, materials and services, both by those new to flies and by experienced D. melanogaster researchers.
Collapse
Affiliation(s)
- Kathleen A Matthews
- Department of Biology, Indiana University, Bloomington, Indiana 47405-3700, USA.
| | | | | |
Collapse
|
22
|
Abstract
MOTIVATION As more whole genome sequences become available, comparing multiple genomes at the sequence level can provide insight into new biological discovery. However, there are significant challenges for genome comparison. The challenge includes requirement for computational resources owing to the large volume of genome data. More importantly, since the choice of genomes to be compared is entirely subjective, there are too many choices for genome comparison. For these reasons, there is pressing need for bioinformatics systems for comparing multiple genomes where users can choose genomes to be compared freely. RESULTS PLATCOM (Platform for Computational Comparative Genomics) is an integrated system for the comparative analysis of multiple genomes. The system is built on several public databases and a suite of genome analysis applications are provided as exemplary genome data mining tools over these internal databases. Researchers are able to visually investigate genomic sequence similarities, conserved gene neighborhoods, conserved metabolic pathways and putative gene fusion events among a set of selected multiple genomes. AVAILABILITY http://platcom.informatics.indiana.edu/platcom
Collapse
Affiliation(s)
- Kwangmin Choi
- School of Informatics, Indiana University, Bloomington, IN 47404, USA
| | | | | | | |
Collapse
|
23
|
Maslov S, Sneppen K, Eriksen KA, Yan KK. Upstream plasticity and downstream robustness in evolution of molecular networks. BMC Evol Biol 2004; 4:9. [PMID: 15070432 PMCID: PMC385226 DOI: 10.1186/1471-2148-4-9] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2003] [Accepted: 03/08/2004] [Indexed: 12/02/2022] Open
Abstract
Background Gene duplication followed by the functional divergence of the resulting pair of paralogous proteins is a major force shaping molecular networks in living organisms. Recent species-wide data for protein-protein interactions and transcriptional regulations allow us to assess the effect of gene duplication on robustness and plasticity of these molecular networks. Results We demonstrate that the transcriptional regulation of duplicated genes in baker's yeast Saccharomyces cerevisiae diverges fast so that on average they lose 3% of common transcription factors for every 1% divergence of their amino acid sequences. The set of protein-protein interaction partners of their protein products changes at a slower rate exhibiting a broad plateau for amino acid sequence similarity above 70%. The stability of functional roles of duplicated genes at such relatively low sequence similarity is further corroborated by their ability to substitute for each other in single gene knockout experiments in yeast and RNAi experiments in a nematode worm Caenorhabditis elegans. We also quantified the divergence rate of physical interaction neighborhoods of paralogous proteins in a bacterium Helicobacter pylori and a fly Drosophila melanogaster. However, in the absence of system-wide data on transcription factors' binding in these organisms we could not compare this rate to that of transcriptional regulation of duplicated genes. Conclusions For all molecular networks studied in this work we found that even the most distantly related paralogous proteins with amino acid sequence identities around 20% on average have more similar positions within a network than a randomly selected pair of proteins. For yeast we also found that the upstream regulation of genes evolves more rapidly than downstream functions of their protein products. This is in accordance with a view which puts regulatory changes as one of the main driving forces of the evolution. In this context a very important open question is to what extent our results obtained for homologous genes within a single species (paralogs) carries over to homologous proteins in different species (orthologs).
Collapse
Affiliation(s)
- Sergei Maslov
- Department of Physics, Brookhaven National Laboratory, Upton, New York 11973, USA
| | - Kim Sneppen
- Nordita, Blegdamsvej 17, 2100 Copenhagen Ø, Denmark
| | - Kasper Astrup Eriksen
- Department of Physics, Brookhaven National Laboratory, Upton, New York 11973, USA
- Department of Theoretical Physics, Lund University, Sölvegatan 14A, SE-223 62 Lund, Sweden
| | - Koon-Kiu Yan
- Department of Physics, Brookhaven National Laboratory, Upton, New York 11973, USA
- Department of Physics and Astronomy, Stony Brook University, Stony Brook, New York 11794, USA
| |
Collapse
|
24
|
Lenhard B, Wahlestedt C, Wasserman WW. GeneLynx mouse: integrated portal to the mouse genome. Genome Res 2003; 13:1501-4. [PMID: 12819149 PMCID: PMC403699 DOI: 10.1101/gr.951403] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
GeneLynx Mouse is a meta-database providing an extensive collection of hyperlinks to mouse gene-specific information in diverse databases available via the Internet. The GeneLynx project is based on the simple notion that given any gene-specific identifier (e.g., accession number, gene name, text, or sequence), scientists should be able to access a single location that provides a set of links to all the publicly available information pertinent to the specified gene. The recent climax in the mouse genome and RIKEN cDNA sequencing projects provided the data necessary for the development of a gene-centric mouse information portal based on the GeneLynx ideals. Clusters of RIKEN cDNA sequences were used to define the initial set of mouse genes. Like its human counterpart, GeneLynx Mouse is designed as an extensible relational database with an intuitive and user-friendly Web interface. Data is automatically extracted from diverse resources, using appropriate approaches to maximize the coverage. To promote cross-database interoperability, an indexing utility is provided to facilitate the establishment of hyperlinks in external databases. As a result of the integration of the human and mouse systems, GeneLynx now serves as a powerful comparative genomics data mining resource. GeneLynx Mouse can be freely accessed at http://mouse.genelynx.org.
Collapse
Affiliation(s)
- Boris Lenhard
- Center for Genomics and Bioinformatics, Karolinska Institutet, 17177 Stockholm, Sweden.
| | | | | |
Collapse
|
25
|
Abstract
Recent years have seen an explosion in the amount of available biological data. More and more genomes are being sequenced and annotated, and protein and gene interaction data are accumulating. Biological databases have been invaluable for managing these data and for making them accessible. Depending on the data that they contain, the databases fulfil different functions. But, although they are architecturally similar, so far their integration has proved problematic.
Collapse
Affiliation(s)
- Lincoln D Stein
- Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, New York 11724, USA.
| |
Collapse
|
26
|
Abstract
The recent sequencing of the Anopheles gambiae genome showcases the genetic breadth of insects and a trend towards sequencing organisms directly involved with human welfare. We describe traits in other insect species that make them important candidates for genomics projects, and review several recent workshops aimed at uniting researchers working with insect species to efficiently address problems in medicine, biotechnology, and agriculture.
Collapse
Affiliation(s)
- Jay D Evans
- USDA-ARS Bee Research Lab, Beltsville, MD 20705, USA.
| | | |
Collapse
|
27
|
Safran M, Chalifa-Caspi V, Shmueli O, Olender T, Lapidot M, Rosen N, Shmoish M, Peter Y, Glusman G, Feldmesser E, Adato A, Peter I, Khen M, Atarot T, Groner Y, Lancet D. Human Gene-Centric Databases at the Weizmann Institute of Science: GeneCards, UDB, CroW 21 and HORDE. Nucleic Acids Res 2003; 31:142-6. [PMID: 12519968 PMCID: PMC165497 DOI: 10.1093/nar/gkg050] [Citation(s) in RCA: 158] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Recent enhancements and current research in the GeneCards (GC) (http://bioinfo.weizmann.ac.il/cards/) project are described, including the addition of gene expression profiles and integrated gene locations. Also highlighted are the contributions of specialized associated human gene-centric databases developed at the Weizmann Institute. These include the Unified Database (UDB) (http://bioinfo.weizmann.ac.il/udb) for human genome mapping, the human Chromosome 21 database at the Weizmann Insti-tute (CroW 21) (http://bioinfo.weizmann.ac.il/crow21), and the Human Olfactory Receptor Data Explora-torium (HORDE) (http://bioinfo.weizmann.ac.il/HORDE). The synergistic relationships amongst these efforts have positively impacted the quality, quantity and usefulness of the GeneCards gene compendium.
Collapse
Affiliation(s)
- Marilyn Safran
- Department of Biological Services (Bioinformatics Unit), The Weizmann Institute of Science, 76100 Rehovot, Israel.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|