51
|
Tochitani S, Hayashizaki Y. Functional screening revisited in the postgenomic era. MOLECULAR BIOSYSTEMS 2007; 3:195-207. [PMID: 17308666 DOI: 10.1039/b614882b] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Functional screening can reveal a hidden function of a gene. cDNA library-based functional screening has flourished in various fields of biology so far, such as cancer biology, developmental biology and neuroscience. In the postgenomic era, however, various sequence database and public full-length cDNA resources are available, which now allow us to perform more straightforward, gene-oriented screening. Furthermore, the advent of RNA interference techniques has made it possible to perform effective loss-of-function screening. Gene-based functional screening is able to bridge the gap between genes and biological phenomena and raise important biological questions which should be tackled by integration of 'omic' datasets. These possible roles of functional screening will become more and more important in modern molecular biology moving toward the system level understanding of living organisms.
Collapse
Affiliation(s)
- Shiro Tochitani
- RNA Resource Exploration Laboratory, Functional RNA Research Program, Frontier Research System, RIKEN, Yokohama, Kanagawa, Japan.
| | | |
Collapse
|
52
|
Analytical methods from the perspective of method standardization. TOPICS IN CURRENT GENETICS 2007. [DOI: 10.1007/4735_2007_0217] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/08/2022]
|
53
|
Kulkarni-Kale U, Bhosle SG, Manjari GS, Joshi M, Bansode S, Kolaskar AS. Curation of viral genomes: challenges, applications and the way forward. BMC Bioinformatics 2006; 7 Suppl 5:S12. [PMID: 17254296 PMCID: PMC1764468 DOI: 10.1186/1471-2105-7-s5-s12] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
BACKGROUND Whole genome sequence data is a step towards generating the 'parts list' of life to understand the underlying principles of Biocomplexity. Genome sequencing initiatives of human and model organisms are targeted efforts towards understanding principles of evolution with an application envisaged to improve human health. These efforts culminated in the development of dedicated resources. Whereas a large number of viral genomes have been sequenced by groups or individuals with an interest to study antigenic variation amongst strains and species. These independent efforts enabled viruses to attain the status of 'best-represented taxa' with the highest number of genomes. However, due to lack of concerted efforts, viral genomic sequences merely remained as entries in the public repositories until recently. RESULTS VirGen is a curated resource of viral genomes and their analyses. Since its first release, it has grown both in terms of coverage of viral families and development of new modules for annotation and analysis. The current release (2.0) includes data for twenty-five families with broad host range as against eight in the first release. The taxonomic description of viruses in VirGen is in accordance with the ICTV nomenclature. A well-characterised strain is identified as a 'representative entry' for every viral species. This non-redundant dataset is used for subsequent annotation and analyses using sequenced-based Bioinformatics approaches. VirGen archives precomputed data on genome and proteome comparisons. A new data module that provides structures of viral proteins available in PDB has been incorporated recently. One of the unique features of VirGen is predicted conformational and sequential epitopes of known antigenic proteins using in-house developed algorithms, a step towards reverse vaccinology. CONCLUSION Structured organization of genomic data facilitates use of data mining tools, which provides opportunities for knowledge discovery. One of the approaches to achieve this goal is to carry out functional annotations using comparative genomics. VirGen, a comprehensive viral genome resource that serves as an annotation and analysis pipeline has been developed for the curation of public domain viral genome data http://bioinfo.ernet.in/virgen/virgen.html. Various steps in the curation and annotation of the genomic data and applications of the value-added derived data are substantiated with case studies.
Collapse
Affiliation(s)
| | | | | | - Manali Joshi
- Bioinformatics Centre, University of Pune, Pune 411 007 India
| | - Sandeep Bansode
- Bioinformatics Centre, University of Pune, Pune 411 007 India
| | | |
Collapse
|
54
|
Wishart DS. Discovering drug targets through the web. COMPARATIVE BIOCHEMISTRY AND PHYSIOLOGY D-GENOMICS & PROTEOMICS 2006; 2:9-17. [PMID: 20483274 DOI: 10.1016/j.cbd.2006.01.003] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/22/2005] [Revised: 01/28/2006] [Accepted: 01/30/2006] [Indexed: 11/25/2022]
Abstract
Traditionally, drug-target discovery is a "wet-bench" experimental process, depending on carefully designed genetic screens, biochemical tests and cellular assays to identify proteins and genes that are associated with a particular disease or condition. However, recent advances in DNA sequencing, transcript profiling, protein identification and protein quantification are leading to a flood of genomic and proteomic data that is, or potentially could be, linked to disease data. The quantity of data generated by these high throughput methods is forcing scientists to re-think the way they do traditional drug-target discovery. In particular it is leading them more and more towards identifying potential drug targets using computers. In fact, drug-target identification is now being done as much on the desk-top as on the bench-top. This review focuses on describing how drug-target discovery can be done in silico (i.e. via computer) using a variety of bioinformatic resources that are freely available on the web. Specifically, it highlights a number of web-accessible sequence databases, automated genome annotation tools, text mining tools; and integrated drug/sequence databases that can be used to identify drug targets for both endogenous (genetic and epigenetic) diseases as well as exogenous (infectious) diseases.
Collapse
Affiliation(s)
- David S Wishart
- Departments of Computing Science and Biological Sciences, University of Alberta, Edmonton, AB, Canada T6G 2E8
| |
Collapse
|
55
|
Messersmith DJ, Benson DA, Geer RC. A Web-based assessment of bioinformatics end-user support services at US universities. J Med Libr Assoc 2006; 94:299-305, E156-87. [PMID: 16888663 PMCID: PMC1525314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/11/2023] Open
Abstract
OBJECTIVES This study was conducted to gauge the availability of bioinformatics end-user support services at US universities and to identify the providers of those services. The study primarily focused on the availability of short-term workshops that introduce users to molecular biology databases and analysis software. METHODS Websites of selected US universities were reviewed to determine if bioinformatics educational workshops were offered, and, if so, what organizational units in the universities provided them. RESULTS Of 239 reviewed universities, 72 (30%) offered bioinformatics educational workshops. These workshops were located at libraries (N = 15), bioinformatics centers (N = 38), or other facilities (N = 35). No such training was noted on the sites of 167 universities (70%). Of the 115 bioinformatics centers identified, two-thirds did not offer workshops. CONCLUSIONS This analysis of university Websites indicates that a gap may exist in the availability of workshops and related training to assist researchers in the use of bioinformatics resources, representing a potential opportunity for libraries and other facilities to provide training and assistance for this growing user group.
Collapse
Affiliation(s)
| | - Dennis A. Benson
- National Center for Biotechnology Information, National Library of Medicine, 8600 Rockville Pike, Building 38A, Room 3N307, Bethesda, Maryland 20894
| | - Renata C. Geer
- National Center for Biotechnology Information, National Library of Medicine, 8600 Rockville Pike, Building 38A, Room 35314, Bethesda, Maryland 20894
| |
Collapse
|
56
|
Abstract
The study of a collection of metabolites as a whole (metabolome), as opposed to isolated small molecules, is a fast-growing field promising to take us one step further towards understanding cell biology, and relating the genetic capabilities of an organism to its observed phenotype. The new sciences of metabolomics and metabonomics can exploit a variety of existing experimental and computational methods, but they also require new technology that can deal with both the amount and the diversity of the data relating to the rich world of metabolites. More specifically, the collaboration between bioinformaticians and chemoinformaticians promises to advance our view of cognate molecules, by shedding light on their atomic structure and properties. Modelling of the interactions of metabolites with other entities in the cell, and eventually complete modelling of reaction pathways will be essential for analysis of the experimental data, and prediction of an organism's response to environmental challenges.
Collapse
Affiliation(s)
- Irene Nobeli
- Randall Division of Cell and Molecular Biophysics, New Hunt's House, King's College London, UK.
| | | |
Collapse
|
57
|
Whitfield EJ, Pruess M, Apweiler R. Bioinformatics database infrastructure for biotechnology research. J Biotechnol 2006; 124:629-39. [PMID: 16757051 DOI: 10.1016/j.jbiotec.2006.04.006] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2005] [Revised: 03/06/2006] [Accepted: 04/03/2006] [Indexed: 10/24/2022]
Abstract
Many databases are available that provide valuable data resources for the biotechnological researcher. According to their core data, they can be divided into different types. Some databases provide primary data, like all published nucleotide sequences, others deal with protein sequences. In addition to these two basic types of databases, a huge number of more specialized resources are available, like databases about protein structures, protein identification, special features of genes and/or proteins, or certain organisms. Furthermore, some resources offer integrated views on different types of data, allowing the user to do easy customized queries over large datasets and to compare different types of data.
Collapse
Affiliation(s)
- Eleanor J Whitfield
- EMBL-EBI, Wellcome Trust Genome Campus, Hinxton Hall, Hinxton, Cambs CB10 1SD, UK.
| | | | | |
Collapse
|
58
|
Okuno Y, Yang J, Taneishi K, Yabuuchi H, Tsujimoto G. GLIDA: GPCR-ligand database for chemical genomic drug discovery. Nucleic Acids Res 2006; 34:D673-7. [PMID: 16381956 PMCID: PMC1347391 DOI: 10.1093/nar/gkj028] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
G-protein coupled receptors (GPCRs) represent one of the most important families of drug targets in pharmaceutical development. GPCR-LIgand DAtabase (GLIDA) is a novel public GPCR-related chemical genomic database that is primarily focused on the correlation of information between GPCRs and their ligands. It provides correlation data between GPCRs and their ligands, along with chemical information on the ligands, as well as access information to the various web databases regarding GPCRs. These data are connected with each other in a relational database, allowing users in the field of GPCR-related drug discovery to easily retrieve such information from either biological or chemical starting points. GLIDA includes structure similarity search functions for the GPCRs and for their ligands. Thus, GLIDA can provide correlation maps linking the searched homologous GPCRs (or ligands) with their ligands (or GPCRs). By analyzing the correlation patterns between GPCRs and ligands, we can gain more detailed knowledge about their interactions and improve drug design efforts by focusing on inferred candidates for GPCR-specific drugs. GLIDA is publicly available at . We hope that it will prove very useful for chemical genomic research and GPCR-related drug discovery.
Collapse
Affiliation(s)
- Yasushi Okuno
- Department of Genomic Drug Discovery Science, Graduate School of Pharmaceutical Sciences, Kyoto University, 46-29 Yoshida-Shimo-Adachi-cho, Sakyo-ku, Kyoto 606-8501, Japan.
| | | | | | | | | |
Collapse
|
59
|
Cochrane G, Aldebert P, Althorpe N, Andersson M, Baker W, Baldwin A, Bates K, Bhattacharyya S, Browne P, van den Broek A, Castro M, Duggan K, Eberhardt R, Faruque N, Gamble J, Kanz C, Kulikova T, Lee C, Leinonen R, Lin Q, Lombard V, Lopez R, McHale M, McWilliam H, Mukherjee G, Nardone F, Pastor MPG, Sobhany S, Stoehr P, Tzouvara K, Vaughan R, Wu D, Zhu W, Apweiler R. EMBL Nucleotide Sequence Database: developments in 2005. Nucleic Acids Res 2006; 34:D10-5. [PMID: 16381823 PMCID: PMC1347492 DOI: 10.1093/nar/gkj130] [Citation(s) in RCA: 71] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The EMBL Nucleotide Sequence Database (www.ebi.ac.uk/embl) at the EMBL European Bioinformatics Institute, UK, offers a comprehensive set of publicly available nucleotide sequence and annotation, freely accessible to all. Maintained in collaboration with partners DDBJ and GenBank, coverage includes whole genome sequencing project data, directly submitted sequence, sequence recorded in support of patent applications and much more. The database continues to offer submission tools, data retrieval facilities and user support. In 2005, the volume of data offered has continued to grow exponentially. In addition to the newly presented data, the database encompasses a range of new data types generated by novel technologies, offers enhanced presentation and searchability of the data and has greater integration with other data resources offered at the EBI and elsewhere. In stride with these developing data types, the database has continued to develop submission and retrieval tools to maximise the information content of submitted data and to offer the simplest possible submission routes for data producers. New developments, the submission process, data retrieval and access to support are presented in this paper, along with links to sources of further information.
Collapse
Affiliation(s)
- Guy Cochrane
- EMBL Outstation-European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
60
|
Wishart DS, Knox C, Guo AC, Shrivastava S, Hassanali M, Stothard P, Chang Z, Woolsey J. DrugBank: a comprehensive resource for in silico drug discovery and exploration. Nucleic Acids Res 2006; 34:D668-72. [PMID: 16381955 PMCID: PMC1347430 DOI: 10.1093/nar/gkj067] [Citation(s) in RCA: 2412] [Impact Index Per Article: 134.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
DrugBank is a unique bioinformatics/cheminformatics resource that combines detailed drug (i.e. chemical) data with comprehensive drug target (i.e. protein) information. The database contains >4100 drug entries including >800 FDA approved small molecule and biotech drugs as well as >3200 experimental drugs. Additionally, >14,000 protein or drug target sequences are linked to these drug entries. Each DrugCard entry contains >80 data fields with half of the information being devoted to drug/chemical data and the other half devoted to drug target or protein data. Many data fields are hyperlinked to other databases (KEGG, PubChem, ChEBI, PDB, Swiss-Prot and GenBank) and a variety of structure viewing applets. The database is fully searchable supporting extensive text, sequence, chemical structure and relational query searches. Potential applications of DrugBank include in silico drug target discovery, drug design, drug docking or screening, drug metabolism prediction, drug interaction prediction and general pharmaceutical education. DrugBank is available at http://redpoll.pharmacy.ualberta.ca/drugbank/.
Collapse
Affiliation(s)
- David S Wishart
- Department of Computing Science, University of Alberta, Edmonton, AB, Canada T6G 2E8.
| | | | | | | | | | | | | | | |
Collapse
|
61
|
Ellis LBM, Roe D, Wackett LP. The University of Minnesota Biocatalysis/Biodegradation Database: the first decade. Nucleic Acids Res 2006; 34:D517-21. [PMID: 16381924 PMCID: PMC1347439 DOI: 10.1093/nar/gkj076] [Citation(s) in RCA: 135] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
As the University of Minnesota Biocatalysis/Biodegradation Database (UM-BBD, ) starts its second decade, it includes information on over 900 compounds, over 600 enzymes, nearly 1000 reactions and about 350 microorganism entries. Its Biochemical Periodic Tables have grown to include biological information for almost all stable, non-noble-gas elements (). Its Pathway Prediction System (PPS) () is now an internationally recognized, open system for predicting microbial catabolism of organic compounds. Graphical display of PPS rules, a stand-alone version of the PPS and guidance for PPS users are being developed. The next decade should see the PPS, and the UM-BBD on which it is based, find increasing use by national and international government agencies, commercial organizations and educational institutions.
Collapse
Affiliation(s)
- Lynda B M Ellis
- Department of Laboratory Medicine and Pathology, University of Minnesota, Minneapolis, Mayo Mail Code 609, 420 SE Delaware Street, MN 55455, USA.
| | | | | |
Collapse
|
62
|
Burgun A. Desiderata for domain reference ontologies in biomedicine. J Biomed Inform 2005; 39:307-13. [PMID: 16266830 DOI: 10.1016/j.jbi.2005.09.002] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2005] [Revised: 09/09/2005] [Accepted: 09/20/2005] [Indexed: 11/28/2022]
Abstract
Domain reference ontologies represent knowledge about a particular part of the world in a way that is independent from specific objectives, through a theory of the domain. An example of reference ontology in biomedical informatics is the Foundational Model of Anatomy (FMA), an ontology of anatomy that covers the entire range of macroscopic, microscopic, and subcellular anatomy. The purpose of this paper is to explore how two domain reference ontologies--the FMA and the Chemical Entities of Biological Interest (ChEBI) ontology, can be used (i) to align existing terminologies, (ii) to infer new knowledge in ontologies of more complex entities, and (iii) to manage and help reasoning about individual data. We analyze those kinds of usages of these two domain reference ontologies and suggest desiderata for reference ontologies in biomedicine. While a number of groups and communities have investigated general requirements for ontology design and desiderata for controlled medical vocabularies, we are focusing on application purposes. We suggest five desirable characteristics for reference ontologies: good lexical coverage, good coverage in terms of relations, compatibility with standards, modularity, and ability to represent variation in reality.
Collapse
Affiliation(s)
- Anita Burgun
- EA 3888, Faculté de Médecine, IFR 140, Université de Rennes I, France.
| |
Collapse
|
63
|
Feldman HJ, Dumontier M, Ling S, Haider N, Hogue CWV. CO: A chemical ontology for identification of functional groups and semantic comparison of small molecules. FEBS Lett 2005; 579:4685-91. [PMID: 16098521 DOI: 10.1016/j.febslet.2005.07.039] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2005] [Revised: 07/21/2005] [Accepted: 07/21/2005] [Indexed: 11/20/2022]
Abstract
A novel chemical ontology based on chemical functional groups automatically, objectively assigned by a computer program, was developed to categorize small molecules. It has been applied to PubChem and the small molecule interaction database to demonstrate its utility as a basic pharmacophore search system. Molecules can be compared using a semantic similarity score based on functional group assignments rather than 3D shape, which succeeds in identifying small molecules known to bind a common binding site. This ontology will serve as a powerful tool for searching chemical databases and identifying key functional groups responsible for biological activities.
Collapse
Affiliation(s)
- Howard J Feldman
- The Blueprint Initiative of the Samuel Lunenfeld Research Institute, Mount Sinai Hospital, 600 University Ave., Toronto, ON, Canada
| | | | | | | | | |
Collapse
|