1
|
CoMentG: comprehensive retrieval of generic relationships between biomedical concepts from the scientific literature. Database (Oxford) 2024; 2024:baae025. [PMID: 38564426 PMCID: PMC10986793 DOI: 10.1093/database/baae025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Revised: 03/01/2024] [Accepted: 03/15/2024] [Indexed: 04/04/2024]
Abstract
The CoMentG resource contains millions of relationships between terms of biomedical interest obtained from the scientific literature. At the core of the system is a methodology for detecting significant co-mentions of concepts in the entire PubMed corpus. That method was applied to nine sets of terms covering the most important classes of biomedical concepts: diseases, symptoms/clinical signs, molecular functions, biological processes, cellular compartments, anatomic parts, cell types, bacteria and chemical compounds. We obtained more than 7 million relationships between more than 74 000 terms, and many types of relationships were not available in any other resource. As the terms were obtained from widely used resources and ontologies, the relationships are given using the standard identifiers provided by them and hence can be linked to other data. A web interface allows users to browse these associations, searching for relationships for a set of terms of interests provided as input, such as between a disease and their associated symptoms, underlying molecular processes or affected tissues. The results are presented in an interactive interface where the user can explore the reported relationships in different ways and follow links to other resources. Database URL: https://csbg.cnb.csic.es/CoMentG/.
Collapse
|
2
|
Each Cellular Compartment Has a Characteristic Protein Reactive Cysteine Ratio Determining Its Sensitivity to Oxidation. Antioxidants (Basel) 2023; 12:1274. [PMID: 37372004 DOI: 10.3390/antiox12061274] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Revised: 06/05/2023] [Accepted: 06/07/2023] [Indexed: 06/29/2023] Open
Abstract
Signaling and detoxification of Reactive Oxygen Species (ROS) are important patho-physiologcal processes. Despite this, we lack comprehensive information on individual cells and cellular structures and functions affected by ROS, which is essential to build quantitative models of the effects of ROS. The thiol groups from cysteines (Cys) in proteins play a major role in redox defense, signaling, and protein function. In this study, we show that the proteins in each subcellular compartment contain a characteristic Cys amount. Using a fluorescent assay for -SH in thiolate form and amino groups in proteins, we show that the thiolate content correlates with ROS sensitivity and signaling properties of each compartment. The highest absolute thiolate concentration was found in the nucleolus, followed by the nucleoplasm and cytoplasm whereas protein thiolate groups per protein showed an inverse pattern. In the nucleoplasm, protein reactive thiols concentrated in SC35 speckles, SMN, and the IBODY that accumulated oxidized RNA. Our findings have important functional consequences, and explain differential sensitivity to ROS.
Collapse
|
3
|
PMIDigest: Interactive Review of Large Collections of PubMed Entries to Distill Relevant Information. Genes (Basel) 2023; 14:genes14040942. [PMID: 37107700 PMCID: PMC10137743 DOI: 10.3390/genes14040942] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Revised: 03/14/2023] [Accepted: 04/18/2023] [Indexed: 04/29/2023] Open
Abstract
Scientific knowledge is being accumulated in the biomedical literature at an unprecedented pace. The most widely used database with biomedicine-related article abstracts, PubMed, currently contains more than 36 million entries. Users performing searches in this database for a subject of interest face thousands of entries (articles) that are difficult to process manually. In this work, we present an interactive tool for automatically digesting large sets of PubMed articles: PMIDigest (PubMed IDs digester). The system allows for classification/sorting of articles according to different criteria, including the type of article and different citation-related figures. It also calculates the distribution of MeSH (medical subject headings) terms for categories of interest, providing in a picture of the themes addressed in the set. These MeSH terms are highlighted in the article abstracts in different colors depending on the category. An interactive representation of the interarticle citation network is also presented in order to easily locate article "clusters" related to particular subjects, as well as their corresponding "hub" articles. In addition to PubMed articles, the system can also process a set of Scopus or Web of Science entries. In summary, with this system, the user can have a "bird's eye view" of a large set of articles and their main thematic tendencies and obtain additional information not evident in a plain list of abstracts.
Collapse
|
4
|
Phosphorylation-Related Crosstalk Between Distant Regions of the Core Region of the Coat Protein Contributes to Virion Assembly of Plum Pox Virus. MOLECULAR PLANT-MICROBE INTERACTIONS : MPMI 2020; 33:653-667. [PMID: 31859600 DOI: 10.1094/mpmi-10-19-0305-r] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Eukaryotic proteins are often targets of posttranslational modifications (PTMs). Capsid protein (CP) of plum pox virus (PPV), a member of genus Potyvirus, has been reported to be prone to phosphorylation in four serines at the N-terminal region. CP phosphorylation has been proposed to influence PPV infection by regulating CP accumulation in coordination with a second PTM, O-GlcNAcylation. In this study, a further proteomic characterization of PPV CP phosphorylation revealed additional phospho-targets, thus evidencing even greater complexity of the network of PTMs affecting this protein. In particular, two new phosphorylation targets, T254 and T313, at protein distal core, appear to be highly relevant for infection. Although abolishing phosphorylation at these positions does not have a severe effect on infectivity or viral accumulation, phospho-mimicking at either of these targets disrupts cell-to-cell movement. Strand-specific reverse transcription-quantitative PCR analysis and fractionation by centrifugation in a continuous sucrose gradient enabled us to conclude that such a deleterious effect is not related to failures in replication but is a consequence of inaccurate virion assembly. The analysis of spontaneous compensatory mutations at the CP core identified in a multiple phospho-mimicking mutant disclosed a functional dialogue between distant phospho-targets, which was further supported by an in silico PPV virion model, built on the watermelon mosaic virus atomic structure. Therefore, whereas joint and opposite action of O-GlcNAcylation and phosphorylation at the N-terminal disordered protrusion of CP appears to regulate protein stability, we propose that phosphorylations at the core region control assembly and disassembly of viral particles.
Collapse
|
5
|
Complex genetic and epigenetic regulation deviates gene expression from a unifying global transcriptional program. PLoS Comput Biol 2019; 15:e1007353. [PMID: 31527866 PMCID: PMC6764696 DOI: 10.1371/journal.pcbi.1007353] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2019] [Revised: 09/27/2019] [Accepted: 08/21/2019] [Indexed: 11/18/2022] Open
Abstract
Environmental or genetic perturbations lead to gene expression changes. While most analyses of these changes emphasize the presence of qualitative differences on just a few genes, we now know that changes are widespread. This large-scale variation has been linked to the exclusive influence of a global transcriptional program determined by the new physiological state of the cell. However, given the sophistication of eukaryotic regulation, we expect to have a complex architecture of specific control affecting this program. Here, we examine this architecture. Using data of Saccharomyces cerevisiae expression in different nutrient conditions, we first propose a five-sector genome partition, which integrates earlier models of resource allocation, as a framework to examine the deviations from the global control. In this scheme, we recognize invariant genes, whose regulation is dominated by physiology, specific genes, which substantially depart from it, and two additional classes that contain the frequently assumed growth-dependent genes. Whereas the invariant class shows a considerable absence of specific regulation, the rest is enriched by regulation at the level of transcription factors (TFs) and epigenetic modulators. We nevertheless find markedly different strategies in how these classes deviate. On the one hand, there are TFs that act in a unique way between partition constituents, and on the other, the action of chromatin modifiers is significantly diverse. The balance between regulatory strategies ultimately modulates the action of the general transcription machinery and therefore limits the possibility of establishing a unifying program of expression change at a genomic scale. How can we understand expression changes observed as a result of environmental or genetic perturbations? This issue has been conventionally answered by examining small groups of genes whose expression becomes qualitatively altered after these perturbations. But this approach is too simplistic, as we now know that extensive variation is typically observed. To explain this variation, recent works proposed a model in which genome-wide changes are explained by the action of a general program of transcription. Our manuscript reasons that given the complexity of eukaryotic transcriptional control, a unifying program of regulation cannot be achievable. Instead, we propose within an integrated framework of resource allocation that a rich structure of deviations from it exists and that by characterizing these deviations we can fully appreciate large-scale expression change.
Collapse
|
6
|
Bacterial Feature Finder (BaFF)-a system for extracting features overrepresented in sets of prokaryotic organisms. Bioinformatics 2019; 35:3482-3483. [PMID: 30844057 DOI: 10.1093/bioinformatics/btz099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2018] [Revised: 12/21/2018] [Accepted: 02/12/2019] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION The results of some experimental and computational techniques are given in terms of large sets of organisms, especially prokaryotic. While their distinctive features can provide useful data regarding specific phenomenon, there are no automated tools for extracting them. RESULTS We present here the Bacterial Feature Finder web server, a tool to automatically interrogate sets of prokaryotic organisms provided by the user to evaluate their specific biological features. At the core of the system is a searchable database of qualitative and quantitative features compiled for more than 23 000 prokaryotic organisms. Both the input set of organisms and the background set used to calculate the enriched features can be directly provided by the user, or they can be obtained by searching the database. The results are presented via an interactive graphical interface, with links to external resources. AVAILABILITY AND IMPLEMENTATION The web server is freely available at http://csbg.cnb.csic.es/BaFF. It has been tested in the main web browsers and does not require any especial plug-ins or additional software. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
7
|
Factors affecting interactome-based prediction of human genes associated with clinical signs. BMC Bioinformatics 2017; 18:340. [PMID: 28715999 PMCID: PMC5514523 DOI: 10.1186/s12859-017-1754-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2017] [Accepted: 07/12/2017] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Clinical signs are a fundamental aspect of human pathologies. While disease diagnosis is problematic or impossible in many cases, signs are easier to perceive and categorize. Clinical signs are increasingly used, together with molecular networks, to prioritize detected variants in clinical genomics pipelines, even if the patient is still undiagnosed. Here we analyze the ability of these network-based methods to predict genes that underlie clinical signs from the human interactome. RESULTS Our analysis reveals that these approaches can locate genes associated with clinical signs with variable performance that depends on the sign and associated disease. We analyzed several clinical and biological factors that explain these variable results, including number of genes involved (mono- vs. oligogenic diseases), mode of inheritance, type of clinical sign and gene product function. CONCLUSIONS Our results indicate that the characteristics of the clinical signs and their related diseases should be considered for interpreting the results of network-prediction methods, such as those aimed at discovering disease-related genes and variants. These results are important due the increasing use of clinical signs as an alternative to diseases for studying the molecular basis of human pathologies.
Collapse
|
8
|
MBROLE 2.0-functional enrichment of chemical compounds. Nucleic Acids Res 2016; 44:W201-4. [PMID: 27084944 PMCID: PMC4987872 DOI: 10.1093/nar/gkw253] [Citation(s) in RCA: 135] [Impact Index Per Article: 16.9] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2016] [Accepted: 04/02/2016] [Indexed: 12/24/2022] Open
Abstract
Metabolites Biological Role (MBROLE) is a server that performs functional enrichment analysis of a list of chemical compounds derived from a metabolomics experiment, which allows this list to be interpreted in biological terms. Since its release in 2011, MBROLE has been used by different groups worldwide to analyse metabolomics experiments from a variety of organisms. Here we present the latest version of the system, MBROLE2, accessible at http://csbg.cnb.csic.es/mbrole2. MBROLE2 has been supplemented with 10 databases not available in the previous version, which allow analysis over a larger, richer set of vocabularies including metabolite–protein and drug–protein interactions. This new version performs automatic conversion of compound identifiers from different databases, thus simplifying usage. In addition, the user interface has been redesigned to generate an interactive, more intuitive representation of the results.
Collapse
|
9
|
Practical analysis of specificity-determining residues in protein families. Brief Bioinform 2015; 17:255-61. [DOI: 10.1093/bib/bbv045] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2015] [Accepted: 06/15/2015] [Indexed: 12/17/2022] Open
|
10
|
Structural characterization of T7 tail machinery reveals a conserved tubular structure among otherPodoviridaefamily members and suggests a common mechanism for DNA delivery. BACTERIOPHAGE 2014. [DOI: 10.4161/bact.27011] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
11
|
Strigolactone promotes degradation of DWARF14, an α/β hydrolase essential for strigolactone signaling in Arabidopsis. THE PLANT CELL 2014; 26:1134-50. [PMID: 24610723 PMCID: PMC4001374 DOI: 10.1105/tpc.114.122903] [Citation(s) in RCA: 70] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/16/2014] [Revised: 02/05/2014] [Accepted: 02/11/2014] [Indexed: 05/18/2023]
Abstract
Strigolactones (SLs) are phytohormones that play a central role in regulating shoot branching. SL perception and signaling involves the F-box protein MAX2 and the hydrolase DWARF14 (D14), proposed to act as an SL receptor. We used strong loss-of-function alleles of the Arabidopsis thaliana D14 gene to characterize D14 function from early axillary bud development through to lateral shoot outgrowth and demonstrated a role of this gene in the control of flowering time. Our data show that D14 distribution in vivo overlaps with that reported for MAX2 at both the tissue and subcellular levels, allowing physical interactions between these proteins. Our grafting studies indicate that neither D14 mRNA nor the protein move over a long range upwards in the plant. Like MAX2, D14 is required locally in the aerial part of the plant to suppress shoot branching. We also identified a mechanism of SL-induced, MAX2-dependent proteasome-mediated degradation of D14. This negative feedback loop would cause a substantial drop in SL perception, which would effectively limit SL signaling duration and intensity.
Collapse
|
12
|
Molecular determinants of the ATP hydrolysis asymmetry of the CCT chaperonin complex. Proteins 2014; 82:703-7. [PMID: 24420718 DOI: 10.1002/prot.24510] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2013] [Revised: 12/16/2013] [Accepted: 12/26/2013] [Indexed: 12/20/2022]
Abstract
The eukaryotic cytosolic chaperonin CCT is a molecular machine involved in assisting the folding of proteins involved in important cellular processes. Like other chaperonins, CCT is formed by a double-ring structure but, unlike all of them, each ring is composed of eight different, albeit homologous subunits. This complexity has probably to do with the specificity in substrate interaction and with the mechanism of protein folding that takes place during the chaperonin functional cycle, but its detailed molecular basis remains unknown. We have analyzed the known proteomes in search of residues that are differentially conserved in the eight subunits, as predictors of functional specificity (specificity-determining positions; SDPs). We have found that most of these SDPs are located near the ATP binding site, and that they define four CCT clusters, corresponding to subunits CCT3, CCT6, CCT8 and CCT1/2/4/5/7. Our results point to a spatial organisation of the CCT subunits in two opposite areas of the ring and provide a molecular explanation for the previously described asymmetry in the hydrolysis of ATP.
Collapse
|
13
|
Functional signature for the recognition of specific target mRNAs by human Staufen1 protein. Nucleic Acids Res 2014; 42:4516-26. [PMID: 24470147 PMCID: PMC3985646 DOI: 10.1093/nar/gku073] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
Cellular messenger RNAs (mRNAs) are associated to proteins in the form of ribonucleoprotein particles. The double-stranded RNA-binding (DRB) proteins play important roles in mRNA synthesis, modification, activity and decay. Staufen is a DRB protein involved in the localized translation of specific mRNAs during Drosophila early development. The human Staufen1 (hStau1) forms RNA granules that contain translation regulation proteins as well as cytoskeleton and motor proteins to allow the movement of the granule on microtubules, but the mechanisms of hStau1-RNA recognition are still unclear. Here we used a combination of affinity chromatography, RNAse-protection, deep-sequencing and bioinformatic analyses to identify mRNAs differentially associated to hStau1 or a mutant protein unable to bind RNA and, in this way, defined a collection of mRNAs specifically associated to wt hStau1. A common sequence signature consisting of two opposite-polarity Alu motifs was present in the hStau1-associated mRNAs and was shown to be sufficient for binding to hStau1 and hStau1-dependent stimulation of protein expression. Our results unravel how hStau1 identifies a wide spectrum of cellular target mRNAs to control their localization, expression and fate.
Collapse
|
14
|
Abstract
Most bacterial viruses need a specialized machinery, called "tail," to inject their genomes inside the bacterial cytoplasm without disrupting the cellular integrity. Bacteriophage T7 is a well characterized member of the Podoviridae family infecting Escherichia coli, and it has a short noncontractile tail that assembles sequentially on the viral head after DNA packaging. The T7 tail is a complex of around 2.7 MDa composed of at least four proteins as follows: the connector (gene product 8, gp8), the tail tubular proteins gp11 and gp12, and the fibers (gp17). Using cryo-electron microscopy and single particle image reconstruction techniques, we have determined the precise topology of the tail proteins by comparing the structure of the T7 tail extracted from viruses and a complex formed by recombinant gp8, gp11, and gp12 proteins. Furthermore, the order of assembly of the structural components within the complex was deduced from interaction assays with cloned and purified tail proteins. The existence of common folds among similar tail proteins allowed us to obtain pseudo-atomic threaded models of gp8 (connector) and gp11 (gatekeeper) proteins, which were docked into the corresponding cryo-EM volumes of the tail complex. This pseudo-atomic model of the connector-gatekeeper interaction revealed the existence of a common molecular architecture among viruses belonging to the three tailed bacteriophage families, strongly suggesting that a common molecular mechanism has been favored during evolution to coordinate the transition between DNA packaging and tail assembly.
Collapse
|
15
|
Tools for the functional interpretation of metabolomic experiments. Brief Bioinform 2012; 14:737-44. [DOI: 10.1093/bib/bbs055] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
|
16
|
|
17
|
Genetic polymorphisms among C57BL/6 mouse inbred strains. Transgenic Res 2010; 20:481-9. [DOI: 10.1007/s11248-010-9403-8] [Citation(s) in RCA: 140] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2010] [Accepted: 05/10/2010] [Indexed: 11/24/2022]
|
18
|
Common conventions for interchange and archiving of three-dimensional electron microscopy information in structural biology. J Struct Biol 2005; 151:196-207. [PMID: 16043364 DOI: 10.1016/j.jsb.2005.06.001] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2005] [Revised: 06/02/2005] [Accepted: 06/07/2005] [Indexed: 11/23/2022]
Abstract
Three-dimensional electron microscopy (3DEM) has made significant contributions to structural biology. To accomplish this feat, many image-processing software packages were developed by various laboratories. The independent development of methods naturally implied the adoption of dissimilar conventions-penalizing users who want to take advantage of the wealth of algorithms from different packages. In addition, a public repository of 3DEM research results, the EM Data Bank, is now established. In an era where information exchange is important, standardizing conventions is a necessity. The 3DEM field requires a consistent set of conventions. We propose a set of common conventions named the "3DEM Image Conventions." They are designed as a standardized approach to image interpretation and presentation. In this regard, the conventions serve as a first step on which to build data-exchange solutions among existing software packages and as a vehicle for homogenous data representation in data archives, such as the EM Data Bank.
Collapse
|
19
|
EMDep: a web-based system for the deposition and validation of high-resolution electron microscopy macromolecular structural information. J Struct Biol 2004; 144:228-37. [PMID: 14643225 DOI: 10.1016/j.jsb.2003.09.009] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
This paper describes the design and implementation of a Web-based deposition system, EMDep, for macro-molecular volumes determined by electron microscopy and deposited at the European Bioinformatics Institute (EBI) for inclusion in the Electron Microscopy Data Base (EMDB). EMDep is a flexible and portable system (http://www.ebi.ac.uk/msd-srv/emdep/) that allows for the acceptance and validation of data, by an interactive depositor-driven operation. The system takes full advantage of the knowledge and expertise of the experimenters, rather than relying on the database curators, for the complete and accurate description of the structural experiment and its results.
Collapse
|
20
|
Abstract
MOTIVATION As in many other fields of science, computational methods in molecular biology need to intersperse information access and algorithm execution in a computational workflow. Users often find difficulties when transferring data between data sources and applications. In most cases there is no standard solution for workflow design and execution and tailored scripting mechanisms are implemented in a case by case basis. RESULTS In this paper, we present a general purpose 'programmable integrator' that can access information from a variety of sources in a coordinated manner. Its usefulness in complex bioinformatics applications is claimed and supported by some application examples. AVAILABILITY Tools are freely available to non-profit educations and research institutions. Usage by commercial organizations requires a license agreement. Software requirements: Java v1.3 (http://java.sun.com), Xerces XML Parser (http://xml.apache.org/xerces-j) and Kweelt implementation of XQuery (http://kweelt.sourceforge.net/).
Collapse
|
21
|
Abstract
FEMME (Feature Extraction in a Multi-resolution Macromolecular Environment: http://www.biocomp.cnb.uam.es/FEMME/) database version 1.0 is a new bioinformatics data resource that collects topologic and geometric information obtained from macromolecular structures solved by three-dimensional electron microscopy (3D-EM). Although the FEMME database is focused on medium resolution data, the methodology employed (based on the so-called alpha-shape theory) is applicable to atomic resolution data as well. The alpha-shape representation allows the automatic extraction of structural features from 3D-EM volumes and their subsequent characterisation. FEMME is being populated with 3D-EM data stored in the electron microscopy database EMD-DB (http://www.ebi.ac.uk/msd/). However, and since the number of entries in EMD-DB is still relatively small, FEMME is also being populated in this initial phase with structural data from PDB and PQS databases (http://www.rcsb.org/pdb/ and pqs.ebi.ac.uk/, respectively) whose resolution has been lowered accordingly. Each FEMME entry contains macromolecular geometry and topology information with a detailed description of its structural features. Moreover, FEMME data have facilitated the study and development of a method to retrieve macromolecular structures by their structural content based on the combined use of spin images and neural networks with encouraging results. Therefore, the FEMME database constitutes a powerful tool that provides a uniform and automatic way of analysing volumes coming from 3D-EM that will hopefully help the scientific community to perform wide structural comparisons.
Collapse
|
22
|
The need for a shared database infrastructure: combining X-ray crystallography and electron microscopy. EUROPEAN BIOPHYSICS JOURNAL : EBJ 2001; 29:457-62. [PMID: 11081407 DOI: 10.1007/s002490000089] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
Advances in structural biology are opening greater opportunities for understanding biological structures from the cellular to the atomic level. Particularly promising are the links that can be established between the information provided by electron microscopy and the atomic structures derived from X-ray crystallography and nuclear magnetic resonance spectroscopy. Combining such different kinds of structural data can result in novel biological information on the interaction of biomolecules in large supramolecular assemblies. As a consequence, the need to develop new databases in the field of structural biology that allow for an integrated access to data from all the experimental techniques is becoming critical. Pilot studies performed in recent years have already established a solid background as far as the basic information that an integrated macromolecular structure database should contain, as well as the basic principles for integration. These efforts started in the context of the BioImage project, and resulted in a first complete database prototype that provided a versatile platform for the linking of atomic models or X-ray diffraction data with electron microscopy information. Analysis of the requirements needed to combine data at different levels of resolution have resulted in sets of specifications that make possible the integration of all these different types in the context of a web environment. The case of a structural study linking electron microscopy and X-ray data, which is already contained within the BioImage data base and in the Protein Data Bank, is used here to illustrate the current approach, while a general discussion highlights the urgent need for integrated databases.
Collapse
|
23
|
Design and realization of an on-line database for multidimensional microscopic images of biological specimens. J Struct Biol 1999; 125:103-11. [PMID: 10222267 DOI: 10.1006/jsbi.1999.4092] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The BioImage database is a new scientific database for multidimensional microscopic images of biological specimens, which is available through the World Wide Web (WWW). The development of this database has followed an iterative approach, in which requirements and functionality have been revised and extended. The complexity and innovative use of the data meant that technical and biological expertise has been crucial in the initial design of the data model. A controlled vocabulary was introduced to ensure data consistency. Pointers are used to reference information stored in other databases. The data model was built using InfoModeler as a database design tool. The database management system is the Informix Dynamic Server with Universal Data Option. This object-relational system allows the handling of complex data using features such as collection types, inheritance, and user-defined data types. Informix datablades are used to provide additional functionality: the Web Integration Option enables WWW access to the database; the Video Foundation Blade provides functionality for video handling.
Collapse
|
24
|
Abstract
Nowadays it is possible to unravel complex information at all levels of cellular organization by obtaining multi-dimensional image information. At the macromolecular level, three-dimensional (3D) electron microscopy, together with other techniques, is able to reach resolutions at the nanometer or subnanometer level. The information is delivered in the form of 3D volumes containing samples of a given function, for example, the electron density distribution within a given macromolecule. The same situation happens at the cellular level with the new forms of light microscopy, particularly confocal microscopy, all of which produce biological 3D volume information. Furthermore, it is possible to record sequences of images over time (videos), as well as sequences of volumes, bringing key information on the dynamics of living biological systems. It is in this context that work on BioImage started two years ago, and that its first version is now presented here. In essence, BioImage is a database specifically designed to contain multi-dimensional images, perform queries and interactively work with the resulting multi-dimensional information on the World Wide Web, as well as accomplish the required cross-database links. Two sister home pages of BioImage can be accessed at http://www. bioimage.org and http://www-embl.bioimage.org
Collapse
|
25
|
|