1
|
Timón-Reina S, Rincón M, Martínez-Tomás R. An overview of graph databases and their applications in the biomedical domain. Database (Oxford) 2021; 2021:baab026. [PMID: 34003247 PMCID: PMC8130509 DOI: 10.1093/database/baab026] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2020] [Revised: 03/24/2021] [Accepted: 04/30/2021] [Indexed: 01/18/2023]
Abstract
Over the past couple of decades, the explosion of densely interconnected data has stimulated the research, development and adoption of graph database technologies. From early graph models to more recent native graph databases, the landscape of implementations has evolved to cover enterprise-ready requirements. Because of the interconnected nature of its data, the biomedical domain has been one of the early adopters of graph databases, enabling more natural representation models and better data integration workflows, exploration and analysis facilities. In this work, we survey the literature to explore the evolution, performance and how the most recent graph database solutions are applied in the biomedical domain, compiling a great variety of use cases. With this evidence, we conclude that the available graph database management systems are fit to support data-intensive, integrative applications, targeted at both basic research and exploratory tasks closer to the clinic.
Collapse
Affiliation(s)
- Santiago Timón-Reina
- Departamento de Inteligencia Artificial, Universidad Nacional de Educación a Distancia (UNED), C/Juan del Rosal, 16 Ciudad Universitaria, Madrid 28040, Spain
| | - Mariano Rincón
- Departamento de Inteligencia Artificial, Universidad Nacional de Educación a Distancia (UNED), C/Juan del Rosal, 16 Ciudad Universitaria, Madrid 28040, Spain
| | - Rafael Martínez-Tomás
- Departamento de Inteligencia Artificial, Universidad Nacional de Educación a Distancia (UNED), C/Juan del Rosal, 16 Ciudad Universitaria, Madrid 28040, Spain
| |
Collapse
|
2
|
Kobren SN, Singh M. Systematic domain-based aggregation of protein structures highlights DNA-, RNA- and other ligand-binding positions. Nucleic Acids Res 2019; 47:582-593. [PMID: 30535108 PMCID: PMC6344845 DOI: 10.1093/nar/gky1224] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2018] [Accepted: 11/26/2018] [Indexed: 02/06/2023] Open
Abstract
Domains are fundamental subunits of proteins, and while they play major roles in facilitating protein–DNA, protein–RNA and other protein–ligand interactions, a systematic assessment of their various interaction modes is still lacking. A comprehensive resource identifying positions within domains that tend to interact with nucleic acids, small molecules and other ligands would expand our knowledge of domain functionality as well as aid in detecting ligand-binding sites within structurally uncharacterized proteins. Here, we introduce an approach to identify per-domain-position interaction ‘frequencies’ by aggregating protein co-complex structures by domain and ascertaining how often residues mapping to each domain position interact with ligands. We perform this domain-based analysis on ∼91000 co-complex structures, and infer positions involved in binding DNA, RNA, peptides, ions or small molecules across 4128 domains, which we refer to collectively as the InteracDome. Cross-validation testing reveals that ligand-binding positions for 2152 domains are highly consistent and can be used to identify residues facilitating interactions in ∼63–69% of human genes. Our resource of domain-inferred ligand-binding sites should be a great aid in understanding disease etiology: whereas these sites are enriched in Mendelian-associated and cancer somatic mutations, they are depleted in polymorphisms observed across healthy populations. The InteracDome is available at http://interacdome.princeton.edu.
Collapse
Affiliation(s)
- Shilpa Nadimpalli Kobren
- Department of Biomedical Informatics, Harvard Medical School, 10 Shattuck Street, Boston, MA 02115, USA
| | - Mona Singh
- Department of Computer Science, Princeton University, 35 Olden Street, Princeton, NJ 08544, USA.,Lewis-Sigler Institute for Integrative Genomics, Princeton University, Carl Icahn Laboratory, Princeton, NJ 08544, USA
| |
Collapse
|
3
|
Enkirch T, Sauber S, Anderson DE, Gan ES, Kenanov D, Maurer-Stroh S, von Messling V. Identification and in vivo Efficacy Assessment of Approved Orally Bioavailable Human Host Protein-Targeting Drugs With Broad Anti-influenza A Activity. Front Immunol 2019; 10:1097. [PMID: 31244822 PMCID: PMC6563844 DOI: 10.3389/fimmu.2019.01097] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2018] [Accepted: 04/30/2019] [Indexed: 12/12/2022] Open
Abstract
The high genetic variability of influenza A viruses poses a continual challenge to seasonal and pandemic vaccine development, leaving antiviral drugs as the first line of defense against antigenically different strains or new subtypes. As resistance against drugs targeting viral proteins emerges rapidly, we assessed the antiviral activity of already approved drugs that target cellular proteins involved in the viral life cycle and were orally bioavailable. Out of 15 candidate compounds, four were able to inhibit infection by 10- to 100-fold without causing toxicity, in vitro. Two of the drugs, dextromethorphan and ketotifen, displayed a 50% effective dose between 5 and 50 μM, not only for the classic H1N1 PR8 strain, but also for a pandemic H1N1 and a seasonal H3N2 strain. Efficacy assessment in mice revealed that dextromethorphan consistently resulted in a significant reduction of viral lung titers and also enhanced the efficacy of oseltamivir. Dextromethorphan treatment of ferrets infected with a pandemic H1N1 strain led to a reduction in clinical disease severity, but no effect on viral titer was observed. In addition to identifying dextromethorphan as a potential influenza treatment option, our study illustrates the feasibility of a bioinformatics-driven rational approach for repurposing approved drugs against infectious diseases.
Collapse
Affiliation(s)
- Theresa Enkirch
- Programme in Emerging Infectious Diseases, Duke-NUS Medical School, Singapore, Singapore.,Veterinary Medicine Division, Paul-Ehrlich-Institut, Langen, Germany
| | - Svenja Sauber
- Veterinary Medicine Division, Paul-Ehrlich-Institut, Langen, Germany
| | - Danielle E Anderson
- Programme in Emerging Infectious Diseases, Duke-NUS Medical School, Singapore, Singapore
| | - Esther S Gan
- Programme in Emerging Infectious Diseases, Duke-NUS Medical School, Singapore, Singapore
| | - Dimitar Kenanov
- Biomolecular Function Discovery Division, Bioinformatics Institute, Agency for Science, Technology and Research, Singapore, Singapore
| | - Sebastian Maurer-Stroh
- Biomolecular Function Discovery Division, Bioinformatics Institute, Agency for Science, Technology and Research, Singapore, Singapore.,Department of Biological Sciences, National University of Singapore, Singapore, Singapore
| | - Veronika von Messling
- Programme in Emerging Infectious Diseases, Duke-NUS Medical School, Singapore, Singapore.,Veterinary Medicine Division, Paul-Ehrlich-Institut, Langen, Germany
| |
Collapse
|
4
|
Pavlopoulos GA, Kontou PI, Pavlopoulou A, Bouyioukos C, Markou E, Bagos PG. Bipartite graphs in systems biology and medicine: a survey of methods and applications. Gigascience 2018; 7:1-31. [PMID: 29648623 PMCID: PMC6333914 DOI: 10.1093/gigascience/giy014] [Citation(s) in RCA: 78] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2017] [Revised: 01/15/2018] [Accepted: 02/13/2018] [Indexed: 11/14/2022] Open
Abstract
The latest advances in high-throughput techniques during the past decade allowed the systems biology field to expand significantly. Today, the focus of biologists has shifted from the study of individual biological components to the study of complex biological systems and their dynamics at a larger scale. Through the discovery of novel bioentity relationships, researchers reveal new information about biological functions and processes. Graphs are widely used to represent bioentities such as proteins, genes, small molecules, ligands, and others such as nodes and their connections as edges within a network. In this review, special focus is given to the usability of bipartite graphs and their impact on the field of network biology and medicine. Furthermore, their topological properties and how these can be applied to certain biological case studies are discussed. Finally, available methodologies and software are presented, and useful insights on how bipartite graphs can shape the path toward the solution of challenging biological problems are provided.
Collapse
Affiliation(s)
- Georgios A Pavlopoulos
- Lawrence Berkeley Labs, DOE Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| | - Panagiota I Kontou
- University of Thessaly, Department of Computer Science and Biomedical Informatics, Papasiopoulou 2–4, Lamia, 35100, Greece
| | - Athanasia Pavlopoulou
- Izmir International Biomedicine and Genome Institute (iBG-Izmir), Dokuz Eylül University, 35340, Turkey
| | - Costas Bouyioukos
- Université Paris Diderot, Sorbonne Paris Cité, Epigenetics and Cell Fate, UMR7216, CNRS, France
| | - Evripides Markou
- University of Thessaly, Department of Computer Science and Biomedical Informatics, Papasiopoulou 2–4, Lamia, 35100, Greece
| | - Pantelis G Bagos
- University of Thessaly, Department of Computer Science and Biomedical Informatics, Papasiopoulou 2–4, Lamia, 35100, Greece
| |
Collapse
|
5
|
Li W, Freudenberg J, Oswald M. Principles for the organization of gene-sets. Comput Biol Chem 2015; 59 Pt B:139-49. [PMID: 26188561 DOI: 10.1016/j.compbiolchem.2015.04.005] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2015] [Accepted: 04/08/2015] [Indexed: 12/23/2022]
Abstract
A gene-set, an important concept in microarray expression analysis and systems biology, is a collection of genes and/or their products (i.e. proteins) that have some features in common. There are many different ways to construct gene-sets, but a systematic organization of these ways is lacking. Gene-sets are mainly organized ad hoc in current public-domain databases, with group header names often determined by practical reasons (such as the types of technology in obtaining the gene-sets or a balanced number of gene-sets under a header). Here we aim at providing a gene-set organization principle according to the level at which genes are connected: homology, physical map proximity, chemical interaction, biological, and phenotypic-medical levels. We also distinguish two types of connections between genes: actual connection versus sharing of a label. Actual connections denote direct biological interactions, whereas shared label connection denotes shared membership in a group. Some extensions of the framework are also addressed such as overlapping of gene-sets, modules, and the incorporation of other non-protein-coding entities such as microRNAs.
Collapse
Affiliation(s)
- Wentian Li
- The Robert S. Boas Center for Genomics and Human Genetics, The Feinstein Institute for Medical Research, North Shore LIJ Health System, Manhasset, NY, USA.
| | - Jan Freudenberg
- The Robert S. Boas Center for Genomics and Human Genetics, The Feinstein Institute for Medical Research, North Shore LIJ Health System, Manhasset, NY, USA
| | - Michaela Oswald
- The Robert S. Boas Center for Genomics and Human Genetics, The Feinstein Institute for Medical Research, North Shore LIJ Health System, Manhasset, NY, USA
| |
Collapse
|
6
|
van Dam S, Craig T, de Magalhães JP. GeneFriends: a human RNA-seq-based gene and transcript co-expression database. Nucleic Acids Res 2014; 43:D1124-32. [PMID: 25361971 PMCID: PMC4383890 DOI: 10.1093/nar/gku1042] [Citation(s) in RCA: 81] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Co-expression networks have proven effective at assigning putative functions to genes based on the functional annotation of their co-expressed partners, in candidate gene prioritization studies and in improving our understanding of regulatory networks. The growing number of genome resequencing efforts and genome-wide association studies often identify loci containing novel genes and there is a need to infer their functions and interaction partners. To facilitate this we have expanded GeneFriends, an online database that allows users to identify co-expressed genes with one or more user-defined genes. This expansion entails an RNA-seq-based co-expression map that includes genes and transcripts that are not present in the microarray-based co-expression maps, including over 10 000 non-coding RNAs. The results users obtain from GeneFriends include a co-expression network as well as a summary of the functional enrichment among the co-expressed genes. Novel insights can be gathered from this database for different splice variants and ncRNAs, such as microRNAs and lincRNAs. Furthermore, our updated tool allows candidate transcripts to be linked to diseases and processes using a guilt-by-association approach. GeneFriends is freely available from http://www.GeneFriends.org and can be used to quickly identify and rank candidate targets relevant to the process or disease under study.
Collapse
Affiliation(s)
- Sipko van Dam
- Integrative Genomics of Ageing Group, Institute of Integrative Biology, University of Liverpool, Liverpool, UK
| | - Thomas Craig
- Integrative Genomics of Ageing Group, Institute of Integrative Biology, University of Liverpool, Liverpool, UK
| | - João Pedro de Magalhães
- Integrative Genomics of Ageing Group, Institute of Integrative Biology, University of Liverpool, Liverpool, UK
| |
Collapse
|
7
|
Xu J, Lim SBH, Ng MY, Ali SM, Kausalya JP, Limviphuvadh V, Maurer-Stroh S, Hunziker W. ZO-1 regulates Erk, Smad1/5/8, Smad2, and RhoA activities to modulate self-renewal and differentiation of mouse embryonic stem cells. Stem Cells 2013; 30:1885-900. [PMID: 22782886 DOI: 10.1002/stem.1172] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
ZO-1/Tjp1 is a cytosolic adaptor that links tight junction (TJ) transmembrane proteins to the actin cytoskeleton and has also been implicated in regulating cell proliferation and differentiation by interacting with transcriptional regulators and signaling proteins. To explore possible roles for ZO-1 in mouse embryonic stem cells (mESCs), we inactivated the ZO-1 locus by homologous recombination. The lack of ZO-1 was found to affect mESC self-renewal and differentiation in the presence of leukemia-inhibiting factor (LIF) and Bmp4 or following removal of the growth factors. Our data suggest that ZO-1 suppresses Stat3 and Smad1/5/8 activities and sustains extracellular-signal-regulated kinase (Erk) activity to promote mESC differentiation. Interestingly, Smad2, critical for human but not mESC self-renewal, was hyperactivated in ZO-1(-/-) mESCs and RhoA protein levels were concomitantly enhanced, suggesting attenuation of the noncanonical transforming growth factor β (Tgfβ)/Activin/Nodal pathway that mediates ubiquitination and degradation of RhoA via the TJ proteins Occludin, Par6, and Smurf1 and activation of the canonical Smad2-dependent pathway. Furthermore, Bmp4-induced differentiation of mESCs in the absence of LIF was suppressed in ZO-1(-/-) mESCs, but differentiation down the neural or cardiac lineages was not disturbed. These findings reveal novel roles for ZO-1 in mESC self-renewal, pluripotency, and differentiation by influencing several signaling networks that regulate these processes. Possible implications for the differing relevance of Smad2 in mESC and human ESC self-renewal and how ZO-1 may connect to the different pathways are discussed.
Collapse
Affiliation(s)
- Jianliang Xu
- Epithelial Cell Biology Laboratory, Institute of Molecular and Cell Biology, Singapore
| | | | | | | | | | | | | | | |
Collapse
|
8
|
Li X, Wang X, Snyder M. Systematic investigation of protein-small molecule interactions. IUBMB Life 2012; 65:2-8. [PMID: 23225626 DOI: 10.1002/iub.1111] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2012] [Accepted: 10/03/2012] [Indexed: 01/01/2023]
Abstract
Cell signaling is extensively wired between cellular components to sustain cell proliferation, differentiation, and adaptation. The interaction network is often manifested in how protein function is regulated through interacting with other cellular components including small molecule metabolites. While many biochemical interactions have been established as reactions between protein enzymes and their substrates and products, much less is known at the system level about how small metabolites regulate protein functions through allosteric binding. In the past decade, study of protein-small molecule interactions has been lagging behind other types of interactions. Recent technological advances have explored several high-throughput platforms to reveal many "unexpected" protein-small molecule interactions that could have profound impact on our understanding of cell signaling. These interactions will help bridge gaps in existing regulatory loops of cell signaling and serve as new targets for medical intervention. In this review, we summarize recent advances of systematic investigation of protein-metabolite/small molecule interactions, and discuss the impact of such studies and their potential impact on both biological researches and medicine.
Collapse
Affiliation(s)
- Xiyan Li
- Department of Genetics, Stanford University, Stanford, CA, USA.
| | | | | |
Collapse
|
9
|
Sirota FL, Batagov A, Schneider G, Eisenhaber B, Eisenhaber F, Maurer-Stroh S. Beware of moving targets: reference proteome content fluctuates substantially over the years. J Bioinform Comput Biol 2012; 10:1250020. [PMID: 22867629 DOI: 10.1142/s0219720012500205] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Reference proteomes are generated by increasingly sophisticated annotation pipelines as part of regular genome build releases; yet, the corresponding changes in reference proteomes' content are dramatic. In the history of the NCBI-curated human proteome, the total number of entries has remained roughly constant but approximately half of the proteins from the 2003 build 33 are no longer represented by entries in current releases, while about the same number of new proteins have been added (for sequence identity thresholds 50-90%). Although mostly hypothetical proteins are affected, there are also spectacular cases of entry removal/addition of well studied proteins. The changes between the 2003 and recent human proteomes are in a similar order of magnitude as the differences between recent human and chimpanzee proteome releases. As an application example, we show that the proteome fluctuations affect the interpretation (about 74% of hits) of organelle-specific mass-spectrometry data. Although proteome quality tends to improve with more recent releases as, for example, the fraction of proteins with functional annotation has increased over time, existing evidence implies that, apparently, the proteome content still remains incomplete, not just pertaining to isoforms/sequence variants but also to proteins and their families that are clearly distinct.
Collapse
Affiliation(s)
- Fernanda L Sirota
- Bioinformatics Institute (BII), Agency for Science and Technology (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore.
| | | | | | | | | | | |
Collapse
|
10
|
García-Alonso L, Alonso R, Vidal E, Amadoz A, de María A, Minguez P, Medina I, Dopazo J. Discovering the hidden sub-network component in a ranked list of genes or proteins derived from genomic experiments. Nucleic Acids Res 2012; 40:e158. [PMID: 22844098 PMCID: PMC3488210 DOI: 10.1093/nar/gks699] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Genomic experiments (e.g. differential gene expression, single-nucleotide polymorphism association) typically produce ranked list of genes. We present a simple but powerful approach which uses protein–protein interaction data to detect sub-networks within such ranked lists of genes or proteins. We performed an exhaustive study of network parameters that allowed us concluding that the average number of components and the average number of nodes per component are the parameters that best discriminate between real and random networks. A novel aspect that increases the efficiency of this strategy in finding sub-networks is that, in addition to direct connections, also connections mediated by intermediate nodes are considered to build up the sub-networks. The possibility of using of such intermediate nodes makes this approach more robust to noise. It also overcomes some limitations intrinsic to experimental designs based on differential expression, in which some nodes are invariant across conditions. The proposed approach can also be used for candidate disease-gene prioritization. Here, we demonstrate the usefulness of the approach by means of several case examples that include a differential expression analysis in Fanconi Anemia, a genome-wide association study of bipolar disorder and a genome-scale study of essentiality in cancer genes. An efficient and easy-to-use web interface (available at http://www.babelomics.org) based on HTML5 technologies is also provided to run the algorithm and represent the network.
Collapse
Affiliation(s)
- Luz García-Alonso
- Department of Bioinformatics, Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain
| | | | | | | | | | | | | | | |
Collapse
|
11
|
Planas-Iglesias J, Guney E, García-García J, Robertson KA, Raza S, Freeman TC, Ghazal P, Oliva B. Extending signaling pathways with protein-interaction networks. Application to apoptosis. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2012; 16:245-56. [PMID: 22385281 DOI: 10.1089/omi.2011.0130] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Cells exploit signaling pathways during responses to environmental changes, and these processes are often modulated during disease. Particularly, relevant human pathologies such as cancer or viral infections require downregulating apoptosis signaling pathways to progress. As a result, the identification of proteins responsible for these changes is essential for the diagnostics and development of therapeutics. Transferring functional annotation within protein interaction networks has proven useful to identify such proteins, although this is not a trivial task. Here, we used different scoring methods to transfer annotation from 53 well-studied members of the human apoptosis pathways (as known by 2005) to their protein interactors. All scoring methods produced significant predictions (compared to a random negative model), but its number was too large to be useful. Thus, we made a final prediction using specific combinations of scoring methods and compared it to the proteins related to apoptosis signaling pathways during the last 5 years. We propose 273 candidate proteins that may be relevant in apoptosis signaling pathways. Although some of them have known functions consistent with their proposed apoptotsis involvement, the majority have not been annotated yet, leaving room for further experimental studies. We provide our predictions at http://sbi.imim.es/web/Apoptosis.php.
Collapse
Affiliation(s)
- Joan Planas-Iglesias
- Structural Bioinformatics Group, GRIB, Universitat Pompeu Fabra, Barcelona, Catalonia, Spain
| | | | | | | | | | | | | | | |
Collapse
|
12
|
Prussia A, Thepchatri P, Snyder JP, Plemper RK. Systematic approaches towards the development of host-directed antiviral therapeutics. Int J Mol Sci 2011; 12:4027-52. [PMID: 21747723 PMCID: PMC3131607 DOI: 10.3390/ijms12064027] [Citation(s) in RCA: 65] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2011] [Revised: 06/03/2011] [Accepted: 06/03/2011] [Indexed: 12/11/2022] Open
Abstract
Since the onset of antiviral therapy, viral resistance has compromised the clinical value of small-molecule drugs targeting pathogen components. As intracellular parasites, viruses complete their life cycle by hijacking a multitude of host-factors. Aiming at the latter rather than the pathogen directly, host-directed antiviral therapy has emerged as a concept to counteract evolution of viral resistance and develop broad-spectrum drug classes. This approach is propelled by bioinformatics analysis of genome-wide screens that greatly enhance insights into the complex network of host-pathogen interactions and generate a shortlist of potential gene targets from a multitude of candidates, thus setting the stage for a new era of rational identification of drug targets for host-directed antiviral therapies. With particular emphasis on human immunodeficiency virus and influenza virus, two major human pathogens, we review screens employed to elucidate host-pathogen interactions and discuss the state of database ontology approaches applicable to defining a therapeutic endpoint. The value of this strategy for drug discovery is evaluated, and perspectives for bioinformatics-driven hit identification are outlined.
Collapse
Affiliation(s)
- Andrew Prussia
- Department of Chemistry, Emory University, Atlanta, GA 30322, USA; E-Mails: (A.P.); (J.P.S.)
- Emory Institute for Drug Discovery (EIDD), Emory University, Atlanta, GA 30322, USA
| | - Pahk Thepchatri
- Department of Chemistry, Emory University, Atlanta, GA 30322, USA; E-Mails: (A.P.); (J.P.S.)
- Emory Institute for Drug Discovery (EIDD), Emory University, Atlanta, GA 30322, USA
- Authors to whom correspondence should be addressed; E-Mails: (P.T.); (R.K.P.); Tel.: +1-404-593-0547 (P.T.); +1-404-727-1605 (R.K.P.); Fax: +1-404-727-9223 (P.T.); +1-404-727-9223 (R.K.P.)
| | - James P. Snyder
- Department of Chemistry, Emory University, Atlanta, GA 30322, USA; E-Mails: (A.P.); (J.P.S.)
- Emory Institute for Drug Discovery (EIDD), Emory University, Atlanta, GA 30322, USA
| | - Richard K. Plemper
- Department of Pediatrics, Emory University, Atlanta, GA 30322, USA
- Children’s Healthcare of Atlanta, Atlanta, GA 30322, USA
- Authors to whom correspondence should be addressed; E-Mails: (P.T.); (R.K.P.); Tel.: +1-404-593-0547 (P.T.); +1-404-727-1605 (R.K.P.); Fax: +1-404-727-9223 (P.T.); +1-404-727-9223 (R.K.P.)
| |
Collapse
|
13
|
Gu H, Zhu P, Jiao Y, Meng Y, Chen M. PRIN: a predicted rice interactome network. BMC Bioinformatics 2011; 12:161. [PMID: 21575196 PMCID: PMC3118165 DOI: 10.1186/1471-2105-12-161] [Citation(s) in RCA: 131] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2010] [Accepted: 05/16/2011] [Indexed: 12/22/2022] Open
Abstract
Background Protein-protein interactions play a fundamental role in elucidating the molecular mechanisms of biomolecular function, signal transductions and metabolic pathways of living organisms. Although high-throughput technologies such as yeast two-hybrid system and affinity purification followed by mass spectrometry are widely used in model organisms, the progress of protein-protein interactions detection in plants is rather slow. With this motivation, our work presents a computational approach to predict protein-protein interactions in Oryza sativa. Results To better understand the interactions of proteins in Oryza sativa, we have developed PRIN, a Predicted Rice Interactome Network. Protein-protein interaction data of PRIN are based on the interologs of six model organisms where large-scale protein-protein interaction experiments have been applied: yeast (Saccharomyces cerevisiae), worm (Caenorhabditis elegans), fruit fly (Drosophila melanogaster), human (Homo sapiens), Escherichia coli K12 and Arabidopsis thaliana. With certain quality controls, altogether we obtained 76,585 non-redundant rice protein interaction pairs among 5,049 rice proteins. Further analysis showed that the topology properties of predicted rice protein interaction network are more similar to yeast than to the other 5 organisms. This may not be surprising as the interologs based on yeast contribute nearly 74% of total interactions. In addition, GO annotation, subcellular localization information and gene expression data are also mapped to our network for validation. Finally, a user-friendly web interface was developed to offer convenient database search and network visualization. Conclusions PRIN is the first well annotated protein interaction database for the important model plant Oryza sativa. It has greatly extended the current available protein-protein interaction data of rice with a computational approach, which will certainly provide further insights into rice functional genomics and systems biology. PRIN is available online at http://bis.zju.edu.cn/prin/.
Collapse
Affiliation(s)
- Haibin Gu
- Department of Bioinformatics, State Key Laboratory of Plant Physiology and Biochemistry, College of Life Sciences, Zhejiang University, Hangzhou 310058, China
| | | | | | | | | |
Collapse
|
14
|
Dietz KJ, Jacquot JP, Harris G. Hubs and bottlenecks in plant molecular signalling networks. THE NEW PHYTOLOGIST 2010; 188:919-38. [PMID: 20958306 DOI: 10.1111/j.1469-8137.2010.03502.x] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
Conditional control of plant cell function and development relies on appropriate signal perception, signal integration and processing. The development of high throughput technologies such as proteomics and interactomics has enabled the identification of protein interaction networks that mediate signal processing from inputs to appropriate outputs. Such networks can be depicted in graphical representations using nodes and edges allowing for the immediate visualization and analysis of the network's topology. Hubs are network elements characterized by many edges (often degree grade k ≥ 5) which confer a degree of topological importance to them. The review introduces the concept of networks, hubs and bottlenecks and describes four examples from plant science in more detail, namely hubs in the redox regulatory network of the chloroplast with ferredoxin, thioredoxin and peroxiredoxin, in mitogen activated protein (MAP) kinase signal processing, in photomorphogenesis with the COP9 signalosome, COP1 and CDD, and monomeric GTPase function. Some guidance is provided to appropriate internet resources, web repositories, databases and their use. Plant networks can be generated from existing public databases and this type of analysis is valuable in support of existing hypotheses, or to allow for the generation of new concepts or ideas. However, intensive manual curating of in silico networks is still always necessary.
Collapse
Affiliation(s)
- Karl-Josef Dietz
- Plant Biochemistry and Physiology, Bielefeld University, D-33501 Bielefeld, Germany.
| | | | | |
Collapse
|
15
|
Abstract
Binary subcomplexes in proteins database (BISC) is a new protein-protein interaction (PPI) database linking up the two communities most active in their characterization: structural biology and functional genomics researchers. The BISC resource offers users (i) a structural perspective and related information about binary subcomplexes (i.e. physical direct interactions between proteins) that are either structurally characterized or modellable entries in the main functional genomics PPI databases BioGRID, IntAct and HPRD; (ii) selected web services to further investigate the validity of postulated PPI by inspection of their hypothetical modelled interfaces. Among other uses we envision that this resource can help identify possible false positive PPI in current database records. BISC is freely available at http://bisc.cse.ucsc.edu.
Collapse
|