1
|
Duhan N, Kaundal R. HuCoPIA: An Atlas of Human vs. SARS-CoV-2 Interactome and the Comparative Analysis with Other Coronaviridae Family Viruses. Viruses 2023; 15:492. [PMID: 36851706 PMCID: PMC9962590 DOI: 10.3390/v15020492] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Revised: 02/01/2023] [Accepted: 02/04/2023] [Indexed: 02/12/2023] Open
Abstract
SARS-CoV-2, a novel betacoronavirus strain, has caused a pandemic that has claimed the lives of nearly 6.7M people worldwide. Vaccines and medicines are being developed around the world to reduce the disease spread, fatality rates, and control the new variants. Understanding the protein-protein interaction mechanism of SARS-CoV-2 in humans, and their comparison with the previous SARS-CoV and MERS strains, is crucial for these efforts. These interactions might be used to assess vaccination effectiveness, diagnose exposure, and produce effective biotherapeutics. Here, we present the HuCoPIA database, which contains approximately 100,000 protein-protein interactions between humans and three strains (SARS-CoV-2, SARS-CoV, and MERS) of betacoronavirus. The interactions in the database are divided into common interactions between all three strains and those unique to each strain. It also contains relevant functional annotation information of human proteins. The HuCoPIA database contains SARS-CoV-2 (41,173), SARS-CoV (31,997), and MERS (26,862) interactions, with functional annotation of human proteins like subcellular localization, tissue-expression, KEGG pathways, and Gene ontology information. We believe HuCoPIA will serve as an invaluable resource to diverse experimental biologists, and will help to advance the research in better understanding the mechanism of betacoronaviruses.
Collapse
Affiliation(s)
- Naveen Duhan
- Department of Plants, Soils, and Climate/Center for Integrated BioSystems, College of Agriculture and Applied Sciences, Utah State University, Logan, UT 84322, USA
- Bioinformatics Facility, Center for Integrated BioSystems, College of Agriculture and Applied Sciences, Utah State University, Logan, UT 84322, USA
| | - Rakesh Kaundal
- Department of Plants, Soils, and Climate/Center for Integrated BioSystems, College of Agriculture and Applied Sciences, Utah State University, Logan, UT 84322, USA
- Bioinformatics Facility, Center for Integrated BioSystems, College of Agriculture and Applied Sciences, Utah State University, Logan, UT 84322, USA
- Department of Computer Science, College of Science, Utah State University, Logan, UT 84322, USA
| |
Collapse
|
2
|
TritiKBdb: A Functional Annotation Resource for Deciphering the Complete Interaction Networks in Wheat-Karnal Bunt Pathosystem. Int J Mol Sci 2022; 23:ijms23137455. [PMID: 35806459 PMCID: PMC9267065 DOI: 10.3390/ijms23137455] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Revised: 06/30/2022] [Accepted: 06/30/2022] [Indexed: 02/01/2023] Open
Abstract
The study of molecular interactions, especially the inter-species protein-protein interactions, is crucial for understanding the disease infection mechanism in plants. These interactions play an important role in disease infection and host immune responses against pathogen attack. Among various critical fungal diseases, the incidences of Karnal bunt (Tilletia indica) around the world have hindered the export of the crops such as wheat from infected regions, thus causing substantial economic losses. Due to sparse information on T. indica, limited insight is available with regard to gaining in-depth knowledge of the interaction mechanisms between the host and pathogen proteins during the disease infection process. Here, we report the development of a comprehensive database and webserver, TritiKBdb, that implements various tools to study the protein-protein interactions in the Triticum species-Tilletia indica pathosystem. The novel ‘interactomics’ tool allows the user to visualize/compare the networks of the predicted interactions in an enriched manner. TritiKBdb is a user-friendly database that provides functional annotations such as subcellular localization, available domains, KEGG pathways, and GO terms of the host and pathogen proteins. Additionally, the information about the host and pathogen proteins that serve as transcription factors and effectors, respectively, is also made available. We believe that TritiKBdb will serve as a beneficial resource for the research community, and aid the community in better understanding the infection mechanisms of Karnal bunt and its interactions with wheat. The database is freely available for public use at http://bioinfo.usu.edu/tritikbdb/.
Collapse
|
3
|
Zhang HB, Ding XB, Jin J, Guo WP, Yang QL, Chen PC, Yao H, Ruan L, Tao YT, Chen X. Predicted mouse interactome and network-based interpretation of differentially expressed genes. PLoS One 2022; 17:e0264174. [PMID: 35390003 PMCID: PMC8989236 DOI: 10.1371/journal.pone.0264174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2021] [Accepted: 02/04/2022] [Indexed: 11/18/2022] Open
Abstract
The house mouse or Mus musculus has become a premier mammalian model for genetic research due to its genetic and physiological similarities to humans. It brought mechanistic insights into numerous human diseases and has been routinely used to assess drug efficiency and toxicity, as well as to predict patient responses. To facilitate molecular mechanism studies in mouse, we present the Mouse Interactome Database (MID, Version 1), which includes 155,887 putative functional associations between mouse protein-coding genes inferred from functional association evidence integrated from 9 public databases. These putative functional associations are expected to cover 19.32% of all mouse protein interactions, and 26.02% of these function associations may represent protein interactions. On top of MID, we developed a gene set linkage analysis (GSLA) web tool to annotate potential functional impacts from observed differentially expressed genes. Two case studies show that the MID/GSLA system provided precise and informative annotations that other widely used gene set annotation tools, such as PANTHER and DAVID, did not. Both MID and GSLA are accessible through the website http://mouse.biomedtzc.cn.
Collapse
Affiliation(s)
- Hai-Bo Zhang
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics & Information Engineering, Taizhou University, Taizhou, China
| | - Xiao-Bao Ding
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics & Information Engineering, Taizhou University, Taizhou, China
| | - Jie Jin
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics & Information Engineering, Taizhou University, Taizhou, China
| | - Wen-Ping Guo
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics & Information Engineering, Taizhou University, Taizhou, China
| | - Qiao-Lei Yang
- Institute of Pharmaceutical Biotechnology, School of Medicine, Zhejiang University, Hangzhou, China
| | - Peng-Cheng Chen
- Institute of Pharmaceutical Biotechnology, School of Medicine, Zhejiang University, Hangzhou, China
| | - Heng Yao
- Institute of Pharmaceutical Biotechnology, School of Medicine, Zhejiang University, Hangzhou, China
| | - Li Ruan
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics & Information Engineering, Taizhou University, Taizhou, China
| | - Yu-Tian Tao
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics & Information Engineering, Taizhou University, Taizhou, China
- * E-mail: (YTT); (XC)
| | - Xin Chen
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics & Information Engineering, Taizhou University, Taizhou, China
- Institute of Pharmaceutical Biotechnology, School of Medicine, Zhejiang University, Hangzhou, China
- Joint Institute for Genetics and Genome Medicine between Zhejiang University and University of Toronto, Zhejiang University, Hangzhou, China
- * E-mail: (YTT); (XC)
| |
Collapse
|
4
|
Deciphering the Host-Pathogen Interactome of the Wheat-Common Bunt System: A Step towards Enhanced Resilience in Next Generation Wheat. Int J Mol Sci 2022; 23:ijms23052589. [PMID: 35269732 PMCID: PMC8910311 DOI: 10.3390/ijms23052589] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Accepted: 02/09/2022] [Indexed: 02/05/2023] Open
Abstract
Common bunt, caused by two fungal species, Tilletia caries and Tilletia laevis, is one of the most potentially destructive diseases of wheat. Despite the availability of synthetic chemicals against the disease, organic agriculture relies greatly on resistant cultivars. Using two computational approaches—interolog and domain-based methods—a total of approximately 58 M and 56 M probable PPIs were predicted in T. aestivum–T. caries and T. aestivum–T. laevis interactomes, respectively. We also identified 648 and 575 effectors in the interactions from T. caries and T. laevis, respectively. The major host hubs belonged to the serine/threonine protein kinase, hsp70, and mitogen-activated protein kinase families, which are actively involved in plant immune signaling during stress conditions. The Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis of the host proteins revealed significant GO terms (O-methyltransferase activity, regulation of response to stimulus, and plastid envelope) and pathways (NF-kappa B signaling and the MAPK signaling pathway) related to plant defense against pathogens. Subcellular localization suggested that most of the pathogen proteins target the host in the plastid. Furthermore, a comparison between unique T. caries and T. laevis proteins was carried out. We also identified novel host candidates that are resistant to disease. Additionally, the host proteins that serve as transcription factors were also predicted.
Collapse
|
5
|
Kataria R, Kaundal R. Deciphering the Crosstalk Mechanisms of Wheat-Stem Rust Pathosystem: Genome-Scale Prediction Unravels Novel Host Targets. FRONTIERS IN PLANT SCIENCE 2022; 13:895480. [PMID: 35800602 PMCID: PMC9253690 DOI: 10.3389/fpls.2022.895480] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/13/2022] [Accepted: 05/31/2022] [Indexed: 05/04/2023]
Abstract
Triticum aestivum (wheat), a major staple food grain, is affected by various biotic stresses. Among these, fungal diseases cause about 15-20% of yield loss, worldwide. In this study, we performed a comparative analysis of protein-protein interactions between two Puccinia graminis races (Pgt 21-0 and Pgt Ug99) that cause stem (black) rust in wheat. The available molecular techniques to study the host-pathogen interaction mechanisms are expensive and labor-intensive. We implemented two computational approaches (interolog and domain-based) for the prediction of PPIs and performed various functional analysis to determine the significant differences between the two pathogen races. The analysis revealed that T. aestivum-Pgt 21-0 and T. aestivum-Pgt Ug99 interactomes consisted of ∼90M and ∼56M putative PPIs, respectively. In the predicted PPIs, we identified 115 Pgt 21-0 and 34 Pgt Ug99 potential effectors that were highly involved in pathogen virulence and development. Functional enrichment analysis of the host proteins revealed significant GO terms and KEGG pathways such as O-methyltransferase activity (GO:0008171), regulation of signal transduction (GO:0009966), lignin metabolic process (GO:0009808), plastid envelope (GO:0009526), plant-pathogen interaction pathway (ko04626), and MAPK pathway (ko04016) that are actively involved in plant defense and immune signaling against the biotic stresses. Subcellular localization analysis anticipated the host plastid as a primary target for pathogen attack. The highly connected host hubs in the protein interaction network belonged to protein kinase domain including Ser/Thr protein kinase, MAPK, and cyclin-dependent kinase. We also identified 5,577 transcription factors in the interactions, associated with plant defense during biotic stress conditions. Additionally, novel host targets that are resistant to stem rust disease were also identified. The present study elucidates the functional differences between Pgt 21-0 and Pgt Ug99, thus providing the researchers with strain-specific information for further experimental validation of the interactions, and the development of durable, disease-resistant crop lines.
Collapse
Affiliation(s)
- Raghav Kataria
- Department of Plants, Soils, and Climate, College of Agriculture and Applied Sciences, Utah State University, Logan, UT, United States
| | - Rakesh Kaundal
- Department of Plants, Soils, and Climate, College of Agriculture and Applied Sciences, Utah State University, Logan, UT, United States
- Bioinformatics Facility, Center for Integrated BioSystems, Utah State University, Logan, UT, United States
- Department of Computer Science, College of Science, Utah State University, Logan, UT, United States
- *Correspondence: Rakesh Kaundal,
| |
Collapse
|
6
|
Gupta SK, Ponte-Sucre A, Bencurova E, Dandekar T. An Ebola, Neisseria and Trypanosoma human protein interaction census reveals a conserved human protein cluster targeted by various human pathogens. Comput Struct Biotechnol J 2021; 19:5292-5308. [PMID: 34745452 PMCID: PMC8531761 DOI: 10.1016/j.csbj.2021.09.017] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2021] [Revised: 09/14/2021] [Accepted: 09/15/2021] [Indexed: 12/28/2022] Open
Abstract
Filovirus ebolavirus (ZE; Zaire ebolavirus, Bundibugyo ebolavirus), Neisseria meningitidis (NM), and Trypanosoma brucei (Tb) are serious infectious pathogens, spanning viruses, bacteria and protists and all may target the blood and central nervous system during their life cycle. NM and Tb are extracellular pathogens while ZE is obligatory intracellular, targetting immune privileged sites. By using interactomics and comparative evolutionary analysis we studied whether conserved human proteins are targeted by these pathogens. We examined 2797 unique pathogen-targeted human proteins. The information derived from orthology searches of experimentally validated protein-protein interactions (PPIs) resulted both in unique and shared PPIs for each pathogen. Comparing and analyzing conserved and pathogen-specific infection pathways for NM, TB and ZE, we identified human proteins predicted to be targeted in at least two of the compared host-pathogen networks. However, four proteins were common to all three host-pathogen interactomes: the elongation factor 1-alpha 1 (EEF1A1), the SWI/SNF complex subunit SMARCC2 (matrix-associated actin-dependent regulator of chromatin subfamily C), the dolichyl-diphosphooligosaccharide--protein glycosyltransferase subunit 1 (RPN1), and the tubulin beta-5 chain (TUBB). These four human proteins all are also involved in cytoskeleton and its regulation and are often addressed by various human pathogens. Specifically, we found (i) 56 human pathogenic bacteria and viruses that target these four proteins, (ii) the well researched new pandemic pathogen SARS-CoV-2 targets two of these four human proteins and (iii) nine human pathogenic fungi (yet another evolutionary distant organism group) target three of the conserved proteins by 130 high confidence interactions.
Collapse
Affiliation(s)
- Shishir K Gupta
- Functional Genomics & Systems Biology Group, Department of Bioinformatics, Biocenter, Am Hubland, University of Würzburg, 97074 Würzburg, Germany
- Evolutionary Genomics Group, Center for Computational and Theoretical Biology, University of Würzburg, 97078 Würzburg, Germany
| | - Alicia Ponte-Sucre
- Laboratorio de Fisiología Molecular, Instituto de Medicina Experimental, Escuela Luis Razetti, Universidad Central de Venezuela, Caracas, Venezuela
- Medical Mission Institute, Hermann-Schell-Str. 7, 97074 Würzburg, Germany
| | - Elena Bencurova
- Functional Genomics & Systems Biology Group, Department of Bioinformatics, Biocenter, Am Hubland, University of Würzburg, 97074 Würzburg, Germany
| | - Thomas Dandekar
- Functional Genomics & Systems Biology Group, Department of Bioinformatics, Biocenter, Am Hubland, University of Würzburg, 97074 Würzburg, Germany
- EMBL Heidelberg, BioComputing Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
| |
Collapse
|
7
|
Hollander M, Do T, Will T, Helms V. Detecting Rewiring Events in Protein-Protein Interaction Networks Based on Transcriptomic Data. FRONTIERS IN BIOINFORMATICS 2021; 1:724297. [PMID: 36303788 PMCID: PMC9581068 DOI: 10.3389/fbinf.2021.724297] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2021] [Accepted: 08/23/2021] [Indexed: 12/25/2022] Open
Abstract
Proteins rarely carry out their cellular functions in isolation. Instead, eukaryotic proteins engage in about six interactions with other proteins on average. The aggregated protein interactome of an organism forms a “hairy ball”-type protein-protein interaction (PPI) network. Yet, in a typical human cell, only about half of all proteins are expressed at a particular time. Hence, it has become common practice to prune the full PPI network to the subset of expressed proteins. If RNAseq data is available, one can further resolve the specific protein isoforms present in a cell or tissue. Here, we review various approaches, software tools and webservices that enable users to construct context-specific or tissue-specific PPI networks and how these are rewired between two cellular conditions. We illustrate their different functionalities on the example of the interactions involving the human TNR6 protein. In an outlook, we describe how PPI networks may be integrated with epigenetic data or with data on the activity of splicing factors.
Collapse
|
8
|
Zhou Y, Chen H, Li S, Chen M. mPPI: a database extension to visualize structural interactome in a one-to-many manner. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2021; 2021:6307707. [PMID: 34156447 DOI: 10.1093/database/baab036] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/26/2021] [Revised: 05/10/2021] [Accepted: 05/28/2021] [Indexed: 01/02/2023]
Abstract
Protein-protein interaction (PPI) databases with structural information are useful to investigate biological functions at both systematic and atomic levels. However, most existing PPI databases only curate binary interactome. From the perspective of the display and function of PPI, as well as the structural binding interface, the related database and resources are summarized. We developed a database extension, named mPPI, for PPI structural visualization. Comparing with the existing structural interactomes that curate resolved PPI conformation in pairs, mPPI can visualize target protein and its multiple interactors simultaneously, which facilitates multi-target drug discovery and structure prediction of protein macro-complexes. By employing a protein-protein docking algorithm, mPPI largely extends the coverage of structural interactome from experimentally resolved complexes. mPPI is designed to be a customizable and convenient plugin for PPI databases. It possesses wide potential applications for various PPI databases, and it has been used for a neurodegenerative disease-related PPI database as demonstration. Scripts and implementation guidelines of mPPI are documented at the database tool website. Database URL http://bis.zju.edu.cn/mppi/.
Collapse
Affiliation(s)
- Yekai Zhou
- Department of Bioinformatics, College of Life Sciences, Zhejiang University, Hangzhou 310058, China.,Department of Computer Science, The University of Hong Kong, Hong Kong 999077, China
| | - Hongjun Chen
- Department of Bioinformatics, College of Life Sciences, Zhejiang University, Hangzhou 310058, China
| | - Sida Li
- Department of Bioinformatics, College of Life Sciences, Zhejiang University, Hangzhou 310058, China
| | - Ming Chen
- Department of Bioinformatics, College of Life Sciences, Zhejiang University, Hangzhou 310058, China.,Bioinformatics Center, The First Affiliated Hospital, School of Medicine, Zhejiang University, Hangzhou 310058, China
| |
Collapse
|
9
|
Guo WP, Ding XB, Jin J, Zhang HB, Yang QL, Chen PC, Yao H, Ruan LI, Tao YT, Chen X. HIR V2: a human interactome resource for the biological interpretation of differentially expressed genes via gene set linkage analysis. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2021; 2021:6156843. [PMID: 33677507 PMCID: PMC7937034 DOI: 10.1093/database/baab009] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Revised: 02/09/2021] [Accepted: 02/19/2021] [Indexed: 12/17/2022]
Abstract
To facilitate biomedical studies of disease mechanisms, a high-quality interactome that connects functionally related genes is needed to help investigators formulate pathway hypotheses and to interpret the biological logic of a phenotype at the biological process level. Interactions in the updated version of the human interactome resource (HIR V2) were inferred from 36 mathematical characterizations of six types of data that suggest functional associations between genes. This update of the HIR consists of 88 069 pairs of genes (23.2% functional interactions of HIR V2 are in common with the previous version of HIR), representing functional associations that are of strengths similar to those between well-studied protein interactions. Among these functional interactions, 57% may represent protein interactions, which are expected to cover 32% of the true human protein interactome. The gene set linkage analysis (GSLA) tool is developed based on the high-quality HIR V2 to identify the potential functional impacts of the observed transcriptomic changes, helping to elucidate their biological significance and complementing the currently widely used enrichment-based gene set interpretation tools. A case study shows that the annotations reported by the HIR V2/GSLA system are more comprehensive and concise compared to those obtained by the widely used gene set annotation tools such as PANTHER and DAVID. The HIR V2 and GSLA are available at http://human.biomedtzc.cn.
Collapse
Affiliation(s)
- Wen-Ping Guo
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, 1139 Shifu Avenue, Taizhou City, Zhejiang Province, Taizhou 318000, China
| | - Xiao-Bao Ding
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, 1139 Shifu Avenue, Taizhou City, Zhejiang Province, Taizhou 318000, China
| | - Jie Jin
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, 1139 Shifu Avenue, Taizhou City, Zhejiang Province, Taizhou 318000, China
| | - Hai-Bo Zhang
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, 1139 Shifu Avenue, Taizhou City, Zhejiang Province, Taizhou 318000, China
| | - Qiao-Lei Yang
- Institute of Pharmaceutical Biotechnology and the First Affiliated Hospital Department of Radiation Oncology, Zhejiang University School of Medicine, 866 Yuhangtang Road, Xihu District, Hangzhou City, Zhejiang Province, Hangzhou 310058, China
| | - Peng-Cheng Chen
- Institute of Pharmaceutical Biotechnology and the First Affiliated Hospital Department of Radiation Oncology, Zhejiang University School of Medicine, 866 Yuhangtang Road, Xihu District, Hangzhou City, Zhejiang Province, Hangzhou 310058, China
| | - Heng Yao
- Institute of Pharmaceutical Biotechnology and the First Affiliated Hospital Department of Radiation Oncology, Zhejiang University School of Medicine, 866 Yuhangtang Road, Xihu District, Hangzhou City, Zhejiang Province, Hangzhou 310058, China
| | - L I Ruan
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, 1139 Shifu Avenue, Taizhou City, Zhejiang Province, Taizhou 318000, China
| | - Yu-Tian Tao
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, 1139 Shifu Avenue, Taizhou City, Zhejiang Province, Taizhou 318000, China
| | - Xin Chen
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, 1139 Shifu Avenue, Taizhou City, Zhejiang Province, Taizhou 318000, China.,Institute of Pharmaceutical Biotechnology and the First Affiliated Hospital Department of Radiation Oncology, Zhejiang University School of Medicine, 866 Yuhangtang Road, Xihu District, Hangzhou City, Zhejiang Province, Hangzhou 310058, China.,Joint Institute for Genetics and Genome Medicine between Zhejiang University and University of Toronto, Zhejiang University, 866 Yuhangtang Road, Xihu District, Hangzhou City, Zhejiang Province, Hangzhou 310058, China
| |
Collapse
|
10
|
Ding XB, Jin J, Tao YT, Guo WP, Ruan L, Yang QL, Chen PC, Yao H, Zhang HB, Chen X. Predicted Drosophila Interactome Resource and web tool for functional interpretation of differentially expressed genes. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2021; 2020:5756140. [PMID: 32103267 DOI: 10.1093/database/baaa005] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/21/2019] [Revised: 12/03/2019] [Accepted: 01/13/2020] [Indexed: 12/14/2022]
Abstract
Drosophila melanogaster is a well-established model organism that is widely used in genetic studies. This species enjoys the availability of a wide range of research tools, well-annotated reference databases and highly similar gene circuitry to other insects. To facilitate molecular mechanism studies in Drosophila, we present the Predicted Drosophila Interactome Resource (PDIR), a database of high-quality predicted functional gene interactions. These interactions were inferred from evidence in 10 public databases providing information for functional gene interactions from diverse perspectives. The current version of PDIR includes 102 835 putative functional associations with balanced sensitivity and specificity, which are expected to cover 22.56% of all Drosophila protein interactions. This set of functional interactions is a good reference for hypothesis formulation in molecular mechanism studies. At the same time, these interactions also serve as a high-quality reference interactome for gene set linkage analysis (GSLA), which is a web tool for the interpretation of the potential functional impacts of a set of changed genes observed in transcriptomics analyses. In a case study, we show that the PDIR/GSLA system was able to produce a more comprehensive and concise interpretation of the collective functional impact of multiple simultaneously changed genes compared with the widely used gene set annotation tools, including PANTHER and David. PDIR and its associated GSLA service can be accessed at http://drosophila.biomedtzc.cn.
Collapse
Affiliation(s)
- Xiao-Bao Ding
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, 1139 Shifu Avenue, Taizhou 318000, China
| | - Jie Jin
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, 1139 Shifu Avenue, Taizhou 318000, China
| | - Yu-Tian Tao
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, 1139 Shifu Avenue, Taizhou 318000, China
| | - Wen-Ping Guo
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, 1139 Shifu Avenue, Taizhou 318000, China
| | - Li Ruan
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, 1139 Shifu Avenue, Taizhou 318000, China
| | - Qiao-Lei Yang
- Institute of Pharmaceutical Biotechnology and the First Affiliated Hospital Department of Radiation Oncology, Zhejiang University School of Medicine, 866 Yuhantang Rd, Hangzhou 310058, China
| | - Peng-Cheng Chen
- Institute of Pharmaceutical Biotechnology and the First Affiliated Hospital Department of Radiation Oncology, Zhejiang University School of Medicine, 866 Yuhantang Rd, Hangzhou 310058, China
| | - Heng Yao
- Institute of Pharmaceutical Biotechnology and the First Affiliated Hospital Department of Radiation Oncology, Zhejiang University School of Medicine, 866 Yuhantang Rd, Hangzhou 310058, China
| | - Hai-Bo Zhang
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, 1139 Shifu Avenue, Taizhou 318000, China
| | - Xin Chen
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, 1139 Shifu Avenue, Taizhou 318000, China.,Institute of Pharmaceutical Biotechnology and the First Affiliated Hospital Department of Radiation Oncology, Zhejiang University School of Medicine, 866 Yuhantang Rd, Hangzhou 310058, China.,Joint Institute for Genetics and Genome Medicine between Zhejiang University and University of Toronto, Zhejiang University, 866 Yuhantang Rd, Hangzhou 310058, China
| |
Collapse
|
11
|
Abstract
Proteins are major functional molecules that physically and functionally interact to carry out cellular processes. The physical interactions are generally mediated by domain-level interactions. Thus, novel protein-protein interactions can be predicted using various computational methods based on domain-domain interactions, using resolved structures of protein complexes. Functional protein interactions can be inferred based on shared domains between proteins, since proteins involved in the same biological processes tend to harbor common domains. We recently developed a method of inferring functional interactions between proteins using associations between their domain compositions, which can be represented as domain profiles. Since the method requires only protein domain annotations, it can be easily applied to any species with a sequenced genome. Here, we describe in detail the method of generating domain profiles for proteins and measuring the association between them to infer functional interactions between proteins. We also demonstrate that domain profile association can be used to successfully construct a large-scale functional network of human proteins.
Collapse
Affiliation(s)
- Jung Eun Shim
- Yonsei Biomedical Research Institute, Yonsei University College of Medicine, Seoul, South Korea.
| | - Insuk Lee
- Department of Biotechnology, College of Life Science and Biotechnology, Yonsei University, Seoul, South Korea.
| |
Collapse
|
12
|
Pathogen and Host-Pathogen Protein Interactions Provide a Key to Identify Novel Drug Targets. SYSTEMS MEDICINE 2021. [DOI: 10.1016/b978-0-12-801238-3.11607-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
|
13
|
Kataria R, Duhan N, Kaundal R. Computational Systems Biology of Alfalfa - Bacterial Blight Host-Pathogen Interactions: Uncovering the Complex Molecular Networks for Developing Durable Disease Resistant Crop. FRONTIERS IN PLANT SCIENCE 2021; 12:807354. [PMID: 35251063 PMCID: PMC8891223 DOI: 10.3389/fpls.2021.807354] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Accepted: 12/29/2021] [Indexed: 05/04/2023]
Abstract
Medicago sativa (also known as alfalfa), a forage legume, is widely cultivated due to its high yield and high-value hay crop production. Infectious diseases are a major threat to the crops, owing to huge economic losses to the agriculture industry, worldwide. The protein-protein interactions (PPIs) between the pathogens and their hosts play a critical role in understanding the molecular basis of pathogenesis. Pseudomonas syringae pv. syringae ALF3 suppresses the plant's innate immune response by secreting type III effector proteins into the host cell, causing bacterial stem blight in alfalfa. The alfalfa-P. syringae system has little information available for PPIs. Thus, to understand the infection mechanism, we elucidated the genome-scale host-pathogen interactions (HPIs) between alfalfa and P. syringae using two computational approaches: interolog-based and domain-based method. A total of ∼14 M putative PPIs were predicted between 50,629 alfalfa proteins and 2,932 P. syringae proteins by combining these approaches. Additionally, ∼0.7 M consensus PPIs were also predicted. The functional analysis revealed that P. syringae proteins are highly involved in nucleotide binding activity (GO:0000166), intracellular organelle (GO:0043229), and translation (GO:0006412) while alfalfa proteins are involved in cellular response to chemical stimulus (GO:0070887), oxidoreductase activity (GO:0016614), and Golgi apparatus (GO:0005794). According to subcellular localization predictions, most of the pathogen proteins targeted host proteins within the cytoplasm and nucleus. In addition, we discovered a slew of new virulence effectors in the predicted HPIs. The current research describes an integrated approach for deciphering genome-scale host-pathogen PPIs between alfalfa and P. syringae, allowing the researchers to better understand the pathogen's infection mechanism and develop pathogen-resistant lines.
Collapse
Affiliation(s)
- Raghav Kataria
- Department of Plants, Soils, and Climate, College of Agriculture and Applied Sciences, Utah State University, Logan, UT, United States
| | - Naveen Duhan
- Department of Plants, Soils, and Climate, College of Agriculture and Applied Sciences, Utah State University, Logan, UT, United States
| | - Rakesh Kaundal
- Department of Plants, Soils, and Climate, College of Agriculture and Applied Sciences, Utah State University, Logan, UT, United States
- Bioinformatics Facility, Center for Integrated Biosystems, Utah State University, Logan, UT, United States
- Department of Computer Science, College of Science, Utah State University, Logan, UT, United States
- *Correspondence: Rakesh Kaundal, ;
| |
Collapse
|
14
|
Chen PC, Ruan L, Jin J, Tao YT, Ding XB, Zhang HB, Guo WP, Yang QL, Yao H, Chen X. Predicted functional interactome of Caenorhabditis elegans and a web tool for the functional interpretation of differentially expressed genes. Biol Direct 2020; 15:20. [PMID: 33076954 PMCID: PMC7574172 DOI: 10.1186/s13062-020-00271-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2019] [Accepted: 09/23/2020] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The nematode worm, Caenorhabditis elegans, is a saprophytic species that has been emerging as a standard model organism since the early 1960s. This species is useful in numerous fields, including developmental biology, neurobiology, and ageing. A high-quality comprehensive molecular interaction network is needed to facilitate molecular mechanism studies in C. elegans. RESULTS We present the predicted functional interactome of Caenorhabditis elegans (FIC), which integrates functional association data from 10 public databases to infer functional gene interactions on diverse functional perspectives. In this work, FIC includes 108,550 putative functional associations with balanced sensitivity and specificity, which are expected to cover 21.42% of all C. elegans protein interactions, and 29.25% of these associations may represent protein interactions. Based on FIC, we developed a gene set linkage analysis (GSLA) web tool to interpret potential functional impacts from a set of differentially expressed genes observed in transcriptome analyses. CONCLUSION We present the predicted C. elegans interactome database FIC, which is a high-quality database of predicted functional interactions among genes. The functional interactions in FIC serve as a good reference interactome for GSLA to annotate differentially expressed genes for their potential functional impacts. In a case study, the FIC/GSLA system shows more comprehensive and concise annotations compared to other widely used gene set annotation tools, including PANTHER and DAVID. FIC and its associated GSLA are available at the website http://worm.biomedtzc.cn .
Collapse
Affiliation(s)
- Peng-Cheng Chen
- Institute of Pharmaceutical Biotechnology of Zhejiang University School of Medicine and Department of Radiology of the First Affiliated Hospital, Hangzhou, 310058, China
| | - Li Ruan
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, Taizhou, 318000, China
| | - Jie Jin
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, Taizhou, 318000, China
| | - Yu-Tian Tao
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, Taizhou, 318000, China
| | - Xiao-Bao Ding
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, Taizhou, 318000, China
| | - Hai-Bo Zhang
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, Taizhou, 318000, China
| | - Wen-Ping Guo
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, Taizhou, 318000, China
| | - Qiao-Lei Yang
- Institute of Pharmaceutical Biotechnology of Zhejiang University School of Medicine and Department of Radiology of the First Affiliated Hospital, Hangzhou, 310058, China
| | - Heng Yao
- Institute of Pharmaceutical Biotechnology of Zhejiang University School of Medicine and Department of Radiology of the First Affiliated Hospital, Hangzhou, 310058, China
| | - Xin Chen
- Institute of Pharmaceutical Biotechnology of Zhejiang University School of Medicine and Department of Radiology of the First Affiliated Hospital, Hangzhou, 310058, China. .,Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, Taizhou, 318000, China. .,Joint Institute for Genetics and Genome Medicine between Zhejiang University and University of Toronto, Zhejiang University, Hangzhou, 310058, China.
| |
Collapse
|
15
|
Jin J, Tao YT, Ding XB, Guo WP, Ruan L, Yang QL, Chen PC, Yao H, Zhang HB, Chen X. Predicted yeast interactome and network-based interpretation of transcriptionally changed genes. Yeast 2020; 37:573-583. [PMID: 32738156 DOI: 10.1002/yea.3516] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2019] [Revised: 06/25/2020] [Accepted: 07/26/2020] [Indexed: 11/06/2022] Open
Abstract
Saccharomyces cerevisiae, budding yeast, is a widely used model organism and research tool in genetics studies. Many efforts have been directed at constructing a high-quality comprehensive molecular interaction network to elucidate the design logic of the gene circuitries in this classic model organism. In this work, we present the yeast interactome resource (YIR), which includes 22,238 putative functional gene interactions inferred from functional gene association data integrated from 10 databases focusing on diverse functional perspectives. These putative functional gene interactions are expected to cover 18.84% of yeast protein interactions, and 38.49% may represent protein interactions. Based on the YIR, a gene set linkage analysis (GSLA) web tool was developed to annotate the potential functional impacts of a set of transcriptionally changed genes. In a case study, we show that the YIR/GSLA system produced more extensive and concise annotations compared with widely used gene set annotation tools, including PANTHER and DAVID. Both YIR and GSLA are accessible through the website http://yeast.biomedtzc.cn.
Collapse
Affiliation(s)
- Jie Jin
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, Taizhou, China
| | - Yu-Tian Tao
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, Taizhou, China
| | - Xiao-Bao Ding
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, Taizhou, China
| | - Wen-Ping Guo
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, Taizhou, China
| | - Li Ruan
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, Taizhou, China
| | - Qiao-Lei Yang
- Institute of Pharmaceutical Biotechnology, School of Medicine, Zhejiang University, Hangzhou, China
| | - Peng-Cheng Chen
- Institute of Pharmaceutical Biotechnology, School of Medicine, Zhejiang University, Hangzhou, China
| | - Heng Yao
- Institute of Pharmaceutical Biotechnology, School of Medicine, Zhejiang University, Hangzhou, China
| | - Hai-Bo Zhang
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, Taizhou, China
| | - Xin Chen
- Institute of Big Data and Artificial Intelligence in Medicine, School of Electronics and Information Engineering, Taizhou University, Taizhou, China.,Institute of Pharmaceutical Biotechnology, School of Medicine, Zhejiang University, Hangzhou, China.,Joint Institute for Genetics and Genome Medicine, Zhejiang University and University of Toronto, Zhejiang University, Hangzhou, China
| |
Collapse
|
16
|
Loaiza CD, Duhan N, Lister M, Kaundal R. In silico prediction of host-pathogen protein interactions in melioidosis pathogen Burkholderia pseudomallei and human reveals novel virulence factors and their targets. Brief Bioinform 2020; 22:5842243. [PMID: 32444871 DOI: 10.1093/bib/bbz162] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2019] [Revised: 11/13/2019] [Accepted: 11/20/2019] [Indexed: 12/13/2022] Open
Abstract
The aerobic, Gram-negative motile bacillus, Burkholderia pseudomallei is a facultative intracellular bacterium causing melioidosis, a critical disease of public health importance, which is widely endemic in the tropics and subtropical regions of the world. Melioidosis is associated with high case fatality rates in animals and humans; even with treatment, its mortality is 20-50%. It also infects plants and is designated as a biothreat agent. B. pseudomallei is pathogenic due to its ability to invade, resist factors in serum and survive intracellularly. Despite its importance, to date only a few effector proteins have been functionally characterized, and there is not much information regarding the host-pathogen protein-protein interactions (PPI) of this system, which are important to studying infection mechanisms and thereby develop prevention measures. We explored two computational approaches, the homology-based interolog and the domain-based method, to predict genome-scale host-pathogen interactions (HPIs) between two different strains of B. pseudomallei (prototypical, and highly virulent) and human. In total, 76 335 common HPIs (between the two strains) were predicted involving 8264 human and 1753 B. pseudomallei proteins. Among the unique PPIs, 14 131 non-redundant HPIs were found to be unique between the prototypical strain and human, compared to 3043 non-redundant HPIs between the highly virulent strain and human. The protein hubs analysis showed that most B. pseudomallei proteins formed a hub with human dnaK complex proteins associated with tuberculosis, a disease similar in symptoms to melioidosis. In addition, drug-binding and carbohydrate-binding mechanisms were found overrepresented within the host-pathogen network, and metabolic pathways were frequently activated according to the pathway enrichment. Subcellular localization analysis showed that most of the pathogen proteins are targeting human proteins inside cytoplasm and nucleus. We also discovered the host targets of the drug-related pathogen proteins and proteins that form T3SS and T6SS in B. pseudomallei. Additionally, a comparison between the unique PPI patterns present in the prototypical and highly virulent strains was performed. The current study is the first report on developing a genome-scale host-pathogen protein interaction networks between the human and B. pseudomallei, a critical biothreat agent. We have identified novel virulence factors and their interacting partners in the human proteome. These PPIs can be further validated by high-throughput experiments and may give new insights on how B. pseudomallei interacts with its host, which will help medical researchers in developing better prevention measures.
Collapse
Affiliation(s)
- Cristian D Loaiza
- Center for Integrated BioSystems/Department of Plants, Soils, and Climate, College of Agriculture and Applied Sciences, Utah State University, USA
| | - Naveen Duhan
- Center for Integrated BioSystems/Department of Plants, Soils, and Climate, College of Agriculture and Applied Sciences, Utah State University, USA
| | - Matthew Lister
- Bioinformatics Facility, Center for Integrated BioSystems, Utah State University, USA
| | - Rakesh Kaundal
- Department of Plants, Soils, and Climate/Center for Integrated BioSystems, College of Agriculture and Applied Sciences, Utah State University, Logan, UT 84322 USA
| |
Collapse
|
17
|
Gupta SK, Srivastava M, Osmanoglu Ö, Dandekar T. Genome-wide inference of the Camponotus floridanus protein-protein interaction network using homologous mapping and interacting domain profile pairs. Sci Rep 2020; 10:2334. [PMID: 32047225 PMCID: PMC7012867 DOI: 10.1038/s41598-020-59344-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2019] [Accepted: 01/22/2020] [Indexed: 12/18/2022] Open
Abstract
Apart from some model organisms, the interactome of most organisms is largely unidentified. High-throughput experimental techniques to determine protein-protein interactions (PPIs) are resource intensive and highly susceptible to noise. Computational methods of PPI determination can accelerate biological discovery by identifying the most promising interacting pairs of proteins and by assessing the reliability of identified PPIs. Here we present a first in-depth study describing a global view of the ant Camponotus floridanus interactome. Although several ant genomes have been sequenced in the last eight years, studies exploring and investigating PPIs in ants are lacking. Our study attempts to fill this gap and the presented interactome will also serve as a template for determining PPIs in other ants in future. Our C. floridanus interactome covers 51,866 non-redundant PPIs among 6,274 proteins, including 20,544 interactions supported by domain-domain interactions (DDIs), 13,640 interactions supported by DDIs and subcellular localization, and 10,834 high confidence interactions mediated by 3,289 proteins. These interactions involve and cover 30.6% of the entire C. floridanus proteome.
Collapse
Affiliation(s)
- Shishir K Gupta
- Functional Genomics and Systems Biology Group, Department of Bioinformatics, Biocenter, Am Hubland, D-97074, Würzburg, Germany.,Department of Microbiology, Biocenter, Am Hubland, D-97074, Würzburg, Germany
| | - Mugdha Srivastava
- Functional Genomics and Systems Biology Group, Department of Bioinformatics, Biocenter, Am Hubland, D-97074, Würzburg, Germany
| | - Özge Osmanoglu
- Functional Genomics and Systems Biology Group, Department of Bioinformatics, Biocenter, Am Hubland, D-97074, Würzburg, Germany
| | - Thomas Dandekar
- Functional Genomics and Systems Biology Group, Department of Bioinformatics, Biocenter, Am Hubland, D-97074, Würzburg, Germany. .,EMBL Heidelberg, BioComputing Unit, Meyerhofstraße 1, 69117, Heidelberg, Germany.
| |
Collapse
|
18
|
Tao YT, Ding XB, Jin J, Zhang HB, Guo WP, Ruan L, Yang QL, Chen PC, Yao H, Chen X. Predicted rat interactome database and gene set linkage analysis. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2020; 2020:5996022. [PMID: 33216897 PMCID: PMC7678787 DOI: 10.1093/database/baaa086] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/08/2020] [Revised: 08/21/2020] [Accepted: 09/10/2020] [Indexed: 11/13/2022]
Abstract
Rattus norvegicus, or the rat, has been widely used as animal models for a diversity of human diseases in the last 150 years. The rat, as a disease model, has the advantage of relatively large body size and highly similar physiology to humans. In drug discovery, rat models are routinely used in drug efficacy and toxicity assessments. To facilitate molecular pharmacology studies in rats, we present the predicted rat interactome database (PRID), which is a database of high-quality predicted functional gene interactions with balanced sensitivity and specificity. PRID integrates functional gene association data from 10 public databases and infers 305 939 putative functional associations, which are expected to include 13.02% of all rat protein interactions, and 52.59% of these function associations may represent protein interactions. This set of functional interactions may not only facilitate hypothesis formulation in molecular mechanism studies, but also serve as a reference interactome for users to perform gene set linkage analysis (GSLA), which is a web-based tool to infer the potential functional impacts of a set of changed genes observed in transcriptomics analyses. In a case study, we show that GSLA based on PRID may provide more precise and informative annotations for investigators to understand the physiological mechanisms underlying a phenotype and lead investigators to testable hypotheses for further studies. Widely used functional annotation tools such as Gene Ontology (GO) analysis, and Database for Annotation, Visualization and Integrated Discovery (DAVID) did not provide similar insights. Database URL: http://rat.biomedtzc.cn.
Collapse
Affiliation(s)
- Yu-Tian Tao
- Institute of Big data and Artificial Intelligence in Medicine, School of Electronics & Information Engineering, Taizhou University, 1139 Shifu Avenue, Taizhou, 318000, China
| | - Xiao-Bao Ding
- Institute of Big data and Artificial Intelligence in Medicine, School of Electronics & Information Engineering, Taizhou University, 1139 Shifu Avenue, Taizhou, 318000, China
| | - Jie Jin
- Institute of Big data and Artificial Intelligence in Medicine, School of Electronics & Information Engineering, Taizhou University, 1139 Shifu Avenue, Taizhou, 318000, China
| | - Hai-Bo Zhang
- Institute of Big data and Artificial Intelligence in Medicine, School of Electronics & Information Engineering, Taizhou University, 1139 Shifu Avenue, Taizhou, 318000, China
| | - Wen-Ping Guo
- Institute of Big data and Artificial Intelligence in Medicine, School of Electronics & Information Engineering, Taizhou University, 1139 Shifu Avenue, Taizhou, 318000, China
| | - Li Ruan
- Institute of Big data and Artificial Intelligence in Medicine, School of Electronics & Information Engineering, Taizhou University, 1139 Shifu Avenue, Taizhou, 318000, China
| | - Qiao-Lei Yang
- Institute of Pharmaceutical Biotechnology, School of Medicine, Zhejiang University, 866 Yuhangtang Road, Hangzhou, 310058, China
| | - Peng-Cheng Chen
- Institute of Pharmaceutical Biotechnology, School of Medicine, Zhejiang University, 866 Yuhangtang Road, Hangzhou, 310058, China
| | - Heng Yao
- Institute of Pharmaceutical Biotechnology, School of Medicine, Zhejiang University, 866 Yuhangtang Road, Hangzhou, 310058, China
| | - Xin Chen
- Institute of Big data and Artificial Intelligence in Medicine, School of Electronics & Information Engineering, Taizhou University, 1139 Shifu Avenue, Taizhou, 318000, China.,Institute of Pharmaceutical Biotechnology, School of Medicine, Zhejiang University, 866 Yuhangtang Road, Hangzhou, 310058, China.,Joint Institute for Genetics and Genome Medicine between Zhejiang University and University of Toronto, Zhejiang University, 866 Yuhangtang Road, Hangzhou, 310058, China
| |
Collapse
|
19
|
Li Y, Zhang Y, Li X, Yi S, Xu J. Gain-of-Function Mutations: An Emerging Advantage for Cancer Biology. Trends Biochem Sci 2019; 44:659-674. [PMID: 31047772 DOI: 10.1016/j.tibs.2019.03.009] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2018] [Revised: 03/21/2019] [Accepted: 03/26/2019] [Indexed: 02/08/2023]
Abstract
Advances in next-generation sequencing have identified thousands of genomic variants that perturb the normal functions of proteins, further contributing to diverse phenotypic consequences in cancer. Elucidating the functional pathways altered by loss-of-function (LOF) or gain-of-function (GOF) mutations will be crucial for prioritizing cancer-causing variants and their resultant therapeutic liabilities. In this review, we highlight the fundamental function of GOF mutations and discuss the potential mechanistic effects in the context of signaling networks. We also summarize advances in experimental and computational resources, which will dramatically help with studies on the functional and phenotypic consequences of mutations. Together, systematic investigations of the function of GOF mutations will provide an important missing piece for cancer biology and precision therapy.
Collapse
Affiliation(s)
- Yongsheng Li
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China; Department of Oncology, Dell Medical School, The University of Texas at Austin, Austin, TX 78712, USA
| | - Yunpeng Zhang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| | - Xia Li
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China; College of Bioinformatics, Hainan Medical University, Haikou 570100, China.
| | - Song Yi
- Department of Oncology, Dell Medical School, The University of Texas at Austin, Austin, TX 78712, USA; Department of Biomedical Engineering, Cockrell School of Engineering, The University of Texas at Austin, Austin, TX 78712, USA.
| | - Juan Xu
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China.
| |
Collapse
|
20
|
Bencurova E, Gupta SK, Oskoueian E, Bhide M, Dandekar T. Omics and bioinformatics applied to vaccine development against Borrelia. Mol Omics 2018; 14:330-340. [PMID: 30113617 DOI: 10.1039/c8mo00130h] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Borrelia burgdorferi is an extracellular spirochete that causes Lyme disease. Currently, no effective vaccine is available for humans and animals except for dogs. In the present study, an extensive bioinformatics pipeline was established to predict new candidates that can be used for vaccine development including building the protein-protein interaction network based on orthologues of experimentally verified protein-protein interaction networks, elucidation of the proteins involved in the immune response, selection of the topologically-interesting proteins and their prioritization based on their antigenicity. Proteomic network analysis yielded an interactome network with 120 nodes with 97 interactions. Proteins were selected to obtain a subnet containing only the borrelial membrane proteins and immune-related host proteins. This strategy resulted in the selection of 15 borrelial targets, which were subjected to extensive bioinformatics analysis to predict their antigenic properties. Based on the strategy applied in this study the proteins encoded by erpX (ErpX proteins, UniProt ID: H7C7L6), erpL (ErpL protein, UniProt ID: H7C7M3) and erpY (ErpY protein, UniProt ID: Q9S0D9) are suggested as a novel set of vaccine targets to control Lyme disease. Moreover, five different tools were used to validate their antigenicity regarding B-cells. The combination of all these proteins in a vaccine should allow improved protection against Borrelia infection.
Collapse
Affiliation(s)
- Elena Bencurova
- Department of Bioinformatics, Biocenter, Am Hubland, D-97074 Wuerzburg, Germany.
| | | | | | | | | |
Collapse
|
21
|
Context-dependent prediction of protein complexes by SiComPre. NPJ Syst Biol Appl 2018; 4:37. [PMID: 30245847 PMCID: PMC6141528 DOI: 10.1038/s41540-018-0073-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2018] [Revised: 08/21/2018] [Accepted: 08/29/2018] [Indexed: 11/09/2022] Open
Abstract
Most cellular processes are regulated by groups of proteins interacting together to form protein complexes. Protein compositions vary between different tissues or disease conditions enabling or preventing certain protein-protein interactions and resulting in variations in the complexome. Quantitative and qualitative characterization of context-specific protein complexes will help to better understand context-dependent variations in the physiological behavior of cells. Here, we present SiComPre 1.0, a computational tool that predicts context-specific protein complexes by integrating multi-omics sources. SiComPre outperforms other protein complex prediction tools in qualitative predictions and is unique in giving quantitative predictions on the complexome depending on the specific interactions and protein abundances defined by the user. We provide tutorials and examples on the complexome prediction of common model organisms, various human tissues and how the complexome is affected by drug treatment.
Collapse
|
22
|
Abstract
Cellular functions are often performed by multiprotein structures called protein complexes. These complexes are dynamic structures that evolve during the cell cycle or in response to external and internal stimuli, and are tightly regulated by protein expression in different tissues resulting in quantitative and qualitative variation of protein complexes. Advances in high-throughput techniques, such as mass-spectrometry and yeast two-hybrid provided a large amount of data on protein-protein interactions. This sparked the development of computational methods able to predict protein complex formation under a variety of biological and clinical conditions. However, the challenges that need to be addressed for successful computational protein complex prediction are highly complex.The post-genomic era saw an emerging number of algorithms and software, which are able to predict protein complexes from protein-protein interaction networks and a variety of other sources. Despite the high capacity of these methods to qualitatively predict protein complexes, they could provide only limited or no quantitative information of the predicted complexes. Recently, a new large-scale simulation of protein complexes was able to achieve this task by simulating protein complex formation on the proteome scale.In this chapter, we review representative methods that can predict multiple protein complexes at different scales and discuss how these can be combined with emerging sources of data in order to improve protein complex characterization.
Collapse
|
23
|
Frenkel-Morgenstern M, Gorohovski A, Tagore S, Sekar V, Vazquez M, Valencia A. ChiPPI: a novel method for mapping chimeric protein-protein interactions uncovers selection principles of protein fusion events in cancer. Nucleic Acids Res 2017; 45:7094-7105. [PMID: 28549153 PMCID: PMC5499553 DOI: 10.1093/nar/gkx423] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2016] [Accepted: 05/07/2017] [Indexed: 12/20/2022] Open
Abstract
Fusion proteins, comprising peptides deriving from the translation of two parental genes, are produced in cancer by chromosomal aberrations. The expressed fusion protein incorporates domains of both parental proteins. Using a methodology that treats discrete protein domains as binding sites for specific domains of interacting proteins, we have cataloged the protein interaction networks for 11 528 cancer fusions (ChiTaRS-3.1). Here, we present our novel method, chimeric protein–protein interactions (ChiPPI) that uses the domain–domain co-occurrence scores in order to identify preserved interactors of chimeric proteins. Mapping the influence of fusion proteins on cell metabolism and pathways reveals that ChiPPI networks often lose tumor suppressor proteins and gain oncoproteins. Furthermore, fusions often induce novel connections between non-interactors skewing interaction networks and signaling pathways. We compared fusion protein PPI networks in leukemia/lymphoma, sarcoma and solid tumors finding distinct enrichment patterns for each disease type. While certain pathways are enriched in all three diseases (Wnt, Notch and TGF β), there are distinct patterns for leukemia (EGFR signaling, DNA replication and CCKR signaling), for sarcoma (p53 pathway and CCKR signaling) and solid tumors (FGFR and EGFR signaling). Thus, the ChiPPI method represents a comprehensive tool for studying the anomaly of skewed cellular networks produced by fusion proteins in cancer.
Collapse
Affiliation(s)
| | | | - Somnath Tagore
- Faculty of Medicine, Bar-Ilan-University, Henrietta Szold 8, Safed 1311502, Israel
| | - Vaishnovi Sekar
- Structural Biology and BioComputing Programme, Spanish National Cancer Research Centre (CNIO), M.F.Almagro 3, 28029 Madrid, Spain
| | - Miguel Vazquez
- Structural Biology and BioComputing Programme, Spanish National Cancer Research Centre (CNIO), M.F.Almagro 3, 28029 Madrid, Spain
| | - Alfonso Valencia
- Structural Biology and BioComputing Programme, Spanish National Cancer Research Centre (CNIO), M.F.Almagro 3, 28029 Madrid, Spain
| |
Collapse
|
24
|
Shatnawi M. Protein-Protein Interaction Prediction: Recent Advances. 2017 28TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA) 2017:69-73. [DOI: 10.1109/dexa.2017.30] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2025]
|
25
|
Oh K, Yi GS. Prediction of scaffold proteins based on protein interaction and domain architectures. BMC Bioinformatics 2016; 17 Suppl 6:220. [PMID: 27490120 PMCID: PMC4965726 DOI: 10.1186/s12859-016-1079-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Background Scaffold proteins are known for being crucial regulators of various cellular functions by assembling multiple proteins involved in signaling and metabolic pathways. Identification of scaffold proteins and the study of their molecular mechanisms can open a new aspect of cellular systemic regulation and the results can be applied in the field of medicine and engineering. Despite being highlighted as the regulatory roles of dozens of scaffold proteins, there was only one known computational approach carried out so far to find scaffold proteins from interactomes. However, there were limitations in finding diverse types of scaffold proteins because their criteria were restricted to the classical scaffold proteins. In this paper, we will suggest a systematic approach to predict massive scaffold proteins from interactomes and to characterize the roles of scaffold proteins comprehensively. Results From a total of 10,419 basic scaffold protein candidates in protein interactomes, we classified them into three classes according to the structural evidences for scaffolding, such as domain architectures, domain interactions and protein complexes. Finally, we could define 2716 highly reliable scaffold protein candidates and their characterized functional features. To assess the accuracy of our prediction, the gold standard positive and negative data sets were constructed. We prepared 158 gold standard positive data and 844 gold standard negative data based on the functional information from Gene Ontology consortium. The precision, sensitivity and specificity of our testing was 80.3, 51.0, and 98.5 % respectively. Through the function enrichment analysis of highly reliable scaffold proteins, we could confirm the significantly enriched functions that are related to scaffold protein binding. We also identified functional association between scaffold proteins and their recruited proteins. Furthermore, we checked that the disease association of scaffold proteins is higher than kinases. Conclusions In conclusion, we could predict larger volume of scaffold proteins and analyzed their functional characteristics. Deeper understandings about the roles of scaffold proteins from this study will provide a higher opportunity to find therapeutic or engineering applications of scaffold proteins using their functional characteristics. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-1079-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Kimin Oh
- Department of Bio and Brain Engineering, KAIST, Daejeon, Korea
| | - Gwan-Su Yi
- Department of Bio and Brain Engineering, KAIST, Daejeon, Korea.
| |
Collapse
|
26
|
Kaltdorf M, Srivastava M, Gupta SK, Liang C, Binder J, Dietl AM, Meir Z, Haas H, Osherov N, Krappmann S, Dandekar T. Systematic Identification of Anti-Fungal Drug Targets by a Metabolic Network Approach. Front Mol Biosci 2016; 3:22. [PMID: 27379244 PMCID: PMC4911368 DOI: 10.3389/fmolb.2016.00022] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2015] [Accepted: 05/24/2016] [Indexed: 11/13/2022] Open
Abstract
New antimycotic drugs are challenging to find, as potential target proteins may have close human orthologs. We here focus on identifying metabolic targets that are critical for fungal growth and have minimal similarity to targets among human proteins. We compare and combine here: (I) direct metabolic network modeling using elementary mode analysis and flux estimates approximations using expression data, (II) targeting metabolic genes by transcriptome analysis of condition-specific highly expressed enzymes, and (III) analysis of enzyme structure, enzyme interconnectedness ("hubs"), and identification of pathogen-specific enzymes using orthology relations. We have identified 64 targets including metabolic enzymes involved in vitamin synthesis, lipid, and amino acid biosynthesis including 18 targets validated from the literature, two validated and five currently examined in own genetic experiments, and 38 further promising novel target proteins which are non-orthologous to human proteins, involved in metabolism and are highly ranked drug targets from these pipelines.
Collapse
Affiliation(s)
- Martin Kaltdorf
- Department of Bioinformatics, Biocenter, University of Würzburg Würzburg, Germany
| | - Mugdha Srivastava
- Department of Bioinformatics, Biocenter, University of Würzburg Würzburg, Germany
| | - Shishir K Gupta
- Department of Bioinformatics, Biocenter, University of Würzburg Würzburg, Germany
| | - Chunguang Liang
- Department of Bioinformatics, Biocenter, University of Würzburg Würzburg, Germany
| | - Jasmin Binder
- Microbiology Institute - Clinical Microbiology, Immunology and Hygiene, Friedrich-Alexander University Erlangen-Nürnberg, University Hospital of Erlangen Erlangen, Germany
| | - Anna-Maria Dietl
- Division of Molecular Biology/Biocenter, Medical University Innsbruck Innsbruck, Austria
| | - Zohar Meir
- Aspergillus and Antifungal Research Laboratory, Department of Clinical Microbiology and Immunology, Sackler School of Medicine, Tel Aviv University Tel-Aviv, Israel
| | - Hubertus Haas
- Division of Molecular Biology/Biocenter, Medical University Innsbruck Innsbruck, Austria
| | - Nir Osherov
- Aspergillus and Antifungal Research Laboratory, Department of Clinical Microbiology and Immunology, Sackler School of Medicine, Tel Aviv University Tel-Aviv, Israel
| | - Sven Krappmann
- Microbiology Institute - Clinical Microbiology, Immunology and Hygiene, Friedrich-Alexander University Erlangen-Nürnberg, University Hospital of Erlangen Erlangen, Germany
| | - Thomas Dandekar
- Department of Bioinformatics, Biocenter, University of Würzburg Würzburg, Germany
| |
Collapse
|
27
|
Shim JE, Lee I. Weighted mutual information analysis substantially improves domain-based functional network models. Bioinformatics 2016; 32:2824-30. [PMID: 27207946 PMCID: PMC5018372 DOI: 10.1093/bioinformatics/btw320] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2016] [Accepted: 05/16/2016] [Indexed: 11/30/2022] Open
Abstract
Motivation: Functional protein–protein interaction (PPI) networks elucidate molecular pathways underlying complex phenotypes, including those of human diseases. Extrapolation of domain–domain interactions (DDIs) from known PPIs is a major domain-based method for inferring functional PPI networks. However, the protein domain is a functional unit of the protein. Therefore, we should be able to effectively infer functional interactions between proteins based on the co-occurrence of domains. Results: Here, we present a method for inferring accurate functional PPIs based on the similarity of domain composition between proteins by weighted mutual information (MI) that assigned different weights to the domains based on their genome-wide frequencies. Weighted MI outperforms other domain-based network inference methods and is highly predictive for pathways as well as phenotypes. A genome-scale human functional network determined by our method reveals numerous communities that are significantly associated with known pathways and diseases. Domain-based functional networks may, therefore, have potential applications in mapping domain-to-pathway or domain-to-phenotype associations. Availability and Implementation: Source code for calculating weighted mutual information based on the domain profile matrix is available from www.netbiolab.org/w/WMI. Contact:Insuklee@yonsei.ac.kr Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jung Eun Shim
- Department of Biotechnology, College of Life Science and Biotechnology, Yonsei University, Seoul, Korea
| | - Insuk Lee
- Department of Biotechnology, College of Life Science and Biotechnology, Yonsei University, Seoul, Korea
| |
Collapse
|
28
|
Integration of multiple biological features yields high confidence human protein interactome. J Theor Biol 2016; 403:85-96. [PMID: 27196966 DOI: 10.1016/j.jtbi.2016.05.020] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2016] [Revised: 05/01/2016] [Accepted: 05/11/2016] [Indexed: 01/05/2023]
Abstract
The biological function of a protein is usually determined by its physical interaction with other proteins. Protein-protein interactions (PPIs) are identified through various experimental methods and are stored in curated databases. The noisiness of the existing PPI data is evident, and it is essential that a more reliable data is generated. Furthermore, the selection of a set of PPIs at different confidence levels might be necessary for many studies. Although different methodologies were introduced to evaluate the confidence scores for binary interactions, a highly reliable, almost complete PPI network of Homo sapiens is not proposed yet. The quality and coverage of human protein interactome need to be improved to be used in various disciplines, especially in biomedicine. In the present work, we propose an unsupervised statistical approach to assign confidence scores to PPIs of H. sapiens. To achieve this goal PPI data from six different databases were collected and a total of 295,288 non-redundant interactions between 15,950 proteins were acquired. The present scoring system included the context information that was assigned to PPIs derived from eight biological attributes. A high confidence network, which included 147,923 binary interactions between 13,213 proteins, had scores greater than the cutoff value of 0.80, for which sensitivity, specificity, and coverage were 94.5%, 80.9%, and 82.8%, respectively. We compared the present scoring method with others for evaluation. Reducing the noise inherent in experimental PPIs via our scoring scheme increased the accuracy significantly. As it was demonstrated through the assessment of process and cancer subnetworks, this study allows researchers to construct and analyze context-specific networks via valid PPI sets and one can easily achieve subnetworks around proteins of interest at a specified confidence level.
Collapse
|
29
|
Will T, Helms V. PPIXpress: construction of condition-specific protein interaction networks based on transcript expression. Bioinformatics 2015; 32:571-8. [PMID: 26508756 DOI: 10.1093/bioinformatics/btv620] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2015] [Accepted: 10/20/2015] [Indexed: 12/13/2022] Open
Abstract
UNLABELLED Protein-protein interaction networks are an important component of modern systems biology. Yet, comparatively few efforts have been made to tailor their topology to the actual cellular condition being studied. Here, we present a network construction method that exploits expression data at the transcript-level and thus reveals alterations in protein connectivity not only caused by differential gene expression but also by alternative splicing. We achieved this by establishing a direct correspondence between individual protein interactions and underlying domain interactions in a complete but condition-unspecific protein interaction network. This knowledge was then used to infer the condition-specific presence of interactions from the dominant protein isoforms. When we compared contextualized interaction networks of matched normal and tumor samples in breast cancer, our transcript-based construction identified more significant alterations that affected proteins associated with cancerogenesis than a method that only uses gene expression data. The approach is provided as the user-friendly tool PPIXpress. AVAILABILITY AND IMPLEMENTATION PPIXpress is available at https://sourceforge.net/projects/ppixpress/.
Collapse
Affiliation(s)
- Thorsten Will
- Center for Bioinformatics and Graduate School of Computer Science, Saarland University, Saarbrücken, Germany
| | | |
Collapse
|
30
|
Abstract
Motivation: Eukaryotic gene expression is controlled through molecular logic circuits that combine regulatory signals of many different factors. In particular, complexation of transcription factors (TFs) and other regulatory proteins is a prevailing and highly conserved mechanism of signal integration within critical regulatory pathways and enables us to infer controlled genes as well as the exerted regulatory mechanism. Common approaches for protein complex prediction that only use protein interaction networks, however, are designed to detect self-contained functional complexes and have difficulties to reveal dynamic combinatorial assemblies of physically interacting proteins. Results: We developed the novel algorithm DACO that combines protein–protein interaction networks and domain–domain interaction networks with the cluster-quality metric cohesiveness. The metric is locally maximized on the holistic level of protein interactions, and connectivity constraints on the domain level are used to account for the exclusive and thus inherently combinatorial nature of the interactions within such assemblies. When applied to predicting TF complexes in the yeast Saccharomyces cerevisiae, the proposed approach outperformed popular complex prediction methods by far. Furthermore, we were able to assign many of the predictions to target genes, as well as to a potential regulatory effect in agreement with literature evidence. Availability and implementation: A prototype implementation is freely available at https://sourceforge.net/projects/dacoalgorithm/. Contact:volkhard.helms@bioinformatik.uni-saarland.de Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Thorsten Will
- Center for Bioinformatics, Campus Building E2.1, Saarland University, D-66123 Saarbrücken, Germany
| | - Volkhard Helms
- Center for Bioinformatics, Campus Building E2.1, Saarland University, D-66123 Saarbrücken, Germany
| |
Collapse
|
31
|
Palmeri A, Ferrè F, Helmer-Citterich M. Exploiting holistic approaches to model specificity in protein phosphorylation. Front Genet 2014; 5:315. [PMID: 25324856 PMCID: PMC4179730 DOI: 10.3389/fgene.2014.00315] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2014] [Accepted: 08/21/2014] [Indexed: 12/27/2022] Open
Abstract
Phosphate plays a chemically unique role in shaping cellular signaling of all current living systems, especially eukaryotes. Protein phosphorylation has been studied at several levels, from the near-site context, both in sequence and structure, to the crowded cellular environment, and ultimately to the systems-level perspective. Despite the tremendous advances in mass spectrometry and efforts dedicated to the development of ad hoc highly sophisticated methods, phosphorylation site inference and associated kinase identification are still unresolved problems in kinome biology. The sequence and structure of the substrate near-site context are not sufficient alone to model the in vivo phosphorylation rules, and they should be integrated with orthogonal information in all possible applications. Here we provide an overview of the different contexts that contribute to protein phosphorylation, discussing their potential impact in phosphorylation site annotation and in predicting kinase-substrate specificity.
Collapse
Affiliation(s)
- Antonio Palmeri
- Department of Biology, Centre for Molecular Bioinformatics, University of Rome Tor Vergata Rome, Italy
| | - Fabrizio Ferrè
- Department of Biology, Centre for Molecular Bioinformatics, University of Rome Tor Vergata Rome, Italy
| | - Manuela Helmer-Citterich
- Department of Biology, Centre for Molecular Bioinformatics, University of Rome Tor Vergata Rome, Italy
| |
Collapse
|
32
|
Abstract
The past decade has seen a dramatic expansion in the number and range of techniques available to obtain genome-wide information and to analyze this information so as to infer both the functions of individual molecules and how they interact to modulate the behavior of biological systems. Here, we review these techniques, focusing on the construction of physical protein-protein interaction networks, and highlighting approaches that incorporate protein structure, which is becoming an increasingly important component of systems-level computational techniques. We also discuss how network analyses are being applied to enhance our basic understanding of biological systems and their disregulation, as well as how these networks are being used in drug development.
Collapse
Affiliation(s)
- Donald Petrey
- Center for Computational Biology and Bioinformatics, Department of Systems Biology
| | | |
Collapse
|
33
|
The domain landscape of virus-host interactomes. BIOMED RESEARCH INTERNATIONAL 2014; 2014:867235. [PMID: 24991570 PMCID: PMC4065681 DOI: 10.1155/2014/867235] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/04/2014] [Accepted: 03/19/2014] [Indexed: 12/31/2022]
Abstract
Viral infections result in millions of deaths in the world today. A thorough analysis of virus-host interactomes may reveal insights into viral infection and pathogenic strategies. In this study, we presented a landscape of virus-host interactomes based on protein domain interaction. Compared to the analysis at protein level, this domain-domain interactome provided a unique abstraction of protein-protein interactome. Through comparisons among DNA, RNA, and retrotranscribing viruses, we identified a core of human domains, that viruses used to hijack the cellular machinery and evade the immune system, which might be promising antiviral drug targets. We showed that viruses preferentially interacted with host hub and bottleneck domains, and the degree and betweenness centrality among three categories of viruses are significantly different. Further analysis at functional level highlighted that different viruses perturbed the host cellular molecular network by common and unique strategies. Most importantly, we creatively proposed a viral disease network among viral domains, human domains and the corresponding diseases, which uncovered several unknown virus-disease relationships that needed further verification. Overall, it is expected that the findings will help to deeply understand the viral infection and contribute to the development of antiviral therapy.
Collapse
|
34
|
Abstract
Background Membrane proteins perform essential roles in diverse cellular functions and are regarded as major pharmaceutical targets. The significance of membrane proteins has led to the developing dozens of resources related with membrane proteins. However, most of these resources are built for specific well-known membrane protein groups, making it difficult to find common and specific features of various membrane protein groups. Methods We collected human membrane proteins from the dispersed resources and predicted novel membrane protein candidates by using ortholog information and our membrane protein classifiers. The membrane proteins were classified according to the type of interaction with the membrane, subcellular localization, and molecular function. We also made new feature dataset to characterize the membrane proteins in various aspects including membrane protein topology, domain, biological process, disease, and drug. Moreover, protein structure and ICD-10-CM based integrated disease and drug information was newly included. To analyze the comprehensive information of membrane proteins, we implemented analysis tools to identify novel sequence and functional features of the classified membrane protein groups and to extract features from protein sequences. Results We constructed HMPAS with 28,509 collected known membrane proteins and 8,076 newly predicted candidates. This system provides integrated information of human membrane proteins individually and in groups organized by 45 subcellular locations and 1,401 molecular functions. As a case study, we identified associations between the membrane proteins and diseases and present that membrane proteins are promising targets for diseases related with nervous system and circulatory system. A web-based interface of this system was constructed to facilitate researchers not only to retrieve organized information of individual proteins but also to use the tools to analyze the membrane proteins. Conclusions HMPAS provides comprehensive information about human membrane proteins including specific features of certain membrane protein groups. In this system, user can acquire the information of individual proteins and specified groups focused on their conserved sequence features, involved cellular processes, and diseases. HMPAS may contribute as a valuable resource for the inference of novel cellular mechanisms and pharmaceutical targets associated with the human membrane proteins. HMPAS is freely available at http://fcode.kaist.ac.kr/hmpas.
Collapse
|
35
|
Braun P, Aubourg S, Van Leene J, De Jaeger G, Lurin C. Plant protein interactomes. ANNUAL REVIEW OF PLANT BIOLOGY 2013; 64:161-87. [PMID: 23330791 DOI: 10.1146/annurev-arplant-050312-120140] [Citation(s) in RCA: 74] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
Protein-protein interactions are a critical element of biological systems, and the analysis of interaction partners can provide valuable hints about unknown functions of a protein. In recent years, several large-scale protein interaction studies have begun to unravel the complex networks through which plant proteins exert their functions. Two major classes of experimental approaches are used for protein interaction mapping: analysis of direct interactions using binary methods such as yeast two-hybrid or split ubiquitin, and analysis of protein complexes through affinity purification followed by mass spectrometry. In addition, bioinformatics predictions can suggest interactions that have evaded detection by other methods or those of proteins that have not been investigated. Here we review the major approaches to construct, analyze, use, and carry out quality control on plant protein interactome networks. We present experimental and computational approaches for large-scale mapping, methods for validation or smaller-scale functional studies, important bioinformatics resources, and findings from recently published large-scale plant interactome network maps.
Collapse
Affiliation(s)
- Pascal Braun
- Department of Plant Systems Biology, Center for Life and Food Sciences Weihenstephan, Technische Universität München (TUM), 85354 Freising-Weihenstephan, Germany.
| | | | | | | | | |
Collapse
|