1
|
Al-gafari M, Jagadeesan SK, Kazmirchuk TDD, Takallou S, Wang J, Hajikarimlou M, Ramessur NB, Darwish W, Bradbury-Jost C, Moteshareie H, Said KB, Samanfar B, Golshani A. Investigating the Activities of CAF20 and ECM32 in the Regulation of PGM2 mRNA Translation. BIOLOGY 2024; 13:884. [PMID: 39596839 PMCID: PMC11592143 DOI: 10.3390/biology13110884] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/19/2024] [Revised: 10/17/2024] [Accepted: 10/22/2024] [Indexed: 11/29/2024]
Abstract
Translation is a fundamental process in biology, and understanding its mechanisms is crucial to comprehending cellular functions and diseases. The regulation of this process is closely linked to the structure of mRNA, as these regions prove vital to modulating translation efficiency and control. Thus, identifying and investigating these fundamental factors that influence the processing and unwinding of structured mRNAs would be of interest due to the widespread impact in various fields of biology. To this end, we employed a computational approach and identified genes that may be involved in the translation of structured mRNAs. The approach is based on the enrichment of interactions and co-expression of genes with those that are known to influence translation and helicase activity. The in silico prediction found CAF20 and ECM32 to be highly ranked candidates that may play a role in unwinding mRNA. The activities of neither CAF20 nor ECM32 have previously been linked to the translation of PGM2 mRNA or other structured mRNAs. Our follow-up investigations with these two genes provided evidence of their participation in the translation of PGM2 mRNA and several other synthetic structured mRNAs.
Collapse
Affiliation(s)
- Mustafa Al-gafari
- Department of Biology, Carleton University, Ottawa, ON K1S 5B6, Canada; (M.A.-g.); (S.K.J.); (T.D.D.K.); (S.T.); (J.W.); (M.H.); (N.B.R.); (W.D.); (C.B.-J.); (K.B.S.)
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada;
| | - Sasi Kumar Jagadeesan
- Department of Biology, Carleton University, Ottawa, ON K1S 5B6, Canada; (M.A.-g.); (S.K.J.); (T.D.D.K.); (S.T.); (J.W.); (M.H.); (N.B.R.); (W.D.); (C.B.-J.); (K.B.S.)
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada;
| | - Thomas David Daniel Kazmirchuk
- Department of Biology, Carleton University, Ottawa, ON K1S 5B6, Canada; (M.A.-g.); (S.K.J.); (T.D.D.K.); (S.T.); (J.W.); (M.H.); (N.B.R.); (W.D.); (C.B.-J.); (K.B.S.)
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada;
| | - Sarah Takallou
- Department of Biology, Carleton University, Ottawa, ON K1S 5B6, Canada; (M.A.-g.); (S.K.J.); (T.D.D.K.); (S.T.); (J.W.); (M.H.); (N.B.R.); (W.D.); (C.B.-J.); (K.B.S.)
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada;
| | - Jiashu Wang
- Department of Biology, Carleton University, Ottawa, ON K1S 5B6, Canada; (M.A.-g.); (S.K.J.); (T.D.D.K.); (S.T.); (J.W.); (M.H.); (N.B.R.); (W.D.); (C.B.-J.); (K.B.S.)
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada;
| | - Maryam Hajikarimlou
- Department of Biology, Carleton University, Ottawa, ON K1S 5B6, Canada; (M.A.-g.); (S.K.J.); (T.D.D.K.); (S.T.); (J.W.); (M.H.); (N.B.R.); (W.D.); (C.B.-J.); (K.B.S.)
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada;
| | - Nishka Beersing Ramessur
- Department of Biology, Carleton University, Ottawa, ON K1S 5B6, Canada; (M.A.-g.); (S.K.J.); (T.D.D.K.); (S.T.); (J.W.); (M.H.); (N.B.R.); (W.D.); (C.B.-J.); (K.B.S.)
| | - Waleed Darwish
- Department of Biology, Carleton University, Ottawa, ON K1S 5B6, Canada; (M.A.-g.); (S.K.J.); (T.D.D.K.); (S.T.); (J.W.); (M.H.); (N.B.R.); (W.D.); (C.B.-J.); (K.B.S.)
| | - Calvin Bradbury-Jost
- Department of Biology, Carleton University, Ottawa, ON K1S 5B6, Canada; (M.A.-g.); (S.K.J.); (T.D.D.K.); (S.T.); (J.W.); (M.H.); (N.B.R.); (W.D.); (C.B.-J.); (K.B.S.)
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada;
| | - Houman Moteshareie
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada;
- Healthy Environments and Consumer Safety Branch, Health Canada, Ottawa, ON K1A 0K9, Canada
| | - Kamaledin B. Said
- Department of Biology, Carleton University, Ottawa, ON K1S 5B6, Canada; (M.A.-g.); (S.K.J.); (T.D.D.K.); (S.T.); (J.W.); (M.H.); (N.B.R.); (W.D.); (C.B.-J.); (K.B.S.)
- Department of Pathology and Microbiology, College of Medicine, University of Hail, Hail P.O. Box 2240, Saudi Arabia
| | - Bahram Samanfar
- Department of Biology, Carleton University, Ottawa, ON K1S 5B6, Canada; (M.A.-g.); (S.K.J.); (T.D.D.K.); (S.T.); (J.W.); (M.H.); (N.B.R.); (W.D.); (C.B.-J.); (K.B.S.)
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre (ORDC), Ottawa, ON K1A 0C6, Canada
| | - Ashkan Golshani
- Department of Biology, Carleton University, Ottawa, ON K1S 5B6, Canada; (M.A.-g.); (S.K.J.); (T.D.D.K.); (S.T.); (J.W.); (M.H.); (N.B.R.); (W.D.); (C.B.-J.); (K.B.S.)
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada;
| |
Collapse
|
2
|
Rosilan NF, Waiho K, Fazhan H, Sung YY, Zakaria NH, Afiqah-Aleng N, Mohamed-Hussein ZA. Current trends of host-pathogen relationship in shrimp infectious disease via computational protein-protein interaction: A bibliometric analysis. FISH & SHELLFISH IMMUNOLOGY 2023; 142:109171. [PMID: 37858788 DOI: 10.1016/j.fsi.2023.109171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/06/2023] [Revised: 10/12/2023] [Accepted: 10/16/2023] [Indexed: 10/21/2023]
Abstract
Protein-protein interactions (PPIs) are essential for understanding cell physiology in normal and pathological conditions, as they might involve in all cellular processes. PPIs have been widely used to elucidate the pathobiology of human and plant diseases. Therefore, they can also be used to unveil the pathobiology of infectious diseases in shrimp, which is one of the high-risk factors influencing the success or failure of shrimp production. PPI network analysis, specifically host-pathogen PPI (HP-PPI), provides insights into the molecular interactions between the shrimp and pathogens. This review quantitatively analyzed the research trends within this field through bibliometric analysis using specific keywords, countries, authors, organizations, journals, and documents. This analysis has screened 206 records from the Scopus database for determining eligibility, resulting in 179 papers that were retrieved for bibliometric analysis. The analysis revealed that China and Thailand were the driving forces behind this specific field of research and frequently collaborated with the United States. Aquaculture and Diseases of Aquatic Organisms were the prominent sources for publications in this field. The main keywords identified included "white spot syndrome virus," "WSSV," and "shrimp." We discovered that studies on HP-PPI are currently quite scarce. As a result, we further discussed the significance of HP-PPI by highlighting various approaches that have been previously adopted. These findings not only emphasize the importance of HP-PPI but also pave the way for future researchers to explore the pathogenesis of infectious diseases in shrimp. By doing so, preventative measures and enhanced treatment strategies can be identified.
Collapse
Affiliation(s)
- Nur Fathiah Rosilan
- Institute of Climate Adaptation and Marine Biotechnology (ICAMB), Universiti Malaysia Terengganu, 21030, Kuala Nerus, Terengganu, Malaysia
| | - Khor Waiho
- Higher Institution Centre of Excellence (HICoE), Institute of Tropical Aquaculture and Fisheries, Universiti Malaysia Terengganu, 21030, Kuala Nerus, Terengganu, Malaysia; Centre for Chemical Biology, Universiti Sains Malaysia, Minden, 11900, Penang, Malaysia; Department of Aquaculture, Faculty of Fisheries, Kasetsart University, 10900, Bangkok, Thailand
| | - Hanafiah Fazhan
- Higher Institution Centre of Excellence (HICoE), Institute of Tropical Aquaculture and Fisheries, Universiti Malaysia Terengganu, 21030, Kuala Nerus, Terengganu, Malaysia; Centre for Chemical Biology, Universiti Sains Malaysia, Minden, 11900, Penang, Malaysia; Department of Aquaculture, Faculty of Fisheries, Kasetsart University, 10900, Bangkok, Thailand
| | - Yeong Yik Sung
- Institute of Climate Adaptation and Marine Biotechnology (ICAMB), Universiti Malaysia Terengganu, 21030, Kuala Nerus, Terengganu, Malaysia
| | - Nor Hafizah Zakaria
- Institute of Climate Adaptation and Marine Biotechnology (ICAMB), Universiti Malaysia Terengganu, 21030, Kuala Nerus, Terengganu, Malaysia.
| | - Nor Afiqah-Aleng
- Institute of Climate Adaptation and Marine Biotechnology (ICAMB), Universiti Malaysia Terengganu, 21030, Kuala Nerus, Terengganu, Malaysia.
| | - Zeti-Azura Mohamed-Hussein
- UKM Medical Molecular Biology Institute, UKM Medical Centre, Jalan Yaacob Latiff, 56000, Cheras, Kuala Lumpur, Malaysia; Department of Applied Physics, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, 43600, UKM Bangi, Selangor, Malaysia
| |
Collapse
|
3
|
Duhan N, Kaundal R. HuCoPIA: An Atlas of Human vs. SARS-CoV-2 Interactome and the Comparative Analysis with Other Coronaviridae Family Viruses. Viruses 2023; 15:492. [PMID: 36851706 PMCID: PMC9962590 DOI: 10.3390/v15020492] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Revised: 02/01/2023] [Accepted: 02/04/2023] [Indexed: 02/12/2023] Open
Abstract
SARS-CoV-2, a novel betacoronavirus strain, has caused a pandemic that has claimed the lives of nearly 6.7M people worldwide. Vaccines and medicines are being developed around the world to reduce the disease spread, fatality rates, and control the new variants. Understanding the protein-protein interaction mechanism of SARS-CoV-2 in humans, and their comparison with the previous SARS-CoV and MERS strains, is crucial for these efforts. These interactions might be used to assess vaccination effectiveness, diagnose exposure, and produce effective biotherapeutics. Here, we present the HuCoPIA database, which contains approximately 100,000 protein-protein interactions between humans and three strains (SARS-CoV-2, SARS-CoV, and MERS) of betacoronavirus. The interactions in the database are divided into common interactions between all three strains and those unique to each strain. It also contains relevant functional annotation information of human proteins. The HuCoPIA database contains SARS-CoV-2 (41,173), SARS-CoV (31,997), and MERS (26,862) interactions, with functional annotation of human proteins like subcellular localization, tissue-expression, KEGG pathways, and Gene ontology information. We believe HuCoPIA will serve as an invaluable resource to diverse experimental biologists, and will help to advance the research in better understanding the mechanism of betacoronaviruses.
Collapse
Affiliation(s)
- Naveen Duhan
- Department of Plants, Soils, and Climate/Center for Integrated BioSystems, College of Agriculture and Applied Sciences, Utah State University, Logan, UT 84322, USA
- Bioinformatics Facility, Center for Integrated BioSystems, College of Agriculture and Applied Sciences, Utah State University, Logan, UT 84322, USA
| | - Rakesh Kaundal
- Department of Plants, Soils, and Climate/Center for Integrated BioSystems, College of Agriculture and Applied Sciences, Utah State University, Logan, UT 84322, USA
- Bioinformatics Facility, Center for Integrated BioSystems, College of Agriculture and Applied Sciences, Utah State University, Logan, UT 84322, USA
- Department of Computer Science, College of Science, Utah State University, Logan, UT 84322, USA
| |
Collapse
|
4
|
Liu Y, Cummins SF, Zhao M. A Genomics Resource for 12 Edible Seaweeds to Predict Seaweed-Secreted Peptides with Potential Anti-Cancer Function. BIOLOGY 2022; 11:biology11101458. [PMID: 36290362 PMCID: PMC9598510 DOI: 10.3390/biology11101458] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 09/25/2022] [Accepted: 09/27/2022] [Indexed: 11/06/2022]
Abstract
Seaweeds are multicellular marine macroalgae with natural compounds that have potential anticancer activity. To date, the identification of those compounds has relied on purification and assay, yet few have been documented. Additionally, the genomes and associated proteomes of edible seaweeds that have been identified thus far are scattered among different resources and with no systematic summary available, which hinders the development of a large-scale omics analysis. To enable this, we constructed a comprehensive genomics resource for the edible seaweeds. These data could be used for systematic metabolomics and a proteome search for anti-cancer compound and peptides. In brief, we integrated and annotated 12 publicly available edible seaweed genomes (8 species and 268,071 proteins). In addition, we integrate the new seaweed genomic resources with established cancer bioinformatics pipelines to help identify potential seaweed proteins that could help mitigate the development of cancer. We present 7892 protein domains that were predicted to be associated with cancer proteins based on a protein domain-domain interaction. The most enriched protein families were associated with protein phosphorylation and insulin signalling, both of which are recognised to be crucial molecular components for patient survival in various cancers. In addition, we found 6692 seaweed proteins that could interact with over 100 tumour suppressor proteins, of which 147 are predicted to be secreted proteins. In conclusion, our genomics resource not only may be helpful in exploring the genomics features of these edible seaweed but also may provide a new avenue to explore the molecular mechanisms for seaweed-associated inhibition of human cancer development.
Collapse
Affiliation(s)
- Yining Liu
- The School of Public Health, Institute for Chemical Carcinogenesis, Guangzhou Medical University, Guangzhou 510180, China
| | - Scott F. Cummins
- Seaweed Research Group, University of the Sunshine Coast, Maroochydore, QLD 4558, Australia
- School of Science, Technology and Engineering, University of the Sunshine Coast, Maroochydore, QLD 4558, Australia
| | - Min Zhao
- Seaweed Research Group, University of the Sunshine Coast, Maroochydore, QLD 4558, Australia
- School of Science, Technology and Engineering, University of the Sunshine Coast, Maroochydore, QLD 4558, Australia
- Correspondence: ; Tel.: +61-07-54563402
| |
Collapse
|
5
|
Sharma MK, Srivastav VK, Joshi CK, Kumar M, Sharma DK. Annotated protein network analysis linking oral diseases. Bioinformation 2022; 18:724-729. [PMID: 37323560 PMCID: PMC10266365 DOI: 10.6026/97320630018724] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2022] [Revised: 08/31/2022] [Accepted: 08/31/2022] [Indexed: 09/28/2024] Open
Abstract
Oral cancer is becoming more common, and it threatens to be a serious worldwide medical issue. Hence, it is of interest to elucidate the networks between proteins and biologically active compounds, as well as their functional annotations, and cell signaling pathways. The online STRING software was used to create a molecular genetics interaction network named AZURIN on oral bacterial proteins. We also used the cystoscope software to identify 11 nodes and 16 edges with an average node order of 2.91. Thus, we document data on the interaction of protein networks with other proteins for identifying potential therapeutic drug candidates linked to oral disease.
Collapse
Affiliation(s)
- Mukesh Kumar Sharma
- Department of Biotechnology, Maharaj Vinayak Global University, Jaipur Rajasthan, India
- Department of Botany, Vishwa Bharti PG College, Sikar, Rajasthan India
| | - Vivek Kumar Srivastav
- Department of Biotechnology, Maharaj Vinayak Global University, Jaipur Rajasthan, India
| | | | - Mohan Kumar
- Gyan Joyti College of Pharmacy and Nursing School, Hazaribagh, Jharkhand, India
| | - Deepak Kumar Sharma
- Department of Conservative and Endodontics, Jaipur Dental College, Jaipur, Rajasthan India
| |
Collapse
|
6
|
TritiKBdb: A Functional Annotation Resource for Deciphering the Complete Interaction Networks in Wheat-Karnal Bunt Pathosystem. Int J Mol Sci 2022; 23:ijms23137455. [PMID: 35806459 PMCID: PMC9267065 DOI: 10.3390/ijms23137455] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Revised: 06/30/2022] [Accepted: 06/30/2022] [Indexed: 02/01/2023] Open
Abstract
The study of molecular interactions, especially the inter-species protein-protein interactions, is crucial for understanding the disease infection mechanism in plants. These interactions play an important role in disease infection and host immune responses against pathogen attack. Among various critical fungal diseases, the incidences of Karnal bunt (Tilletia indica) around the world have hindered the export of the crops such as wheat from infected regions, thus causing substantial economic losses. Due to sparse information on T. indica, limited insight is available with regard to gaining in-depth knowledge of the interaction mechanisms between the host and pathogen proteins during the disease infection process. Here, we report the development of a comprehensive database and webserver, TritiKBdb, that implements various tools to study the protein-protein interactions in the Triticum species-Tilletia indica pathosystem. The novel ‘interactomics’ tool allows the user to visualize/compare the networks of the predicted interactions in an enriched manner. TritiKBdb is a user-friendly database that provides functional annotations such as subcellular localization, available domains, KEGG pathways, and GO terms of the host and pathogen proteins. Additionally, the information about the host and pathogen proteins that serve as transcription factors and effectors, respectively, is also made available. We believe that TritiKBdb will serve as a beneficial resource for the research community, and aid the community in better understanding the infection mechanisms of Karnal bunt and its interactions with wheat. The database is freely available for public use at http://bioinfo.usu.edu/tritikbdb/.
Collapse
|
7
|
Deciphering the Host-Pathogen Interactome of the Wheat-Common Bunt System: A Step towards Enhanced Resilience in Next Generation Wheat. Int J Mol Sci 2022; 23:ijms23052589. [PMID: 35269732 PMCID: PMC8910311 DOI: 10.3390/ijms23052589] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Accepted: 02/09/2022] [Indexed: 02/05/2023] Open
Abstract
Common bunt, caused by two fungal species, Tilletia caries and Tilletia laevis, is one of the most potentially destructive diseases of wheat. Despite the availability of synthetic chemicals against the disease, organic agriculture relies greatly on resistant cultivars. Using two computational approaches—interolog and domain-based methods—a total of approximately 58 M and 56 M probable PPIs were predicted in T. aestivum–T. caries and T. aestivum–T. laevis interactomes, respectively. We also identified 648 and 575 effectors in the interactions from T. caries and T. laevis, respectively. The major host hubs belonged to the serine/threonine protein kinase, hsp70, and mitogen-activated protein kinase families, which are actively involved in plant immune signaling during stress conditions. The Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis of the host proteins revealed significant GO terms (O-methyltransferase activity, regulation of response to stimulus, and plastid envelope) and pathways (NF-kappa B signaling and the MAPK signaling pathway) related to plant defense against pathogens. Subcellular localization suggested that most of the pathogen proteins target the host in the plastid. Furthermore, a comparison between unique T. caries and T. laevis proteins was carried out. We also identified novel host candidates that are resistant to disease. Additionally, the host proteins that serve as transcription factors were also predicted.
Collapse
|
8
|
OUP accepted manuscript. Brief Funct Genomics 2022; 21:243-269. [DOI: 10.1093/bfgp/elac007] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Revised: 03/17/2022] [Accepted: 03/18/2022] [Indexed: 11/14/2022] Open
|
9
|
Kataria R, Kaundal R. Deciphering the Crosstalk Mechanisms of Wheat-Stem Rust Pathosystem: Genome-Scale Prediction Unravels Novel Host Targets. FRONTIERS IN PLANT SCIENCE 2022; 13:895480. [PMID: 35800602 PMCID: PMC9253690 DOI: 10.3389/fpls.2022.895480] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/13/2022] [Accepted: 05/31/2022] [Indexed: 05/04/2023]
Abstract
Triticum aestivum (wheat), a major staple food grain, is affected by various biotic stresses. Among these, fungal diseases cause about 15-20% of yield loss, worldwide. In this study, we performed a comparative analysis of protein-protein interactions between two Puccinia graminis races (Pgt 21-0 and Pgt Ug99) that cause stem (black) rust in wheat. The available molecular techniques to study the host-pathogen interaction mechanisms are expensive and labor-intensive. We implemented two computational approaches (interolog and domain-based) for the prediction of PPIs and performed various functional analysis to determine the significant differences between the two pathogen races. The analysis revealed that T. aestivum-Pgt 21-0 and T. aestivum-Pgt Ug99 interactomes consisted of ∼90M and ∼56M putative PPIs, respectively. In the predicted PPIs, we identified 115 Pgt 21-0 and 34 Pgt Ug99 potential effectors that were highly involved in pathogen virulence and development. Functional enrichment analysis of the host proteins revealed significant GO terms and KEGG pathways such as O-methyltransferase activity (GO:0008171), regulation of signal transduction (GO:0009966), lignin metabolic process (GO:0009808), plastid envelope (GO:0009526), plant-pathogen interaction pathway (ko04626), and MAPK pathway (ko04016) that are actively involved in plant defense and immune signaling against the biotic stresses. Subcellular localization analysis anticipated the host plastid as a primary target for pathogen attack. The highly connected host hubs in the protein interaction network belonged to protein kinase domain including Ser/Thr protein kinase, MAPK, and cyclin-dependent kinase. We also identified 5,577 transcription factors in the interactions, associated with plant defense during biotic stress conditions. Additionally, novel host targets that are resistant to stem rust disease were also identified. The present study elucidates the functional differences between Pgt 21-0 and Pgt Ug99, thus providing the researchers with strain-specific information for further experimental validation of the interactions, and the development of durable, disease-resistant crop lines.
Collapse
Affiliation(s)
- Raghav Kataria
- Department of Plants, Soils, and Climate, College of Agriculture and Applied Sciences, Utah State University, Logan, UT, United States
| | - Rakesh Kaundal
- Department of Plants, Soils, and Climate, College of Agriculture and Applied Sciences, Utah State University, Logan, UT, United States
- Bioinformatics Facility, Center for Integrated BioSystems, Utah State University, Logan, UT, United States
- Department of Computer Science, College of Science, Utah State University, Logan, UT, United States
- *Correspondence: Rakesh Kaundal,
| |
Collapse
|
10
|
Sudhakar P, Machiels K, Verstockt B, Korcsmaros T, Vermeire S. Computational Biology and Machine Learning Approaches to Understand Mechanistic Microbiome-Host Interactions. Front Microbiol 2021; 12:618856. [PMID: 34046017 PMCID: PMC8148342 DOI: 10.3389/fmicb.2021.618856] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2020] [Accepted: 03/19/2021] [Indexed: 12/11/2022] Open
Abstract
The microbiome, by virtue of its interactions with the host, is implicated in various host functions including its influence on nutrition and homeostasis. Many chronic diseases such as diabetes, cancer, inflammatory bowel diseases are characterized by a disruption of microbial communities in at least one biological niche/organ system. Various molecular mechanisms between microbial and host components such as proteins, RNAs, metabolites have recently been identified, thus filling many gaps in our understanding of how the microbiome modulates host processes. Concurrently, high-throughput technologies have enabled the profiling of heterogeneous datasets capturing community level changes in the microbiome as well as the host responses. However, due to limitations in parallel sampling and analytical procedures, big gaps still exist in terms of how the microbiome mechanistically influences host functions at a system and community level. In the past decade, computational biology and machine learning methodologies have been developed with the aim of filling the existing gaps. Due to the agnostic nature of the tools, they have been applied in diverse disease contexts to analyze and infer the interactions between the microbiome and host molecular components. Some of these approaches allow the identification and analysis of affected downstream host processes. Most of the tools statistically or mechanistically integrate different types of -omic and meta -omic datasets followed by functional/biological interpretation. In this review, we provide an overview of the landscape of computational approaches for investigating mechanistic interactions between individual microbes/microbiome and the host and the opportunities for basic and clinical research. These could include but are not limited to the development of activity- and mechanism-based biomarkers, uncovering mechanisms for therapeutic interventions and generating integrated signatures to stratify patients.
Collapse
Affiliation(s)
- Padhmanand Sudhakar
- Department of Chronic Diseases, Metabolism and Ageing, Translational Research Center for Gastrointestinal Disorders (TARGID), KU Leuven, Leuven, Belgium
- Earlham Institute, Norwich, United Kingdom
- Quadram Institute Bioscience, Norwich, United Kingdom
| | - Kathleen Machiels
- Department of Chronic Diseases, Metabolism and Ageing, Translational Research Center for Gastrointestinal Disorders (TARGID), KU Leuven, Leuven, Belgium
| | - Bram Verstockt
- Department of Chronic Diseases, Metabolism and Ageing, Translational Research Center for Gastrointestinal Disorders (TARGID), KU Leuven, Leuven, Belgium
- Department of Gastroenterology and Hepatology, University Hospitals Leuven, KU Leuven, Leuven, Belgium
| | - Tamas Korcsmaros
- Earlham Institute, Norwich, United Kingdom
- Quadram Institute Bioscience, Norwich, United Kingdom
| | - Séverine Vermeire
- Department of Chronic Diseases, Metabolism and Ageing, Translational Research Center for Gastrointestinal Disorders (TARGID), KU Leuven, Leuven, Belgium
- Department of Gastroenterology and Hepatology, University Hospitals Leuven, KU Leuven, Leuven, Belgium
| |
Collapse
|
11
|
Louadi Z, Yuan K, Gress A, Tsoy O, Kalinina OV, Baumbach J, Kacprowski T, List M. DIGGER: exploring the functional role of alternative splicing in protein interactions. Nucleic Acids Res 2021; 49:D309-D318. [PMID: 32976589 PMCID: PMC7778957 DOI: 10.1093/nar/gkaa768] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2020] [Revised: 09/01/2020] [Accepted: 09/04/2020] [Indexed: 12/20/2022] Open
Abstract
Alternative splicing plays a major role in regulating the functional repertoire of the proteome. However, isoform-specific effects to protein-protein interactions (PPIs) are usually overlooked, making it impossible to judge the functional role of individual exons on a systems biology level. We overcome this barrier by integrating protein-protein interactions, domain-domain interactions and residue-level interactions information to lift exon expression analysis to a network level. Our user-friendly database DIGGER is available at https://exbio.wzw.tum.de/digger and allows users to seamlessly switch between isoform and exon-centric views of the interactome and to extract sub-networks of relevant isoforms, making it an essential resource for studying mechanistic consequences of alternative splicing.
Collapse
Affiliation(s)
- Zakaria Louadi
- Chair of Experimental Bioinformatics, TUM School of Life Sciences Weihenstephan, Technical University of Munich, 85354 Freising, Germany
| | - Kevin Yuan
- Chair of Experimental Bioinformatics, TUM School of Life Sciences Weihenstephan, Technical University of Munich, 85354 Freising, Germany
| | - Alexander Gress
- Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research (HZI), 66123 Saarbrücken, Germany
| | - Olga Tsoy
- Chair of Experimental Bioinformatics, TUM School of Life Sciences Weihenstephan, Technical University of Munich, 85354 Freising, Germany
| | - Olga V Kalinina
- Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research (HZI), 66123 Saarbrücken, Germany.,Faculty of Medicine, Saarland University, 66421 Homburg, Germany
| | - Jan Baumbach
- Chair of Experimental Bioinformatics, TUM School of Life Sciences Weihenstephan, Technical University of Munich, 85354 Freising, Germany.,Department of Mathematics and Computer Science, University of Southern Denmark, 5230 Odense M, Denmark
| | - Tim Kacprowski
- Chair of Experimental Bioinformatics, TUM School of Life Sciences Weihenstephan, Technical University of Munich, 85354 Freising, Germany
| | - Markus List
- Chair of Experimental Bioinformatics, TUM School of Life Sciences Weihenstephan, Technical University of Munich, 85354 Freising, Germany
| |
Collapse
|
12
|
Pathogen and Host-Pathogen Protein Interactions Provide a Key to Identify Novel Drug Targets. SYSTEMS MEDICINE 2021. [DOI: 10.1016/b978-0-12-801238-3.11607-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
|
13
|
Kataria R, Duhan N, Kaundal R. Computational Systems Biology of Alfalfa - Bacterial Blight Host-Pathogen Interactions: Uncovering the Complex Molecular Networks for Developing Durable Disease Resistant Crop. FRONTIERS IN PLANT SCIENCE 2021; 12:807354. [PMID: 35251063 PMCID: PMC8891223 DOI: 10.3389/fpls.2021.807354] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Accepted: 12/29/2021] [Indexed: 05/04/2023]
Abstract
Medicago sativa (also known as alfalfa), a forage legume, is widely cultivated due to its high yield and high-value hay crop production. Infectious diseases are a major threat to the crops, owing to huge economic losses to the agriculture industry, worldwide. The protein-protein interactions (PPIs) between the pathogens and their hosts play a critical role in understanding the molecular basis of pathogenesis. Pseudomonas syringae pv. syringae ALF3 suppresses the plant's innate immune response by secreting type III effector proteins into the host cell, causing bacterial stem blight in alfalfa. The alfalfa-P. syringae system has little information available for PPIs. Thus, to understand the infection mechanism, we elucidated the genome-scale host-pathogen interactions (HPIs) between alfalfa and P. syringae using two computational approaches: interolog-based and domain-based method. A total of ∼14 M putative PPIs were predicted between 50,629 alfalfa proteins and 2,932 P. syringae proteins by combining these approaches. Additionally, ∼0.7 M consensus PPIs were also predicted. The functional analysis revealed that P. syringae proteins are highly involved in nucleotide binding activity (GO:0000166), intracellular organelle (GO:0043229), and translation (GO:0006412) while alfalfa proteins are involved in cellular response to chemical stimulus (GO:0070887), oxidoreductase activity (GO:0016614), and Golgi apparatus (GO:0005794). According to subcellular localization predictions, most of the pathogen proteins targeted host proteins within the cytoplasm and nucleus. In addition, we discovered a slew of new virulence effectors in the predicted HPIs. The current research describes an integrated approach for deciphering genome-scale host-pathogen PPIs between alfalfa and P. syringae, allowing the researchers to better understand the pathogen's infection mechanism and develop pathogen-resistant lines.
Collapse
Affiliation(s)
- Raghav Kataria
- Department of Plants, Soils, and Climate, College of Agriculture and Applied Sciences, Utah State University, Logan, UT, United States
| | - Naveen Duhan
- Department of Plants, Soils, and Climate, College of Agriculture and Applied Sciences, Utah State University, Logan, UT, United States
| | - Rakesh Kaundal
- Department of Plants, Soils, and Climate, College of Agriculture and Applied Sciences, Utah State University, Logan, UT, United States
- Bioinformatics Facility, Center for Integrated Biosystems, Utah State University, Logan, UT, United States
- Department of Computer Science, College of Science, Utah State University, Logan, UT, United States
- *Correspondence: Rakesh Kaundal, ;
| |
Collapse
|
14
|
Bhat AS, Kinch LN, Grishin NV. β-Strand-mediated interactions of protein domains. Proteins 2020; 88:1513-1527. [PMID: 32543729 PMCID: PMC8018532 DOI: 10.1002/prot.25970] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2019] [Revised: 03/10/2020] [Accepted: 06/06/2020] [Indexed: 01/14/2023]
Abstract
Protein domains exist by themselves or in combination with other domains to form complex multidomain proteins. Defining domain boundaries in proteins is essential for understanding their evolution and function but is not trivial. More specifically, partitioning domains that interact by forming a single β-sheet is known to be particularly troublesome for automatic structure-based domain decomposition pipelines. Here, we study edge-to-edge β-strand interactions between domains in a protein chain, to help define the boundaries for some more difficult cases where a single β-sheet spanning over two domains gives an appearance of one. We give a number of examples where β-strands belonging to a single β-sheet do not belong to a single domain and highlight the difficulties of automatic domain parsers on these examples. This work can be used as a baseline for defining domain boundaries in homologous proteins or proteins with similar domain interactions in the future.
Collapse
Affiliation(s)
- Archana S. Bhat
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, Texas 75390-9050
| | - Lisa N. Kinch
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas 75390-9050
| | - Nick V. Grishin
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, Texas 75390-9050
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas 75390-9050
| |
Collapse
|
15
|
Mishra SK, Muthye V, Kandoi G. Computational Methods for Predicting Functions at the mRNA Isoform Level. Int J Mol Sci 2020; 21:ijms21165686. [PMID: 32784445 PMCID: PMC7460821 DOI: 10.3390/ijms21165686] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Revised: 08/05/2020] [Accepted: 08/06/2020] [Indexed: 11/16/2022] Open
Abstract
Multiple mRNA isoforms of the same gene are produced via alternative splicing, a biological mechanism that regulates protein diversity while maintaining genome size. Alternatively spliced mRNA isoforms of the same gene may sometimes have very similar sequence, but they can have significantly diverse effects on cellular function and regulation. The products of alternative splicing have important and diverse functional roles, such as response to environmental stress, regulation of gene expression, human heritable, and plant diseases. The mRNA isoforms of the same gene can have dramatically different functions. Despite the functional importance of mRNA isoforms, very little has been done to annotate their functions. The recent years have however seen the development of several computational methods aimed at predicting mRNA isoform level biological functions. These methods use a wide array of proteo-genomic data to develop machine learning-based mRNA isoform function prediction tools. In this review, we discuss the computational methods developed for predicting the biological function at the individual mRNA isoform level.
Collapse
|
16
|
Andrighetti T, Bohar B, Lemke N, Sudhakar P, Korcsmaros T. MicrobioLink: An Integrated Computational Pipeline to Infer Functional Effects of Microbiome-Host Interactions. Cells 2020; 9:cells9051278. [PMID: 32455748 PMCID: PMC7291277 DOI: 10.3390/cells9051278] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2020] [Revised: 05/15/2020] [Accepted: 05/19/2020] [Indexed: 02/07/2023] Open
Abstract
Microbiome–host interactions play significant roles in health and in various diseases including autoimmune disorders. Uncovering these inter-kingdom cross-talks propels our understanding of disease pathogenesis and provides useful leads on potential therapeutic targets. Despite the biological significance of microbe–host interactions, there is a big gap in understanding the downstream effects of these interactions on host processes. Computational methods are expected to fill this gap by generating, integrating, and prioritizing predictions—as experimental detection remains challenging due to feasibility issues. Here, we present MicrobioLink, a computational pipeline to integrate predicted interactions between microbial and host proteins together with host molecular networks. Using the concept of network diffusion, MicrobioLink can analyse how microbial proteins in a certain context are influencing cellular processes by modulating gene or protein expression. We demonstrated the applicability of the pipeline using a case study. We used gut metaproteomic data from Crohn’s disease patients and healthy controls to uncover the mechanisms by which the microbial proteins can modulate host genes which belong to biological processes implicated in disease pathogenesis. MicrobioLink, which is agnostic of the microbial protein sources (bacterial, viral, etc.), is freely available on GitHub.
Collapse
Affiliation(s)
- Tahila Andrighetti
- Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK; (T.A.); (B.B.)
- Institute of Biosciences, São Paulo University (UNESP), Botucatu 18618-689, SP, Brazil;
| | - Balazs Bohar
- Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK; (T.A.); (B.B.)
- Department of Genetics, Eötvös Loránd University, Budapest 1117, Hungary
| | - Ney Lemke
- Institute of Biosciences, São Paulo University (UNESP), Botucatu 18618-689, SP, Brazil;
| | - Padhmanand Sudhakar
- Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK; (T.A.); (B.B.)
- Quadram Institute Bioscience, Norwich Research Park, Norwich NR4 7UQ, UK
- Department of Chronic Diseases, Metabolism and Ageing, KU Leuven BE-3000, Leuven, Belgium
- Correspondence: (T.K.); (P.S.)
| | - Tamas Korcsmaros
- Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK; (T.A.); (B.B.)
- Quadram Institute Bioscience, Norwich Research Park, Norwich NR4 7UQ, UK
- Correspondence: (T.K.); (P.S.)
| |
Collapse
|
17
|
Poverennaya EV, Kiseleva OI, Ivanov AS, Ponomarenko EA. Methods of Computational Interactomics for Investigating Interactions of Human Proteoforms. BIOCHEMISTRY (MOSCOW) 2020; 85:68-79. [PMID: 32079518 DOI: 10.1134/s000629792001006x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
Human genome contains ca. 20,000 protein-coding genes that could be translated into millions of unique protein species (proteoforms). Proteoforms coded by a single gene often have different functions, which implies different protein partners. By interacting with each other, proteoforms create a network reflecting the dynamics of cellular processes in an organism. Perturbations of protein-protein interactions change the network topology, which often triggers pathological processes. Studying proteoforms is a relatively new research area in proteomics, and this is why there are comparatively few experimental studies on the interaction of proteoforms. Bioinformatics tools can facilitate such studies by providing valuable complementary information to the experimental data and, in particular, expanding the possibilities of the studies of proteoform interactions.
Collapse
Affiliation(s)
| | - O I Kiseleva
- Institute of Biomedical Chemistry, Moscow, 119121, Russia
| | - A S Ivanov
- Institute of Biomedical Chemistry, Moscow, 119121, Russia
| | | |
Collapse
|
18
|
Marini S, Vitali F, Rampazzi S, Demartini A, Akutsu T. Protease target prediction via matrix factorization. Bioinformatics 2019; 35:923-929. [PMID: 30169576 DOI: 10.1093/bioinformatics/bty746] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2018] [Revised: 08/20/2018] [Accepted: 08/27/2018] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Protein cleavage is an important cellular event, involved in a myriad of processes, from apoptosis to immune response. Bioinformatics provides in silico tools, such as machine learning-based models, to guide the discovery of targets for the proteases responsible for protein cleavage. State-of-the-art models have a scope limited to specific protease families (such as Caspases), and do not explicitly include biological or medical knowledge (such as the hierarchical protein domain similarity or gene-gene interactions). To fill this gap, we present a novel approach for protease target prediction based on data integration. RESULTS By representing protease-protein target information in the form of relational matrices, we design a model (i) that is general and not limited to a single protease family, and (b) leverages on the available knowledge, managing extremely sparse data from heterogeneous data sources, including primary sequence, pathways, domains and interactions. When compared with other algorithms on test data, our approach provides a better performance even for models specifically focusing on a single protease family. AVAILABILITY AND IMPLEMENTATION https://gitlab.com/smarini/MaDDA/ (Matlab code and utilized data.). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Simone Marini
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Francesca Vitali
- Department of Medicine, Center for Biomedical Informatics and Biostatistics, BIO5 Institute), University of Arizona, Tucson, AZ, USA
| | - Sara Rampazzi
- Department of Computer Science and Engineering, University of Michigan, Ann Arbor, MI, USA
| | - Andrea Demartini
- Department of Electrical Computer and Biomedical Engineering, University of Pavia, Pavia, Italy
| | - Tatsuya Akutsu
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Kyoto, Japan
| |
Collapse
|
19
|
Sudhakar P, Jacomin AC, Hautefort I, Samavedam S, Fatemian K, Ari E, Gul L, Demeter A, Jones E, Korcsmaros T, Nezis IP. Targeted interplay between bacterial pathogens and host autophagy. Autophagy 2019; 15:1620-1633. [PMID: 30909843 PMCID: PMC6693458 DOI: 10.1080/15548627.2019.1590519] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2017] [Revised: 02/21/2019] [Accepted: 03/01/2019] [Indexed: 12/12/2022] Open
Abstract
Due to the critical role played by autophagy in pathogen clearance, pathogens have developed diverse strategies to subvert it. Despite previous key findings of bacteria-autophagy interplay, asystems-level insight into selective targeting by the host and autophagy modulation by the pathogens is lacking. We predicted potential interactions between human autophagy proteins and effector proteins from 56 pathogenic bacterial species by identifying bacterial proteins predicted to have recognition motifs for selective autophagy receptors SQSTM1/p62, CALCOCO2/NDP52 and MAP1LC3/LC3. Using structure-based interaction prediction, we identified bacterial proteins capable to modify core autophagy components. Our analysis revealed that autophagy receptors in general potentially target mostly genus-specific proteins, and not those present in multiple genera. The complementarity between the predicted SQSTM1/p62 and CALCOCO2/NDP52 targets, which has been shown for Salmonella, Listeria and Shigella, could be observed across other pathogens. This complementarity potentially leaves the host more susceptible to chronic infections upon the mutation of autophagy receptors. Proteins derived from enterotoxigenic and non-toxigenic Bacillus outer membrane vesicles indicated that autophagy targets pathogenic proteins rather than non-pathogenic ones. We also observed apathogen-specific pattern as to which autophagy phase could be modulated by specific genera. We found intriguing examples of bacterial proteins that could modulate autophagy, and in turn being targeted by autophagy as ahost defense mechanism. We confirmed experimentally an interplay between a Salmonella protease, YhjJ and autophagy. Our comparative meta-analysis points out key commonalities and differences in how pathogens could affect autophagy and how autophagy potentially recognizes these pathogenic effectors. Abbreviations: ATG5: autophagy related 5; CALCOCO2/NDP52: calcium binding and coiled-coil domain 2; GST: glutathione S-transferase; LIR: MAP1LC3/LC3-interacting region; MAP1LC3/LC3: microtubule associated protein 1 light chain 3 alpha; OMV: outer membrane vesicles; SQSTM1/p62: sequestosome 1; SCV: Salmonella containing vesicle; TECPR1: tectonin beta-propeller repeat containing 1; YhjJ: hypothetical zinc-protease.
Collapse
Affiliation(s)
- Padhmanand Sudhakar
- Earlham Institute, Norwich Research Park, Norwich, UK
- Gut Health and Microbes Programme, Quadram Institute, Norwich Research Park, Norwich, UK
- Department of Chronic Diseases, Metabolism and Ageing, KU Leuven, Leuven, Belgium
| | | | | | - Siva Samavedam
- School of Life Sciences, University of Warwick, Coventry, UK
| | - Koorosh Fatemian
- School of Life Sciences, University of Warwick, Coventry, UK
- Current affiliation:Exaelements LTD, Coventry, UK
| | - Eszter Ari
- Department of Genetics, Eotvos Lorand University, Budapest, Hungary
- Synthetic and System Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Leila Gul
- Earlham Institute, Norwich Research Park, Norwich, UK
| | - Amanda Demeter
- Earlham Institute, Norwich Research Park, Norwich, UK
- Gut Health and Microbes Programme, Quadram Institute, Norwich Research Park, Norwich, UK
- Department of Genetics, Eotvos Lorand University, Budapest, Hungary
| | - Emily Jones
- Earlham Institute, Norwich Research Park, Norwich, UK
- Gut Health and Microbes Programme, Quadram Institute, Norwich Research Park, Norwich, UK
| | - Tamas Korcsmaros
- Earlham Institute, Norwich Research Park, Norwich, UK
- Gut Health and Microbes Programme, Quadram Institute, Norwich Research Park, Norwich, UK
| | | |
Collapse
|
20
|
Kobren SN, Singh M. Systematic domain-based aggregation of protein structures highlights DNA-, RNA- and other ligand-binding positions. Nucleic Acids Res 2019; 47:582-593. [PMID: 30535108 PMCID: PMC6344845 DOI: 10.1093/nar/gky1224] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2018] [Accepted: 11/26/2018] [Indexed: 02/06/2023] Open
Abstract
Domains are fundamental subunits of proteins, and while they play major roles in facilitating protein–DNA, protein–RNA and other protein–ligand interactions, a systematic assessment of their various interaction modes is still lacking. A comprehensive resource identifying positions within domains that tend to interact with nucleic acids, small molecules and other ligands would expand our knowledge of domain functionality as well as aid in detecting ligand-binding sites within structurally uncharacterized proteins. Here, we introduce an approach to identify per-domain-position interaction ‘frequencies’ by aggregating protein co-complex structures by domain and ascertaining how often residues mapping to each domain position interact with ligands. We perform this domain-based analysis on ∼91000 co-complex structures, and infer positions involved in binding DNA, RNA, peptides, ions or small molecules across 4128 domains, which we refer to collectively as the InteracDome. Cross-validation testing reveals that ligand-binding positions for 2152 domains are highly consistent and can be used to identify residues facilitating interactions in ∼63–69% of human genes. Our resource of domain-inferred ligand-binding sites should be a great aid in understanding disease etiology: whereas these sites are enriched in Mendelian-associated and cancer somatic mutations, they are depleted in polymorphisms observed across healthy populations. The InteracDome is available at http://interacdome.princeton.edu.
Collapse
Affiliation(s)
- Shilpa Nadimpalli Kobren
- Department of Biomedical Informatics, Harvard Medical School, 10 Shattuck Street, Boston, MA 02115, USA
| | - Mona Singh
- Department of Computer Science, Princeton University, 35 Olden Street, Princeton, NJ 08544, USA.,Lewis-Sigler Institute for Integrative Genomics, Princeton University, Carl Icahn Laboratory, Princeton, NJ 08544, USA
| |
Collapse
|
21
|
Hillier C, Pardo M, Yu L, Bushell E, Sanderson T, Metcalf T, Herd C, Anar B, Rayner JC, Billker O, Choudhary JS. Landscape of the Plasmodium Interactome Reveals Both Conserved and Species-Specific Functionality. Cell Rep 2019; 28:1635-1647.e5. [PMID: 31390575 PMCID: PMC6693557 DOI: 10.1016/j.celrep.2019.07.019] [Citation(s) in RCA: 40] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2018] [Revised: 05/28/2019] [Accepted: 07/08/2019] [Indexed: 11/16/2022] Open
Abstract
Malaria represents a major global health issue, and the identification of new intervention targets remains an urgent priority. This search is hampered by more than one-third of the genes of malaria-causing Plasmodium parasites being uncharacterized. We report a large-scale protein interaction network in Plasmodium schizonts, generated by combining blue native-polyacrylamide electrophoresis with quantitative mass spectrometry and machine learning. This integrative approach, spanning 3 species, identifies >20,000 putative protein interactions, organized into 600 protein clusters. We validate selected interactions, assigning functions in chromatin regulation to previously unannotated proteins and suggesting a role for an EELM2 domain-containing protein and a putative microrchidia protein as mechanistic links between AP2-domain transcription factors and epigenetic regulation. Our interactome represents a high-confidence map of the native organization of core cellular processes in Plasmodium parasites. The network reveals putative functions for uncharacterized proteins, provides mechanistic and structural insight, and uncovers potential alternative therapeutic targets.
Collapse
Affiliation(s)
- Charles Hillier
- Developmental Biology Unit, European Molecular Biology Laboratory, 69117 Heidelberg, Germany
| | - Mercedes Pardo
- Functional Proteomics, The Institute of Cancer Research, London SW7 3RP, UK.
| | - Lu Yu
- Functional Proteomics, The Institute of Cancer Research, London SW7 3RP, UK
| | - Ellen Bushell
- Department of Molecular Biology, The Laboratory for Molecular Infection Medicine Sweden, Umeå University, 901 87 Umeå, Sweden
| | - Theo Sanderson
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge CB10 1SA, UK
| | - Tom Metcalf
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge CB10 1SA, UK
| | - Colin Herd
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge CB10 1SA, UK
| | - Burcu Anar
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge CB10 1SA, UK
| | - Julian C Rayner
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge CB10 1SA, UK
| | - Oliver Billker
- Department of Molecular Biology, The Laboratory for Molecular Infection Medicine Sweden, Umeå University, 901 87 Umeå, Sweden.
| | - Jyoti S Choudhary
- Functional Proteomics, The Institute of Cancer Research, London SW7 3RP, UK.
| |
Collapse
|
22
|
Jones EJ, Matthews ZJ, Gul L, Sudhakar P, Treveil A, Divekar D, Buck J, Wrzesinski T, Jefferson M, Armstrong SD, Hall LJ, Watson AJM, Carding SR, Haerty W, Di Palma F, Mayer U, Powell PP, Hautefort I, Wileman T, Korcsmaros T. Integrative analysis of Paneth cell proteomic and transcriptomic data from intestinal organoids reveals functional processes dependent on autophagy. Dis Model Mech 2019; 12:dmm037069. [PMID: 30814064 PMCID: PMC6451430 DOI: 10.1242/dmm.037069] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2018] [Accepted: 02/01/2019] [Indexed: 12/12/2022] Open
Abstract
Paneth cells are key epithelial cells that provide an antimicrobial barrier and maintain integrity of the small-intestinal stem cell niche. Paneth cell abnormalities are unfortunately detrimental to gut health and are often associated with digestive pathologies such as Crohn's disease or infections. Similar alterations are observed in individuals with impaired autophagy, a process that recycles cellular components. The direct effect of autophagy impairment on Paneth cells has not been analysed. To investigate this, we generated a mouse model lacking Atg16l1 specifically in intestinal epithelial cells, making these cells impaired in autophagy. Using three-dimensional intestinal organoids enriched for Paneth cells, we compared the proteomic profiles of wild-type and autophagy-impaired organoids. We used an integrated computational approach combining protein-protein interaction networks, autophagy-targeted proteins and functional information to identify the mechanistic link between autophagy impairment and disrupted pathways. Of the 284 altered proteins, 198 (70%) were more abundant in autophagy-impaired organoids, suggesting reduced protein degradation. Interestingly, these differentially abundant proteins comprised 116 proteins (41%) that are predicted targets of the selective autophagy proteins p62, LC3 and ATG16L1. Our integrative analysis revealed autophagy-mediated mechanisms that degrade key proteins in Paneth cell functions, such as exocytosis, apoptosis and DNA damage repair. Transcriptomic profiling of additional organoids confirmed that 90% of the observed changes upon autophagy alteration have effects at the protein level, not on gene expression. We performed further validation experiments showing differential lysozyme secretion, confirming our computationally inferred downregulation of exocytosis. Our observations could explain how protein-level alterations affect Paneth cell homeostatic functions upon autophagy impairment.This article has an associated First Person interview with the joint first authors of the paper.
Collapse
Affiliation(s)
- Emily J Jones
- Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
- Quadram Institute, Norwich Research Park, Norwich NR4 7UA, UK
- Norwich Medical School, University of East Anglia, Norwich NR4 7TJ, UK
| | - Zoe J Matthews
- Norwich Medical School, University of East Anglia, Norwich NR4 7TJ, UK
| | - Lejla Gul
- Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Padhmanand Sudhakar
- Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
- Quadram Institute, Norwich Research Park, Norwich NR4 7UA, UK
| | - Agatha Treveil
- Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
- Quadram Institute, Norwich Research Park, Norwich NR4 7UA, UK
| | - Devina Divekar
- Quadram Institute, Norwich Research Park, Norwich NR4 7UA, UK
- Norwich Medical School, University of East Anglia, Norwich NR4 7TJ, UK
| | - Jasmine Buck
- Norwich Medical School, University of East Anglia, Norwich NR4 7TJ, UK
| | | | - Matthew Jefferson
- Norwich Medical School, University of East Anglia, Norwich NR4 7TJ, UK
| | - Stuart D Armstrong
- National Institute of Health Research, University of Liverpool, Liverpool L3 5RF, UK
| | - Lindsay J Hall
- Quadram Institute, Norwich Research Park, Norwich NR4 7UA, UK
| | - Alastair J M Watson
- Quadram Institute, Norwich Research Park, Norwich NR4 7UA, UK
- Norwich Medical School, University of East Anglia, Norwich NR4 7TJ, UK
| | - Simon R Carding
- Quadram Institute, Norwich Research Park, Norwich NR4 7UA, UK
- Norwich Medical School, University of East Anglia, Norwich NR4 7TJ, UK
| | - Wilfried Haerty
- Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | | | - Ulrike Mayer
- School of Biological Sciences, University of East Anglia, Norwich NR4 7TJ, UK
| | - Penny P Powell
- Norwich Medical School, University of East Anglia, Norwich NR4 7TJ, UK
| | | | - Tom Wileman
- Quadram Institute, Norwich Research Park, Norwich NR4 7UA, UK
- Norwich Medical School, University of East Anglia, Norwich NR4 7TJ, UK
| | - Tamas Korcsmaros
- Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
- Quadram Institute, Norwich Research Park, Norwich NR4 7UA, UK
| |
Collapse
|
23
|
Leite DMC, Brochet X, Resch G, Que YA, Neves A, Peña-Reyes C. Computational prediction of inter-species relationships through omics data analysis and machine learning. BMC Bioinformatics 2018; 19:420. [PMID: 30453987 PMCID: PMC6245486 DOI: 10.1186/s12859-018-2388-7] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
BACKGROUND Antibiotic resistance and its rapid dissemination around the world threaten the efficacy of currently-used medical treatments and call for novel, innovative approaches to manage multi-drug resistant infections. Phage therapy, i.e., the use of viruses (phages) to specifically infect and kill bacteria during their life cycle, is one of the most promising alternatives to antibiotics. It is based on the correct matching between a target pathogenic bacteria and the therapeutic phage. Nevertheless, correctly matching them is a major challenge. Currently, there is no systematic method to efficiently predict whether phage-bacterium interactions exist and these pairs must be empirically tested in laboratory. Herein, we present our approach for developing a computational model able to predict whether a given phage-bacterium pair can interact based on their genome. RESULTS Based on public data from GenBank and phagesDB.org, we collected more than a thousand positive phage-bacterium interactions with their complete genomes. In addition, we generated putative negative (i.e., non-interacting) pairs. We extracted, from the collected genomes, a set of informative features based on the distribution of predictive protein-protein interactions and on their primary structure (e.g. amino-acid frequency, molecular weight and chemical composition of each protein). With these features, we generated multiple candidate datasets to train our algorithms. On this base, we built predictive models exhibiting predictive performance of around 90% in terms of F1-score, sensitivity, specificity, and accuracy, obtained on the test set with 10-fold cross-validation. CONCLUSION These promising results reinforce the hypothesis that machine learning techniques may produce highly-predictive models accelerating the search of interacting phage-bacteria pairs.
Collapse
Affiliation(s)
- Diogo Manuel Carvalho Leite
- School of Business and Engineering Vaud (HEIG-VD), University of Applied Sciences Western Switzerland (HES-SO), Route. de Cheseaux 1, Yverdon-Les-Bains, 1400 Switzerland
- SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Xavier Brochet
- School of Business and Engineering Vaud (HEIG-VD), University of Applied Sciences Western Switzerland (HES-SO), Route. de Cheseaux 1, Yverdon-Les-Bains, 1400 Switzerland
- SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Grégory Resch
- Department of Fundamental Microbiology, University of Lausanne, Lausanne, 1015 Switzerland
| | - Yok-Ai Que
- Department of Intensive Care Medicine, Bern University Hospital (Inselspital), Freiburgstrasse, Bern, 3010 Switzerland
| | - Aitana Neves
- SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Carlos Peña-Reyes
- School of Business and Engineering Vaud (HEIG-VD), University of Applied Sciences Western Switzerland (HES-SO), Route. de Cheseaux 1, Yverdon-Les-Bains, 1400 Switzerland
- SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
24
|
Caetano-Anolles K, Kim K, Kwak W, Sung S, Kim H, Choi BH, Lim D. Genome sequencing and protein domain annotations of Korean Hanwoo cattle identify Hanwoo-specific immunity-related and other novel genes. BMC Genet 2018; 19:37. [PMID: 29843617 PMCID: PMC5975384 DOI: 10.1186/s12863-018-0623-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2017] [Accepted: 05/14/2018] [Indexed: 12/30/2022] Open
Abstract
Background Identification of genetic mechanisms and idiosyncrasies at the breed-level can provide valuable information for potential use in evolutionary studies, medical applications, and breeding of selective traits. Here, we analyzed genomic data collected from 136 Korean Native cattle, known as Hanwoo, using advanced statistical methods. Results Results revealed Hanwoo-specific protein domains which were largely characterized by immunoglobulin function. Furthermore, domain interactions of novel Hanwoo-specific genes reveal additional links to immunity. Novel Hanwoo-specific genes linked to muscle and other functions were identified, including protein domains with functions related to energy, fat storage, and muscle function that may provide insight into the mechanisms behind Hanwoo cattle’s uniquely high percentage of intramuscular fat and fat marbling. Conclusion The identification of Hanwoo-specific genes linked to immunity are potentially useful for future medical research and selective breeding. The significant genomic variations identified here can crucially identify genetic novelties that are arising from useful adaptations. These results will allow future researchers to compare and classify breeds, identify important genetic markers, and develop breeding strategies to further improve significant traits. Electronic supplementary material The online version of this article (10.1186/s12863-018-0623-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Kelsey Caetano-Anolles
- Department of Agricultural Biotechnology and Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 151-921, Republic of Korea
| | - Kwondo Kim
- Interdisciplinary Program in Bioinformatics, Seoul National University, Kwan-ak St. 599, Kwan-ak Gu, Seoul, 151-741, Republic of Korea
| | - Woori Kwak
- Interdisciplinary Program in Bioinformatics, Seoul National University, Kwan-ak St. 599, Kwan-ak Gu, Seoul, 151-741, Republic of Korea.,CHO&KIM genomics, Main Bldg. #514, SNU Research Park, Seoul National University Mt.4-2, NakSeoungDae, Gwanakgu, Seoul, 151-919, Republic of Korea
| | - Samsun Sung
- CHO&KIM genomics, Main Bldg. #514, SNU Research Park, Seoul National University Mt.4-2, NakSeoungDae, Gwanakgu, Seoul, 151-919, Republic of Korea
| | - Heebal Kim
- Interdisciplinary Program in Bioinformatics, Seoul National University, Kwan-ak St. 599, Kwan-ak Gu, Seoul, 151-741, Republic of Korea.,Department of Agricultural Biotechnology and Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 151-921, Republic of Korea.,CHO&KIM genomics, Main Bldg. #514, SNU Research Park, Seoul National University Mt.4-2, NakSeoungDae, Gwanakgu, Seoul, 151-919, Republic of Korea
| | - Bong-Hwan Choi
- Animal Genomics & Bioinformatics Division, National Institute of Animal Science, RDA, 77 Chuksan-gil, Kwonsun-gu, Suwon, 441-706, Republic of Korea
| | - Dajeong Lim
- Animal Genomics & Bioinformatics Division, National Institute of Animal Science, RDA, 77 Chuksan-gil, Kwonsun-gu, Suwon, 441-706, Republic of Korea.
| |
Collapse
|
25
|
Climente-González H, Porta-Pardo E, Godzik A, Eyras E. The Functional Impact of Alternative Splicing in Cancer. Cell Rep 2017; 20:2215-2226. [DOI: 10.1016/j.celrep.2017.08.012] [Citation(s) in RCA: 376] [Impact Index Per Article: 47.0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2017] [Revised: 07/15/2017] [Accepted: 07/26/2017] [Indexed: 12/29/2022] Open
|
26
|
Chang JW, Zhou YQ, Ul Qamar MT, Chen LL, Ding YD. Prediction of Protein-Protein Interactions by Evidence Combining Methods. Int J Mol Sci 2016; 17:ijms17111946. [PMID: 27879651 PMCID: PMC5133940 DOI: 10.3390/ijms17111946] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2016] [Revised: 11/15/2016] [Accepted: 11/15/2016] [Indexed: 12/27/2022] Open
Abstract
Most cellular functions involve proteins' features based on their physical interactions with other partner proteins. Sketching a map of protein-protein interactions (PPIs) is therefore an important inception step towards understanding the basics of cell functions. Several experimental techniques operating in vivo or in vitro have made significant contributions to screening a large number of protein interaction partners, especially high-throughput experimental methods. However, computational approaches for PPI predication supported by rapid accumulation of data generated from experimental techniques, 3D structure definitions, and genome sequencing have boosted the map sketching of PPIs. In this review, we shed light on in silico PPI prediction methods that integrate evidence from multiple sources, including evolutionary relationship, function annotation, sequence/structure features, network topology and text mining. These methods are developed for integration of multi-dimensional evidence, for designing the strategies to predict novel interactions, and for making the results consistent with the increase of prediction coverage and accuracy.
Collapse
Affiliation(s)
- Ji-Wei Chang
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China.
- College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.
| | - Yan-Qing Zhou
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China.
- College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.
| | - Muhammad Tahir Ul Qamar
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China.
- College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.
| | - Ling-Ling Chen
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China.
- College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.
| | - Yu-Duan Ding
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China.
- College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.
| |
Collapse
|
27
|
Qin GM, Li RY, Zhao XM. Identifying Disease Associated miRNAs Based on Protein Domains. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2016; 13:1027-1035. [PMID: 26829801 DOI: 10.1109/tcbb.2016.2515608] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
MicroRNAs (miRNAs) are a class of small endogenous non-coding genes, acting as regulators in the post-transcriptional processes. Recently, the miRNAs are found to be widely involved in different types of diseases. Therefore, the identification of disease associated miRNAs can help understand the mechanisms that underlie the disease and identify new biomarkers. However, it is not easy to identify the miRNAs related to diseases due to its extensive involvements in various biological processes. In this work, we present a new approach to identify disease associated miRNAs based on domains, the functional and structural blocks of proteins. The results on real datasets demonstrate that our method can effectively identify disease related miRNAs with high precision.
Collapse
|
28
|
Iddamalgoda L, Das PS, Aponso A, Sundararajan VS, Suravajhala P, Valadi JK. Data Mining and Pattern Recognition Models for Identifying Inherited Diseases: Challenges and Implications. Front Genet 2016; 7:136. [PMID: 27559342 PMCID: PMC4979376 DOI: 10.3389/fgene.2016.00136] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2016] [Accepted: 07/15/2016] [Indexed: 01/02/2023] Open
Abstract
Data mining and pattern recognition methods reveal interesting findings in genetic studies, especially on how the genetic makeup is associated with inherited diseases. Although researchers have proposed various data mining models for biomedical approaches, there remains a challenge in accurately prioritizing the single nucleotide polymorphisms (SNP) associated with the disease. In this commentary, we review the state-of-art data mining and pattern recognition models for identifying inherited diseases and deliberate the need of binary classification- and scoring-based prioritization methods in determining causal variants. While we discuss the pros and cons associated with these methods known, we argue that the gene prioritization methods and the protein interaction (PPI) methods in conjunction with the K nearest neighbors' could be used in accurately categorizing the genetic factors in disease causation.
Collapse
Affiliation(s)
- Lahiru Iddamalgoda
- Department of Computing, Informatics Institute of Technology, University of WestminsterColombo, Sri Lanka
| | - Partha S. Das
- Department of Microbiology, Bioinformatics Infrastructure Facility, Vidyasagar UniversityMidnapore, India
- Bioinformatics, Bioclues OrganizationHyderabad, India
| | - Achala Aponso
- Department of Computing, Informatics Institute of Technology, University of WestminsterColombo, Sri Lanka
| | - Vijayaraghava S. Sundararajan
- Bioinformatics, Bioclues OrganizationHyderabad, India
- Environmental Health Institute, National Environment Agency, SingaporeSingapore
| | - Prashanth Suravajhala
- Bioinformatics, Bioclues OrganizationHyderabad, India
- Molecular Biology and Genetics, Quantitative Genetics and Genomics, Aarhus UniversityTjele, Denmark
- Bioinformatics, Bioinformatics OrganizationHudson, MA, USA
| | | |
Collapse
|
29
|
Bhardwaj J, Gangwar I, Panzade G, Shankar R, Yadav SK. Global De Novo Protein-Protein Interactome Elucidates Interactions of Drought-Responsive Proteins in Horse Gram (Macrotyloma uniflorum). J Proteome Res 2016; 15:1794-809. [PMID: 27161830 DOI: 10.1021/acs.jproteome.5b01114] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Inspired by the availability of de novo transcriptome of horse gram (Macrotyloma uniflorum) and recent developments in systems biology studies, the first ever global protein-protein interactome (PPI) map was constructed for this highly drought-tolerant legume. Large-scale studies of PPIs and the constructed database would provide rationale behind the interplay at cascading translational levels for drought stress-adaptive mechanisms in horse gram. Using a bidirectional approach (interolog and domain-based), a high-confidence interactome map and database for horse gram was constructed. Available transcriptomic information for shoot and root tissues of a sensitive (M-191; genotype 1) and a drought-tolerant (M-249; genotype 2) genotype of horse gram was utilized to draw comparative PPI subnetworks under drought stress. High-confidence 6804 interactions were predicted among 1812 proteins covering about one-fourth of the horse gram proteome. The highest number of interactions (33.86%) in horse gram interactome matched with Arabidopsis PPI data. The top five hub nodes mostly included ubiquitin and heat-shock-related proteins. Higher numbers of PPIs were found to be responsive in shoot tissue (416) and root tissue (2228) of genotype 2 compared with shoot tissue (136) and root tissue (579) of genotype 1. Characterization of PPIs using gene ontology analysis revealed that kinase and transferase activities involved in signal transduction, cellular processes, nucleocytoplasmic transport, protein ubiquitination, and localization of molecules were most responsive to drought stress. Hence, these could be framed in stress adaptive mechanisms of horse gram. Being the first legume global PPI map, it would provide new insights into gene and protein regulatory networks for drought stress tolerance mechanisms in horse gram. Information compiled in the form of database (MauPIR) will provide the much needed high-confidence systems biology information for horse gram genes, proteins, and involved processes. This information would ease the effort and increase the efficacy for similar studies on other legumes. Public access is available at http://14.139.59.221/MauPIR/ .
Collapse
Affiliation(s)
| | | | | | | | - Sudesh Kumar Yadav
- Center of Innovative and Applied Bioprocessing (CIAB) , Mohali 160071, Punjab, India
| |
Collapse
|
30
|
Prediction of human protein–protein interaction by a domain-based approach. J Theor Biol 2016; 396:144-53. [PMID: 26925814 DOI: 10.1016/j.jtbi.2016.02.026] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2015] [Revised: 01/29/2016] [Accepted: 02/20/2016] [Indexed: 02/04/2023]
|
31
|
Zhu G, Wu A, Xu XJ, Xiao PP, Lu L, Liu J, Cao Y, Chen L, Wu J, Zhao XM. PPIM: A Protein-Protein Interaction Database for Maize. PLANT PHYSIOLOGY 2016; 170:618-26. [PMID: 26620522 PMCID: PMC4734591 DOI: 10.1104/pp.15.01821] [Citation(s) in RCA: 56] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/24/2015] [Accepted: 11/25/2015] [Indexed: 05/18/2023]
Abstract
Maize (Zea mays) is one of the most important crops worldwide. To understand the biological processes underlying various traits of the crop (e.g. yield and response to stress), a detailed protein-protein interaction (PPI) network is highly demanded. Unfortunately, there are very few such PPIs available in the literature. Therefore, in this work, we present the Protein-Protein Interaction Database for Maize (PPIM), which covers 2,762,560 interactions among 14,000 proteins. The PPIM contains not only accurately predicted PPIs but also those molecular interactions collected from the literature. The database is freely available at http://comp-sysbio.org/ppim with a user-friendly powerful interface. We believe that the PPIM resource can help biologists better understand the maize crop.
Collapse
Affiliation(s)
- Guanghui Zhu
- Department of Computer Science and Technology, Tongji University, Shanghai 201804, China (G.Z., P.-P.X., J.W., X.-M.Z.);Key Laboratory of Food Safety Research, Institute for Nutritional Sciences (A.W.), and Key Laboratory of Systems Biology (L.C.), Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China;Department of Mathematics, Shanghai University, Shanghai 200444, China (X.-J.X.); andMonsanto Company, St. Louis, Missouri 63167 (L.L., J.L., Y.C.)
| | - Aibo Wu
- Department of Computer Science and Technology, Tongji University, Shanghai 201804, China (G.Z., P.-P.X., J.W., X.-M.Z.);Key Laboratory of Food Safety Research, Institute for Nutritional Sciences (A.W.), and Key Laboratory of Systems Biology (L.C.), Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China;Department of Mathematics, Shanghai University, Shanghai 200444, China (X.-J.X.); andMonsanto Company, St. Louis, Missouri 63167 (L.L., J.L., Y.C.)
| | - Xin-Jian Xu
- Department of Computer Science and Technology, Tongji University, Shanghai 201804, China (G.Z., P.-P.X., J.W., X.-M.Z.);Key Laboratory of Food Safety Research, Institute for Nutritional Sciences (A.W.), and Key Laboratory of Systems Biology (L.C.), Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China;Department of Mathematics, Shanghai University, Shanghai 200444, China (X.-J.X.); andMonsanto Company, St. Louis, Missouri 63167 (L.L., J.L., Y.C.)
| | - Pei-Pei Xiao
- Department of Computer Science and Technology, Tongji University, Shanghai 201804, China (G.Z., P.-P.X., J.W., X.-M.Z.);Key Laboratory of Food Safety Research, Institute for Nutritional Sciences (A.W.), and Key Laboratory of Systems Biology (L.C.), Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China;Department of Mathematics, Shanghai University, Shanghai 200444, China (X.-J.X.); andMonsanto Company, St. Louis, Missouri 63167 (L.L., J.L., Y.C.)
| | - Le Lu
- Department of Computer Science and Technology, Tongji University, Shanghai 201804, China (G.Z., P.-P.X., J.W., X.-M.Z.);Key Laboratory of Food Safety Research, Institute for Nutritional Sciences (A.W.), and Key Laboratory of Systems Biology (L.C.), Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China;Department of Mathematics, Shanghai University, Shanghai 200444, China (X.-J.X.); andMonsanto Company, St. Louis, Missouri 63167 (L.L., J.L., Y.C.)
| | - Jingdong Liu
- Department of Computer Science and Technology, Tongji University, Shanghai 201804, China (G.Z., P.-P.X., J.W., X.-M.Z.);Key Laboratory of Food Safety Research, Institute for Nutritional Sciences (A.W.), and Key Laboratory of Systems Biology (L.C.), Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China;Department of Mathematics, Shanghai University, Shanghai 200444, China (X.-J.X.); andMonsanto Company, St. Louis, Missouri 63167 (L.L., J.L., Y.C.)
| | - Yongwei Cao
- Department of Computer Science and Technology, Tongji University, Shanghai 201804, China (G.Z., P.-P.X., J.W., X.-M.Z.);Key Laboratory of Food Safety Research, Institute for Nutritional Sciences (A.W.), and Key Laboratory of Systems Biology (L.C.), Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China;Department of Mathematics, Shanghai University, Shanghai 200444, China (X.-J.X.); andMonsanto Company, St. Louis, Missouri 63167 (L.L., J.L., Y.C.)
| | - Luonan Chen
- Department of Computer Science and Technology, Tongji University, Shanghai 201804, China (G.Z., P.-P.X., J.W., X.-M.Z.);Key Laboratory of Food Safety Research, Institute for Nutritional Sciences (A.W.), and Key Laboratory of Systems Biology (L.C.), Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China;Department of Mathematics, Shanghai University, Shanghai 200444, China (X.-J.X.); andMonsanto Company, St. Louis, Missouri 63167 (L.L., J.L., Y.C.)
| | - Jun Wu
- Department of Computer Science and Technology, Tongji University, Shanghai 201804, China (G.Z., P.-P.X., J.W., X.-M.Z.);Key Laboratory of Food Safety Research, Institute for Nutritional Sciences (A.W.), and Key Laboratory of Systems Biology (L.C.), Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China;Department of Mathematics, Shanghai University, Shanghai 200444, China (X.-J.X.); andMonsanto Company, St. Louis, Missouri 63167 (L.L., J.L., Y.C.)
| | - Xing-Ming Zhao
- Department of Computer Science and Technology, Tongji University, Shanghai 201804, China (G.Z., P.-P.X., J.W., X.-M.Z.);Key Laboratory of Food Safety Research, Institute for Nutritional Sciences (A.W.), and Key Laboratory of Systems Biology (L.C.), Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China;Department of Mathematics, Shanghai University, Shanghai 200444, China (X.-J.X.); andMonsanto Company, St. Louis, Missouri 63167 (L.L., J.L., Y.C.)
| |
Collapse
|
32
|
Türei D, Földvári-Nagy L, Fazekas D, Módos D, Kubisch J, Kadlecsik T, Demeter A, Lenti K, Csermely P, Vellai T, Korcsmáros T. Autophagy Regulatory Network - a systems-level bioinformatics resource for studying the mechanism and regulation of autophagy. Autophagy 2015; 11:155-65. [PMID: 25635527 PMCID: PMC4502651 DOI: 10.4161/15548627.2014.994346] [Citation(s) in RCA: 75] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Autophagy is a complex cellular process having multiple roles, depending on tissue, physiological, or pathological conditions. Major post-translational regulators of autophagy are well known, however, they have not yet been collected comprehensively. The precise and context-dependent regulation of autophagy necessitates additional regulators, including transcriptional and post-transcriptional components that are listed in various datasets. Prompted by the lack of systems-level autophagy-related information, we manually collected the literature and integrated external resources to gain a high coverage autophagy database. We developed an online resource, Autophagy Regulatory Network (ARN; http://autophagy-regulation.org), to provide an integrated and systems-level database for autophagy research. ARN contains manually curated, imported, and predicted interactions of autophagy components (1,485 proteins with 4,013 interactions) in humans. We listed 413 transcription factors and 386 miRNAs that could regulate autophagy components or their protein regulators. We also connected the above-mentioned autophagy components and regulators with signaling pathways from the SignaLink 2 resource. The user-friendly website of ARN allows researchers without computational background to search, browse, and download the database. The database can be downloaded in SQL, CSV, BioPAX, SBML, PSI-MI, and in a Cytoscape CYS file formats. ARN has the potential to facilitate the experimental validation of novel autophagy components and regulators. In addition, ARN helps the investigation of transcription factors, miRNAs and signaling pathways implicated in the control of the autophagic pathway. The list of such known and predicted regulators could be important in pharmacological attempts against cancer and neurodegenerative diseases.
Collapse
Affiliation(s)
- Dénes Türei
- a Department of Genetics ; Eötvös Loránd University ; Budapest , Hungary
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
33
|
Pushing the annotation of cellular activities to a higher resolution: Predicting functions at the isoform level. Methods 2015; 93:110-8. [PMID: 26238263 DOI: 10.1016/j.ymeth.2015.07.016] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2015] [Revised: 07/20/2015] [Accepted: 07/29/2015] [Indexed: 12/23/2022] Open
Abstract
In past decades, the experimental determination of protein functions was expensive and time-consuming, so numerous computational methods were developed to speed up and guide the process. However, most of these methods predict protein functions at the gene level and do not consider the fact that protein isoforms (translated from alternatively spliced transcripts), not genes, are the actual function carriers. Now, high-throughput RNA-seq technology is providing unprecedented opportunities to unravel protein functions at the isoform level. In this article, we review recent progress in the high-resolution functional annotations of protein isoforms, focusing on two methods developed by the authors. Both methods can integrate multiple RNA-seq datasets for comprehensively characterizing functions of protein isoforms.
Collapse
|
34
|
Liu W, Wu H, Chen L, Wen Y, Kong X, Gao WQ. Park7 interacts with p47(phox) to direct NADPH oxidase-dependent ROS production and protect against sepsis. Cell Res 2015; 25:691-706. [PMID: 26021615 DOI: 10.1038/cr.2015.63] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2014] [Revised: 03/24/2015] [Accepted: 04/07/2015] [Indexed: 01/05/2023] Open
Abstract
Inappropriate inflammation responses contribute to mortality during sepsis. Through Toll-like receptors (TLRs), reactive oxygen species (ROS) produced by NADPH oxidase could modulate the inflammation responses. Parkinson disease (autosomal recessive, early onset) 7 (Park7) has a cytoprotective role by eliminating ROS. However, whether Park7 could modulate inflammation responses and mortality in sepsis is unclear. Here, we show that, compared with wild-type mice, Park7(-/-) mice had significantly increased mortality and bacterial burdens in sepsis model along with markedly decreased systemic and local inflammation, and drastically impaired macrophage phagocytosis and bacterial killing abilities. Surprisingly, LPS and phorbol-12-myristate-13-acetate stimulation failed to induce ROS and proinflammatory cytokine production in Park7(-/-) macrophages and Park7-deficient RAW264.7 cells. Through its C-terminus, Park7 binds to p47(phox), a subunit of the NADPH oxidase, to promote NADPH oxidase-dependent production of ROS. Restoration of Park7 expression rescues ROS production and improves survival in LPS-induced sepsis. Together, our study shows that Park7 has a protective role against sepsis by controlling macrophage activation, NADPH oxidase activation and inflammation responses.
Collapse
Affiliation(s)
- Wenjun Liu
- State Key Laboratory for Oncogenes and Related Genes, Renji-Med X Clinical Stem Cell Research Center, Ren Ji Hospital, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200030, China
| | - Hailong Wu
- State Key Laboratory for Oncogenes and Related Genes, Renji-Med X Clinical Stem Cell Research Center, Ren Ji Hospital, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200030, China
| | - Lili Chen
- State Key Laboratory for Oncogenes and Related Genes, Renji-Med X Clinical Stem Cell Research Center, Ren Ji Hospital, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200030, China
| | - Yankai Wen
- State Key Laboratory for Oncogenes and Related Genes, Renji-Med X Clinical Stem Cell Research Center, Ren Ji Hospital, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200030, China
| | - Xiaoni Kong
- State Key Laboratory for Oncogenes and Related Genes, Renji-Med X Clinical Stem Cell Research Center, Ren Ji Hospital, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200030, China
| | - Wei-Qiang Gao
- State Key Laboratory for Oncogenes and Related Genes, Renji-Med X Clinical Stem Cell Research Center, Ren Ji Hospital, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200030, China
| |
Collapse
|
35
|
Tseng YT, Li W, Chen CH, Zhang S, Chen JJW, Zhou X, Liu CC. IIIDB: a database for isoform-isoform interactions and isoform network modules. BMC Genomics 2015; 16 Suppl 2:S10. [PMID: 25707505 PMCID: PMC4331710 DOI: 10.1186/1471-2164-16-s2-s10] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
BACKGROUND Protein-protein interactions (PPIs) are key to understanding diverse cellular processes and disease mechanisms. However, current PPI databases only provide low-resolution knowledge of PPIs, in the sense that "proteins" of currently known PPIs generally refer to "genes." It is known that alternative splicing often impacts PPI by either directly affecting protein interacting domains, or by indirectly impacting other domains, which, in turn, impacts the PPI binding. Thus, proteins translated from different isoforms of the same gene can have different interaction partners. RESULTS Due to the limitations of current experimental capacities, little data is available for PPIs at the resolution of isoforms, although such high-resolution data is crucial to map pathways and to understand protein functions. In fact, alternative splicing can often change the internal structure of a pathway by rearranging specific PPIs. To fill the gap, we systematically predicted genome-wide isoform-isoform interactions (IIIs) using RNA-seq datasets, domain-domain interaction and PPIs. Furthermore, we constructed an III database (IIIDB) that is a resource for studying PPIs at isoform resolution. To discover functional modules in the III network, we performed III network clustering, and then obtained 1025 isoform modules. To evaluate the module functionality, we performed the GO/pathway enrichment analysis for each isoform module. CONCLUSIONS The IIIDB provides predictions of human protein-protein interactions at the high resolution of transcript isoforms that can facilitate detailed understanding of protein functions and biological pathways. The web interface allows users to search for IIIs or III network modules. The IIIDB is freely available at http://syslab.nchu.edu.tw/IIIDB.
Collapse
|
36
|
Martínez V, Cano C, Blanco A. ProphNet: a generic prioritization method through propagation of information. BMC Bioinformatics 2014; 15 Suppl 1:S5. [PMID: 24564336 PMCID: PMC4015146 DOI: 10.1186/1471-2105-15-s1-s5] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Background Prioritization methods have become an useful tool for mining large amounts of data to suggest promising hypotheses in early research stages. Particularly, network-based prioritization tools use a network representation for the interactions between different biological entities to identify novel indirect relationships. However, current network-based prioritization tools are strongly tailored to specific domains of interest (e.g. gene-disease prioritization) and they do not allow to consider networks with more than two types of entities (e.g. genes and diseases). Therefore, the direct application of these methods to accomplish new prioritization tasks is limited. Results This work presents ProphNet, a generic network-based prioritization tool that allows to integrate an arbitrary number of interrelated biological entities to accomplish any prioritization task. We tested the performance of ProphNet in comparison with leading network-based prioritization methods, namely rcNet and DomainRBF, for gene-disease and domain-disease prioritization, respectively. The results obtained by ProphNet show a significant improvement in terms of sensitivity and specificity for both tasks. We also applied ProphNet to disease-gene prioritization on Alzheimer, Diabetes Mellitus Type 2 and Breast Cancer to validate the results and identify putative candidate genes involved in these diseases. Conclusions ProphNet works on top of any heterogeneous network by integrating information of different types of biological entities to rank entities of a specific type according to their degree of relationship with a query set of entities of another type. Our method works by propagating information across data networks and measuring the correlation between the propagated values for a query and a target sets of entities. ProphNet is available at: http://genome2.ugr.es/prophnet. A Matlab implementation of the algorithm is also available at the website.
Collapse
|
37
|
Galperin MY, Koonin EV. Comparative Genomics Approaches to Identifying Functionally Related Genes. ALGORITHMS FOR COMPUTATIONAL BIOLOGY 2014. [DOI: 10.1007/978-3-319-07953-0_1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
|
38
|
Bandyopadhyay S, Mallick K. A New Path Based Hybrid Measure for Gene Ontology Similarity. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2014; 11:116-127. [PMID: 26355512 DOI: 10.1109/tcbb.2013.149] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Gene Ontology (GO) consists of a controlled vocabulary of terms, annotating a gene or gene product, structured in a directed acyclic graph. In the graph, semantic relations connect the terms, that represent the knowledge of functional description and cellular component information of gene products. GO similarity gives us a numerical representation of biological relationship between a gene set, which can be used to infer various biological facts such as protein interaction, structural similarity, gene clustering, etc. Here we introduce a new shortest path based hybrid measure of ontological similarity between two terms which combines both structure of the GO graph and information content of the terms. Here the similarity between two terms t1 and t2, referred to as GOSim(PBHM)(t1,t2), has two components; one obtained from the common ancestors of t1 and t2. The other from their remaining ancestors. The proposed path based hybrid measure does not suffer from the well-known shallow annotation problem. Its superiority with respect to some other popular measures is established for protein protein interaction prediction, correlation with gene expression and functional classification of genes in a biological pathway. Finally, the proposed measure is utilized to compute the average GO similarity score among the genes that are experimentally validated targets of some microRNAs. Results demonstrate that the targets of a given miRNA have a high degree of similarity in the biological process category of GO.
Collapse
|
39
|
Garamszegi S, Franzosa EA, Xia Y. Signatures of pleiotropy, economy and convergent evolution in a domain-resolved map of human-virus protein-protein interaction networks. PLoS Pathog 2013; 9:e1003778. [PMID: 24339775 PMCID: PMC3855575 DOI: 10.1371/journal.ppat.1003778] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2013] [Accepted: 10/06/2013] [Indexed: 01/09/2023] Open
Abstract
A central challenge in host-pathogen systems biology is the elucidation of general, systems-level principles that distinguish host-pathogen interactions from within-host interactions. Current analyses of host-pathogen and within-host protein-protein interaction networks are largely limited by their resolution, treating proteins as nodes and interactions as edges. Here, we construct a domain-resolved map of human-virus and within-human protein-protein interaction networks by annotating protein interactions with high-coverage, high-accuracy, domain-centric interaction mechanisms: (1) domain-domain interactions, in which a domain in one protein binds to a domain in a second protein, and (2) domain-motif interactions, in which a domain in one protein binds to a short, linear peptide motif in a second protein. Analysis of these domain-resolved networks reveals, for the first time, significant mechanistic differences between virus-human and within-human interactions at the resolution of single domains. While human proteins tend to compete with each other for domain binding sites by means of sequence similarity, viral proteins tend to compete with human proteins for domain binding sites in the absence of sequence similarity. Independent of their previously established preference for targeting human protein hubs, viral proteins also preferentially target human proteins containing linear motif-binding domains. Compared to human proteins, viral proteins participate in more domain-motif interactions, target more unique linear motif-binding domains per residue, and contain more unique linear motifs per residue. Together, these results suggest that viruses surmount genome size constraints by convergently evolving multiple short linear motifs in order to effectively mimic, hijack, and manipulate complex host processes for their survival. Our domain-resolved analyses reveal unique signatures of pleiotropy, economy, and convergent evolution in viral-host interactions that are otherwise hidden in the traditional binary network, highlighting the power and necessity of high-resolution approaches in host-pathogen systems biology.
Collapse
Affiliation(s)
- Sara Garamszegi
- Bioinformatics Program, Boston University, Boston, Massachusetts, United States of America
| | | | | |
Collapse
|
40
|
Praveen P, Fröhlich H. Boosting probabilistic graphical model inference by incorporating prior knowledge from multiple sources. PLoS One 2013; 8:e67410. [PMID: 23826291 PMCID: PMC3691143 DOI: 10.1371/journal.pone.0067410] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2013] [Accepted: 05/17/2013] [Indexed: 12/29/2022] Open
Abstract
Inferring regulatory networks from experimental data via probabilistic graphical models is a popular framework to gain insights into biological systems. However, the inherent noise in experimental data coupled with a limited sample size reduces the performance of network reverse engineering. Prior knowledge from existing sources of biological information can address this low signal to noise problem by biasing the network inference towards biologically plausible network structures. Although integrating various sources of information is desirable, their heterogeneous nature makes this task challenging. We propose two computational methods to incorporate various information sources into a probabilistic consensus structure prior to be used in graphical model inference. Our first model, called Latent Factor Model (LFM), assumes a high degree of correlation among external information sources and reconstructs a hidden variable as a common source in a Bayesian manner. The second model, a Noisy-OR, picks up the strongest support for an interaction among information sources in a probabilistic fashion. Our extensive computational studies on KEGG signaling pathways as well as on gene expression data from breast cancer and yeast heat shock response reveal that both approaches can significantly enhance the reconstruction accuracy of Bayesian Networks compared to other competing methods as well as to the situation without any prior. Our framework allows for using diverse information sources, like pathway databases, GO terms and protein domain data, etc. and is flexible enough to integrate new sources, if available.
Collapse
Affiliation(s)
- Paurush Praveen
- University of Bonn, Bonn-Aachen International Center for IT, Bonn, Germany.
| | | |
Collapse
|
41
|
Memišević V, Wallqvist A, Reifman J. Reconstituting protein interaction networks using parameter-dependent domain-domain interactions. BMC Bioinformatics 2013; 14:154. [PMID: 23651452 PMCID: PMC3660195 DOI: 10.1186/1471-2105-14-154] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2012] [Accepted: 04/05/2013] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND We can describe protein-protein interactions (PPIs) as sets of distinct domain-domain interactions (DDIs) that mediate the physical interactions between proteins. Experimental data confirm that DDIs are more consistent than their corresponding PPIs, lending support to the notion that analyses of DDIs may improve our understanding of PPIs and lead to further insights into cellular function, disease, and evolution. However, currently available experimental DDI data cover only a small fraction of all existing PPIs and, in the absence of structural data, determining which particular DDI mediates any given PPI is a challenge. RESULTS We present two contributions to the field of domain interaction analysis. First, we introduce a novel computational strategy to merge domain annotation data from multiple databases. We show that when we merged yeast domain annotations from six annotation databases we increased the average number of domains per protein from 1.05 to 2.44, bringing it closer to the estimated average value of 3. Second, we introduce a novel computational method, parameter-dependent DDI selection (PADDS), which, given a set of PPIs, extracts a small set of domain pairs that can reconstruct the original set of protein interactions, while attempting to minimize false positives. Based on a set of PPIs from multiple organisms, our method extracted 27% more experimentally detected DDIs than existing computational approaches. CONCLUSIONS We have provided a method to merge domain annotation data from multiple sources, ensuring large and consistent domain annotation for any given organism. Moreover, we provided a method to extract a small set of DDIs from the underlying set of PPIs and we showed that, in contrast to existing approaches, our method was not biased towards DDIs with low or high occurrence counts. Finally, we used these two methods to highlight the influence of the underlying annotation density on the characteristics of extracted DDIs. Although increased annotations greatly expanded the possible DDIs, the lack of knowledge of the true biological false positive interactions still prevents an unambiguous assignment of domain interactions responsible for all protein network interactions.Executable files and examples are given at: http://www.bhsai.org/downloads/padds/
Collapse
Affiliation(s)
- Vesna Memišević
- Department of Defense Biotechnology High Performance Computing Software Applications Institute, Telemedicine and Advanced Technology Research Center, U.S. Army Medical Research and Materiel Command, Fort Detrick, MD 21702, USA
| | | | | |
Collapse
|
42
|
Understanding Protein–Protein Interactions Using Local Structural Features. J Mol Biol 2013; 425:1210-24. [DOI: 10.1016/j.jmb.2013.01.014] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2012] [Revised: 01/08/2013] [Accepted: 01/14/2013] [Indexed: 11/21/2022]
|
43
|
González AJ, Liao L, Wu CH. Prediction of contact matrix for protein-protein interaction. ACTA ACUST UNITED AC 2013; 29:1018-25. [PMID: 23418186 DOI: 10.1093/bioinformatics/btt076] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
MOTIVATION Prediction of protein-protein interaction has become an important part of systems biology in reverse engineering the biological networks for better understanding the molecular biology of the cell. Although significant progress has been made in terms of prediction accuracy, most computational methods only predict whether two proteins interact but not their interacting residues-the information that can be very valuable for understanding the interaction mechanisms and designing modulation of the interaction. In this work, we developed a computational method to predict the interacting residue pairs-contact matrix for interacting protein domains, whose rows and columns correspond to the residues in the two interacting domains respectively and whose values (1 or 0) indicate whether the corresponding residues (do or do not) interact. RESULTS Our method is based on supervised learning using support vector machines. For each domain involved in a given domain-domain interaction (DDI), an interaction profile hidden Markov model (ipHMM) is first built for the domain family, and then each residue position for a member domain sequence is represented as a 20-dimension vector of Fisher scores, characterizing how similar it is as compared with the family profile at that position. Each element of the contact matrix for a sequence pair is now represented by a feature vector from concatenating the vectors of the two corresponding residues, and the task is to predict the element value (1 or 0) from the feature vector. A support vector machine is trained for a given DDI, using either a consensus contact matrix or contact matrices for individual sequence pairs, and is tested by leave-one-out cross validation. The performance averaged over a set of 115 DDIs collected from the 3 DID database shows significant improvement (sensitivity up to 85%, and specificity up to 85%), as compared with a multiple sequence alignment-based method (sensitivity 57%, and specificity 78%) previously reported in the literature. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Alvaro J González
- Department of Computer and Information Sciences, University of Delaware, Newark, DE 19716, USA
| | | | | |
Collapse
|
44
|
Yin F, Liu X, Li D, Wang Q, Zhang W, Li L. Bioinformatic analysis of chemokine (C-C motif) ligand 21 and SPARC-like protein 1 revealing their associations with drug resistance in ovarian cancer. Int J Oncol 2013; 42:1305-16. [PMID: 23404140 DOI: 10.3892/ijo.2013.1819] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2012] [Accepted: 01/14/2013] [Indexed: 11/06/2022] Open
Abstract
Chemokine (C-C motif) ligand 21 (CCL21) and SPARC-like protein 1 (SPARCL1/MAST9/hevin/SC-1) are associated with various biological behavior in the development of cancers. Although the expression of CCL21 and SPARCL1 is downregulated in many solid tumors, their roles in ovarian cancer and their associations with drug resistance have rarely been studied. We performed a comprehensive bioinformatic analysis consisting of motif analysis, literature co-occurrence, gene/protein-gene/protein interaction network, protein-small molecule interaction network, and microRNAs enrichments which revealed that CCL21 and SPARCL1 directly or indirectly interact with a number of genes, proteins, small molecules and pathways associated with drug resistance in ovarian and other cancers. These results suggested that CCL21 and SPARCL1 may contribute to drug resistance in ovarian cancer. This study provided important information for further investigation of drug resistance-related functions of CCL21 and SPARCL1 in ovarian cancer.
Collapse
Affiliation(s)
- Fuqiang Yin
- Department of Gynecologic Oncology, Affiliated Tumor Hospital of Guangxi Medical University, Nanning, Guangxi 530021, P.R. China
| | | | | | | | | | | |
Collapse
|
45
|
Rezende AM, Folador EL, Resende DDM, Ruiz JC. Computational prediction of protein-protein interactions in Leishmania predicted proteomes. PLoS One 2012; 7:e51304. [PMID: 23251492 PMCID: PMC3519578 DOI: 10.1371/journal.pone.0051304] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2012] [Accepted: 10/31/2012] [Indexed: 11/18/2022] Open
Abstract
The Trypanosomatids parasites Leishmania braziliensis, Leishmania major and Leishmania infantum are important human pathogens. Despite of years of study and genome availability, effective vaccine has not been developed yet, and the chemotherapy is highly toxic. Therefore, it is clear just interdisciplinary integrated studies will have success in trying to search new targets for developing of vaccines and drugs. An essential part of this rationale is related to protein-protein interaction network (PPI) study which can provide a better understanding of complex protein interactions in biological system. Thus, we modeled PPIs for Trypanosomatids through computational methods using sequence comparison against public database of protein or domain interaction for interaction prediction (Interolog Mapping) and developed a dedicated combined system score to address the predictions robustness. The confidence evaluation of network prediction approach was addressed using gold standard positive and negative datasets and the AUC value obtained was 0.94. As result, 39,420, 43,531 and 45,235 interactions were predicted for L. braziliensis, L. major and L. infantum respectively. For each predicted network the top 20 proteins were ranked by MCC topological index. In addition, information related with immunological potential, degree of protein sequence conservation among orthologs and degree of identity compared to proteins of potential parasite hosts was integrated. This information integration provides a better understanding and usefulness of the predicted networks that can be valuable to select new potential biological targets for drug and vaccine development. Network modularity which is a key when one is interested in destabilizing the PPIs for drug or vaccine purposes along with multiple alignments of the predicted PPIs were performed revealing patterns associated with protein turnover. In addition, around 50% of hypothetical protein present in the networks received some degree of functional annotation which represents an important contribution since approximately 60% of Leishmania predicted proteomes has no predicted function.
Collapse
Affiliation(s)
- Antonio M. Rezende
- Laboratório de Parasitologia Celular e Molecular, Centro de Pesquisa René Rachou – FIOCRUZ, Belo Horizonte, Minas Gerais, Brazil
- Departamento de Bioquímica e Imunologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brazil
- * E-mail: (AMR); (JCR)
| | - Edson L. Folador
- Laboratório de Parasitologia Celular e Molecular, Centro de Pesquisa René Rachou – FIOCRUZ, Belo Horizonte, Minas Gerais, Brazil
- Instituto Oswaldo Cruz, Fundação Oswaldo Cruz, Rio de Janeiro, Rio de Janeiro, Brazil
| | - Daniela de M. Resende
- Laboratório de Parasitologia Celular e Molecular, Centro de Pesquisa René Rachou – FIOCRUZ, Belo Horizonte, Minas Gerais, Brazil
- Laboratório de Pesquisas Clínicas, Universidade Federal de Ouro Preto, Ouro Preto, Minas Gerais, Brazil
| | - Jeronimo C. Ruiz
- Laboratório de Parasitologia Celular e Molecular, Centro de Pesquisa René Rachou – FIOCRUZ, Belo Horizonte, Minas Gerais, Brazil
- * E-mail: (AMR); (JCR)
| |
Collapse
|
46
|
Lu Y, Zhang Y, Hang X, Qu W, Lubec G, Chen C, Zhang C. Genome-wide computational identification of bicistronic mRNA in humans. Amino Acids 2012; 44:597-606. [PMID: 22945903 DOI: 10.1007/s00726-012-1380-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2012] [Accepted: 07/26/2012] [Indexed: 11/30/2022]
Abstract
Mammalian bicistronic mRNA is a recently discovered mammalian gene structure. Several reported cases of mammalian bicistronic mRNA indicated that genes of this structure play roles in some important biological processes. However, a genome-wide computational identification of bicistronic mRNA in mammalian genome, such as human genome, is still lacking. Here we used a comparative genomics approach to identify the frequency of human bicistronic mRNA. We then validated the result by using a new support vector machine (SVM) model. We identified 43 human bicistronic mRNAs in 30 distinct genes. Our literature analysis shows that our method recovered 100 % (6/6) of the previously known bicistronic mRNAs which had been experimentally confirmed by other groups. Our graph theory-based analysis and GO analysis indicated that human bicistronic mRNAs are prone to produce different yet closely functionally related proteins. In addition, we also described and analyzed three different mechanisms of ORF fusion. Our method of identifying bicistronic mRNAs in human genome provides a model for the computational identification of characteristic gene structures in mammalian genomes. We anticipate that our data will facilitate further molecular characterization and functional study of human bicistronic mRNA.
Collapse
Affiliation(s)
- Yiming Lu
- Beijing Institute of Radiation Medicine, State Key Laboratory of Proteomics, Cognitive and Mental Health Research Center, Beijing 100850, China
| | | | | | | | | | | | | |
Collapse
|
47
|
Winter C, Henschel A, Tuukkanen A, Schroeder M. Protein interactions in 3D: From interface evolution to drug discovery. J Struct Biol 2012; 179:347-58. [DOI: 10.1016/j.jsb.2012.04.009] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2011] [Revised: 03/27/2012] [Accepted: 04/18/2012] [Indexed: 11/25/2022]
|
48
|
Arnold R, Boonen K, Sun MG, Kim PM. Computational analysis of interactomes: current and future perspectives for bioinformatics approaches to model the host-pathogen interaction space. Methods 2012; 57:508-18. [PMID: 22750305 PMCID: PMC7128575 DOI: 10.1016/j.ymeth.2012.06.011] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2012] [Revised: 06/20/2012] [Accepted: 06/21/2012] [Indexed: 11/05/2022] Open
Abstract
Bacterial and viral pathogens affect their eukaryotic host partly by interacting with proteins of the host cell. Hence, to investigate infection from a systems' perspective we need to construct complete and accurate host-pathogen protein-protein interaction networks. Because of the paucity of available data and the cost associated with experimental approaches, any construction and analysis of such a network in the near future has to rely on computational predictions. Specifically, this challenge consists of a number of sub-problems: First, prediction of possible pathogen interactors (e.g. effector proteins) is necessary for bacteria and protozoa. Second, the prospective host binding partners have to be determined and finally, the impact on the host cell analyzed. This review gives an overview of current bioinformatics approaches to obtain and understand host-pathogen interactions. As an application example of the methods covered, we predict host-pathogen interactions of Salmonella and discuss the value of these predictions as a prospective for further research.
Collapse
Affiliation(s)
- Roland Arnold
- Terrence Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, Canada M5S 3E1
| | - Kurt Boonen
- Terrence Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, Canada M5S 3E1
| | - Mark G.F. Sun
- Terrence Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, Canada M5S 3E1
| | - Philip M. Kim
- Terrence Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, Canada M5S 3E1
- Banting and Best Department of Medical Research, University of Toronto, Toronto, ON, Canada M5S 3E1
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada M5S 3E1
- Department of Computer Science, University of Toronto, Toronto, ON, Canada M5S 3E1
| |
Collapse
|
49
|
Zhang M, Su S, Bhatnagar RK, Hassett DJ, Lu LJ. Prediction and analysis of the protein interactome in Pseudomonas aeruginosa to enable network-based drug target selection. PLoS One 2012; 7:e41202. [PMID: 22848443 PMCID: PMC3404098 DOI: 10.1371/journal.pone.0041202] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2012] [Accepted: 06/18/2012] [Indexed: 01/23/2023] Open
Abstract
Pseudomonas aeruginosa (PA) is a ubiquitous opportunistic pathogen that is capable of causing highly problematic, chronic infections in cystic fibrosis and chronic obstructive pulmonary disease patients. With the increased prevalence of multi-drug resistant PA, the conventional “one gene, one drug, one disease” paradigm is losing effectiveness. Network pharmacology, on the other hand, may hold the promise of discovering new drug targets to treat a variety of PA infections. However, given the urgent need for novel drug target discovery, a PA protein-protein interaction (PPI) network of high accuracy and coverage, has not yet been constructed. In this study, we predicted a genome-scale PPI network of PA by integrating various genomic features of PA proteins/genes by a machine learning-based approach. A total of 54,107 interactions covering 4,181 proteins in PA were predicted. A high-confidence network combining predicted high-confidence interactions, a reference set and verified interactions that consist of 3,343 proteins and 19,416 potential interactions was further assembled and analyzed. The predicted interactome network from this study is the first large-scale PPI network in PA with significant coverage and high accuracy. Subsequent analysis, including validations based on existing small-scale PPI data and the network structure comparison with other model organisms, shows the validity of the predicted PPI network. Potential drug targets were identified and prioritized based on their essentiality and topological importance in the high-confidence network. Host-pathogen protein interactions between human and PA were further extracted and analyzed. In addition, case studies were performed on protein interactions regarding anti-sigma factor MucA, negative periplasmic alginate regulator MucB, and the transcriptional regulator RhlR. A web server to access the predicted PPI dataset is available at http://research.cchmc.org/PPIdatabase/.
Collapse
Affiliation(s)
- Minlu Zhang
- Division of Biomedical Informatics, Cincinnati Children’s Hospital Research Foundation, Cincinnati, Ohio, United States of America
- School of Computing Sciences and Informatics, University of Cincinnati, Cincinnati, Ohio, United States of America
| | - Shengchang Su
- Department of Molecular Genetics, Biochemistry and Microbiology, College of Medicine, University of Cincinnati, Cincinnati, Ohio, United States of America
| | - Raj K. Bhatnagar
- School of Electronic and Computer Systems, University of Cincinnati, Cincinnati, Ohio, United States of America
| | - Daniel J. Hassett
- Department of Molecular Genetics, Biochemistry and Microbiology, College of Medicine, University of Cincinnati, Cincinnati, Ohio, United States of America
| | - Long J. Lu
- Division of Biomedical Informatics, Cincinnati Children’s Hospital Research Foundation, Cincinnati, Ohio, United States of America
- School of Computing Sciences and Informatics, University of Cincinnati, Cincinnati, Ohio, United States of America
- Department of Environmental Health, University of Cincinnati, Cincinnati, Ohio, United States of America
- * E-mail:
| |
Collapse
|
50
|
Jang WH, Jung SH, Han DS. A computational model for predicting protein interactions based on multidomain collaboration. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2012; 9:1081-1090. [PMID: 22508910 DOI: 10.1109/tcbb.2012.55] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Recently, several domain-based computational models for predicting protein-protein interactions (PPIs) have been proposed. The conventional methods usually infer domain or domain combination (DC) interactions from already known interacting sets of proteins, and then predict PPIs using the information. However, the majority of these models often have limitations in providing detailed information on which domain pair (single domain interaction) or DC pair (multidomain interaction) will actually interact for the predicted protein interaction. Therefore, a more comprehensive and concrete computational model for the prediction of PPIs is needed. We developed a computational model to predict PPIs using the information of intraprotein domain cohesion and interprotein DC coupling interaction. A method of identifying the primary interacting DC pair was also incorporated into the model in order to infer actual participants in a predicted interaction. Our method made an apparent improvement in the PPI prediction accuracy, and the primary interacting DC pair identification was valid specifically in predicting multidomain protein interactions. In this paper, we demonstrate that 1) the intraprotein domain cohesion is meaningful in improving the accuracy of domain-based PPI prediction, 2) a prediction model incorporating the intradomain cohesion enables us to identify the primary interacting DC pair, and 3) a hybrid approach using the intra/interdomain interaction information can lead to a more accurate prediction.
Collapse
Affiliation(s)
- Woo-Hyuk Jang
- Department of Information and Communications Engineering, Korea Advanced Institute of Science and Technology, Kaist, 335 Gwahak-ro, Yuseong-gu, Daejeon 305-701, Korea.
| | | | | |
Collapse
|