1
|
OUP accepted manuscript. Brief Funct Genomics 2022; 21:243-269. [DOI: 10.1093/bfgp/elac007] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Revised: 03/17/2022] [Accepted: 03/18/2022] [Indexed: 11/14/2022] Open
|
2
|
James K, Olson PD. The tapeworm interactome: inferring confidence scored protein-protein interactions from the proteome of Hymenolepis microstoma. BMC Genomics 2020; 21:346. [PMID: 32380953 PMCID: PMC7204028 DOI: 10.1186/s12864-020-6710-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2019] [Accepted: 03/30/2020] [Indexed: 12/14/2022] Open
Abstract
Background Reference genome and transcriptome assemblies of helminths have reached a level of completion whereby secondary analyses that rely on accurate gene estimation or syntenic relationships can be now conducted with a high level of confidence. Recent public release of the v.3 assembly of the mouse bile-duct tapeworm, Hymenolepis microstoma, provides chromosome-level characterisation of the genome and a stabilised set of protein coding gene models underpinned by bioinformatic and empirical data. However, interactome data have not been produced. Conserved protein-protein interactions in other organisms, termed interologs, can be used to transfer interactions between species, allowing systems-level analysis in non-model organisms. Results Here, we describe a probabilistic, integrated network of interologs for the H. microstoma proteome, based on conserved protein interactions found in eukaryote model species. Almost a third of the 10,139 gene models in the v.3 assembly could be assigned interaction data and assessment of the resulting network indicates that topologically-important proteins are related to essential cellular pathways, and that the network clusters into biologically meaningful components. Moreover, network parameters are similar to those of single-species interaction networks that we constructed in the same way for S. cerevisiae, C. elegans and H. sapiens, demonstrating that information-rich, system-level analyses can be conducted even on species separated by a large phylogenetic distance from the major model organisms from which most protein interaction evidence is based. Using the interolog network, we then focused on sub-networks of interactions assigned to discrete suites of genes of interest, including signalling components and transcription factors, germline multipotency genes, and genes differentially-expressed between larval and adult worms. Results show not only an expected bias toward highly-conserved proteins, such as components of intracellular signal transduction, but in some cases predicted interactions with transcription factors that aid in identifying their target genes. Conclusions With key helminth genomes now complete, systems-level analyses can provide an important predictive framework to guide basic and applied research on helminths and will become increasingly informative as new protein-protein interaction data accumulate.
Collapse
Affiliation(s)
- Katherine James
- Department of Applied Sciences, Northumbria University, Newcastle Upon Tyne, UK. .,Department of Life Sciences, The Natural History Museum, Cromwell Road, London, UK.
| | - Peter D Olson
- Department of Life Sciences, The Natural History Museum, Cromwell Road, London, UK
| |
Collapse
|
3
|
Cuesta-Astroz Y, Santos A, Oliveira G, Jensen LJ. Analysis of Predicted Host-Parasite Interactomes Reveals Commonalities and Specificities Related to Parasitic Lifestyle and Tissues Tropism. Front Immunol 2019; 10:212. [PMID: 30815000 PMCID: PMC6381214 DOI: 10.3389/fimmu.2019.00212] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2018] [Accepted: 01/24/2019] [Indexed: 01/03/2023] Open
Abstract
The study of molecular host–parasite interactions is essential to understand parasitic infection and adaptation within the host system. As well, prevention and treatment of infectious diseases require a clear understanding of the molecular crosstalk between parasites and their hosts. Yet, large-scale experimental identification of host–parasite molecular interactions remains challenging, and the use of computational predictions becomes then necessary. Here, we propose a computational integrative approach to predict host—parasite protein—protein interaction (PPI) networks resulting from the human infection by 15 different eukaryotic parasites. We used an orthology-based approach to transfer high-confidence intraspecies interactions obtained from the STRING database to the corresponding interspecies homolog protein pairs in the host–parasite system. Our approach uses either the parasites predicted secretome and membrane proteins, or only the secretome, depending on whether they are uni- or multi-cellular, respectively, to reduce the number of false predictions. Moreover, the host proteome is filtered for proteins expressed in selected cellular localizations and tissues supporting the parasite growth. We evaluated the inferred interactions by analyzing the enriched biological processes and pathways in the predicted networks and their association with known parasitic invasion and evasion mechanisms. The resulting PPI networks were compared across parasites to identify common mechanisms that may define a global pathogenic hallmark. We also provided a study case focusing on a closer examination of the human–S. mansoni predicted interactome, detecting central proteins that have relevant roles in the human–S. mansoni network, and identifying tissue-specific interactions with key roles in the life cycle of the parasite. The predicted PPI networks can be visualized and downloaded at http://orthohpi.jensenlab.org.
Collapse
Affiliation(s)
- Yesid Cuesta-Astroz
- Instituto René Rachou, Fundação Oswaldo Cruz - FIOCRUZ, Belo Horizonte, Brazil
| | - Alberto Santos
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | | | - Lars J Jensen
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
4
|
Lu L, McCurdy S, Huang S, Zhu X, Peplowska K, Tiirikainen M, Boisvert WA, Garmire LX. Time Series miRNA-mRNA integrated analysis reveals critical miRNAs and targets in macrophage polarization. Sci Rep 2016; 6:37446. [PMID: 27981970 PMCID: PMC5159803 DOI: 10.1038/srep37446] [Citation(s) in RCA: 61] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2015] [Accepted: 10/25/2016] [Indexed: 01/13/2023] Open
Abstract
Polarization of macrophages is regulated through complex signaling networks. Correlating miRNA and mRNA expression over time after macrophage polarization has not yet been investigated. We used paired RNA-Seq and miRNA-Seq experiments to measure the mRNA and miRNA expression in bone marrow-derived macrophages over a time-series of 8 hours. Bioinformatics analysis identified 31 differentially expressed miRNAs between M1 and M2 polarized macrophages. The top 4 M1 miRNAs (miR-155-3p, miR-155-5p, miR-147-3p and miR-9-5p) and top 4 M2 miRNAs (miR-27a-5p, let-7c-1-3p, miR-23a-5p and miR-23b-5p) were validated by qPCR. Interestingly, M1 specific miRNAs could be categorized to early- and late-response groups, in which three new miRNAs miR-1931, miR-3473e and miR-5128 were validated as early-response miRNAs. M1 polarization led to the enrichment of genes involved in immune responses and signal transduction, whereas M2 polarization enriched genes involved in cell cycle and metabolic processes. C2H2 zinc-finger family members are key targets of DE miRNAs. The integrative analysis between miRNAs and mRNAs demonstrates the regulations of miRNAs on nearly four thousand differentially expressed genes and most of the biological pathways enriched in macrophage polarization. In summary, this study elucidates the expression profiles of miRNAs and their potential targetomes during macrophage polarization.
Collapse
Affiliation(s)
- Liangqun Lu
- Molecular Biosciences and Bioengineering Graduate Program, University of Hawaii at Manoa, Honolulu, HI 96822, USA
- Epidemiology Program, University of Hawaii Cancer Center, Honolulu, HI 96813, USA
| | - Sara McCurdy
- Center for Cardiovascular Research John A. Burns School of Medicine, University of Hawaii Cancer Center, Honolulu, HI 96813, USA
| | - Sijia Huang
- Molecular Biosciences and Bioengineering Graduate Program, University of Hawaii at Manoa, Honolulu, HI 96822, USA
- Epidemiology Program, University of Hawaii Cancer Center, Honolulu, HI 96813, USA
| | - Xun Zhu
- Molecular Biosciences and Bioengineering Graduate Program, University of Hawaii at Manoa, Honolulu, HI 96822, USA
- Epidemiology Program, University of Hawaii Cancer Center, Honolulu, HI 96813, USA
| | - Karolina Peplowska
- Genomics Shared Resource, University of Hawaii Cancer Center, Honolulu, HI, 96813, USA
| | - Maarit Tiirikainen
- Genomics Shared Resource, University of Hawaii Cancer Center, Honolulu, HI, 96813, USA
| | - William A. Boisvert
- Center for Cardiovascular Research John A. Burns School of Medicine, University of Hawaii Cancer Center, Honolulu, HI 96813, USA
| | - Lana X. Garmire
- Molecular Biosciences and Bioengineering Graduate Program, University of Hawaii at Manoa, Honolulu, HI 96822, USA
- Epidemiology Program, University of Hawaii Cancer Center, Honolulu, HI 96813, USA
| |
Collapse
|
5
|
Yu Y, Zhang X, Li B, Zhang Y, Liu J, Li H, Chen Y, Wang P, Kang R, Wu H, Wang Z. Entropy-based divergent and convergent modular pattern reveals additive and synergistic anticerebral ischemia mechanisms. Exp Biol Med (Maywood) 2016; 241:2063-2074. [PMID: 27480252 DOI: 10.1177/1535370216662361] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Module-based network analysis of diverse pharmacological mechanisms is critical to systematically understand combination therapies and disease outcomes. We first constructed drug-target ischemic networks in baicalin, jasminoidin, ursodeoxycholic acid, and their combinations baicalin and jasminoidin as well as jasminoidin and ursodeoxycholic acid groups and identified modules using the entropy-based clustering algorithm. The modules 11, 7, 4, 8 and 3 were identified as baicalin, jasminoidin, ursodeoxycholic acid, baicalin and jasminoidin and jasminoidin and ursodeoxycholic acid-emerged responsive modules, while 12, 8, 15, 17 and 9 were identified as disappeared responsive modules based on variation of topological similarity, respectively. No overlapping differential biological processes were enriched between baicalin and jasminoidin and jasminoidin and ursodeoxycholic acid pure emerged responsive modules, but two were enriched by their co-disappeared responsive modules including nucleotide-excision repair and epithelial structure maintenance. We found an additive effect of baicalin and jasminoidin in a divergent pattern and a synergistic effect of jasminoidin and ursodeoxycholic acid in a convergent pattern on "central hit strategy" of regulating inflammation against cerebral ischemia. The proposed module-based approach may provide us a holistic view to understand multiple pharmacological mechanisms associated with differential phenotypes from the standpoint of modular pharmacology.
Collapse
Affiliation(s)
- Yanan Yu
- 1 Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Dongzhimennei, Beijing 100700, China
| | - Xiaoxu Zhang
- 1 Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Dongzhimennei, Beijing 100700, China
| | - Bing Li
- 1 Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Dongzhimennei, Beijing 100700, China
| | - Yingying Zhang
- 1 Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Dongzhimennei, Beijing 100700, China
| | - Jun Liu
- 1 Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Dongzhimennei, Beijing 100700, China
| | - Haixia Li
- 2 Guang'anmen Hospital, China Academy of Chinese Medical Sciences, Beijing 100053, China
| | - Yinying Chen
- 2 Guang'anmen Hospital, China Academy of Chinese Medical Sciences, Beijing 100053, China
| | - Pengqian Wang
- 1 Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Dongzhimennei, Beijing 100700, China
| | - Ruixia Kang
- 1 Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Dongzhimennei, Beijing 100700, China
| | - Hongli Wu
- 1 Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Dongzhimennei, Beijing 100700, China
| | - Zhong Wang
- 1 Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Dongzhimennei, Beijing 100700, China
| |
Collapse
|
6
|
Chen YY, Yu YN, Zhang YY, Li B, Liu J, Li DF, Wu P, Wang J, Wang Z, Wang YY. Quantitative Determination of Flexible Pharmacological Mechanisms Based On Topological Variation in Mice Anti-Ischemic Modular Networks. PLoS One 2016; 11:e0158379. [PMID: 27383195 PMCID: PMC4934924 DOI: 10.1371/journal.pone.0158379] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2015] [Accepted: 05/12/2016] [Indexed: 12/29/2022] Open
Abstract
Targeting modules or signalings may open a new path to understanding the complex pharmacological mechanisms of reversing disease processes. However, determining how to quantify the structural alteration of these signalings or modules in pharmacological networks poses a great challenge towards realizing rational drug use in clinical medicine. Here, we explore a novel approach for dynamic comparative and quantitative analysis of the topological structural variation of modules in molecular networks, proposing the concept of allosteric modules (AMs). Based on the ischemic brain of mice, we optimize module distribution in different compound-dependent modular networks by using the minimum entropy criterion and then calculate the variation in similarity values of AMs under various conditions using a novel method of SimiNEF. The diverse pharmacological dynamic stereo-scrolls of AMs with functional gradient alteration, which consist of five types of AMs, may robustly deconstruct modular networks under the same ischemic conditions. The concept of AMs can not only integrate the responsive mechanisms of different compounds based on topological cascading variation but also obtain valuable structural information about disease and pharmacological networks beyond pathway analysis. We thereby provide a new systemic quantitative strategy for rationally determining pharmacological mechanisms of altered modular networks based on topological variation.
Collapse
Affiliation(s)
- Yin-ying Chen
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
- Guang'anmen Hospital, China Academy of Chinese Medical Sciences, Beijing, China
| | - Ya-nan Yu
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Ying-ying Zhang
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Bing Li
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Jun Liu
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
| | - Dong-feng Li
- School of Mathematical Sciences, Peking University, Beijing, China
| | - Ping Wu
- Guang'anmen Hospital, China Academy of Chinese Medical Sciences, Beijing, China
| | - Jie Wang
- Guang'anmen Hospital, China Academy of Chinese Medical Sciences, Beijing, China
- * E-mail: (JW); (ZW); (YYW)
| | - Zhong Wang
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
- * E-mail: (JW); (ZW); (YYW)
| | - Yong-yan Wang
- Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing, China
- * E-mail: (JW); (ZW); (YYW)
| |
Collapse
|
7
|
Wu M, Kwoh CK, Li X, Zheng J. Finding trans-regulatory genes and protein complexes modulating meiotic recombination hotspots of human, mouse and yeast. BMC SYSTEMS BIOLOGY 2014; 8:107. [PMID: 25208583 PMCID: PMC4236725 DOI: 10.1186/s12918-014-0107-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/10/2013] [Accepted: 07/11/2014] [Indexed: 11/18/2022]
Abstract
Background The regulatory mechanism of recombination is one of the most fundamental problems in genomics, with wide applications in genome wide association studies (GWAS), birth-defect diseases, molecular evolution, cancer research, etc. Recombination events cluster into short genomic regions called “recombination hotspots”. Recently, a zinc finger protein PRDM9 was reported to regulate recombination hotspots in human and mouse genomes. In addition, a 13-mer motif contained in the binding sites of PRDM9 is found to be enriched in human hotspots. However, this 13-mer motif only covers a fraction of hotspots, indicating that PRDM9 is not the only regulator of recombination hotspots. Therefore, the challenge of discovering other regulators of recombination hotspots becomes significant. Furthermore, recombination is a complex process. Hence, multiple proteins acting as machinery, rather than individual proteins, are more likely to carry out this process in a precise and stable manner. Therefore, the extension of the prediction of individual trans-regulators to protein complexes is also highly desired. Results In this paper, we introduce a pipeline to identify genes and protein complexes associated with recombination hotspots. First, we prioritize proteins associated with hotspots based on their preference of binding to hotspots and coldspots. Second, using the above identified genes as seeds, we apply the Random Walk with Restart algorithm (RWR) to propagate their influences to other proteins in protein-protein interaction (PPI) networks. Hence, many proteins without DNA-binding information will also be assigned a score to implicate their roles in recombination hotspots. Third, we construct sub-PPI networks induced by top genes ranked by RWR for various species (e.g., yeast, human and mouse) and detect protein complexes in those sub-PPI networks. Conclusions The GO term analysis show that our prioritizing methods and the RWR algorithm are capable of identifying novel genes associated with recombination hotspots. The trans-regulators predicted by our pipeline are enriched with epigenetic functions (e.g., histone modifications), demonstrating the epigenetic regulatory mechanisms of recombination hotspots. The identified protein complexes also provide us with candidates to further investigate the molecular machineries for recombination hotspots. Moreover, the experimental data and results are available on our web site http://www.ntu.edu.sg/home/zhengjie/data/RecombinationHotspot/NetPipe/.
Collapse
|
8
|
Rodgers-Melnick E, Culp M, DiFazio SP. Predicting whole genome protein interaction networks from primary sequence data in model and non-model organisms using ENTS. BMC Genomics 2013; 14:608. [PMID: 24015873 PMCID: PMC3848842 DOI: 10.1186/1471-2164-14-608] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2012] [Accepted: 09/04/2013] [Indexed: 01/10/2023] Open
Abstract
Background The large-scale identification of physical protein-protein interactions (PPIs) is an important step toward understanding how biological networks evolve and generate emergent phenotypes. However, experimental identification of PPIs is a laborious and error-prone process, and current methods of PPI prediction tend to be highly conservative or require large amounts of functional data that may not be available for newly-sequenced organisms. Results In this study we demonstrate a random-forest based technique, ENTS, for the computational prediction of protein-protein interactions based only on primary sequence data. Our approach is able to efficiently predict interactions on a whole-genome scale for any eukaryotic organism, using pairwise combinations of conserved domains and predicted subcellular localization of proteins as input features. We present the first predicted interactome for the forest tree Populus trichocarpa in addition to the predicted interactomes for Saccharomyces cerevisiae, Homo sapiens, Mus musculus, and Arabidopsis thaliana. Comparing our approach to other PPI predictors, we find that ENTS performs comparably to or better than a number of existing approaches, including several that utilize a variety of functional information for their predictions. We also find that the predicted interactions are biologically meaningful, as indicated by similarity in functional annotations and enrichment of co-expressed genes in public microarray datasets. Furthermore, we demonstrate some of the biological insights that can be gained from these predicted interaction networks. We show that the predicted interactions yield informative groupings of P. trichocarpa metabolic pathways, literature-supported associations among human disease states, and theory-supported insight into the evolutionary dynamics of duplicated genes in paleopolyploid plants. Conclusion We conclude that the ENTS classifier will be a valuable tool for the de novo annotation of genome sequences, providing initial clues about regulatory and metabolic network topology, and revealing relationships that are not immediately obvious from traditional homology-based annotations.
Collapse
Affiliation(s)
- Eli Rodgers-Melnick
- Department of Biology, West Virginia University, Morgantown, West Virginia, 26506, USA.
| | | | | |
Collapse
|
9
|
Gibbs DL, Baratt A, Baric RS, Kawaoka Y, Smith RD, Orwoll ES, Katze MG, McWeeney SK. Protein co-expression network analysis (ProCoNA). J Clin Bioinforma 2013; 3:11. [PMID: 23724967 PMCID: PMC3695838 DOI: 10.1186/2043-9113-3-11] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2013] [Accepted: 05/23/2013] [Indexed: 12/20/2022] Open
Abstract
Background Biological networks are important for elucidating disease etiology due to their ability to model complex high dimensional data and biological systems. Proteomics provides a critical data source for such models, but currently lacks robust de novo methods for network construction, which could bring important insights in systems biology. Results We have evaluated the construction of network models using methods derived from weighted gene co-expression network analysis (WGCNA). We show that approximately scale-free peptide networks, composed of statistically significant modules, are feasible and biologically meaningful using two mouse lung experiments and one human plasma experiment. Within each network, peptides derived from the same protein are shown to have a statistically higher topological overlap and concordance in abundance, which is potentially important for inferring protein abundance. The module representatives, called eigenpeptides, correlate significantly with biological phenotypes. Furthermore, within modules, we find significant enrichment for biological function and known interactions (gene ontology and protein-protein interactions). Conclusions Biological networks are important tools in the analysis of complex systems. In this paper we evaluate the application of weighted co-expression network analysis to quantitative proteomics data. Protein co-expression networks allow novel approaches for biological interpretation, quality control, inference of protein abundance, a framework for potentially resolving degenerate peptide-protein mappings, and a biomarker signature discovery.
Collapse
Affiliation(s)
- David L Gibbs
- Division of Bioinformatics and Computational Biology, Oregon Health & Science University, 3181 S,W, Sam Jackson Park Rd, Portland, OR 97239, USA.
| | | | | | | | | | | | | | | |
Collapse
|
10
|
Pálfy M, Farkas IJ, Vellai T, Korcsmáros T. Uniform curation protocol of metazoan signaling pathways to predict novel signaling components. Methods Mol Biol 2013; 1021:285-297. [PMID: 23715991 DOI: 10.1007/978-1-62703-450-0_15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
A relatively large number of signaling databases available today have strongly contributed to our understanding of signaling pathway properties. However, pathway comparisons both within and across databases are currently severely hampered by the large variety of data sources and the different levels of detail of their information content (on proteins and interactions). In this chapter, we present a protocol for a uniform curation method of signaling pathways, which intends to overcome this insufficiency. This uniformly curated database called SignaLink ( http://signalink.org ) allows us to systematically transfer pathway annotations between different species, based on orthology, and thereby to predict novel signaling pathway components. Thus, this method enables the compilation of a comprehensive signaling map of a given species and identification of new potential drug targets in humans. We strongly believe that the strict curation protocol we have established to compile a signaling pathway database can also be applied for the compilation of other (e.g., metabolic) databases. Similarly, the detailed guide to the orthology-based prediction of novel signaling components across species may also be utilized for predicting components of other biological processes.
Collapse
Affiliation(s)
- Máté Pálfy
- Department of Genetics, Eötvös Loránd University, Budapest, Hungary
| | | | | | | |
Collapse
|
11
|
Lewis ACF, Jones NS, Porter MA, Deane CM. What evidence is there for the homology of protein-protein interactions? PLoS Comput Biol 2012; 8:e1002645. [PMID: 23028270 PMCID: PMC3447968 DOI: 10.1371/journal.pcbi.1002645] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2011] [Accepted: 06/21/2012] [Indexed: 12/17/2022] Open
Abstract
The notion that sequence homology implies functional similarity underlies much of computational biology. In the case of protein-protein interactions, an interaction can be inferred between two proteins on the basis that sequence-similar proteins have been observed to interact. The use of transferred interactions is common, but the legitimacy of such inferred interactions is not clear. Here we investigate transferred interactions and whether data incompleteness explains the lack of evidence found for them. Using definitions of homology associated with functional annotation transfer, we estimate that conservation rates of interactions are low even after taking interactome incompleteness into account. For example, at a blastp E-value threshold of 10(-70), we estimate the conservation rate to be about 11 % between S. cerevisiae and H. sapiens. Our method also produces estimates of interactome sizes (which are similar to those previously proposed). Using our estimates of interaction conservation we estimate the rate at which protein-protein interactions are lost across species. To our knowledge, this is the first such study based on large-scale data. Previous work has suggested that interactions transferred within species are more reliable than interactions transferred across species. By controlling for factors that are specific to within-species interaction prediction, we propose that the transfer of interactions within species might be less reliable than transfers between species. Protein-protein interactions appear to be very rarely conserved unless very high sequence similarity is observed. Consequently, inferred interactions should be used with care.
Collapse
Affiliation(s)
- Anna C. F. Lewis
- Department of Statistics, University of Oxford, Oxford, United Kingdom
- Systems Biology Doctoral Training Centre, University of Oxford, Oxford, United Kingdom
| | - Nick S. Jones
- Department of Mathematics, Imperial College, London, United Kingdom
- Department of Physics, University of Oxford, Oxford, United Kingdom
- CABDyN Complexity Centre, University of Oxford, Oxford, United Kingdom
- Oxford Centre for Integrative Systems Biology, University of Oxford, Oxford, United Kingdom
| | - Mason A. Porter
- CABDyN Complexity Centre, University of Oxford, Oxford, United Kingdom
- Oxford Centre for Industrial and Applied Mathematics, Mathematical Institute, University of Oxford, Oxford, United Kingdom
| | - Charlotte M. Deane
- Department of Statistics, University of Oxford, Oxford, United Kingdom
- Oxford Centre for Integrative Systems Biology, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
12
|
Korcsmáros T, Szalay MS, Rovó P, Palotai R, Fazekas D, Lenti K, Farkas IJ, Csermely P, Vellai T. Signalogs: orthology-based identification of novel signaling pathway components in three metazoans. PLoS One 2011; 6:e19240. [PMID: 21559328 PMCID: PMC3086880 DOI: 10.1371/journal.pone.0019240] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2010] [Accepted: 03/29/2011] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND Uncovering novel components of signal transduction pathways and their interactions within species is a central task in current biological research. Orthology alignment and functional genomics approaches allow the effective identification of signaling proteins by cross-species data integration. Recently, functional annotation of orthologs was transferred across organisms to predict novel roles for proteins. Despite the wide use of these methods, annotation of complete signaling pathways has not yet been transferred systematically between species. PRINCIPAL FINDINGS Here we introduce the concept of 'signalog' to describe potential novel signaling function of a protein on the basis of the known signaling role(s) of its ortholog(s). To identify signalogs on genomic scale, we systematically transferred signaling pathway annotations among three animal species, the nematode Caenorhabditis elegans, the fruit fly Drosophila melanogaster, and humans. Using orthology data from InParanoid and signaling pathway information from the SignaLink database, we predict 88 worm, 92 fly, and 73 human novel signaling components. Furthermore, we developed an on-line tool and an interactive orthology network viewer to allow users to predict and visualize components of orthologous pathways. We verified the novelty of the predicted signalogs by literature search and comparison to known pathway annotations. In C. elegans, 6 out of the predicted novel Notch pathway members were validated experimentally. Our approach predicts signaling roles for 19 human orthodisease proteins and 5 known drug targets, and suggests 14 novel drug target candidates. CONCLUSIONS Orthology-based pathway membership prediction between species enables the identification of novel signaling pathway components that we referred to as signalogs. Signalogs can be used to build a comprehensive signaling network in a given species. Such networks may increase the biomedical utilization of C. elegans and D. melanogaster. In humans, signalogs may identify novel drug targets and new signaling mechanisms for approved drugs.
Collapse
Affiliation(s)
- Tamás Korcsmáros
- Department of Genetics, Eötvös Loránd University, Budapest, Hungary
| | | | | | | | | | | | | | | | | |
Collapse
|
13
|
Abstract
Bioinformatic methods to predict protein-protein interactions (PPI) via coevolutionary analysis have -positioned themselves to compete alongside established in vitro methods, despite a lack of understanding for the underlying molecular mechanisms of the coevolutionary process. Investigating the alignment of coevolutionary predictions of PPI with experimental data can focus the effective scope of prediction and lead to better accuracies. A new rate-based coevolutionary method, MMM, preferentially finds obligate interacting proteins that form complexes, conforming to results from studies based on coimmunoprecipitation coupled with mass spectrometry. Using gold-standard databases as a benchmark for accuracy, MMM surpasses methods based on abundance ratios, suggesting that correlated evolutionary rates may yet be better than coexpression at predicting interacting proteins. At the level of protein domains, -coevolution is difficult to detect, even with MMM, except when considering small-scale experimental data involving proteins with multiple domains. Overall, these findings confirm that coevolutionary -methods can be confidently used in predicting PPI, either independently or as drivers of coimmunoprecipitation experiments.
Collapse
|
14
|
Abstract
Coevolution maintains interactions between phenotypic traits through the process of reciprocal natural selection. Detecting molecular coevolution can expose functional interactions between molecules in the cell, generating insights into biological processes, pathways, and the networks of interactions important for cellular function. Prediction of interaction partners from different protein families exploits the property that interacting proteins can follow similar patterns and relative rates of evolution. Current methods for detecting coevolution based on the similarity of phylogenetic trees or evolutionary distance matrices have, however, been limited by requiring coevolution over the entire evolutionary history considered and are inaccurate in the presence of paralogous copies. We present a novel method for determining coevolving protein partners by finding the largest common submatrix in a given pair of distance matrices, with the size of the largest common submatrix measuring the strength of coevolution. This approach permits us to consider matrices of different size and scale, to find lineage-specific coevolution, and to predict multiple interaction partners. We used MatrixMatchMaker to predict protein-protein interactions in the human genome. We show that proteins that are known to interact physically are more strongly coevolving than proteins that simply belong to the same biochemical pathway. The human coevolution network is highly connected, suggesting many more protein-protein interactions than are currently known from high-throughput and other experimental evidence. These most strongly coevolving proteins suggest interactions that have been maintained over long periods of evolutionary time, and that are thus likely to be of fundamental importance to cellular function.
Collapse
Affiliation(s)
- Elisabeth R M Tillier
- Department of Medical Biophysics, University of Toronto, Ontario Cancer Institute, University Health Network, Canada.
| | | |
Collapse
|
15
|
Li X, Cai H, Xu J, Ying S, Zhang Y. A mouse protein interactome through combined literature mining with multiple sources of interaction evidence. Amino Acids 2009; 38:1237-52. [PMID: 19669079 DOI: 10.1007/s00726-009-0335-7] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2009] [Accepted: 07/24/2009] [Indexed: 11/25/2022]
Abstract
Protein-protein interactions (PPIs) play crucial roles in a number of biological processes. Recently, protein interaction networks (PINs) for several model organisms and humans have been generated, but few large-scale researches for mice have ever been made neither experimentally nor computationally. In the work, we undertook an effort to map a mouse PIN, in which protein interactions are hidden in enormous amount of biomedical literatures. Following a co-occurrence-based text-mining approach, a probabilistic model--naïve Bayesian was used to filter false-positive interactions by integrating heterogeneous kinds of evidence from genomic and proteomic datasets. A support vector machine algorithm was further used to choose protein pairs with physical interactions. By comparing with the currently available PPI datasets from several model organisms and humans, it showed that the derived mouse PINs have similar topological properties at the global level, but a high local divergence. The mouse protein interaction dataset is stored in the Mouse protein-protein interaction DataBase (MppDB) that is useful source of information for system-level understanding of gene function and biological processes in mammals. Access to the MppDB database is public available at http://bio.scu.edu.cn/mppi.
Collapse
Affiliation(s)
- Xiao Li
- Sichuan Key Laboratory of Molecular Biology and Biotechnology, Ministry of Education Key Laboratory for Bio-resource and Eco-environment, College of Life Sciences, Sichuan University, 610065, Chengdu, People's Republic of China.
| | | | | | | | | |
Collapse
|