1
|
Windels SFL, Malod-Dognin N, Pržulj N. Graphlet eigencentralities capture novel central roles of genes in pathways. PLoS One 2022; 17:e0261676. [PMID: 35077468 PMCID: PMC8789115 DOI: 10.1371/journal.pone.0261676] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2021] [Accepted: 12/07/2021] [Indexed: 01/16/2023] Open
Abstract
MOTIVATION Graphlet adjacency extends regular node adjacency in a network by considering a pair of nodes being adjacent if they participate in a given graphlet (small, connected, induced subgraph). Graphlet adjacencies captured by different graphlets were shown to contain complementary biological functions and cancer mechanisms. To further investigate the relationships between the topological features of genes participating in molecular networks, as captured by graphlet adjacencies, and their biological functions, we build more descriptive pathway-based approaches. CONTRIBUTION We introduce a new graphlet-based definition of eigencentrality of genes in a pathway, graphlet eigencentrality, to identify pathways and cancer mechanisms described by a given graphlet adjacency. We compute the centrality of genes in a pathway either from the local perspective of the pathway or from the global perspective of the entire network. RESULTS We show that in molecular networks of human and yeast, different local graphlet adjacencies describe different pathways (i.e., all the genes that are functionally important in a pathway are also considered topologically important by their local graphlet eigencentrality). Pathways described by the same graphlet adjacency are functionally similar, suggesting that each graphlet adjacency captures different pathway topology and function relationships. Additionally, we show that different graphlet eigencentralities describe different cancer driver genes that play central roles in pathways, or in the crosstalk between them (i.e. we can predict cancer driver genes participating in a pathway by their local or global graphlet eigencentrality). This result suggests that by considering different graphlet eigencentralities, we can capture different functional roles of genes in and between pathways.
Collapse
Affiliation(s)
- Sam F. L. Windels
- Department of Computer Science, University College London, London, United Kingdom
- Barcelona Supercomputing Center (BSC), Barcelona, Spain
| | - Noël Malod-Dognin
- Department of Computer Science, University College London, London, United Kingdom
- Barcelona Supercomputing Center (BSC), Barcelona, Spain
| | - Nataša Pržulj
- Department of Computer Science, University College London, London, United Kingdom
- Barcelona Supercomputing Center (BSC), Barcelona, Spain
- ICREA, Barcelona, Spain
| |
Collapse
|
2
|
Zhu J, Teolis S, Biassou N, Tabb A, Jabin PE, Lavi O. Tracking the Adaptation and Compensation Processes of Patients' Brain Arterial Network to an Evolving Glioblastoma. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2022; 44:488-501. [PMID: 32750811 DOI: 10.1109/tpami.2020.3008379] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
The brain's vascular network dynamically affects its development and core functions. It rapidly responds to abnormal conditions by adjusting properties of the network, aiding stabilization and regulation of brain activities. Tracking prominent arterial changes has clear clinical and surgical advantages. However, the arterial network functions as a system; thus, local changes may imply global compensatory effects that could impact the dynamic progression of a disease. We developed automated personalized system-level analysis methods of the compensatory arterial changes and mean blood flow behavior from a patient's clinical images. By applying our approach to data from a patient with aggressive brain cancer compared with healthy individuals, we found unique spatiotemporal patterns of the arterial network that could assist in predicting the evolution of glioblastoma over time. Our personalized approach provides a valuable analysis tool that could augment current clinical assessments of the progression of glioblastoma and other neurological disorders affecting the brain.
Collapse
|
3
|
Liu Y, Liang H, Zou Q, He Z. Significance-Based Essential Protein Discovery. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:633-642. [PMID: 32750873 DOI: 10.1109/tcbb.2020.3004364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
The identification of essential proteins is an important problem in bioinformatics. During the past decades, many centrality measures and algorithms have been proposed to address this issue. However, existing methods still deserve the following drawbacks: (1) the lack of a context-free and readily interpretable quantification of their centrality values; (2) the difficulty of specifying a proper threshold for their centrality values; (3) the incapability of controlling the quality of reported essential proteins in a statistically sound manner. To overcome the limitations of existing solutions, we tackle the essential protein discovery problem from a significance testing perspective. More precisely, the essential protein discovery problem is formulated as a multiple hypothesis testing problem, where the null hypothesis is that each protein is not an essential protein. To quantify the statistical significance of each protein, we present a p-value calculation method in which both the degree and the local clustering coefficient are used as the test statistic and the Erdös-Rényi model is employed as the random graph model. After calculating the p-value for each protein, the false discovery rate is used as the error rate in the multiple testing correction procedure. Our significance-based essential protein discovery method is named as SigEP, which is tested on both simulated networks and real PPI networks. The experimental results show that our method is able to achieve better performance than those competing algorithms.
Collapse
|
4
|
Chen X, Luo L, Shen C, Ding P, Luo J. An In Silico Method for Predicting Drug Synergy Based on Multitask Learning. Interdiscip Sci 2021; 13:299-311. [PMID: 33611781 DOI: 10.1007/s12539-021-00422-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Revised: 01/29/2021] [Accepted: 02/07/2021] [Indexed: 12/20/2022]
Abstract
To make better use of all kinds of knowledge to predict drug synergy, it is crucial to successfully establish a drug synergy prediction model and leverage the reconstruction of sparse known drug targets. Therefore, we present an in silico method that predicts the synergy scores of drug pairs based on multitask learning (DSML) that could fuse drug targets, protein-protein interactions, anatomical therapeutic chemical codes, a priori knowledge of drug combinations. To simultaneously reconstruct drug-target protein interactions and synergistic drug combinations, DSML benefits indirectly from the associations with relation through proteins. In cross-validation experiments, DSML improved the ability to predict drug synergy. Moreover, the reconstruction of drug-target interactions and the incorporation of multisource knowledge significantly improved drug combination predictions by a large margin. The potential drug combinations predicted by DSML demonstrate its ability to predict drug synergy.
Collapse
Affiliation(s)
- Xin Chen
- School of Computer Science, University of South China, Hengyang, 421001, Hunan, China
| | - Lingyun Luo
- School of Computer Science, University of South China, Hengyang, 421001, Hunan, China.,Hunan Medical Big Data International Sci.&Tech. Innovation Cooperation Base, Hengyang, 421000, Hunan, China
| | - Cong Shen
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410082, Hunan, China
| | - Pingjian Ding
- School of Computer Science, University of South China, Hengyang, 421001, Hunan, China. .,Hunan Medical Big Data International Sci.&Tech. Innovation Cooperation Base, Hengyang, 421000, Hunan, China.
| | - Jiawei Luo
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410082, Hunan, China
| |
Collapse
|
5
|
Zaidi SS, Naqvi RZ, Asif M, Strickler S, Shakir S, Shafiq M, Khan AM, Amin I, Mishra B, Mukhtar MS, Scheffler BE, Scheffler JA, Mueller LA, Mansoor S. Molecular insight into cotton leaf curl geminivirus disease resistance in cultivated cotton (Gossypium hirsutum). PLANT BIOTECHNOLOGY JOURNAL 2020; 18:691-706. [PMID: 31448544 PMCID: PMC7004920 DOI: 10.1111/pbi.13236] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/15/2018] [Revised: 08/01/2019] [Accepted: 08/15/2019] [Indexed: 05/21/2023]
Abstract
Cultivated cotton (Gossypium hirsutum) is the most important fibre crop in the world. Cotton leaf curl disease (CLCuD) is the major limiting factor and a threat to textile industry in India and Pakistan. All the local cotton cultivars exhibit moderate to no resistance against CLCuD. In this study, we evaluated an exotic cotton accession Mac7 as a resistance source to CLCuD by challenging it with viruliferous whiteflies and performing qPCR to evaluate the presence/absence and relative titre of CLCuD-associated geminiviruses/betasatellites. The results indicated that replication of pathogenicity determinant betasatellite is significantly attenuated in Mac7 and probably responsible for resistance phenotype. Afterwards, to decipher the genetic basis of CLCuD resistance in Mac7, we performed RNA sequencing on CLCuD-infested Mac7 and validated RNA-Seq data with qPCR on 24 independent genes. We performed co-expression network and pathway analysis for regulation of geminivirus/betasatellite-interacting genes. We identified nine novel modules with 52 hubs of highly connected genes in network topology within the co-expression network. Analysis of these hubs indicated the differential regulation of auxin stimulus and cellular localization pathways in response to CLCuD. We also analysed the differential regulation of geminivirus/betasatellite-interacting genes in Mac7. We further performed the functional validation of selected candidate genes via virus-induced gene silencing (VIGS). Finally, we evaluated the genomic context of resistance responsive genes and found that these genes are not specific to A or D sub-genomes of G. hirsutum. These results have important implications in understanding CLCuD resistance mechanism and developing a durable resistance in cultivated cotton.
Collapse
Affiliation(s)
- Syed Shan‐e‐Ali Zaidi
- National Institute for Biotechnology and Genetic EngineeringFaisalabadPakistan
- Boyce Thompson InstituteIthacaNYUSA
- Plant Genetics LabTERRA Teaching and Research CenterGembloux Agro-Bio TechUniversity of LiègeGemblouxBelgium
| | - Rubab Zahra Naqvi
- National Institute for Biotechnology and Genetic EngineeringFaisalabadPakistan
- Boyce Thompson InstituteIthacaNYUSA
| | - Muhammad Asif
- National Institute for Biotechnology and Genetic EngineeringFaisalabadPakistan
| | | | - Sara Shakir
- National Institute for Biotechnology and Genetic EngineeringFaisalabadPakistan
- Boyce Thompson InstituteIthacaNYUSA
- Plant Genetics LabTERRA Teaching and Research CenterGembloux Agro-Bio TechUniversity of LiègeGemblouxBelgium
| | - Muhammad Shafiq
- National Institute for Biotechnology and Genetic EngineeringFaisalabadPakistan
- Present address:
Department of BiotechnologyUniversity of OkaraOkaraPakistan
| | - Abdul Manan Khan
- National Institute for Biotechnology and Genetic EngineeringFaisalabadPakistan
| | - Imran Amin
- National Institute for Biotechnology and Genetic EngineeringFaisalabadPakistan
| | - Bharat Mishra
- Department of BiologyUniversity of Alabama at BirminghamBirminghamALUSA
| | - M. Shahid Mukhtar
- Department of BiologyUniversity of Alabama at BirminghamBirminghamALUSA
| | - Brian E. Scheffler
- Genomics and Bioinformatics Research UnitUnited States Department of Agriculture‐Agricultural Research Service (USDA‐ARS)StonevilleMSUSA
| | - Jodi A. Scheffler
- Crop Genetics Research UnitUnited States Department of Agriculture‐Agricultural Research Service (USDA‐ARS)StonevilleMSUSA
| | | | - Shahid Mansoor
- National Institute for Biotechnology and Genetic EngineeringFaisalabadPakistan
| |
Collapse
|
6
|
Liang Y, Kelemen A. Dynamic modeling and network approaches for omics time course data: overview of computational approaches and applications. Brief Bioinform 2019; 19:1051-1068. [PMID: 28430854 DOI: 10.1093/bib/bbx036] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2016] [Indexed: 12/23/2022] Open
Abstract
Inferring networks and dynamics of genes, proteins, cells and other biological entities from high-throughput biological omics data is a central and challenging issue in computational and systems biology. This is essential for understanding the complexity of human health, disease susceptibility and pathogenesis for Predictive, Preventive, Personalized and Participatory (P4) system and precision medicine. The delineation of the possible interactions of all genes/proteins in a genome/proteome is a task for which conventional experimental techniques are ill suited. Urgently needed are rapid and inexpensive computational and statistical methods that can identify interacting candidate disease genes or drug targets out of thousands that can be further investigated or validated by experimentations. Moreover, identifying biological dynamic systems, and simultaneously estimating the important kinetic structural and functional parameters, which may not be experimentally accessible could be important directions for drug-disease-gene network studies. In this article, we present an overview and comparison of recent developments of dynamic modeling and network approaches for time-course omics data, and their applications to various biological systems, health conditions and disease statuses. Moreover, various data reduction and analytical schemes ranging from mathematical to computational to statistical methods are compared including their merits, drawbacks and limitations. The most recent software, associated web resources and other potentials for the compared methods are also presented and discussed in detail.
Collapse
Affiliation(s)
- Yulan Liang
- Department of Family and Community Health, University of Maryland, Baltimore, MD, USA
| | - Arpad Kelemen
- Department of Family and Community Health, University of Maryland, Baltimore, MD, USA
| |
Collapse
|
7
|
Shafi A, Nguyen T, Peyvandipour A, Nguyen H, Draghici S. A Multi-Cohort and Multi-Omics Meta-Analysis Framework to Identify Network-Based Gene Signatures. Front Genet 2019; 10:159. [PMID: 30941158 PMCID: PMC6434849 DOI: 10.3389/fgene.2019.00159] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2018] [Accepted: 02/14/2019] [Indexed: 12/20/2022] Open
Abstract
Although massive amounts of condition-specific molecular profiles are being accumulated in public repositories every day, meaningful interpretation of these data remains a major challenge. In an effort to identify the biomarkers that describe the key biological phenomena for a given condition, several approaches have been developed over the past few years. However, the majority of these approaches either (i) do not consider the known intermolecular interactions, or (ii) do not integrate molecular data of multiple types (e.g., genomics, transcriptomics, proteomics, epigenomics, etc.), and thus potentially fail to capture the true biological changes responsible for complex diseases (e.g., cancer). In addition, these approaches often ignore the heterogeneity and study bias present in independent molecular cohorts. In this manuscript, we propose a novel multi-cohort and multi-omics meta-analysis framework that overcomes all three limitations mentioned above in order to identify robust molecular subnetworks that capture the key dynamic nature of a given biological condition. Our framework integrates multiple independent gene expression studies, unmatched DNA methylation studies, and protein-protein interactions to identify methylation-driven subnetworks. We demonstrate the proposed framework by constructing subnetworks related to two complex diseases: glioblastoma and low-grade gliomas. We validate the identified subnetworks by showing their ability to predict patients' clinical outcome on multiple independent validation cohorts.
Collapse
Affiliation(s)
- Adib Shafi
- Department of Computer Science, Wayne State University, Detroit, MI, United States
| | - Tin Nguyen
- Department of Computer Science and Engineering, University of Nevada, Reno, NV, United States
| | - Azam Peyvandipour
- Department of Computer Science, Wayne State University, Detroit, MI, United States
| | - Hung Nguyen
- Department of Computer Science and Engineering, University of Nevada, Reno, NV, United States
| | - Sorin Draghici
- Department of Computer Science, Wayne State University, Detroit, MI, United States.,Department of Obstetrics and Gynecology, Wayne State University, Detroit, MI, United States
| |
Collapse
|
8
|
Li X, Li W, Zeng M, Zheng R, Li M. Network-based methods for predicting essential genes or proteins: a survey. Brief Bioinform 2019; 21:566-583. [DOI: 10.1093/bib/bbz017] [Citation(s) in RCA: 55] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2018] [Revised: 01/21/2019] [Accepted: 01/22/2019] [Indexed: 12/14/2022] Open
Abstract
Abstract
Genes that are thought to be critical for the survival of organisms or cells are called essential genes. The prediction of essential genes and their products (essential proteins) is of great value in exploring the mechanism of complex diseases, the study of the minimal required genome for living cells and the development of new drug targets. As laboratory methods are often complicated, costly and time-consuming, a great many of computational methods have been proposed to identify essential genes/proteins from the perspective of the network level with the in-depth understanding of network biology and the rapid development of biotechnologies. Through analyzing the topological characteristics of essential genes/proteins in protein–protein interaction networks (PINs), integrating biological information and considering the dynamic features of PINs, network-based methods have been proved to be effective in the identification of essential genes/proteins. In this paper, we survey the advanced methods for network-based prediction of essential genes/proteins and present the challenges and directions for future research.
Collapse
Affiliation(s)
- Xingyi Li
- School of Computer Science and Engineering, Central South University, Changsha, Hunan, China
| | - Wenkai Li
- School of Computer Science and Engineering, Central South University, Changsha, Hunan, China
| | - Min Zeng
- School of Computer Science and Engineering, Central South University, Changsha, Hunan, China
| | - Ruiqing Zheng
- School of Computer Science and Engineering, Central South University, Changsha, Hunan, China
| | - Min Li
- School of Computer Science and Engineering, Central South University, Changsha, Hunan, China
| |
Collapse
|
9
|
Naqvi RZ, Zaidi SSEA, Mukhtar MS, Amin I, Mishra B, Strickler S, Mueller LA, Asif M, Mansoor S. Transcriptomic analysis of cultivated cotton Gossypium hirsutum provides insights into host responses upon whitefly-mediated transmission of cotton leaf curl disease. PLoS One 2019; 14:e0210011. [PMID: 30730891 PMCID: PMC6366760 DOI: 10.1371/journal.pone.0210011] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2018] [Accepted: 12/14/2018] [Indexed: 11/18/2022] Open
Abstract
Cotton is a commercial and economically important crop that generates billions of dollars in annual revenue worldwide. However, cotton yield is affected by a sap-sucking insect Bemisia tabaci (whitefly), and whitefly-borne cotton leaf curl disease (CLCuD). The causative agent of devastating CLCuD is led by the viruses belonging to the genus Begomovirus (family Geminiviridae), collectively called cotton leaf curl viruses. Unfortunately, the extensively cultivated cotton (Gossypium hirsutum) species are highly susceptible and vulnerable to CLCuD. Yet, the concomitant influence of whitefly and CLCuD on the susceptible G. hirsutum transcriptome has not been interpreted. In the present study we have employed an RNA Sequencing (RNA-Seq) transcriptomics approach to explore the differential gene expression in susceptible G. hirsutum variety upon infection with viruliferous whiteflies. Comparative RNA-Seq of control and CLCuD infected plants was done using Illumina HiSeq 2500. This study yielded 468 differentially expressed genes (DEGs). Among them, we identified 220 up and 248 downregulated DEGs involved in disease responses and pathogen defense. We selected ten genes for downstream RT-qPCR analyses on two cultivars, Karishma and MNH 786 that are susceptible to CLCuD. We observed a similar expression pattern of these genes in both susceptible cultivars that was also consistent with our transcriptome data further implying a wider application of our global transcription study on host susceptibility to CLCuD. We next performed weighted gene co-expression network analysis that revealed six modules. This analysis also identified highly co-expressed genes as well as 55 hub genes that co-express with ≥ 50 genes. Intriguingly, most of these hub genes are shown to be downregulated and enriched in cellular processes. Under-expression of such highly co-expressed genes suggests their roles in favoring the virus and enhancing plant susceptibility to CLCuD. We also discuss the potential mechanisms governing the establishment of disease susceptibility. Overall, our study provides a comprehensive differential gene expression analysis of G. hirsutum under whitefly-mediated CLCuD infection. This vital study will advance the understanding of simultaneous effect of whitefly and virus on their host and aid in identifying important G. hirsutum genes which intricate in its susceptibility to CLCuD.
Collapse
Affiliation(s)
- Rubab Zahra Naqvi
- Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering (NIBGE), Faisalabad, Punjab, Pakistan
- Pakistan Institute of Engineering & Applied Sciences (PIEAS), Nilore, Islamabad, Pakistan
- Boyce Thompson Institute, Cornell University, Ithaca, NY, United States of America
| | - Syed Shan-e-Ali Zaidi
- Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering (NIBGE), Faisalabad, Punjab, Pakistan
- Pakistan Institute of Engineering & Applied Sciences (PIEAS), Nilore, Islamabad, Pakistan
- Boyce Thompson Institute, Cornell University, Ithaca, NY, United States of America
| | - M. Shahid Mukhtar
- Department of Biology, University of Alabama at Birmingham, Birmingham, AL, United States of America
| | - Imran Amin
- Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering (NIBGE), Faisalabad, Punjab, Pakistan
| | - Bharat Mishra
- Department of Biology, University of Alabama at Birmingham, Birmingham, AL, United States of America
| | - Susan Strickler
- Boyce Thompson Institute, Cornell University, Ithaca, NY, United States of America
| | - Lukas A. Mueller
- Boyce Thompson Institute, Cornell University, Ithaca, NY, United States of America
| | - Muhammad Asif
- Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering (NIBGE), Faisalabad, Punjab, Pakistan
| | - Shahid Mansoor
- Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering (NIBGE), Faisalabad, Punjab, Pakistan
- * E-mail:
| |
Collapse
|
10
|
Ashraf MI, Ong SK, Mujawar S, Pawar S, More P, Paul S, Lahiri C. A side-effect free method for identifying cancer drug targets. Sci Rep 2018; 8:6669. [PMID: 29703908 PMCID: PMC5923273 DOI: 10.1038/s41598-018-25042-2] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2017] [Accepted: 04/13/2018] [Indexed: 12/20/2022] Open
Abstract
Identifying effective drug targets, with little or no side effects, remains an ever challenging task. A potential pitfall of failing to uncover the correct drug targets, due to side effect of pleiotropic genes, might lead the potential drugs to be illicit and withdrawn. Simplifying disease complexity, for the investigation of the mechanistic aspects and identification of effective drug targets, have been done through several approaches of protein interactome analysis. Of these, centrality measures have always gained importance in identifying candidate drug targets. Here, we put forward an integrated method of analysing a complex network of cancer and depict the importance of k-core, functional connectivity and centrality (KFC) for identifying effective drug targets. Essentially, we have extracted the proteins involved in the pathways leading to cancer from the pathway databases which enlist real experimental datasets. The interactions between these proteins were mapped to build an interactome. Integrative analyses of the interactome enabled us to unearth plausible reasons for drugs being rendered withdrawn, thereby giving future scope to pharmaceutical industries to potentially avoid them (e.g. ESR1, HDAC2, F2, PLG, PPARA, RXRA, etc). Based upon our KFC criteria, we have shortlisted ten proteins (GRB2, FYN, PIK3R1, CBL, JAK2, LCK, LYN, SYK, JAK1 and SOCS3) as effective candidates for drug development.
Collapse
Affiliation(s)
- Md Izhar Ashraf
- The Institute of Mathematical Sciences, Chennai, 600113, India.,B.S. Abdur Rahman Crescent Institute of Science & Technology, Vandalur, Chennai, 600048, India
| | - Seng-Kai Ong
- Department of Biological Sciences, Sunway University, 47500, Petaling Jaya, Malaysia
| | - Shama Mujawar
- Department of Biological Sciences, Sunway University, 47500, Petaling Jaya, Malaysia
| | - Shrikant Pawar
- Department of Computer Science & Department of Biology, Georgia State University, Atlanta, GA, 30303, USA
| | - Pallavi More
- Department of Bioinformatics, University of Pune, Pune, Maharashtra, 411007, India
| | - Somnath Paul
- Department of Computer Science and Engineering, Birla Institute of Technology, Mesra, India
| | - Chandrajit Lahiri
- The Institute of Mathematical Sciences, Chennai, 600113, India. .,Department of Biological Sciences, Sunway University, 47500, Petaling Jaya, Malaysia.
| |
Collapse
|
11
|
Dopazo J, Erten C. Graph-theoretical comparison of normal and tumor networks in identifying BRCA genes. BMC SYSTEMS BIOLOGY 2017; 11:110. [PMID: 29166896 PMCID: PMC5700672 DOI: 10.1186/s12918-017-0495-0] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/01/2017] [Accepted: 11/13/2017] [Indexed: 12/18/2022]
Abstract
BACKGROUND Identification of driver genes related to certain types of cancer is an important research topic. Several systems biology approaches have been suggested, in particular for the identification of breast cancer (BRCA) related genes. Such approaches usually rely on differential gene expression and/or mutational landscape data. In some cases interaction network data is also integrated to identify cancer-related modules computationally. RESULTS We provide a framework for the comparative graph-theoretical analysis of networks integrating the relevant gene expression, mutations, and potein-protein interaction network data. The comparisons involve a graph-theoretical analysis of normal and tumor network pairs across all instances of a given set of breast cancer samples. The network measures under consideration are based on appropriate formulations of various centrality measures: betweenness, clustering coefficients, degree centrality, random walk distances, graph-theoretical distances, and Jaccard index centrality. CONCLUSIONS Among all the studied centrality-based graph-theoretical properties, we show that a betweenness-based measure differentiates BRCA genes across all normal versus tumor network pairs, than the rest of the popular centrality-based measures. The AUROC and AUPR values of the gene lists ordered with respect to the measures under study as compared to NCBI BioSystems pathway and the COSMIC database of cancer genes are the largest with the betweenness-based differentiation, followed by the measure based on degree centrality. In order to test the robustness of the suggested measures in prioritizing cancer genes, we further tested the two most promising measures, those based on betweenness and degree centralities, on randomly rewired networks. We show that both measures are quite resilient to noise in the input interaction network. We also compared the same measures against a state-of-the-art alternative disease gene prioritization method, MUFFFINN. We show that both our graph-theoretical measures outperform MUFFINN prioritizations in terms of ROC and precions/recall analysis. Finally, we filter the ordered list of the best measure, the betweenness-based differentiation, via a maximum-weight independent set formulation and investigate the top 50 genes in regards to literature verification. We show that almost all genes in the list are verified by the breast cancer literature and three genes are presented as novel genes that may potentialy be BRCA-related but missing in literature.
Collapse
Affiliation(s)
- Joaquin Dopazo
- Clinical Bioinformatics Research Area, Fundación Progreso y Salud, Hospital Virgen del Rocío, Sevilla, Spain
| | - Cesim Erten
- Computer Engineering, Antalya Bilim University, Antalya, Turkey.
| |
Collapse
|
12
|
Naqvi RZ, Zaidi SSEA, Akhtar KP, Strickler S, Woldemariam M, Mishra B, Mukhtar MS, Scheffler BE, Scheffler JA, Jander G, Mueller LA, Asif M, Mansoor S. Transcriptomics reveals multiple resistance mechanisms against cotton leaf curl disease in a naturally immune cotton species, Gossypium arboreum. Sci Rep 2017; 7:15880. [PMID: 29162860 PMCID: PMC5698292 DOI: 10.1038/s41598-017-15963-9] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2017] [Accepted: 11/03/2017] [Indexed: 12/13/2022] Open
Abstract
Cotton leaf curl disease (CLCuD), caused by cotton leaf curl viruses (CLCuVs), is among the most devastating diseases in cotton. While the widely cultivated cotton species Gossypium hirsutum is generally susceptible, the diploid species G. arboreum is a natural source for resistance against CLCuD. However, the influence of CLCuD on the G. arboreum transcriptome and the interaction of CLCuD with G. arboreum remains to be elucidated. Here we have used an RNA-Seq based study to analyze differential gene expression in G. arboreum under CLCuD infestation. G. arboreum plants were infested by graft inoculation using a CLCuD infected scion of G. hirsutum. CLCuD infested asymptomatic and symptomatic plants were analyzed with RNA-seq using an Illumina HiSeq. 2500. Data analysis revealed 1062 differentially expressed genes (DEGs) in G. arboreum. We selected 17 genes for qPCR to validate RNA-Seq data. We identified several genes involved in disease resistance and pathogen defense. Furthermore, a weighted gene co-expression network was constructed from the RNA-Seq dataset that indicated 50 hub genes, most of which are involved in transport processes and might have a role in the defense response of G. arboreum against CLCuD. This fundamental study will improve the understanding of virus-host interaction and identification of important genes involved in G. arboreum tolerance against CLCuD.
Collapse
Affiliation(s)
- Rubab Zahra Naqvi
- Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering (NIBGE), Jhang Road, Faisalabad, Punjab, Pakistan
- Pakistan Institute of Engineering & Applied Sciences (PIEAS), Nilore, Islamabad, Pakistan
- Boyce Thompson Institute, 533 Tower Road, Cornell University, Ithaca, NY, USA
| | - Syed Shan-E-Ali Zaidi
- Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering (NIBGE), Jhang Road, Faisalabad, Punjab, Pakistan
- Pakistan Institute of Engineering & Applied Sciences (PIEAS), Nilore, Islamabad, Pakistan
- Boyce Thompson Institute, 533 Tower Road, Cornell University, Ithaca, NY, USA
- AgroBioChem Department, Gembloux Agro-Bio Tech, University of Liège, 5030, Gembloux, Belgium
| | - Khalid Pervaiz Akhtar
- Nuclear Institute for Agriculture & Biology (NIAB), Jhang Road, Faisalabad, Punjab, Pakistan
| | - Susan Strickler
- Boyce Thompson Institute, 533 Tower Road, Cornell University, Ithaca, NY, USA
| | - Melkamu Woldemariam
- Boyce Thompson Institute, 533 Tower Road, Cornell University, Ithaca, NY, USA
| | - Bharat Mishra
- Department of Biology, University of Alabama at Birmingham, Birmingham, AL, USA
| | - M Shahid Mukhtar
- Department of Biology, University of Alabama at Birmingham, Birmingham, AL, USA
| | - Brian E Scheffler
- Genomics and Bioinformatics Research Unit (USDA-ARS), Stoneville, MS, USA
| | - Jodi A Scheffler
- Crop Genetics Research Unit, United States Department of Agriculture-Agricultural Research Service (USDA-ARS), Stoneville, MS, USA
| | - Georg Jander
- Boyce Thompson Institute, 533 Tower Road, Cornell University, Ithaca, NY, USA
| | - Lukas A Mueller
- Boyce Thompson Institute, 533 Tower Road, Cornell University, Ithaca, NY, USA
| | - Muhammad Asif
- Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering (NIBGE), Jhang Road, Faisalabad, Punjab, Pakistan
| | - Shahid Mansoor
- Agricultural Biotechnology Division, National Institute for Biotechnology and Genetic Engineering (NIBGE), Jhang Road, Faisalabad, Punjab, Pakistan.
| |
Collapse
|
13
|
Kuenzi BM, Remsing Rix LL, Stewart PA, Fang B, Kinose F, Bryant AT, Boyle TA, Koomen JM, Haura EB, Rix U. Polypharmacology-based ceritinib repurposing using integrated functional proteomics. Nat Chem Biol 2017; 13:1222-1231. [PMID: 28991240 DOI: 10.1038/nchembio.2489] [Citation(s) in RCA: 52] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2017] [Accepted: 08/30/2017] [Indexed: 12/11/2022]
Abstract
Targeted drugs are effective when they directly inhibit strong disease drivers, but only a small fraction of diseases feature defined actionable drivers. Alternatively, network-based approaches can uncover new therapeutic opportunities. Applying an integrated phenotypic screening, chemical and phosphoproteomics strategy, here we describe the anaplastic lymphoma kinase (ALK) inhibitor ceritinib as having activity across several ALK-negative lung cancer cell lines and identify new targets and network-wide signaling effects. Combining pharmacological inhibitors and RNA interference revealed a polypharmacology mechanism involving the noncanonical targets IGF1R, FAK1, RSK1 and RSK2. Mutating the downstream signaling hub YB1 protected cells from ceritinib. Consistent with YB1 signaling being known to cause taxol resistance, combination of ceritinib with paclitaxel displayed strong synergy, particularly in cells expressing high FAK autophosphorylation, which we show to be prevalent in lung cancer. Together, we present a systems chemical biology platform for elucidating multikinase inhibitor polypharmacology mechanisms, subsequent design of synergistic drug combinations, and identification of mechanistic biomarker candidates.
Collapse
Affiliation(s)
- Brent M Kuenzi
- Department of Drug Discovery, H. Lee Moffitt Cancer Center and Research Institute, Tampa, Florida, USA.,Cancer Biology PhD Program, University of South Florida, Tampa, Florida, USA
| | - Lily L Remsing Rix
- Department of Drug Discovery, H. Lee Moffitt Cancer Center and Research Institute, Tampa, Florida, USA
| | - Paul A Stewart
- Department of Thoracic Oncology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, Florida, USA
| | - Bin Fang
- Proteomics Core Facility, H. Lee Moffitt Cancer Center and Research Institute, Tampa, Florida, USA
| | - Fumi Kinose
- Department of Thoracic Oncology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, Florida, USA
| | - Annamarie T Bryant
- Department of Drug Discovery, H. Lee Moffitt Cancer Center and Research Institute, Tampa, Florida, USA
| | - Theresa A Boyle
- Department of Thoracic Oncology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, Florida, USA
| | - John M Koomen
- Department of Molecular Oncology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, Florida, USA
| | - Eric B Haura
- Department of Thoracic Oncology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, Florida, USA
| | - Uwe Rix
- Department of Drug Discovery, H. Lee Moffitt Cancer Center and Research Institute, Tampa, Florida, USA
| |
Collapse
|
14
|
Mishra B, Sun Y, Ahmed H, Liu X, Mukhtar MS. Global temporal dynamic landscape of pathogen-mediated subversion of Arabidopsis innate immunity. Sci Rep 2017; 7:7849. [PMID: 28798368 PMCID: PMC5552879 DOI: 10.1038/s41598-017-08073-z] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2017] [Accepted: 06/29/2017] [Indexed: 12/22/2022] Open
Abstract
The universal nature of networks’ structural and physical properties across diverse systems offers a better prospect to elucidate the interplay between a system and its environment. In the last decade, several large-scale transcriptome and interactome studies were conducted to understand the complex and dynamic nature of interactions between Arabidopsis and its bacterial pathogen, Pseudomonas syringae pv. tomato DC3000. We took advantage of these publicly available datasets and performed “-omics”-based integrative, and network topology analyses to decipher the transcriptional and protein-protein interaction activities of effector targets. We demonstrated that effector targets exhibit shorter distance to differentially expressed genes (DEGs) and possess increased information centrality. Intriguingly, effector targets are differentially expressed in a sequential manner and make for 1% of the total DEGs at any time point of infection with virulent or defense-inducing DC3000 strains. We revealed that DC3000 significantly alters the expression levels of 71% effector targets and their downstream physical interacting proteins in Arabidopsis interactome. Our integrative “-omics”-–based analyses identified dynamic complexes associated with MTI and disease susceptibility. Finally, we discovered five novel plant defense players using a systems biology-fueled top-to-bottom approach and demonstrated immune-related functions for them, further validating the power and resolution of our network analyses.
Collapse
Affiliation(s)
- Bharat Mishra
- Department of Biology, University of Alabama at Birmingham, Birmingham, USA
| | - Yali Sun
- Department of Biology, University of Alabama at Birmingham, Birmingham, USA
| | - Hadia Ahmed
- Department of Computer & Information Sciences, University of Alabama at Birmingham, Birmingham, USA
| | - Xiaoyu Liu
- Department of Biology, University of Alabama at Birmingham, Birmingham, USA
| | - M Shahid Mukhtar
- Department of Biology, University of Alabama at Birmingham, Birmingham, USA. .,Nutrition Obesity Research Center, University of Alabama at Birmingham, Birmingham, USA.
| |
Collapse
|
15
|
Liang Y, Kelemen A. Computational dynamic approaches for temporal omics data with applications to systems medicine. BioData Min 2017. [PMID: 28638442 PMCID: PMC5473988 DOI: 10.1186/s13040-017-0140-x] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Modeling and predicting biological dynamic systems and simultaneously estimating the kinetic structural and functional parameters are extremely important in systems and computational biology. This is key for understanding the complexity of the human health, drug response, disease susceptibility and pathogenesis for systems medicine. Temporal omics data used to measure the dynamic biological systems are essentials to discover complex biological interactions and clinical mechanism and causations. However, the delineation of the possible associations and causalities of genes, proteins, metabolites, cells and other biological entities from high throughput time course omics data is challenging for which conventional experimental techniques are not suited in the big omics era. In this paper, we present various recently developed dynamic trajectory and causal network approaches for temporal omics data, which are extremely useful for those researchers who want to start working in this challenging research area. Moreover, applications to various biological systems, health conditions and disease status, and examples that summarize the state-of-the art performances depending on different specific mining tasks are presented. We critically discuss the merits, drawbacks and limitations of the approaches, and the associated main challenges for the years ahead. The most recent computing tools and software to analyze specific problem type, associated platform resources, and other potentials for the dynamic trajectory and interaction methods are also presented and discussed in detail.
Collapse
Affiliation(s)
- Yulan Liang
- Department of Family and Community Health, University of Maryland, Baltimore, MD 21201 USA
| | - Arpad Kelemen
- Department of Organizational Systems and Adult Health, University of Maryland, Baltimore, MD 21201 USA
| |
Collapse
|
16
|
Abstract
Background Identifying the genes associated to human diseases is crucial for disease diagnosis and drug design. Computational approaches, esp. the network-based approaches, have been recently developed to identify disease-related genes effectively from the existing biomedical networks. Meanwhile, the advance in biotechnology enables researchers to produce multi-omics data, enriching our understanding on human diseases, and revealing the complex relationships between genes and diseases. However, none of the existing computational approaches is able to integrate the huge amount of omics data into a weighted integrated network and utilize it to enhance disease related gene discovery. Results We propose a new network-based disease gene prediction method called SLN-SRW (Simplified Laplacian Normalization-Supervised Random Walk) to generate and model the edge weights of a new biomedical network that integrates biomedical data from heterogeneous sources, thus far enhancing the disease related gene discovery. Conclusions The experiment results show that SLN-SRW significantly improves the performance of disease gene prediction on both the real and the synthetic data sets. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-3263-4) contains supplementary material, which is available to authorized users.
Collapse
|
17
|
Módos D, Bulusu KC, Fazekas D, Kubisch J, Brooks J, Marczell I, Szabó PM, Vellai T, Csermely P, Lenti K, Bender A, Korcsmáros T. Neighbours of cancer-related proteins have key influence on pathogenesis and could increase the drug target space for anticancer therapies. NPJ Syst Biol Appl 2017; 3:2. [PMID: 28603644 PMCID: PMC5460138 DOI: 10.1038/s41540-017-0003-6] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Even targeted chemotherapies against solid cancers show a moderate success increasing the need to novel targeting strategies. To address this problem, we designed a systems-level approach investigating the neighbourhood of mutated or differentially expressed cancer-related proteins in four major solid cancers (colon, breast, liver and lung). Using signalling and protein–protein interaction network resources integrated with mutational and expression datasets, we analysed the properties of the direct and indirect interactors (first and second neighbours) of cancer-related proteins, not found previously related to the given cancer type. We found that first neighbours have at least as high degree, betweenness centrality and clustering coefficient as cancer-related proteins themselves, indicating a previously unknown central network position. We identified a complementary strategy for mutated and differentially expressed proteins, where the affect of differentially expressed proteins having smaller network centrality is compensated with high centrality first neighbours. These first neighbours can be considered as key, so far hidden, components in cancer rewiring, with similar importance as mutated proteins. These observations strikingly suggest targeting first neighbours as a novel strategy for disrupting cancer-specific networks. Remarkably, our survey revealed 223 marketed drugs already targeting first neighbour proteins but applied mostly outside oncology, providing a potential list for drug repurposing against solid cancers. For the very central first neighbours, whose direct targeting would cause several side effects, we suggest a cancer-mimicking strategy by targeting their interactors (second neighbours of cancer-related proteins, having a central protein affecting position, similarly to the cancer-related proteins). Hence, we propose to include first neighbours to network medicine based approaches for (but not limited to) anticancer therapies. Cancer is considered a systems disease in which the interactors of cancer-related proteins have a key role, also as targets to fight cancer. New therapeutic approaches are needed to improve success rates and to identify suitable proteins as novel, alternative drug targets. We designed a computational approach, combining mutation and differential expression data with network information, to analyse the interactions of cancer-related proteins in colon, breast, liver and lung cancer. We found that first (direct) neighbours, not linked previously to the given cancer type, are similarly important as mutated proteins known to be involved in cancer development. We found 223 drugs already in the clinic targeting these proteins but not yet used against cancer as their oncology relevance was hidden so far. Our observations open up new strategies for target selection and anti-cancer drug discovery.
Collapse
Affiliation(s)
- Dezső Módos
- Department of Morphology and Physiology, Department of Health Science, Semmelweis University, Budapest, Hungary.,Department of Genetics, Eötvös Loránd University, Budapest, Hungary.,Earlham Institute, Norwich Research Park, Norwich, UK.,Gut Health and Food Safety Programme, Institute of Food Research, Norwich Research Park, Norwich, UK
| | - Krishna C Bulusu
- Centre for Molecular Informatics, University of Cambridge, Cambridge, UK
| | - Dávid Fazekas
- Department of Genetics, Eötvös Loránd University, Budapest, Hungary.,Gut Health and Food Safety Programme, Institute of Food Research, Norwich Research Park, Norwich, UK
| | - János Kubisch
- Department of Genetics, Eötvös Loránd University, Budapest, Hungary
| | - Johanne Brooks
- Gut Health and Food Safety Programme, Institute of Food Research, Norwich Research Park, Norwich, UK.,Department of Medicine and Health, University of East Anglia, Norwich, UK.,Department of Gastroenterology, Norfolk and Norwich University Hospitals, Norwich, UK
| | - István Marczell
- 2nd Department of Internal Medicine, Semmelweis University, Budapest, Hungary
| | - Péter M Szabó
- 2nd Department of Internal Medicine, Semmelweis University, Budapest, Hungary.,Biometric Research Branch, Division of Cancer Treatment and Diagnosis, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Tibor Vellai
- Department of Genetics, Eötvös Loránd University, Budapest, Hungary
| | - Péter Csermely
- Department of Medical Chemistry, Semmelweis University, Budapest, Hungary
| | - Katalin Lenti
- Department of Morphology and Physiology, Department of Health Science, Semmelweis University, Budapest, Hungary
| | - Andreas Bender
- Centre for Molecular Informatics, University of Cambridge, Cambridge, UK
| | - Tamás Korcsmáros
- Department of Genetics, Eötvös Loránd University, Budapest, Hungary.,Earlham Institute, Norwich Research Park, Norwich, UK.,Gut Health and Food Safety Programme, Institute of Food Research, Norwich Research Park, Norwich, UK
| |
Collapse
|
18
|
Karabekmez ME, Kirdar B. A novel topological centrality measure capturing biologically important proteins. MOLECULAR BIOSYSTEMS 2016; 12:666-73. [PMID: 26699451 DOI: 10.1039/c5mb00732a] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Topological centrality in protein interaction networks and its biological implications have widely been investigated in the past. In the present study, a novel metric of centrality-weighted sum of loads eigenvector centrality (WSL-EC)-based on graph spectra is defined and its performance in identifying topologically and biologically important nodes is comparatively investigated with common metrics of centrality in a human protein-protein interaction network. The metric can capture nodes from peripherals of the network differently from conventional eigenvector centrality. Different metrics were found to selectively identify hub sets that are significantly associated with different biological processes. The widely accepted metrics degree centrality, betweenness centrality, subgraph centrality and eigenvector centrality are subject to a bias towards super-hubs, whereas WSL-EC is not affected by the presence of super-hubs. WSL-EC outperforms other metrics of centrality in detecting biologically central nodes such as pathogen-interacting, cancer, ageing, HIV-1 or disease-related proteins and proteins involved in immune system processes and autoimmune diseases in the human interactome.
Collapse
Affiliation(s)
| | - Betul Kirdar
- Bogazici University, Department of Chemical Engineering, Istanbul, Turkey.
| |
Collapse
|
19
|
Sun J, Zhu K, Zheng W, Xu H. A comparative study of disease genes and drug targets in the human protein interactome. BMC Bioinformatics 2015; 16 Suppl 5:S1. [PMID: 25861037 PMCID: PMC4402590 DOI: 10.1186/1471-2105-16-s5-s1] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Disease genes cause or contribute genetically to the development of the most complex diseases. Drugs are the major approaches to treat the complex disease through interacting with their targets. Thus, drug targets are critical for treatment efficacy. However, the interrelationship between the disease genes and drug targets is not clear. RESULTS In this study, we comprehensively compared the network properties of disease genes and drug targets for five major disease categories (cancer, cardiovascular disease, immune system disease, metabolic disease, and nervous system disease). We first collected disease genes from genome-wide association studies (GWAS) for five disease categories and collected their corresponding drugs based on drugs' Anatomical Therapeutic Chemical (ATC) classification. Then, we obtained the drug targets for these five different disease categories. We found that, though the intersections between disease genes and drug targets were small, disease genes were significantly enriched in targets compared to their enrichment in human protein-coding genes. We further compared network properties of the proteins encoded by disease genes and drug targets in human protein-protein interaction networks (interactome). The results showed that the drug targets tended to have higher degree, higher betweenness, and lower clustering coefficient in cancer Furthermore, we observed a clear fraction increase of disease proteins or drug targets in the near neighborhood compared with the randomized genes. CONCLUSIONS The study presents the first comprehensive comparison of the disease genes and drug targets in the context of interactome. The results provide some foundational network characteristics for further designing computational strategies to predict novel drug targets and drug repurposing.
Collapse
|
20
|
Li M, Lu Y, Wang J, Wu FX, Pan Y. A Topology Potential-Based Method for Identifying Essential Proteins from PPI Networks. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2015; 12:372-383. [PMID: 26357224 DOI: 10.1109/tcbb.2014.2361350] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Essential proteins are indispensable for cellular life. It is of great significance to identify essential proteins that can help us understand the minimal requirements for cellular life and is also very important for drug design. However, identification of essential proteins based on experimental approaches are typically time-consuming and expensive. With the development of high-throughput technology in the post-genomic era, more and more protein-protein interaction data can be obtained, which make it possible to study essential proteins from the network level. There have been a series of computational approaches proposed for predicting essential proteins based on network topologies. Most of these topology based essential protein discovery methods were to use network centralities. In this paper, we investigate the essential proteins' topological characters from a completely new perspective. To our knowledge it is the first time that topology potential is used to identify essential proteins from a protein-protein interaction (PPI) network. The basic idea is that each protein in the network can be viewed as a material particle which creates a potential field around itself and the interaction of all proteins forms a topological field over the network. By defining and computing the value of each protein's topology potential, we can obtain a more precise ranking which reflects the importance of proteins from the PPI network. The experimental results show that topology potential-based methods TP and TP-NC outperform traditional topology measures: degree centrality (DC), betweenness centrality (BC), closeness centrality (CC), subgraph centrality (SC), eigenvector centrality (EC), information centrality (IC), and network centrality (NC) for predicting essential proteins. In addition, these centrality measures are improved on their performance for identifying essential proteins in biological network when controlled by topology potential.
Collapse
|
21
|
Ganegoda GU, Li M, Wang W, Feng Q. Heterogeneous Network Model to Infer Human Disease-Long Intergenic Non-Coding RNA Associations. IEEE Trans Nanobioscience 2015; 14:175-83. [DOI: 10.1109/tnb.2015.2391133] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
22
|
Bhattacharyya M, Chakrabarti S. Identification of important interacting proteins (IIPs) in Plasmodium falciparum using large-scale interaction network analysis and in-silico knock-out studies. Malar J 2015; 14:70. [PMID: 25879642 PMCID: PMC4333160 DOI: 10.1186/s12936-015-0562-1] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2014] [Accepted: 01/14/2015] [Indexed: 12/28/2022] Open
Abstract
Background Plasmodium falciparum causes the most severe form of malaria and affects 3.2 million people annually. Due to the increasing incidence of resistance to existing drugs, there is a growing need to discover new and more effective drugs against malaria. Despite the global importance of P. falciparum, vast majority of its proteins are uncharacterized experimentally. Application of newer approaches using several “omics” data has become successful for exploring the biological interactions underlying cellular processes. Till date not many system level study has been published using P. falciparum protein protein interaction. Hence, the purpose of this study is to develop a standardized pipeline for structural, functional, and topographical analysis of large scale protein protein interaction network (PPIN) in order to identify proteins important for network topology and integrity. Here, P. falciparum PPIN has been utilized as a model for better understanding of the molecular mechanisms of survival and pathogenesis of malaria parasite. Methods Various graph theoretical approaches were implemented to identify highly interacting hub and central proteins that are crucial for network integrity. Further, potential network perturbing proteins via an in-silico knock-out (KO) analysis to isolate important interacting proteins (IIPs), which in principle, can elicit significant impact on the global and local environments of the P. falciparum interaction network. Results 177 hubs and 132 central proteins were identified from the malarial (proteins: 1607; interactions: 4750) PPI networks. Using the in-silico knock-out exercise 131 and 99 global and local network perturbing proteins were also identified. Finally, 271 proteins from P. falciparum were shortlisted as important interacting proteins (IIPs), which not only play crucial role in intra-pathogen network integrity, stage specificity but also interact with various human proteins involved in multiple metabolic pathways within the host cell. These IIPs could be used as potential drug targets in malarial research. Conclusion Graph theoretical analysis of PPIN can be a very useful approach to identify proteins that are important for regulation of the interactions required for an organism’s survival. Important interacting proteins (IIPs) identified using P. falciparum PPIN provides a useful dataset containing probable candidates for future drug target analysis. Electronic supplementary material The online version of this article (doi:10.1186/s12936-015-0562-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Madhumita Bhattacharyya
- Structural Biology and Bioinformatics Division, Council of Scientific and Industrial Research-Indian Institute of Chemical Biology, Kolkata, 700032, West Bengal, India.
| | - Saikat Chakrabarti
- Structural Biology and Bioinformatics Division, Council of Scientific and Industrial Research-Indian Institute of Chemical Biology, Kolkata, 700032, West Bengal, India.
| |
Collapse
|
23
|
Prioritization of orphan disease-causing genes using topological feature and GO similarity between proteins in interaction networks. SCIENCE CHINA-LIFE SCIENCES 2014; 57:1064-71. [PMID: 25326068 DOI: 10.1007/s11427-014-4747-6] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/23/2014] [Accepted: 07/15/2014] [Indexed: 12/22/2022]
Abstract
Identification of disease-causing genes among a large number of candidates is a fundamental challenge in human disease studies. However, it is still time-consuming and laborious to determine the real disease-causing genes by biological experiments. With the advances of the high-throughput techniques, a large number of protein-protein interactions have been produced. Therefore, to address this issue, several methods based on protein interaction network have been proposed. In this paper, we propose a shortest path-based algorithm, named SPranker, to prioritize disease-causing genes in protein interaction networks. Considering the fact that diseases with similar phenotypes are generally caused by functionally related genes, we further propose an improved algorithm SPGOranker by integrating the semantic similarity of GO annotations. SPGOranker not only considers the topological similarity between protein pairs in a protein interaction network but also takes their functional similarity into account. The proposed algorithms SPranker and SPGOranker were applied to 1598 known orphan disease-causing genes from 172 orphan diseases and compared with three state-of-the-art approaches, ICN, VS and RWR. The experimental results show that SPranker and SPGOranker outperform ICN, VS, and RWR for the prioritization of orphan disease-causing genes. Importantly, for the case study of severe combined immunodeficiency, SPranker and SPGOranker predict several novel causal genes.
Collapse
|
24
|
Kasavi C, Eraslan S, Arga KY, Oner ET, Kirdar B. A system based network approach to ethanol tolerance in Saccharomyces cerevisiae. BMC SYSTEMS BIOLOGY 2014; 8:90. [PMID: 25103914 PMCID: PMC4236716 DOI: 10.1186/s12918-014-0090-6] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/10/2014] [Accepted: 07/15/2014] [Indexed: 01/23/2023]
Abstract
Background Saccharomyces cerevisiae has been widely used for bio-ethanol production and development of rational genetic engineering strategies leading both to the improvement of productivity and ethanol tolerance is very important for cost-effective bio-ethanol production. Studies on the identification of the genes that are up- or down-regulated in the presence of ethanol indicated that the genes may be involved to protect the cells against ethanol stress, but not necessarily required for ethanol tolerance. Results In the present study, a novel network based approach was developed to identify candidate genes involved in ethanol tolerance. Protein-protein interaction (PPI) network associated with ethanol tolerance (tETN) was reconstructed by integrating PPI data with Gene Ontology (GO) terms. Modular analysis of the constructed networks revealed genes with no previously reported experimental evidence related to ethanol tolerance and resulted in the identification of 17 genes with previously unknown biological functions. We have randomly selected four of these genes and deletion strains of two genes (YDR307W and YHL042W) were found to exhibit improved tolerance to ethanol when compared to wild type strain. The genome-wide transcriptomic response of yeast cells to the deletions of YDR307W and YHL042W in the absence of ethanol revealed that the deletion of YDR307W and YHL042W genes resulted in the transcriptional re-programming of the metabolism resulting from a mis-perception of the nutritional environment. Yeast cells perceived an excess amount of glucose and a deficiency of methionine or sulfur in the absence of YDR307W and YHL042W, respectively, possibly resulting from a defect in the nutritional sensing and signaling or transport mechanisms. Mutations leading to an increase in ribosome biogenesis were found to be important for the improvement of ethanol tolerance. Modulations of chronological life span were also identified to contribute to ethanol tolerance in yeast. Conclusions The system based network approach developed allows the identification of novel gene targets for improved ethanol tolerance and supports the highly complex nature of ethanol tolerance in yeast.
Collapse
Affiliation(s)
| | | | | | | | - Betul Kirdar
- Department of Chemical Engineering, Boğaziçi University, Istanbul, Turkey.
| |
Collapse
|
25
|
Identifying the gene signatures from gene-pathway bipartite network guarantees the robust model performance on predicting the cancer prognosis. BIOMED RESEARCH INTERNATIONAL 2014; 2014:424509. [PMID: 25126556 PMCID: PMC4122106 DOI: 10.1155/2014/424509] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/16/2014] [Revised: 06/21/2014] [Accepted: 06/24/2014] [Indexed: 01/12/2023]
Abstract
For the purpose of improving the prediction of cancer prognosis in the clinical researches, various algorithms have been developed to construct the predictive models with the gene signatures detected by DNA microarrays. Due to the heterogeneity of the clinical samples, the list of differentially expressed genes (DEGs) generated by the statistical methods or the machine learning algorithms often involves a number of false positive genes, which are not associated with the phenotypic differences between the compared clinical conditions, and subsequently impacts the reliability of the predictive models. In this study, we proposed a strategy, which combined the statistical algorithm with the gene-pathway bipartite networks, to generate the reliable lists of cancer-related DEGs and constructed the models by using support vector machine for predicting the prognosis of three types of cancers, namely, breast cancer, acute myeloma leukemia, and glioblastoma. Our results demonstrated that, combined with the gene-pathway bipartite networks, our proposed strategy can efficiently generate the reliable cancer-related DEG lists for constructing the predictive models. In addition, the model performance in the swap analysis was similar to that in the original analysis, indicating the robustness of the models in predicting the cancer outcomes.
Collapse
|
26
|
Netotea S, Sundell D, Street NR, Hvidsten TR. ComPlEx: conservation and divergence of co-expression networks in A. thaliana, Populus and O. sativa. BMC Genomics 2014; 15:106. [PMID: 24498971 PMCID: PMC3925997 DOI: 10.1186/1471-2164-15-106] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2013] [Accepted: 01/29/2014] [Indexed: 01/16/2023] Open
Abstract
Background Divergence in gene regulation has emerged as a key mechanism underlying species differentiation. Comparative analysis of co-expression networks across species can reveal conservation and divergence in the regulation of genes. Results We inferred co-expression networks of A. thaliana, Populus spp. and O. sativa using state-of-the-art methods based on mutual information and context likelihood of relatedness, and conducted a comprehensive comparison of these networks across a range of co-expression thresholds. In addition to quantifying gene-gene link and network neighbourhood conservation, we also applied recent advancements in network analysis to do cross-species comparisons of network properties such as scale free characteristics and gene centrality as well as network motifs. We found that in all species the networks emerged as scale free only above a certain co-expression threshold, and that the high-centrality genes upholding this organization tended to be conserved. Network motifs, in particular the feed-forward loop, were found to be significantly enriched in specific functional subnetworks but where much less conserved across species than gene centrality. Although individual gene-gene co-expression had massively diverged, up to ~80% of the genes still had a significantly conserved network neighbourhood. For genes with multiple predicted orthologs, about half had one ortholog with conserved regulation and another ortholog with diverged or non-conserved regulation. Furthermore, the most sequence similar ortholog was not the one with the most conserved gene regulation in over half of the cases. Conclusions We have provided a comprehensive analysis of gene regulation evolution in plants and built a web tool for Comparative analysis of Plant co-Expression networks (ComPlEx, http://complex.plantgenie.org/). The tool can be particularly useful for identifying the ortholog with the most conserved regulation among several sequence-similar alternatives and can thus be of practical importance in e.g. finding candidate genes for perturbation experiments.
Collapse
Affiliation(s)
| | | | | | - Torgeir R Hvidsten
- Umeå Plant Science Center, Department of Plant Physiology, Umeå University, Umeå, Sweden.
| |
Collapse
|
27
|
Peng Q, Schork NJ. Utility of network integrity methods in therapeutic target identification. Front Genet 2014; 5:12. [PMID: 24550933 PMCID: PMC3909879 DOI: 10.3389/fgene.2014.00012] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2013] [Accepted: 01/13/2014] [Indexed: 01/05/2023] Open
Abstract
Analysis of the biological gene networks involved in a disease may lead to the identification of therapeutic targets. Such analysis requires exploring network properties, in particular the importance of individual network nodes (i.e., genes). There are many measures that consider the importance of nodes in a network and some may shed light on the biological significance and potential optimality of a gene or set of genes as therapeutic targets. This has been shown to be the case in cancer therapy. A dilemma exists, however, in finding the best therapeutic targets based on network analysis since the optimal targets should be nodes that are highly influential in, but not toxic to, the functioning of the entire network. In addition, cancer therapeutics targeting a single gene often result in relapse since compensatory, feedback and redundancy loops in the network may offset the activity associated with the targeted gene. Thus, multiple genes reflecting parallel functional cascades in a network should be targeted simultaneously, but require the identification of such targets. We propose a methodology that exploits centrality statistics characterizing the importance of nodes within a gene network that is constructed from the gene expression patterns in that network. We consider centrality measures based on both graph theory and spectral graph theory. We also consider the origins of a network topology, and show how different available representations yield different node importance results. We apply our techniques to tumor gene expression data and suggest that the identification of optimal therapeutic targets involving particular genes, pathways and sub-networks based on an analysis of the nodes in that network is possible and can facilitate individualized cancer treatments. The proposed methods also have the potential to identify candidate cancer therapeutic targets that are not thought to be oncogenes but nonetheless play important roles in the functioning of a cancer-related network or pathway.
Collapse
Affiliation(s)
- Qian Peng
- Department of Molecular and Experimental Medicine, The Scripps Research Institute La Jolla, CA, USA ; Scripps Genomic Medicine, The Scripps Translational Science Institute La Jolla, CA, USA
| | - Nicholas J Schork
- Department of Molecular and Experimental Medicine, The Scripps Research Institute La Jolla, CA, USA ; Scripps Genomic Medicine, The Scripps Translational Science Institute La Jolla, CA, USA
| |
Collapse
|
28
|
Fröhlich H. Including network knowledge into Cox regression models for biomarker signature discovery. Biom J 2014; 56:287-306. [DOI: 10.1002/bimj.201300035] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2013] [Revised: 10/10/2013] [Accepted: 10/25/2013] [Indexed: 12/14/2022]
Affiliation(s)
- Holger Fröhlich
- Rheinische Friedrich-Wilhelms-Universität Bonn; Bonn-Aachen International Center for IT, Algorithmic Bioinformatics; Dahlmannstr. 2 53113 Bonn Germany
| |
Collapse
|
29
|
Dimitrakopoulou K, Dimitrakopoulos GN, Sgarbas KN, Bezerianos A. Tamoxifen integromics and personalized medicine: dynamic modular transformations underpinning response to tamoxifen in breast cancer treatment. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2013; 18:15-33. [PMID: 24299457 DOI: 10.1089/omi.2013.0055] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Recent advances in pharmacogenomics technologies allow bold steps to be taken towards personalized medicine, more accurate health planning, and personalized drug development. In this framework, systems pharmacology network-based approaches offer an appealing way for integrating multi-omics data and set the basis for defining systems-level drug response biomarkers. On the road to individualized tamoxifen treatment in estrogen receptor-positive breast cancer patients, we examine the dynamics of the attendant pharmacological response mechanisms. By means of an "integromics" network approach, we assessed the tamoxifen effect through the way the high-order organization of interactome (i.e., the modules) is perturbed. To accomplish that, first we integrated the time series transcriptome data with the human protein interaction data, and second, an efficient module-detecting algorithm was applied onto the composite graphs. Our findings show that tamoxifen induces severe modular transformations on specific areas of the interactome. Our modular biomarkers in response to tamoxifen attest to the immunomodulatory role of tamoxifen, and further reveal that it deregulates cell cycle and apoptosis pathways, while coordinating the proteasome and basal transcription factors. To the best of our knowledge, this is the first report that informs the fields of personalized medicine and clinical pharmacology about the actual dynamic interactome response to tamoxifen administration.
Collapse
|
30
|
Pradhan MP, Nagulapalli K, Palakal MJ. Cliques for the identification of gene signatures for colorectal cancer across population. BMC SYSTEMS BIOLOGY 2012; 6 Suppl 3:S17. [PMID: 23282040 PMCID: PMC3524317 DOI: 10.1186/1752-0509-6-s3-s17] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Background Colorectal cancer (CRC) is one of the most commonly diagnosed cancers worldwide. Studies have correlated risk of CRC development with dietary habits and environmental conditions. Gene signatures for any disease can identify the key biological processes, which is especially useful in studying cancer development. Such processes can be used to evaluate potential drug targets. Though recognition of CRC gene-signatures across populations is crucial to better understanding potential novel treatment options for CRC, it remains a challenging task. Results We developed a topological and biological feature-based network approach for identifying the gene signatures across populations. In this work, we propose a novel approach of using cliques to understand the variability within population. Cliques are more conserved and co-expressed, therefore allowing identification and comparison of cliques across a population which can help researchers study gene variations. Our study was based on four publicly available expression datasets belonging to four different populations across the world. We identified cliques of various sizes (0 to 7) across the four population networks. Cliques of size seven were further analyzed across populations for their commonality and uniqueness. Forty-nine common cliques of size seven were identified. These cliques were further analyzed based on their connectivity profiles. We found associations between the cliques and their connectivity profiles across networks. With these clique connectivity profiles (CCPs), we were able to identify the divergence among the populations, important biological processes (cell cycle, signal transduction, and cell differentiation), and related gene pathways. Therefore the genes identified in these cliques and their connectivity profiles can be defined as the gene-signatures across populations. In this work we demonstrate the power and effectiveness of cliques to study CRC across populations. Conclusions We developed a new approach where cliques and their connectivity profiles helped elucidate the variation and similarity in CRC gene profiles across four populations with unique dietary habits.
Collapse
Affiliation(s)
- Meeta P Pradhan
- School of Informatics, Indiana University Purdue University Indianapolis, IN, USA
| | | | | |
Collapse
|
31
|
Zhang K, Liu Y, Yang JY, Arabnia HR, Niemierko A, Ghafoor A, Li W, Deng Y. From genes to networks: in systematic points of view. BMC SYSTEMS BIOLOGY 2011; 5 Suppl 3:I1. [PMID: 22784614 PMCID: PMC3287563 DOI: 10.1186/1752-0509-5-s3-i1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
We present a report of the BIOCOMP'10 - The 2010 International Conference on Bioinformatics & Computational Biology and other related work in the area of systems biology.
Collapse
|