1
|
Adinew GM, Messeha S, Taka E, Ahmed SA, Soliman KFA. The Role of Apoptotic Genes and Protein-Protein Interactions in Triple-negative Breast Cancer. Cancer Genomics Proteomics 2023; 20:247-272. [PMID: 37093683 PMCID: PMC10148064 DOI: 10.21873/cgp.20379] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Revised: 02/09/2023] [Accepted: 02/19/2023] [Indexed: 04/25/2023] Open
Abstract
BACKGROUND/AIM Compared to other breast cancer types, triple-negative breast cancer (TNBC) has historically had few treatment alternatives. Therefore, exploring and pinpointing potentially implicated genes could be used for treating and managing TNBC. By doing this, we will provide essential data to comprehend how the genes are involved in the apoptotic pathways of the cancer cells to identify potential therapeutic targets. Analysis of a single genetic alteration may not reveal the pathogenicity driving TNBC due to the high genomic complexity and heterogeneity of TNBC. Therefore, searching through a large variety of gene interactions enabled the identification of molecular therapeutic genes. MATERIALS AND METHODS This study used integrated bioinformatics methods such as UALCAN, TNM plotter, PANTHER, GO-KEEG and PPIs to assess the gene expression, protein-protein interaction (PPI), and transcription factor interaction of apoptosis-regulated genes. RESULTS Compared to normal breast tissue, gene expressions of BNIP3, TNFRSF10B, MCL1, and CASP4 were downregulated in UALCAN. At the same time, BIK, AKT1, BAD, FADD, DIABLO, and CASP9 was down-regulated in bc-GeneExMiner v4.5 mRNA expression (BCGM) databases. Based on GO term enrichment analysis, the cellular process (GO:0009987), which has about 21 apoptosis-regulated genes, is the top category in the biological processes (BP), followed by biological regulation (GO:0065007). We identified 29 differentially regulated pathways, including the p53 pathway, angiogenesis, apoptosis signaling pathway, and the Alzheimer's disease presenilin pathway. We examined the PPIs between the genes that regulate apoptosis; CASP3 and CASP9 interact with FADD, MCL1, TNF, TNFRSRF10A, and TNFRSF10; additionally, CASP3 significantly forms PPIs with CASP9, DFFA, and TP53, and CASP9 with DIABLO. In the top 10 transcription factors, the androgen receptor (AR) interacts with five apoptosis-regulated genes (p<0.0001; q<0.01), followed by retinoic acid receptor alpha (RARA) (p<0.0001; q<0.01) and ring finger protein (RNF2) (p<0.0001; q<0.01). Overall, the gene expression profile, PPIs, and the apoptosis-TF interaction findings suggest that the 27 apoptosis-regulated genes might be used as promising targets in treating and managing TNBC. Furthermore, from a total of 27 key genes, CASP2, CASP3, DAPK1, TNF, TRAF2, and TRAF3 were significantly correlated with poor overall survival in TNBC (p-value <0.05); they could play important roles in the progression of TNBC and provide attractive therapeutic targets that may offer new candidate molecules for targeted therapy. CONCLUSION Our findings demonstrate that CASP2, CASP3, DAPK1, TNF, TRAF2, and TRAF3 were substantially associated with the overall survival rate (OS) difference of TNBC patients out of a total of 27 specific genes used in this study, which may play crucial roles in the development of TNBC and offer promising therapeutic interventions.
Collapse
Affiliation(s)
- Getinet M Adinew
- Division of Pharmaceutical Sciences, College of Pharmacy and Pharmaceutical Sciences, Institute of Public Health, Florida A&M University, Tallahassee, FL, U.S.A
| | - Samia Messeha
- Division of Pharmaceutical Sciences, College of Pharmacy and Pharmaceutical Sciences, Institute of Public Health, Florida A&M University, Tallahassee, FL, U.S.A
| | - Equar Taka
- Division of Pharmaceutical Sciences, College of Pharmacy and Pharmaceutical Sciences, Institute of Public Health, Florida A&M University, Tallahassee, FL, U.S.A
| | - Shade A Ahmed
- Division of Pharmaceutical Sciences, College of Pharmacy and Pharmaceutical Sciences, Institute of Public Health, Florida A&M University, Tallahassee, FL, U.S.A
| | - Karam F A Soliman
- Division of Pharmaceutical Sciences, College of Pharmacy and Pharmaceutical Sciences, Institute of Public Health, Florida A&M University, Tallahassee, FL, U.S.A.
| |
Collapse
|
2
|
Liu J, Zhu H, Qiu J. Locally Adjust Networks Based on Connectivity and Semantic Similarities for Disease Module Detection. Front Genet 2021; 12:726596. [PMID: 34759955 PMCID: PMC8575408 DOI: 10.3389/fgene.2021.726596] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Accepted: 09/22/2021] [Indexed: 11/13/2022] Open
Abstract
For studying the pathogenesis of complex diseases, it is important to identify the disease modules in the system level. Since the protein-protein interaction (PPI) networks contain a number of incomplete and incorrect interactome, most existing methods often lead to many disease proteins isolating from disease modules. In this paper, we propose an effective disease module identification method IDMCSS, where the used human PPI networks are obtained by adding some potential missing interactions from existing PPI networks, as well as removing some potential incorrect interactions. In IDMCSS, a network adjustment strategy is developed to add or remove links around disease proteins based on both topological and semantic information. Next, neighboring proteins of disease proteins are prioritized according to a suggested similarity between each of them and disease proteins, and the protein with the largest similarity with disease proteins is added into a candidate disease protein set one by one. The stopping criterion is set to the boundary of the disease proteins. Finally, the connected subnetwork having the largest number of disease proteins is selected as a disease module. Experimental results on asthma demonstrate the effectiveness of the method in comparison to existing algorithms for disease module identification. It is also shown that the proposed IDMCSS can obtain the disease modules having crucial biological processes of asthma and 12 targets for drug intervention can be predicted.
Collapse
Affiliation(s)
- Jia Liu
- State Key Laboratory of Media Convergence and Communication, Communication University of China, Beijing, China
| | - Huole Zhu
- Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, School of Artificial Intelligence, Anhui University, Hefei, China
- Information Materials and Intelligent Sensing Laboratory of Anhui Province, School of Artificial Intelligence, Anhui University, Hefei, China
| | - Jianfeng Qiu
- Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, School of Artificial Intelligence, Anhui University, Hefei, China
- Information Materials and Intelligent Sensing Laboratory of Anhui Province, School of Artificial Intelligence, Anhui University, Hefei, China
| |
Collapse
|
3
|
Thummadi NB, Vishnu E, Subbiah EV, Manimaran P. A graph centrality-based approach for candidate gene prediction for type 1 diabetes. Immunol Res 2021; 69:422-428. [PMID: 34297307 DOI: 10.1007/s12026-021-09217-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2021] [Accepted: 07/15/2021] [Indexed: 10/20/2022]
Abstract
Type 1 diabetes mellitus (T1DM) or insulin-dependent diabetes is an autoimmune disease that may pose life-threatening situations to individuals. In most cases, cytotoxic T lymphocytes (CTLs) promotes killing of islets of Langerhans in the pancreas, which harbour insulin-producing beta cells. The trigger for autoimmune attack is still unclear; therefore, identifying and targeting candidate genes are imperative to hinder its deleterious effects. In the present study, we focused on identification of new candidate genes for T1DM. For our study, we exclusively selected immune-related genes as they play a crucial role in T1DM. We constructed and analysed a human immunome signalling network (directed network) to identify the new candidate genes through various graph centrality measures combining with Gene Ontology (GO). As a result, we identified 4 new candidate genes which may act as potential drug targets for T1DM. We further validated for their disease relevance through literature survey and pathway analysis and found that 3 out of 4 predicted genes mirrored their well-established roles as potential targets for T1DM.
Collapse
Affiliation(s)
- N B Thummadi
- Department of Animal Biology, University of Hyderabad, Gachibowli, Hyderabad, 500046, India
| | - E Vishnu
- School of Physics, University of Hyderabad, Gachibowli, Hyderabad, 500046, Telangana, India
| | - E V Subbiah
- Department of Sports Biosciences, Central University of Rajasthan, Kishangarh, Ajmer, 305817, India
| | - P Manimaran
- School of Physics, University of Hyderabad, Gachibowli, Hyderabad, 500046, Telangana, India.
| |
Collapse
|
4
|
Amanatidou AI, Nastou KC, Tsitsilonis OE, Iconomidou VA. Visualization and analysis of the interaction network of proteins associated with blood-cell targeting autoimmune diseases. Biochim Biophys Acta Mol Basis Dis 2020; 1866:165714. [DOI: 10.1016/j.bbadis.2020.165714] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2019] [Revised: 01/21/2020] [Accepted: 01/31/2020] [Indexed: 12/17/2022]
|
5
|
Oral lichen planus interactome reveals CXCR4 and CXCL12 as candidate therapeutic targets. Sci Rep 2020; 10:5454. [PMID: 32214134 PMCID: PMC7096434 DOI: 10.1038/s41598-020-62258-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2019] [Accepted: 03/12/2020] [Indexed: 01/03/2023] Open
Abstract
Today, we face difficulty in generating new hypotheses and understanding oral lichen planus due to the large amount of biomedical information available. In this research, we have used an integrated bioinformatics approach assimilating information from data mining, gene ontologies, protein–protein interaction and network analysis to predict candidate genes related to oral lichen planus. A detailed pathway analysis led us to propose two promising therapeutic targets: the stromal cell derived factor 1 (CXCL12) and the C-X-C type 4 chemokine receptor (CXCR4). We further validated our predictions and found that CXCR4 was upregulated in all oral lichen planus tissue samples. Our bioinformatics data cumulatively support the pathological role of chemokines and chemokine receptors in oral lichen planus. From a clinical perspective, we suggest a drug (plerixafor) and two therapeutic targets for future research.
Collapse
|
6
|
Malhotra AG, Singh S, Jha M, Pandey KM. A Parametric Targetability Evaluation Approach for Vitiligo Proteome Extracted through Integration of Gene Ontologies and Protein Interaction Topologies. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2019; 16:1830-1842. [PMID: 29994537 DOI: 10.1109/tcbb.2018.2835459] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Vitiligo is a well-known skin disorder with complex etiology. Vitiligo pathogenesis is multifaceted with many ramifications. A computational systemic path was designed to first propose candidate disease proteins by merging properties from protein interaction networks and gene ontology terms. All in all, 109 proteins were identified and suggested to be involved in the onset of disease or its progression. Later, a composite approach was employed to prioritize vitiligo disease proteins by comparing and benchmarking the properties against standard target identification criteria. This includes sequence-based, structural, functional, essentiality, protein-protein interaction, vulnerability, secretability, assayability, and druggability information. The existing information was seamlessly integrated into efficient pipelines to propose a novel protocol for assessment of targetability of disease proteins. Using the online data resources and the scripting, an illustrative list of 68 potential drug targets was generated for vitiligo. While this list is broadly consistent with the research community's current interest in certain specific proteins, and suggests novel target candidates that may merit further study, it can still be modified to correspond to a user-specific environment, either by adjusting the weights for chosen criteria (i.e., a quantitative approach) or by changing the considered criteria (i.e., a qualitative approach).
Collapse
|
7
|
Li Q, Yang Z, Zhao Z, Luo L, Li Z, Wang L, Zhang Y, Lin H, Wang J, Zhang Y. HMNPPID-human malignant neoplasm protein-protein interaction database. Hum Genomics 2019; 13:44. [PMID: 31639057 PMCID: PMC6805303 DOI: 10.1186/s40246-019-0223-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Protein-protein interaction (PPI) information extraction from biomedical literature helps unveil the molecular mechanisms of biological processes. Especially, the PPIs associated with human malignant neoplasms can unveil the biology behind these neoplasms. However, such PPI database is not currently available. RESULTS In this work, a database of protein-protein interactions associated with 171 kinds of human malignant neoplasms named HMNPPID is constructed. In addition, a visualization program, named VisualPPI, is provided to facilitate the analysis of the PPI network for a specific neoplasm. CONCLUSIONS HMNPPID can hopefully become an important resource for the research on PPIs of human malignant neoplasms since it provides readily available data for healthcare professionals. Thus, they do not need to dig into a large amount of biomedical literatures any more, which may accelerate the researches on the PPIs of malignant neoplasms.
Collapse
Affiliation(s)
- Qingqing Li
- College of Computer Science and Technology, Dalian University of Technology, Dalian, 116024, China
| | - Zhihao Yang
- College of Computer Science and Technology, Dalian University of Technology, Dalian, 116024, China.
| | - Zhehuan Zhao
- College of Computer Science and Technology, Dalian University of Technology, Dalian, 116024, China
| | - Ling Luo
- College of Computer Science and Technology, Dalian University of Technology, Dalian, 116024, China
| | - Zhiheng Li
- College of Computer Science and Technology, Dalian University of Technology, Dalian, 116024, China
| | - Lei Wang
- Beijing Institute of Health Administration and Medical Information, Beijing, 100850, China.
| | - Yin Zhang
- Beijing Institute of Health Administration and Medical Information, Beijing, 100850, China
| | - Hongfei Lin
- College of Computer Science and Technology, Dalian University of Technology, Dalian, 116024, China
| | - Jian Wang
- College of Computer Science and Technology, Dalian University of Technology, Dalian, 116024, China
| | - Yijia Zhang
- College of Computer Science and Technology, Dalian University of Technology, Dalian, 116024, China
| |
Collapse
|
8
|
Cheng L, Zhao H, Wang P, Zhou W, Luo M, Li T, Han J, Liu S, Jiang Q. Computational Methods for Identifying Similar Diseases. MOLECULAR THERAPY. NUCLEIC ACIDS 2019; 18:590-604. [PMID: 31678735 PMCID: PMC6838934 DOI: 10.1016/j.omtn.2019.09.019] [Citation(s) in RCA: 75] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/25/2019] [Revised: 09/11/2019] [Accepted: 09/12/2019] [Indexed: 02/01/2023]
Abstract
Although our knowledge of human diseases has increased dramatically, the molecular basis, phenotypic traits, and therapeutic targets of most diseases still remain unclear. An increasing number of studies have observed that similar diseases often are caused by similar molecules, can be diagnosed by similar markers or phenotypes, or can be cured by similar drugs. Thus, the identification of diseases similar to known ones has attracted considerable attention worldwide. To this end, the associations between diseases at the molecular, phenotypic, and taxonomic levels were used to measure the pairwise similarity in diseases. The corresponding performance assessment strategies for these methods involving the terms “category-based,” “simulated-patient-based,” and “benchmark-data-based” were thus further emphasized. Then, frequently used methods were evaluated using a benchmark-data-based strategy. To facilitate the assessment of disease similarity scores, researchers have designed dozens of tools that implement these methods for calculating disease similarity. Currently, disease similarity has been advantageous in predicting noncoding RNA (ncRNA) function and therapeutic drugs for diseases. In this article, we review disease similarity methods, evaluation strategies, tools, and their applications in the biomedical community. We further evaluate the performance of these methods and discuss the current limitations and future trends for calculating disease similarity.
Collapse
Affiliation(s)
- Liang Cheng
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| | - Hengqiang Zhao
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| | - Pingping Wang
- School of Life Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, China
| | - Wenyang Zhou
- School of Life Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, China
| | - Meng Luo
- School of Life Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, China
| | - Tianxin Li
- School of Life Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, China
| | - Junwei Han
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China.
| | - Shulin Liu
- Systemomics Center, College of Pharmacy, and Genomics Research Center (State-Province Key Laboratories of Biomedicine-Pharmaceutics of China), Harbin Medical University, Harbin, Heilongjiang, China; Department of Microbiology, Immunology and Infectious Diseases, University of Calgary, Calgary, AB, Canada.
| | - Qinghua Jiang
- School of Life Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, China.
| |
Collapse
|
9
|
Identification of candidate disease genes in patients with common variable immunodeficiency. QUANTITATIVE BIOLOGY 2019. [DOI: 10.1007/s40484-019-0174-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
10
|
Prioritizing complex disease risk genes by integrating multiple data. Genomics 2019; 111:590-597. [DOI: 10.1016/j.ygeno.2018.03.014] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2017] [Revised: 03/07/2018] [Accepted: 03/18/2018] [Indexed: 01/18/2023]
|
11
|
Predicting disease-genes based on network information loss and protein complexes in heterogeneous network. Inf Sci (N Y) 2019. [DOI: 10.1016/j.ins.2018.12.008] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
|
12
|
Teku GN, Vihinen M. Simulation of the Dynamics of Primary Immunodeficiencies in B Cells. Front Immunol 2018; 9:1785. [PMID: 30116248 PMCID: PMC6082931 DOI: 10.3389/fimmu.2018.01785] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2018] [Accepted: 07/19/2018] [Indexed: 12/20/2022] Open
Abstract
Primary immunodeficiencies (PIDs) are a group of over 300 hereditary, heterogeneous, and mainly rare disorders that affect the immune system. Various aspects of immune system and PID proteins and genes have been investigated and facilitate systems biological studies of effects of PIDs on B cell physiology and response. We reconstructed a B cell network model based on data for the core B cell receptor activation and response processes and performed semi-quantitative dynamic simulations for normal and B cell PID failure modes. The results for several knockout simulations correspond to previously reported molecular studies and reveal novel mechanisms for PIDs. The simulations for CD21, CD40, LYN, MS4A1, ORAI1, PLCG2, PTPRC, and STIM1 indicated profound changes to major transcription factor signaling and to the network. Significant effects were observed also in the BCL10, BLNK, BTK, loss-of-function CARD11, IKKB, MALT1, and NEMO, simulations whereas only minor effects were detected for PIDs that are caused by constitutively active proteins (PI3K, gain-of-function CARD11, KRAS, and NFKBIA). This study revealed the underlying dynamics of PID diseases, confirms previous observations, and identifies novel candidates for PID diagnostics and therapy.
Collapse
Affiliation(s)
| | - Mauno Vihinen
- Department of Experimental Medical Science, BMC B13, Lund University, Lund, Sweden
| |
Collapse
|
13
|
Anaya JM, Leon KJ, Rojas M, Rodriguez Y, Pacheco Y, Acosta-Ampudia Y, Monsalve DM, Ramirez-Santana C. Progress towards precision medicine for lupus: the role of genetic biomarkers. EXPERT REVIEW OF PRECISION MEDICINE AND DRUG DEVELOPMENT 2018. [DOI: 10.1080/23808993.2018.1448266] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Affiliation(s)
- Juan-Manuel Anaya
- Center for Autoimmune Diseases Research (CREA), School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia
| | - Kelly J. Leon
- Center for Autoimmune Diseases Research (CREA), School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia
| | - Manuel Rojas
- Center for Autoimmune Diseases Research (CREA), School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia
| | - Yhojan Rodriguez
- Center for Autoimmune Diseases Research (CREA), School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia
| | - Yovana Pacheco
- Center for Autoimmune Diseases Research (CREA), School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia
| | - Yeny Acosta-Ampudia
- Center for Autoimmune Diseases Research (CREA), School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia
| | - Diana M. Monsalve
- Center for Autoimmune Diseases Research (CREA), School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia
| | - Carolina Ramirez-Santana
- Center for Autoimmune Diseases Research (CREA), School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia
| |
Collapse
|
14
|
|
15
|
Hu Y, Zhou M, Shi H, Ju H, Jiang Q, Cheng L. Measuring disease similarity and predicting disease-related ncRNAs by a novel method. BMC Med Genomics 2017; 10:71. [PMID: 29297338 PMCID: PMC5751624 DOI: 10.1186/s12920-017-0315-9] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open
Abstract
Background Similar diseases are always caused by similar molecular origins, such as diasease-related protein-coding genes (PCGs). And the molecular associations reflect their similarity. Therefore, current methods for calculating disease similarity often utilized functional interactions of PCGs. Besides, the existing methods have neglected a fact that genes could also be associated in the gene functional network (GFN) based on intermediate nodes. Methods Here we presented a novel method, InfDisSim, to deduce the similarity of diseases. InfDisSim utilized the whole network based on random walk with damping to model the information flow. A benchmark set of similar disease pairs was employed to evaluate the performance of InfDisSim. Results The region beneath the receiver operating characteristic curve (AUC) was calculated to assess the performance. As a result, InfDisSim reaches a high AUC (0.9786) which indicates a very good performance. Furthermore, after calculating the disease similarity by the InfDisSim, we reconfirmed that similar diseases tend to have common therapeutic drugs (Pearson correlation γ2 = 0.1315, p = 2.2e-16). Finally, the disease similarity computed by infDisSim was employed to construct a miRNA similarity network (MSN) and lncRNA similarity network (LSN), which were further exploited to predict potential associations of lncRNA-disease pairs and miRNA-disease pairs, respectively. High AUC (0.9893, 0.9007) based on leave-one-out cross validation shows that the LSN and MSN is very appropriate for predicting novel disease-related lncRNAs and miRNAs, respectively. Conclusions The high AUC based on benchmark data indicates the method performs well. The method is valuable in the prediction of disease-related lncRNAs and miRNAs. Electronic supplementary material The online version of this article (doi: 10.1186/s12920-017-0315-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Yang Hu
- School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150001, People's Republic of China
| | - Meng Zhou
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150001, China
| | - Hongbo Shi
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150001, China
| | - Hong Ju
- Department of information engineering, Heilongjiang biological science and technology Career Academy, Harbin, 150001, China
| | - Qinghua Jiang
- School of Life Science and Technology, Harbin Institute of Technology, Harbin, 150001, People's Republic of China.
| | - Liang Cheng
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150001, China.
| |
Collapse
|
16
|
Kim J, Bang C, Hwang H, Kim D, Park C, Park S. IMA: Identifying disease-related genes using MeSH terms and association rules. J Biomed Inform 2017; 76:110-123. [DOI: 10.1016/j.jbi.2017.11.009] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2017] [Revised: 10/29/2017] [Accepted: 11/13/2017] [Indexed: 01/19/2023]
|
17
|
Weishaupt H, Johansson P, Engström C, Nelander S, Silvestrov S, Swartling FJ. Loss of Conservation of Graph Centralities in Reverse-engineered Transcriptional Regulatory Networks. Methodol Comput Appl Probab 2017. [DOI: 10.1007/s11009-017-9554-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
|
18
|
Grewal N, Singh S, Chand T. Effect of Aggregation Operators on Network-Based Disease Gene Prioritization: A Case Study on Blood Disorders. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2017; 14:1276-1287. [PMID: 29220322 DOI: 10.1109/tcbb.2016.2599155] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Owing to the innate noise in the biological data sources, a single source or a single measure do not suffice for an effective disease gene prioritization. So, the integration of multiple data sources or aggregation of multiple measures is the need of the hour. The aggregation operators combine multiple related data values to a single value such that the combined value has the effect of all the individual values. In this paper, an attempt has been made for applying the fuzzy aggregation on the network-based disease gene prioritization and investigate its effect under noise conditions. This study has been conducted for a set of 15 blood disorders by fusing four different network measures, computed from the protein interaction network, using a selected set of aggregation operators and ranking the genes on the basis of the aggregated value. The aggregation operator-based rankings have been compared with the "Random walk with restart" gene prioritization method. The impact of noise has also been investigated by adding varying proportions of noise to the seed set. The results reveal that for all the selected blood disorders, the Mean of Maximal operator has relatively outperformed the other aggregation operators for noisy as well as non-noisy data.
Collapse
|
19
|
Hu Y, Zhao L, Liu Z, Ju H, Shi H, Xu P, Wang Y, Cheng L. DisSetSim: an online system for calculating similarity between disease sets. J Biomed Semantics 2017; 8:28. [PMID: 29297411 PMCID: PMC5763469 DOI: 10.1186/s13326-017-0140-2] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Background Functional similarity between molecules results in similar phenotypes, such as diseases. Therefore, it is an effective way to reveal the function of molecules based on their induced diseases. However, the lack of a tool for obtaining the similarity score of pair-wise disease sets (SSDS) limits this type of application. Results Here, we introduce DisSetSim, an online system to solve this problem in this article. Five state-of-the-art methods involving Resnik’s, Lin’s, Wang’s, PSB, and SemFunSim methods were implemented to measure the similarity score of pair-wise diseases (SSD) first. And then “pair-wise-best pairs-average” (PWBPA) method was implemented to calculated the SSDS by the SSD. The system was applied for calculating the functional similarity of miRNAs based on their induced disease sets. The results were further used to predict potential disease-miRNA relationships. Conclusions The high area under the receiver operating characteristic curve AUC (0.9296) based on leave-one-out cross validation shows that the PWBPA method achieves a high true positive rate and a low false positive rate. The system can be accessed from http://www.bio-annotation.cn:8080/DisSetSim/.
Collapse
Affiliation(s)
- Yang Hu
- Harbin Institute of Technology, School of Life Science and Technology, Harbin, 150001, People's Republic of China
| | - Lingling Zhao
- Department of Computer Science and Technology, Harbin Institute of Technology, Harbin, 150001, People's Republic of China
| | - Zhiyan Liu
- Department of Computer Science and Technology, Harbin Institute of Technology, Harbin, 150001, People's Republic of China
| | - Hong Ju
- Department of information engineering, Heilongjiang Biological Science and Technology Career Academy, Harbin, 150001, People's Republic of China
| | - Hongbo Shi
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150001, People's Republic of China
| | - Peigang Xu
- Department of Computer Science and Technology, Harbin Institute of Technology, Harbin, 150001, People's Republic of China
| | - Yadong Wang
- Department of Computer Science and Technology, Harbin Institute of Technology, Harbin, 150001, People's Republic of China.
| | - Liang Cheng
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, 150001, People's Republic of China.
| |
Collapse
|
20
|
Identification of oral cancer related candidate genes by integrating protein-protein interactions, gene ontology, pathway analysis and immunohistochemistry. Sci Rep 2017; 7:2472. [PMID: 28559546 PMCID: PMC5449392 DOI: 10.1038/s41598-017-02522-5] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2016] [Accepted: 04/10/2017] [Indexed: 12/12/2022] Open
Abstract
In the recent years, bioinformatics methods have been reported with a high degree of success for candidate gene identification. In this milieu, we have used an integrated bioinformatics approach assimilating information from gene ontologies (GO), protein–protein interaction (PPI) and network analysis to predict candidate genes related to oral squamous cell carcinoma (OSCC). A total of 40973 PPIs were considered for 4704 cancer-related genes to construct human cancer gene network (HCGN). The importance of each node was measured in HCGN by ten different centrality measures. We have shown that the top ranking genes are related to a significantly higher number of diseases as compared to other genes in HCGN. A total of 39 candidate oral cancer target genes were predicted by combining top ranked genes and the genes corresponding to significantly enriched oral cancer related GO terms. Initial verification using literature and available experimental data indicated that 29 genes were related with OSCC. A detailed pathway analysis led us to propose a role for the selected candidate genes in the invasion and metastasis in OSCC. We further validated our predictions using immunohistochemistry (IHC) and found that the gene FLNA was upregulated while the genes ARRB1 and HTT were downregulated in the OSCC tissue samples.
Collapse
|
21
|
Teku GN, Vihinen M. Simulation of the dynamics of primary immunodeficiencies in CD4+ T-cells. PLoS One 2017; 12:e0176500. [PMID: 28448599 PMCID: PMC5407609 DOI: 10.1371/journal.pone.0176500] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2017] [Accepted: 04/11/2017] [Indexed: 01/05/2023] Open
Abstract
Primary immunodeficiencies (PIDs) form a large and heterogeneous group of mainly rare disorders that affect the immune system. T-cell deficiencies account for about one-tenth of PIDs, most of them being monogenic. Apart from genetic and clinical information, lots of other data are available for PID proteins and genes, including functions and interactions. Thus, it is possible to perform systems biology studies on the effects of PIDs on T-cell physiology and response. To achieve this, we reconstructed a T-cell network model based on literature mining and TPPIN, a previously published core T-cell network, and performed semi-quantitative dynamic network simulations on both normal and T-cell PID failure modes. The results for several loss-of-function PID simulations correspond to results of previously reported molecular studies. The simulations for TCR PTPRC, LCK, ZAP70 and ITK indicate profound changes to numerous proteins in the network. Significant effects were observed also in the BCL10, CARD11, MALT1, NEMO, IKKB and MAP3K14 simulations. No major effects were observed for PIDs that are caused by constitutively active proteins. The T-cell model facilitates the understanding of the underlying dynamics of PID disease processes. The approach confirms previous knowledge about T-cell signaling network and indicates several new important proteins that may be of interest when developing novel diagnosis and therapies to treat immunological defects.
Collapse
Affiliation(s)
- Gabriel N. Teku
- Department of Experimental Medical Science, Lund University, Lund, Sweden
| | - Mauno Vihinen
- Department of Experimental Medical Science, Lund University, Lund, Sweden
- * E-mail:
| |
Collapse
|
22
|
Fijarczyk A, Dudek K, Babik W. Selective Landscapes in newt Immune Genes Inferred from Patterns of Nucleotide Variation. Genome Biol Evol 2016; 8:3417-3432. [PMID: 27702815 PMCID: PMC5203778 DOI: 10.1093/gbe/evw236] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Host–pathogen interactions may result in either directional selection or in pressure for the maintenance of polymorphism at the molecular level. Hence signatures of both positive and balancing selection are expected in immune genes. Because both overall selective pressure and specific targets may differ between species, large-scale population genomic studies are useful in detecting functionally important immune genes and comparing selective landscapes between taxa. Such studies are of particular interest in amphibians, a group threatened worldwide by emerging infectious diseases. Here, we present an analysis of polymorphism and divergence of 634 immune genes in two lineages of Lissotriton newts: L. montandoni and L. vulgaris graecus. Variation in newt immune genes has been shaped predominantly by widespread purifying selection and strong evolutionary constraint, implying long-term importance of these genes for functioning of the immune system. The two evolutionary lineages differ in the overall strength of purifying selection which can partially be explained by demographic history but may also signal differences in long-term pathogen pressure. The prevalent constraint notwithstanding, 23 putative targets of positive selection and 11 putative targets of balancing selection were identified. The latter were detected by composite tests involving the demographic model and further validated in independent population samples. Putative targets of balancing selection encode proteins which may interact closely with pathogens but include also regulators of immune response. The identified candidates will be useful for testing whether genes affected by balancing selection are more prone to interspecific introgression than other genes in the genome.
Collapse
Affiliation(s)
- Anna Fijarczyk
- Institute of Environmental Sciences, Jagiellonian University, Kraków, Poland
| | - Katarzyna Dudek
- Institute of Environmental Sciences, Jagiellonian University, Kraków, Poland
| | - Wieslaw Babik
- Institute of Environmental Sciences, Jagiellonian University, Kraków, Poland
| |
Collapse
|
23
|
Waks Z, Weissbrod O, Carmeli B, Norel R, Utro F, Goldschmidt Y. Driver gene classification reveals a substantial overrepresentation of tumor suppressors among very large chromatin-regulating proteins. Sci Rep 2016; 6:38988. [PMID: 28008934 PMCID: PMC5180091 DOI: 10.1038/srep38988] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2016] [Accepted: 11/07/2016] [Indexed: 12/18/2022] Open
Abstract
Compiling a comprehensive list of cancer driver genes is imperative for oncology diagnostics and drug development. While driver genes are typically discovered by analysis of tumor genomes, infrequently mutated driver genes often evade detection due to limited sample sizes. Here, we address sample size limitations by integrating tumor genomics data with a wide spectrum of gene-specific properties to search for rare drivers, functionally classify them, and detect features characteristic of driver genes. We show that our approach, CAnceR geNe similarity-based Annotator and Finder (CARNAF), enables detection of potentially novel drivers that eluded over a dozen pan-cancer/multi-tumor type studies. In particular, feature analysis reveals a highly concentrated pool of known and putative tumor suppressors among the <1% of genes that encode very large, chromatin-regulating proteins. Thus, our study highlights the need for deeper characterization of very large, epigenetic regulators in the context of cancer causality.
Collapse
Affiliation(s)
- Zeev Waks
- Machine Learning for Healthcare and Life Sciences, IBM Research - Haifa, Mount Carmel Campus, Israel
| | - Omer Weissbrod
- Machine Learning for Healthcare and Life Sciences, IBM Research - Haifa, Mount Carmel Campus, Israel
| | - Boaz Carmeli
- Machine Learning for Healthcare and Life Sciences, IBM Research - Haifa, Mount Carmel Campus, Israel
| | - Raquel Norel
- Computational Biology Center, IBM T. J. Watson Research, Yorktown Heights, NY 10598, USA
| | - Filippo Utro
- Computational Biology Center, IBM T. J. Watson Research, Yorktown Heights, NY 10598, USA
| | - Yaara Goldschmidt
- Machine Learning for Healthcare and Life Sciences, IBM Research - Haifa, Mount Carmel Campus, Israel
| |
Collapse
|
24
|
Tian Z, Wang C, Guo M, Liu X, Teng Z. SGFSC: speeding the gene functional similarity calculation based on hash tables. BMC Bioinformatics 2016; 17:445. [PMID: 27814675 PMCID: PMC5096311 DOI: 10.1186/s12859-016-1294-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2016] [Accepted: 10/19/2016] [Indexed: 12/23/2022] Open
Abstract
Background In recent years, many measures of gene functional similarity have been proposed and widely used in all kinds of essential research. These methods are mainly divided into two categories: pairwise approaches and group-wise approaches. However, a common problem with these methods is their time consumption, especially when measuring the gene functional similarities of a large number of gene pairs. The problem of computational efficiency for pairwise approaches is even more prominent because they are dependent on the combination of semantic similarity. Therefore, the efficient measurement of gene functional similarity remains a challenging problem. Results To speed current gene functional similarity calculation methods, a novel two-step computing strategy is proposed: (1) establish a hash table for each method to store essential information obtained from the Gene Ontology (GO) graph and (2) measure gene functional similarity based on the corresponding hash table. There is no need to traverse the GO graph repeatedly for each method with the help of the hash table. The analysis of time complexity shows that the computational efficiency of these methods is significantly improved. We also implement a novel Speeding Gene Functional Similarity Calculation tool, namely SGFSC, which is bundled with seven typical measures using our proposed strategy. Further experiments show the great advantage of SGFSC in measuring gene functional similarity on the whole genomic scale. Conclusions The proposed strategy is successful in speeding current gene functional similarity calculation methods. SGFSC is an efficient tool that is freely available at http://nclab.hit.edu.cn/SGFSC. The source code of SGFSC can be downloaded from http://pan.baidu.com/s/1dFFmvpZ.
Collapse
Affiliation(s)
- Zhen Tian
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, 150001, People's Republic of China
| | - Chunyu Wang
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, 150001, People's Republic of China
| | - Maozu Guo
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, 150001, People's Republic of China.
| | - Xiaoyan Liu
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, 150001, People's Republic of China
| | - Zhixia Teng
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, 150001, People's Republic of China.,Department of Information Management and Information System, Northeast Forestry University, Harbin, 150001, People's Republic of China
| |
Collapse
|
25
|
Li P, Nie Y, Yu J. Fusing literature and full network data improves disease similarity computation. BMC Bioinformatics 2016; 17:326. [PMID: 27578323 PMCID: PMC5006367 DOI: 10.1186/s12859-016-1205-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2016] [Accepted: 08/24/2016] [Indexed: 01/01/2023] Open
Abstract
Background Identifying relatedness among diseases could help deepen understanding for the underlying pathogenic mechanisms of diseases, and facilitate drug repositioning projects. A number of methods for computing disease similarity had been developed; however, none of them were designed to utilize information of the entire protein interaction network, using instead only those interactions involving disease causing genes. Most of previously published methods required gene-disease association data, unfortunately, many diseases still have very few or no associated genes, which impeded broad adoption of those methods. In this study, we propose a new method (MedNetSim) for computing disease similarity by integrating medical literature and protein interaction network. MedNetSim consists of a network-based method (NetSim), which employs the entire protein interaction network, and a MEDLINE-based method (MedSim), which computes disease similarity by mining the biomedical literature. Results Among function-based methods, NetSim achieved the best performance. Its average AUC (area under the receiver operating characteristic curve) reached 95.2 %. MedSim, whose performance was even comparable to some function-based methods, acquired the highest average AUC in all semantic-based methods. Integration of MedSim and NetSim (MedNetSim) further improved the average AUC to 96.4 %. We further studied the effectiveness of different data sources. It was found that quality of protein interaction data was more important than its volume. On the contrary, higher volume of gene-disease association data was more beneficial, even with a lower reliability. Utilizing higher volume of disease-related gene data further improved the average AUC of MedNetSim and NetSim to 97.5 % and 96.7 %, respectively. Conclusions Integrating biomedical literature and protein interaction network can be an effective way to compute disease similarity. Lacking sufficient disease-related gene data, literature-based methods such as MedSim can be a great addition to function-based algorithms. It may be beneficial to steer more resources torward studying gene-disease associations and improving the quality of protein interaction data. Disease similarities can be computed using the proposed methods at http://www.digintelli.com:8000/. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-1205-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Ping Li
- State Key Laboratory of Biochemical Engineering, Institute of Process Engineering, Chinese Academy of Sciences, Beijing, 100190, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Yaling Nie
- State Key Laboratory of Biochemical Engineering, Institute of Process Engineering, Chinese Academy of Sciences, Beijing, 100190, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Jingkai Yu
- State Key Laboratory of Biochemical Engineering, Institute of Process Engineering, Chinese Academy of Sciences, Beijing, 100190, China.
| |
Collapse
|
26
|
UMADEVI S, PREMKUMAR K, VALARMATHI S, AYYASAMY PM, RAJAKUMAR S. IDENTIFICATION OF NOVEL GENES RELATED TO DIABETIC RETINOPATHY USING PROTEIN–PROTEIN INTERACTION NETWORK AND GENE ONTOLOGIES. J BIOL SYST 2016. [DOI: 10.1142/s0218339016500066] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Diabetic retinopathy is the most common cause of blindness, associated with many biochemical pathways mediated by several genes and proteins. Disease gene identification can be achieved through several approaches but still it is a challenging task. This study, aimed to find out the novel genes associated with diabetic retinopathy. In this study, all the well-known genes associated with diabetic retinopathy were collected from databases and the protein interaction partners were identified. The interacting candidate genes were chosen by chromosomal locations, sharing with disease genes. The protein–protein interaction network was constructed and the key nodes (genes) were identified by degree, betweenness centrality, closeness centrality and eccentricity centrality. Further, the ontological terms, molecular function, biological process and cellular components were related with that of the disease genes with p-value [Formula: see text]. The genes UBC, FOS, ITGB1, FOXA2, CCND1, FOSL1, RXRA and NCAM1 were identified as potential genes associated with diabetic retinopathy. The molecular functions of these genes include protein binding, receptor activity, receptor binding, oxidoreductase activity, protein kinase activity, serine-type peptidase activity and growth factor. Many of the identified genes were clinically related as evidence by the literature.
Collapse
Affiliation(s)
- S. UMADEVI
- Department of Biomedical Science, Bharathidasan University, Tiruchirappalli 620024, Tamil Nadu, India
| | - K. PREMKUMAR
- Department of Biomedical Science, Bharathidasan University, Tiruchirappalli 620024, Tamil Nadu, India
| | - S. VALARMATHI
- Department of Biochemistry, PSG College of Arts and Science, Coimbatore 641014, Tamil Nadu, India
| | - P. M. AYYASAMY
- Department of Microbiology, Periyar University, Salem 636011, Tamil Nadu, India
| | - S. RAJAKUMAR
- Department of Marine Biotechnology, Bharathidasan University, Tiruchirappalli 620024, Tamil Nadu, India
| |
Collapse
|
27
|
CancerNet: a database for decoding multilevel molecular interactions across diverse cancer types. Oncogenesis 2015; 4:e177. [PMID: 26690544 PMCID: PMC4688397 DOI: 10.1038/oncsis.2015.40] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2015] [Revised: 09/21/2015] [Accepted: 10/15/2015] [Indexed: 02/08/2023] Open
Abstract
Protein–protein interactions (PPIs) and microRNA (miRNA)–target interactions are important for deciphering the mechanisms of tumorigenesis. However, current PPI databases do not support cancer-specific analysis. Also, no available databases can be used to retrieve cancer-associated miRNA–target interactions. As the pathogenesis of human cancers is affected by several miRNAs rather than a single miRNA, it is needed to uncover miRNA synergism in a systems level. Here for each cancer type, we constructed a miRNA–miRNA functionally synergistic network based on the functions of miRNA targets and their topological features in that cancer PPI network. And for the first time, we report the cancer-specific database CancerNet (http://bis.zju.edu.cn/CancerNet), which contains information about PPIs, miRNA–target interactions and functionally synergistic miRNA–miRNA pairs across 33 human cancer types. In addition, PPI information across 33 main normal tissues and cell types are included. Flexible query methods are allowed to retrieve cancer molecular interactions. Network viewer can be used to visualize interactions that users are interested in. Enrichment analysis tool was designed to detect significantly overrepresented Gene Ontology categories of miRNA targets. Thus, CancerNet serves as a comprehensive platform for assessing the roles of proteins and miRNAs, as well as their interactions across human cancers.
Collapse
|
28
|
Predicting Abdominal Aortic Aneurysm Target Genes by Level-2 Protein-Protein Interaction. PLoS One 2015; 10:e0140888. [PMID: 26496478 PMCID: PMC4619739 DOI: 10.1371/journal.pone.0140888] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2015] [Accepted: 09/30/2015] [Indexed: 12/22/2022] Open
Abstract
Abdominal aortic aneurysm (AAA) is frequently lethal and has no effective pharmaceutical treatment, posing a great threat to human health. Previous bioinformatics studies of the mechanisms underlying AAA relied largely on the detection of direct protein-protein interactions (level-1 PPI) between the products of reported AAA-related genes. Thus, some proteins not suspected to be directly linked to previously reported genes of pivotal importance to AAA might have been missed. In this study, we constructed an indirect protein-protein interaction (level-2 PPI) network based on common interacting proteins encoded by known AAA-related genes and successfully predicted previously unreported AAA-related genes using this network. We used four methods to test and verify the performance of this level-2 PPI network: cross validation, human AAA mRNA chip array comparison, literature mining, and verification in a mouse CaPO4 AAA model. We confirmed that the new level-2 PPI network is superior to the original level-1 PPI network and proved that the top 100 candidate genes predicted by the level-2 PPI network shared similar GO functions and KEGG pathways compared with positive genes.
Collapse
|
29
|
Liu B, Jin M, Zeng P. Prioritization of candidate disease genes by combining topological similarity and semantic similarity. J Biomed Inform 2015; 57:1-5. [DOI: 10.1016/j.jbi.2015.07.005] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2014] [Revised: 07/01/2015] [Accepted: 07/06/2015] [Indexed: 10/23/2022]
|
30
|
Sun Y, Sheng Z, Ma C, Tang K, Zhu R, Wu Z, Shen R, Feng J, Wu D, Huang D, Huang D, Fei J, Liu Q, Cao Z. Combining genomic and network characteristics for extended capability in predicting synergistic drugs for cancer. Nat Commun 2015; 6:8481. [PMID: 26412466 PMCID: PMC4598846 DOI: 10.1038/ncomms9481] [Citation(s) in RCA: 87] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2015] [Accepted: 08/27/2015] [Indexed: 12/12/2022] Open
Abstract
The identification of synergistic chemotherapeutic agents from a large pool of candidates is highly challenging. Here, we present a Ranking-system of Anti-Cancer Synergy (RACS) that combines features of targeting networks and transcriptomic profiles, and validate it on three types of cancer. Using data on human β-cell lymphoma from the Dialogue for Reverse Engineering Assessments and Methods consortium we show a probability concordance of 0.78 compared with 0.61 obtained with the previous best algorithm. We confirm 63.6% of our breast cancer predictions through experiment and literature, including four strong synergistic pairs. Further in vivo screening in a zebrafish MCF7 xenograft model confirms one prediction with strong synergy and low toxicity. Validation using A549 lung cancer cells shows similar results. Thus, RACS can significantly improve drug synergy prediction and markedly reduce the experimental prescreening of existing drugs for repurposing to cancer treatment, although the molecular mechanism underlying particular interactions remains unknown. Predicting combinations of chemotherapeutic drugs that act synergistically is challenging. Here the authors take a computational approach to predict synergistic pairs, validate novel pairs using several cancer cell lines, and assess toxicity in a zebrafish xenograft model.
Collapse
Affiliation(s)
- Yi Sun
- School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Zhen Sheng
- School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Chao Ma
- School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Kailin Tang
- School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Ruixin Zhu
- School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Zhuanbin Wu
- Shanghai Research Center for Model Organisms, Shanghai 200092, China
| | - Ruling Shen
- School of Life Sciences and Technology, Tongji University, Shanghai 200092, China.,Shanghai Research Center for Model Organisms, Shanghai 200092, China
| | - Jun Feng
- School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Dingfeng Wu
- School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Danyi Huang
- School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Dandan Huang
- School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Jian Fei
- School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Qi Liu
- School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Zhiwei Cao
- School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| |
Collapse
|
31
|
Ambedkar C, Reddi KK, Muppalaneni NB, Kalyani D. Application of centrality measures in the identification of critical genes in diabetes mellitus. Bioinformation 2015; 11:90-5. [PMID: 25848169 PMCID: PMC4369684 DOI: 10.6026/97320630011090] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2015] [Accepted: 01/31/2015] [Indexed: 01/05/2023] Open
Abstract
The connectivity of a protein and its structure is related to its functional properties. Many experimental approaches have been employed for the identification of Diabetes Mellitus (DM) associated candidate genes. Therefore, it is of interest to use var ious graph centrality measures integrated with the genes associated with the human Diabetes Mellitus network for the identification of potential targets. We used 2728 genes known to cause Diabetes Mellitus from Jensenlab (Novo Nordisk Foundation Center for Protein Research, Denmark) for this analysis. A protein-protein interaction network was further constructed using a tool Centralities in Biological Networks (CentiBiN) with 1020 nodes after eliminating the duplicates, parallel edges, self -loop edges and unknown Human Protein Reference Database (HPRD) IDS. We used fourteen centralities measures which are useful in identifying the structural characteristic of individuals in the network. The results of the centrality measures are highly correlated. Thus, we identified genes that are critically associated with DM. We further report the top ten genes of all fourteen centrality measures for further consideration as targets for DM.
Collapse
|
32
|
Chen H, Zhu Z, Zhu Y, Wang J, Mei Y, Cheng Y. Pathway mapping and development of disease-specific biomarkers: protein-based network biomarkers. J Cell Mol Med 2015; 19:297-314. [PMID: 25560835 PMCID: PMC4407592 DOI: 10.1111/jcmm.12447] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2014] [Accepted: 08/22/2014] [Indexed: 01/06/2023] Open
Abstract
It is known that a disease is rarely a consequence of an abnormality of a single gene, but reflects the interactions of various processes in a complex network. Annotated molecular networks offer new opportunities to understand diseases within a systems biology framework and provide an excellent substrate for network-based identification of biomarkers. The network biomarkers and dynamic network biomarkers (DNBs) represent new types of biomarkers with protein-protein or gene-gene interactions that can be monitored and evaluated at different stages and time-points during development of disease. Clinical bioinformatics as a new way to combine clinical measurements and signs with human tissue-generated bioinformatics is crucial to translate biomarkers into clinical application, validate the disease specificity, and understand the role of biomarkers in clinical settings. In this article, the recent advances and developments on network biomarkers and DNBs are comprehensively reviewed. How network biomarkers help a better understanding of molecular mechanism of diseases, the advantages and constraints of network biomarkers for clinical application, clinical bioinformatics as a bridge to the development of diseases-specific, stage-specific, severity-specific and therapy predictive biomarkers, and the potentials of network biomarkers are also discussed.
Collapse
Affiliation(s)
- Hao Chen
- Department of Cardiothoracic Surgery, Tongji Hospital, Tongji University, Shanghai, China
| | | | | | | | | | | |
Collapse
|
33
|
Thomas P, Durek P, Solt I, Klinger B, Witzel F, Schulthess P, Mayer Y, Tikk D, Blüthgen N, Leser U. Computer-assisted curation of a human regulatory core network from the biological literature. Bioinformatics 2014; 31:1258-66. [DOI: 10.1093/bioinformatics/btu795] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2014] [Accepted: 11/26/2014] [Indexed: 12/20/2022] Open
|
34
|
Guo NL, Wan YW. Network-based identification of biomarkers coexpressed with multiple pathways. Cancer Inform 2014; 13:37-47. [PMID: 25392692 PMCID: PMC4218687 DOI: 10.4137/cin.s14054] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2014] [Revised: 06/25/2014] [Accepted: 06/29/2014] [Indexed: 02/07/2023] Open
Abstract
Unraveling complex molecular interactions and networks and incorporating clinical information in modeling will present a paradigm shift in molecular medicine. Embedding biological relevance via modeling molecular networks and pathways has become increasingly important for biomarker identification in cancer susceptibility and metastasis studies. Here, we give a comprehensive overview of computational methods used for biomarker identification, and provide a performance comparison of several network models used in studies of cancer susceptibility, disease progression, and prognostication. Specifically, we evaluated implication networks, Boolean networks, Bayesian networks, and Pearson’s correlation networks in constructing gene coexpression networks for identifying lung cancer diagnostic and prognostic biomarkers. The results show that implication networks, implemented in Genet package, identified sets of biomarkers that generated an accurate prediction of lung cancer risk and metastases; meanwhile, implication networks revealed more biologically relevant molecular interactions than Boolean networks, Bayesian networks, and Pearson’s correlation networks when evaluated with MSigDB database.
Collapse
Affiliation(s)
- Nancy Lan Guo
- Mary Babb Randolph Cancer Center/School of Public Health, West Virginia University, Morgantown, WV, USA
| | - Ying-Wooi Wan
- Mary Babb Randolph Cancer Center/School of Public Health, West Virginia University, Morgantown, WV, USA
| |
Collapse
|
35
|
Cheng L, Li J, Ju P, Peng J, Wang Y. SemFunSim: a new method for measuring disease similarity by integrating semantic and gene functional association. PLoS One 2014; 9:e99415. [PMID: 24932637 PMCID: PMC4059643 DOI: 10.1371/journal.pone.0099415] [Citation(s) in RCA: 78] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2013] [Accepted: 05/14/2014] [Indexed: 01/20/2023] Open
Abstract
Background Measuring similarity between diseases plays an important role in disease-related molecular function research. Functional associations between disease-related genes and semantic associations between diseases are often used to identify pairs of similar diseases from different perspectives. Currently, it is still a challenge to exploit both of them to calculate disease similarity. Therefore, a new method (SemFunSim) that integrates semantic and functional association is proposed to address the issue. Methods SemFunSim is designed as follows. First of all, FunSim (Functional similarity) is proposed to calculate disease similarity using disease-related gene sets in a weighted network of human gene function. Next, SemSim (Semantic Similarity) is devised to calculate disease similarity using the relationship between two diseases from Disease Ontology. Finally, FunSim and SemSim are integrated to measure disease similarity. Results The high average AUC (area under the receiver operating characteristic curve) (96.37%) shows that SemFunSim achieves a high true positive rate and a low false positive rate. 79 of the top 100 pairs of similar diseases identified by SemFunSim are annotated in the Comparative Toxicogenomics Database (CTD) as being targeted by the same therapeutic compounds, while other methods we compared could identify 35 or less such pairs among the top 100. Moreover, when using our method on diseases without annotated compounds in CTD, we could confirm many of our predicted candidate compounds from literature. This indicates that SemFunSim is an effective method for drug repositioning.
Collapse
Affiliation(s)
- Liang Cheng
- Center for Bioinformatics, School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, China
| | - Jie Li
- Center for Bioinformatics, School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, China
| | - Peng Ju
- School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore, Singapore
| | - Jiajie Peng
- Center for Bioinformatics, School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, China
| | - Yadong Wang
- Center for Bioinformatics, School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, China
| |
Collapse
|
36
|
Zhu C, Wu C, Aronow BJ, Jegga AG. Computational approaches for human disease gene prediction and ranking. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2014; 799:69-84. [PMID: 24292962 DOI: 10.1007/978-1-4614-8778-4_4] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
While candidate gene association studies continue to be the most practical and frequently employed approach in disease gene investigation for complex disorders, selecting suitable genes to test is a challenge. There are several computational approaches available for selecting and prioritizing disease candidate genes. A majority of these tools are based on guilt-by-association principle where novel disease candidate genes are identified and prioritized based on either functional or topological similarity to known disease genes. In this chapter we review the prioritization criteria and the algorithms along with some use cases that demonstrate how these tools can be used for identifying and ranking human disease candidate genes.
Collapse
Affiliation(s)
- Cheng Zhu
- Department of Computer Science, College of Engineering and Applied Science, University of Cincinnati, Cincinnati, OH, USA
| | | | | | | |
Collapse
|
37
|
Abstract
Bioinformatics aids in the understanding of the biological processes of living beings and the genetic architecture of human diseases. The discovery of disease-related genes improves the diagnosis and therapy design for the disease. To save the cost and time involved in the experimental verification of the candidate genes, computational methods are employed for ranking the genes according to their likelihood of being associated with the disease. Only top-ranked genes are then verified experimentally. A variety of methods have been conceived by the researchers for the prioritization of the disease candidate genes, which differ in the data source being used or the scoring function used for ranking the genes. A review of various aspects of computational disease gene prioritization and its research issues is presented in this article. The aspects covered are gene prioritization process, data sources used, types of prioritization methods, and performance assessment methods. This article provides a brief overview and acts as a quick guide for disease gene prioritization.
Collapse
Affiliation(s)
- Nivit Gill
- 1 Punjabi University Regional Centre For IT and Management , Mohali, Punjab, India
| | | | | |
Collapse
|
38
|
Hindumathi V, Kranthi T, Rao SB, Manimaran P. The prediction of candidate genes for cervix related cancer through gene ontology and graph theoretical approach. MOLECULAR BIOSYSTEMS 2014; 10:1450-60. [PMID: 24647578 DOI: 10.1039/c4mb00004h] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
With rapidly changing technology, prediction of candidate genes has become an indispensable task in recent years mainly in the field of biological research. The empirical methods for candidate gene prioritization that succors to explore the potential pathway between genetic determinants and complex diseases are highly cumbersome and labor intensive. In such a scenario predicting potential targets for a disease state through in silico approaches are of researcher's interest. The prodigious availability of protein interaction data coupled with gene annotation renders an ease in the accurate determination of disease specific candidate genes. In our work we have prioritized the cervix related cancer candidate genes by employing Csaba Ortutay and his co-workers approach of identifying the candidate genes through graph theoretical centrality measures and gene ontology. With the advantage of the human protein interaction data, cervical cancer gene sets and the ontological terms, we were able to predict 15 novel candidates for cervical carcinogenesis. The disease relevance of the anticipated candidate genes was corroborated through a literature survey. Also the presence of the drugs for these candidates was detected through Therapeutic Target Database (TTD) and DrugMap Central (DMC) which affirms that they may be endowed as potential drug targets for cervical cancer.
Collapse
Affiliation(s)
- V Hindumathi
- C R Rao Advanced Institute of Mathematics, Statistics and Computer Science, University of Hyderabad Campus, Prof. C R Rao Road, Gachibowli, Hyderabad - 500046, India.
| | | | | | | |
Collapse
|
39
|
Abstract
A rare or orphan disorder is any disease that affects a small percentage of the population. Most genes and pathways underlying these disorders remain unknown. High-throughput techniques are frequently applied to detect disease candidate genes. The speed and affordability of sequencing following recent technological advances while advantageous are accompanied by the problem of data deluge. Furthermore, experimental validation of disease candidate genes is both time-consuming and expensive. Therefore, several computational approaches have been developed to identify the most promising candidates for follow-up studies. Based on the guilt by association principle, most of these approaches use prior knowledge about a disease of interest to discover and rank novel candidate genes. In this chapter, a brief overview of some of the in silico strategies for candidate gene prioritization is provided. To demonstrate their utility in rare disease research, a Web-based computational suite of tools that use integrated heterogeneous data sources for ranking disease candidate genes is used to demonstrate how to run typical queries using this system.
Collapse
Affiliation(s)
- Anil G Jegga
- Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, 3333 Burnet Avenue, ML 7024, Cincinnati, OH, 45229, USA,
| |
Collapse
|
40
|
Siddani BR, Pochineni LP, Palanisamy M. Candidate gene identification for systemic lupus erythematosus using network centrality measures and gene ontology. PLoS One 2013; 8:e81766. [PMID: 24312583 PMCID: PMC3847089 DOI: 10.1371/journal.pone.0081766] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2013] [Accepted: 10/16/2013] [Indexed: 01/12/2023] Open
Abstract
Systemic lupus erythematosus (SLE) commonly accredited as “the great imitator” is a highly complex disease involving multiple gene susceptibility with non-specific symptoms. Many experimental and computational approaches have been used to investigate the disease related candidate genes. But the limited knowledge of gene function and disease correlation and also lack of complete functional details about the majority of genes in susceptible locus, encumbrances the identification of SLE related candidate genes. In this paper, we have studied the human immunome network (undirected) using various graph theoretical centrality measures integrated with the gene ontology terms to predict the new candidate genes. As a result, we have identified 8 candidate genes, which may act as potential targets for SLE disease. We have also carried out the same analysis by replacing the human immunome network with human immunome signaling network (directed) and as an outcome we have obtained 5 candidate genes as potential targets for SLE disease. From the comparison study, we have found these two approaches are complementary in nature.
Collapse
Affiliation(s)
- Bhaskara Rao Siddani
- C R Rao Advanced Institute of Mathematics, Statistics and Computer Science, Hyderabad, India
| | | | | |
Collapse
|
41
|
Prediction of novel genes associated with negative regulators of toll-like receptors-induced inflammation based on endotoxin tolerance. Inflammation 2013; 35:1889-99. [PMID: 22843012 DOI: 10.1007/s10753-012-9511-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
Prior exposure of innate immune cells to lipopolysaccharide (LPS) has caused them to be refractory to further endotoxin stimulation, also termed endotoxin tolerance (ET). Bacterial LPS signals through Toll-like receptor (TLR) 4, which was thought to enable the innate immune system to deal with invasive pathogens and to restrain systemic inflammation efficiently. We established a robust model of ET and determined the level of TNF-α and IL-6 in cultured human monocytes. Then, microarray assay was applied to assess gene expression in this model. The results showed that 356 non-tolerizable genes were differentially expressed at a high level in tolerant monocytes. The genes selected were classified into several categories based on gene ontology (GO) and KEGG pathway database. And then literature annotations, protein-protein interaction (PPI) network, and functional consistency were applied to analyze the non-tolerizable genes. Finally, the microarray results were verified by quantitative real-time PCR of seven representative genes, including the two candidate genes, Spry2 and Smurf2, which were supposed to play a critical role in TLRs-induced inflammation based on literature retrieval. Our results would provide useful information for further analysis of regulating TLRs-induced inflammation, and would facilitate the study of associated mechanisms.
Collapse
|
42
|
Teng Z, Guo M, Liu X, Dai Q, Wang C, Xuan P. Measuring gene functional similarity based on group-wise comparison of GO terms. Bioinformatics 2013; 29:1424-32. [PMID: 23572412 DOI: 10.1093/bioinformatics/btt160] [Citation(s) in RCA: 72] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
43
|
Hulsegge I, Woelders H, Smits M, Schokker D, Jiang L, Sørensen P. Prioritization of candidate genes for cattle reproductive traits, based on protein-protein interactions, gene expression, and text-mining. Physiol Genomics 2013; 45:400-6. [PMID: 23572538 DOI: 10.1152/physiolgenomics.00172.2012] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
Reproduction is of significant economic importance in dairy cattle. Improved understanding of mechanisms that control estrous behavior and other reproduction traits could help in developing strategies to improve and/or monitor these traits. The objective of this study was to predict and rank genes and processes in brain areas and pituitary involved in reproductive traits in cattle using information derived from three different data sources: gene expression, protein-protein interactions, and literature. We identified 59, 89, 53, 23, and 71 genes in bovine amygdala, dorsal hypothalamus, hippocampus, pituitary, and ventral hypothalamus, respectively, potentially involved in processes underlying estrus and estrous behavior. Functional annotation of the candidate genes points to a number of tissue-specific processes of which the "neurotransmitter/ion channel/synapse" process in the amygdala, "steroid hormone receptor activity/ion binding" in the pituitary, "extracellular region" in the ventral hypothalamus, and "positive regulation of transcription/metabolic process" in the dorsal hypothalamus are most prominent. The regulation of the functional processes in the various tissues operate at different biological levels, including transcriptional, posttranscriptional, extracellular, and intercellular signaling levels.
Collapse
Affiliation(s)
- Ina Hulsegge
- Animal Breeding and Genomics Centre, Wageningen UR Livestock Research, Lelystad, The Netherlands
| | | | | | | | | | | |
Collapse
|
44
|
Vashisht S, Bagler G. An approach for the identification of targets specific to bone metastasis using cancer genes interactome and gene ontology analysis. PLoS One 2012; 7:e49401. [PMID: 23166660 PMCID: PMC3498148 DOI: 10.1371/journal.pone.0049401] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2012] [Accepted: 10/11/2012] [Indexed: 12/20/2022] Open
Abstract
Metastasis is one of the most enigmatic aspects of cancer pathogenesis and is a major cause of cancer-associated mortality. Secondary bone cancer (SBC) is a complex disease caused by metastasis of tumor cells from their primary site and is characterized by intricate interplay of molecular interactions. Identification of targets for multifactorial diseases such as SBC, the most frequent complication of breast and prostate cancers, is a challenge. Towards achieving our aim of identification of targets specific to SBC, we constructed a 'Cancer Genes Network', a representative protein interactome of cancer genes. Using graph theoretical methods, we obtained a set of key genes that are relevant for generic mechanisms of cancers and have a role in biological essentiality. We also compiled a curated dataset of 391 SBC genes from published literature which serves as a basis of ontological correlates of secondary bone cancer. Building on these results, we implement a strategy based on generic cancer genes, SBC genes and gene ontology enrichment method, to obtain a set of targets that are specific to bone metastasis. Through this study, we present an approach for probing one of the major complications in cancers, namely, metastasis. The results on genes that play generic roles in cancer phenotype, obtained by network analysis of 'Cancer Genes Network', have broader implications in understanding the role of molecular regulators in mechanisms of cancers. Specifically, our study provides a set of potential targets that are of ontological and regulatory relevance to secondary bone cancer.
Collapse
Affiliation(s)
- Shikha Vashisht
- Biotechnology Division, Institute of Himalayan Bioresource Technology, Council of Scientific and Industrial Research, Palampur, India
| | - Ganesh Bagler
- Biotechnology Division, Institute of Himalayan Bioresource Technology, Council of Scientific and Industrial Research, Palampur, India
| |
Collapse
|
45
|
Systems genetics in "-omics" era: current and future development. Theory Biosci 2012; 132:1-16. [PMID: 23138757 DOI: 10.1007/s12064-012-0168-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2012] [Accepted: 10/25/2012] [Indexed: 02/06/2023]
Abstract
The systems genetics is an emerging discipline that integrates high-throughput expression profiling technology and systems biology approaches for revealing the molecular mechanism of complex traits, and will improve our understanding of gene functions in the biochemical pathway and genetic interactions between biological molecules. With the rapid advances of microarray analysis technologies, bioinformatics is extensively used in the studies of gene functions, SNP-SNP genetic interactions, LD block-block interactions, miRNA-mRNA interactions, DNA-protein interactions, protein-protein interactions, and functional mapping for LD blocks. Based on bioinformatics panel, which can integrate "-omics" datasets to extract systems knowledge and useful information for explaining the molecular mechanism of complex traits, systems genetics is all about to enhance our understanding of biological processes. Systems biology has provided systems level recognition of various biological phenomena, and constructed the scientific background for the development of systems genetics. In addition, the next-generation sequencing technology and post-genome wide association studies empower the discovery of new gene and rare variants. The integration of different strategies will help to propose novel hypothesis and perfect the theoretical framework of systems genetics, which will make contribution to the future development of systems genetics, and open up a whole new area of genetics.
Collapse
|
46
|
Chen X, Jiang W, Wang Q, Huang T, Wang P, Li Y, Chen X, Lv Y, Li X. Systematically characterizing and prioritizing chemosensitivity related gene based on Gene Ontology and protein interaction network. BMC Med Genomics 2012; 5:43. [PMID: 23031817 PMCID: PMC3532125 DOI: 10.1186/1755-8794-5-43] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2011] [Accepted: 08/27/2012] [Indexed: 11/30/2022] Open
Abstract
Background The identification of genes that predict in vitro cellular chemosensitivity of cancer cells is of great importance. Chemosensitivity related genes (CRGs) have been widely utilized to guide clinical and cancer chemotherapy decisions. In addition, CRGs potentially share functional characteristics and network features in protein interaction networks (PPIN). Methods In this study, we proposed a method to identify CRGs based on Gene Ontology (GO) and PPIN. Firstly, we documented 150 pairs of drug-CCRG (curated chemosensitivity related gene) from 492 published papers. Secondly, we characterized CCRGs from the perspective of GO and PPIN. Thirdly, we prioritized CRGs based on CCRGs’ GO and network characteristics. Lastly, we evaluated the performance of the proposed method. Results We found that CCRG enriched GO terms were most often related to chemosensitivity and exhibited higher similarity scores compared to randomly selected genes. Moreover, CCRGs played key roles in maintaining the connectivity and controlling the information flow of PPINs. We then prioritized CRGs using CCRG enriched GO terms and CCRG network characteristics in order to obtain a database of predicted drug-CRGs that included 53 CRGs, 32 of which have been reported to affect susceptibility to drugs. Our proposed method identifies a greater number of drug-CCRGs, and drug-CCRGs are much more significantly enriched in predicted drug-CRGs, compared to a method based on the correlation of gene expression and drug activity. The mean area under ROC curve (AUC) for our method is 65.2%, whereas that for the traditional method is 55.2%. Conclusions Our method not only identifies CRGs with expression patterns strongly correlated with drug activity, but also identifies CRGs in which expression is weakly correlated with drug activity. This study provides the framework for the identification of signatures that predict in vitro cellular chemosensitivity and offers a valuable database for pharmacogenomics research.
Collapse
Affiliation(s)
- Xin Chen
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| | | | | | | | | | | | | | | | | |
Collapse
|
47
|
Tang H, Zhong F, Xie H. A quick guide to biomolecular network studies: construction, analysis, applications, and resources. Biochem Biophys Res Commun 2012; 424:7-11. [PMID: 22732414 DOI: 10.1016/j.bbrc.2012.06.085] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2012] [Accepted: 06/18/2012] [Indexed: 10/28/2022]
Abstract
Over the past decade, a rapid increase in network data including signaling, transcription regulation, metabolic reaction, protein-protein interaction and genetic interaction has been observed. Many biology issues have been investigated by analyzing these diverse networks, providing new insights into biology. Networks also play an important role in disease studies including disease gene screening and clinical diagnosis. Large amounts of databases and software have been developed to facilitate the storage, exchange, integration, and analysis of network data and network analysis is becoming a routine procedure for biologists to infer biological information. In this review, several main aspects of network studies are discussed, including network construction, analysis, application, and resources.
Collapse
Affiliation(s)
- Hailin Tang
- College of Mechanical & Electronic Engineering and Automatization, National University of Defense Technology, Changsha 410073, China
| | | | | |
Collapse
|
48
|
Doncheva NT, Kacprowski T, Albrecht M. Recent approaches to the prioritization of candidate disease genes. WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE 2012; 4:429-42. [PMID: 22689539 DOI: 10.1002/wsbm.1177] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Many efforts are still devoted to the discovery of genes involved with specific phenotypes, in particular, diseases. High-throughput techniques are thus applied frequently to detect dozens or even hundreds of candidate genes. However, the experimental validation of many candidates is often an expensive and time-consuming task. Therefore, a great variety of computational approaches has been developed to support the identification of the most promising candidates for follow-up studies. The biomedical knowledge already available about the disease of interest and related genes is commonly exploited to find new gene-disease associations and to prioritize candidates. In this review, we highlight recent methodological advances in this research field of candidate gene prioritization. We focus on approaches that use network information and integrate heterogeneous data sources. Furthermore, we discuss current benchmarking procedures for evaluating and comparing different prioritization methods.
Collapse
|
49
|
Chase NM, Verbsky JW, Routes JM. Newborn screening for SCID: three years of experience. Ann N Y Acad Sci 2012; 1238:99-105. [PMID: 22129057 DOI: 10.1111/j.1749-6632.2011.06241.x] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
Over the past three years, newborn blood screening (NBS) for severe T cell lymphopenia/severe combined immunodeficiency (sTCL/SCID) using the T cell receptor excision circle (TREC) assay has revolutionized the early detection of infants with primary immunodeficiencies (PIDs) associated with T cell lymphopenia. Nonetheless, despite the comprehensive NBS protocols developed by each state, additional issues unique to screening for sTCL/SCID have surfaced, including variability in the performance of the TREC assay, diagnostic and treatment algorithms, definition of sTCL/SCID, and approach to the discovery of new genetic variants. Although NBS using the TREC assay has been highly successful, new and difficult challenges have emerged that need to be addressed to enhance our knowledge of the causes of sTCL/SCID and to optimize the detection and outcomes of affected infants.
Collapse
Affiliation(s)
- Nicole M Chase
- Division of Allergy/Clinical Immunology, Department of Pediatrics, Medical College of Wisconsin, Milwaukee, 53226, USA
| | | | | |
Collapse
|
50
|
Wang IM, Stone DJ, Nickle D, Loboda A, Puig O, Roberts C. Systems biology approach for new target and biomarker identification. Curr Top Microbiol Immunol 2012; 363:169-99. [PMID: 22903568 DOI: 10.1007/82_2012_252] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
The pharmaceutical industry is spending increasingly large amounts of money on the discovery and development of novel medicines, but this investment is not adequately paying off in an increased rate of newly approved drugs by the FDA. The post-genomic era has provided a wealth of novel approaches for generating large, high-dimensional genetic and transcriptomic data sets from large cohorts of preclinical species as well as normal and diseased individuals. This systems biology approach to understanding disease-related biology is revolutionizing our understanding of the cellular pathways and gene networks underlying the onset of disease, and the mechanisms of pharmacological treatments that ameliorate disease phenotypes. In this article, we review a number of approaches being used by pharmaceutical and biotechnology companies, e.g., high-throughput DNA genotyping, sequencing, and genome-wide gene expression profiling, to enable drug discovery and development through the identification of new drug targets and biomarkers of disease progression, drug pharmacodynamics, and predictive markers for selecting the patients most likely to respond to therapy.
Collapse
Affiliation(s)
- I-Ming Wang
- Informatics and Analysis, Merck Research Laboratory, West Point, PA 19486, USA.
| | | | | | | | | | | |
Collapse
|