1
|
Zhapa-Camacho F, Tang Z, Kulmanov M, Hoehndorf R. Predicting protein functions using positive-unlabeled ranking with ontology-based priors. Bioinformatics 2024; 40:i401-i409. [PMID: 38940168 PMCID: PMC11211813 DOI: 10.1093/bioinformatics/btae237] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/29/2024] Open
Abstract
Automated protein function prediction is a crucial and widely studied problem in bioinformatics. Computationally, protein function is a multilabel classification problem where only positive samples are defined and there is a large number of unlabeled annotations. Most existing methods rely on the assumption that the unlabeled set of protein function annotations are negatives, inducing the false negative issue, where potential positive samples are trained as negatives. We introduce a novel approach named PU-GO, wherein we address function prediction as a positive-unlabeled ranking problem. We apply empirical risk minimization, i.e. we minimize the classification risk of a classifier where class priors are obtained from the Gene Ontology hierarchical structure. We show that our approach is more robust than other state-of-the-art methods on similarity-based and time-based benchmark datasets. AVAILABILITY AND IMPLEMENTATION Data and code are available at https://github.com/bio-ontology-research-group/PU-GO.
Collapse
Affiliation(s)
- Fernando Zhapa-Camacho
- Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology, Thuwal, 23955-6900, Saudi Arabia
- Computer, Electrical and Mathematical Sciences & Engineering Division (CEMSE), King Abdullah University of Science and Technology, Thuwal, 23955-6900, Saudi Arabia
| | - Zhenwei Tang
- Department of Computer Science, University of Toronto, Toronto, ON M5S 1A1, Canada
| | - Maxat Kulmanov
- Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology, Thuwal, 23955-6900, Saudi Arabia
- Computer, Electrical and Mathematical Sciences & Engineering Division (CEMSE), King Abdullah University of Science and Technology, Thuwal, 23955-6900, Saudi Arabia
- SDAIA-KAUST Center of Excellence in Data Science and Artificial Intelligence, King Abdullah University of Science and Technology, Thuwal, 23955-6900, Saudi Arabia
| | - Robert Hoehndorf
- Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology, Thuwal, 23955-6900, Saudi Arabia
- Computer, Electrical and Mathematical Sciences & Engineering Division (CEMSE), King Abdullah University of Science and Technology, Thuwal, 23955-6900, Saudi Arabia
- SDAIA-KAUST Center of Excellence in Data Science and Artificial Intelligence, King Abdullah University of Science and Technology, Thuwal, 23955-6900, Saudi Arabia
| |
Collapse
|
2
|
Wimalagunasekara SS, Weeraman JWJK, Tirimanne S, Fernando PC. Protein-protein interaction (PPI) network analysis reveals important hub proteins and sub-network modules for root development in rice (Oryza sativa). J Genet Eng Biotechnol 2023; 21:69. [PMID: 37246172 DOI: 10.1186/s43141-023-00515-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2022] [Accepted: 05/06/2023] [Indexed: 05/30/2023]
Abstract
BACKGROUND The root system is vital to plant growth and survival. Therefore, genetic improvement of the root system is beneficial for developing stress-tolerant and improved plant varieties. This requires the identification of proteins that significantly contribute to root development. Analyzing protein-protein interaction (PPI) networks is vastly beneficial in studying developmental phenotypes, such as root development, because a phenotype is an outcome of several interacting proteins. PPI networks can be analyzed to identify modules and get a global understanding of important proteins governing the phenotypes. PPI network analysis for root development in rice has not been performed before and has the potential to yield new findings to improve stress tolerance. RESULTS Here, the network module for root development was extracted from the global Oryza sativa PPI network retrieved from the STRING database. Novel protein candidates were predicted, and hub proteins and sub-modules were identified from the extracted module. The validation of the predictions yielded 75 novel candidate proteins, 6 sub-modules, 20 intramodular hubs, and 2 intermodular hubs. CONCLUSIONS These results show how the PPI network module is organized for root development and can be used for future wet-lab studies for producing improved rice varieties.
Collapse
Affiliation(s)
| | - Janith W J K Weeraman
- Department of Plant Sciences, Faculty of Science, University of Colombo, Colombo, Sri Lanka.
| | - Shamala Tirimanne
- Department of Plant Sciences, Faculty of Science, University of Colombo, Colombo, Sri Lanka
| | - Pasan C Fernando
- Department of Plant Sciences, Faculty of Science, University of Colombo, Colombo, Sri Lanka
| |
Collapse
|
3
|
Zhang X, Hong S, Yu C, Shen X, Sun F, Yang J. Comparative analysis between high -grade serous ovarian cancer and healthy ovarian tissues using single-cell RNA sequencing. Front Oncol 2023; 13:1148628. [PMID: 37124501 PMCID: PMC10140397 DOI: 10.3389/fonc.2023.1148628] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Accepted: 03/31/2023] [Indexed: 05/02/2023] Open
Abstract
Introduction High-grade serous ovarian cancer (HGSOC) is the most common histological subtype of ovarian cancer, and is associated with high mortality rates. Methods In this study, we analyzed specific cell subpopulations and compared different gene functions between healthy ovarian and ovarian cancer cells using single-cell RNA sequencing (ScRNA-seq). We delved deeper into the differences between healthy ovarian and ovarian cancer cells at different levels, and performed specific analysis on endothelial cells. Results We obtained scRNA-seq data of 6867 and 17056 cells from healthy ovarian samples and ovarian cancer samples, respectively. The transcriptional profiles of the groups differed at various stages of ovarian cell development. A detailed comparison of the cell cycle, and cell communication of different groups, revealed significant differences between healthy ovarian and ovarian cancer cells. We also found that apoptosis-related genes, URI1, PAK2, PARP1, CLU and TIMP3, were highly expressed, while immune-related genes, UBB, RPL11, CAV1, NUPR1 and Hsp90ab1, were lowly expressed in ovarian cancer cells. The results of the ScRNA-seq were verified using qPCR. Discussion Our findings revealed differences in function, gene expression and cell interaction patterns between ovarian cancer and healthy ovarian cell populations. These findings provide key insights on further research into the treatment of ovarian cancer.
Collapse
Affiliation(s)
- Xiao Zhang
- Department of Obstetrics and Gynecology, Sir Run Run Shaw Hospital, School of Medicine, Zhejiang University, Key Laboratory of Reproductive Dysfunction Management of Zhejiang Province, Hangzhou, China
| | - Shihao Hong
- Department of Obstetrics and Gynecology, Sir Run Run Shaw Hospital, School of Medicine, Zhejiang University, Key Laboratory of Reproductive Dysfunction Management of Zhejiang Province, Hangzhou, China
| | - Chengying Yu
- Department of Obstetrics and Gynecology, Longyou People’s Hospital, Quzhou, China
| | | | - Fangying Sun
- Department of Obstetrics and Gynecology, Sir Run Run Shaw Hospital, School of Medicine, Zhejiang University, Key Laboratory of Reproductive Dysfunction Management of Zhejiang Province, Hangzhou, China
| | - Jianhua Yang
- Department of Obstetrics and Gynecology, Sir Run Run Shaw Hospital, School of Medicine, Zhejiang University, Key Laboratory of Reproductive Dysfunction Management of Zhejiang Province, Hangzhou, China
- *Correspondence: Jianhua Yang,
| |
Collapse
|
4
|
Cavalcante JDS, de Almeida CAS, Clasen MA, da Silva EL, de Barros LC, Marinho AD, Rossini BC, Marino CL, Carvalho PC, Jorge RJB, Dos Santos LD. A fingerprint of plasma proteome alteration after local tissue damage induced by Bothrops leucurus snake venom in mice. J Proteomics 2022; 253:104464. [PMID: 34954398 DOI: 10.1016/j.jprot.2021.104464] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Revised: 11/30/2021] [Accepted: 12/19/2021] [Indexed: 12/21/2022]
Abstract
Bothrops spp. is responsible for about 70% of snakebites in Brazil, causing a diverse and complex pathophysiological condition. Bothrops leucurus is the main species of medical relevance found in the Atlantic coast in the Brazilian Northeast region. The pathophysiological effects involved B. leucurus snakebite as well as the organism's reaction in response to this envenoming, it has not been explored yet. Thus, edema was induced in mice paw using 1.2, 2.5, and 5.0 μg of B. leucurus venom, the percentage of edema was measured 30 min after injection and the blood plasma was collected and analyzed by shotgun proteomic strategy. We identified 80 common plasma proteins with differential abundance among the experimental groups and we can understand the early aspects of this snake envenomation, regardless of the suggestive severity of an ophidian accident. The results showed B. leucurus venom triggers a thromboinflammation scenario where family's proteins of the Serpins, Apolipoproteins, Complement factors and Component subunits, Cathepsins, Kinases, Oxidoreductases, Proteases inhibitors, Proteases, Collagens, Growth factors are related to inflammation, complement and coagulation systems, modulators platelets and neutrophils, lipid and retinoid metabolism, oxidative stress and tissue repair. Our findings set precedents for future studies in the area of early diagnosis and/or treatment of snakebites. SIGNIFICANCE: The physiopathological effects that the snake venoms can cause have been investigated through classical and reductionist tools, which allowed, so far, the identification of action mechanisms of individual components associated with specific tissue damage. The currently incomplete limitations of this knowledge must be expanded through new approaches, such as proteomics, which may represent a big leap in understanding the venom-modulated pathological process. The exploration of the complete protein set that suffer modifications by the simultaneous action of multiple toxins, provides a map of the establishment of physiopathological phenotypes, which favors the identification of multiple toxin targets, that may or may not act in synergy, as well as favoring the discovery of biomarkers and therapeutic targets for manifestations that are not neutralized by the antivenom.
Collapse
Affiliation(s)
- Joeliton Dos Santos Cavalcante
- Graduate Program in Tropical Diseases, Botucatu Medical School (FMB), São Paulo State University (UNESP), Botucatu, SP, Brazil
| | | | - Milan Avila Clasen
- Laboratory for Structural and Computational Proteomics, ICC, Oswaldo Cruz Foundation (FIOCRUZ), Curitiba, PR, Brazil
| | - Emerson Lucena da Silva
- Drug Research and Development Center, Federal University of Ceará (UFC), Fortaleza, CE, Brazil; Department of Physiology and Pharmacology, School of Medicine, Federal University of Ceará (UFC), Fortaleza, CE, Brazil
| | - Luciana Curtolo de Barros
- Center for the Study of Venoms and Venomous Animals (CEVAP), São Paulo State University (UNESP), Botucatu, SP, Brazil
| | - Aline Diogo Marinho
- Drug Research and Development Center, Federal University of Ceará (UFC), Fortaleza, CE, Brazil; Department of Physiology and Pharmacology, School of Medicine, Federal University of Ceará (UFC), Fortaleza, CE, Brazil
| | - Bruno Cesar Rossini
- Biotechnology Institute (IBTEC), São Paulo State University (UNESP), Botucatu, SP, Brazil; Department of Chemical and Biological Sciences, São Paulo State University (UNESP), Botucatu, SP, Brazil
| | - Celso Luís Marino
- Biotechnology Institute (IBTEC), São Paulo State University (UNESP), Botucatu, SP, Brazil; Department of Chemical and Biological Sciences, São Paulo State University (UNESP), Botucatu, SP, Brazil
| | - Paulo Costa Carvalho
- Laboratory for Structural and Computational Proteomics, ICC, Oswaldo Cruz Foundation (FIOCRUZ), Curitiba, PR, Brazil
| | - Roberta Jeane Bezerra Jorge
- Drug Research and Development Center, Federal University of Ceará (UFC), Fortaleza, CE, Brazil; Department of Physiology and Pharmacology, School of Medicine, Federal University of Ceará (UFC), Fortaleza, CE, Brazil
| | - Lucilene Delazari Dos Santos
- Graduate Program in Tropical Diseases, Botucatu Medical School (FMB), São Paulo State University (UNESP), Botucatu, SP, Brazil; Biotechnology Institute (IBTEC), São Paulo State University (UNESP), Botucatu, SP, Brazil.
| |
Collapse
|
5
|
Whole-genome sequencing reveals novel ethnicity-specific rare variants associated with Alzheimer's disease. Mol Psychiatry 2022; 27:2554-2562. [PMID: 35264725 PMCID: PMC9135624 DOI: 10.1038/s41380-022-01483-0] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Revised: 01/21/2022] [Accepted: 02/08/2022] [Indexed: 12/03/2022]
Abstract
Alzheimer's disease (AD) is the most common multifactorial neurodegenerative disease among elderly people. Genome-wide association studies (GWAS) have been highly successful in identifying genetic risk factors. However, GWAS investigate common variants, which tend to have small effect sizes, and rare variants with potentially larger phenotypic effects have not been sufficiently investigated. Whole-genome sequencing (WGS) enables us to detect those rare variants. Here, we performed rare-variant association studies by using WGS data from 140 individuals with probable AD and 798 cognitively normal elder controls (CN), as well as single-nucleotide polymorphism genotyping data from an independent large Japanese AD cohort of 1604 AD and 1235 CN subjects. We identified two rare variants as candidates for AD association: a missense variant in OR51G1 (rs146006146, c.815 G > A, p.R272H) and a stop-gain variant in MLKL (rs763812068, c.142 C > T, p.Q48X). Subsequent in vitro functional analysis revealed that the MLKL stop-gain variant can contribute to increases not only in abnormal cells that should die by programmed cell death but do not, but also in the ratio of Aβ42 to Aβ40. We further detected AD candidate genes through gene-based association tests of rare variants; a network-based meta-analysis using these candidates identified four functionally important hub genes (NCOR2, PLEC, DMD, and NEDD4). Our findings will contribute to the understanding of AD and provide novel insights into its pathogenic mechanisms that can be used in future studies.
Collapse
|
6
|
Exploring the conservation of Alzheimer-related pathways between H. sapiens and C. elegans: a network alignment approach. Sci Rep 2021; 11:4572. [PMID: 33633188 PMCID: PMC7907373 DOI: 10.1038/s41598-021-83892-9] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Accepted: 01/11/2021] [Indexed: 12/13/2022] Open
Abstract
Alzheimer disease (AD) is a neurodegenerative disorder with an –as of yet– unclear etiology and pathogenesis. Research to unveil disease processes underlying AD often relies on the use of neurodegenerative disease model organisms, such as Caenorhabditis elegans. This study sought to identify biological pathways implicated in AD that are conserved in Homo sapiens and C. elegans. Protein–protein interaction networks were assembled for amyloid precursor protein (APP) and Tau in H. sapiens—two proteins whose aggregation is a hallmark in AD—and their orthologs APL-1 and PTL-1 for C. elegans. Global network alignment was used to compare these networks and determine similar, likely conserved, network regions. This comparison revealed that two prominent pathways, the APP-processing and the Tau-phosphorylation pathways, are highly conserved in both organisms. While the majority of interactions between proteins in those pathways are known to be associated with AD in human, they remain unexamined in C. elegans, signifying the need for their further investigation. In this work, we have highlighted conserved interactions related to AD in humans and have identified specific proteins that can act as targets for experimental studies in C. elegans, aiming to uncover the underlying mechanisms of AD.
Collapse
|
7
|
Statistical Physics for Medical Diagnostics: Learning, Inference, and Optimization Algorithms. Diagnostics (Basel) 2020; 10:diagnostics10110972. [PMID: 33228143 PMCID: PMC7699346 DOI: 10.3390/diagnostics10110972] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Revised: 11/16/2020] [Accepted: 11/17/2020] [Indexed: 02/03/2023] Open
Abstract
It is widely believed that cooperation between clinicians and machines may address many of the decisional fragilities intrinsic to current medical practice. However, the realization of this potential will require more precise definitions of disease states as well as their dynamics and interactions. A careful probabilistic examination of symptoms and signs, including the molecular profiles of the relevant biochemical networks, will often be required for building an unbiased and efficient diagnostic approach. Analogous problems have been studied for years by physicists extracting macroscopic states of various physical systems by examining microscopic elements and their interactions. These valuable experiences are now being extended to the medical field. From this perspective, we discuss how recent developments in statistical physics, machine learning and inference algorithms are coming together to improve current medical diagnostic approaches.
Collapse
|
8
|
Ramezanpour A, Mashaghi A. Disease evolution in reaction networks: Implications for a diagnostic problem. PLoS Comput Biol 2020; 16:e1007889. [PMID: 32497038 PMCID: PMC7272006 DOI: 10.1371/journal.pcbi.1007889] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2019] [Accepted: 04/20/2020] [Indexed: 12/30/2022] Open
Abstract
We study the time evolution of symptoms (signs) with some defects in the dynamics of a reaction network as a (microscopic) model for the progress of disease phenotypes. To this end, we take a large population of reaction networks and follow the stochastic dynamics of the system to see how the development of defects affects the macroscopic states of the signs probability distribution. We start from some plausible definitions for the healthy and disease states along with a dynamical model for the emergence of diseases by a reverse simulated annealing algorithm. The healthy state is defined as a state of maximum objective function, which here is the sum of mutual information between a subset of signal variables and the subset of assigned response variables. A disease phenotype is defined with two parameters controlling the rate of mutations in reactions and the rate of accepting mutations that reduce the objective function. The model can provide the time dependence of the sign probabilities given a disease phenotype. This allows us to obtain the accuracy of diagnosis as a function of time by using a probabilistic model of signs and diseases. The trade-off between the diagnosis accuracy (increasing in time) and the objective function (decreasing in time) can be used to suggest an optimal time for medical intervention. Our model would be useful in particular for a dynamical (history-based) diagnostic problem, to estimate the likelihood of a disease hypothesis given the temporal evolution of the signs. Here, we use concepts from statistical physics and reaction network dynamics to introduce a measure to quantify the tradeoff between the accuracy of diagnosis and an early diagnosis. This measure is used to suggest an optimal time for medical intervention depending on the number of observed signs (medical tests). We present a stochastic model using a reverse simulated annealing algorithm for numerical simulation of disease evolution. The model can provide the time dependence of the sign probabilities given a disease phenotype. This in turn allows us to anticipate the accuracy of diagnosis as a function of time by using a probabilistic model of signs and diseases.
Collapse
Affiliation(s)
- Abolfazl Ramezanpour
- Medical Systems Biophysics and Bioengineering, Leiden Academic Centre for Drug Research, Faculty of Science, Leiden University, Leiden, The Netherlands
- Physics Department, College of Sciences, Shiraz University, Shiraz, Iran
| | - Alireza Mashaghi
- Medical Systems Biophysics and Bioengineering, Leiden Academic Centre for Drug Research, Faculty of Science, Leiden University, Leiden, The Netherlands
- * E-mail:
| |
Collapse
|
9
|
Afiqah-Aleng N, Altaf-Ul-Amin M, Kanaya S, Mohamed-Hussein ZA. Graph cluster approach in identifying novel proteins and significant pathways involved in polycystic ovary syndrome. Reprod Biomed Online 2019; 40:319-330. [PMID: 32001161 DOI: 10.1016/j.rbmo.2019.11.012] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2019] [Revised: 11/07/2019] [Accepted: 11/25/2019] [Indexed: 12/18/2022]
Abstract
RESEARCH QUESTION Polycystic ovary syndrome (PCOS) is a complex endocrine disorder with diverse clinical implications, such as infertility, metabolic disorders, cardiovascular diseases and psychological problems among others. The heterogeneity of conditions found in PCOS contribute to its various phenotypes, leading to difficulties in identifying proteins involved in this abnormality. Several studies, however, have shown the feasibility in identifying molecular evidence underlying other diseases using graph cluster analysis. Therefore, is it possible to identify proteins and pathways related to PCOS using the same approach? METHODS Known PCOS-related proteins (PCOSrp) from PCOSBase and DisGeNET were integrated with protein-protein interactions (PPI) information from Human Integrated Protein-Protein Interaction reference to construct a PCOS PPI network. The network was clustered with DPClusO algorithm to generate clusters, which were evaluated using Fisher's exact test. Pathway enrichment analysis using gProfileR was conducted to identify significant pathways. RESULTS The statistical significance of the identified clusters has successfully predicted 138 novel PCOSrp with 61.5% reliability and, based on Cronbach's alpha, this prediction is acceptable. Androgen signalling pathway and leptin signalling pathway were among the significant PCOS-related pathways corroborating the information obtained from the clinical observation, where androgen signalling pathway is responsible in producing male hormones in women with PCOS, whereas leptin signalling pathway is involved in insulin sensitivity. CONCLUSIONS These results show that graph cluster analysis can provide additional insight into the pathobiology of PCOS, as the pathways identified as statistically significant correspond to earlier biological studies. Therefore, integrative analysis can reveal unknown mechanisms, which may enable the development of accurate diagnosis and effective treatment in PCOS.
Collapse
Affiliation(s)
- Nor Afiqah-Aleng
- Centre for Bioinformatics Research, Institute of Systems Biology (INBIOSIS), Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia; Institute of Marine Biotechnology, Universiti Malaysia Terengganu (UMT), 21030 Kuala Nerus, Terengganu, Malaysia
| | - M Altaf-Ul-Amin
- Graduate School of Science and Technology & NAIST Data Science Center, Nara Institute of Science and Technology, Nara 630-0192, Japan
| | - Shigehiko Kanaya
- Graduate School of Science and Technology & NAIST Data Science Center, Nara Institute of Science and Technology, Nara 630-0192, Japan
| | - Zeti-Azura Mohamed-Hussein
- Centre for Bioinformatics Research, Institute of Systems Biology (INBIOSIS), Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia; Centre for Frontier Sciences, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia.
| |
Collapse
|
10
|
Yang Y, Wang X, Ju W, Sun L, Zhang H. Genetic and Expression Analysis of COPI Genes and Alzheimer's Disease Susceptibility. Front Genet 2019; 10:866. [PMID: 31608112 PMCID: PMC6761859 DOI: 10.3389/fgene.2019.00866] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2019] [Accepted: 08/19/2019] [Indexed: 11/13/2022] Open
Abstract
Alzheimer's disease (AD) is the most common neurodegenerative disease in the elderly and the leading cause of dementia in humans. Evidence shows that cellular trafficking and recycling machineries are associated with AD risk. A recent study found that the coat protein complex I (COPI)-dependent trafficking in vivo could significantly reduce amyloid plaques in the cortex and hippocampus of neurological in the AD mouse models and identified 12 single-nucleotide polymorphisms in COPI genes to be significantly associated with increased AD risk using 6,795 samples. Here, we used a large-scale GWAS dataset to investigate the potential association between the COPI genes and AD susceptibility by both SNP and gene-based tests. The results showed that only rs9898218 was associated with AD risk with P = 0.017. We further conducted an expression quantitative trait loci (eQTLs) analysis and found that rs9898218 G allele was associated with increased COPZ2 expression in cerebellar cortex with P = 0.0184. Importantly, the eQTLs analysis in whole blood further indicated that 11 of these 12 genetic variants could significantly regulate the expression of COPI genes. Hence, these findings may contribute to understand the association between COPI genes and AD susceptibility.
Collapse
Affiliation(s)
- Yu Yang
- Department of Neurology and Neuroscience Center, First Hospital of Jilin University, Changchun, China
| | - Xu Wang
- Department of Neurology and Neuroscience Center, First Hospital of Jilin University, Changchun, China
| | - Weina Ju
- Department of Neurology and Neuroscience Center, First Hospital of Jilin University, Changchun, China
| | - Li Sun
- Department of Neurology and Neuroscience Center, First Hospital of Jilin University, Changchun, China
| | - Haining Zhang
- Department of Neurology and Neuroscience Center, First Hospital of Jilin University, Changchun, China
| |
Collapse
|
11
|
Saha S, Sengupta K, Chatterjee P, Basu S, Nasipuri M. Analysis of protein targets in pathogen-host interaction in infectious diseases: a case study on Plasmodium falciparum and Homo sapiens interaction network. Brief Funct Genomics 2019; 17:441-450. [PMID: 29028886 DOI: 10.1093/bfgp/elx024] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Infection and disease progression is the outcome of protein interactions between pathogen and host. Pathogen, the role player of Infection, is becoming a severe threat to life as because of its adaptability toward drugs and evolutionary dynamism in nature. Identifying protein targets by analyzing protein interactions between host and pathogen is the key point. Proteins with higher degree and possessing some topologically significant graph theoretical measures are found to be drug targets. On the other hand, exceptional nodes may be involved in infection mechanism because of some pathway process and biologically unknown factors. In this article, we attempt to investigate characteristics of host-pathogen protein interactions by presenting a comprehensive review of computational approaches applied on different infectious diseases. As an illustration, we have analyzed a case study on infectious disease malaria, with its causative agent Plasmodium falciparum acting as 'Bait' and host, Homo sapiens/human acting as 'Prey'. In this pathogen-host interaction network based on some interconnectivity and centrality properties, proteins are viewed as central, peripheral, hub and non-hub nodes and their significance on infection process. Besides, it is observed that because of sparseness of the pathogen and host interaction network, there may be some topologically unimportant but biologically significant proteins, which can also act as Bait/Prey. So, functional similarity or gene ontology mapping can help us in this case to identify these proteins.
Collapse
Affiliation(s)
- Sovan Saha
- Department of Computer Science and Engineering at Dr Sudhir Chandra Sur Degree Engineering College, India
| | - Kaustav Sengupta
- Department of Computer Science and Engineering, Jadavpur University, India
| | - Piyali Chatterjee
- Department of Computer Science and Engineering, Netaji Subhash Engineering College, Garia, India
| | - Subhadip Basu
- Department of Computer Science and Engineering, Jadavpur University, India
| | - Mita Nasipuri
- Department of Computer Science and Engineering, Jadavpur University, India
| |
Collapse
|
12
|
A network pharmacology approach to investigate the pharmacological effect of curcumin and capsaicin targets in cancer angiogenesis by module-based PPI network analysis. ACTA ACUST UNITED AC 2019. [DOI: 10.1007/s42485-019-00012-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
13
|
Das AB. Disease association of human tumor suppressor genes. Mol Genet Genomics 2019; 294:931-940. [PMID: 30945018 DOI: 10.1007/s00438-019-01557-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2019] [Accepted: 03/27/2019] [Indexed: 02/06/2023]
Abstract
The multifactorial disease, cancer, frequently emerges due to perturbations in tumor suppressor genes (TSGs). However, a growing number of noncanonical target genes of TSGs and the highly interconnected nature of the human interactome reveal that the functions of TSGs are not limited to cancer-specific events. The various functions of TSGs lead to the assumption that cancer is linked with other human disorders. Therefore, a disease-gene association network of TSGs (TSDN) was constructed by integrating protein-protein interaction networks of TSGs (TSN) with Morbid Map in Online Mendelian Inheritance in Man. The TSDN revealed links between TSGs and 22 different human disorders including cancer and indicated disease-disease associations. In addition, high-density functional protein clusters in the TSN showed cohesive and overlapping disease-TSG associations, which proved the prevalent role of TSGs in various human diseases beyond cancer. The presence of overlapping disease-gene modules and disease-disease associations via the TSN demonstrated that other diseases can serve as possible roots of the life-threatening disease cancer. Therefore, a disease association map of TSGs could be a promising tool for exploring intricate relationships between cancer and other diseases for the early prediction of cancer and the understanding of disease etiology.
Collapse
Affiliation(s)
- Asim Bikas Das
- Department of Biotechnology, National Institute of Technology Warangal, Warangal, Telangana, 506004, India.
| |
Collapse
|
14
|
Stoney R, Robertson DL, Nenadic G, Schwartz JM. Mapping biological process relationships and disease perturbations within a pathway network. NPJ Syst Biol Appl 2018; 4:22. [PMID: 29900005 PMCID: PMC5995814 DOI: 10.1038/s41540-018-0055-2] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2017] [Revised: 04/17/2018] [Accepted: 04/24/2018] [Indexed: 01/07/2023] Open
Abstract
Molecular interaction networks are routinely used to map the organization of cellular function. Edges represent interactions between genes, proteins, or metabolites. However, in living cells, molecular interactions are dynamic, necessitating context-dependent models. Contextual information can be integrated into molecular interaction networks through the inclusion of additional molecular data, but there are concerns about completeness and relevance of this data. We developed an approach for representing the organization of human cellular processes using pathways as the nodes in a network. Pathways represent spatial and temporal sets of context-dependent interactions, generating a high-level network when linked together, which incorporates contextual information without the need for molecular interaction data. Analysis of the pathway network revealed linked communities representing functional relationships, comparable to those found in molecular networks, including metabolism, signaling, immunity, and the cell cycle. We mapped a range of diseases onto this network and find that pathways associated with diseases tend to be functionally connected, highlighting the perturbed functions that result in disease phenotypes. We demonstrated that disease pathways cluster within the network. We then examined the distribution of cancer pathways and showed that cancer pathways tend to localize within the signaling, DNA processes and immune modules, although some cancer-associated nodes are found in other network regions. Altogether, we generated a high-confidence functional network, which avoids some of the shortcomings faced by conventional molecular models. Our representation provides an intuitive functional interpretation of cellular organization, which relies only on high-quality pathway and Gene Ontology data. The network is available at https://data.mendeley.com/datasets/3pbwkxjxg9/1.
Collapse
Affiliation(s)
- Ruth Stoney
- School of Computer Science, University of Manchester, M13 9PT, Manchester, UK
| | - David L Robertson
- MRC-University of Glasgow Centre for Virus Research, Garscube Campus, Glasgow, G61 1QH UK
| | - Goran Nenadic
- School of Computer Science, University of Manchester, M13 9PT, Manchester, UK
| | - Jean-Marc Schwartz
- Faculty of Biology, Medicine and Health, University of Manchester, Manchester, M13 9PT UK
| |
Collapse
|
15
|
Mashaghi A, Ramezanpour A. Statistical physics of medical diagnostics: Study of a probabilistic model. Phys Rev E 2018; 97:032118. [PMID: 29776109 DOI: 10.1103/physreve.97.032118] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2017] [Indexed: 01/04/2023]
Abstract
We study a diagnostic strategy which is based on the anticipation of the diagnostic process by simulation of the dynamical process starting from the initial findings. We show that such a strategy could result in more accurate diagnoses compared to a strategy that is solely based on the direct implications of the initial observations. We demonstrate this by employing the mean-field approximation of statistical physics to compute the posterior disease probabilities for a given subset of observed signs (symptoms) in a probabilistic model of signs and diseases. A Monte Carlo optimization algorithm is then used to maximize an objective function of the sequence of observations, which favors the more decisive observations resulting in more polarized disease probabilities. We see how the observed signs change the nature of the macroscopic (Gibbs) states of the sign and disease probability distributions. The structure of these macroscopic states in the configuration space of the variables affects the quality of any approximate inference algorithm (so the diagnostic performance) which tries to estimate the sign-disease marginal probabilities. In particular, we find that the simulation (or extrapolation) of the diagnostic process is helpful when the disease landscape is not trivial and the system undergoes a phase transition to an ordered phase.
Collapse
Affiliation(s)
- Alireza Mashaghi
- Leiden Academic Centre for Drug Research, Faculty of Mathematics and Natural Sciences, Leiden University, Leiden, The Netherlands.,Harvard Medical School, Harvard University, Boston, Massachusetts, USA
| | - Abolfazl Ramezanpour
- Leiden Academic Centre for Drug Research, Faculty of Mathematics and Natural Sciences, Leiden University, Leiden, The Netherlands.,Department of Physics, University of Neyshabur, Neyshabur, Iran
| |
Collapse
|
16
|
Disease gene classification with metagraph representations. Methods 2017; 131:83-92. [DOI: 10.1016/j.ymeth.2017.06.036] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2017] [Revised: 06/23/2017] [Accepted: 06/30/2017] [Indexed: 12/28/2022] Open
|
17
|
Chen L, Mukerjee G, Dorfman R, Moghadas SM. Disease Risk Assessment Using a Voronoi-Based Network Analysis of Genes and Variants Scores. Front Genet 2017; 8:29. [PMID: 28326099 PMCID: PMC5339255 DOI: 10.3389/fgene.2017.00029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2016] [Accepted: 02/22/2017] [Indexed: 11/18/2022] Open
Abstract
Much effort has been devoted to assess disease risk based on large-scale protein-protein network and genotype-phenotype associations. However, the challenge of risk prediction for complex diseases remains unaddressed. Here, we propose a framework to quantify the risk based on a Voronoi tessellation network analysis, taking into account the disease association scores of both genes and variants. By integrating ClinVar, SNPnexus, and DISEASES databases, we introduce a gene-variant map that is based on the pairwise disease-associated gene-variant scores. This map is clustered using Voronoi tessellation and network analysis with a threshold obtained from fitting the background Voronoi cell density distribution. We define the relative risk of disease that is inferred from the scores of the data points within the related clusters on the gene-variant map. We identify autoimmune-associated clusters that may interact at the system-level. The proposed framework can be used to determine the clusters that are specific to a subtype or contribute to multiple subtypes of complex diseases.
Collapse
Affiliation(s)
- Lin Chen
- Agent-Based Modelling Laboratory, York University Toronto, ON, Canada
| | | | | | - Seyed M Moghadas
- Agent-Based Modelling Laboratory, York University Toronto, ON, Canada
| |
Collapse
|