1
|
Joseph S, Patil K, Rahate N, Shah J, Mukherjee S, Mahale SD. Integrated data driven analysis identifies potential candidate genes associated with PCOS. Comput Biol Chem 2024; 113:108191. [PMID: 39243549 DOI: 10.1016/j.compbiolchem.2024.108191] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2024] [Revised: 07/16/2024] [Accepted: 08/28/2024] [Indexed: 09/09/2024]
Abstract
Polycystic ovary syndrome (PCOS) is one of the most common anovulatory disorder observed in women presenting with infertility. Several high and low throughput studies on PCOS have led to accumulation of vast amount of information on PCOS. Despite the availability of several resources which index the advances in PCOS, information on its etiology still remains inadequate. Analysis of the existing information using an integrated evidence based approach may aid identification of novel potential candidate genes with a role in PCOS pathophysiology. This work focuses on integrating existing information on PCOS from literature and gene expression studies and evaluating the application of gene prioritization and network analysis to predict missing novel candidates. Further, it assesses the utility of evidence-based scoring to rank genes for their association with PCOS. The results of this study led to identification of ∼2000 plausible candidate genes associated with PCOS. Insilico validation of these identified candidates confirmed the role of 938 genes in PCOS. Further, experimental validation was carried out for four of the potential candidate genes, a high-scoring (PROS1), two mid-scoring (C1QA and KNG1), and a low-scoring gene (VTN) involved in the complement and coagulation pathway by comparing protein levels in follicular fluid in women with PCOS and healthy controls. While the expression of PROS1, C1QA, and KNG1 was found to be significantly downregulated in women with PCOS, the expression of VTN was found to be unchanged in PCOS. The findings of this study reiterate the utility of employing insilico approaches to identify and prioritize the most promising candidate genes in diseases with a complex pathophysiology like PCOS. Further, the study also helps in gaining clearer insights into the molecular mechanisms associated with the manifestation of the PCOS phenotype by contributing to the existing repertoire of genes associated with PCOS.
Collapse
Affiliation(s)
- Shaini Joseph
- Genetic Research Center, ICMR-National Institute for Research in Reproductive and Child Health, J.M. Street, Parel, Mumbai 400012, India
| | - Krutika Patil
- Department of Molecular Endocrinology, ICMR-National Institute for Research in Reproductive and Child Health, J.M. Street, Parel, Mumbai 400012, India
| | - Niharika Rahate
- Genetic Research Center, ICMR-National Institute for Research in Reproductive and Child Health, J.M. Street, Parel, Mumbai 400012, India
| | - Jatin Shah
- Mumbai Fertility Clinic & IVF Centre, Kamala Polyclinic and Nursing Home, Mumbai 400026, India
| | - Srabani Mukherjee
- Department of Molecular Endocrinology, ICMR-National Institute for Research in Reproductive and Child Health, J.M. Street, Parel, Mumbai 400012, India.
| | - Smita D Mahale
- ICMR-National Institute for Research in Reproductive and Child Health, J.M. Street, Parel, Mumbai 400012, India.
| |
Collapse
|
2
|
Balachandran S, Prada-Medina CA, Mensah MA, Kakar N, Nagel I, Pozojevic J, Audain E, Hitz MP, Kircher M, Sreenivasan VKA, Spielmann M. STIGMA: Single-cell tissue-specific gene prioritization using machine learning. Am J Hum Genet 2024; 111:338-349. [PMID: 38228144 PMCID: PMC10870135 DOI: 10.1016/j.ajhg.2023.12.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2023] [Revised: 12/01/2023] [Accepted: 12/07/2023] [Indexed: 01/18/2024] Open
Abstract
Clinical exome and genome sequencing have revolutionized the understanding of human disease genetics. Yet many genes remain functionally uncharacterized, complicating the establishment of causal disease links for genetic variants. While several scoring methods have been devised to prioritize these candidate genes, these methods fall short of capturing the expression heterogeneity across cell subpopulations within tissues. Here, we introduce single-cell tissue-specific gene prioritization using machine learning (STIGMA), an approach that leverages single-cell RNA-seq (scRNA-seq) data to prioritize candidate genes associated with rare congenital diseases. STIGMA prioritizes genes by learning the temporal dynamics of gene expression across cell types during healthy organogenesis. To assess the efficacy of our framework, we applied STIGMA to mouse limb and human fetal heart scRNA-seq datasets. In a cohort of individuals with congenital limb malformation, STIGMA prioritized 469 variants in 345 genes, with UBA2 as a notable example. For congenital heart defects, we detected 34 genes harboring nonsynonymous de novo variants (nsDNVs) in two or more individuals from a set of 7,958 individuals, including the ortholog of Prdm1, which is associated with hypoplastic left ventricle and hypoplastic aortic arch. Overall, our findings demonstrate that STIGMA effectively prioritizes tissue-specific candidate genes by utilizing single-cell transcriptome data. The ability to capture the heterogeneity of gene expression across cell populations makes STIGMA a powerful tool for the discovery of disease-associated genes and facilitates the identification of causal variants underlying human genetic disorders.
Collapse
Affiliation(s)
- Saranya Balachandran
- Institute of Human Genetics, University Hospital Schleswig-Holstein, University of Lübeck and Kiel University, Lübeck, Germany
| | - Cesar A Prada-Medina
- Human Molecular Genetics Group, Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany
| | - Martin A Mensah
- Institut für Medizinische Genetik und Humangenetik, Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Augustenburger Platz 1, 13353 Berlin, Germany; BIH Charité Digital Clinician Scientist Program, BIH Biomedical Innovation Academy, Anna-Louisa-Karsch-Strasse 2, 10178 Berlin, Germany; RG Development & Disease, Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany
| | - Naseebullah Kakar
- Institute of Human Genetics, University Hospital Schleswig-Holstein, University of Lübeck and Kiel University, Lübeck, Germany; Department of Biotechnology, BUITEMS, Quetta, Pakistan
| | - Inga Nagel
- Institute of Human Genetics, University Hospital Schleswig-Holstein, University of Lübeck and Kiel University, Lübeck, Germany
| | - Jelena Pozojevic
- Institute of Human Genetics, University Hospital Schleswig-Holstein, University of Lübeck and Kiel University, Lübeck, Germany
| | - Enrique Audain
- Institute of Medical Genetics, Carl von Ossietzky University, 26129 Oldenburg, Germany; DZHK e.V. (German Center for Cardiovascular Research), Partner Site Hamburg/Kiel/Lübeck; Department of Congenital Heart Disease and Pediatric Cardiology, University Hospital of Schleswig-Holstein, 24105 Kiel, Germany
| | - Marc-Phillip Hitz
- Institute of Medical Genetics, Carl von Ossietzky University, 26129 Oldenburg, Germany; DZHK e.V. (German Center for Cardiovascular Research), Partner Site Hamburg/Kiel/Lübeck; Department of Congenital Heart Disease and Pediatric Cardiology, University Hospital of Schleswig-Holstein, 24105 Kiel, Germany
| | - Martin Kircher
- Institute of Human Genetics, University Hospital Schleswig-Holstein, University of Lübeck and Kiel University, Lübeck, Germany
| | - Varun K A Sreenivasan
- Institute of Human Genetics, University Hospital Schleswig-Holstein, University of Lübeck and Kiel University, Lübeck, Germany.
| | - Malte Spielmann
- Institute of Human Genetics, University Hospital Schleswig-Holstein, University of Lübeck and Kiel University, Lübeck, Germany; Human Molecular Genetics Group, Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany; DZHK e.V. (German Center for Cardiovascular Research), Partner Site Hamburg/Kiel/Lübeck.
| |
Collapse
|
3
|
Nguyen T, Yue Z, Slominski R, Welner R, Zhang J, Chen JY. WINNER: A network biology tool for biomolecular characterization and prioritization. Front Big Data 2022; 5:1016606. [PMID: 36407327 PMCID: PMC9672476 DOI: 10.3389/fdata.2022.1016606] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Accepted: 10/14/2022] [Indexed: 12/09/2024] Open
Abstract
BACKGROUND AND CONTRIBUTION In network biology, molecular functions can be characterized by network-based inference, or "guilt-by-associations." PageRank-like tools have been applied in the study of biomolecular interaction networks to obtain further the relative significance of all molecules in the network. However, there is a great deal of inherent noise in widely accessible data sets for gene-to-gene associations or protein-protein interactions. How to develop robust tests to expand, filter, and rank molecular entities in disease-specific networks remains an ad hoc data analysis process. RESULTS We describe a new biomolecular characterization and prioritization tool called Weighted In-Network Node Expansion and Ranking (WINNER). It takes the input of any molecular interaction network data and generates an optionally expanded network with all the nodes ranked according to their relevance to one another in the network. To help users assess the robustness of results, WINNER provides two different types of statistics. The first type is a node-expansion p-value, which helps evaluate the statistical significance of adding "non-seed" molecules to the original biomolecular interaction network consisting of "seed" molecules and molecular interactions. The second type is a node-ranking p-value, which helps evaluate the relative statistical significance of the contribution of each node to the overall network architecture. We validated the robustness of WINNER in ranking top molecules by spiking noises in several network permutation experiments. We have found that node degree-preservation randomization of the gene network produced normally distributed ranking scores, which outperform those made with other gene network randomization techniques. Furthermore, we validated that a more significant proportion of the WINNER-ranked genes was associated with disease biology than existing methods such as PageRank. We demonstrated the performance of WINNER with a few case studies, including Alzheimer's disease, breast cancer, myocardial infarctions, and Triple negative breast cancer (TNBC). In all these case studies, the expanded and top-ranked genes identified by WINNER reveal disease biology more significantly than those identified by other gene prioritizing software tools, including Ingenuity Pathway Analysis (IPA) and DiAMOND. CONCLUSION WINNER ranking strongly correlates to other ranking methods when the network covers sufficient node and edge information, indicating a high network quality. WINNER users can use this new tool to robustly evaluate a list of candidate genes, proteins, or metabolites produced from high-throughput biology experiments, as long as there is available gene/protein/metabolic network information.
Collapse
Affiliation(s)
- Thanh Nguyen
- Informatics Institute in School of Medicine, The University of Alabama at Birmingham, Birmingham, AL, United States
- Department of Biomedical Engineering, The University of Alabama at Birmingham, Birmingham, AL, United States
| | - Zongliang Yue
- Informatics Institute in School of Medicine, The University of Alabama at Birmingham, Birmingham, AL, United States
| | - Radomir Slominski
- Informatics Institute in School of Medicine, The University of Alabama at Birmingham, Birmingham, AL, United States
| | - Robert Welner
- Comprehensive Arthritis, Musculoskeletal, Bone and Autoimmunity Center (CAMBAC), School of Medicine, The University of Alabama at Birmingham, Birmingham, AL, United States
| | - Jianyi Zhang
- Department of Biomedical Engineering, The University of Alabama at Birmingham, Birmingham, AL, United States
| | - Jake Y. Chen
- Informatics Institute in School of Medicine, The University of Alabama at Birmingham, Birmingham, AL, United States
| |
Collapse
|
4
|
Azadifar S, Ahmadi A. A novel candidate disease gene prioritization method using deep graph convolutional networks and semi-supervised learning. BMC Bioinformatics 2022; 23:422. [PMID: 36241966 PMCID: PMC9563530 DOI: 10.1186/s12859-022-04954-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2022] [Accepted: 09/20/2022] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND Selecting and prioritizing candidate disease genes is necessary before conducting laboratory studies as identifying disease genes from a large number of candidate genes using laboratory methods, is a very costly and time-consuming task. There are many machine learning-based gene prioritization methods. These methods differ in various aspects including the feature vectors of genes, the used datasets with different structures, and the learning model. Creating a suitable feature vector for genes and an appropriate learning model on a variety of data with different and non-Euclidean structures, including graphs, as well as the lack of negative data are very important challenges of these methods. The use of graph neural networks has recently emerged in machine learning and other related fields, and they have demonstrated superior performance for a broad range of problems. METHODS In this study, a new semi-supervised learning method based on graph convolutional networks is presented using the novel constructing feature vector for each gene. In the proposed method, first, we construct three feature vectors for each gene using terms from the Gene Ontology (GO) database. Then, we train a graph convolution network on these vectors using protein-protein interaction (PPI) network data to identify disease candidate genes. Our model discovers hidden layer representations encoding in both local graph structure as well as features of nodes. This method is characterized by the simultaneous consideration of topological information of the biological network (e.g., PPI) and other sources of evidence. Finally, a validation has been done to demonstrate the efficiency of our method. RESULTS Several experiments are performed on 16 diseases to evaluate the proposed method's performance. The experiments demonstrate that our proposed method achieves the best results, in terms of precision, the area under the ROC curve (AUCs), and F1-score values, when compared with eight state-of-the-art network and machine learning-based disease gene prioritization methods. CONCLUSION This study shows that the proposed semi-supervised learning method appropriately classifies and ranks candidate disease genes using a graph convolutional network and an innovative method to create three feature vectors for genes based on the molecular function, cellular component, and biological process terms from GO data.
Collapse
Affiliation(s)
- Saeid Azadifar
- Faculty of Computer Engineering, K. N. Toosi University of Technology, Tehran, Iran
| | - Ali Ahmadi
- Faculty of Computer Engineering, K. N. Toosi University of Technology, Tehran, Iran
| |
Collapse
|
5
|
Guerra C, Joshi S, Lu Y, Palini F, Ferraro Petrillo U, Rossignac J. Rank-Similarity Measures for Comparing Gene Prioritizations: A Case Study in Autism. J Comput Biol 2020; 28:283-295. [PMID: 33103913 DOI: 10.1089/cmb.2020.0244] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
We discuss the challenge of comparing three gene prioritization methods: network propagation, integer linear programming rank aggregation (RA), and statistical RA. These methods are based on different biological categories and estimate disease-gene association. Previously proposed comparison schemes are based on three measures of performance: receiver operating curve, area under the curve, and median rank ratio. Although they may capture important aspects of gene prioritization performance, they may fail to capture important differences in the rankings of individual genes. We suggest that comparison schemes could be improved by also considering recently proposed measures of similarity between gene rankings. We tested this suggestion on comparison schemes for prioritizations of genes associated with autism that were obtained using brain- and tissue-specific data. Our results show the effectiveness of our measures of similarity in clustering brain regions based on their relevance to autism.
Collapse
Affiliation(s)
- Concettina Guerra
- Georgia Institute of Technology College of Computing, School of Interactive Computing, Atlanta, Georgia, USA
| | - Sarang Joshi
- Georgia Institute of Technology College of Computing, School of Interactive Computing, Atlanta, Georgia, USA
| | - Yinquan Lu
- Georgia Institute of Technology College of Computing, School of Interactive Computing, Atlanta, Georgia, USA
| | - Francesco Palini
- Dipartimento di Scienze Statistiche, Università di Roma-La Sapienza, Rome, Italy
| | | | - Jarek Rossignac
- Georgia Institute of Technology College of Computing, School of Interactive Computing, Atlanta, Georgia, USA
| |
Collapse
|
6
|
Cheng B, Qi X, Liang C, Zhang L, Ma M, Li P, Liu L, Cheng S, Yao Y, Chu X, Ye J, Wen Y, Jia Y, Zhang F. Integrative Genomic Enrichment Analysis Identified the Brain Regions and Development Stages Related to Anorexia Nervosa and Obsessive-Compulsive Disorder. Cereb Cortex 2020; 30:6481-6489. [PMID: 32770201 DOI: 10.1093/cercor/bhaa214] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2020] [Revised: 06/29/2020] [Accepted: 07/14/2020] [Indexed: 12/31/2022] Open
Abstract
Our aim is to explore the spatial and temporal features of anorexia nervosa (AN) and obsessive-compulsive disorder (OCD) considering different brain regions and development stages. The gene sets related to 16 brain regions and nine development stages were obtained from a brain spatial and temporal transcriptomic dataset. Using the genome-wide association study data, transcriptome-wide association study (TWAS) was conducted to identify the genes whose imputed expressions were associated with AN and OCD, respectively. The mRNA expression profiles were analyzed by GEO2R to obtain differentially expressed genes. Gene set enrichment analysis was conducted to detect the spatial and temporal features related to AN and OCD using the TWAS and mRNA expression analysis results. We observed multiple common association signals shared by TWAS and mRNA expression analysis of AN, such as the primary auditory cortex vs. cerebellar cortex in fetal development and earlier vs. later fetal development in the somatosensory cortex. For OCD, we also detected multiple common association signals, such as medial prefrontal cortex vs. amygdala in adulthood and fetal development vs. infancy in mediodorsal nucleus of thalamus. Our study provides novel clues for describing the spatial and temporal features of brain development in the pathogenesis of AN and OCD.
Collapse
Affiliation(s)
- Bolun Cheng
- Key Laboratory of Trace Elements and Endemic Diseases, Collaborative Innovation Center of Endemic Disease and Health Promotion for Silk Road Region, School of Public Health, Health Science Center, Xi'an Jiaotong University, Xi'an 710061, P.R. China
| | - Xin Qi
- Key Laboratory of Trace Elements and Endemic Diseases, Collaborative Innovation Center of Endemic Disease and Health Promotion for Silk Road Region, School of Public Health, Health Science Center, Xi'an Jiaotong University, Xi'an 710061, P.R. China
| | - Chujun Liang
- Key Laboratory of Trace Elements and Endemic Diseases, Collaborative Innovation Center of Endemic Disease and Health Promotion for Silk Road Region, School of Public Health, Health Science Center, Xi'an Jiaotong University, Xi'an 710061, P.R. China
| | - Lu Zhang
- Key Laboratory of Trace Elements and Endemic Diseases, Collaborative Innovation Center of Endemic Disease and Health Promotion for Silk Road Region, School of Public Health, Health Science Center, Xi'an Jiaotong University, Xi'an 710061, P.R. China
| | - Mei Ma
- Key Laboratory of Trace Elements and Endemic Diseases, Collaborative Innovation Center of Endemic Disease and Health Promotion for Silk Road Region, School of Public Health, Health Science Center, Xi'an Jiaotong University, Xi'an 710061, P.R. China
| | - Ping Li
- Key Laboratory of Trace Elements and Endemic Diseases, Collaborative Innovation Center of Endemic Disease and Health Promotion for Silk Road Region, School of Public Health, Health Science Center, Xi'an Jiaotong University, Xi'an 710061, P.R. China
| | - Li Liu
- Key Laboratory of Trace Elements and Endemic Diseases, Collaborative Innovation Center of Endemic Disease and Health Promotion for Silk Road Region, School of Public Health, Health Science Center, Xi'an Jiaotong University, Xi'an 710061, P.R. China
| | - Shiqiang Cheng
- Key Laboratory of Trace Elements and Endemic Diseases, Collaborative Innovation Center of Endemic Disease and Health Promotion for Silk Road Region, School of Public Health, Health Science Center, Xi'an Jiaotong University, Xi'an 710061, P.R. China
| | - Yao Yao
- Key Laboratory of Trace Elements and Endemic Diseases, Collaborative Innovation Center of Endemic Disease and Health Promotion for Silk Road Region, School of Public Health, Health Science Center, Xi'an Jiaotong University, Xi'an 710061, P.R. China
| | - Xiaomeng Chu
- Key Laboratory of Trace Elements and Endemic Diseases, Collaborative Innovation Center of Endemic Disease and Health Promotion for Silk Road Region, School of Public Health, Health Science Center, Xi'an Jiaotong University, Xi'an 710061, P.R. China
| | - Jing Ye
- Key Laboratory of Trace Elements and Endemic Diseases, Collaborative Innovation Center of Endemic Disease and Health Promotion for Silk Road Region, School of Public Health, Health Science Center, Xi'an Jiaotong University, Xi'an 710061, P.R. China
| | - Yan Wen
- Key Laboratory of Trace Elements and Endemic Diseases, Collaborative Innovation Center of Endemic Disease and Health Promotion for Silk Road Region, School of Public Health, Health Science Center, Xi'an Jiaotong University, Xi'an 710061, P.R. China
| | - Yumeng Jia
- Key Laboratory of Trace Elements and Endemic Diseases, Collaborative Innovation Center of Endemic Disease and Health Promotion for Silk Road Region, School of Public Health, Health Science Center, Xi'an Jiaotong University, Xi'an 710061, P.R. China
| | - Feng Zhang
- Key Laboratory of Trace Elements and Endemic Diseases, Collaborative Innovation Center of Endemic Disease and Health Promotion for Silk Road Region, School of Public Health, Health Science Center, Xi'an Jiaotong University, Xi'an 710061, P.R. China
| |
Collapse
|
7
|
Jiang L, Xue C, Dai S, Chen S, Chen P, Sham PC, Wang H, Li M. DESE: estimating driver tissues by selective expression of genes associated with complex diseases or traits. Genome Biol 2019; 20:233. [PMID: 31694669 PMCID: PMC6836538 DOI: 10.1186/s13059-019-1801-5] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2019] [Accepted: 08/25/2019] [Indexed: 02/08/2023] Open
Abstract
The driver tissues or cell types in which susceptibility genes initiate diseases remain elusive. We develop a unified framework to detect the causal tissues of complex diseases or traits according to selective expression of disease-associated genes in genome-wide association studies (GWASs). This framework consists of three components which run iteratively to produce a converged prioritization list of driver tissues. Additionally, this framework also outputs a list of prioritized genes as a byproduct. We apply the framework to six representative complex diseases or traits with GWAS summary statistics, which leads to the estimation of the lung as an associated tissue of rheumatoid arthritis.
Collapse
Affiliation(s)
- Lin Jiang
- Zhongshan School of Medicine, Center for Precision Medicine, Sun Yat-sen University, Guangzhou, 510080, China.,Department of Pituitary Tumour Center, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, 510080, China
| | - Chao Xue
- Zhongshan School of Medicine, Center for Precision Medicine, Sun Yat-sen University, Guangzhou, 510080, China.,Key Laboratory of Tropical Disease Control (SYSU), Ministry of Education, Guangzhou, 510080, China
| | - Sheng Dai
- Zhongshan School of Medicine, Center for Precision Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Shangzhen Chen
- Zhongshan School of Medicine, Center for Precision Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Peikai Chen
- Department of Psychiatry, The Centre for Genomic Sciences, State Key Laboratory of Brain and Cognitive Sciences, The University of Hong Kong, Hong Kong SAR, China
| | - Pak Chung Sham
- Department of Psychiatry, The Centre for Genomic Sciences, State Key Laboratory of Brain and Cognitive Sciences, The University of Hong Kong, Hong Kong SAR, China
| | - Haijun Wang
- Department of Pituitary Tumour Center, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, 510080, China.
| | - Miaoxin Li
- Zhongshan School of Medicine, Center for Precision Medicine, Sun Yat-sen University, Guangzhou, 510080, China. .,Key Laboratory of Tropical Disease Control (SYSU), Ministry of Education, Guangzhou, 510080, China. .,Department of Psychiatry, The Centre for Genomic Sciences, State Key Laboratory of Brain and Cognitive Sciences, The University of Hong Kong, Hong Kong SAR, China.
| |
Collapse
|
8
|
Pan X, Shen HB. Inferring Disease-Associated MicroRNAs Using Semi-supervised Multi-Label Graph Convolutional Networks. iScience 2019; 20:265-277. [PMID: 31605942 PMCID: PMC6817654 DOI: 10.1016/j.isci.2019.09.013] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2019] [Revised: 09/05/2019] [Accepted: 09/11/2019] [Indexed: 01/22/2023] Open
Abstract
MicroRNAs (miRNAs) play crucial roles in biological processes involved in diseases. The associations between diseases and protein-coding genes (PCGs) have been well investigated, and miRNAs interact with PCGs to trigger them to be functional. We present a computational method, DimiG, to infer miRNA-associated diseases using a semi-supervised Graph Convolutional Network model (GCN). DimiG uses a multi-label framework to integrate PCG-PCG interactions, PCG-miRNA interactions, PCG-disease associations, and tissue expression profiles. DimiG is trained on disease-PCG associations and an interaction network using a GCN, which is further used to score associations between diseases and miRNAs. We evaluate DimiG on a benchmark set from verified disease-miRNA associations. Our results demonstrate that DimiG outperforms the best unsupervised method and is comparable to two supervised methods. Three case studies of prostate cancer, lung cancer, and inflammatory bowel disease further demonstrate the efficacy of DimiG, where top miRNAs predicted by DimiG are supported by literature.
Collapse
Affiliation(s)
- Xiaoyong Pan
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, 200240 Shanghai, China; Department of Medical informatics, Erasmus Medical Center, 3015 CE Rotterdam, the Netherlands.
| | - Hong-Bin Shen
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, 200240 Shanghai, China.
| |
Collapse
|
9
|
Zolotareva O, Kleine M. A Survey of Gene Prioritization Tools for Mendelian and Complex Human Diseases. J Integr Bioinform 2019; 16:/j/jib.ahead-of-print/jib-2018-0069/jib-2018-0069.xml. [PMID: 31494632 PMCID: PMC7074139 DOI: 10.1515/jib-2018-0069] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2018] [Accepted: 07/12/2019] [Indexed: 12/16/2022] Open
Abstract
Modern high-throughput experiments provide us with numerous potential associations between genes and diseases. Experimental validation of all the discovered associations, let alone all the possible interactions between them, is time-consuming and expensive. To facilitate the discovery of causative genes, various approaches for prioritization of genes according to their relevance for a given disease have been developed. In this article, we explain the gene prioritization problem and provide an overview of computational tools for gene prioritization. Among about a hundred of published gene prioritization tools, we select and briefly describe 14 most up-to-date and user-friendly. Also, we discuss the advantages and disadvantages of existing tools, challenges of their validation, and the directions for future research.
Collapse
Affiliation(s)
- Olga Zolotareva
- Bielefeld University, Faculty of Technology and Center for Biotechnology, International Research Training Group "Computational Methods for the Analysis of the Diversity and Dynamics of Genomes" and Genome Informatics, Universitätsstraße 25, Bielefeld, Germany
| | - Maren Kleine
- Bielefeld University, Faculty of Technology, Bioinformatics/Medical Informatics Department, Universitätsstraße 25, Bielefeld, Germany
| |
Collapse
|
10
|
Loganathan J, Pandey R, Ambhore NS, Borowicz P, Sathish V. Laser-capture microdissection of murine lung for differential cellular RNA analysis. Cell Tissue Res 2019; 376:425-432. [PMID: 30710174 PMCID: PMC6534428 DOI: 10.1007/s00441-019-02995-y] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2018] [Accepted: 01/15/2019] [Indexed: 12/11/2022]
Abstract
The lung tissue contains a heterogeneous milieu of bronchioles, epithelial, airway smooth muscle (ASM), alveolar, and immune cell types. Healthy bronchiole comprises epithelial cells surrounded by ASM cells and helps in normal respiration. In contrast, airway remodeling, or plasticity, increases surrounding of bronchial epithelium during inflammation, especially in asthmatic condition. Given the profound functional difference between ASM, epithelial, and other cell types in the lung, it is imperative to separate and isolate different cell types of lungs for genomics, proteomics, and molecular analysis, which will improve the diagnostic and therapeutic approach to treat cell-specific lung disorders. Laser capture microdissection (LCM) is the technique generally used for the isolation of specific cell populations under direct visual inspection, which plays a crucial role to evaluate cell-specific effect in clinical and preclinical setup. However, maintenance of tissue RNA quality and integrity in LCM studies are very challenging tasks. It is obvious to believe that the major factor affecting the RNA quality is tissue-fixation method. The prime focus of this study was to address the RNA quality factors within the lung tissue using the different solvent system to fix tissue sample to obtain high-quality RNA. Paraformaldehyde and Carnoy's solutions were used for fixing the lung tissue and compared RNA integrity in LCM captured lung tissue samples. To further confirm the quality of RNA, we measured cellular marker genes in collected lung tissue samples from control and mixed allergen (MA)-induced asthmatic mouse model using qRT-PCR technique. RNA integrity number showed a significantly better quality of RNA in lung tissue samples fixed with Carnoy's solution compared to paraformaldehyde solution. Isolated RNA from MA-induced asthmatic murine lung epithelium, smooth muscle, and granulomatous foci using LCM showed a significant increase in remodeling gene expression compared to control which confirm the quality and integrity of isolated RNA. Overall, the study concludes tissue fixation solvent can alter the quality of RNA in the lung and the outcome of the results.
Collapse
Affiliation(s)
- Jagadish Loganathan
- Department of Pharmaceutical Sciences, School of Pharmacy, North Dakota State University, Sudro Hall, Room 203, Fargo, ND, 58108-6050, USA
| | - Roshni Pandey
- Department of Pharmaceutical Sciences, School of Pharmacy, North Dakota State University, Sudro Hall, Room 203, Fargo, ND, 58108-6050, USA
| | - Nilesh Sudhakar Ambhore
- Department of Pharmaceutical Sciences, School of Pharmacy, North Dakota State University, Sudro Hall, Room 203, Fargo, ND, 58108-6050, USA
| | - Pawel Borowicz
- Department of Animal Sciences, North Dakota State University, Fargo, ND, USA
| | - Venkatachalem Sathish
- Department of Pharmaceutical Sciences, School of Pharmacy, North Dakota State University, Sudro Hall, Room 203, Fargo, ND, 58108-6050, USA.
| |
Collapse
|
11
|
Pan X, Jensen LJ, Gorodkin J. Inferring disease-associated long non-coding RNAs using genome-wide tissue expression profiles. Bioinformatics 2018; 35:1494-1502. [DOI: 10.1093/bioinformatics/bty859] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2018] [Revised: 08/28/2018] [Accepted: 10/04/2018] [Indexed: 11/13/2022] Open
Affiliation(s)
- Xiaoyong Pan
- Department of Veterinary and Animal Sciences, Center for Non-coding RNA in Technology and Health, University of Copenhagen, Frederiksberg C, Denmark
- Disease Systems Biology Program, Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen N, Denmark
| | - Lars Juhl Jensen
- Disease Systems Biology Program, Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen N, Denmark
| | - Jan Gorodkin
- Department of Veterinary and Animal Sciences, Center for Non-coding RNA in Technology and Health, University of Copenhagen, Frederiksberg C, Denmark
| |
Collapse
|
12
|
Abstract
Motivation Understanding functions of proteins in specific human tissues is essential for insights into disease diagnostics and therapeutics, yet prediction of tissue-specific cellular function remains a critical challenge for biomedicine. Results Here, we present OhmNet, a hierarchy-aware unsupervised node feature learning approach for multi-layer networks. We build a multi-layer network, where each layer represents molecular interactions in a different human tissue. OhmNet then automatically learns a mapping of proteins, represented as nodes, to a neural embedding-based low-dimensional space of features. OhmNet encourages sharing of similar features among proteins with similar network neighborhoods and among proteins activated in similar tissues. The algorithm generalizes prior work, which generally ignores relationships between tissues, by modeling tissue organization with a rich multiscale tissue hierarchy. We use OhmNet to study multicellular function in a multi-layer protein interaction network of 107 human tissues. In 48 tissues with known tissue-specific cellular functions, OhmNet provides more accurate predictions of cellular function than alternative approaches, and also generates more accurate hypotheses about tissue-specific protein actions. We show that taking into account the tissue hierarchy leads to improved predictive power. Remarkably, we also demonstrate that it is possible to leverage the tissue hierarchy in order to effectively transfer cellular functions to a functionally uncharacterized tissue. Overall, OhmNet moves from flat networks to multiscale models able to predict a range of phenotypes spanning cellular subsystems. Availability and implementation Source code and datasets are available at http://snap.stanford.edu/ohmnet.
Collapse
Affiliation(s)
- Marinka Zitnik
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Jure Leskovec
- Department of Computer Science, Stanford University, Stanford, CA, USA
| |
Collapse
|
13
|
|
14
|
Frasca M. Gene2DisCo: Gene to disease using disease commonalities. Artif Intell Med 2017; 82:34-46. [DOI: 10.1016/j.artmed.2017.08.001] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2017] [Revised: 07/24/2017] [Accepted: 08/13/2017] [Indexed: 01/10/2023]
|
15
|
Tian Z, Guo M, Wang C, Xing L, Wang L, Zhang Y. Constructing an integrated gene similarity network for the identification of disease genes. J Biomed Semantics 2017; 8:32. [PMID: 29297379 PMCID: PMC5763299 DOI: 10.1186/s13326-017-0141-1] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
BACKGROUND Discovering novel genes that are involved human diseases is a challenging task in biomedical research. In recent years, several computational approaches have been proposed to prioritize candidate disease genes. Most of these methods are mainly based on protein-protein interaction (PPI) networks. However, since these PPI networks contain false positives and only cover less half of known human genes, their reliability and coverage are very low. Therefore, it is highly necessary to fuse multiple genomic data to construct a credible gene similarity network and then infer disease genes on the whole genomic scale. RESULTS We proposed a novel method, named RWRB, to infer causal genes of interested diseases. First, we construct five individual gene (protein) similarity networks based on multiple genomic data of human genes. Then, an integrated gene similarity network (IGSN) is reconstructed based on similarity network fusion (SNF) method. Finally, we employee the random walk with restart algorithm on the phenotype-gene bilayer network, which combines phenotype similarity network, IGSN as well as phenotype-gene association network, to prioritize candidate disease genes. We investigate the effectiveness of RWRB through leave-one-out cross-validation methods in inferring phenotype-gene relationships. Results show that RWRB is more accurate than state-of-the-art methods on most evaluation metrics. Further analysis shows that the success of RWRB is benefited from IGSN which has a wider coverage and higher reliability comparing with current PPI networks. Moreover, we conduct a comprehensive case study for Alzheimer's disease and predict some novel disease genes that supported by literature. CONCLUSIONS RWRB is an effective and reliable algorithm in prioritizing candidate disease genes on the genomic scale. Software and supplementary information are available at http://nclab.hit.edu.cn/~tianzhen/RWRB/ .
Collapse
Affiliation(s)
- Zhen Tian
- School of Computer Science and Engineering, Harbin Institute of Technology, Harbin, 150001 People’s Republic of China
| | - Maozu Guo
- School of Computer Science and Engineering, Harbin Institute of Technology, Harbin, 150001 People’s Republic of China
| | - Chunyu Wang
- School of Computer Science and Engineering, Harbin Institute of Technology, Harbin, 150001 People’s Republic of China
| | - LinLin Xing
- School of Computer Science and Engineering, Harbin Institute of Technology, Harbin, 150001 People’s Republic of China
| | - Lei Wang
- Institute of Health Service and Medical Information Academy of Military Medical Sciences Beijing, Beijing, 100850 China
| | - Yin Zhang
- Institute of Health Service and Medical Information Academy of Military Medical Sciences Beijing, Beijing, 100850 China
| |
Collapse
|
16
|
Ramyachitra D, Nithya R. Construction of reliable heterogeneous network using protein sequence similarity for the prioritization of candidate disease genes. GENE REPORTS 2017. [DOI: 10.1016/j.genrep.2017.04.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
17
|
Feiglin A, Allen BK, Kohane IS, Kong SW. Comprehensive Analysis of Tissue-wide Gene Expression and Phenotype Data Reveals Tissues Affected in Rare Genetic Disorders. Cell Syst 2017; 5:140-148.e2. [PMID: 28822752 PMCID: PMC5928498 DOI: 10.1016/j.cels.2017.06.016] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2016] [Revised: 03/21/2017] [Accepted: 06/29/2017] [Indexed: 01/23/2023]
Abstract
Linking putatively pathogenic variants to the tissues they affect is necessary for determining the correct diagnostic workup and therapeutic regime in undiagnosed patients. Here, we explored how gene expression across healthy tissues can be used to infer this link. We integrated 6,665 tissue-wide transcriptomes with genetic disorder knowledge bases covering 3,397 diseases. Receiver-operating characteristics (ROC) analysis using expression levels in each tissue and across tissues indicated significant but modest associations between elevated expression and phenotype for most tissues (maximum area under ROC curve = 0.69). At extreme elevation, associations were marked. Upregulation of disease genes in affected tissues was pronounced for genes associated with autosomal dominant over recessive disorders. Pathways enriched for genes expressed and associated with phenotypes highlighted tissue functionality, including lipid metabolism in spleen and DNA repair in adipose tissue. These results suggest features useful for evaluating the likelihood of particular tissue manifestations in genetic disorders. The web address of an interactive platform integrating these data is provided.
Collapse
Affiliation(s)
- Ariel Feiglin
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA
| | - Bryce K Allen
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA
| | - Isaac S Kohane
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA.
| | - Sek Won Kong
- Computational Health Informatics Program, Boston Children's Hospital, Boston, MA 02115, USA; Department of Pediatrics, Harvard Medical School, Boston, MA 02115, USA
| |
Collapse
|
18
|
Freytag S, Burgess R, Oliver KL, Bahlo M. brain-coX: investigating and visualising gene co-expression in seven human brain transcriptomic datasets. Genome Med 2017; 9:55. [PMID: 28595657 PMCID: PMC5465565 DOI: 10.1186/s13073-017-0444-y] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2016] [Accepted: 05/26/2017] [Indexed: 12/17/2022] Open
Abstract
Background The pathogenesis of neurological and mental health disorders often involves multiple genes, complex interactions, as well as brain- and development-specific biological mechanisms. These characteristics make identification of disease genes for such disorders challenging, as conventional prioritisation tools are not specifically tailored to deal with the complexity of the human brain. Thus, we developed a novel web-application—brain-coX—that offers gene prioritisation with accompanying visualisations based on seven gene expression datasets in the post-mortem human brain, the largest such resource ever assembled. Results We tested whether our tool can correctly prioritise known genes from 37 brain-specific KEGG pathways and 17 psychiatric conditions. We achieved average sensitivity of nearly 50%, at the same time reaching a specificity of approximately 75%. We also compared brain-coX’s performance to that of its main competitors, Endeavour and ToppGene, focusing on the ability to discover novel associations. Using a subset of the curated SFARI autism gene collection we show that brain-coX’s prioritisations are most similar to SFARI’s own curated gene classifications. Conclusions brain-coX is the first prioritisation and visualisation web-tool targeted to the human brain and can be freely accessed via http://shiny.bioinf.wehi.edu.au/freytag.s/. Electronic supplementary material The online version of this article (doi:10.1186/s13073-017-0444-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Saskia Freytag
- Population Health and Immunity Divison, The Walter and Eliza Hall Institute of Medical Research, 1G Royale Parade, 3052, Parkville, Australia. .,Department of Medical Biology, University of Melbourne, 1G Royale Parade, 3052, Parkville, Australia.
| | - Rosemary Burgess
- Epilepsy Research Centre, Department of Medicine, Austin Health, University of Melbourne, 245 Burgundy Street, 3084, Heidelberg, Australia
| | - Karen L Oliver
- Population Health and Immunity Divison, The Walter and Eliza Hall Institute of Medical Research, 1G Royale Parade, 3052, Parkville, Australia.,Epilepsy Research Centre, Department of Medicine, Austin Health, University of Melbourne, 245 Burgundy Street, 3084, Heidelberg, Australia
| | - Melanie Bahlo
- Population Health and Immunity Divison, The Walter and Eliza Hall Institute of Medical Research, 1G Royale Parade, 3052, Parkville, Australia.,Department of Medical Biology, University of Melbourne, 1G Royale Parade, 3052, Parkville, Australia.,School of Mathematics and Statistics, University of Melbourne, 3010, Parkville, Australia
| |
Collapse
|
19
|
Bertoldi L, Forcato C, Vitulo N, Birolo G, De Pascale F, Feltrin E, Schiavon R, Anglani F, Negrisolo S, Zanetti A, D'Avanzo F, Tomanin R, Faulkner G, Vezzi A, Valle G. QueryOR: a comprehensive web platform for genetic variant analysis and prioritization. BMC Bioinformatics 2017; 18:225. [PMID: 28454514 PMCID: PMC5410040 DOI: 10.1186/s12859-017-1654-4] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2017] [Accepted: 04/26/2017] [Indexed: 11/21/2022] Open
Abstract
Background Whole genome and exome sequencing are contributing to the extraordinary progress in the study of human genetic variants. In this fast developing field, appropriate and easily accessible tools are required to facilitate data analysis. Results Here we describe QueryOR, a web platform suitable for searching among known candidate genes as well as for finding novel gene-disease associations. QueryOR combines several innovative features that make it comprehensive, flexible and easy to use. Instead of being designed on specific datasets, it works on a general XML schema specifying formats and criteria of each data source. Thanks to this flexibility, new criteria can be easily added for future expansion. Currently, up to 70 user-selectable criteria are available, including a wide range of gene and variant features. Moreover, rather than progressively discarding variants taking one criterion at a time, the prioritization is achieved by a global positive selection process that considers all transcript isoforms, thus producing reliable results. QueryOR is easy to use and its intuitive interface allows to handle different kinds of inheritance as well as features related to sharing variants in different patients. QueryOR is suitable for investigating single patients, families or cohorts. Conclusions QueryOR is a comprehensive and flexible web platform eligible for an easy user-driven variant prioritization. It is freely available for academic institutions at http://queryor.cribi.unipd.it/. Electronic supplementary material The online version of this article (doi:10.1186/s12859-017-1654-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Loris Bertoldi
- CRIBI Biotechnology Centre, University of Padua, Padua, Italy
| | - Claudio Forcato
- CRIBI Biotechnology Centre, University of Padua, Padua, Italy
| | - Nicola Vitulo
- CRIBI Biotechnology Centre, University of Padua, Padua, Italy.,Present address: Department of Biotechnology, University of Verona, Verona, Italy
| | - Giovanni Birolo
- CRIBI Biotechnology Centre, University of Padua, Padua, Italy
| | | | - Erika Feltrin
- CRIBI Biotechnology Centre, University of Padua, Padua, Italy
| | | | - Franca Anglani
- Department of Medicine, University of Padua, Padua, Italy
| | - Susanna Negrisolo
- Department of Women's and Children's Health, University of Padua, Padua, Italy
| | - Alessandra Zanetti
- Department of Women's and Children's Health, University of Padua, Padua, Italy
| | - Francesca D'Avanzo
- Department of Women's and Children's Health, University of Padua, Padua, Italy
| | - Rosella Tomanin
- Department of Women's and Children's Health, University of Padua, Padua, Italy
| | | | | | - Giorgio Valle
- CRIBI Biotechnology Centre, University of Padua, Padua, Italy. .,Department of Biology, University of Padua, Padua, Italy.
| |
Collapse
|
20
|
Masters SL, Lagou V, Jéru I, Baker PJ, Van Eyck L, Parry DA, Lawless D, De Nardo D, Garcia-Perez JE, Dagley LF, Holley CL, Dooley J, Moghaddas F, Pasciuto E, Jeandel PY, Sciot R, Lyras D, Webb AI, Nicholson SE, De Somer L, van Nieuwenhove E, Ruuth-Praz J, Copin B, Cochet E, Medlej-Hashim M, Megarbane A, Schroder K, Savic S, Goris A, Amselem S, Wouters C, Liston A. Familial autoinflammation with neutrophilic dermatosis reveals a regulatory mechanism of pyrin activation. Sci Transl Med 2016; 8:332ra45. [PMID: 27030597 DOI: 10.1126/scitranslmed.aaf1471] [Citation(s) in RCA: 210] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2015] [Accepted: 03/03/2016] [Indexed: 12/16/2022]
Abstract
Pyrin responds to pathogen signals and loss of cellular homeostasis by forming an inflammasome complex that drives the cleavage and secretion of interleukin-1β (IL-1β). Mutations in the B30.2/SPRY domain cause pathogen-independent activation of pyrin and are responsible for the autoinflammatory disease familial Mediterranean fever (FMF). We studied a family with a dominantly inherited autoinflammatory disease, distinct from FMF, characterized by childhood-onset recurrent episodes of neutrophilic dermatosis, fever, elevated acute-phase reactants, arthralgia, and myalgia/myositis. The disease was caused by a mutation in MEFV, the gene encoding pyrin (S242R). The mutation results in the loss of a 14-3-3 binding motif at phosphorylated S242, which was not perturbed by FMF mutations in the B30.2/SPRY domain. However, loss of both S242 phosphorylation and 14-3-3 binding was observed for bacterial effectors that activate the pyrin inflammasome, such as Clostridium difficile toxin B (TcdB). The S242R mutation thus recapitulated the effect of pathogen sensing, triggering inflammasome activation and IL-1β production. Successful therapy targeting IL-1β has been initiated in one patient, resolving pyrin-associated autoinflammation with neutrophilic dermatosis. This disease provides evidence that a guard-like mechanism of pyrin regulation, originally identified for Nod-like receptors in plant innate immunity, also exists in humans.
Collapse
Affiliation(s)
- Seth L Masters
- Inflammation Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, 3052, Australia. Department of Medical Biology, The University of Melbourne, Parkville, Victoria 3010, Australia.
| | - Vasiliki Lagou
- Department of Neurosciences, KU Leuven, Leuven 3000, Belgium. Department of Microbiology and Immunology, KU Leuven, Leuven 3000, Belgium. Translational Immunology Laboratory, VIB, Leuven 3000, Belgium
| | - Isabelle Jéru
- INSERM, UMR S933, Paris F-75012, France. Université Pierre et Marie Curie-Paris, UMR S933, Paris F-75012, France. Assistance Publique Hôpitaux de Paris, Hôpital Trousseau, Service de Génétique et d'Embryologie médicales, Paris F-75012, France
| | - Paul J Baker
- Inflammation Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, 3052, Australia. Department of Medical Biology, The University of Melbourne, Parkville, Victoria 3010, Australia
| | - Lien Van Eyck
- Department of Microbiology and Immunology, KU Leuven, Leuven 3000, Belgium. Translational Immunology Laboratory, VIB, Leuven 3000, Belgium
| | - David A Parry
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Molecular Medicine, University of Edinburgh, Western General Hospital, Crewe Road South, Edinburgh LS7 4SA, UK
| | - Dylan Lawless
- Leeds Institute of Biomedical and Clinical Sciences, University of Leeds, Wellcome Trust Brenner Building, Saint James's University Hospital, Leeds LS7 4SA, UK
| | - Dominic De Nardo
- Inflammation Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, 3052, Australia. Department of Medical Biology, The University of Melbourne, Parkville, Victoria 3010, Australia
| | - Josselyn E Garcia-Perez
- Department of Microbiology and Immunology, KU Leuven, Leuven 3000, Belgium. Translational Immunology Laboratory, VIB, Leuven 3000, Belgium
| | - Laura F Dagley
- Inflammation Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, 3052, Australia. Department of Medical Biology, The University of Melbourne, Parkville, Victoria 3010, Australia. Systems Biology and Personalised Medicine Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria 3052, Australia
| | - Caroline L Holley
- Institute for Molecular Bioscience (IMB) and IMB Centre for Inflammation and Disease Research, The University of Queensland, Brisbane, Queensland 4072, Australia
| | - James Dooley
- Department of Microbiology and Immunology, KU Leuven, Leuven 3000, Belgium. Translational Immunology Laboratory, VIB, Leuven 3000, Belgium
| | - Fiona Moghaddas
- Inflammation Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, 3052, Australia. Department of Medical Biology, The University of Melbourne, Parkville, Victoria 3010, Australia
| | - Emanuela Pasciuto
- Department of Microbiology and Immunology, KU Leuven, Leuven 3000, Belgium. Translational Immunology Laboratory, VIB, Leuven 3000, Belgium
| | - Pierre-Yves Jeandel
- Département de Médecine Interne, Hôpital Archet 1, Université Nice Sophia-Antipolis, 06202 Nice, France
| | - Raf Sciot
- Department of Pathology, KU Leuven, Leuven 3000, Belgium. University Hospitals Leuven, Leuven 3000, Belgium
| | - Dena Lyras
- Department of Microbiology, Monash University, Melbourne, Victoria 3800, Australia
| | - Andrew I Webb
- Department of Medical Biology, The University of Melbourne, Parkville, Victoria 3010, Australia. Systems Biology and Personalised Medicine Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria 3052, Australia
| | - Sandra E Nicholson
- Inflammation Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, 3052, Australia. Department of Medical Biology, The University of Melbourne, Parkville, Victoria 3010, Australia
| | | | - Erika van Nieuwenhove
- Department of Microbiology and Immunology, KU Leuven, Leuven 3000, Belgium. Translational Immunology Laboratory, VIB, Leuven 3000, Belgium. University Hospitals Leuven, Leuven 3000, Belgium
| | - Julia Ruuth-Praz
- Université Pierre et Marie Curie-Paris, UMR S933, Paris F-75012, France. Assistance Publique Hôpitaux de Paris, Hôpital Trousseau, Service de Génétique et d'Embryologie médicales, Paris F-75012, France
| | - Bruno Copin
- Assistance Publique Hôpitaux de Paris, Hôpital Trousseau, Service de Génétique et d'Embryologie médicales, Paris F-75012, France
| | - Emmanuelle Cochet
- Assistance Publique Hôpitaux de Paris, Hôpital Trousseau, Service de Génétique et d'Embryologie médicales, Paris F-75012, France
| | - Myrna Medlej-Hashim
- Department of Life and Earth Sciences, Faculty of Sciences II, Lebanese University, Beirut 1102 2801, Lebanon
| | - Andre Megarbane
- Al-Jawhara Center, Arabian Gulf University, Manama 26671, Bahrain
| | - Kate Schroder
- Institute for Molecular Bioscience (IMB) and IMB Centre for Inflammation and Disease Research, The University of Queensland, Brisbane, Queensland 4072, Australia
| | - Sinisa Savic
- Department of Allergy and Clinical Immunology, Saint James's University Hospital, Leeds LS9 7TF, UK. National Institute for Health Research-Leeds Musculoskeletal Biomedical Research Unit and Leeds Institute of Rheumatic and Musculoskeletal Medicine, Wellcome Trust Brenner Building, Saint James's University Hospital, Beckett Street, Leeds LS9 7TF, UK
| | - An Goris
- Department of Neurosciences, KU Leuven, Leuven 3000, Belgium
| | - Serge Amselem
- INSERM, UMR S933, Paris F-75012, France. Université Pierre et Marie Curie-Paris, UMR S933, Paris F-75012, France. Assistance Publique Hôpitaux de Paris, Hôpital Trousseau, Service de Génétique et d'Embryologie médicales, Paris F-75012, France
| | - Carine Wouters
- Department of Microbiology and Immunology, KU Leuven, Leuven 3000, Belgium. University Hospitals Leuven, Leuven 3000, Belgium.
| | - Adrian Liston
- Department of Microbiology and Immunology, KU Leuven, Leuven 3000, Belgium. Translational Immunology Laboratory, VIB, Leuven 3000, Belgium.
| |
Collapse
|
21
|
Cardozo T, Gupta P, Ni E, Young LM, Tivon D, Felsovalyi K. Data sources for in vivo molecular profiling of human phenotypes. WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE 2016; 8:472-484. [PMID: 27599755 DOI: 10.1002/wsbm.1354] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2016] [Revised: 06/26/2016] [Accepted: 06/27/2016] [Indexed: 11/08/2022]
Abstract
Molecular profiling of human diseases has been approached at the genetic (DNA), expression (RNA), and proteomic (protein) levels. An important goal of these efforts is to map observed molecular patterns to specific, mechanistic organic entities, such as loci in the genome, individual RNA molecules or defined proteins or protein assemblies. Importantly, such maps have been historically approached in the more intuitive context of a theoretical individual cell, but diseases are better described in reality using an in vivo framework, namely a library of several tissue-specific maps. In this article, we review the existing data atlases that can be used for this purpose and identify critical gaps that could move the field forward from cellular to in vivo dimensions. WIREs Syst Biol Med 2016, 8:472-484. doi: 10.1002/wsbm.1354 For further resources related to this article, please visit the WIREs website.
Collapse
Affiliation(s)
- Timothy Cardozo
- Department of Biochemistry and Molecular Pharmacology, NYU School of Medicine, New York, NY, USA.
| | - Priyanka Gupta
- Department of Biochemistry and Molecular Pharmacology, NYU School of Medicine, New York, NY, USA.,GeneCentrix Inc., New York, NY, USA
| | - Eric Ni
- Department of Biochemistry and Molecular Pharmacology, NYU School of Medicine, New York, NY, USA.,GeneCentrix Inc., New York, NY, USA
| | - Lauren M Young
- Department of Pathology, NYU School of Medicine, New York, NY, USA
| | | | | |
Collapse
|
22
|
Jiang J, Li W, Liang B, Xie R, Chen B, Huang H, Li Y, He Y, Lv J, He W, Chen L. A Novel Prioritization Method in Identifying Recurrent Venous Thromboembolism-Related Genes. PLoS One 2016; 11:e0153006. [PMID: 27050193 PMCID: PMC4822849 DOI: 10.1371/journal.pone.0153006] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2015] [Accepted: 03/21/2016] [Indexed: 12/13/2022] Open
Abstract
Identifying the genes involved in venous thromboembolism (VTE) recurrence is important not only for understanding the pathogenesis but also for discovering the therapeutic targets. We proposed a novel prioritization method called Function-Interaction-Pearson (FIP) by creating gene-disease similarity scores to prioritize candidate genes underling VTE. The scores were calculated by integrating and optimizing three types of resources including gene expression, gene ontology and protein-protein interaction. As a result, 124 out of top 200 prioritized candidate genes had been confirmed in literature, among which there were 34 antithrombotic drug targets. Compared with two well-known gene prioritization tools Endeavour and ToppNet, FIP was shown to have better performance. The approach provides a valuable alternative for drug targets discovery and disease therapy.
Collapse
Affiliation(s)
- Jing Jiang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Hei Longjiang Province, China, Postal code: 150081
| | - Wan Li
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Hei Longjiang Province, China, Postal code: 150081
| | - Binhua Liang
- National Microbology Laboratory, Public Health Agency of Canada, Winnipeg, Manitoba, Canada
| | - Ruiqiang Xie
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Hei Longjiang Province, China, Postal code: 150081
| | - Binbin Chen
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Hei Longjiang Province, China, Postal code: 150081
| | - Hao Huang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Hei Longjiang Province, China, Postal code: 150081
| | - Yiran Li
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Hei Longjiang Province, China, Postal code: 150081
| | - Yuehan He
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Hei Longjiang Province, China, Postal code: 150081
| | - Junjie Lv
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Hei Longjiang Province, China, Postal code: 150081
| | - Weiming He
- Institute of Opto-electronics, Harbin Institute of Technology, Harbin, Hei Longjiang Province, China
- * E-mail: (LC); (WH)
| | - Lina Chen
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Hei Longjiang Province, China, Postal code: 150081
- * E-mail: (LC); (WH)
| |
Collapse
|
23
|
Wells A, Kopp N, Xu X, O'Brien DR, Yang W, Nehorai A, Adair-Kirk TL, Kopan R, Dougherty JD. The anatomical distribution of genetic associations. Nucleic Acids Res 2015; 43:10804-20. [PMID: 26586807 PMCID: PMC4678833 DOI: 10.1093/nar/gkv1262] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2015] [Accepted: 11/04/2015] [Indexed: 01/13/2023] Open
Abstract
Deeper understanding of the anatomical intermediaries for disease and other complex genetic traits is essential to understanding mechanisms and developing new interventions. Existing ontology tools provide functional, curated annotations for many genes and can be used to develop mechanistic hypotheses; yet information about the spatial expression of genes may be equally useful in interpreting results and forming novel hypotheses for a trait. Therefore, we developed an approach for statistically testing the relationship between gene expression across the body and sets of candidate genes from across the genome. We validated this tool and tested its utility on three applications. First, we show that the expression of genes in associated loci from GWA studies implicates specific tissues for 57 out of 98 traits. Second, we tested the ability of the tool to identify novel relationships between gene expression and phenotypes. Specifically, we experimentally confirmed an underappreciated prediction highlighted by our tool: that white blood cell count--a quantitative trait of the immune system--is genetically modulated by genes expressed in the skin. Finally, using gene lists derived from exome sequencing data, we show that human genes under selective constraint are disproportionately expressed in nervous system tissues.
Collapse
Affiliation(s)
- Alan Wells
- Department of Genetics, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Nathan Kopp
- Department of Genetics, Washington University School of Medicine, St. Louis, MO 63110, USA Department of Psychiatry, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Xiaoxiao Xu
- The Preston M. Green Department of Electrical and Systems Engineering, Washington University, St. Louis, MO 63130, USA
| | - David R O'Brien
- Department of Genetics, Washington University School of Medicine, St. Louis, MO 63110, USA Department of Psychiatry, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Wei Yang
- Department of Genetics, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Arye Nehorai
- The Preston M. Green Department of Electrical and Systems Engineering, Washington University, St. Louis, MO 63130, USA
| | - Tracy L Adair-Kirk
- Department of Medicine, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Raphael Kopan
- Division of Developmental Biology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH 45229, USA
| | - J D Dougherty
- Department of Genetics, Washington University School of Medicine, St. Louis, MO 63110, USA Department of Psychiatry, Washington University School of Medicine, St. Louis, MO 63110, USA
| |
Collapse
|