Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Brunner HG, van Driel MA. From syndrome families to functional genomics. Nat Rev Genet 2004;5:545-51. [PMID: 15211356 DOI: 10.1038/nrg1383] [Citation(s) in RCA: 142] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Number

Cited by Other Article(s)

Luo S, Zhang X, Xiao X, Luo W, Yang Z, Tang S, Huang W. Exploring Potential Biomarkers and Molecular Mechanisms of Ischemic Cardiomyopathy and COVID-19 Comorbidity Based on Bioinformatics and Systems Biology. Int J Mol Sci 2023;24:ijms24076511. [PMID: 37047484 PMCID: PMC10094917 DOI: 10.3390/ijms24076511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2023] [Revised: 03/28/2023] [Accepted: 03/29/2023] [Indexed: 04/03/2023] Open

Bragina EY, Puzyrev VP. Genetic outline of the hermeneutics of the diseases connection phenomenon in human. Vavilovskii Zhurnal Genet Selektsii 2023;27:7-17. [PMID: 36923482 PMCID: PMC10009484 DOI: 10.18699/vjgb-23-03] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2022] [Revised: 12/25/2022] [Accepted: 12/26/2022] [Indexed: 03/11/2023] Open

Chen L, Yu YN, Liu J, Chen YY, Wang B, Qi YF, Guan S, Liu X, Li B, Zhang YY, Hu Y, Wang Z. Modular networks and genomic variation during progression from stable angina pectoris through ischemic cardiomyopathy to chronic heart failure. Mol Med 2022;28:140. [DOI: 10.1186/s10020-022-00569-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Accepted: 11/04/2022] [Indexed: 11/28/2022] Open

Abstract Abstract Background Analyzing disease–disease relationships plays an important role for understanding etiology, disease classification, and drug repositioning. However, as cardiovascular diseases with causative links, the molecular relationship among stable angina pectoris (SAP), ischemic cardiomyopathy (ICM) and chronic heart failure (CHF) is not clear. Methods In this study, by integrating the multi-database data, we constructed paired disease progression modules (PDPMs) to identified relationship among SAP, ICM and CHF based on module reconstruction pairs (MRPs) of K-value calculation (a Euclidean distance optimization by integrating module topology parameters and their weights) methods. Finally, enrichment analysis, literature validation and structural variation (SV) were performed to verify the relationship between the three diseases in PDPMs. Results Total 16 PDPMs were found with K > 0.3777 among SAP, ICM and CHF, in which 6 pairs in SAP–ICM, 5 pairs for both ICM–CHF and SAP–CHF. SAP–ICM was the most closely related by having the smallest average K-value (K = 0.3899) while the maximum is SAP–CHF (K = 0.4006). According to the function of the validation gene, inflammatory response were through each stage of SAP–ICM–CHF, while SAP–ICM was uniquely involved in fibrosis, and genes were related in affecting the upstream of PI3K–Akt signaling pathway. 4 of the 11 genes (FLT1, KDR, ANGPT2 and PGF) in SAP–ICM–CHF related to angiogenesis in HIF-1 signaling pathway. Furthermore, we identified 62.96% SVs were protein deletion in SAP–ICM–CHF, and 53.85% SVs were defined as protein replication in SAP–ICM, while ICM–CHF genes were mainly affected by protein deletion. Conclusion The PDPMs analysis approach combined with genomic structural variation provides a new avenue for determining target associations contributing to disease progression and reveals that inflammation and angiogenesis may be important links among SAP, ICM and CHF progression. Collapse

Network-Based Methods for Approaching Human Pathologies from a Phenotypic Point of View. Genes (Basel) 2022;13:genes13061081. [PMID: 35741843 PMCID: PMC9222217 DOI: 10.3390/genes13061081] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Revised: 06/10/2022] [Accepted: 06/14/2022] [Indexed: 01/27/2023] Open

Van De Weghe JC, Gomez A, Doherty D. The Joubert-Meckel-Nephronophthisis Spectrum of Ciliopathies. Annu Rev Genomics Hum Genet 2022;23:301-329. [PMID: 35655331 DOI: 10.1146/annurev-genom-121321-093528] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Tang X, Xiao Q, Yu K. Breast Cancer Candidate Gene Detection Through Integration of Subcellular Localization Data With Protein–Protein Interaction Networks. IEEE Trans Nanobioscience 2020;19:556-561. [DOI: 10.1109/tnb.2020.2990178] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Gamba A, Salmona M, Bazzoni G. Quantitative analysis of proteins which are members of the same protein complex but cause locus heterogeneity in disease. Sci Rep 2020;10:10423. [PMID: 32591566 PMCID: PMC7320193 DOI: 10.1038/s41598-020-66836-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Accepted: 05/26/2020] [Indexed: 12/28/2022] Open

Abduljaleel Z, Athar M, Al-Allaf FA, Al-Dehlawi S, Vazquez JR. Association of functional variants and protein-to-protein physical interactions of human MutY homolog linked with familial adenomatous polyposis and colorectal cancer syndrome. Noncoding RNA Res 2020;4:155-173. [PMID: 32072083 PMCID: PMC7012779 DOI: 10.1016/j.ncrna.2019.11.005] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2019] [Revised: 09/26/2019] [Accepted: 11/19/2019] [Indexed: 11/26/2022] Open

Down-regulation of TUFM impairs host cell interaction and virulence by Paracoccidioides brasiliensis. Sci Rep 2019;9:17206. [PMID: 31748561 PMCID: PMC6868139 DOI: 10.1038/s41598-019-51540-y] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2017] [Accepted: 07/31/2019] [Indexed: 12/13/2022] Open

Knaus A, Kortüm F, Kleefstra T, Stray-Pedersen A, Đukić D, Murakami Y, Gerstner T, van Bokhoven H, Iqbal Z, Horn D, Kinoshita T, Hempel M, Krawitz PM. Mutations in PIGU Impair the Function of the GPI Transamidase Complex, Causing Severe Intellectual Disability, Epilepsy, and Brain Anomalies. Am J Hum Genet 2019;105:395-402. [PMID: 31353022 DOI: 10.1016/j.ajhg.2019.06.009] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2019] [Accepted: 06/07/2019] [Indexed: 12/11/2022] Open

Yu L, Gao L. Human Pathway-Based Disease Network. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2019;16:1240-1249. [PMID: 29990107 DOI: 10.1109/tcbb.2017.2774802] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

Dozmorov MG. Disease classification: from phenotypic similarity to integrative genomics and beyond. Brief Bioinform 2019;20:1769-1780. [DOI: 10.1093/bib/bby049] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2018] [Revised: 05/01/2018] [Indexed: 02/06/2023] Open

The Discovery of a LEMD2-Associated Nuclear Envelopathy with Early Progeroid Appearance Suggests Advanced Applications for AI-Driven Facial Phenotyping. Am J Hum Genet 2019;104:749-757. [PMID: 30905398 DOI: 10.1016/j.ajhg.2019.02.021] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2018] [Accepted: 02/16/2019] [Indexed: 12/16/2022] Open

Abstract

Over a relatively short period of time, the clinical geneticist's "toolbox" has been expanded by machine-learning algorithms for image analysis, which can be applied to the task of syndrome identification on the basis of facial photographs, but these technologies harbor potential beyond the recognition of established phenotypes. Here, we comprehensively characterized two individuals with a hitherto unknown genetic disorder caused by the same de novo mutation in LEMD2 (c.1436C>T;p.Ser479Phe), the gene which encodes the nuclear envelope protein LEM domain-containing protein 2 (LEMD2). Despite different ages and ethnic backgrounds, both individuals share a progeria-like facial phenotype and a distinct combination of physical and neurologic anomalies, such as growth retardation; hypoplastic jaws crowded with multiple supernumerary, yet unerupted, teeth; and cerebellar intention tremor. Immunofluorescence analyses of patient fibroblasts revealed mutation-induced disturbance of nuclear architecture, recapitulating previously published data in LEMD2-deficient cell lines, and additional experiments suggested mislocalization of mutant LEMD2 protein within the nuclear lamina. Computational analysis of facial features with two different deep neural networks showed phenotypic proximity to other nuclear envelopathies. One of the algorithms, when trained to recognize syndromic similarity (rather than specific syndromes) in an unsupervised approach, clustered both individuals closely together, providing hypothesis-free hints for a common genetic etiology. We show that a recurrent de novo mutation in LEMD2 causes a nuclear envelopathy whose prognosis in adolescence is relatively good in comparison to that of classical Hutchinson-Gilford progeria syndrome, and we suggest that the application of artificial intelligence to the analysis of patient images can facilitate the discovery of new genetic disorders.

Collapse

Zhu X, Shen X, Jiang X, Wei K, He T, Ma Y, Liu J, Hu X. Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data. BMC Bioinformatics 2018;19:505. [PMID: 30577738 PMCID: PMC6302369 DOI: 10.1186/s12859-018-2537-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open

Alzoubi D, Desouki AA, Lercher MJ. Alleles of a gene differ in pleiotropy, often mediated through currency metabolite production, in E. coli and yeast metabolic simulations. Sci Rep 2018;8:17252. [PMID: 30467356 PMCID: PMC6250661 DOI: 10.1038/s41598-018-35092-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2018] [Accepted: 10/22/2018] [Indexed: 11/09/2022] Open

Liu J, Li M, Luo XJ, Su B. Systems-level analysis of risk genes reveals the modular nature of schizophrenia. Schizophr Res 2018;201:261-269. [PMID: 29789256 DOI: 10.1016/j.schres.2018.05.015] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/03/2018] [Revised: 05/10/2018] [Accepted: 05/12/2018] [Indexed: 12/31/2022]

Garcia-Vaquero ML, Gama-Carvalho M, Rivas JDL, Pinto FR. Searching the overlap between network modules with specific betweeness (S2B) and its application to cross-disease analysis. Sci Rep 2018;8:11555. [PMID: 30068933 PMCID: PMC6070533 DOI: 10.1038/s41598-018-29990-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2018] [Accepted: 07/23/2018] [Indexed: 12/14/2022] Open

Lee JJY, Gottlieb MM, Lever J, Jones SJM, Blau N, van Karnebeek CDM, Wasserman WW. Text-based phenotypic profiles incorporating biochemical phenotypes of inborn errors of metabolism improve phenomics-based diagnosis. J Inherit Metab Dis 2018;41:555-562. [PMID: 29340838 PMCID: PMC5959948 DOI: 10.1007/s10545-017-0125-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/31/2017] [Revised: 12/01/2017] [Accepted: 12/05/2017] [Indexed: 01/28/2023]

Ma C, Gu C, Huo Y, Li X, Luo XJ. The integrated landscape of causal genes and pathways in schizophrenia. Transl Psychiatry 2018;8:67. [PMID: 29540662 PMCID: PMC5851982 DOI: 10.1038/s41398-018-0114-x] [Citation(s) in RCA: 52] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open

Abstract

Genome-wide association studies (GWAS) have identified more than 100 loci that show robust association with schizophrenia risk. However, due to the complexity of linkage disequilibrium and gene regulatory, it is challenging to pinpoint the causal genes at the risk loci and translate the genetic findings from GWAS into disease mechanism and clinical treatment. Here we systematically predicted the plausible candidate causal genes for schizophrenia at genome-wide level. We utilized different approaches and strategies to predict causal genes for schizophrenia, including Sherlock, SMR, DAPPLE, Prix Fixe, NetWAS, and DEPICT. By integrating the results from different prediction approaches, we identified six top candidates that represent promising causal genes for schizophrenia, including CNTN4, GATAD2A, GPM6A, MMP16, PSMA4, and TCF4. Besides, we also identified 35 additional high-confidence causal genes for schizophrenia. The identified causal genes showed distinct spatio-temporal expression patterns in developing and adult human brain. Cell-type-specific expression analysis indicated that the expression level of the predicted causal genes was significantly higher in neurons compared with oligodendrocytes and microglia (P < 0.05). We found that synaptic transmission-related genes were significantly enriched among the identified causal genes (P < 0.05), providing further support for the dysregulation of synaptic transmission in schizophrenia. Finally, we showed that the top six causal genes are dysregulated in schizophrenia cases compared with controls and knockdown of these genes impaired the proliferation of neuronal cells. Our study depicts the landscape of plausible schizophrenia causal genes for the first time. Further genetic and functional validation of these genes will provide mechanistic insights into schizophrenia pathogenesis and may facilitate to provide potential targets for future therapeutics and diagnostics.

Collapse

Yang CP, Li X, Wu Y, Shen Q, Zeng Y, Xiong Q, Wei M, Chen C, Liu J, Huo Y, Li K, Xue G, Yao YG, Zhang C, Li M, Chen Y, Luo XJ. Comprehensive integrative analyses identify GLT8D1 and CSNK2B as schizophrenia risk genes. Nat Commun 2018;9:838. [PMID: 29483533 PMCID: PMC5826945 DOI: 10.1038/s41467-018-03247-3] [Citation(s) in RCA: 74] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2017] [Accepted: 01/29/2018] [Indexed: 01/01/2023] Open

Affiliation(s)

Cui-Ping Yang Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China
Xiaoyan Li Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China
Yong Wu Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China
Qiushuo Shen Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China.,Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, 650204, China
Yong Zeng Department of Psychiatry, The First Affiliated Hospital of Kunming Medical College, Kunming, Yunnan, 650031, China
Qiuxia Xiong Department of Psychiatry, The First Affiliated Hospital of Kunming Medical College, Kunming, Yunnan, 650031, China
Mengping Wei State Key Laboratory of Membrane Biology, PKU-IDG/McGovern Institute for Brain Research, School of Life Sciences, Peking University, Beijing, 100871, China
Chunhui Chen State Key Laboratory of Cognitive Neuroscience and Learning, and IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing, 100875, China
Jiewei Liu Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China
Yongxia Huo Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China
Kaiqin Li Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China
Gui Xue State Key Laboratory of Cognitive Neuroscience and Learning, and IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing, 100875, China
Yong-Gang Yao Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China.,CAS Center for Excellence in Brain Science and Intelligence Technology, Chinese Academy of Sciences, Shanghai, 200031, China
Chen Zhang State Key Laboratory of Membrane Biology, PKU-IDG/McGovern Institute for Brain Research, School of Life Sciences, Peking University, Beijing, 100871, China
Ming Li Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China.,CAS Center for Excellence in Brain Science and Intelligence Technology, Chinese Academy of Sciences, Shanghai, 200031, China
Yongbin Chen Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China. .,Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, Yunnna, 650223, China.
Xiong-Jian Luo Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China. .,Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, Yunnna, 650223, China.

Collapse

White JJ, Mazzeu JF, Coban-Akdemir Z, Bayram Y, Bahrambeigi V, Hoischen A, van Bon BWM, Gezdirici A, Gulec EY, Ramond F, Touraine R, Thevenon J, Shinawi M, Beaver E, Heeley J, Hoover-Fong J, Durmaz CD, Karabulut HG, Marzioglu-Ozdemir E, Cayir A, Duz MB, Seven M, Price S, Ferreira BM, Vianna-Morgante AM, Ellard S, Parrish A, Stals K, Flores-Daboub J, Jhangiani SN, Gibbs RA, Brunner HG, Sutton VR, Lupski JR, Carvalho CMB. WNT Signaling Perturbations Underlie the Genetic Heterogeneity of Robinow Syndrome. Am J Hum Genet 2018;102:27-43. [PMID: 29276006 DOI: 10.1016/j.ajhg.2017.10.002] [Citation(s) in RCA: 71] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2017] [Accepted: 10/06/2017] [Indexed: 12/12/2022] Open

Affiliation(s)

Janson J White Department of Molecular and Human Genetics, Baylor College of Medicine, Houston TX 77030, USA
Juliana F Mazzeu University of Brasilia, Brasilia 70910, Brazil; Robinow Syndrome Foundation, Anoka, MN 55303, USA
Zeynep Coban-Akdemir Department of Molecular and Human Genetics, Baylor College of Medicine, Houston TX 77030, USA
Yavuz Bayram Department of Molecular and Human Genetics, Baylor College of Medicine, Houston TX 77030, USA
Vahid Bahrambeigi Department of Molecular and Human Genetics, Baylor College of Medicine, Houston TX 77030, USA; Graduate Program in Diagnostic Genetics, School of Health Professions, University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
Alexander Hoischen Department of Human Genetics, Radboud Institute of Molecular Life Sciences, Radboud University Medical Center, 6500 HB Nijmegen, the Netherlands; Department of Internal Medicine and Radboud Center for Infectious Diseases (RCI), Radboud University Medical Center, 6500 HB Nijmegen, the Netherlands
Bregje W M van Bon Department of Human Genetics, Radboud Institute of Molecular Life Sciences, Radboud University Medical Center, 6500 HB Nijmegen, the Netherlands
Alper Gezdirici Department of Medical Genetics, Kanuni Sultan Suleyman Training and Research Hospital, Istanbul 34303, Turkey
Elif Yilmaz Gulec Department of Medical Genetics, Kanuni Sultan Suleyman Training and Research Hospital, Istanbul 34303, Turkey
Francis Ramond Service de Génétique, CHU-Hôpital Nord, 42000 Saint-Etienne, France
Renaud Touraine Service de Génétique, CHU-Hôpital Nord, 42000 Saint-Etienne, France
Julien Thevenon Inserm UMR 1231 GAD team, Genetics of Developmental Anomalies, Université de Bourgogne-Franche Comté, 21000 Dijon, France; FHU-TRANSLAD, Université de Bourgogne, 21000 CHU Dijon, France; Centre de génétique, Hôpital Couple-Enfant, CHU de Grenoble-Alpes, 38700 La Tronche, France
Marwan Shinawi Division of Genetics and Genomic Medicine, Department of Pediatrics, Washington University School of Medicine, St. Louis, MO 63110, USA
Erin Beaver Mercy Clinic-Kids Genetics, Mercy Children's Hospital St. Louis, St. Louis, MO 63141, USA
Jennifer Heeley Mercy Clinic-Kids Genetics, Mercy Children's Hospital St. Louis, St. Louis, MO 63141, USA
Julie Hoover-Fong Greenberg Center for Skeletal Dysplasias, McKusick-Nathans Institute for Genetic Medicine, Johns Hopkins University, Baltimore, MD 21287, USA
Ceren D Durmaz Department of Medical Genetics, Ankara University School of Medicine, 06100 Ankara, Turkey
Halil Gurhan Karabulut Department of Medical Genetics, Ankara University School of Medicine, 06100 Ankara, Turkey
Ebru Marzioglu-Ozdemir Department of Medical Genetics, Erzurum Regional and Training Hospital, 25070 Erzurum, Turkey
Atilla Cayir Erzurum Training and Research Hospital, Department of Pediatric Endocrinology, 25070 Erzurum, Turkey
Mehmet B Duz Department of Medical Genetics, Cerrahpasa Medical School, Istanbul University, 34452 Istanbul, Turkey
Mehmet Seven Department of Medical Genetics, Cerrahpasa Medical School, Istanbul University, 34452 Istanbul, Turkey
Susan Price Oxford Centre for Genomic Medicine, Nuffield Orthopaedic Centre, Oxford OX3 7LD, UK
Barbara Merfort Ferreira University of Brasilia, Brasilia 70910, Brazil
Angela M Vianna-Morgante Department of Genetics and Evolutionary Biology, Institute of Biosciences, Sao Paulo - SP 05508-090, Brazil
Sian Ellard Department of Molecular Genetics, Royal Devon and Exeter NHS Foundation Trust, Exeter EX2 5DW, UK; Institute of Biomedical and Clinical Science, University of Exeter Medical School, Exeter EX1 2LU, UK
Andrew Parrish Department of Molecular Genetics, Royal Devon and Exeter NHS Foundation Trust, Exeter EX2 5DW, UK
Karen Stals Department of Molecular Genetics, Royal Devon and Exeter NHS Foundation Trust, Exeter EX2 5DW, UK
Josue Flores-Daboub Department of Pediatric Genetics, University of Utah School of Medicine, Salt Lake City, UT 84108, USA
Shalini N Jhangiani Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
Richard A Gibbs Department of Molecular and Human Genetics, Baylor College of Medicine, Houston TX 77030, USA; Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
Han G Brunner Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, 6500 HB Nijmegen, the Netherlands; Department of Clinical Genetics, GROW School for Oncology and Developmental Biology, Maastricht University Medical Center, 6202 AZ Maastricht, the Netherlands
V Reid Sutton Department of Molecular and Human Genetics, Baylor College of Medicine, Houston TX 77030, USA; Texas Children's Hospital, Houston, TX 77030, USA
James R Lupski Department of Molecular and Human Genetics, Baylor College of Medicine, Houston TX 77030, USA; Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA; Texas Children's Hospital, Houston, TX 77030, USA
Claudia M B Carvalho Department of Molecular and Human Genetics, Baylor College of Medicine, Houston TX 77030, USA.

Collapse

Chen Y, Xu R. Context-sensitive network-based disease genetics prediction and its implications in drug discovery. Bioinformatics 2017;33:1031-1039. [PMID: 28062449 DOI: 10.1093/bioinformatics/btw737] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2016] [Accepted: 11/19/2016] [Indexed: 01/05/2023] Open

Abstract

Motivation

Disease phenotype networks play an important role in computational approaches to identifying new disease-gene associations. Current disease phenotype networks often model disease relationships based on pairwise similarities, therefore ignore the specific context on how two diseases are connected. In this study, we propose a new strategy to model disease associations using context-sensitive networks (CSNs). We developed a CSN-based phenome-driven approach for disease genetics prediction, and investigated the translational potential of the predicted genes in drug discovery.

Results

We constructed CSNs by directly connecting diseases with associated phenotypes. Here, we constructed two CSNs using different data sources; the two networks contain 26 790 and 13 822 nodes respectively. We integrated the CSNs with a genetic functional relationship network and predicted disease genes using a network-based ranking algorithm. For comparison, we built Similarity-Based disease Networks (SBN) using the same disease phenotype data. In a de novo cross validation for 3324 diseases, the CSN-based approach significantly increased the average rank from top 12.6 to top 8.8% for all tested genes comparing with the SBN-based approach ( p<e-22 ). The area under the receiver operating characteristic curve for the CSN approach was also significantly higher than the SBN approach (0.91 versus 0.87, p<e-3 ). In addition, we predicted genes for Parkinson's disease using CSNs, and demonstrated that the top-ranked genes are highly relevant to PD pathologenesis. We pin-pointed a top-ranked drug target gene for PD, and found its association with neurodegeneration supported by literature. In summary, CSNs lead to significantly improve the disease genetics prediction comparing with SBNs and provide leads for potential drug targets.

Availability and Implementation

nlp.case.edu/public/data/.

Contact

rxx@case.edu.

Collapse

Li YH, Zhang GG, Wang N. Systematic Characterization and Prediction of Human Hypertension Genes. Hypertension 2016;69:349-355. [PMID: 27895194 DOI: 10.1161/hypertensionaha.116.08573] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2016] [Revised: 10/19/2016] [Accepted: 11/09/2016] [Indexed: 01/25/2023]

Transcriptome Profiling in Rat Inbred Strains and Experimental Cross Reveals Discrepant Genetic Architecture of Genome-Wide Gene Expression. G3-GENES GENOMES GENETICS 2016;6:3671-3683. [PMID: 27646706 PMCID: PMC5100866 DOI: 10.1534/g3.116.033274] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]

Chen Y, Xu R. Phenome-based gene discovery provides information about Parkinson's disease drug targets. BMC Genomics 2016;17 Suppl 5:493. [PMID: 27586503 PMCID: PMC5009520 DOI: 10.1186/s12864-016-2820-1] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open

Kortvely E, Ueffing M. Gene Structure of the 10q26 Locus: A Clue to Cracking the ARMS2/HTRA1 Riddle? ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016;854:23-9. [PMID: 26427389 DOI: 10.1007/978-3-319-17121-0_4] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Chen Y, Li L, Zhang GQ, Xu R. Phenome-driven disease genetics prediction toward drug discovery. Bioinformatics 2015;31:i276-83. [PMID: 26072493 PMCID: PMC4542779 DOI: 10.1093/bioinformatics/btv245] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open

Abstract

MOTIVATION

Discerning genetic contributions to diseases not only enhances our understanding of disease mechanisms, but also leads to translational opportunities for drug discovery. Recent computational approaches incorporate disease phenotypic similarities to improve the prediction power of disease gene discovery. However, most current studies used only one data source of human disease phenotype. We present an innovative and generic strategy for combining multiple different data sources of human disease phenotype and predicting disease-associated genes from integrated phenotypic and genomic data.

RESULTS

To demonstrate our approach, we explored a new phenotype database from biomedical ontologies and constructed Disease Manifestation Network (DMN). We combined DMN with mimMiner, which was a widely used phenotype database in disease gene prediction studies. Our approach achieved significantly improved performance over a baseline method, which used only one phenotype data source. In the leave-one-out cross-validation and de novo gene prediction analysis, our approach achieved the area under the curves of 90.7% and 90.3%, which are significantly higher than 84.2% (P < e(-4)) and 81.3% (P < e(-12)) for the baseline approach. We further demonstrated that our predicted genes have the translational potential in drug discovery. We used Crohn's disease as an example and ranked the candidate drugs based on the rank of drug targets. Our gene prediction approach prioritized druggable genes that are likely to be associated with Crohn's disease pathogenesis, and our rank of candidate drugs successfully prioritized the Food and Drug Administration-approved drugs for Crohn's disease. We also found literature evidence to support a number of drugs among the top 200 candidates. In summary, we demonstrated that a novel strategy combining unique disease phenotype data with system approaches can lead to rapid drug discovery.

AVAILABILITY AND IMPLEMENTATION

nlp.

CASE

edu/public/data/DMN

Collapse

Network-assisted analysis of primary Sjögren's syndrome GWAS data in Han Chinese. Sci Rep 2015;5:18855. [PMID: 26686423 PMCID: PMC4685393 DOI: 10.1038/srep18855] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2015] [Accepted: 11/05/2015] [Indexed: 12/23/2022] Open

Chong J, Buckingham K, Jhangiani S, Boehm C, Sobreira N, Smith J, Harrell T, McMillin M, Wiszniewski W, Gambin T, Coban Akdemir Z, Doheny K, Scott A, Avramopoulos D, Chakravarti A, Hoover-Fong J, Mathews D, Witmer P, Ling H, Hetrick K, Watkins L, Patterson K, Reinier F, Blue E, Muzny D, Kircher M, Bilguvar K, López-Giráldez F, Sutton V, Tabor H, Leal S, Gunel M, Mane S, Gibbs R, Boerwinkle E, Hamosh A, Shendure J, Lupski J, Lifton R, Valle D, Nickerson D, Bamshad M, Bamshad MJ. The Genetic Basis of Mendelian Phenotypes: Discoveries, Challenges, and Opportunities. Am J Hum Genet 2015;97:199-215. [PMID: 26166479 DOI: 10.1016/j.ajhg.2015.06.009] [Citation(s) in RCA: 449] [Impact Index Per Article: 49.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2015] [Indexed: 01/06/2023] Open

Abstract

Discovering the genetic basis of a Mendelian phenotype establishes a causal link between genotype and phenotype, making possible carrier and population screening and direct diagnosis. Such discoveries also contribute to our knowledge of gene function, gene regulation, development, and biological mechanisms that can be used for developing new therapeutics. As of February 2015, 2,937 genes underlying 4,163 Mendelian phenotypes have been discovered, but the genes underlying ∼50% (i.e., 3,152) of all known Mendelian phenotypes are still unknown, and many more Mendelian conditions have yet to be recognized. This is a formidable gap in biomedical knowledge. Accordingly, in December 2011, the NIH established the Centers for Mendelian Genomics (CMGs) to provide the collaborative framework and infrastructure necessary for undertaking large-scale whole-exome sequencing and discovery of the genetic variants responsible for Mendelian phenotypes. In partnership with 529 investigators from 261 institutions in 36 countries, the CMGs assessed 18,863 samples from 8,838 families representing 579 known and 470 novel Mendelian phenotypes as of January 2015. This collaborative effort has identified 956 genes, including 375 not previously associated with human health, that underlie a Mendelian phenotype. These results provide insight into study design and analytical strategies, identify novel mechanisms of disease, and reveal the extensive clinical variability of Mendelian phenotypes. Discovering the gene underlying every Mendelian phenotype will require tackling challenges such as worldwide ascertainment and phenotypic characterization of families affected by Mendelian conditions, improvement in sequencing and analytical techniques, and pervasive sharing of phenotypic and genomic data among researchers, clinicians, and families.

Collapse

Le DH. A novel method for identifying disease associated protein complexes based on functional similarity protein complex networks. Algorithms Mol Biol 2015;10:14. [PMID: 25969691 PMCID: PMC4427953 DOI: 10.1186/s13015-015-0044-6] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2014] [Accepted: 04/01/2015] [Indexed: 12/21/2022] Open

Abstract

Background

Protein complexes formed by non-covalent interaction among proteins play important roles in cellular functions. Computational and purification methods have been used to identify many protein complexes and their cellular functions. However, their roles in terms of causing disease have not been well discovered yet. There exist only a few studies for the identification of disease-associated protein complexes. However, they mostly utilize complicated heterogeneous networks which are constructed based on an out-of-date database of phenotype similarity network collected from literature. In addition, they only apply for diseases for which tissue-specific data exist.

Methods

In this study, we propose a method to identify novel disease-protein complex associations. First, we introduce a framework to construct functional similarity protein complex networks where two protein complexes are functionally connected by either shared protein elements, shared annotating GO terms or based on protein interactions between elements in each protein complex. Second, we propose a simple but effective neighborhood-based algorithm, which yields a local similarity measure, to rank disease candidate protein complexes.

Results

Comparing the predictive performance of our proposed algorithm with that of two state-of-the-art network propagation algorithms including one we used in our previous study, we found that it performed statistically significantly better than that of these two algorithms for all the constructed functional similarity protein complex networks. In addition, it ran about 32 times faster than these two algorithms. Moreover, our proposed method always achieved high performance in terms of AUC values irrespective of the ways to construct the functional similarity protein complex networks and the used algorithms. The performance of our method was also higher than that reported in some existing methods which were based on complicated heterogeneous networks. Finally, we also tested our method with prostate cancer and selected the top 100 highly ranked candidate protein complexes. Interestingly, 69 of them were evidenced since at least one of their protein elements are known to be associated with prostate cancer.

Conclusions

Our proposed method, including the framework to construct functional similarity protein complex networks and the neighborhood-based algorithm on these networks, could be used for identification of novel disease-protein complex associations.

Electronic supplementary material

The online version of this article (doi:10.1186/s13015-015-0044-6) contains supplementary material, which is available to authorized users.

Collapse

Understanding multicellular function and disease with human tissue-specific networks. Nat Genet 2015;47:569-76. [PMID: 25915600 PMCID: PMC4828725 DOI: 10.1038/ng.3259] [Citation(s) in RCA: 543] [Impact Index Per Article: 60.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2014] [Accepted: 03/06/2015] [Indexed: 12/17/2022]

RecRWR: a recursive random walk method for improved identification of diseases. BIOMED RESEARCH INTERNATIONAL 2015;2015:747156. [PMID: 25874227 PMCID: PMC4385608 DOI: 10.1155/2015/747156] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/22/2014] [Revised: 10/17/2014] [Accepted: 10/31/2014] [Indexed: 12/02/2022]

Ghiassian SD, Menche J, Barabási AL. A DIseAse MOdule Detection (DIAMOnD) algorithm derived from a systematic analysis of connectivity patterns of disease proteins in the human interactome. PLoS Comput Biol 2015;11:e1004120. [PMID: 25853560 PMCID: PMC4390154 DOI: 10.1371/journal.pcbi.1004120] [Citation(s) in RCA: 216] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2014] [Accepted: 01/09/2015] [Indexed: 01/08/2023] Open

Abstract

The observation that disease associated proteins often interact with each other has fueled the development of network-based approaches to elucidate the molecular mechanisms of human disease. Such approaches build on the assumption that protein interaction networks can be viewed as maps in which diseases can be identified with localized perturbation within a certain neighborhood. The identification of these neighborhoods, or disease modules, is therefore a prerequisite of a detailed investigation of a particular pathophenotype. While numerous heuristic methods exist that successfully pinpoint disease associated modules, the basic underlying connectivity patterns remain largely unexplored. In this work we aim to fill this gap by analyzing the network properties of a comprehensive corpus of 70 complex diseases. We find that disease associated proteins do not reside within locally dense communities and instead identify connectivity significance as the most predictive quantity. This quantity inspires the design of a novel Disease Module Detection (DIAMOnD) algorithm to identify the full disease module around a set of known disease proteins. We study the performance of the algorithm using well-controlled synthetic data and systematically validate the identified neighborhoods for a large corpus of diseases.

Diseases are rarely the result of an abnormality in a single gene, but involve a whole cascade of interactions between several cellular processes. To disentangle these complex interactions it is necessary to study genotype-phenotype relationships in the context of protein-protein interaction networks. Our analysis of 70 diseases shows that disease proteins are not randomly scattered within these networks, but agglomerate in specific regions, suggesting the existence of specific disease modules for each disease. The identification of these modules is the first step towards elucidating the biological mechanisms of a disease or for a targeted search of drug targets. We present a systematic analysis of the connectivity patterns of disease proteins and determine the most predictive topological property for their identification. This allows us to rationally design a reliable and efficient Disease Module Detection algorithm (DIAMOnD).

Collapse

Keppler-Noreuil KM, Rios JJ, Parker VE, Semple RK, Lindhurst MJ, Sapp JC, Alomari A, Ezaki M, Dobyns W, Biesecker LG. PIK3CA-related overgrowth spectrum (PROS): diagnostic and testing eligibility criteria, differential diagnosis, and evaluation. Am J Med Genet A 2015;167A:287-95. [PMID: 25557259 PMCID: PMC4480633 DOI: 10.1002/ajmg.a.36836] [Citation(s) in RCA: 317] [Impact Index Per Article: 35.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2014] [Revised: 09/29/2014] [Accepted: 09/30/2014] [Indexed: 01/20/2023]

Abstract

Somatic activating mutations in the phosphatidylinositol-3-kinase/AKT/mTOR pathway underlie heterogeneous segmental overgrowth phenotypes. Because of the extreme differences among patients, we sought to characterize the phenotypic spectrum associated with different genotypes and mutation burdens, including a better understanding of associated complications and natural history. Historically, the clinical diagnoses in patients with PIK3CA activating mutations have included Fibroadipose hyperplasia or Overgrowth (FAO), Hemihyperplasia Multiple Lipomatosis (HHML), Congenital Lipomatous Overgrowth, Vascular Malformations, Epidermal Nevi, Scoliosis/Skeletal and Spinal (CLOVES) syndrome, macrodactyly, Fibroadipose Infiltrating Lipomatosis, and the related megalencephaly syndromes, Megalencephaly-Capillary Malformation (MCAP or M-CM) and Dysplastic Megalencephaly (DMEG). A workshop was convened at the National Institutes of Health (NIH) to discuss and develop a consensus document regarding diagnosis and treatment of patients with PIK3CA-associated somatic overgrowth disorders. Participants in the workshop included a group of researchers from several institutions who have been studying these disorders and have published their findings, as well as representatives from patient-advocacy and support groups. The umbrella term of "PIK3CA-Related Overgrowth Spectrum (PROS)" was agreed upon to encompass both the known and emerging clinical entities associated with somatic PIK3CA mutations including, macrodactyly, FAO, HHML, CLOVES, and related megalencephaly conditions. Key clinical diagnostic features and criteria for testing were proposed, and testing approaches summarized. Preliminary recommendations for a uniform approach to assessment of overgrowth and molecular diagnostic testing were determined. Future areas to address include the surgical management of overgrowth tissue and vascular anomalies, the optimal approach to thrombosis risk, and the testing of potential pharmacologic therapies.

Collapse

Xu W, Jiang X, Hu X, Li G. Visualization of genetic disease-phenotype similarities by multiple maps t-SNE with Laplacian regularization. BMC Med Genomics 2014;7 Suppl 2:S1. [PMID: 25350393 PMCID: PMC4243097 DOI: 10.1186/1755-8794-7-s2-s1] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

From a phenotypic standpoint, certain types of diseases may prove to be difficult to accurately diagnose, due to specific combinations of confounding symptoms. Referred to as phenotypic overlap, these sets of disease-related symptoms suggest shared pathophysiological mechanisms. Few attempts have been made to visualize the phenotypic relationships between different human diseases from a machine learning perspective. The proposed research, it is anticipated, will visually assist researchers in quickly disambiguating symptoms which can confound the timely and accurate diagnosis of a disease.

METHODS

Our method is primarily based on multiple maps t-SNE (mm-tSNE), which is a probabilistic method for visualizing data points in multiple low dimensional spaces. We improved mm-tSNE by adding a Laplacian regularization term and subsequently provide an algorithm for optimizing the new objective function. The advantage of Laplacian regularization is that it adopts clustering structures of variables and provides more sparsity to the estimated parameters.

RESULTS

In order to further assess our modified mm-tSNE algorithm from a comparative standpoint, we reexamined two social network datasets used by the previous authors. Subsequently, we apply our method on phenotype dataset. In all these cases, our proposed method demonstrated better performance than the original version of mm-tSNE, as measured by the neighbourhood preservation ratio.

CONCLUSIONS

Phenotype grouping reflects the nature of human disease genetics. Thus, phenotype visualization may be complementary to investigate candidate genes for diseases as well as functional relations between genes and proteins. These relationships can be modelled by the modified mm-tSNE method. The modified mm-tSNE can be applied directly in other domain including social and biological datasets.

Collapse

Chen Y, Xu R. Mining cancer-specific disease comorbidities from a large observational health database. Cancer Inform 2014;13:37-44. [PMID: 25392682 PMCID: PMC4216041 DOI: 10.4137/cin.s13893] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2014] [Revised: 04/29/2014] [Accepted: 04/30/2014] [Indexed: 12/28/2022] Open

Chen Y, Zhang X, Zhang GQ, Xu R. Comparative analysis of a novel disease phenotype network based on clinical manifestations. J Biomed Inform 2014;53:113-20. [PMID: 25277758 DOI: 10.1016/j.jbi.2014.09.007] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2014] [Revised: 08/18/2014] [Accepted: 09/21/2014] [Indexed: 12/21/2022]

Garcia-Alonso L, Jiménez-Almazán J, Carbonell-Caballero J, Vela-Boza A, Santoyo-López J, Antiñolo G, Dopazo J. The role of the interactome in the maintenance of deleterious variability in human populations. Mol Syst Biol 2014;10:752. [PMID: 25261458 PMCID: PMC4299661 DOI: 10.15252/msb.20145222] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2014] [Revised: 08/23/2014] [Accepted: 08/28/2014] [Indexed: 12/25/2022] Open

Jiang L, Edwards SM, Thomsen B, Workman CT, Guldbrandtsen B, Sørensen P. A random set scoring model for prioritization of disease candidate genes using protein complexes and data-mining of GeneRIF, OMIM and PubMed records. BMC Bioinformatics 2014;15:315. [PMID: 25253562 PMCID: PMC4181406 DOI: 10.1186/1471-2105-15-315] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2013] [Accepted: 09/17/2014] [Indexed: 12/12/2022] Open

Abstract

BACKGROUND

Prioritizing genetic variants is a challenge because disease susceptibility loci are often located in genes of unknown function or the relationship with the corresponding phenotype is unclear. A global data-mining exercise on the biomedical literature can establish the phenotypic profile of genes with respect to their connection to disease phenotypes. The importance of protein-protein interaction networks in the genetic heterogeneity of common diseases or complex traits is becoming increasingly recognized. Thus, the development of a network-based approach combined with phenotypic profiling would be useful for disease gene prioritization.

RESULTS

We developed a random-set scoring model and implemented it to quantify phenotype relevance in a network-based disease gene-prioritization approach. We validated our approach based on different gene phenotypic profiles, which were generated from PubMed abstracts, OMIM, and GeneRIF records. We also investigated the validity of several vocabulary filters and different likelihood thresholds for predicted protein-protein interactions in terms of their effect on the network-based gene-prioritization approach, which relies on text-mining of the phenotype data. Our method demonstrated good precision and sensitivity compared with those of two alternative complex-based prioritization approaches. We then conducted a global ranking of all human genes according to their relevance to a range of human diseases. The resulting accurate ranking of known causal genes supported the reliability of our approach. Moreover, these data suggest many promising novel candidate genes for human disorders that have a complex mode of inheritance.

CONCLUSION

We have implemented and validated a network-based approach to prioritize genes for human diseases based on their phenotypic profile. We have devised a powerful and transparent tool to identify and rank candidate genes. Our global gene prioritization provides a unique resource for the biological interpretation of data from genome-wide association studies, and will help in the understanding of how the associated genetic variants influence disease or quantitative phenotypes.

Collapse

Honti F, Meader S, Webber C. Unbiased functional clustering of gene variants with a phenotypic-linkage network. PLoS Comput Biol 2014;10:e1003815. [PMID: 25166029 PMCID: PMC4148192 DOI: 10.1371/journal.pcbi.1003815] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2014] [Accepted: 07/14/2014] [Indexed: 01/04/2023] Open

Abstract

Groupwise functional analysis of gene variants is becoming standard in next-generation sequencing studies. As the function of many genes is unknown and their classification to pathways is scant, functional associations between genes are often inferred from large-scale omics data. Such data types—including protein–protein interactions and gene co-expression networks—are used to examine the interrelations of the implicated genes. Statistical significance is assessed by comparing the interconnectedness of the mutated genes with that of random gene sets. However, interconnectedness can be affected by confounding bias, potentially resulting in false positive findings. We show that genes implicated through de novo sequence variants are biased in their coding-sequence length and longer genes tend to cluster together, which leads to exaggerated p-values in functional studies; we present here an integrative method that addresses these bias. To discern molecular pathways relevant to complex disease, we have inferred functional associations between human genes from diverse data types and assessed them with a novel phenotype-based method. Examining the functional association between de novo gene variants, we control for the heretofore unexplored confounding bias in coding-sequence length. We test different data types and networks and find that the disease-associated genes cluster more significantly in an integrated phenotypic-linkage network than in other gene networks. We present a tool of superior power to identify functional associations among genes mutated in the same disease even after accounting for significant sequencing study bias and demonstrate the suitability of this method to functionally cluster variant genes underlying polygenic disorders.

Plenty of gene variants have been associated with a disease, yet most of the heritability, along with the molecular basis, of common diseases remains unexplained. However, it is widely thought that the products of genes whose mutations are implicated in the same disease function together in the same biological pathways and it is the disruption of these pathways that underlies the disease. Such pathways are not well defined and their identification could help elucidate disease mechanisms. Consequently, groupwise functional analyses of gene variants to identify common disease-relevant pathways are becoming standard in next-generation sequencing studies, but we find that these analyses are confounded by coding-sequence length bias. We control for these bias and describe a phenotype-based approach which outperforms other methods in discerning functional associations among the disease-associated genes. We also demonstrate the suitability of this method to functionally dissect the gene variants underlying a complex disorder, the identified functional clusters offering insight into disease mechanisms.

Collapse

Pagnan NAB, Visinoni ÁF. Update on ectodermal dysplasias clinical classification. Am J Med Genet A 2014;164A:2415-23. [PMID: 25098893 DOI: 10.1002/ajmg.a.36616] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2013] [Accepted: 04/14/2014] [Indexed: 01/30/2023]

Human symptoms-disease network. Nat Commun 2014;5:4212. [PMID: 24967666 DOI: 10.1038/ncomms5212] [Citation(s) in RCA: 316] [Impact Index Per Article: 31.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2013] [Accepted: 05/27/2014] [Indexed: 12/19/2022] Open

Yang P, Li X, Chua HN, Kwoh CK, Ng SK. Ensemble positive unlabeled learning for disease gene identification. PLoS One 2014;9:e97079. [PMID: 24816822 PMCID: PMC4016241 DOI: 10.1371/journal.pone.0097079] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2013] [Accepted: 04/14/2014] [Indexed: 11/24/2022] Open

Abstract

An increasing number of genes have been experimentally confirmed in recent years as causative genes to various human diseases. The newly available knowledge can be exploited by machine learning methods to discover additional unknown genes that are likely to be associated with diseases. In particular, positive unlabeled learning (PU learning) methods, which require only a positive training set P (confirmed disease genes) and an unlabeled set U (the unknown candidate genes) instead of a negative training set N, have been shown to be effective in uncovering new disease genes in the current scenario. Using only a single source of data for prediction can be susceptible to bias due to incompleteness and noise in the genomic data and a single machine learning predictor prone to bias caused by inherent limitations of individual methods. In this paper, we propose an effective PU learning framework that integrates multiple biological data sources and an ensemble of powerful machine learning classifiers for disease gene identification. Our proposed method integrates data from multiple biological sources for training PU learning classifiers. A novel ensemble-based PU learning method EPU is then used to integrate multiple PU learning classifiers to achieve accurate and robust disease gene predictions. Our evaluation experiments across six disease groups showed that EPU achieved significantly better results compared with various state-of-the-art prediction methods as well as ensemble learning classifiers. Through integrating multiple biological data sources for training and the outputs of an ensemble of PU learning classifiers for prediction, we are able to minimize the potential bias and errors in individual data sources and machine learning algorithms to achieve more accurate and robust disease gene predictions. In the future, our EPU method provides an effective framework to integrate the additional biological and computational resources for better disease gene predictions.

Collapse

Zhang SW, Shao DD, Zhang SY, Wang YB. Prioritization of candidate disease genes by enlarging the seed set and fusing information of the network topology and gene expression. MOLECULAR BIOSYSTEMS 2014;10:1400-8. [PMID: 24695957 DOI: 10.1039/c3mb70588a] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]

Abstract

The identification of disease genes is very important not only to provide greater understanding of gene function and cellular mechanisms which drive human disease, but also to enhance human disease diagnosis and treatment. Recently, high-throughput techniques have been applied to detect dozens or even hundreds of candidate genes. However, experimental approaches to validate the many candidates are usually time-consuming, tedious and expensive, and sometimes lack reproducibility. Therefore, numerous theoretical and computational methods (e.g. network-based approaches) have been developed to prioritize candidate disease genes. Many network-based approaches implicitly utilize the observation that genes causing the same or similar diseases tend to correlate with each other in gene-protein relationship networks. Of these network approaches, the random walk with restart algorithm (RWR) is considered to be a state-of-the-art approach. To further improve the performance of RWR, we propose a novel method named ESFSC to identify disease-related genes, by enlarging the seed set according to the centrality of disease genes in a network and fusing information of the protein-protein interaction (PPI) network topological similarity and the gene expression correlation. The ESFSC algorithm restarts at all of the nodes in the seed set consisting of the known disease genes and their k-nearest neighbor nodes, then walks in the global network separately guided by the similarity transition matrix constructed with PPI network topological similarity properties and the correlational transition matrix constructed with the gene expression profiles. As a result, all the genes in the network are ranked by weighted fusing the above results of the RWR guided by two types of transition matrices. Comprehensive simulation results of the 10 diseases with 97 known disease genes collected from the Online Mendelian Inheritance in Man (OMIM) database show that ESFSC outperforms existing methods for prioritizing candidate disease genes. The top prediction results of Alzheimer's disease are consistent with previous literature reports.

Collapse

Gustafsson M, Edström M, Gawel D, Nestor CE, Wang H, Zhang H, Barrenäs F, Tojo J, Kockum I, Olsson T, Serra-Musach J, Bonifaci N, Pujana MA, Ernerudh J, Benson M. Integrated genomic and prospective clinical studies show the importance of modular pleiotropy for disease susceptibility, diagnosis and treatment. Genome Med 2014;6:17. [PMID: 24571673 PMCID: PMC4064311 DOI: 10.1186/gm534] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2013] [Accepted: 02/21/2014] [Indexed: 12/17/2022] Open

Affiliation(s)

Mika Gustafsson The Centre for Individualised Medicine, Department of Clinical and Experimental Medicine, Linköping University, 58185 Linköping, Sweden
Måns Edström Clinical and Experimental Medicine, Faculty of Health Sciences, Division of Clinical Immunology, Unit of Autoimmunity and Immune Regulation, Linköping University, 58185 Linköping, Sweden
Danuta Gawel The Centre for Individualised Medicine, Department of Clinical and Experimental Medicine, Linköping University, 58185 Linköping, Sweden
Colm E Nestor The Centre for Individualised Medicine, Department of Clinical and Experimental Medicine, Linköping University, 58185 Linköping, Sweden
Hui Wang The Centre for Individualised Medicine, Department of Clinical and Experimental Medicine, Linköping University, 58185 Linköping, Sweden
Huan Zhang The Centre for Individualised Medicine, Department of Clinical and Experimental Medicine, Linköping University, 58185 Linköping, Sweden
Fredrik Barrenäs The Centre for Individualised Medicine, Department of Clinical and Experimental Medicine, Linköping University, 58185 Linköping, Sweden
James Tojo Department of Clinical Neurosciences, Karolinska Institutet and Centrum for Molecular Medicine, 17177 Stockholm, Sweden
Ingrid Kockum Department of Clinical Neurosciences, Karolinska Institutet and Centrum for Molecular Medicine, 17177 Stockholm, Sweden
Tomas Olsson Department of Clinical Neurosciences, Karolinska Institutet and Centrum for Molecular Medicine, 17177 Stockholm, Sweden
Jordi Serra-Musach Cancer and Systems Biology Unit, Catalan Institute of Oncology, IDIBELL, L'Hospitalet del Llobregat, 08908 Barcelona, Spain
Núria Bonifaci Cancer and Systems Biology Unit, Catalan Institute of Oncology, IDIBELL, L'Hospitalet del Llobregat, 08908 Barcelona, Spain
Miguel Angel Pujana Cancer and Systems Biology Unit, Catalan Institute of Oncology, IDIBELL, L'Hospitalet del Llobregat, 08908 Barcelona, Spain
Jan Ernerudh Clinical and Experimental Medicine, Faculty of Health Sciences, Division of Clinical Immunology, Unit of Autoimmunity and Immune Regulation, Linköping University, 58185 Linköping, Sweden
Mikael Benson The Centre for Individualised Medicine, Department of Clinical and Experimental Medicine, Linköping University, 58185 Linköping, Sweden

Collapse

Network Analysis of Human Disease Comorbidity Patterns Based on Large-Scale Data Mining. BIOINFORMATICS RESEARCH AND APPLICATIONS 2014. [DOI: 10.1007/978-3-319-08171-7_22] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]

Chen Y, Wu X, Jiang R. Integrating human omics data to prioritize candidate genes. BMC Med Genomics 2013;6:57. [PMID: 24344781 PMCID: PMC3878333 DOI: 10.1186/1755-8794-6-57] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2013] [Accepted: 12/12/2013] [Indexed: 01/07/2023] Open

Biesecker LG. Invited editorial comment-the human phenotype of germlinePIGAmutations. Am J Med Genet A 2013;164A:15-6. [DOI: 10.1002/ajmg.a.36213] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2013] [Accepted: 07/29/2013] [Indexed: 11/11/2022]

Multi-dimensional prioritization of dental caries candidate genes and its enriched dense network modules. PLoS One 2013;8:e76666. [PMID: 24146904 PMCID: PMC3795720 DOI: 10.1371/journal.pone.0076666] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2013] [Accepted: 08/31/2013] [Indexed: 01/14/2023] Open

Hennekam RC, Biesecker LG, Allanson JE, Hall JG, Opitz JM, Temple IK, Carey JC. Elements of morphology: General terms for congenital anomalies. Am J Med Genet A 2013;161A:2726-33. [DOI: 10.1002/ajmg.a.36249] [Citation(s) in RCA: 84] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2013] [Accepted: 08/26/2013] [Indexed: 11/08/2022]