1
|
Luo S, Zhang X, Xiao X, Luo W, Yang Z, Tang S, Huang W. Exploring Potential Biomarkers and Molecular Mechanisms of Ischemic Cardiomyopathy and COVID-19 Comorbidity Based on Bioinformatics and Systems Biology. Int J Mol Sci 2023; 24:ijms24076511. [PMID: 37047484 PMCID: PMC10094917 DOI: 10.3390/ijms24076511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2023] [Revised: 03/28/2023] [Accepted: 03/29/2023] [Indexed: 04/03/2023] Open
Abstract
Cardiovascular complications combined with COVID-19 (SARS-CoV-2) lead to a poor prognosis in patients. The common pathogenesis of ischemic cardiomyopathy (ICM) and COVID-19 is still unclear. Here, we explored potential molecular mechanisms and biomarkers for ICM and COVID-19. Common differentially expressed genes (DEGs) of ICM (GSE5406) and COVID-19 (GSE164805) were identified using GEO2R. We performed enrichment and protein–protein interaction analyses and screened key genes. To confirm the diagnostic performance for these hub genes, we used external datasets (GSE116250 and GSE211979) and plotted ROC curves. Transcription factor and microRNA regulatory networks were constructed for the validated hub genes. Finally, drug prediction and molecular docking validation were performed using cMAP. We identified 81 common DEGs, many of which were enriched in terms of their relation to angiogenesis. Three DEGs were identified as key hub genes (HSP90AA1, HSPA9, and SRSF1) in the protein–protein interaction analysis. These hub genes had high diagnostic performance in the four datasets (AUC > 0.7). Mir-16-5p and KLF9 transcription factor co-regulated these hub genes. The drugs vindesine and ON-01910 showed good binding performance to the hub genes. We identified HSP90AA1, HSPA9, and SRSF1 as markers for the co-pathogenesis of ICM and COVID-19, and showed that co-pathogenesis of ICM and COVID-19 may be related to angiogenesis. Vindesine and ON-01910 were predicted as potential therapeutic agents. Our findings will contribute to a deeper understanding of the comorbidity of ICM with COVID-19.
Collapse
|
2
|
Bragina EY, Puzyrev VP. Genetic outline of the hermeneutics of the diseases connection phenomenon in human. Vavilovskii Zhurnal Genet Selektsii 2023; 27:7-17. [PMID: 36923482 PMCID: PMC10009484 DOI: 10.18699/vjgb-23-03] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2022] [Revised: 12/25/2022] [Accepted: 12/26/2022] [Indexed: 03/11/2023] Open
Abstract
The structure of diseases in humans is heterogeneous, which is manifested by various combinations of diseases, including comorbidities associated with a common pathogenetic mechanism, as well as diseases that rarely manifest together. Recently, there has been a growing interest in studying the patterns of development of not individual diseases, but entire families associated with common pathogenetic mechanisms and common genes involved in their development. Studies of this problem make it possible to isolate an essential genetic component that controls the formation of disease conglomerates in a complex way through functionally interacting modules of individual genes in gene networks. An analytical review of studies on the problems of various aspects of the combination of diseases is the purpose of this study. The review uses the metaphor of a hermeneutic circle to understand the structure of regular relationships between diseases, and provides a conceptual framework related to the study of multiple diseases in an individual. The existing terminology is considered in relation to them, including multimorbidity, polypathies, comorbidity, conglomerates, families, "second diseases", syntropy and others. Here we summarize the key results that are extremely useful, primarily for describing the genetic architecture of diseases of a multifactorial nature. Summaries of the research problem of the disease connection phenomenon allow us to approach the systematization and natural classification of diseases. From practical healthcare perspective, the description of the disease connection phenomenon is crucial for expanding the clinician's interpretive horizon and moving beyond narrow, disease-specific therapeutic decisions.
Collapse
Affiliation(s)
- E Yu Bragina
- Research Institute of Medical Genetics, Tomsk National Research Medical Center of the Russian Academy of Sciences, Tomsk, Russia
| | - V P Puzyrev
- Research Institute of Medical Genetics, Tomsk National Research Medical Center of the Russian Academy of Sciences, Tomsk, Russia Siberian State Medical University, Tomsk, Russia
| |
Collapse
|
3
|
Chen L, Yu YN, Liu J, Chen YY, Wang B, Qi YF, Guan S, Liu X, Li B, Zhang YY, Hu Y, Wang Z. Modular networks and genomic variation during progression from stable angina pectoris through ischemic cardiomyopathy to chronic heart failure. Mol Med 2022; 28:140. [DOI: 10.1186/s10020-022-00569-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Accepted: 11/04/2022] [Indexed: 11/28/2022] Open
Abstract
Abstract
Background
Analyzing disease–disease relationships plays an important role for understanding etiology, disease classification, and drug repositioning. However, as cardiovascular diseases with causative links, the molecular relationship among stable angina pectoris (SAP), ischemic cardiomyopathy (ICM) and chronic heart failure (CHF) is not clear.
Methods
In this study, by integrating the multi-database data, we constructed paired disease progression modules (PDPMs) to identified relationship among SAP, ICM and CHF based on module reconstruction pairs (MRPs) of K-value calculation (a Euclidean distance optimization by integrating module topology parameters and their weights) methods. Finally, enrichment analysis, literature validation and structural variation (SV) were performed to verify the relationship between the three diseases in PDPMs.
Results
Total 16 PDPMs were found with K > 0.3777 among SAP, ICM and CHF, in which 6 pairs in SAP–ICM, 5 pairs for both ICM–CHF and SAP–CHF. SAP–ICM was the most closely related by having the smallest average K-value (K = 0.3899) while the maximum is SAP–CHF (K = 0.4006). According to the function of the validation gene, inflammatory response were through each stage of SAP–ICM–CHF, while SAP–ICM was uniquely involved in fibrosis, and genes were related in affecting the upstream of PI3K–Akt signaling pathway. 4 of the 11 genes (FLT1, KDR, ANGPT2 and PGF) in SAP–ICM–CHF related to angiogenesis in HIF-1 signaling pathway. Furthermore, we identified 62.96% SVs were protein deletion in SAP–ICM–CHF, and 53.85% SVs were defined as protein replication in SAP–ICM, while ICM–CHF genes were mainly affected by protein deletion.
Conclusion
The PDPMs analysis approach combined with genomic structural variation provides a new avenue for determining target associations contributing to disease progression and reveals that inflammation and angiogenesis may be important links among SAP, ICM and CHF progression.
Collapse
|
4
|
Network-Based Methods for Approaching Human Pathologies from a Phenotypic Point of View. Genes (Basel) 2022; 13:genes13061081. [PMID: 35741843 PMCID: PMC9222217 DOI: 10.3390/genes13061081] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Revised: 06/10/2022] [Accepted: 06/14/2022] [Indexed: 01/27/2023] Open
Abstract
Network and systemic approaches to studying human pathologies are helping us to gain insight into the molecular mechanisms of and potential therapeutic interventions for human diseases, especially for complex diseases where large numbers of genes are involved. The complex human pathological landscape is traditionally partitioned into discrete “diseases”; however, that partition is sometimes problematic, as diseases are highly heterogeneous and can differ greatly from one patient to another. Moreover, for many pathological states, the set of symptoms (phenotypes) manifested by the patient is not enough to diagnose a particular disease. On the contrary, phenotypes, by definition, are directly observable and can be closer to the molecular basis of the pathology. These clinical phenotypes are also important for personalised medicine, as they can help stratify patients and design personalised interventions. For these reasons, network and systemic approaches to pathologies are gradually incorporating phenotypic information. This review covers the current landscape of phenotype-centred network approaches to study different aspects of human diseases.
Collapse
|
5
|
Van De Weghe JC, Gomez A, Doherty D. The Joubert-Meckel-Nephronophthisis Spectrum of Ciliopathies. Annu Rev Genomics Hum Genet 2022; 23:301-329. [PMID: 35655331 DOI: 10.1146/annurev-genom-121321-093528] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The Joubert syndrome (JS), Meckel syndrome (MKS), and nephronophthisis (NPH) ciliopathy spectrum could be the poster child for advances and challenges in Mendelian human genetics over the past half century. Progress in understanding these conditions illustrates many core concepts of human genetics. The JS phenotype alone is caused by pathogenic variants in more than 40 genes; remarkably, all of the associated proteins function in and around the primary cilium. Primary cilia are near-ubiquitous, microtubule-based organelles that play crucial roles in development and homeostasis. Protruding from the cell, these cellular antennae sense diverse signals and mediate Hedgehog and other critical signaling pathways. Ciliary dysfunction causes many human conditions termed ciliopathies, which range from multiple congenital malformations to adult-onset single-organ failure. Research on the genetics of the JS-MKS-NPH spectrum has spurred extensive functional work exploring the broadly important role of primary cilia in health and disease. This functional work promises to illuminate the mechanisms underlying JS-MKS-NPH in humans, identify therapeutic targets across genetic causes, and generate future precision treatments. Expected final online publication date for the Annual Review of Genomics and Human Genetics, Volume 23 is October 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
Collapse
Affiliation(s)
| | - Arianna Gomez
- Department of Pediatrics, University of Washington, Seattle, Washington, USA; .,Molecular Medicine and Mechanisms of Disease Program, Department of Laboratory Medicine and Pathology, University of Washington, Seattle, Washington, USA;
| | - Dan Doherty
- Department of Pediatrics, University of Washington, Seattle, Washington, USA; .,Center for Integrative Brain Research, Seattle Children's Research Institute, Seattle, Washington, USA;
| |
Collapse
|
6
|
Tang X, Xiao Q, Yu K. Breast Cancer Candidate Gene Detection Through Integration of Subcellular Localization Data With Protein–Protein Interaction Networks. IEEE Trans Nanobioscience 2020; 19:556-561. [DOI: 10.1109/tnb.2020.2990178] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
7
|
Gamba A, Salmona M, Bazzoni G. Quantitative analysis of proteins which are members of the same protein complex but cause locus heterogeneity in disease. Sci Rep 2020; 10:10423. [PMID: 32591566 PMCID: PMC7320193 DOI: 10.1038/s41598-020-66836-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Accepted: 05/26/2020] [Indexed: 12/28/2022] Open
Abstract
It is still largely unknown how mutations in different genes cause similar diseases – a condition known as locus heterogeneity. A likely explanation is that the different proteins encoded by the locus heterogeneity genes participate in the same biological function and, specifically, that they belong to the same protein complex. Here we report that, in up to 30% of the instances of locus heterogeneity, the disease-causing proteins are indeed members of the same protein complex. Moreover, we observed that, in many instances, the diseases and protein complexes only partially intersect. Among the possible explanations, we surmised that some genes that encode proteins in the complex have not yet been reported as causing disease and are therefore candidate disease genes. Mutations of known human disease genes and murine orthologs of candidate disease genes that encode proteins in the same protein complex do in fact often cause similar phenotypes in humans and mice. Furthermore, we found that the disease-complex intersection is not only incomplete but also non-univocal, with many examples of one disease intersecting more than one protein complex or one protein complex intersecting more than one disease. These limits notwithstanding, this study shows that action on proteins in the same complex is a widespread pathogenic mechanism underlying numerous instances of locus heterogeneity.
Collapse
Affiliation(s)
- Alessio Gamba
- Department of Biochemistry and Molecular Pharmacology Istituto di Ricerche Farmacologiche Mario Negri IRCCS Via Mario Negri 2, I-20156, Milano, Italy
| | - Mario Salmona
- Department of Biochemistry and Molecular Pharmacology Istituto di Ricerche Farmacologiche Mario Negri IRCCS Via Mario Negri 2, I-20156, Milano, Italy
| | - Gianfranco Bazzoni
- Department of Biochemistry and Molecular Pharmacology Istituto di Ricerche Farmacologiche Mario Negri IRCCS Via Mario Negri 2, I-20156, Milano, Italy.
| |
Collapse
|
8
|
Abduljaleel Z, Athar M, Al-Allaf FA, Al-Dehlawi S, Vazquez JR. Association of functional variants and protein-to-protein physical interactions of human MutY homolog linked with familial adenomatous polyposis and colorectal cancer syndrome. Noncoding RNA Res 2020; 4:155-173. [PMID: 32072083 PMCID: PMC7012779 DOI: 10.1016/j.ncrna.2019.11.005] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2019] [Revised: 09/26/2019] [Accepted: 11/19/2019] [Indexed: 11/26/2022] Open
Abstract
The human gene MUTYH codes for a DNA glycosylase involved in the repair of oxidative DNA damage. Faulty MUTYH protein activity causes the accumulation of G→T transversions due to unrepaired 8-oxoG:A mismatches. MUTYH germ-line mutations in humans are linked with a recessive form of Familial Adenomatous Polyposis (FAP) and colorectal cancer predisposition. We studied the repair capacity of variants identified in MUTYH-associated polyposis (MAP) patients. MAP is inherited in an autosomal recessive type due to mutations in MUTYH (Y165C, G382D, P54S, A22V, Q63R, G45D, S136P and N43S), indicating that both copies of the gene become inactivated. However, the parents of an individual with an autosomal recessive condition may serve as carriers, each harboring one copy of the mutated gene without showing signs or symptoms of MAP. Six protein partners have been associated with MUTYH, four via direct physical interactions, namely, hMSH6, hPCNA, hRPA1, and hAPEX1. We examined, for the first time, specific interactions of these protein partners with MAP-associated MUTYH mutants using molecular dynamics simulations. The approach provided tools for exploration of the conformational energy landscape accessible to protein partners. The investigation also determined the impact before and after energy minimization of protein-protein interactions and binding affinities of MUTYH wild type and mutant forms, as well as the interactions with other proteins. Taken together, this study provided new insights into the role of MUTYH and its interacting proteins in MAP.
Collapse
Affiliation(s)
- Zainularifeen Abduljaleel
- Department of Medical Genetics, Faculty of Medicine, Umm Al-Qura University, P.O.Box: 715, Makkah 21955, Saudi Arabia.,Science and Technology Unit, Umm Al-Qura University, P.O. Box: 715, Makkah 21955, Saudi Arabia.,Bircham University, Av. Sierra, 2, 28691 Villanueva de la Canada, Madrid, Spain
| | - Mohammad Athar
- Department of Medical Genetics, Faculty of Medicine, Umm Al-Qura University, P.O.Box: 715, Makkah 21955, Saudi Arabia.,Science and Technology Unit, Umm Al-Qura University, P.O. Box: 715, Makkah 21955, Saudi Arabia
| | - Faisal A Al-Allaf
- Department of Medical Genetics, Faculty of Medicine, Umm Al-Qura University, P.O.Box: 715, Makkah 21955, Saudi Arabia.,Science and Technology Unit, Umm Al-Qura University, P.O. Box: 715, Makkah 21955, Saudi Arabia
| | - Saied Al-Dehlawi
- The Regional Laboratory, Ministry of Health (MOH), P.O. Box: 6251, Makkah, Saudi Arabia
| | - Jose R Vazquez
- Bircham University, Av. Sierra, 2, 28691 Villanueva de la Canada, Madrid, Spain
| |
Collapse
|
9
|
Down-regulation of TUFM impairs host cell interaction and virulence by Paracoccidioides brasiliensis. Sci Rep 2019; 9:17206. [PMID: 31748561 PMCID: PMC6868139 DOI: 10.1038/s41598-019-51540-y] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2017] [Accepted: 07/31/2019] [Indexed: 12/13/2022] Open
Abstract
The genus Paracoccidioides consist of dimorphic fungi geographically limited to the subtropical regions of Latin America, which are responsible for causing deep systemic mycosis in humans. However, the molecular mechanisms by which Paracoccidioides spp. causes the disease remain poorly understood. Paracoccidioides spp. harbor genes that encode proteins involved in host cell interaction and mitochondrial function, which together are required for pathogenicity and mediate virulence. Previously, we identified TufM (previously known as EF-Tu) in Paracoccidioides brasiliensis (PbTufM) and suggested that it may be involved in the pathogenicity of this fungus. In this study, we examined the effects of downregulating PbTUFM using a silenced strain with a 55% reduction in PbTUFM expression obtained by antisense-RNA (aRNA) technology. Silencing PbTUFM yielded phenotypic differences, such as altered translation elongation, respiratory defects, increased sensitivity of yeast cells to reactive oxygen stress, survival after macrophage phagocytosis, and reduced interaction with pneumocytes. These results were associated with reduced virulence in Galleria mellonella and murine infection models, emphasizing the importance of PbTufM in the full virulence of P. brasiliensis and its potential as a target for antifungal agents against paracoccidioidomycosis.
Collapse
|
10
|
Knaus A, Kortüm F, Kleefstra T, Stray-Pedersen A, Đukić D, Murakami Y, Gerstner T, van Bokhoven H, Iqbal Z, Horn D, Kinoshita T, Hempel M, Krawitz PM. Mutations in PIGU Impair the Function of the GPI Transamidase Complex, Causing Severe Intellectual Disability, Epilepsy, and Brain Anomalies. Am J Hum Genet 2019; 105:395-402. [PMID: 31353022 DOI: 10.1016/j.ajhg.2019.06.009] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2019] [Accepted: 06/07/2019] [Indexed: 12/11/2022] Open
Abstract
The glycosylphosphatidylinositol (GPI) anchor links over 150 proteins to the cell surface and is present on every cell type. Many of these proteins play crucial roles in neuronal development and function. Mutations in 18 of the 29 genes implicated in the biosynthesis of the GPI anchor have been identified as the cause of GPI biosynthesis deficiencies (GPIBDs) in humans. GPIBDs are associated with intellectual disability and seizures as their cardinal features. An essential component of the GPI transamidase complex is PIGU, along with PIGK, PIGS, PIGT, and GPAA1, all of which link GPI-anchored proteins (GPI-APs) onto the GPI anchor in the endoplasmic reticulum (ER). Here, we report two homozygous missense mutations (c.209T>A [p.Ile70Lys] and c.1149C>A [p.Asn383Lys]) in five individuals from three unrelated families. All individuals presented with global developmental delay, severe-to-profound intellectual disability, muscular hypotonia, seizures, brain anomalies, scoliosis, and mild facial dysmorphism. Using multicolor flow cytometry, we determined a characteristic profile for GPI transamidase deficiency. On granulocytes this profile consisted of reduced cell-surface expression of fluorescein-labeled proaerolysin (FLAER), CD16, and CD24, but not of CD55 and CD59; additionally, B cells showed an increased expression of free GPI anchors determined by T5 antibody. Moreover, computer-assisted facial analysis of different GPIBDs revealed a characteristic facial gestalt shared among individuals with mutations in PIGU and GPAA1. Our findings improve our understanding of the role of the GPI transamidase complex in the development of nervous and skeletal systems and expand the clinical spectrum of disorders belonging to the group of inherited GPI-anchor deficiencies.
Collapse
|
11
|
Yu L, Gao L. Human Pathway-Based Disease Network. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2019; 16:1240-1249. [PMID: 29990107 DOI: 10.1109/tcbb.2017.2774802] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Constructing disease-disease similarity network is important in elucidating the associations between the origin and molecular mechanism of diseases, and in researching disease function and medical research. In this paper, we use a high-quality protein interaction network and a collection of pathway databases to construct a Human Pathway-based Disease Network (HPDN) to explore the relationship between diseases and their intrinsic interactions. We find that the similarity of two diseases has a strong correlation with the number of their shared functional pathways and the interaction between their related gene sets. Comparing HPDN with disease networks based on genes and symptoms respectively, we find the three networks have high overlap rates. Additionally, HPDN can predict new disease-disease correlations, which are supported by Comparative Toxicogenomics Database (CTD) benchmark and large-scale biomedical literature database. The comprehensive, high-quality relations between diseases based on pathways can further be applied to study important matters in systems medicine, for instance, drug repurposing. Based on a dense subgraph in our network, we find two drugs, prednisone and folic acid, may have new indications, which will provide potential directions for the treatments of complex diseases.
Collapse
|
12
|
Dozmorov MG. Disease classification: from phenotypic similarity to integrative genomics and beyond. Brief Bioinform 2019; 20:1769-1780. [DOI: 10.1093/bib/bby049] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2018] [Revised: 05/01/2018] [Indexed: 02/06/2023] Open
Abstract
Abstract
A fundamental challenge of modern biomedical research is understanding how diseases that are similar on the phenotypic level are similar on the molecular level. Integration of various genomic data sets with the traditionally used phenotypic disease similarity revealed novel genetic and molecular mechanisms and blurred the distinction between monogenic (Mendelian) and complex diseases. Network-based medicine has emerged as a complementary approach for identifying disease-causing genes, genetic mediators, disruptions in the underlying cellular functions and for drug repositioning. The recent development of machine and deep learning methods allow for leveraging real-life information about diseases to refine genetic and phenotypic disease relationships. This review describes the historical development and recent methodological advancements for studying disease classification (nosology).
Collapse
Affiliation(s)
- Mikhail G Dozmorov
- Department of Biostatistics, Virginia Commonwealth University, 830 East Main Street, Richmond, VA, USA
| |
Collapse
|
13
|
The Discovery of a LEMD2-Associated Nuclear Envelopathy with Early Progeroid Appearance Suggests Advanced Applications for AI-Driven Facial Phenotyping. Am J Hum Genet 2019; 104:749-757. [PMID: 30905398 DOI: 10.1016/j.ajhg.2019.02.021] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2018] [Accepted: 02/16/2019] [Indexed: 12/16/2022] Open
Abstract
Over a relatively short period of time, the clinical geneticist's "toolbox" has been expanded by machine-learning algorithms for image analysis, which can be applied to the task of syndrome identification on the basis of facial photographs, but these technologies harbor potential beyond the recognition of established phenotypes. Here, we comprehensively characterized two individuals with a hitherto unknown genetic disorder caused by the same de novo mutation in LEMD2 (c.1436C>T;p.Ser479Phe), the gene which encodes the nuclear envelope protein LEM domain-containing protein 2 (LEMD2). Despite different ages and ethnic backgrounds, both individuals share a progeria-like facial phenotype and a distinct combination of physical and neurologic anomalies, such as growth retardation; hypoplastic jaws crowded with multiple supernumerary, yet unerupted, teeth; and cerebellar intention tremor. Immunofluorescence analyses of patient fibroblasts revealed mutation-induced disturbance of nuclear architecture, recapitulating previously published data in LEMD2-deficient cell lines, and additional experiments suggested mislocalization of mutant LEMD2 protein within the nuclear lamina. Computational analysis of facial features with two different deep neural networks showed phenotypic proximity to other nuclear envelopathies. One of the algorithms, when trained to recognize syndromic similarity (rather than specific syndromes) in an unsupervised approach, clustered both individuals closely together, providing hypothesis-free hints for a common genetic etiology. We show that a recurrent de novo mutation in LEMD2 causes a nuclear envelopathy whose prognosis in adolescence is relatively good in comparison to that of classical Hutchinson-Gilford progeria syndrome, and we suggest that the application of artificial intelligence to the analysis of patient images can facilitate the discovery of new genetic disorders.
Collapse
|
14
|
Zhu X, Shen X, Jiang X, Wei K, He T, Ma Y, Liu J, Hu X. Nonlinear expression and visualization of nonmetric relationships in genetic diseases and microbiome data. BMC Bioinformatics 2018; 19:505. [PMID: 30577738 PMCID: PMC6302369 DOI: 10.1186/s12859-018-2537-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
Background The traditional methods of visualizing high-dimensional data objects in low-dimensional metric spaces are subject to the basic limitations of metric space. These limitations result in multidimensional scaling that fails to faithfully represent non-metric similarity data. Results Multiple maps t-SNE (mm-tSNE) has drawn much attention due to the construction of multiple mappings in low-dimensional space to visualize the non-metric pairwise similarity to eliminate the limitations of a single metric map. mm-tSNE regularization combines the intrinsic geometry between data points in a high-dimensional space. The weight of data points on each map is used as the regularization parameter of the manifold, so the weights of similar data points on the same map are also as close as possible. However, these methods use standard momentum methods to calculate parameters of gradient at each iteration, which may lead to erroneous gradient search directions so that the target loss function fails to achieve a better local minimum. In this article, we use a Nesterov momentum method to learn the target loss function and correct each gradient update by looking back at the previous gradient in the candidate search direction. By using indirect second-order information, the algorithm obtains faster convergence than the original algorithm. To further evaluate our approach from a comparative perspective, we conducted experiments on several datasets including social network data, phenotype similarity data, and microbiomic data. Conclusions The experimental results show that the proposed method achieves better results than several versions of mm-tSNE based on three evaluation indicators including the neighborhood preservation ratio (NPR), error rate and time complexity.
Collapse
Affiliation(s)
- Xianchao Zhu
- School of Computer, Central China Normal University, Wuhan, China
| | - Xianjun Shen
- School of Computer, Central China Normal University, Wuhan, China.
| | - Xingpeng Jiang
- School of Computer, Central China Normal University, Wuhan, China
| | - Kaiping Wei
- School of Computer, Central China Normal University, Wuhan, China
| | - Tingting He
- School of Computer, Central China Normal University, Wuhan, China
| | - Yuanyuan Ma
- School of Computer, Central China Normal University, Wuhan, China
| | - Jiaqi Liu
- School of Computer, Central China Normal University, Wuhan, China
| | - Xiaohua Hu
- School of Computer, Central China Normal University, Wuhan, China.,College of Computing and Informatics, Drexel University, Philadelphia, PA, 19104, USA
| |
Collapse
|
15
|
Alzoubi D, Desouki AA, Lercher MJ. Alleles of a gene differ in pleiotropy, often mediated through currency metabolite production, in E. coli and yeast metabolic simulations. Sci Rep 2018; 8:17252. [PMID: 30467356 PMCID: PMC6250661 DOI: 10.1038/s41598-018-35092-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2018] [Accepted: 10/22/2018] [Indexed: 11/09/2022] Open
Abstract
A major obstacle to the mapping of genotype-phenotype relationships is pleiotropy, the tendency of mutations to affect seemingly unrelated traits. Pleiotropy has major implications for evolution, development, ageing, and disease. Except for disease data, pleiotropy is almost exclusively estimated from full gene knockouts. However, most deleterious alleles segregating in natural populations do not fully abolish gene function, and the degree to which a polymorphism reduces protein function may influence the number of traits it affects. Utilizing genome-scale metabolic models for Escherichia coli and Saccharomyces cerevisiae, we show that most fitness-reducing full gene knockouts of metabolic genes in these fast-growing microbes have pleiotropic effects, i.e., they compromise the production of multiple biomass components. Alleles of the same metabolic enzyme-encoding gene with increasingly reduced enzymatic function typically affect an increasing number of biomass components. This increasing pleiotropy is often mediated through effects on the generation of currency metabolites such as ATP or NADPH. We conclude that the physiological effects observed in full gene knockouts of metabolic genes will in most cases not be representative for alleles with only partially reduced enzyme capacity or expression level.
Collapse
Affiliation(s)
- Deya Alzoubi
- Institute for Computer Science and Department of Biology, Heinrich Heine University, Universitätsstraße 1, Düsseldorf, D-40221, Germany
| | - Abdelmoneim Amer Desouki
- Institute for Computer Science and Department of Biology, Heinrich Heine University, Universitätsstraße 1, Düsseldorf, D-40221, Germany
| | - Martin J Lercher
- Institute for Computer Science and Department of Biology, Heinrich Heine University, Universitätsstraße 1, Düsseldorf, D-40221, Germany.
| |
Collapse
|
16
|
Liu J, Li M, Luo XJ, Su B. Systems-level analysis of risk genes reveals the modular nature of schizophrenia. Schizophr Res 2018; 201:261-269. [PMID: 29789256 DOI: 10.1016/j.schres.2018.05.015] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/03/2018] [Revised: 05/10/2018] [Accepted: 05/12/2018] [Indexed: 12/31/2022]
Abstract
Schizophrenia (SCZ) is a complex mental disorder with high heritability. Genetic studies (especially recent genome-wide association studies) have identified many risk genes for schizophrenia. However, the physical interactions among the proteins encoded by schizophrenia risk genes remain elusive and it is not known whether the identified risk genes converge on common molecular networks or pathways. Here we systematically investigated the network characteristics of schizophrenia risk genes using the high-confidence protein-protein interactions (PPI) from the human interactome. We found that schizophrenia risk genes encode a densely interconnected PPI network (P = 4.15 × 10-31). Compared with the background genes, the schizophrenia risk genes in the interactome have significantly higher degree (P = 5.39 × 10-11), closeness centrality (P = 7.56 × 10-11), betweeness centrality (P = 1.29 × 10-11), clustering coefficient (P = 2.22 × 10-2), and shorter average shortest path length (P = 7.56 × 10-11). Based on the densely interconnected PPI network, we identified 48 hub genes and 4 modules formed by highly interconnected schizophrenia genes. We showed that the proteins encoded by schizophrenia hub genes have significantly more direct physical interactions. Gene ontology (GO) analysis revealed that cell adhesion, cell cycle, immune system response, and GABR-receptor complex categories were enriched in the modules formed by highly interconnected schizophrenia risk genes. Our study reveals that schizophrenia risk genes encode a densely interconnected molecular network and demonstrates the modular nature of schizophrenia.
Collapse
Affiliation(s)
- Jiewei Liu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, China; Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, Yunnan, China
| | - Ming Li
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Kunming, Yunnan, China
| | - Xiong-Jian Luo
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Kunming, Yunnan, China; Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650223, China.
| | - Bing Su
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, China; Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650223, China.
| |
Collapse
|
17
|
Garcia-Vaquero ML, Gama-Carvalho M, Rivas JDL, Pinto FR. Searching the overlap between network modules with specific betweeness (S2B) and its application to cross-disease analysis. Sci Rep 2018; 8:11555. [PMID: 30068933 PMCID: PMC6070533 DOI: 10.1038/s41598-018-29990-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2018] [Accepted: 07/23/2018] [Indexed: 12/14/2022] Open
Abstract
Discovering disease-associated genes (DG) is strategic for understanding pathological mechanisms. DGs form modules in protein interaction networks and diseases with common phenotypes share more DGs or have more closely interacting DGs. This prompted the development of Specific Betweenness (S2B) to find genes associated with two related diseases. S2B prioritizes genes frequently and specifically present in shortest paths linking two disease modules. Top S2B scores identified genes in the overlap of artificial network modules more than 80% of the times, even with incomplete or noisy knowledge. Applied to Amyotrophic Lateral Sclerosis and Spinal Muscular Atrophy, S2B candidates were enriched in biological processes previously associated with motor neuron degeneration. Some S2B candidates closely interacted in network cliques, suggesting common molecular mechanisms for the two diseases. S2B is a valuable tool for DG prediction, bringing new insights into pathological mechanisms. More generally, S2B can be applied to infer the overlap between other types of network modules, such as functional modules or context-specific subnetworks. An R package implementing S2B is publicly available at https://github.com/frpinto/S2B .
Collapse
Affiliation(s)
- Marina L Garcia-Vaquero
- University of Lisboa, Faculty of Sciences, BioISI - Biosystems & Integrative Sciences Institute, Campo Grande, C8 bdg, 1749-016, Lisboa, Portugal
| | - Margarida Gama-Carvalho
- University of Lisboa, Faculty of Sciences, BioISI - Biosystems & Integrative Sciences Institute, Campo Grande, C8 bdg, 1749-016, Lisboa, Portugal
| | - Javier De Las Rivas
- Cancer Research Center (CiC-IBMCC, CSIC/USAL/IBSAL), Consejo Superior de Investigaciones Científicas (CSIC) and Universidad de Salamanca (USAL), Salamanca, Spain
| | - Francisco R Pinto
- University of Lisboa, Faculty of Sciences, BioISI - Biosystems & Integrative Sciences Institute, Campo Grande, C8 bdg, 1749-016, Lisboa, Portugal.
| |
Collapse
|
18
|
Lee JJY, Gottlieb MM, Lever J, Jones SJM, Blau N, van Karnebeek CDM, Wasserman WW. Text-based phenotypic profiles incorporating biochemical phenotypes of inborn errors of metabolism improve phenomics-based diagnosis. J Inherit Metab Dis 2018; 41:555-562. [PMID: 29340838 PMCID: PMC5959948 DOI: 10.1007/s10545-017-0125-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/31/2017] [Revised: 12/01/2017] [Accepted: 12/05/2017] [Indexed: 01/28/2023]
Abstract
Phenomics is the comprehensive study of phenotypes at every level of biology: from metabolites to organisms. With high throughput technologies increasing the scope of biological discoveries, the field of phenomics has been developing rapid and precise methods to collect, catalog, and analyze phenotypes. Such methods have allowed phenotypic data to be widely used in medical applications, from assisting clinical diagnoses to prioritizing genomic diagnoses. To channel the benefits of phenomics into the field of inborn errors of metabolism (IEM), we have recently launched IEMbase, an expert-curated knowledgebase of IEM and their disease-characterizing phenotypes. While our efforts with IEMbase have realized benefits, taking full advantage of phenomics requires a comprehensive curation of IEM phenotypes in core phenomics projects, which is dependent upon contributions from the IEM clinical and research community. Here, we assess the inclusion of IEM biochemical phenotypes in a core phenomics project, the Human Phenotype Ontology. We then demonstrate the utility of biochemical phenotypes using a text-based phenomics method to predict gene-disease relationships, showing that the prediction of IEM genes is significantly better using biochemical rather than clinical profiles. The findings herein provide a motivating goal for the IEM community to expand the computationally accessible descriptions of biochemical phenotypes associated with IEM in phenomics resources.
Collapse
Affiliation(s)
- Jessica J Y Lee
- Centre for Molecular Medicine and Therapeutics, BC Children's Hospital Research Institute, University of British Columbia, Room 3109, 950 West 28th Avenue, Vancouver, BC, V5Z 4H4, Canada
| | - Michael M Gottlieb
- Centre for Molecular Medicine and Therapeutics, BC Children's Hospital Research Institute, University of British Columbia, Room 3109, 950 West 28th Avenue, Vancouver, BC, V5Z 4H4, Canada
| | - Jake Lever
- Canada's Michael Smith Genome Sciences Centre, BC Cancer Agency, Vancouver, BC, Canada
| | - Steven J M Jones
- Canada's Michael Smith Genome Sciences Centre, BC Cancer Agency, Vancouver, BC, Canada
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada
| | - Nenad Blau
- Dietmar-Hopp Metabolic Center, Department of General Pediatrics, University Hospital, Heidelberg, Germany
| | - Clara D M van Karnebeek
- Centre for Molecular Medicine and Therapeutics, BC Children's Hospital Research Institute, University of British Columbia, Room 3109, 950 West 28th Avenue, Vancouver, BC, V5Z 4H4, Canada
- Department of Pediatrics, University of British Columbia, Vancouver, BC, Canada
- Departments of Pediatrics and Clinical Genetics, Emma Children's Hospital, Academic Medical Centre, Amsterdam, The Netherlands
| | - Wyeth W Wasserman
- Centre for Molecular Medicine and Therapeutics, BC Children's Hospital Research Institute, University of British Columbia, Room 3109, 950 West 28th Avenue, Vancouver, BC, V5Z 4H4, Canada.
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada.
| |
Collapse
|
19
|
Abstract
Genome-wide association studies (GWAS) have identified more than 100 loci that show robust association with schizophrenia risk. However, due to the complexity of linkage disequilibrium and gene regulatory, it is challenging to pinpoint the causal genes at the risk loci and translate the genetic findings from GWAS into disease mechanism and clinical treatment. Here we systematically predicted the plausible candidate causal genes for schizophrenia at genome-wide level. We utilized different approaches and strategies to predict causal genes for schizophrenia, including Sherlock, SMR, DAPPLE, Prix Fixe, NetWAS, and DEPICT. By integrating the results from different prediction approaches, we identified six top candidates that represent promising causal genes for schizophrenia, including CNTN4, GATAD2A, GPM6A, MMP16, PSMA4, and TCF4. Besides, we also identified 35 additional high-confidence causal genes for schizophrenia. The identified causal genes showed distinct spatio-temporal expression patterns in developing and adult human brain. Cell-type-specific expression analysis indicated that the expression level of the predicted causal genes was significantly higher in neurons compared with oligodendrocytes and microglia (P < 0.05). We found that synaptic transmission-related genes were significantly enriched among the identified causal genes (P < 0.05), providing further support for the dysregulation of synaptic transmission in schizophrenia. Finally, we showed that the top six causal genes are dysregulated in schizophrenia cases compared with controls and knockdown of these genes impaired the proliferation of neuronal cells. Our study depicts the landscape of plausible schizophrenia causal genes for the first time. Further genetic and functional validation of these genes will provide mechanistic insights into schizophrenia pathogenesis and may facilitate to provide potential targets for future therapeutics and diagnostics.
Collapse
Affiliation(s)
- Changguo Ma
- 0000000119573309grid.9227.eKey Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223 China
| | - Chunjie Gu
- 0000000119573309grid.9227.eKey Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223 China
| | - Yongxia Huo
- 0000000119573309grid.9227.eKey Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223 China
| | - Xiaoyan Li
- 0000000119573309grid.9227.eKey Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223 China
| | - Xiong-Jian Luo
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China. .,Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China.
| |
Collapse
|
20
|
Yang CP, Li X, Wu Y, Shen Q, Zeng Y, Xiong Q, Wei M, Chen C, Liu J, Huo Y, Li K, Xue G, Yao YG, Zhang C, Li M, Chen Y, Luo XJ. Comprehensive integrative analyses identify GLT8D1 and CSNK2B as schizophrenia risk genes. Nat Commun 2018; 9:838. [PMID: 29483533 PMCID: PMC5826945 DOI: 10.1038/s41467-018-03247-3] [Citation(s) in RCA: 74] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2017] [Accepted: 01/29/2018] [Indexed: 01/01/2023] Open
Abstract
Recent genome-wide association studies (GWAS) have identified multiple risk loci that show strong associations with schizophrenia. However, pinpointing the potential causal genes at the reported loci remains a major challenge. Here we identify candidate causal genes for schizophrenia using an integrative genomic approach. Sherlock integrative analysis shows that ALMS1, GLT8D1, and CSNK2B are schizophrenia risk genes, which are validated using independent brain expression quantitative trait loci (eQTL) data and integrative analysis method (SMR). Consistently, gene expression analysis in schizophrenia cases and controls further supports the potential role of these three genes in the pathogenesis of schizophrenia. Finally, we show that GLT8D1 and CSNK2B knockdown promote the proliferation and inhibit the differentiation abilities of neural stem cells, and alter morphology and synaptic transmission of neurons. These convergent lines of evidence suggest that the ALMS1, CSNK2B, and GLT8D1 genes may be involved in pathophysiology of schizophrenia. More than 100 risk loci for schizophrenia have been identified by genome-wide association studies. Here, the authors apply an integrative genomic approach to prioritize risk genes and validate GLT8D1 and CSNK2B as candidate causal genes by in vitro studies in neural stem cells.
Collapse
Affiliation(s)
- Cui-Ping Yang
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China
| | - Xiaoyan Li
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China
| | - Yong Wu
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China
| | - Qiushuo Shen
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China.,Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, 650204, China
| | - Yong Zeng
- Department of Psychiatry, The First Affiliated Hospital of Kunming Medical College, Kunming, Yunnan, 650031, China
| | - Qiuxia Xiong
- Department of Psychiatry, The First Affiliated Hospital of Kunming Medical College, Kunming, Yunnan, 650031, China
| | - Mengping Wei
- State Key Laboratory of Membrane Biology, PKU-IDG/McGovern Institute for Brain Research, School of Life Sciences, Peking University, Beijing, 100871, China
| | - Chunhui Chen
- State Key Laboratory of Cognitive Neuroscience and Learning, and IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing, 100875, China
| | - Jiewei Liu
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China
| | - Yongxia Huo
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China
| | - Kaiqin Li
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China
| | - Gui Xue
- State Key Laboratory of Cognitive Neuroscience and Learning, and IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing, 100875, China
| | - Yong-Gang Yao
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China.,CAS Center for Excellence in Brain Science and Intelligence Technology, Chinese Academy of Sciences, Shanghai, 200031, China
| | - Chen Zhang
- State Key Laboratory of Membrane Biology, PKU-IDG/McGovern Institute for Brain Research, School of Life Sciences, Peking University, Beijing, 100871, China
| | - Ming Li
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China.,CAS Center for Excellence in Brain Science and Intelligence Technology, Chinese Academy of Sciences, Shanghai, 200031, China
| | - Yongbin Chen
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China. .,Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, Yunnna, 650223, China.
| | - Xiong-Jian Luo
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China. .,Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, Yunnna, 650223, China.
| |
Collapse
|
21
|
White JJ, Mazzeu JF, Coban-Akdemir Z, Bayram Y, Bahrambeigi V, Hoischen A, van Bon BWM, Gezdirici A, Gulec EY, Ramond F, Touraine R, Thevenon J, Shinawi M, Beaver E, Heeley J, Hoover-Fong J, Durmaz CD, Karabulut HG, Marzioglu-Ozdemir E, Cayir A, Duz MB, Seven M, Price S, Ferreira BM, Vianna-Morgante AM, Ellard S, Parrish A, Stals K, Flores-Daboub J, Jhangiani SN, Gibbs RA, Brunner HG, Sutton VR, Lupski JR, Carvalho CMB. WNT Signaling Perturbations Underlie the Genetic Heterogeneity of Robinow Syndrome. Am J Hum Genet 2018; 102:27-43. [PMID: 29276006 DOI: 10.1016/j.ajhg.2017.10.002] [Citation(s) in RCA: 71] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2017] [Accepted: 10/06/2017] [Indexed: 12/12/2022] Open
Abstract
Locus heterogeneity characterizes a variety of skeletal dysplasias often due to interacting or overlapping signaling pathways. Robinow syndrome is a skeletal disorder historically refractory to molecular diagnosis, potentially stemming from substantial genetic heterogeneity. All current known pathogenic variants reside in genes within the noncanonical Wnt signaling pathway including ROR2, WNT5A, and more recently, DVL1 and DVL3. However, ∼70% of autosomal-dominant Robinow syndrome cases remain molecularly unsolved. To investigate this missing heritability, we recruited 21 families with at least one family member clinically diagnosed with Robinow or Robinow-like phenotypes and performed genetic and genomic studies. In total, four families with variants in FZD2 were identified as well as three individuals from two families with biallelic variants in NXN that co-segregate with the phenotype. Importantly, both FZD2 and NXN are relevant protein partners in the WNT5A interactome, supporting their role in skeletal development. In addition to confirming that clustered -1 frameshifting variants in DVL1 and DVL3 are the main contributors to dominant Robinow syndrome, we also found likely pathogenic variants in candidate genes GPC4 and RAC3, both linked to the Wnt signaling pathway. These data support an initial hypothesis that Robinow syndrome results from perturbation of the Wnt/PCP pathway, suggest specific relevant domains of the proteins involved, and reveal key contributors in this signaling cascade during human embryonic development. Contrary to the view that non-allelic genetic heterogeneity hampers gene discovery, this study demonstrates the utility of rare disease genomic studies to parse gene function in human developmental pathways.
Collapse
Affiliation(s)
- Janson J White
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston TX 77030, USA
| | - Juliana F Mazzeu
- University of Brasilia, Brasilia 70910, Brazil; Robinow Syndrome Foundation, Anoka, MN 55303, USA
| | - Zeynep Coban-Akdemir
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston TX 77030, USA
| | - Yavuz Bayram
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston TX 77030, USA
| | - Vahid Bahrambeigi
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston TX 77030, USA; Graduate Program in Diagnostic Genetics, School of Health Professions, University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Alexander Hoischen
- Department of Human Genetics, Radboud Institute of Molecular Life Sciences, Radboud University Medical Center, 6500 HB Nijmegen, the Netherlands; Department of Internal Medicine and Radboud Center for Infectious Diseases (RCI), Radboud University Medical Center, 6500 HB Nijmegen, the Netherlands
| | - Bregje W M van Bon
- Department of Human Genetics, Radboud Institute of Molecular Life Sciences, Radboud University Medical Center, 6500 HB Nijmegen, the Netherlands
| | - Alper Gezdirici
- Department of Medical Genetics, Kanuni Sultan Suleyman Training and Research Hospital, Istanbul 34303, Turkey
| | - Elif Yilmaz Gulec
- Department of Medical Genetics, Kanuni Sultan Suleyman Training and Research Hospital, Istanbul 34303, Turkey
| | - Francis Ramond
- Service de Génétique, CHU-Hôpital Nord, 42000 Saint-Etienne, France
| | - Renaud Touraine
- Service de Génétique, CHU-Hôpital Nord, 42000 Saint-Etienne, France
| | - Julien Thevenon
- Inserm UMR 1231 GAD team, Genetics of Developmental Anomalies, Université de Bourgogne-Franche Comté, 21000 Dijon, France; FHU-TRANSLAD, Université de Bourgogne, 21000 CHU Dijon, France; Centre de génétique, Hôpital Couple-Enfant, CHU de Grenoble-Alpes, 38700 La Tronche, France
| | - Marwan Shinawi
- Division of Genetics and Genomic Medicine, Department of Pediatrics, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Erin Beaver
- Mercy Clinic-Kids Genetics, Mercy Children's Hospital St. Louis, St. Louis, MO 63141, USA
| | - Jennifer Heeley
- Mercy Clinic-Kids Genetics, Mercy Children's Hospital St. Louis, St. Louis, MO 63141, USA
| | - Julie Hoover-Fong
- Greenberg Center for Skeletal Dysplasias, McKusick-Nathans Institute for Genetic Medicine, Johns Hopkins University, Baltimore, MD 21287, USA
| | - Ceren D Durmaz
- Department of Medical Genetics, Ankara University School of Medicine, 06100 Ankara, Turkey
| | - Halil Gurhan Karabulut
- Department of Medical Genetics, Ankara University School of Medicine, 06100 Ankara, Turkey
| | - Ebru Marzioglu-Ozdemir
- Department of Medical Genetics, Erzurum Regional and Training Hospital, 25070 Erzurum, Turkey
| | - Atilla Cayir
- Erzurum Training and Research Hospital, Department of Pediatric Endocrinology, 25070 Erzurum, Turkey
| | - Mehmet B Duz
- Department of Medical Genetics, Cerrahpasa Medical School, Istanbul University, 34452 Istanbul, Turkey
| | - Mehmet Seven
- Department of Medical Genetics, Cerrahpasa Medical School, Istanbul University, 34452 Istanbul, Turkey
| | - Susan Price
- Oxford Centre for Genomic Medicine, Nuffield Orthopaedic Centre, Oxford OX3 7LD, UK
| | | | - Angela M Vianna-Morgante
- Department of Genetics and Evolutionary Biology, Institute of Biosciences, Sao Paulo - SP 05508-090, Brazil
| | - Sian Ellard
- Department of Molecular Genetics, Royal Devon and Exeter NHS Foundation Trust, Exeter EX2 5DW, UK; Institute of Biomedical and Clinical Science, University of Exeter Medical School, Exeter EX1 2LU, UK
| | - Andrew Parrish
- Department of Molecular Genetics, Royal Devon and Exeter NHS Foundation Trust, Exeter EX2 5DW, UK
| | - Karen Stals
- Department of Molecular Genetics, Royal Devon and Exeter NHS Foundation Trust, Exeter EX2 5DW, UK
| | - Josue Flores-Daboub
- Department of Pediatric Genetics, University of Utah School of Medicine, Salt Lake City, UT 84108, USA
| | - Shalini N Jhangiani
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Richard A Gibbs
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston TX 77030, USA; Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Han G Brunner
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, 6500 HB Nijmegen, the Netherlands; Department of Clinical Genetics, GROW School for Oncology and Developmental Biology, Maastricht University Medical Center, 6202 AZ Maastricht, the Netherlands
| | - V Reid Sutton
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston TX 77030, USA; Texas Children's Hospital, Houston, TX 77030, USA
| | - James R Lupski
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston TX 77030, USA; Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA; Texas Children's Hospital, Houston, TX 77030, USA
| | - Claudia M B Carvalho
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston TX 77030, USA.
| |
Collapse
|
22
|
Chen Y, Xu R. Context-sensitive network-based disease genetics prediction and its implications in drug discovery. Bioinformatics 2017; 33:1031-1039. [PMID: 28062449 DOI: 10.1093/bioinformatics/btw737] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2016] [Accepted: 11/19/2016] [Indexed: 01/05/2023] Open
Abstract
Motivation Disease phenotype networks play an important role in computational approaches to identifying new disease-gene associations. Current disease phenotype networks often model disease relationships based on pairwise similarities, therefore ignore the specific context on how two diseases are connected. In this study, we propose a new strategy to model disease associations using context-sensitive networks (CSNs). We developed a CSN-based phenome-driven approach for disease genetics prediction, and investigated the translational potential of the predicted genes in drug discovery. Results We constructed CSNs by directly connecting diseases with associated phenotypes. Here, we constructed two CSNs using different data sources; the two networks contain 26 790 and 13 822 nodes respectively. We integrated the CSNs with a genetic functional relationship network and predicted disease genes using a network-based ranking algorithm. For comparison, we built Similarity-Based disease Networks (SBN) using the same disease phenotype data. In a de novo cross validation for 3324 diseases, the CSN-based approach significantly increased the average rank from top 12.6 to top 8.8% for all tested genes comparing with the SBN-based approach ( p<e-22 ). The area under the receiver operating characteristic curve for the CSN approach was also significantly higher than the SBN approach (0.91 versus 0.87, p<e-3 ). In addition, we predicted genes for Parkinson's disease using CSNs, and demonstrated that the top-ranked genes are highly relevant to PD pathologenesis. We pin-pointed a top-ranked drug target gene for PD, and found its association with neurodegeneration supported by literature. In summary, CSNs lead to significantly improve the disease genetics prediction comparing with SBNs and provide leads for potential drug targets. Availability and Implementation nlp.case.edu/public/data/. Contact rxx@case.edu.
Collapse
|
23
|
Li YH, Zhang GG, Wang N. Systematic Characterization and Prediction of Human Hypertension Genes. Hypertension 2016; 69:349-355. [PMID: 27895194 DOI: 10.1161/hypertensionaha.116.08573] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2016] [Revised: 10/19/2016] [Accepted: 11/09/2016] [Indexed: 01/25/2023]
Abstract
Hypertension is a major cardiovascular risk factor and accounts for a large part of cardiovascular mortality. In this work, we analyzed the properties of hypertension genes and found that when compared with genes not yet known to be involved in hypertension regulation, known hypertension genes display distinguishing features: (1) hypertension genes tend to be located at network center; (2) hypertension genes tend to interact with each other; and (3) hypertension genes tend to enrich in certain biological processes and show certain phenotypes. Based on these features, we developed a machine-learning algorithm to predict new hypertension genes. One hundred and seventy-seven candidates were predicted with a posterior probability >0.9. Evidence supporting 17 of the predictions has been found.
Collapse
Affiliation(s)
- Yan-Hui Li
- From the Institute of Cardiovascular Sciences and Key Laboratory of Molecular Cardiovascular Sciences, Ministry of Education, Peking University Health Science Center, Beijing, People's Republic of China (Y.-H.L., N.W.); Special Medical Ward (Geratology Department), First Hospital of Tsinghua University Beijing, People's Republic of China (G.-G.Z.); and The Advanced Institute for Medical Sciences, Dalian Medical University, China (N.W.).
| | - Gai-Gai Zhang
- From the Institute of Cardiovascular Sciences and Key Laboratory of Molecular Cardiovascular Sciences, Ministry of Education, Peking University Health Science Center, Beijing, People's Republic of China (Y.-H.L., N.W.); Special Medical Ward (Geratology Department), First Hospital of Tsinghua University Beijing, People's Republic of China (G.-G.Z.); and The Advanced Institute for Medical Sciences, Dalian Medical University, China (N.W.)
| | - Nanping Wang
- From the Institute of Cardiovascular Sciences and Key Laboratory of Molecular Cardiovascular Sciences, Ministry of Education, Peking University Health Science Center, Beijing, People's Republic of China (Y.-H.L., N.W.); Special Medical Ward (Geratology Department), First Hospital of Tsinghua University Beijing, People's Republic of China (G.-G.Z.); and The Advanced Institute for Medical Sciences, Dalian Medical University, China (N.W.).
| |
Collapse
|
24
|
Transcriptome Profiling in Rat Inbred Strains and Experimental Cross Reveals Discrepant Genetic Architecture of Genome-Wide Gene Expression. G3-GENES GENOMES GENETICS 2016; 6:3671-3683. [PMID: 27646706 PMCID: PMC5100866 DOI: 10.1534/g3.116.033274] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
To test the impact of genetic heterogeneity on cis- and trans-mediated mechanisms of gene expression regulation, we profiled the transcriptome of adipose tissue in 20 inbred congenic strains derived from diabetic Goto-Kakizaki (GK) rats and Brown-Norway (BN) controls, which contain well-defined blocks (1-183 Mb) of genetic polymorphisms, and in 123 genetically heterogeneous rats of an (GK × BN)F2 offspring. Within each congenic we identified 73-1351 differentially expressed genes (DEGs), only 7.7% of which mapped within the congenic blocks, and which may be regulated in cis The remainder localized outside the blocks, and therefore must be regulated in trans Most trans-regulated genes exhibited approximately twofold expression changes, consistent with monoallelic expression. Altered biological pathways were replicated between congenic strains sharing blocks of genetic polymorphisms, but polymorphisms at different loci also had redundant effects on transcription of common distant genes and pathways. We mapped 2735 expression quantitative trait loci (eQTL) in the F2 cross, including 26% predominantly cis-regulated genes, which validated DEGs in congenic strains. A hotspot of >300 eQTL in a 10 cM region of chromosome 1 was enriched in DEGs in a congenic strain. However, many DEGs among GK, BN and congenic strains did not replicate as eQTL in F2 hybrids, demonstrating distinct mechanisms of gene expression when alleles segregate in an outbred population or are fixed homozygous across the entire genome or in short genomic regions. Our analysis provides conceptual advances in our understanding of the complex architecture of genome expression and pathway regulation, and suggests a prominent impact of epistasis and monoallelic expression on gene transcription.
Collapse
|
25
|
Chen Y, Xu R. Phenome-based gene discovery provides information about Parkinson's disease drug targets. BMC Genomics 2016; 17 Suppl 5:493. [PMID: 27586503 PMCID: PMC5009520 DOI: 10.1186/s12864-016-2820-1] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
Abstract
BACKGROUND Parkinson disease (PD) is a severe neurodegenerative disease without curative drugs. The highly complex and heterogeneous disease mechanisms are still unclear. Detecting novel PD associated genes not only contributes in revealing the disease pathogenesis, but also facilitates discovering new targets for drugs. METHODS We propose a phenome-based gene prediction strategy to identify disease-associated genes for PD. We integrated multiple disease phenotype networks, a gene functional relationship network, and known PD genes to predict novel candidate genes. Then we investigated the translational potential of the predicted genes in drug discovery. RESULTS In a cross validation analysis, the average rank for 15 known PD genes is within top 0.8 %. We also tested the algorithm with an independent validation set of 669 PD-associated genes detected by genome-wide association studies. The top ranked genes predicted by our approach are enriched for these validation genes. In addition, our approach prioritized the target genes for FDA-approved PD drugs and the drugs that have been tested for PD in clinical trials. Pathway analysis shows that the prioritized drug target genes are closely associated with PD pathogenesis. The result provides empirical evidence that our computational gene prediction approach identifies novel candidate genes for PD, and has the potential to lead to rapid drug discovery.
Collapse
Affiliation(s)
- Yang Chen
- Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA
| | - Rong Xu
- Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA.
| |
Collapse
|
26
|
Kortvely E, Ueffing M. Gene Structure of the 10q26 Locus: A Clue to Cracking the ARMS2/HTRA1 Riddle? ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016; 854:23-9. [PMID: 26427389 DOI: 10.1007/978-3-319-17121-0_4] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Age-related macular degeneration (AMD) is a sight-threatening disorder of the central retina. Being the leading cause of visual impairment in senior citizens, it represents a major public health issue in developed countries. Genetic studies of AMD identified two major susceptibility loci on chromosomes 1 and 10. The high-risk allele of the 10q26 locus encompasses three genes, PLEKHA1, ARMS2, and HTRA1 with high linkage disequilibrium and the individual contribution of the encoded proteins to disease etiology remains controversial. While PLEKHA1 and HTRA1 are highly conserved proteins, ARMS2 is only present in primates and can be detected by using RT-PCR. On the other hand, there is no unequivocal evidence for the existence of the encoded protein. However, it has been reported that risk haplotypes only affect the expression of ARMS2 (but not of HTRA1), making ARMS2 the best candidate for being the genuine AMD gene within this locus. Yet, homozygous carriers of a common haplotype carry a premature stop codon in the ARMS2 gene (R38X) and therefore lack ARMS2, but this variant is not associated with AMD. In this work we aimed at characterizing the diversity of transcripts originating from this locus, in order to find new hints on how to resolve this perplexing paradox. We found chimeric transcripts originating from the PLEKHA1 gene but ending in ARMS2. This finding may give a new explanation as to how variants in this locus contribute to AMD.
Collapse
Affiliation(s)
- Elod Kortvely
- Division of Experimental Ophthalmology, University of Tuebingen, Roentgenweg 11, 72076, Tuebingen, Germany.
| | - Marius Ueffing
- Division of Experimental Ophthalmology, University of Tuebingen, Roentgenweg 11, 72076, Tuebingen, Germany.
| |
Collapse
|
27
|
Abstract
MOTIVATION Discerning genetic contributions to diseases not only enhances our understanding of disease mechanisms, but also leads to translational opportunities for drug discovery. Recent computational approaches incorporate disease phenotypic similarities to improve the prediction power of disease gene discovery. However, most current studies used only one data source of human disease phenotype. We present an innovative and generic strategy for combining multiple different data sources of human disease phenotype and predicting disease-associated genes from integrated phenotypic and genomic data. RESULTS To demonstrate our approach, we explored a new phenotype database from biomedical ontologies and constructed Disease Manifestation Network (DMN). We combined DMN with mimMiner, which was a widely used phenotype database in disease gene prediction studies. Our approach achieved significantly improved performance over a baseline method, which used only one phenotype data source. In the leave-one-out cross-validation and de novo gene prediction analysis, our approach achieved the area under the curves of 90.7% and 90.3%, which are significantly higher than 84.2% (P < e(-4)) and 81.3% (P < e(-12)) for the baseline approach. We further demonstrated that our predicted genes have the translational potential in drug discovery. We used Crohn's disease as an example and ranked the candidate drugs based on the rank of drug targets. Our gene prediction approach prioritized druggable genes that are likely to be associated with Crohn's disease pathogenesis, and our rank of candidate drugs successfully prioritized the Food and Drug Administration-approved drugs for Crohn's disease. We also found literature evidence to support a number of drugs among the top 200 candidates. In summary, we demonstrated that a novel strategy combining unique disease phenotype data with system approaches can lead to rapid drug discovery. AVAILABILITY AND IMPLEMENTATION nlp. CASE edu/public/data/DMN
Collapse
Affiliation(s)
- Yang Chen
- Department of Electrical Engineering and Computer Science, Department of Epidemiology and Biostatistics and Department of Family Medicine and Community Health, Case Western Reserve University, Cleveland, OH 44106, USA
| | - Li Li
- Department of Electrical Engineering and Computer Science, Department of Epidemiology and Biostatistics and Department of Family Medicine and Community Health, Case Western Reserve University, Cleveland, OH 44106, USA Department of Electrical Engineering and Computer Science, Department of Epidemiology and Biostatistics and Department of Family Medicine and Community Health, Case Western Reserve University, Cleveland, OH 44106, USA
| | - Guo-Qiang Zhang
- Department of Electrical Engineering and Computer Science, Department of Epidemiology and Biostatistics and Department of Family Medicine and Community Health, Case Western Reserve University, Cleveland, OH 44106, USA
| | - Rong Xu
- Department of Electrical Engineering and Computer Science, Department of Epidemiology and Biostatistics and Department of Family Medicine and Community Health, Case Western Reserve University, Cleveland, OH 44106, USA
| |
Collapse
|
28
|
Network-assisted analysis of primary Sjögren's syndrome GWAS data in Han Chinese. Sci Rep 2015; 5:18855. [PMID: 26686423 PMCID: PMC4685393 DOI: 10.1038/srep18855] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2015] [Accepted: 11/05/2015] [Indexed: 12/23/2022] Open
Abstract
Primary Sjögren's syndrome (pSS) is a complex autoimmune disorder. So far, genetic research in pSS has lagged far behind and the underlying biological mechanism is unclear. Further exploring existing genome-wide association study (GWAS) data is urgently expected to uncover disease-related gene combination patterns. Herein, we conducted a network-based analysis by integrating pSS GWAS in Han Chinese with a protein-protein interactions network to identify pSS candidate genes. After module detection and evaluation, 8 dense modules covering 40 genes were obtained for further functional annotation. Additional 31 MHC genes with significant gene-level P-values (sigMHC-gene) were also remained. The combined module genes and sigMHC-genes, a total of 71 genes, were denoted as pSS candidate genes. Of these pSS candidates, 14 genes had been reported to be associated with any of pSS, RA, and SLE, including STAT4, GTF2I, HLA-DPB1, HLA-DRB1, PTTG1, HLA-DQB1, MBL2, TAP2, CFLAR, NFKBIE, HLA-DRA, APOM, HLA-DQA2 and NOTCH4. This is the first report of the network-assisted analysis for pSS GWAS data to explore combined gene patterns associated with pSS. Our study suggests that network-assisted analysis is a useful approach to gaining further insights into the biology of associated genes and providing important clues for future research into pSS etiology.
Collapse
|
29
|
Chong J, Buckingham K, Jhangiani S, Boehm C, Sobreira N, Smith J, Harrell T, McMillin M, Wiszniewski W, Gambin T, Coban Akdemir Z, Doheny K, Scott A, Avramopoulos D, Chakravarti A, Hoover-Fong J, Mathews D, Witmer P, Ling H, Hetrick K, Watkins L, Patterson K, Reinier F, Blue E, Muzny D, Kircher M, Bilguvar K, López-Giráldez F, Sutton V, Tabor H, Leal S, Gunel M, Mane S, Gibbs R, Boerwinkle E, Hamosh A, Shendure J, Lupski J, Lifton R, Valle D, Nickerson D, Bamshad M, Bamshad MJ. The Genetic Basis of Mendelian Phenotypes: Discoveries, Challenges, and Opportunities. Am J Hum Genet 2015; 97:199-215. [PMID: 26166479 DOI: 10.1016/j.ajhg.2015.06.009] [Citation(s) in RCA: 449] [Impact Index Per Article: 49.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2015] [Indexed: 01/06/2023] Open
Abstract
Discovering the genetic basis of a Mendelian phenotype establishes a causal link between genotype and phenotype, making possible carrier and population screening and direct diagnosis. Such discoveries also contribute to our knowledge of gene function, gene regulation, development, and biological mechanisms that can be used for developing new therapeutics. As of February 2015, 2,937 genes underlying 4,163 Mendelian phenotypes have been discovered, but the genes underlying ∼50% (i.e., 3,152) of all known Mendelian phenotypes are still unknown, and many more Mendelian conditions have yet to be recognized. This is a formidable gap in biomedical knowledge. Accordingly, in December 2011, the NIH established the Centers for Mendelian Genomics (CMGs) to provide the collaborative framework and infrastructure necessary for undertaking large-scale whole-exome sequencing and discovery of the genetic variants responsible for Mendelian phenotypes. In partnership with 529 investigators from 261 institutions in 36 countries, the CMGs assessed 18,863 samples from 8,838 families representing 579 known and 470 novel Mendelian phenotypes as of January 2015. This collaborative effort has identified 956 genes, including 375 not previously associated with human health, that underlie a Mendelian phenotype. These results provide insight into study design and analytical strategies, identify novel mechanisms of disease, and reveal the extensive clinical variability of Mendelian phenotypes. Discovering the gene underlying every Mendelian phenotype will require tackling challenges such as worldwide ascertainment and phenotypic characterization of families affected by Mendelian conditions, improvement in sequencing and analytical techniques, and pervasive sharing of phenotypic and genomic data among researchers, clinicians, and families.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - Michael J Bamshad
- Department of Pediatrics, University of Washington, Seattle, WA 98195, USA; Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA; Division of Genetic Medicine, Seattle Children's Hospital, Seattle, WA 98105, USA.
| |
Collapse
|
30
|
Le DH. A novel method for identifying disease associated protein complexes based on functional similarity protein complex networks. Algorithms Mol Biol 2015; 10:14. [PMID: 25969691 PMCID: PMC4427953 DOI: 10.1186/s13015-015-0044-6] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2014] [Accepted: 04/01/2015] [Indexed: 12/21/2022] Open
Abstract
Background Protein complexes formed by non-covalent interaction among proteins play important roles in cellular functions. Computational and purification methods have been used to identify many protein complexes and their cellular functions. However, their roles in terms of causing disease have not been well discovered yet. There exist only a few studies for the identification of disease-associated protein complexes. However, they mostly utilize complicated heterogeneous networks which are constructed based on an out-of-date database of phenotype similarity network collected from literature. In addition, they only apply for diseases for which tissue-specific data exist. Methods In this study, we propose a method to identify novel disease-protein complex associations. First, we introduce a framework to construct functional similarity protein complex networks where two protein complexes are functionally connected by either shared protein elements, shared annotating GO terms or based on protein interactions between elements in each protein complex. Second, we propose a simple but effective neighborhood-based algorithm, which yields a local similarity measure, to rank disease candidate protein complexes. Results Comparing the predictive performance of our proposed algorithm with that of two state-of-the-art network propagation algorithms including one we used in our previous study, we found that it performed statistically significantly better than that of these two algorithms for all the constructed functional similarity protein complex networks. In addition, it ran about 32 times faster than these two algorithms. Moreover, our proposed method always achieved high performance in terms of AUC values irrespective of the ways to construct the functional similarity protein complex networks and the used algorithms. The performance of our method was also higher than that reported in some existing methods which were based on complicated heterogeneous networks. Finally, we also tested our method with prostate cancer and selected the top 100 highly ranked candidate protein complexes. Interestingly, 69 of them were evidenced since at least one of their protein elements are known to be associated with prostate cancer. Conclusions Our proposed method, including the framework to construct functional similarity protein complex networks and the neighborhood-based algorithm on these networks, could be used for identification of novel disease-protein complex associations. Electronic supplementary material The online version of this article (doi:10.1186/s13015-015-0044-6) contains supplementary material, which is available to authorized users.
Collapse
|
31
|
Understanding multicellular function and disease with human tissue-specific networks. Nat Genet 2015; 47:569-76. [PMID: 25915600 PMCID: PMC4828725 DOI: 10.1038/ng.3259] [Citation(s) in RCA: 543] [Impact Index Per Article: 60.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2014] [Accepted: 03/06/2015] [Indexed: 12/17/2022]
Abstract
Tissue and cell-type identity lie at the core of human physiology and disease. Understanding the genetic underpinnings of complex tissues and individual cell lineages is crucial for developing improved diagnostics and therapeutics. We present genome-wide functional interaction networks for 144 human tissues and cell types developed using a data-driven Bayesian methodology that integrates thousands of diverse experiments spanning tissue and disease states. Tissue-specific networks predict lineage-specific responses to perturbation, reveal genes’ changing functional roles across tissues, and illuminate disease-disease relationships. We introduce NetWAS, which combines genes with nominally significant GWAS p-values and tissue-specific networks to identify disease-gene associations more accurately than GWAS alone. Our webserver, GIANT, provides an interface to human tissue networks through multi-gene queries, network visualization, analysis tools including NetWAS, and downloadable networks. GIANT enables systematic exploration of the landscape of interacting genes that shape specialized cellular functions across more than one hundred human tissues and cell types.
Collapse
|
32
|
RecRWR: a recursive random walk method for improved identification of diseases. BIOMED RESEARCH INTERNATIONAL 2015; 2015:747156. [PMID: 25874227 PMCID: PMC4385608 DOI: 10.1155/2015/747156] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/22/2014] [Revised: 10/17/2014] [Accepted: 10/31/2014] [Indexed: 12/02/2022]
Abstract
High-throughput methods such as next-generation sequencing or DNA microarrays lack precision, as they return hundreds of genes for a single disease profile. Several computational methods applied to physical interaction of protein networks have been successfully used in identification of the best disease candidates for each expression profile. An open problem for these methods is the ability to combine and take advantage of the wealth of biomedical data publicly available.
We propose an enhanced method to improve selection of the best disease targets for a multilayer biomedical network that integrates PPI data annotated with stable knowledge from OMIM diseases and GO biological processes. We present a comprehensive validation that demonstrates the advantage of the proposed approach, Recursive Random Walk with Restarts (RecRWR). The obtained results outline the superiority of the proposed approach, RecRWR, in identifying disease candidates, especially with high levels of biological noise and benefiting from all data available.
Collapse
|
33
|
Ghiassian SD, Menche J, Barabási AL. A DIseAse MOdule Detection (DIAMOnD) algorithm derived from a systematic analysis of connectivity patterns of disease proteins in the human interactome. PLoS Comput Biol 2015; 11:e1004120. [PMID: 25853560 PMCID: PMC4390154 DOI: 10.1371/journal.pcbi.1004120] [Citation(s) in RCA: 216] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2014] [Accepted: 01/09/2015] [Indexed: 01/08/2023] Open
Abstract
The observation that disease associated proteins often interact with each other has fueled the development of network-based approaches to elucidate the molecular mechanisms of human disease. Such approaches build on the assumption that protein interaction networks can be viewed as maps in which diseases can be identified with localized perturbation within a certain neighborhood. The identification of these neighborhoods, or disease modules, is therefore a prerequisite of a detailed investigation of a particular pathophenotype. While numerous heuristic methods exist that successfully pinpoint disease associated modules, the basic underlying connectivity patterns remain largely unexplored. In this work we aim to fill this gap by analyzing the network properties of a comprehensive corpus of 70 complex diseases. We find that disease associated proteins do not reside within locally dense communities and instead identify connectivity significance as the most predictive quantity. This quantity inspires the design of a novel Disease Module Detection (DIAMOnD) algorithm to identify the full disease module around a set of known disease proteins. We study the performance of the algorithm using well-controlled synthetic data and systematically validate the identified neighborhoods for a large corpus of diseases. Diseases are rarely the result of an abnormality in a single gene, but involve a whole cascade of interactions between several cellular processes. To disentangle these complex interactions it is necessary to study genotype-phenotype relationships in the context of protein-protein interaction networks. Our analysis of 70 diseases shows that disease proteins are not randomly scattered within these networks, but agglomerate in specific regions, suggesting the existence of specific disease modules for each disease. The identification of these modules is the first step towards elucidating the biological mechanisms of a disease or for a targeted search of drug targets. We present a systematic analysis of the connectivity patterns of disease proteins and determine the most predictive topological property for their identification. This allows us to rationally design a reliable and efficient Disease Module Detection algorithm (DIAMOnD).
Collapse
Affiliation(s)
- Susan Dina Ghiassian
- Center for Complex Networks Research and Department of Physics, Northeastern University, Boston, Massachusetts, United States of America
- Center for Cancer Systems Biology (CCSB) and Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, Massachusetts, United States of America
| | - Jörg Menche
- Center for Complex Networks Research and Department of Physics, Northeastern University, Boston, Massachusetts, United States of America
- Center for Cancer Systems Biology (CCSB) and Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, Massachusetts, United States of America
- Center for Network Science, Central European University, Budapest, Hungary
| | - Albert-László Barabási
- Center for Complex Networks Research and Department of Physics, Northeastern University, Boston, Massachusetts, United States of America
- Center for Cancer Systems Biology (CCSB) and Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, Massachusetts, United States of America
- Center for Network Science, Central European University, Budapest, Hungary
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
34
|
Keppler-Noreuil KM, Rios JJ, Parker VE, Semple RK, Lindhurst MJ, Sapp JC, Alomari A, Ezaki M, Dobyns W, Biesecker LG. PIK3CA-related overgrowth spectrum (PROS): diagnostic and testing eligibility criteria, differential diagnosis, and evaluation. Am J Med Genet A 2015; 167A:287-95. [PMID: 25557259 PMCID: PMC4480633 DOI: 10.1002/ajmg.a.36836] [Citation(s) in RCA: 317] [Impact Index Per Article: 35.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2014] [Revised: 09/29/2014] [Accepted: 09/30/2014] [Indexed: 01/20/2023]
Abstract
Somatic activating mutations in the phosphatidylinositol-3-kinase/AKT/mTOR pathway underlie heterogeneous segmental overgrowth phenotypes. Because of the extreme differences among patients, we sought to characterize the phenotypic spectrum associated with different genotypes and mutation burdens, including a better understanding of associated complications and natural history. Historically, the clinical diagnoses in patients with PIK3CA activating mutations have included Fibroadipose hyperplasia or Overgrowth (FAO), Hemihyperplasia Multiple Lipomatosis (HHML), Congenital Lipomatous Overgrowth, Vascular Malformations, Epidermal Nevi, Scoliosis/Skeletal and Spinal (CLOVES) syndrome, macrodactyly, Fibroadipose Infiltrating Lipomatosis, and the related megalencephaly syndromes, Megalencephaly-Capillary Malformation (MCAP or M-CM) and Dysplastic Megalencephaly (DMEG). A workshop was convened at the National Institutes of Health (NIH) to discuss and develop a consensus document regarding diagnosis and treatment of patients with PIK3CA-associated somatic overgrowth disorders. Participants in the workshop included a group of researchers from several institutions who have been studying these disorders and have published their findings, as well as representatives from patient-advocacy and support groups. The umbrella term of "PIK3CA-Related Overgrowth Spectrum (PROS)" was agreed upon to encompass both the known and emerging clinical entities associated with somatic PIK3CA mutations including, macrodactyly, FAO, HHML, CLOVES, and related megalencephaly conditions. Key clinical diagnostic features and criteria for testing were proposed, and testing approaches summarized. Preliminary recommendations for a uniform approach to assessment of overgrowth and molecular diagnostic testing were determined. Future areas to address include the surgical management of overgrowth tissue and vascular anomalies, the optimal approach to thrombosis risk, and the testing of potential pharmacologic therapies.
Collapse
Affiliation(s)
- Kim M. Keppler-Noreuil
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Jonathan J. Rios
- Sarah M. and Charles E. Seay Center for Musculoskeletal Research, Texas Scottish Rite Hospital for Children, Dallas, Texas, 75219 USA
- Department of Pediatrics, UT Southwestern Medical Center, Dallas, TX, 75390 USA
- Eugene McDermott Center for Human Growth and Development, Dallas, TX, 75390 USA
- Department of Orthopaedic Surgery, UT Southwestern Medical Center, Dallas, TX, 75390 USA
| | - Victoria E.R. Parker
- The University of Cambridge Metabolic Research Laboratories, Institute of Metabolic Science, Cambridge, UK
| | - Robert K. Semple
- The University of Cambridge Metabolic Research Laboratories, Institute of Metabolic Science, Cambridge, UK
| | - Marjorie J. Lindhurst
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Julie C. Sapp
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Ahmad Alomari
- Division of Vascular and Interventional Radiology, Boston Children’s Hospital and Harvard Medical School, Boston, MA
| | - Marybeth Ezaki
- Department of Orthopaedic Surgery, UT Southwestern Medical Center, Dallas, TX, 75390 USA
| | | | - Leslie G. Biesecker
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|
35
|
Xu W, Jiang X, Hu X, Li G. Visualization of genetic disease-phenotype similarities by multiple maps t-SNE with Laplacian regularization. BMC Med Genomics 2014; 7 Suppl 2:S1. [PMID: 25350393 PMCID: PMC4243097 DOI: 10.1186/1755-8794-7-s2-s1] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND From a phenotypic standpoint, certain types of diseases may prove to be difficult to accurately diagnose, due to specific combinations of confounding symptoms. Referred to as phenotypic overlap, these sets of disease-related symptoms suggest shared pathophysiological mechanisms. Few attempts have been made to visualize the phenotypic relationships between different human diseases from a machine learning perspective. The proposed research, it is anticipated, will visually assist researchers in quickly disambiguating symptoms which can confound the timely and accurate diagnosis of a disease. METHODS Our method is primarily based on multiple maps t-SNE (mm-tSNE), which is a probabilistic method for visualizing data points in multiple low dimensional spaces. We improved mm-tSNE by adding a Laplacian regularization term and subsequently provide an algorithm for optimizing the new objective function. The advantage of Laplacian regularization is that it adopts clustering structures of variables and provides more sparsity to the estimated parameters. RESULTS In order to further assess our modified mm-tSNE algorithm from a comparative standpoint, we reexamined two social network datasets used by the previous authors. Subsequently, we apply our method on phenotype dataset. In all these cases, our proposed method demonstrated better performance than the original version of mm-tSNE, as measured by the neighbourhood preservation ratio. CONCLUSIONS Phenotype grouping reflects the nature of human disease genetics. Thus, phenotype visualization may be complementary to investigate candidate genes for diseases as well as functional relations between genes and proteins. These relationships can be modelled by the modified mm-tSNE method. The modified mm-tSNE can be applied directly in other domain including social and biological datasets.
Collapse
|
36
|
Chen Y, Xu R. Mining cancer-specific disease comorbidities from a large observational health database. Cancer Inform 2014; 13:37-44. [PMID: 25392682 PMCID: PMC4216041 DOI: 10.4137/cin.s13893] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2014] [Revised: 04/29/2014] [Accepted: 04/30/2014] [Indexed: 12/28/2022] Open
Abstract
Cancer comorbidities often reflect the complex pathogenesis of cancers and provide valuable clues to discover the underlying genetic mechanisms of cancers. In this study, we systematically mine and analyze cancer-specific comorbidity from the FDA Adverse Event Reporting System. We stratified 3,354,043 patients based on age and gender, and developed a network-based approach to extract comorbidity patterns from each patient group. We compared the comorbidity patterns among different patient groups and investigated the effect of age and gender on cancer comorbidity patterns. The results demonstrated that the comorbidity relationships between cancers and non-cancer diseases largely depend on age and gender. A few exceptions are depression, anxiety, and metabolic syndrome, whose comorbidity relationships with cancers are relatively stable among all patients. Literature evidences demonstrate that these stable cancer comorbidities reflect the pathogenesis of cancers. We applied our comorbidity mining approach on colorectal cancer and detected its comorbid associations with metabolic syndrome components, diabetes, and osteoporosis. Our results not only confirmed known cancer comorbidities but also generated novel hypotheses, which can illuminate the common pathophysiology between cancers and their co-occurring diseases.
Collapse
Affiliation(s)
- Yang Chen
- Division of Medical Informatics, Case Western Reserve University, Cleveland, OH, USA
| | - Rong Xu
- Division of Medical Informatics, Case Western Reserve University, Cleveland, OH, USA
| |
Collapse
|
37
|
Chen Y, Zhang X, Zhang GQ, Xu R. Comparative analysis of a novel disease phenotype network based on clinical manifestations. J Biomed Inform 2014; 53:113-20. [PMID: 25277758 DOI: 10.1016/j.jbi.2014.09.007] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2014] [Revised: 08/18/2014] [Accepted: 09/21/2014] [Indexed: 12/21/2022]
Abstract
Systems approaches to analyzing disease phenotype networks in combination with protein functional interaction networks have great potential in illuminating disease pathophysiological mechanisms. While many genetic networks are readily available, disease phenotype networks remain largely incomplete. In this study, we built a large-scale Disease Manifestation Network (DMN) from 50,543 highly accurate disease-manifestation semantic relationships in the United Medical Language System (UMLS). Our new phenotype network contains 2305 nodes and 373,527 weighted edges to represent the disease phenotypic similarities. We first compared DMN with the networks representing genetic relationships among diseases, and demonstrated that the phenotype clustering in DMN reflects common disease genetics. Then we compared DMN with a widely-used disease phenotype network in previous gene discovery studies, called mimMiner, which was extracted from the textual descriptions in Online Mendelian Inheritance in Man (OMIM). We demonstrated that DMN contains different knowledge from the existing phenotype data source. Finally, a case study on Marfan syndrome further proved that DMN contains useful information and can provide leads to discover unknown disease causes. Integrating DMN in systems approaches with mimMiner and other data offers the opportunities to predict novel disease genetics. We made DMN publicly available at nlp/case.edu/public/data/DMN.
Collapse
Affiliation(s)
- Yang Chen
- Department of Electrical Engineering and Computer Science, Case Western Reserve University, Cleveland, OH 44106, United States; Division of Medical Informatics, School of Medicine, Case Western Reserve University, Cleveland, OH 44106, United States
| | - Xiang Zhang
- Department of Electrical Engineering and Computer Science, Case Western Reserve University, Cleveland, OH 44106, United States
| | - Guo-Qiang Zhang
- Department of Electrical Engineering and Computer Science, Case Western Reserve University, Cleveland, OH 44106, United States; Division of Medical Informatics, School of Medicine, Case Western Reserve University, Cleveland, OH 44106, United States
| | - Rong Xu
- Division of Medical Informatics, School of Medicine, Case Western Reserve University, Cleveland, OH 44106, United States.
| |
Collapse
|
38
|
Garcia-Alonso L, Jiménez-Almazán J, Carbonell-Caballero J, Vela-Boza A, Santoyo-López J, Antiñolo G, Dopazo J. The role of the interactome in the maintenance of deleterious variability in human populations. Mol Syst Biol 2014; 10:752. [PMID: 25261458 PMCID: PMC4299661 DOI: 10.15252/msb.20145222] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2014] [Revised: 08/23/2014] [Accepted: 08/28/2014] [Indexed: 12/25/2022] Open
Abstract
Recent genomic projects have revealed the existence of an unexpectedly large amount of deleterious variability in the human genome. Several hypotheses have been proposed to explain such an apparently high mutational load. However, the mechanisms by which deleterious mutations in some genes cause a pathological effect but are apparently innocuous in other genes remain largely unknown. This study searched for deleterious variants in the 1,000 genomes populations, as well as in a newly sequenced population of 252 healthy Spanish individuals. In addition, variants causative of monogenic diseases and somatic variants from 41 chronic lymphocytic leukaemia patients were analysed. The deleterious variants found were analysed in the context of the interactome to understand the role of network topology in the maintenance of the observed mutational load. Our results suggest that one of the mechanisms whereby the effect of these deleterious variants on the phenotype is suppressed could be related to the configuration of the protein interaction network. Most of the deleterious variants observed in healthy individuals are concentrated in peripheral regions of the interactome, in combinations that preserve their connectivity, and have a marginal effect on interactome integrity. On the contrary, likely pathogenic cancer somatic deleterious variants tend to occur in internal regions of the interactome, often with associated structural consequences. Finally, variants causative of monogenic diseases seem to occupy an intermediate position. Our observations suggest that the real pathological potential of a variant might be more a systems property rather than an intrinsic property of individual proteins.
Collapse
Affiliation(s)
- Luz Garcia-Alonso
- Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain
| | - Jorge Jiménez-Almazán
- Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain Bioinformatics of Rare Diseases (BIER), CIBER de Enfermedades Raras (CIBERER), Valencia, Spain
| | - Jose Carbonell-Caballero
- Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain
| | - Alicia Vela-Boza
- Medical Genome Project, Genomics and Bioinformatics Platform of Andalusia (GBPA), Seville, Spain
| | - Javier Santoyo-López
- Medical Genome Project, Genomics and Bioinformatics Platform of Andalusia (GBPA), Seville, Spain
| | - Guillermo Antiñolo
- Medical Genome Project, Genomics and Bioinformatics Platform of Andalusia (GBPA), Seville, Spain Department of Genetics, Reproduction and Fetal Medicine, Institute of Biomedicine of Seville, University Hospital Virgen del Rocio/Consejo Superior de Investigaciones Científicas/University of Seville, Seville, Spain Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER), Seville, Spain
| | - Joaquin Dopazo
- Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain Bioinformatics of Rare Diseases (BIER), CIBER de Enfermedades Raras (CIBERER), Valencia, Spain Medical Genome Project, Genomics and Bioinformatics Platform of Andalusia (GBPA), Seville, Spain Functional Genomics Node, (INB) at CIPF, Valencia, Spain
| |
Collapse
|
39
|
Jiang L, Edwards SM, Thomsen B, Workman CT, Guldbrandtsen B, Sørensen P. A random set scoring model for prioritization of disease candidate genes using protein complexes and data-mining of GeneRIF, OMIM and PubMed records. BMC Bioinformatics 2014; 15:315. [PMID: 25253562 PMCID: PMC4181406 DOI: 10.1186/1471-2105-15-315] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2013] [Accepted: 09/17/2014] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND Prioritizing genetic variants is a challenge because disease susceptibility loci are often located in genes of unknown function or the relationship with the corresponding phenotype is unclear. A global data-mining exercise on the biomedical literature can establish the phenotypic profile of genes with respect to their connection to disease phenotypes. The importance of protein-protein interaction networks in the genetic heterogeneity of common diseases or complex traits is becoming increasingly recognized. Thus, the development of a network-based approach combined with phenotypic profiling would be useful for disease gene prioritization. RESULTS We developed a random-set scoring model and implemented it to quantify phenotype relevance in a network-based disease gene-prioritization approach. We validated our approach based on different gene phenotypic profiles, which were generated from PubMed abstracts, OMIM, and GeneRIF records. We also investigated the validity of several vocabulary filters and different likelihood thresholds for predicted protein-protein interactions in terms of their effect on the network-based gene-prioritization approach, which relies on text-mining of the phenotype data. Our method demonstrated good precision and sensitivity compared with those of two alternative complex-based prioritization approaches. We then conducted a global ranking of all human genes according to their relevance to a range of human diseases. The resulting accurate ranking of known causal genes supported the reliability of our approach. Moreover, these data suggest many promising novel candidate genes for human disorders that have a complex mode of inheritance. CONCLUSION We have implemented and validated a network-based approach to prioritize genes for human diseases based on their phenotypic profile. We have devised a powerful and transparent tool to identify and rank candidate genes. Our global gene prioritization provides a unique resource for the biological interpretation of data from genome-wide association studies, and will help in the understanding of how the associated genetic variants influence disease or quantitative phenotypes.
Collapse
Affiliation(s)
- Li Jiang
- Department of Molecular Biology and Genetics, Aarhus University, DK-8830 Tjele, Denmark.
| | | | | | | | | | | |
Collapse
|
40
|
Honti F, Meader S, Webber C. Unbiased functional clustering of gene variants with a phenotypic-linkage network. PLoS Comput Biol 2014; 10:e1003815. [PMID: 25166029 PMCID: PMC4148192 DOI: 10.1371/journal.pcbi.1003815] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2014] [Accepted: 07/14/2014] [Indexed: 01/04/2023] Open
Abstract
Groupwise functional analysis of gene variants is becoming standard in next-generation sequencing studies. As the function of many genes is unknown and their classification to pathways is scant, functional associations between genes are often inferred from large-scale omics data. Such data types—including protein–protein interactions and gene co-expression networks—are used to examine the interrelations of the implicated genes. Statistical significance is assessed by comparing the interconnectedness of the mutated genes with that of random gene sets. However, interconnectedness can be affected by confounding bias, potentially resulting in false positive findings. We show that genes implicated through de novo sequence variants are biased in their coding-sequence length and longer genes tend to cluster together, which leads to exaggerated p-values in functional studies; we present here an integrative method that addresses these bias. To discern molecular pathways relevant to complex disease, we have inferred functional associations between human genes from diverse data types and assessed them with a novel phenotype-based method. Examining the functional association between de novo gene variants, we control for the heretofore unexplored confounding bias in coding-sequence length. We test different data types and networks and find that the disease-associated genes cluster more significantly in an integrated phenotypic-linkage network than in other gene networks. We present a tool of superior power to identify functional associations among genes mutated in the same disease even after accounting for significant sequencing study bias and demonstrate the suitability of this method to functionally cluster variant genes underlying polygenic disorders. Plenty of gene variants have been associated with a disease, yet most of the heritability, along with the molecular basis, of common diseases remains unexplained. However, it is widely thought that the products of genes whose mutations are implicated in the same disease function together in the same biological pathways and it is the disruption of these pathways that underlies the disease. Such pathways are not well defined and their identification could help elucidate disease mechanisms. Consequently, groupwise functional analyses of gene variants to identify common disease-relevant pathways are becoming standard in next-generation sequencing studies, but we find that these analyses are confounded by coding-sequence length bias. We control for these bias and describe a phenotype-based approach which outperforms other methods in discerning functional associations among the disease-associated genes. We also demonstrate the suitability of this method to functionally dissect the gene variants underlying a complex disorder, the identified functional clusters offering insight into disease mechanisms.
Collapse
Affiliation(s)
- Frantisek Honti
- MRC Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom
| | - Stephen Meader
- MRC Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom
| | - Caleb Webber
- MRC Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom
- * E-mail:
| |
Collapse
|
41
|
Pagnan NAB, Visinoni ÁF. Update on ectodermal dysplasias clinical classification. Am J Med Genet A 2014; 164A:2415-23. [PMID: 25098893 DOI: 10.1002/ajmg.a.36616] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2013] [Accepted: 04/14/2014] [Indexed: 01/30/2023]
Abstract
Monogenic genetic disorders constitute a very large group of rare conditions, each of which is defined by a characteristic combination of phenotypic features. Their enormous clinical variability and their etiological heterogeneity may result in difficulties for the establishment of a syndromic diagnosis. In this context, classifications were proposed for different nosological groups, including ectodermal dysplasias. Freire-Maia proposed a clinical based classification, but nowadays the need of connecting clinical and molecular data on EDs demands a re-evaluation of the knowledge and the formulation of a new classification approach. The aim of this article is to provide an update of an article published in 2009 in this Journal. In order to check for new articles and information on ectodermal dysplasias, we have consulted the OMIM, PUBMED, and Science Direct online databases.
Collapse
|
42
|
Human symptoms-disease network. Nat Commun 2014; 5:4212. [PMID: 24967666 DOI: 10.1038/ncomms5212] [Citation(s) in RCA: 316] [Impact Index Per Article: 31.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2013] [Accepted: 05/27/2014] [Indexed: 12/19/2022] Open
Abstract
In the post-genomic era, the elucidation of the relationship between the molecular origins of diseases and their resulting phenotypes is a crucial task for medical research. Here, we use a large-scale biomedical literature database to construct a symptom-based human disease network and investigate the connection between clinical manifestations of diseases and their underlying molecular interactions. We find that the symptom-based similarity of two diseases correlates strongly with the number of shared genetic associations and the extent to which their associated proteins interact. Moreover, the diversity of the clinical manifestations of a disease can be related to the connectivity patterns of the underlying protein interaction network. The comprehensive, high-quality map of disease-symptom relations can further be used as a resource helping to address important questions in the field of systems medicine, for example, the identification of unexpected associations between diseases, disease etiology research or drug design.
Collapse
|
43
|
Yang P, Li X, Chua HN, Kwoh CK, Ng SK. Ensemble positive unlabeled learning for disease gene identification. PLoS One 2014; 9:e97079. [PMID: 24816822 PMCID: PMC4016241 DOI: 10.1371/journal.pone.0097079] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2013] [Accepted: 04/14/2014] [Indexed: 11/24/2022] Open
Abstract
An increasing number of genes have been experimentally confirmed in recent years as causative genes to various human diseases. The newly available knowledge can be exploited by machine learning methods to discover additional unknown genes that are likely to be associated with diseases. In particular, positive unlabeled learning (PU learning) methods, which require only a positive training set P (confirmed disease genes) and an unlabeled set U (the unknown candidate genes) instead of a negative training set N, have been shown to be effective in uncovering new disease genes in the current scenario. Using only a single source of data for prediction can be susceptible to bias due to incompleteness and noise in the genomic data and a single machine learning predictor prone to bias caused by inherent limitations of individual methods. In this paper, we propose an effective PU learning framework that integrates multiple biological data sources and an ensemble of powerful machine learning classifiers for disease gene identification. Our proposed method integrates data from multiple biological sources for training PU learning classifiers. A novel ensemble-based PU learning method EPU is then used to integrate multiple PU learning classifiers to achieve accurate and robust disease gene predictions. Our evaluation experiments across six disease groups showed that EPU achieved significantly better results compared with various state-of-the-art prediction methods as well as ensemble learning classifiers. Through integrating multiple biological data sources for training and the outputs of an ensemble of PU learning classifiers for prediction, we are able to minimize the potential bias and errors in individual data sources and machine learning algorithms to achieve more accurate and robust disease gene predictions. In the future, our EPU method provides an effective framework to integrate the additional biological and computational resources for better disease gene predictions.
Collapse
Affiliation(s)
- Peng Yang
- Data Analytics Department, Institute for Infocomm Research (I2R), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
- * E-mail: (PY); (XL)
| | - Xiaoli Li
- Data Analytics Department, Institute for Infocomm Research (I2R), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
- * E-mail: (PY); (XL)
| | - Hon-Nian Chua
- Data Analytics Department, Institute for Infocomm Research (I2R), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
| | - Chee-Keong Kwoh
- Bioinformatics Research Centre, School of Computer Engineering, Nanyang Technological University, Singapore, Singapore
| | - See-Kiong Ng
- Data Analytics Department, Institute for Infocomm Research (I2R), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
| |
Collapse
|
44
|
Zhang SW, Shao DD, Zhang SY, Wang YB. Prioritization of candidate disease genes by enlarging the seed set and fusing information of the network topology and gene expression. MOLECULAR BIOSYSTEMS 2014; 10:1400-8. [PMID: 24695957 DOI: 10.1039/c3mb70588a] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
The identification of disease genes is very important not only to provide greater understanding of gene function and cellular mechanisms which drive human disease, but also to enhance human disease diagnosis and treatment. Recently, high-throughput techniques have been applied to detect dozens or even hundreds of candidate genes. However, experimental approaches to validate the many candidates are usually time-consuming, tedious and expensive, and sometimes lack reproducibility. Therefore, numerous theoretical and computational methods (e.g. network-based approaches) have been developed to prioritize candidate disease genes. Many network-based approaches implicitly utilize the observation that genes causing the same or similar diseases tend to correlate with each other in gene-protein relationship networks. Of these network approaches, the random walk with restart algorithm (RWR) is considered to be a state-of-the-art approach. To further improve the performance of RWR, we propose a novel method named ESFSC to identify disease-related genes, by enlarging the seed set according to the centrality of disease genes in a network and fusing information of the protein-protein interaction (PPI) network topological similarity and the gene expression correlation. The ESFSC algorithm restarts at all of the nodes in the seed set consisting of the known disease genes and their k-nearest neighbor nodes, then walks in the global network separately guided by the similarity transition matrix constructed with PPI network topological similarity properties and the correlational transition matrix constructed with the gene expression profiles. As a result, all the genes in the network are ranked by weighted fusing the above results of the RWR guided by two types of transition matrices. Comprehensive simulation results of the 10 diseases with 97 known disease genes collected from the Online Mendelian Inheritance in Man (OMIM) database show that ESFSC outperforms existing methods for prioritizing candidate disease genes. The top prediction results of Alzheimer's disease are consistent with previous literature reports.
Collapse
Affiliation(s)
- Shao-Wu Zhang
- College of Automation, Northwestern Polytechnical University, 710072, Xi'an, China.
| | | | | | | |
Collapse
|
45
|
Gustafsson M, Edström M, Gawel D, Nestor CE, Wang H, Zhang H, Barrenäs F, Tojo J, Kockum I, Olsson T, Serra-Musach J, Bonifaci N, Pujana MA, Ernerudh J, Benson M. Integrated genomic and prospective clinical studies show the importance of modular pleiotropy for disease susceptibility, diagnosis and treatment. Genome Med 2014; 6:17. [PMID: 24571673 PMCID: PMC4064311 DOI: 10.1186/gm534] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2013] [Accepted: 02/21/2014] [Indexed: 12/17/2022] Open
Abstract
Background Translational research typically aims to identify and functionally validate individual, disease-specific genes. However, reaching this aim is complicated by the involvement of thousands of genes in common diseases, and that many of those genes are pleiotropic, that is, shared by several diseases. Methods We integrated genomic meta-analyses with prospective clinical studies to systematically investigate the pathogenic, diagnostic and therapeutic roles of pleiotropic genes. In a novel approach, we first used pathway analysis of all published genome-wide association studies (GWAS) to find a cell type common to many diseases. Results The analysis showed over-representation of the T helper cell differentiation pathway, which is expressed in T cells. This led us to focus on expression profiling of CD4+ T cells from highly diverse inflammatory and malignant diseases. We found that pleiotropic genes were highly interconnected and formed a pleiotropic module, which was enriched for inflammatory, metabolic and proliferative pathways. The general relevance of this module was supported by highly significant enrichment of genetic variants identified by all GWAS and cancer studies, as well as known diagnostic and therapeutic targets. Prospective clinical studies of multiple sclerosis and allergy showed the importance of both pleiotropic and disease specific modules for clinical stratification. Conclusions In summary, this translational genomics study identified a pleiotropic module, which has key pathogenic, diagnostic and therapeutic roles.
Collapse
Affiliation(s)
- Mika Gustafsson
- The Centre for Individualised Medicine, Department of Clinical and Experimental Medicine, Linköping University, 58185 Linköping, Sweden
| | - Måns Edström
- Clinical and Experimental Medicine, Faculty of Health Sciences, Division of Clinical Immunology, Unit of Autoimmunity and Immune Regulation, Linköping University, 58185 Linköping, Sweden
| | - Danuta Gawel
- The Centre for Individualised Medicine, Department of Clinical and Experimental Medicine, Linköping University, 58185 Linköping, Sweden
| | - Colm E Nestor
- The Centre for Individualised Medicine, Department of Clinical and Experimental Medicine, Linköping University, 58185 Linköping, Sweden
| | - Hui Wang
- The Centre for Individualised Medicine, Department of Clinical and Experimental Medicine, Linköping University, 58185 Linköping, Sweden
| | - Huan Zhang
- The Centre for Individualised Medicine, Department of Clinical and Experimental Medicine, Linköping University, 58185 Linköping, Sweden
| | - Fredrik Barrenäs
- The Centre for Individualised Medicine, Department of Clinical and Experimental Medicine, Linköping University, 58185 Linköping, Sweden
| | - James Tojo
- Department of Clinical Neurosciences, Karolinska Institutet and Centrum for Molecular Medicine, 17177 Stockholm, Sweden
| | - Ingrid Kockum
- Department of Clinical Neurosciences, Karolinska Institutet and Centrum for Molecular Medicine, 17177 Stockholm, Sweden
| | - Tomas Olsson
- Department of Clinical Neurosciences, Karolinska Institutet and Centrum for Molecular Medicine, 17177 Stockholm, Sweden
| | - Jordi Serra-Musach
- Cancer and Systems Biology Unit, Catalan Institute of Oncology, IDIBELL, L'Hospitalet del Llobregat, 08908 Barcelona, Spain
| | - Núria Bonifaci
- Cancer and Systems Biology Unit, Catalan Institute of Oncology, IDIBELL, L'Hospitalet del Llobregat, 08908 Barcelona, Spain
| | - Miguel Angel Pujana
- Cancer and Systems Biology Unit, Catalan Institute of Oncology, IDIBELL, L'Hospitalet del Llobregat, 08908 Barcelona, Spain
| | - Jan Ernerudh
- Clinical and Experimental Medicine, Faculty of Health Sciences, Division of Clinical Immunology, Unit of Autoimmunity and Immune Regulation, Linköping University, 58185 Linköping, Sweden
| | - Mikael Benson
- The Centre for Individualised Medicine, Department of Clinical and Experimental Medicine, Linköping University, 58185 Linköping, Sweden
| |
Collapse
|
46
|
Network Analysis of Human Disease Comorbidity Patterns Based on Large-Scale Data Mining. BIOINFORMATICS RESEARCH AND APPLICATIONS 2014. [DOI: 10.1007/978-3-319-08171-7_22] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
|
47
|
Chen Y, Wu X, Jiang R. Integrating human omics data to prioritize candidate genes. BMC Med Genomics 2013; 6:57. [PMID: 24344781 PMCID: PMC3878333 DOI: 10.1186/1755-8794-6-57] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2013] [Accepted: 12/12/2013] [Indexed: 01/07/2023] Open
Abstract
Background The identification of genes involved in human complex diseases remains a great challenge in computational systems biology. Although methods have been developed to use disease phenotypic similarities with a protein-protein interaction network for the prioritization of candidate genes, other valuable omics data sources have been largely overlooked in these methods. Methods With this understanding, we proposed a method called BRIDGE to prioritize candidate genes by integrating disease phenotypic similarities with such omics data as protein-protein interactions, gene sequence similarities, gene expression patterns, gene ontology annotations, and gene pathway memberships. BRIDGE utilizes a multiple regression model with lasso penalty to automatically weight different data sources and is capable of discovering genes associated with diseases whose genetic bases are completely unknown. Results We conducted large-scale cross-validation experiments and demonstrated that more than 60% known disease genes can be ranked top one by BRIDGE in simulated linkage intervals, suggesting the superior performance of this method. We further performed two comprehensive case studies by applying BRIDGE to predict novel genes and transcriptional networks involved in obesity and type II diabetes. Conclusion The proposed method provides an effective and scalable way for integrating multi omics data to infer disease genes. Further applications of BRIDGE will be benefit to providing novel disease genes and underlying mechanisms of human diseases.
Collapse
Affiliation(s)
| | | | - Rui Jiang
- Department of Automation, MOE Key Laboratory of Bioinformatics; Bioinformatics Division and Center for Synthetic & Systems Biology, TNLIST, Tsinghua University, Beijing 100084, China.
| |
Collapse
|
48
|
Biesecker LG. Invited editorial comment-the human phenotype of germlinePIGAmutations. Am J Med Genet A 2013; 164A:15-6. [DOI: 10.1002/ajmg.a.36213] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2013] [Accepted: 07/29/2013] [Indexed: 11/11/2022]
|
49
|
Multi-dimensional prioritization of dental caries candidate genes and its enriched dense network modules. PLoS One 2013; 8:e76666. [PMID: 24146904 PMCID: PMC3795720 DOI: 10.1371/journal.pone.0076666] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2013] [Accepted: 08/31/2013] [Indexed: 01/14/2023] Open
Abstract
A number of genetic studies have suggested numerous susceptibility genes for dental caries over the past decade with few definite conclusions. The rapid accumulation of relevant information, along with the complex architecture of the disease, provides a challenging but also unique opportunity to review and integrate the heterogeneous data for follow-up validation and exploration. In this study, we collected and curated candidate genes from four major categories: association studies, linkage scans, gene expression analyses, and literature mining. Candidate genes were prioritized according to the magnitude of evidence related to dental caries. We then searched for dense modules enriched with the prioritized candidate genes through their protein-protein interactions (PPIs). We identified 23 modules comprising of 53 genes. Functional analyses of these 53 genes revealed three major clusters: cytokine network relevant genes, matrix metalloproteinases (MMPs) family, and transforming growth factor-beta (TGF-β) family, all of which have been previously implicated to play important roles in tooth development and carious lesions. Through our extensive data collection and an integrative application of gene prioritization and PPI network analyses, we built a dental caries-specific sub-network for the first time. Our study provided insights into the molecular mechanisms underlying dental caries. The framework we proposed in this work can be applied to other complex diseases.
Collapse
|
50
|
Hennekam RC, Biesecker LG, Allanson JE, Hall JG, Opitz JM, Temple IK, Carey JC. Elements of morphology: General terms for congenital anomalies. Am J Med Genet A 2013; 161A:2726-33. [DOI: 10.1002/ajmg.a.36249] [Citation(s) in RCA: 84] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2013] [Accepted: 08/26/2013] [Indexed: 11/08/2022]
Affiliation(s)
- Raoul C. Hennekam
- Departments of Pediatrics and Clinical Genetics, Academic Medical Center; University of Amsterdam; Amsterdam Netherlands
| | - Leslie G. Biesecker
- National Human Genome Research Institute; National Institutes of Health; Bethesda Maryland
| | - Judith E. Allanson
- Department of Genetics; Children's Hospital of Eastern Ontario; Ottawa Canada
| | - Judith G. Hall
- Departments of Medical Genetics and Pediatrics; University of British Columbia and BC Children's Hospital; Vancouver British Columbia Canada
| | - John M. Opitz
- Division of Medical Genetics, Human Genetics; Pathology; Obstetrics and Gynecology; University of Utah; Salt Lake City Utah
| | - I Karen Temple
- Faculty of Medicine, University of Southampton and Wessex Clinical Genetics Service; University Hospital Southampton; Southampton United Kingdom
| | - John C. Carey
- Division of Medical Genetics, Human Genetics; Pathology; Obstetrics and Gynecology; University of Utah; Salt Lake City Utah
| | | |
Collapse
|