Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Kumar S, Dudley JT, Filipski A, Liu L. Phylomedicine: an evolutionary telescope to explore and diagnose the universe of disease mutations. Trends Genet 2011;27:377-86. [PMID: 21764165 PMCID: PMC3272884 DOI: 10.1016/j.tig.2011.06.004] [Citation(s) in RCA: 66] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2011] [Revised: 06/10/2011] [Accepted: 06/13/2011] [Indexed: 12/30/2022]

For:	Kumar S, Dudley JT, Filipski A, Liu L. Phylomedicine: an evolutionary telescope to explore and diagnose the universe of disease mutations. Trends Genet 2011;27:377-86. [PMID: 21764165 PMCID: PMC3272884 DOI: 10.1016/j.tig.2011.06.004] [Citation(s) in RCA: 66] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2011] [Revised: 06/10/2011] [Accepted: 06/13/2011] [Indexed: 12/30/2022]

Number

Cited by Other Article(s)

Campitelli P, Ross D, Swint-Kruse L, Ozkan SB. Dynamics-based protein network features accurately discriminate neutral and rheostat positions. Biophys J 2024:S0006-3495(24)00625-8. [PMID: 39277794 DOI: 10.1016/j.bpj.2024.09.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2024] [Revised: 07/03/2024] [Accepted: 09/11/2024] [Indexed: 09/17/2024] Open

Abstract

In some proteins, a unique class of nonconserved positions is characterized by their ability to generate diverse functional outcomes through single amino acid substitutions. Due to their ability to tune protein function, accurately identifying such "rheostat" positions is crucial for protein design, for understanding the impact of mutations observed in humans, and for predicting the evolution of pathogen drug resistance. However, identifying rheostat positions has been challenging, due-in part-to the absence of a clear structural relationship with binding sites. In this study, experimental data from our previous study of the Escherichia coli lactose repressor protein (LacI) was used to identify rheostat positions for which mutations tune in vivo EC50 for the allosteric ligand "IPTG." We next used the rheostat assignments to test the hypothesis that rheostat positions have unique dynamic features that will enable their identification. To that end, we integrated all-atom molecular dynamics simulations with perturbation residue response analysis. Results first revealed distinct dynamic behavior in IPTG-bound LacI compared with apo LacI, which was consistent with IPTG's role as an allosteric inducer. Next, we used a variety of dynamic features to build a classification model that discriminates experimentally characterized rheostat positions in LacI from positions with other types of substitution outcomes. In parallel, we built a second classifier model based on the 3D structural "static" network features of LacI. In comparative studies, the dynamic model better identified rheostat positions that were >8 Å from the binding site. In summary, our study provides insights into the dynamic characteristics of rheostat positions and suggests that models built on dynamic features may be useful for predicting the locations of rheostat positions in a wide range of proteins.

Collapse

Chen H, Shu J, Maley CC, Liu L. A Mouse-Specific Model to Detect Genes under Selection in Tumors. Cancers (Basel) 2023;15:5156. [PMID: 37958330 PMCID: PMC10647215 DOI: 10.3390/cancers15215156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 10/16/2023] [Accepted: 10/18/2023] [Indexed: 11/15/2023] Open

Liska O, Boross G, Rocabert C, Szappanos B, Tengölics R, Papp B. Principles of metabolome conservation in animals. Proc Natl Acad Sci U S A 2023;120:e2302147120. [PMID: 37603743 PMCID: PMC10468614 DOI: 10.1073/pnas.2302147120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Accepted: 07/16/2023] [Indexed: 08/23/2023] Open

Affiliation(s)

Orsolya Liska Hungarian Centre of Excellence for Molecular Medicine - Biological Research Centre Metabolic Systems Biology Lab, 6728Szeged, Hungary National Laboratory of Biotechnology, Synthetic and System Biology Unit, Institute of Biochemistry, Biological Research Centre, Eötvös Loránd Research Network, 6726Szeged, Hungary Doctoral School of Biology, University of Szeged, 6726Szeged, Hungary
Gábor Boross National Laboratory of Biotechnology, Synthetic and System Biology Unit, Institute of Biochemistry, Biological Research Centre, Eötvös Loránd Research Network, 6726Szeged, Hungary Department of Biology, Stanford University, Stanford, City of Palo Alto, CA94305-5020
Charles Rocabert National Laboratory of Biotechnology, Synthetic and System Biology Unit, Institute of Biochemistry, Biological Research Centre, Eötvös Loránd Research Network, 6726Szeged, Hungary Inria, 78150Rocquencourt, 69100Villeurbanne, France Organismal and Evolutionary Biology Research Programme, University of Helsinki, 00014Helsinki, Finland Institute for Computational Cell Biology, Heinrich-Heine Universität, 40225Düsseldorf, Germany
Balázs Szappanos Hungarian Centre of Excellence for Molecular Medicine - Biological Research Centre Metabolic Systems Biology Lab, 6728Szeged, Hungary National Laboratory of Biotechnology, Synthetic and System Biology Unit, Institute of Biochemistry, Biological Research Centre, Eötvös Loránd Research Network, 6726Szeged, Hungary Department of Biotechnology, University of Szeged, 6726Szeged, Hungary
Roland Tengölics Hungarian Centre of Excellence for Molecular Medicine - Biological Research Centre Metabolic Systems Biology Lab, 6728Szeged, Hungary National Laboratory of Biotechnology, Synthetic and System Biology Unit, Institute of Biochemistry, Biological Research Centre, Eötvös Loránd Research Network, 6726Szeged, Hungary Metabolomics Lab, Core facilities, Biological Research Centre, Eötvös Loránd Research Network, 6726Szeged, Hungary
Balázs Papp Hungarian Centre of Excellence for Molecular Medicine - Biological Research Centre Metabolic Systems Biology Lab, 6728Szeged, Hungary National Laboratory of Biotechnology, Synthetic and System Biology Unit, Institute of Biochemistry, Biological Research Centre, Eötvös Loránd Research Network, 6726Szeged, Hungary National Laboratory for Health Security, Biological Research Centre, Eötvös Loránd Research Network, 6726Szeged, Hungary

Collapse

Manzoor H, Zahid H, Emerling CA, Kumar KR, Hussain HMJ, Seo GH, Wajid M, Naz S. A biallelic variant of DCAF13 implicated in a neuromuscular disorder in humans. Eur J Hum Genet 2023;31:629-637. [PMID: 36797467 PMCID: PMC10250411 DOI: 10.1038/s41431-023-01319-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Revised: 02/05/2023] [Accepted: 02/09/2023] [Indexed: 02/18/2023] Open

Ose NJ, Butler BM, Kumar A, Kazan IC, Sanderford M, Kumar S, Ozkan SB. Dynamic coupling of residues within proteins as a mechanistic foundation of many enigmatic pathogenic missense variants. PLoS Comput Biol 2022;18:e1010006. [PMID: 35389981 PMCID: PMC9017885 DOI: 10.1371/journal.pcbi.1010006] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2021] [Revised: 04/19/2022] [Accepted: 03/09/2022] [Indexed: 01/07/2023] Open

Murphy WJ, Foley NM, Bredemeyer KR, Gatesy J, Springer MS. Phylogenomics and the Genetic Architecture of the Placental Mammal Radiation. Annu Rev Anim Biosci 2020;9:29-53. [PMID: 33228377 DOI: 10.1146/annurev-animal-061220-023149] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Allostery and Epistasis: Emergent Properties of Anisotropic Networks. ENTROPY 2020;22:e22060667. [PMID: 33286439 PMCID: PMC7517209 DOI: 10.3390/e22060667] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/24/2020] [Revised: 06/02/2020] [Accepted: 06/08/2020] [Indexed: 11/17/2022]

Guan X, Runger G, Liu L. Dynamic incorporation of prior knowledge from multiple domains in biomarker discovery. BMC Bioinformatics 2020;21:77. [PMID: 32164534 PMCID: PMC7068914 DOI: 10.1186/s12859-020-3344-x] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open

Abstract

Background

In biomarker discovery, applying domain knowledge is an effective approach to eliminating false positive features, prioritizing functionally impactful markers and facilitating the interpretation of predictive signatures. Several computational methods have been developed that formulate the knowledge-based biomarker discovery as a feature selection problem guided by prior information. These methods often require that prior information is encoded as a single score and the algorithms are optimized for biological knowledge of a specific type. However, in practice, domain knowledge from diverse resources can provide complementary information. But no current methods can integrate heterogeneous prior information for biomarker discovery. To address this problem, we developed the Know-GRRF (know-guided regularized random forest) method that enables dynamic incorporation of domain knowledge from multiple disciplines to guide feature selection.

Results

Know-GRRF embeds domain knowledge in a regularized random forest framework. It combines prior information from multiple domains in a linear model to derive a composite score, which, together with other tuning parameters, controls the regularization of the random forests model. Know-GRRF concurrently optimizes the weight given to each type of domain knowledge and other tuning parameters to minimize the AIC of out-of-bag predictions. The objective is to select a compact feature subset that has a high discriminative power and strong functional relevance to the biological phenotype.

Via rigorous simulations, we show that Know-GRRF guided by multiple-domain prior information outperforms feature selection methods guided by single-domain prior information or no prior information. We then applied Known-GRRF to a real-world study to identify prognostic biomarkers of prostate cancers. We evaluated the combination of cancer-related gene annotations, evolutionary conservation and pre-computed statistical scores as the prior knowledge to assemble a panel of biomarkers. We discovered a compact set of biomarkers with significant improvements on prediction accuracies.

Conclusions

Know-GRRF is a powerful novel method to incorporate knowledge from multiple domains for feature selection. It has a broad range of applications in biomarker discoveries. We implemented this method and released a KnowGRRF package in the R/CRAN archive.

Collapse

Campitelli P, Modi T, Kumar S, Ozkan SB. The Role of Conformational Dynamics and Allostery in Modulating Protein Evolution. Annu Rev Biophys 2020;49:267-288. [PMID: 32075411 DOI: 10.1146/annurev-biophys-052118-115517] [Citation(s) in RCA: 89] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Wong KC, Yan S, Lin Q, Li X, Peng C. Deleterious Non-Synonymous Single Nucleotide Polymorphism Predictions on Human Transcription Factors. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2020;17:327-333. [PMID: 30475727 DOI: 10.1109/tcbb.2018.2882548] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Sharma V, Hiller M. Losses of human disease-associated genes in placental mammals. NAR Genom Bioinform 2019;2:lqz012. [PMID: 33575564 PMCID: PMC7671337 DOI: 10.1093/nargab/lqz012] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2019] [Revised: 08/24/2019] [Accepted: 10/08/2019] [Indexed: 02/07/2023] Open

Alves LQ, Alves J, Ribeiro R, Ruivo R, Castro F. The dopamine receptor D₅ gene shows signs of independent erosion in toothed and baleen whales. PeerJ 2019;7:e7758. [PMID: 31616587 PMCID: PMC6791347 DOI: 10.7717/peerj.7758] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2019] [Accepted: 08/26/2019] [Indexed: 12/30/2022] Open

Chong CS, Kunze M, Hochreiter B, Krenn M, Berger J, Maurer-Stroh S. Rare Human Missense Variants can affect the Function of Disease-Relevant Proteins by Loss and Gain of Peroxisomal Targeting Motifs. Int J Mol Sci 2019;20:E4609. [PMID: 31533369 PMCID: PMC6770196 DOI: 10.3390/ijms20184609] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2019] [Revised: 09/06/2019] [Accepted: 09/14/2019] [Indexed: 12/30/2022] Open

Butler BM, Kazan IC, Kumar A, Ozkan SB. Coevolving residues inform protein dynamics profiles and disease susceptibility of nSNVs. PLoS Comput Biol 2018;14:e1006626. [PMID: 30496278 PMCID: PMC6289467 DOI: 10.1371/journal.pcbi.1006626] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2018] [Revised: 12/11/2018] [Accepted: 11/09/2018] [Indexed: 11/18/2022] Open

Abstract

The conformational dynamics of proteins is rarely used in methodologies used to predict the impact of genetic mutations due to the paucity of three-dimensional protein structures as compared to the vast number of available sequences. Until now a three-dimensional (3D) structure has been required to predict the conformational dynamics of a protein. We introduce an approach that estimates the conformational dynamics of a protein, without relying on structural information. This de novo approach utilizes coevolving residues identified from a multiple sequence alignment (MSA) using Potts models. These coevolving residues are used as contacts in a Gaussian network model (GNM) to obtain protein dynamics. B-factors calculated using sequence-based GNM (Seq-GNM) are in agreement with crystallographic B-factors as well as theoretical B-factors from the original GNM that utilizes the 3D structure. Moreover, we demonstrate the ability of the calculated B-factors from the Seq-GNM approach to discriminate genomic variants according to their phenotypes for a wide range of proteins. These results suggest that protein dynamics can be approximated based on sequence information alone, making it possible to assess the phenotypes of nSNVs in cases where a 3D structure is unknown. We hope this work will promote the use of dynamics information in genetic disease prediction at scale by circumventing the need for 3D structures.

Proteins are dynamic machines that undergo atomic fluctuations, side chain rotations, and collective domain movements that are required for biological function. There is, therefore, a need for quantitative metrics that capture the dynamic fluctuations per position to understand the critical role of protein dynamics in shaping biological functions. A limiting factor in incorporating structural dynamics information in the classification of non-synonymous single nucleotide variants (nSNVs) is the limited number of known 3D structures compared to the vast number of available sequences. We have developed a new sequence-based GNM method, termed Seq-GNM, which uses co-evolving amino acid positions based on the multiple sequence alignment of a given query sequence to estimate the thermal motions of C-alpha atoms. In this paper, we have demonstrated that the predicted thermal motions using Seq-GNM are in reasonable agreement with experimental B-factors as well as B-factors computed using 3D crystal structures. We also provide evidence that B-factors predicted by Seq-GNM are capable of distinguishing between disease-associated and neutral nSNVs.

Collapse

Lopez JV, Kamel B, Medina M, Collins T, Baums IB. Multiple Facets of Marine Invertebrate Conservation Genomics. Annu Rev Anim Biosci 2018;7:473-497. [PMID: 30485758 DOI: 10.1146/annurev-animal-020518-115034] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Earth BioGenome Project: Sequencing life for the future of life. Proc Natl Acad Sci U S A 2018;115:4325-4333. [PMID: 29686065 DOI: 10.1073/pnas.1720115115] [Citation(s) in RCA: 431] [Impact Index Per Article: 71.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open

Kumar S, Patel R. Neutral Theory, Disease Mutations, and Personal Exomes. Mol Biol Evol 2018;35:1297-1303. [PMID: 29688514 PMCID: PMC5967454 DOI: 10.1093/molbev/msy085] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open

Scholte LLS, Pascoal-Xavier MA, Nahum LA. Helminths and Cancers From the Evolutionary Perspective. Front Med (Lausanne) 2018;5:90. [PMID: 29713629 PMCID: PMC5911458 DOI: 10.3389/fmed.2018.00090] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2017] [Accepted: 03/22/2018] [Indexed: 01/20/2023] Open

Klink GV, Golovin AV, Bazykin GA. Substitutions into amino acids that are pathogenic in human mitochondrial proteins are more frequent in lineages closely related to human than in distant lineages. PeerJ 2017;5:e4143. [PMID: 29250469 PMCID: PMC5731343 DOI: 10.7717/peerj.4143] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2017] [Accepted: 11/16/2017] [Indexed: 11/23/2022] Open

Emerling CA, Widjaja AD, Nguyen NN, Springer MS. Their loss is our gain: regressive evolution in vertebrates provides genomic models for uncovering human disease loci. J Med Genet 2017;54:787-794. [PMID: 28814606 DOI: 10.1136/jmedgenet-2017-104837] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2017] [Revised: 07/07/2017] [Accepted: 07/10/2017] [Indexed: 12/20/2022]

Gasse B, Prasad M, Delgado S, Huckert M, Kawczynski M, Garret-Bernardin A, Lopez-Cazaux S, Bailleul-Forestier I, Manière MC, Stoetzel C, Bloch-Zupan A, Sire JY. Evolutionary Analysis Predicts Sensitive Positions of MMP20 and Validates Newly- and Previously-Identified MMP20 Mutations Causing Amelogenesis Imperfecta. Front Physiol 2017;8:398. [PMID: 28659819 PMCID: PMC5469888 DOI: 10.3389/fphys.2017.00398] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2017] [Accepted: 05/26/2017] [Indexed: 12/21/2022] Open

Affiliation(s)

Barbara Gasse Institut de Biologie Paris-Seine, UMR 7138-Evolution Paris-Seine, Sorbonne Universités, Université Pierre et Marie CurieParis, France
Megana Prasad Laboratoire de Génétique Médicale, Institut National de la Santé et de la Recherche Médicale UMRS_1112, Institut de Génétique Médicale d'Alsace, FMTS, Université de StrasbourgStrasbourg, France
Sidney Delgado Institut de Biologie Paris-Seine, UMR 7138-Evolution Paris-Seine, Sorbonne Universités, Université Pierre et Marie CurieParis, France
Mathilde Huckert Laboratoire de Génétique Médicale, Institut National de la Santé et de la Recherche Médicale UMRS_1112, Institut de Génétique Médicale d'Alsace, FMTS, Université de StrasbourgStrasbourg, France.,Faculté de Chirurgie Dentaire, Université de StrasbourgStrasbourg, France
Marzena Kawczynski Faculté de Chirurgie Dentaire, Université de StrasbourgStrasbourg, France.,Pôle de Médecine et Chirurgie Bucco-Dentaires, Centre de Référence des Manifestations Odontologiques des Maladies Rares, O-Rares, Hôpitaux Universitaires de StrasbourgStrasbourg, France
Annelyse Garret-Bernardin Faculté de Chirurgie Dentaire, Université de StrasbourgStrasbourg, France.,Unit of Dentistry, IRCCS, Bambino Gesù Children's HospitalRome, Italy
Serena Lopez-Cazaux Faculté de Chirurgie Dentaire, Département d'Odontologie Pédiatrique, Centre de Compétences Maladies Rares, CHU Hôtel Dieu, Service d'odontologie Conservatrice et PédiatriqueNantes, France
Isabelle Bailleul-Forestier Faculté de Chirurgie Dentaire, CHU de Toulouse, Centre de Compétences Maladies Rares, Odontologie Pédiatrique, Université Paul SabatierToulouse, France
Marie-Cécile Manière Faculté de Chirurgie Dentaire, Université de StrasbourgStrasbourg, France.,Pôle de Médecine et Chirurgie Bucco-Dentaires, Centre de Référence des Manifestations Odontologiques des Maladies Rares, O-Rares, Hôpitaux Universitaires de StrasbourgStrasbourg, France
Corinne Stoetzel Laboratoire de Génétique Médicale, Institut National de la Santé et de la Recherche Médicale UMRS_1112, Institut de Génétique Médicale d'Alsace, FMTS, Université de StrasbourgStrasbourg, France
Agnès Bloch-Zupan Faculté de Chirurgie Dentaire, Université de StrasbourgStrasbourg, France.,Pôle de Médecine et Chirurgie Bucco-Dentaires, Centre de Référence des Manifestations Odontologiques des Maladies Rares, O-Rares, Hôpitaux Universitaires de StrasbourgStrasbourg, France.,Centre Européen de Recherche en Biologie et en Médecine, Centre National de la Recherche Scientifique UMR7104, Institut National de la Santé et de la Recherche Médicale U964, Institut de Génétique et de Biologie Moléculaire and Cellulaire, Université de StrasbourgIllkirch, France.,Institut d'Etudes Avancées, Université de Strasbourg, USIASStrasbourg, France.,Eastman Dental Institute, University College LondonLondon, United Kingdom
Jean-Yves Sire Institut de Biologie Paris-Seine, UMR 7138-Evolution Paris-Seine, Sorbonne Universités, Université Pierre et Marie CurieParis, France

Collapse

Tollis M, Schiffman JD, Boddy AM. Evolution of cancer suppression as revealed by mammalian comparative genomics. Curr Opin Genet Dev 2017;42:40-47. [DOI: 10.1016/j.gde.2016.12.004] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2016] [Revised: 12/19/2016] [Accepted: 12/21/2016] [Indexed: 02/05/2023]

Spataro N, Rodríguez JA, Navarro A, Bosch E. Properties of human disease genes and the role of genes linked to Mendelian disorders in complex disease aetiology. Hum Mol Genet 2017;26:489-500. [PMID: 28053046 PMCID: PMC5409085 DOI: 10.1093/hmg/ddw405] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2016] [Revised: 11/10/2016] [Accepted: 11/23/2016] [Indexed: 01/19/2023] Open

Liu L, Chang Y, Yang T, Noren DP, Long B, Kornblau S, Qutub A, Ye J. Evolution-informed modeling improves outcome prediction for cancers. Evol Appl 2016;10:68-76. [PMID: 28035236 PMCID: PMC5192825 DOI: 10.1111/eva.12417] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2016] [Accepted: 08/17/2016] [Indexed: 12/19/2022] Open

Karim S, NourEldin HF, Abusamra H, Salem N, Alhathli E, Dudley J, Sanderford M, Scheinfeldt LB, Chaudhary AG, Al-Qahtani MH, Kumar S. e-GRASP: an integrated evolutionary and GRASP resource for exploring disease associations. BMC Genomics 2016;17:770. [PMID: 27766955 PMCID: PMC5073857 DOI: 10.1186/s12864-016-3088-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/14/2023] Open

Abstract

Background

Genome-wide association studies (GWAS) have become a mainstay of biological research concerned with discovering genetic variation linked to phenotypic traits and diseases. Both discrete and continuous traits can be analyzed in GWAS to discover associations between single nucleotide polymorphisms (SNPs) and traits of interest. Associations are typically determined by estimating the significance of the statistical relationship between genetic loci and the given trait. However, the prioritization of bona fide, reproducible genetic associations from GWAS results remains a central challenge in identifying genomic loci underlying common complex diseases. Evolutionary-aware meta-analysis of the growing GWAS literature is one way to address this challenge and to advance from association to causation in the discovery of genotype-phenotype relationships.

Description

We have created an evolutionary GWAS resource to enable in-depth query and exploration of published GWAS results. This resource uses the publically available GWAS results annotated in the GRASP2 database. The GRASP2 database includes results from 2082 studies, 177 broad phenotype categories, and ~8.87 million SNP-phenotype associations. For each SNP in e-GRASP, we present information from the GRASP2 database for convenience as well as evolutionary information (e.g., rate and timespan). Users can, therefore, identify not only SNPs with highly significant phenotype-association P-values, but also SNPs that are highly replicated and/or occur at evolutionarily conserved sites that are likely to be functionally important. Additionally, we provide an evolutionary-adjusted SNP association ranking (E-rank) that uses cross-species evolutionary conservation scores and population allele frequencies to transform P-values in an effort to enhance the discovery of SNPs with a greater probability of biologically meaningful disease associations.

Conclusion

By adding an evolutionary dimension to the GWAS results available in the GRASP2 database, our e-GRASP resource will enable a more effective exploration of SNPs not only by the statistical significance of trait associations, but also by the number of studies in which associations have been replicated, and the evolutionary context of the associated mutations. Therefore, e-GRASP will be a valuable resource for aiding researchers in the identification of bona fide, reproducible genetic associations from GWAS results. This resource is freely available at http://www.mypeg.info/egrasp.

Collapse

Genomic insights into ayurvedic and western approaches to personalized medicine. J Genet 2016;95:209-28. [DOI: 10.1007/s12041-015-0607-9] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Comparative sequence analyses of rhodopsin and RPE65 reveal patterns of selective constraint across hereditary retinal disease mutations. Vis Neurosci 2016;33:e002. [DOI: 10.1017/s0952523815000322] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Abstract AbstractRetinitis pigmentosa (RP) comprises several heritable diseases that involve photoreceptor, and ultimately retinal, degeneration. Currently, mutations in over 50 genes have known links to RP. Despite advances in clinical characterization, molecular characterization of RP remains challenging due to the heterogeneous nature of causal genes, mutations, and clinical phenotypes. In this study, we compiled large datasets of two important visual genes associated with RP: rhodopsin, which initiates the phototransduction cascade, and the retinoid isomerase RPE65, which regenerates the visual cycle. We used a comparative evolutionary approach to investigate the relationship between interspecific sequence variation and pathogenic mutations that lead to degenerative retinal disease. Using codon-based likelihood methods, we estimated evolutionary rates (dN/dS) across both genes in a phylogenetic context to investigate differences between pathogenic and nonpathogenic amino acid sites. In both genes, disease-associated sites showed significantly lower evolutionary rates compared to nondisease sites, and were more likely to occur in functionally critical areas of the proteins. The nature of the dataset (e.g., vertebrate or mammalian sequences), as well as selection of pathogenic sites, affected the differences observed between pathogenic and nonpathogenic sites. Our results illustrate that these methods can serve as an intermediate step in understanding protein structure and function in a clinical context, particularly in predicting the relative pathogenicity (i.e., functional impact) of point mutations and their downstream phenotypic effects. Extensions of this approach may also contribute to current methods for predicting the deleterious effects of candidate mutations and to the identification of protein regions under strong constraint where we expect pathogenic mutations to occur. Collapse

Kumar A, Butler BM, Kumar S, Ozkan SB. Integration of structural dynamics and molecular evolution via protein interaction networks: a new era in genomic medicine. Curr Opin Struct Biol 2015;35:135-42. [PMID: 26684487 PMCID: PMC4856467 DOI: 10.1016/j.sbi.2015.11.002] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2015] [Revised: 11/03/2015] [Accepted: 11/05/2015] [Indexed: 01/08/2023]

Miura S, Tate S, Kumar S. Using Disease-Associated Coding Sequence Variation to Investigate Functional Compensation by Human Paralogous Proteins. Evol Bioinform Online 2015;11:245-51. [PMID: 26604664 PMCID: PMC4631161 DOI: 10.4137/ebo.s30594] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2015] [Revised: 09/14/2015] [Accepted: 09/18/2015] [Indexed: 11/09/2022] Open

Kumar A, Glembo TJ, Ozkan SB. The Role of Conformational Dynamics and Allostery in the Disease Development of Human Ferritin. Biophys J 2015;109:1273-81. [PMID: 26255589 PMCID: PMC4576160 DOI: 10.1016/j.bpj.2015.06.060] [Citation(s) in RCA: 47] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2015] [Revised: 06/18/2015] [Accepted: 06/30/2015] [Indexed: 12/26/2022] Open

Abstract

Determining the three-dimensional structure of myoglobin, the first solved structure of a protein, fundamentally changed the way protein function was understood. Even more revolutionary was the information that came afterward: protein dynamics play a critical role in biological functions. Therefore, understanding conformational dynamics is crucial to obtaining a more complete picture of protein evolution. We recently analyzed the evolution of different protein families including green fluorescent proteins (GFPs), β-lactamase inhibitors, and nuclear receptors, and we observed that the alteration of conformational dynamics through allosteric regulation leads to functional changes. Moreover, proteome-wide conformational dynamics analysis of more than 100 human proteins showed that mutations occurring at rigid residue positions are more susceptible to disease than flexible residue positions. These studies suggest that disease-associated mutations may impair dynamic allosteric regulations, leading to loss of function. Thus, in this study, we analyzed the conformational dynamics of the wild-type light chain subunit of human ferritin protein along with the neutral and disease forms. We first performed replica exchange molecular dynamics simulations of wild-type and mutants to obtain equilibrated dynamics and then used perturbation response scanning (PRS), where we introduced a random Brownian kick to a position and computed the fluctuation response of the chain using linear response theory. Using this approach, we computed the dynamic flexibility index (DFI) for each position in the chain for the wild-type and the mutants. DFI quantifies the resilience of a position to a perturbation and provides a flexibility/rigidity measurement for a given position in the chain. The DFI analysis reveals that neutral variants and the wild-type exhibit similar flexibility profiles in which experimentally determined functionally critical sites act as hinges in controlling the overall motion. However, disease mutations alter the conformational dynamic profile, making hinges more loose (i.e., softening the hinges), thus impairing the allosterically regulated dynamics.

Collapse

Cheng F, Liu C, Lin CC, Zhao J, Jia P, Li WH, Zhao Z. A Gene Gravity Model for the Evolution of Cancer Genomes: A Study of 3,000 Cancer Genomes across 9 Cancer Types. PLoS Comput Biol 2015;11:e1004497. [PMID: 26352260 PMCID: PMC4564226 DOI: 10.1371/journal.pcbi.1004497] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2015] [Accepted: 08/11/2015] [Indexed: 12/14/2022] Open

Evolutionary analysis of selective constraints identifies ameloblastin (AMBN) as a potential candidate for amelogenesis imperfecta. BMC Evol Biol 2015. [PMID: 26223266 PMCID: PMC4518657 DOI: 10.1186/s12862-015-0431-0] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open

Abstract

Background

Ameloblastin (AMBN) is a phosphorylated, proline/glutamine-rich protein secreted during enamel formation. Previous studies have revealed that this enamel matrix protein was present early in vertebrate evolution and certainly plays important roles during enamel formation although its precise functions remain unclear. We performed evolutionary analyses of AMBN in order to (i) identify residues and motifs important for the protein function, (ii) predict mutations responsible for genetic diseases, and (iii) understand its molecular evolution in mammals.

Results

In silico searches retrieved 56 complete sequences in public databases that were aligned and analyzed computationally. We showed that AMBN is globally evolving under moderate purifying selection in mammals and contains a strong phylogenetic signal. In addition, our analyses revealed codons evolving under significant positive selection. Evidence for positive selection acting on AMBN was observed in catarrhine primates and the aye-aye. We also found that (i) an additional translation initiation site was recruited in the ancestral placental AMBN, (ii) a short exon was duplicated several times in various species including catarrhine primates, and (iii) several polyadenylation sites are present.

Conclusions

AMBN possesses many positions, which have been subjected to strong selective pressure for 200 million years. These positions correspond to several cleavage sites and hydroxylated, O-glycosylated, and phosphorylated residues. We predict that these conserved positions would be potentially responsible for enamel disorder if substituted. Some motifs that were previously identified as potentially important functionally were confirmed, and we found two, highly conserved, new motifs, the function of which should be tested in the near future. This study illustrates the power of evolutionary analyses for characterizing the functional constraints acting on proteins with yet uncharacterized structure.

Electronic supplementary material

The online version of this article (doi:10.1186/s12862-015-0431-0) contains supplementary material, which is available to authorized users.

Collapse

Webb AE, Gerek ZN, Morgan CC, Walsh TA, Loscher CE, Edwards SV, O'Connell MJ. Adaptive Evolution as a Predictor of Species-Specific Innate Immune Response. Mol Biol Evol 2015;32:1717-29. [PMID: 25758009 PMCID: PMC4476151 DOI: 10.1093/molbev/msv051] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open

Butler BM, Gerek ZN, Kumar S, Ozkan SB. Conformational dynamics of nonsynonymous variants at protein interfaces reveals disease association. Proteins 2015;83:428-35. [PMID: 25546381 DOI: 10.1002/prot.24748] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2014] [Revised: 11/20/2014] [Accepted: 12/10/2014] [Indexed: 12/12/2022]

Carroll CJ, Brilhante V, Suomalainen A. Next-generation sequencing for mitochondrial disorders. Br J Pharmacol 2014;171:1837-53. [PMID: 24138576 DOI: 10.1111/bph.12469] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2013] [Revised: 10/03/2013] [Accepted: 10/13/2013] [Indexed: 12/30/2022] Open

Whole-genome sequencing of the snub-nosed monkey provides insights into folivory and evolutionary history. Nat Genet 2014;46:1303-10. [DOI: 10.1038/ng.3137] [Citation(s) in RCA: 137] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2014] [Accepted: 10/09/2014] [Indexed: 11/08/2022]

Silvent J, Gasse B, Mornet E, Sire JY. Molecular evolution of the tissue-nonspecific alkaline phosphatase allows prediction and validation of missense mutations responsible for hypophosphatasia. J Biol Chem 2014;289:24168-79. [PMID: 25023282 DOI: 10.1074/jbc.m114.576843] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open

Cheng F, Jia P, Wang Q, Lin CC, Li WH, Zhao Z. Studying tumorigenesis through network evolution and somatic mutational perturbations in the cancer interactome. Mol Biol Evol 2014;31:2156-69. [PMID: 24881052 DOI: 10.1093/molbev/msu167] [Citation(s) in RCA: 73] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open

Vona B, Hofrichter MAH, Neuner C, Schröder J, Gehrig A, Hennermann JB, Kraus F, Shehata-Dieler W, Klopocki E, Nanda I, Haaf T. DFNB16 is a frequent cause of congenital hearing impairment: implementation of STRC mutation analysis in routine diagnostics. Clin Genet 2014;87:49-55. [PMID: 26011646 PMCID: PMC4302246 DOI: 10.1111/cge.12332] [Citation(s) in RCA: 66] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2013] [Revised: 11/26/2013] [Accepted: 12/12/2013] [Indexed: 11/29/2022]

Phylogenetic Gaussian process model for the inference of functionally important regions in protein tertiary structures. PLoS Comput Biol 2014;10:e1003429. [PMID: 24453956 PMCID: PMC3894161 DOI: 10.1371/journal.pcbi.1003429] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2013] [Accepted: 11/22/2013] [Indexed: 11/30/2022] Open

Abstract

A critical question in biology is the identification of functionally important amino acid sites in proteins. Because functionally important sites are under stronger purifying selection, site-specific substitution rates tend to be lower than usual at these sites. A large number of phylogenetic models have been developed to estimate site-specific substitution rates in proteins and the extraordinarily low substitution rates have been used as evidence of function. Most of the existing tools, e.g. Rate4Site, assume that site-specific substitution rates are independent across sites. However, site-specific substitution rates may be strongly correlated in the protein tertiary structure, since functionally important sites tend to be clustered together to form functional patches. We have developed a new model, GP4Rate, which incorporates the Gaussian process model with the standard phylogenetic model to identify slowly evolved regions in protein tertiary structures. GP4Rate uses the Gaussian process to define a nonparametric prior distribution of site-specific substitution rates, which naturally captures the spatial correlation of substitution rates. Simulations suggest that GP4Rate can potentially estimate site-specific substitution rates with a much higher accuracy than Rate4Site and tends to report slowly evolved regions rather than individual sites. In addition, GP4Rate can estimate the strength of the spatial correlation of substitution rates from the data. By applying GP4Rate to a set of mammalian B7-1 genes, we found a highly conserved region which coincides with experimental evidence. GP4Rate may be a useful tool for the in silico prediction of functionally important regions in the proteins with known structures.

To understand how a protein functions, a critical step is to know which regions in its protein tertiary structure may be functionally important. Functionally important protein regions are typically more conserved than other regions because mutations in these regions are more likely to be deleterious. A number of phylogenetic models have been developed to identify conserved sites or regions in proteins by comparing protein sequences from multiple species. However, most of these methods treat amino acid sites independently and do not consider the spatial clustering of conserved sites in the protein tertiary structure. Therefore, their power of identifying functional protein regions is limited. We develop a new statistical model, GP4Rate, which combines the information from the protein sequences and the protein tertiary structure to infer conserved regions. We demonstrate that GP4Rate outperforms Rate4Site, the most widely used phylogenetic software for inferring functional amino acid sites, via simulations with a case study of B7-1 genes. GP4Rate is a potentially useful tool for guiding mutagenesis experiments or providing insights on the relationship between protein structures and functions.

Collapse

Stecher G, Liu L, Sanderford M, Peterson D, Tamura K, Kumar S. MEGA-MD: molecular evolutionary genetics analysis software with mutational diagnosis of amino acid variation. Bioinformatics 2014;30:1305-7. [PMID: 24413669 DOI: 10.1093/bioinformatics/btu018] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Preeprem T, Gibson G. An association-adjusted consensus deleterious scheme to classify homozygous Mis-sense mutations for personal genome interpretation. BioData Min 2013;6:24. [PMID: 24365473 PMCID: PMC3892026 DOI: 10.1186/1756-0381-6-24] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2013] [Accepted: 12/17/2013] [Indexed: 11/22/2022] Open

Abstract

BACKGROUND

Personal genome analysis is now being considered for evaluation of disease risk in healthy individuals, utilizing both rare and common variants. Multiple scores have been developed to predict the deleteriousness of amino acid substitutions, using information on the allele frequencies, level of evolutionary conservation, and averaged structural evidence. However, agreement among these scores is limited and they likely over-estimate the fraction of the genome that is deleterious.

METHOD

This study proposes an integrative approach to identify a subset of homozygous non-synonymous single nucleotide polymorphisms (nsSNPs). An 8-level classification scheme is constructed from the presence/absence of deleterious predictions combined with evidence of association with disease or complex traits. Detailed literature searches and structural validations are then performed for a subset of homozygous 826 mis-sense mutations in 575 proteins found in the genomes of 12 healthy adults.

RESULTS

Implementation of the Association-Adjusted Consensus Deleterious Scheme (AACDS) classifies 11% of all predicted highly deleterious homozygous variants as most likely to influence disease risk. The number of such variants per genome ranges from 0 to 8 with no significant difference between African and Caucasian Americans. Detailed analysis of mutations affecting the APOE, MTMR2, THSB1, CHIA, αMyHC, and AMY2A proteins shows how the protein structure is likely to be disrupted, even though the associated phenotypes have not been documented in the corresponding individuals.

CONCLUSIONS

The classification system for homozygous nsSNPs provides an opportunity to systematically rank nsSNPs based on suggestive evidence from annotations and sequence-based predictions. The ranking scheme, in-depth literature searches, and structural validations of highly prioritized mis-sense mutations compliment traditional sequence-based approaches and should have particular utility for the development of individualized health profiles. An online tool reporting the AACDS score for any variant is provided at the authors' website.

Collapse

Gemovic B, Perovic V, Glisic S, Veljkovic N. Feature-based classification of amino acid substitutions outside conserved functional protein domains. ScientificWorldJournal 2013;2013:948617. [PMID: 24348198 PMCID: PMC3855963 DOI: 10.1155/2013/948617] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2013] [Accepted: 09/24/2013] [Indexed: 01/01/2023] Open

Christodoulou K, Wiskin AE, Gibson J, Tapper W, Willis C, Afzal NA, Upstill-Goddard R, Holloway JW, Simpson MA, Beattie RM, Collins A, Ennis S. Next generation exome sequencing of paediatric inflammatory bowel disease patients identifies rare and novel variants in candidate genes. Gut 2013;62:977-84. [PMID: 22543157 PMCID: PMC3686259 DOI: 10.1136/gutjnl-2011-301833] [Citation(s) in RCA: 97] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Abstract

BACKGROUND

Multiple genes have been implicated by association studies in altering inflammatory bowel disease (IBD) predisposition. Paediatric patients often manifest more extensive disease and a particularly severe disease course. It is likely that genetic predisposition plays a more substantial role in this group.

OBJECTIVE

To identify the spectrum of rare and novel variation in known IBD susceptibility genes using exome sequencing analysis in eight individual cases of childhood onset severe disease.

DESIGN

DNA samples from the eight patients underwent targeted exome capture and sequencing. Data were processed through an analytical pipeline to align sequence reads, conduct quality checks, and identify and annotate variants where patient sequence differed from the reference sequence. For each patient, the entire complement of rare variation within strongly associated candidate genes was catalogued.

RESULTS

Across the panel of 169 known IBD susceptibility genes, approximately 300 variants in 104 genes were found. Excluding splicing and HLA-class variants, 58 variants across 39 of these genes were classified as rare, with an alternative allele frequency of <5%, of which 17 were novel. Only two patients with early onset Crohn's disease exhibited rare deleterious variations within NOD2: the previously described R702W variant was the sole NOD2 variant in one patient, while the second patient also carried the L1007 frameshift insertion. Both patients harboured other potentially damaging mutations in the GSDMB, ERAP2 and SEC16A genes. The two patients severely affected with ulcerative colitis exhibited a distinct profile: both carried potentially detrimental variation in the BACH2 and IL10 genes not seen in other patients.

CONCLUSION

For each of the eight individuals studied, all non-synonymous, truncating and frameshift mutations across all known IBD genes were identified. A unique profile of rare and potentially damaging variants was evident for each patient with this complex disease.

Collapse

Affiliation(s)

Katja Christodoulou Genetic Epidemiology and Genomic Informatics Group, Human Genetics & Genomic Medicine, Faculty of Medicine, University of Southampton, Duthie Building (Mailpoint 808), University Hospital Southampton NHS Foundation Trust, Southampton, UK
Anthony E Wiskin NIHR Biomedical Research Unit (Nutrition, Diet & Lifestyle), University Hospital Southampton NHS Foundation Trust, Mailpoint 218, Southampton General Hospital, Tremona Road, Southampton, UK
Jane Gibson Genetic Epidemiology and Genomic Informatics Group, Human Genetics & Genomic Medicine, Faculty of Medicine, University of Southampton, Duthie Building (Mailpoint 808), University Hospital Southampton NHS Foundation Trust, Southampton, UK
William Tapper Genetic Epidemiology and Genomic Informatics Group, Human Genetics & Genomic Medicine, Faculty of Medicine, University of Southampton, Duthie Building (Mailpoint 808), University Hospital Southampton NHS Foundation Trust, Southampton, UK
Claire Willis NIHR Biomedical Research Unit (Nutrition, Diet & Lifestyle), University Hospital Southampton NHS Foundation Trust, Mailpoint 218, Southampton General Hospital, Tremona Road, Southampton, UK
Nadeem A Afzal Paediatric Medical Unit, University Hospital Southampton NHS Foundation Trust, Southampton General Hospital, Tremona Road, Southampton, UK
Rosanna Upstill-Goddard Genetic Epidemiology and Genomic Informatics Group, Human Genetics & Genomic Medicine, Faculty of Medicine, University of Southampton, Duthie Building (Mailpoint 808), University Hospital Southampton NHS Foundation Trust, Southampton, UK
John W Holloway Human Genetics & Genomic Medicine, Human Genetics, Faculty of Medicine, University of Southampton Duthie Building (Mailpoint 808), University Hospital Southampton NHS Foundation Trust, Southampton, SO16 6YD, UK
Michael A Simpson Division of Genetics and Molecular Medicine, King's College London School of Medicine, Guy's Hospital, London, UK
R Mark Beattie Paediatric Medical Unit, University Hospital Southampton NHS Foundation Trust, Southampton General Hospital, Tremona Road, Southampton, UK
Andrew Collins Genetic Epidemiology and Genomic Informatics Group, Human Genetics & Genomic Medicine, Faculty of Medicine, University of Southampton, Duthie Building (Mailpoint 808), University Hospital Southampton NHS Foundation Trust, Southampton, UK
Sarah Ennis Genetic Epidemiology and Genomic Informatics Group, Human Genetics & Genomic Medicine, Faculty of Medicine, University of Southampton, Duthie Building (Mailpoint 808), University Hospital Southampton NHS Foundation Trust, Southampton, UK

Collapse

Kotaru AR, Shameer K, Sundaramurthy P, Joshi RC. An improved hypergeometric probability method for identification of functionally linked proteins using phylogenetic profiles. Bioinformation 2013;9:368-74. [PMID: 23750082 PMCID: PMC3669790 DOI: 10.6026/97320630009368] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2013] [Accepted: 03/06/2013] [Indexed: 12/04/2022] Open

Effect of genetic regions on the correlation between single point mutation variability and morbidity. Comput Biol Med 2013;43:594-9. [DOI: 10.1016/j.compbiomed.2013.01.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2011] [Revised: 07/27/2012] [Accepted: 01/19/2013] [Indexed: 11/19/2022]

Kirwan JD, Bekaert M, Commins JM, Davies KTJ, Rossiter SJ, Teeling EC. A phylomedicine approach to understanding the evolution of auditory sensory perception and disease in mammals. Evol Appl 2013;6:412-22. [PMID: 23745134 PMCID: PMC3673470 DOI: 10.1111/eva.12047] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2012] [Accepted: 12/21/2012] [Indexed: 01/31/2023] Open

Liu L, Kumar S. Evolutionary balancing is critical for correctly forecasting disease-associated amino acid variants. Mol Biol Evol 2013;30:1252-7. [PMID: 23462317 DOI: 10.1093/molbev/mst037] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open

Kindt ASD, Navarro P, Semple CAM, Haley CS. The genomic signature of trait-associated variants. BMC Genomics 2013;14:108. [PMID: 23418889 PMCID: PMC3600003 DOI: 10.1186/1471-2164-14-108] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2012] [Accepted: 02/11/2013] [Indexed: 02/06/2023] Open

Abstract

BACKGROUND

Genome-wide association studies have identified thousands of SNP variants associated with hundreds of phenotypes. For most associations the causal variants and the molecular mechanisms underlying pathogenesis remain unknown. Exploration of the underlying functional annotations of trait-associated loci has thrown some light on their potential roles in pathogenesis. However, there are some shortcomings of the methods used to date, which may undermine efforts to prioritize variants for further analyses. Here, we introduce and apply novel methods to rigorously identify annotation classes showing enrichment or depletion of trait-associated variants taking into account the underlying associations due to co-location of different functional annotations and linkage disequilibrium.

RESULTS

We assessed enrichment and depletion of variants in publicly available annotation classes such as genic regions, regulatory features, measures of conservation, and patterns of histone modifications. We used logistic regression to build a multivariate model that identified the most influential functional annotations for trait-association status of genome-wide significant variants. SNPs associated with all of the enriched annotations were 8 times more likely to be trait-associated variants than SNPs annotated with none of them. Annotations associated with chromatin state together with prior knowledge of the existence of a local expression QTL (eQTL) were the most important factors in the final logistic regression model. Surprisingly, despite the widespread use of evolutionary conservation to prioritize variants for study we find only modest enrichment of trait-associated SNPs in conserved regions.

CONCLUSION

We established odds ratios of functional annotations that are more likely to contain significantly trait-associated SNPs, for the purpose of prioritizing GWAS hits for further studies. Additionally, we estimated the relative and combined influence of the different genomic annotations, which may facilitate future prioritization methods by adding substantial information.

Collapse

Nevin Gerek Z, Kumar S, Banu Ozkan S. Structural dynamics flexibility informs function and evolution at a proteome scale. Evol Appl 2013;6:423-33. [PMID: 23745135 PMCID: PMC3673471 DOI: 10.1111/eva.12052] [Citation(s) in RCA: 73] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2012] [Accepted: 01/13/2013] [Indexed: 01/04/2023] Open