101
|
Martinelli S, Torreri P, Tinti M, Stella L, Bocchinfuso G, Flex E, Grottesi A, Ceccarini M, Palleschi A, Cesareni G, Castagnoli L, Petrucci TC, Gelb BD, Tartaglia M. Diverse driving forces underlie the invariant occurrence of the T42A, E139D, I282V and T468M SHP2 amino acid substitutions causing Noonan and LEOPARD syndromes. Hum Mol Genet 2008; 17:2018-29. [PMID: 18372317 PMCID: PMC2900904 DOI: 10.1093/hmg/ddn099] [Citation(s) in RCA: 66] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2008] [Accepted: 03/25/2008] [Indexed: 01/22/2023] Open
Abstract
Missense PTPN11 mutations cause Noonan and LEOPARD syndromes (NS and LS), two developmental disorders with pleiomorphic phenotypes. PTPN11 encodes SHP2, an SH2 domain-containing protein tyrosine phosphatase functioning as a signal transducer. Generally, different substitutions of a particular amino acid residue are observed in these diseases, indicating that the crucial factor is the residue being replaced. For a few codons, only one substitution is observed, suggesting the possibility of specific roles for the residue introduced. We analyzed the biochemical behavior and ligand-binding properties of all possible substitutions arising from single-base changes affecting codons 42, 139, 279, 282 and 468 to investigate the mechanisms underlying the invariant occurrence of the T42A, E139D and I282V substitutions in NS and the Y279C and T468M changes in LS. Our data demonstrate that the isoleucine-to-valine change at codon 282 is the only substitution at that position perturbing the stability of SHP2's closed conformation without impairing catalysis, while the threonine-to-alanine change at codon 42, but not other substitutions of that residue, promotes increased phosphopeptide-binding affinity. The recognition specificity of the C-SH2 domain bearing the E139D substitution differed substantially from its wild-type counterpart acquiring binding properties similar to those observed for the N-SH2 domain, revealing a novel mechanism of SHP2's functional dysregulation. Finally, while functional selection does not seem to occur for the substitutions at codons 279 and 468, we point to deamination of the methylated cytosine at nucleotide 1403 as the driving factor leading to the high prevalence of the T468M change in LS.
Collapse
Affiliation(s)
- Simone Martinelli
- Dipartimento di Biologia Cellulare e Neuroscienze, Istituto Superiore di Sanità, Viale Regina Elena 299, 00161 Rome, Italy
| | - Paola Torreri
- Dipartimento di Biologia Cellulare e Neuroscienze, Istituto Superiore di Sanità, Viale Regina Elena 299, 00161 Rome, Italy
| | - Michele Tinti
- Dipartimento di Biologia, Università di Roma ‘Tor Vergata’, Via della Ricerca Scientifica s.n.c., 00133 Rome, Italy
| | - Lorenzo Stella
- Dipartimento di Scienze e Tecnologie Chimiche, Università di Roma ‘Tor Vergata’, Via della Ricerca Scientifica s.n.c, 00133 Rome, Italy
| | - Gianfranco Bocchinfuso
- Dipartimento di Scienze e Tecnologie Chimiche, Università di Roma ‘Tor Vergata’, Via della Ricerca Scientifica s.n.c, 00133 Rome, Italy
| | - Elisabetta Flex
- Dipartimento di Biologia Cellulare e Neuroscienze, Istituto Superiore di Sanità, Viale Regina Elena 299, 00161 Rome, Italy
| | - Alessandro Grottesi
- Consortium for the Application of Super-Computing for Universities and Research (CASPUR), Via dei Tizii 6, 00185 Rome, Italy
| | - Marina Ceccarini
- Dipartimento di Biologia Cellulare e Neuroscienze, Istituto Superiore di Sanità, Viale Regina Elena 299, 00161 Rome, Italy
| | - Antonio Palleschi
- Dipartimento di Scienze e Tecnologie Chimiche, Università di Roma ‘Tor Vergata’, Via della Ricerca Scientifica s.n.c, 00133 Rome, Italy
| | - Gianni Cesareni
- Dipartimento di Biologia, Università di Roma ‘Tor Vergata’, Via della Ricerca Scientifica s.n.c., 00133 Rome, Italy
| | - Luisa Castagnoli
- Dipartimento di Biologia, Università di Roma ‘Tor Vergata’, Via della Ricerca Scientifica s.n.c., 00133 Rome, Italy
| | - Tamara C. Petrucci
- Dipartimento di Biologia Cellulare e Neuroscienze, Istituto Superiore di Sanità, Viale Regina Elena 299, 00161 Rome, Italy
| | | | - Marco Tartaglia
- Dipartimento di Biologia Cellulare e Neuroscienze, Istituto Superiore di Sanità, Viale Regina Elena 299, 00161 Rome, Italy
| |
Collapse
|
102
|
Jackson HA, Accili EA. Evolutionary analyses of KCNQ1 and HERG voltage-gated potassium channel sequences reveal location-specific susceptibility and augmented chemical severities of arrhythmogenic mutations. BMC Evol Biol 2008; 8:188. [PMID: 18590565 PMCID: PMC2483723 DOI: 10.1186/1471-2148-8-188] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2008] [Accepted: 06/30/2008] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Mutations in HERG and KCNQ1 potassium channels have been associated with Long QT syndrome and atrial fibrillation, and more recently with sudden infant death syndrome and sudden unexplained death. In other proteins, disease-associated amino acid mutations have been analyzed according to the chemical severity of the changes and the locations of the altered amino acids according to their conservation over metazoan evolution. Here, we present the first such analysis of arrhythmia-associated mutations (AAMs) in the HERG and KCNQ1 potassium channels. RESULTS Using evolutionary analyses, AAMs in HERG and KCNQ1 were preferentially found at evolutionarily conserved sites and unevenly distributed among functionally conserved domains. Non-synonymous single nucleotide polymorphisms (nsSNPs) are under-represented at evolutionarily conserved sites in HERG, but distribute randomly in KCNQ1. AAMs are chemically more severe, according to Grantham's Scale, than changes observed in evolution and their severity correlates with the expected chemical severity of the involved codon. Expected chemical severity of a given amino acid also correlates with its relative contribution to arrhythmias. At evolutionarily variable sites, the chemical severity of the changes is also correlated with the expected chemical severity of the involved codon. CONCLUSION Unlike nsSNPs, AAMs preferentially locate to evolutionarily conserved, and functionally important, sites and regions within HERG and KCNQ1, and are chemically more severe than changes which occur in evolution. Expected chemical severity may contribute to the overrepresentation of certain residues in AAMs, as well as to evolutionary change.
Collapse
Affiliation(s)
- Heather A Jackson
- Department of Cellular and Physiological Sciences, Faculty of Medicine, University of British Columbia, Vancouver, BC, V6T 1Z3, Canada.
| | | |
Collapse
|
103
|
Cooper DN, Stenson PD, Chuzhanova NA. The Human Gene Mutation Database (HGMD) and its exploitation in the study of mutational mechanisms. ACTA ACUST UNITED AC 2008; Chapter 1:Unit 1.13. [PMID: 18428754 DOI: 10.1002/0471250953.bi0113s12] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
The Human Gene Mutation Database (HGMD) constitutes a comprehensive core collection of data on germ-line mutations in nuclear genes underlying or associated with human inherited disease (http://www.hgmd.org). Data cataloged include single base-pair substitutions in coding, regulatory, and splicing-relevant regions, microdeletions and microinsertions, indels, and triplet repeat expansions, as well as gross gene deletions, insertions, duplications, and complex rearrangements. Each mutation is entered into HGMD only once, in order to avoid confusion between recurrent and identical-by-descent lesions. By June 2005, the database contained in excess of 53,000 different lesions detected in 2029 different nuclear genes, with new entries currently accumulating at a rate in excess of 5000 per annum. HGMD includes cDNA reference sequences, now provided for more than 90% of the listed genes, splice junction data, disease-associated and functional polymorphisms, and links to data present in publicly available online locus-specific mutation databases.
Collapse
|
104
|
Meurs KM, Mealey KL. Evaluation of the flanking nucleotide sequences of sarcomeric hypertrophic cardiomyopathy substitution mutations. Mutat Res 2008; 642:86-9. [PMID: 18539302 DOI: 10.1016/j.mrfmmm.2008.04.005] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2008] [Revised: 04/15/2008] [Accepted: 04/16/2008] [Indexed: 11/28/2022]
Abstract
Hypertrophic cardiomyopathy (HCM) is a familial myocardial disease with a prevalence of 1 in 500. More than 400 causative mutations have been identified in 13 sarcomeric and myofilament related genes, 350 of these are substitution mutations within eight sarcomeric genes. Within a population, examples of recurring identical disease causing mutations that appear to have arisen independently have been noted as well as those that appear to have been inherited from a common ancestor. The large number of novel HCM mutations could suggest a mechanism of increased mutability within the sarcomeric genes. The objective of this study was to evaluate the most commonly reported HCM genes, beta myosin heavy chain (MYH7), myosin binding protein C, troponin I, troponin T, cardiac regulatory myosin light chain, cardiac essential myosin light chain, alpha tropomyosin and cardiac alpha-actin for sequence patterns surrounding the substitution mutations that may suggest a mechanism of increased mutability. The mutations as well as the 10 flanking nucleotides were evaluated for frequency of di-, tri- and tetranucleotides containing the mutation as well as for the presence of certain tri- and tetranculeotide motifs. The most common substitutions were guanine (G) to adenine (A) and cytosine (C) to thymidine (T). The CG dinucleotide had a significantly higher relative mutability than any other dinucleotide (p<0.05). The relative mutability of each possible trinucleotide and tetranucleotide sequence containing the mutation was calculated; none were at a statistically higher frequency than the others. The large number of G to A and C to T mutations as well as the relative mutability of CG may suggest that deamination of methylated CpG is an important mechanism for mutation development in at least some of these cardiac genes.
Collapse
Affiliation(s)
- Kathryn M Meurs
- Department of Veterinary Clinical Sciences, Washington State University College of Veterinary Medicine, Grimes Street, Pullman, WA 99164, United States.
| | | |
Collapse
|
105
|
Nilsen H, Hayes B, Berg PR, Roseth A, Sundsaasen KK, Nilsen K, Lien S. Construction of a dense SNP map for bovine chromosome 6 to assist the assembly of the bovine genome sequence. Anim Genet 2008; 39:97-104. [PMID: 18307581 DOI: 10.1111/j.1365-2052.2007.01686.x] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
A linkage map was constructed for bovine chromosome 6 (BTA6), using 399 single nucleotide polymorphisms (SNPs) detected primarily from PCR-resequencing. The efficiency of SNP detection was highly dependent on the source of sequence information chosen for primer design (BAC-end sequences, introns or promoters). The SNPs were used to build a linkage map comprising 104 cM on BTA6. The SNP order in the linkage map corresponded very well with radiation hybrid (RH) maps available for BTA6 as well as with expected positions in the human comparative map, but diverged significantly from the current assembly of the bovine genome (Btau_3.1). When performing linkage analysis with the marker order suggested from the Btau_3.1 we observed an expansion of the genetic map from 104 cM to 137 cM, strongly suggesting a reordering of scaffolds in the current version of the bovine genome assembly. The extent of LD on BTA6 was evaluated by calculating the average r(2) for SNP pairs separated by given distances. The decline of LD was rapid with distance, such that r(2) was 0.1 at 100 kb. Our results indicate that linkage mapping will be a valuable source of information for correcting errors in the current bovine assembly. These errors were sufficiently frequent to be of concern for the accuracy of mapping QTL with panels of SNPs whose positions are based on the current assembly.
Collapse
Affiliation(s)
- H Nilsen
- Department of Animal and Aquacultural Sciences, The Norwegian University of Life Sciences, N-1432 Aas, Norway
| | | | | | | | | | | | | |
Collapse
|
106
|
Orten DJ, Fischer SM, Sorensen JL, Radhakrishna U, Cremers CW, Marres HA, Van Camp G, Welch KO, Smith RJ, Kimberling WJ. Branchio-oto-renal syndrome (BOR): novel mutations in theEYA1gene, and a review of the mutational genetics of BOR. Hum Mutat 2008; 29:537-44. [DOI: 10.1002/humu.20691] [Citation(s) in RCA: 65] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
|
107
|
Roca X, Olson AJ, Rao AR, Enerly E, Kristensen VN, Børresen-Dale AL, Andresen BS, Krainer AR, Sachidanandam R. Features of 5'-splice-site efficiency derived from disease-causing mutations and comparative genomics. Genome Res 2007; 18:77-87. [PMID: 18032726 DOI: 10.1101/gr.6859308] [Citation(s) in RCA: 67] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Many human diseases, including Fanconi anemia, hemophilia B, neurofibromatosis, and phenylketonuria, can be caused by 5'-splice-site (5'ss) mutations that are not predicted to disrupt splicing, according to position weight matrices. By using comparative genomics, we identify pairwise dependencies between 5'ss nucleotides as a conserved feature of the entire set of 5'ss. These dependencies are also conserved in human-mouse pairs of orthologous 5'ss. Many disease-associated 5'ss mutations disrupt these dependencies, as can some human SNPs that appear to alter splicing. The consistency of the evidence signifies the relevance of this approach and suggests that 5'ss SNPs play a role in complex diseases.
Collapse
Affiliation(s)
- Xavier Roca
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
108
|
Guan X, Madabushi A, Chang DY, Fitzgerald ME, Shi G, Drohat AC, Lu AL. The human checkpoint sensor Rad9-Rad1-Hus1 interacts with and stimulates DNA repair enzyme TDG glycosylase. Nucleic Acids Res 2007; 35:6207-18. [PMID: 17855402 PMCID: PMC2094074 DOI: 10.1093/nar/gkm678] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
Human (h) DNA repair enzyme thymine DNA glycosylase (hTDG) is a key DNA glycosylase in the base excision repair (BER) pathway that repairs deaminated cytosines and 5-methyl-cytosines. The cell cycle checkpoint protein Rad9–Rad1–Hus1 (the 9-1-1 complex) is the surveillance machinery involved in the preservation of genome stability. In this study, we show that hTDG interacts with hRad9, hRad1 and hHus1 as individual proteins and as a complex. The hHus1 interacting domain is mapped to residues 67–110 of hTDG, and Val74 of hTDG plays an important role in the TDG–Hus1 interaction. In contrast to the core domain of hTDG (residues 110–308), hTDG(67–308) removes U and T from U/G and T/G mispairs, respectively, with similar rates as native hTDG. Human TDG activity is significantly stimulated by hHus1, hRad1, hRad9 separately, and by the 9-1-1 complex. Interestingly, the interaction between hRad9 and hTDG, as detected by co-immunoprecipitation (Co-IP), is enhanced following N-methyl-N′-nitro-N-nitrosoguanidine (MNNG) treatment. A significant fraction of the hTDG nuclear foci co-localize with hRad9 foci in cells treated with methylating agents. Thus, the 9-1-1 complex at the lesion sites serves as both a damage sensor to activate checkpoint control and a component of the BER.
Collapse
Affiliation(s)
| | | | | | | | | | | | - A-Lien Lu
- *To whom correspondence should be addressed. +1 410 706 4356+1 410 706 1787
| |
Collapse
|
109
|
Cheung LWT, Lee YF, Ng TW, Ching WK, Khoo US, Ng MKP, Wong AST. CpG/CpNpG motifs in the coding region are preferred sites for mutagenesis in the breast cancer susceptibility genes. FEBS Lett 2007; 581:4668-74. [PMID: 17826769 DOI: 10.1016/j.febslet.2007.08.061] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2007] [Revised: 08/21/2007] [Accepted: 08/26/2007] [Indexed: 10/22/2022]
Abstract
The range of BRCA1/BRCA2 gene mutations is diverse and the mechanism accounting for this heterogeneity is obscure. To gain insight into the endogenous mutational mechanisms involved, we evaluated the association of specific sequences (i.e. CpG/CpNpG motifs, homonucleotides, short repeats) and mutations within the genes. We classified 1337 published mutations in BRCA1 (1765 BRCA2 mutations) for each specific sequence, and employed computer simulation combined with mathematical calculations to estimate the true underlying tendency of mutation occurrence. Interestingly, we found no mutational bias to homonucleotides and repeats in deletions/insertions and substitutions but striking bias to CpG/CpNpG in substitutions in both genes. This suggests that methylation-dependent DNA alterations would be a major mechanism for mutagenesis.
Collapse
Affiliation(s)
- Lydia W T Cheung
- School of Biological Sciences, University of Hong Kong, Hong Kong
| | | | | | | | | | | | | |
Collapse
|
110
|
Khan S, Vihinen M. Spectrum of disease-causing mutations in protein secondary structures. BMC STRUCTURAL BIOLOGY 2007; 7:56. [PMID: 17727703 PMCID: PMC1995201 DOI: 10.1186/1472-6807-7-56] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/28/2006] [Accepted: 08/29/2007] [Indexed: 11/10/2022]
Abstract
BACKGROUND Most genetic disorders are linked to missense mutations as even minor changes in the size or properties of an amino acid can alter or prevent the function of the protein. Further, the effect of a mutation is also dependent on the sequence and structure context of the alteration. RESULTS We investigated the spectrum of disease-causing missense mutations in secondary structure elements in proteins with numerous known mutations and for which an experimentally defined three-dimensional structure is available. We obtained a comprehensive map of the differences in mutation frequencies, location and contact energies, and the changes in residue volume and charge - both in the mutated (original) amino acids and in the mutant amino acids in the different secondary structure types. We collected information for 44 different proteins involved in a large number of diseases. The studied proteins contained a total of 2413 mutations of which 1935 (80%) appeared in secondary structures. Differences in mutation patterns between secondary structures and whole proteins were generally not statistically significant whereas within the secondary structural elements numerous highly significant features were observed. CONCLUSION Numerous trends in mutated and mutant amino acids are apparent. Among the original residues, arginine clearly has the highest relative mutability. The overall relative mutability among mutant residues is highest for cysteine and tryptophan. The mutability values are higher for mutated residues than for mutant residues. Arginine and glycine are among the most mutated residues in all secondary structures whereas the other amino acids have large variations in mutability between structure types. Statistical analysis was used to reveal trends in different secondary structural elements, residue types as well as for the charge and volume changes.
Collapse
Affiliation(s)
- Sofia Khan
- Institute of Medical Technology, FI-33014 University of Tampere, Finland
| | - Mauno Vihinen
- Institute of Medical Technology, FI-33014 University of Tampere, Finland
- Research Unit, Tampere University Hospital, FI-33520 Tampere, Finland
| |
Collapse
|
111
|
Zhang W, Bouffard GG, Wallace SS, Bond JP. Estimation of DNA sequence context-dependent mutation rates using primate genomic sequences. J Mol Evol 2007; 65:207-14. [PMID: 17676366 DOI: 10.1007/s00239-007-9000-5] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2004] [Accepted: 11/30/2006] [Indexed: 10/23/2022]
Abstract
It is understood that DNA and amino acid substitution rates are highly sequence context-dependent, e.g., C --> T substitutions in vertebrates may occur much more frequently at CpG sites and that cysteine substitution rates may depend on support of the context for participation in a disulfide bond. Furthermore, many applications rely on quantitative models of nucleotide or amino acid substitution, including phylogenetic inference and identification of amino acid sequence positions involved in functional specificity. We describe quantification of the context dependence of nucleotide substitution rates using baboon, chimpanzee, and human genomic sequence data generated by the NISC Comparative Sequencing Program. Relative mutation rates are reported for the 96 classes of mutations of the form 5' alphabetagamma 3' --> 5' alphadeltagamma 3', where alpha, beta, gamma, and delta are nucleotides and beta not equal delta, based on maximum likelihood calculations. Our results confirm that C --> T substitutions are enhanced at CpG sites compared with other transitions, relatively independent of the identity of the preceding nucleotide. While, as expected, transitions generally occur more frequently than transversions, we find that the most frequent transversions involve the C at CpG sites (CpG transversions) and that their rate is comparable to the rate of transitions at non-CpG sites. A four-class model of the rates of context-dependent evolution of primate DNA sequences, CpG transitions > non-CpG transitions approximately CpG transversions > non-CpG transversions, captures qualitative features of the mutation spectrum. We find that despite qualitative similarity of mutation rates among different genomic regions, there are statistically significant differences.
Collapse
Affiliation(s)
- Wei Zhang
- Department of Medicine, University of Chicago, 515 CLSC, Chicago, IL 60637, USA
| | | | | | | |
Collapse
|
112
|
Wu H, Ma BG, Zhao JT, Zhang HY. How similar are amino acid mutations in human genetic diseases and evolution. Biochem Biophys Res Commun 2007; 362:233-7. [PMID: 17681277 DOI: 10.1016/j.bbrc.2007.07.141] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2007] [Accepted: 07/24/2007] [Indexed: 10/23/2022]
Abstract
Accumulating evidence indicates that some deleterious mutations responsible for genetic diseases may offer benefits for human to prevent other diseases. Therefore, human genetic diseases and evolution were tentatively regarded as the two sides of the same coin, which stimulated our interest to explore how similar are amino acid mutations in human genetic diseases and evolution. Through a large-scale analysis on amino acid mutation patterns of genetic diseases and evolution of Hominidae (Homo sapiens and Pan troglodytes), it was found that there exist significant correlations between two mutation patterns. Besides, there also exist some evident differences between both mutations, especially those associated with four amino acids C, G, R, and L. These findings are of significance to understanding the subtle connections between human genetic diseases and evolution.
Collapse
Affiliation(s)
- Hao Wu
- Shandong Provincial Research Center for Bioinformatic Engineering and Technique, Center for Advanced Study, Shandong University of Technology, Zibo 255049, PR China
| | | | | | | |
Collapse
|
113
|
Buratti E, Chivers M, Královičová J, Romano M, Baralle M, Krainer AR, Vořechovský I. Aberrant 5' splice sites in human disease genes: mutation pattern, nucleotide structure and comparison of computational tools that predict their utilization. Nucleic Acids Res 2007; 35:4250-63. [PMID: 17576681 PMCID: PMC1934990 DOI: 10.1093/nar/gkm402] [Citation(s) in RCA: 151] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Despite a growing number of splicing mutations found in hereditary diseases, utilization of aberrant splice sites and their effects on gene expression remain challenging to predict. We compiled sequences of 346 aberrant 5′splice sites (5′ss) that were activated by mutations in 166 human disease genes. Mutations within the 5′ss consensus accounted for 254 cryptic 5′ss and mutations elsewhere activated 92 de novo 5′ss. Point mutations leading to cryptic 5′ss activation were most common in the first intron nucleotide, followed by the fifth nucleotide. Substitutions at position +5 were exclusively G>A transitions, which was largely attributable to high mutability rates of C/G>T/A. However, the frequency of point mutations at position +5 was significantly higher than that observed in the Human Gene Mutation Database, suggesting that alterations of this position are particularly prone to aberrant splicing, possibly due to a requirement for sequential interactions with U1 and U6 snRNAs. Cryptic 5′ss were best predicted by computational algorithms that accommodate nucleotide dependencies and not by weight-matrix models. Discrimination of intronic 5′ss from their authentic counterparts was less effective than for exonic sites, as the former were intrinsically stronger than the latter. Computational prediction of exonic de novo 5′ss was poor, suggesting that their activation critically depends on exonic splicing enhancers or silencers. The authentic counterparts of aberrant 5′ss were significantly weaker than the average human 5′ss. The development of an online database of aberrant 5′ss will be useful for studying basic mechanisms of splice-site selection, identifying splicing mutations and optimizing splice-site prediction algorithms.
Collapse
Affiliation(s)
- Emanuele Buratti
- International Centre for Genetic Engineering and Biotechnology, Padriciano 99, 34012 Trieste, Italy, University of Southampton School of Medicine, Division of Human Genetics, Southampton SO16 6YD, UK and Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724, USA
| | - Martin Chivers
- International Centre for Genetic Engineering and Biotechnology, Padriciano 99, 34012 Trieste, Italy, University of Southampton School of Medicine, Division of Human Genetics, Southampton SO16 6YD, UK and Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724, USA
| | - Jana Královičová
- International Centre for Genetic Engineering and Biotechnology, Padriciano 99, 34012 Trieste, Italy, University of Southampton School of Medicine, Division of Human Genetics, Southampton SO16 6YD, UK and Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724, USA
| | - Maurizio Romano
- International Centre for Genetic Engineering and Biotechnology, Padriciano 99, 34012 Trieste, Italy, University of Southampton School of Medicine, Division of Human Genetics, Southampton SO16 6YD, UK and Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724, USA
| | - Marco Baralle
- International Centre for Genetic Engineering and Biotechnology, Padriciano 99, 34012 Trieste, Italy, University of Southampton School of Medicine, Division of Human Genetics, Southampton SO16 6YD, UK and Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724, USA
| | - Adrian R. Krainer
- International Centre for Genetic Engineering and Biotechnology, Padriciano 99, 34012 Trieste, Italy, University of Southampton School of Medicine, Division of Human Genetics, Southampton SO16 6YD, UK and Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724, USA
| | - Igor Vořechovský
- International Centre for Genetic Engineering and Biotechnology, Padriciano 99, 34012 Trieste, Italy, University of Southampton School of Medicine, Division of Human Genetics, Southampton SO16 6YD, UK and Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724, USA
- *To whom correspondence should be addressed. +44 2380 796425+44 2380 794264
| |
Collapse
|
114
|
Zheng T, Ichiba T, Morton BR. Assessing substitution variation across sites in grass chloroplast DNA. J Mol Evol 2007; 64:605-13. [PMID: 17541677 DOI: 10.1007/s00239-006-0076-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2006] [Accepted: 02/28/2007] [Indexed: 11/24/2022]
Abstract
We assess the similarity of base substitution processes, described by empirically derived 4 x 4 matrices, using chi-square homogeneity tests. Such significance analyses allow us to assess variation in sequence evolution across sites and we apply them to matrices derived from noncoding sites in different contexts in grass chloroplast DNA. We show that there is statistically significant variation in rates and patterns of mutation among noncoding sites in different contexts and then demonstrate a similar and significant influence of context on substitutions at fourfold degenerate sites of coding regions from grass chloroplast DNA. These results show that context has the same general effect on substitution bias in coding and noncoding DNA: the A+T content of flanking bases is correlated with rate of substitution, transition bias, and GC --> AT pressure, while the number of flanking pyrimidines on a single strand is correlated with a mutational bias, or skew, toward pyrimidines. Despite the similarity in general trends, however, when we compare coding and noncoding matrices we find that there is a statistically significant difference between them even when we control for context. Most noticeably, fourfold degenerate sites in coding sequences are undergoing substitution at a higher rate and there are also significant differences in the relationship between pyrimidines skew and the number of flanking pyrimidines. Possible reasons for the differences between coding and noncoding sites are discussed. Furthermore, our analysis illustrates a simple statistical way for comparing substitution processes across sites allowing us to better study variation in evolutionary processes across a genome.
Collapse
Affiliation(s)
- Tian Zheng
- Department of Statistics, Columbia University, New York, NY 10027, USA
| | | | | |
Collapse
|
115
|
Kryukov GV, Pennacchio LA, Sunyaev SR. Most rare missense alleles are deleterious in humans: implications for complex disease and association studies. Am J Hum Genet 2007; 80:727-39. [PMID: 17357078 PMCID: PMC1852724 DOI: 10.1086/513473] [Citation(s) in RCA: 444] [Impact Index Per Article: 26.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2006] [Accepted: 01/30/2007] [Indexed: 12/31/2022] Open
Abstract
The accumulation of mildly deleterious missense mutations in individual human genomes has been proposed to be a genetic basis for complex diseases. The plausibility of this hypothesis depends on quantitative estimates of the prevalence of mildly deleterious de novo mutations and polymorphic variants in humans and on the intensity of selective pressure against them. We combined analysis of mutations causing human Mendelian diseases, of human-chimpanzee divergence, and of systematic data on human genetic variation and found that ~20% of new missense mutations in humans result in a loss of function, whereas ~27% are effectively neutral. Thus, the remaining 53% of new missense mutations have mildly deleterious effects. These mutations give rise to many low-frequency deleterious allelic variants in the human population, as is evident from a new data set of 37 genes sequenced in >1,500 individual human chromosomes. Surprisingly, up to 70% of low-frequency missense alleles are mildly deleterious and are associated with a heterozygous fitness loss in the range 0.001-0.003. Thus, the low allele frequency of an amino acid variant can, by itself, serve as a predictor of its functional significance. Several recent studies have reported a significant excess of rare missense variants in candidate genes or pathways in individuals with extreme values of quantitative phenotypes. These studies would be unlikely to yield results if most rare variants were neutral or if rare variants were not a significant contributor to the genetic component of phenotypic inheritance. Our results provide a justification for these types of candidate-gene (pathway) association studies and imply that mutation-selection balance may be a feasible evolutionary mechanism underlying some common diseases.
Collapse
Affiliation(s)
- Gregory V Kryukov
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital, Boston, MA 02125, USA
| | | | | |
Collapse
|
116
|
Barbaric I, Wells S, Russ A, Dear TN. Spectrum of ENU-induced mutations in phenotype-driven and gene-driven screens in the mouse. ENVIRONMENTAL AND MOLECULAR MUTAGENESIS 2007; 48:124-42. [PMID: 17295309 DOI: 10.1002/em.20286] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
N-ethyl-N-nitrosourea (ENU) mutagenesis in mice has become a standard tool for (i) increasing the pool of mutants in many areas of biology, (ii) identifying novel genes involved in physiological processes and disease, and (iii) in assisting in assigning functions to genes. ENU is assumed to cause random mutations throughout the mouse genome, but this presumption has never been analyzed. This is a crucial point, especially for large-scale mutagenesis, as a bias would reflect a constraint on identifying possible genetic targets. There is a significant body of published data now available from both phenotype-driven and gene-driven ENU mutagenesis screens in the mouse that can be used to reveal the effectiveness and limitations of an ENU mutagenesis approach. Analysis of the published data is presented in this paper. As expected for a randomly acting mutagen, ENU-induced mutations identified in phenotype-driven screens were in genes that had higher coding sequence length and higher exon number than the average for the mouse genome. Unexpectedly, the data showed that ENU-induced mutations were more likely to be found in genes that had a higher G + C content and neighboring base analysis revealed that the identified ENU mutations were more often directly flanked by G or C nucleotides. ENU mutations from phenotype-driven and gene-driven screens were dominantly A:T to T:A transversions or A:T to G:C transitions. Knowledge of the spectrum of mutations that ENU elicits will assist in positional cloning of ENU-induced mutations by allowing prioritization of candidate genes based on some of their inherent features.
Collapse
Affiliation(s)
- Ivana Barbaric
- Department of Biomedical Science, University of Sheffield, Sheffield, United Kingdom
| | | | | | | |
Collapse
|
117
|
Seo D, Jiang C, Zhao Z. A novel statistical method to estimate the effective SNP size in vertebrate genomes and categorized genomic regions. BMC Genomics 2006; 7:329. [PMID: 17196097 PMCID: PMC1769377 DOI: 10.1186/1471-2164-7-329] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2006] [Accepted: 12/29/2006] [Indexed: 11/29/2022] Open
Abstract
Background The local environment of single nucleotide polymorphisms (SNPs) contains abundant genetic information for the study of mechanisms of mutation, genome evolution, and causes of diseases. Recent studies revealed that neighboring-nucleotide biases on SNPs were strong and the genome-wide bias patterns could be represented by a small subset of the total SNPs. It remains unsolved for the estimation of the effective SNP size, the number of SNPs that are sufficient to represent the bias patterns observed from the whole SNP data. Results To estimate the effective SNP size, we developed a novel statistical method, SNPKS, which considers both the statistical and biological significances. SNPKS consists of two major steps: to obtain an initial effective size by the Kolmogorov-Smirnov test (KS test) and to find an intermediate effective size by interval evaluation. The SNPKS algorithm was implemented in computer programs and applied to the real SNP data. The effective SNP size was estimated to be 38,200, 39,300, 38,000, and 38,700 in the human, chimpanzee, dog, and mouse genomes, respectively, and 39,100, 39,600, 39,200, and 42,200 in human intergenic, genic, intronic, and CpG island regions, respectively. Conclusion SNPKS is the first statistical method to estimate the effective SNP size. It runs efficiently and greatly outperforms the algorithm implemented in SNPNB. The application of SNPKS to the real SNP data revealed the similar small effective SNP size (38,000 – 42,200) in the human, chimpanzee, dog, and mouse genomes as well as in human genomic regions. The findings suggest strong influence of genetic factors across vertebrate genomes.
Collapse
|
118
|
Baser ME. The distribution of constitutional and somatic mutations in the neurofibromatosis 2 gene. Hum Mutat 2006; 27:297-306. [PMID: 16521120 DOI: 10.1002/humu.20317] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Constitutional heterozygous inactivating mutations in the neurofibromatosis 2 (NF2) tumor suppressor gene cause the autosomal dominant disease NF2, and biallelic inactivating somatic NF2 mutations are found in a high proportion of unilateral sporadic vestibular schwannoma (USVS) and sporadic meningioma. We surveyed the distributions of constitutional NF2 mutations in 823 NF2 families, 278 somatic NF2 mutations in USVS, and 208 somatic NF2 mutations in sporadic meningioma. Based on the available NF2 mutation data, the most dominant influence on the spectra of mutations in exons 1-15 are C>T transitions that change arginine codons (CGA) to stop codons (TGA) due to spontaneous deamination of methylcytosine to thymine in CpG dinucleotides. The paucity of reported mutations in exon 9 and the absence of reported mutations in exons 16 and 17 may be related to structure-function relationships in the NF2 protein.
Collapse
Affiliation(s)
- Michael E Baser
- Academic Unit of Medical Genetics, St. Mary's Hospital, Manchester, United Kingdom
| |
Collapse
|
119
|
Christodoulou J, Craig HJ, Walker DC, Weaving LS, Pearson CE, McInnes RR. Deletion hotspot in the argininosuccinate lyase gene: association with topoisomerase II and DNA polymerase alpha sites. Hum Mutat 2006; 27:1065-71. [PMID: 16941645 DOI: 10.1002/humu.20352] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Molecular analysis of argininosuccinate lyase (ASAL) deficiency has led to the identification of a deletion hotspot in the ASL gene. Six individuals with ASAL deficiency had alleles that led to a complete absence of exon 13 from the ASL mRNA; each had a partial deletion of exon 13 in the genomic DNA. In all six patients, the deletions begin 18 bp upstream of the 3' end of exon 13. In four cases, the deletions were 13 bp in length, and ended within exon 13, whereas in two other patients the deletions were 25 bp and extended into intron 13. The sequence at which these deletions begin overlaps both a putative topoisomerase II recognition site and a DNA polymerase alpha mutation/frameshift site. Moreover, the topoisomerase II cut site is situated precisely at the beginning of the deletions, which are flanked by small (2- and 3-bp) direct repeats. We note that a similar concurrence of these two putative enzyme sites can be found in a number of other deletion sites in the human genome, most notably the DeltaF508 deletion in the CFTR gene. These findings suggest that the joint presence of these two enzyme sites represents a DNA sequence context that may favor the occurrence of small deletions.
Collapse
Affiliation(s)
- John Christodoulou
- Program in Genetics and Genomic Biology, Research Institute, Hospital for Sick Children, University of Toronto, Toronto, Ontario, Canada.
| | | | | | | | | | | |
Collapse
|
120
|
Kleinjung T, Langguth B, Fischer B, Hajak G, Eichhammer P, Sand P. Systematic Screening of the Serotonin Receptor 1A (5-HT1A) Gene in Chronic Tinnitus. J Otol 2006. [DOI: 10.1016/s1672-2930(06)50018-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022] Open
|
121
|
Zhao H, Li Q, Li J, Zeng C, Hu S, Yu J. The study of neighboring nucleotide composition and transition/transversion bias. ACTA ACUST UNITED AC 2006; 49:395-402. [PMID: 16989286 DOI: 10.1007/s11427-006-2002-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Base substitution is one of the raw fuels that produce genetic variation and drive evolution. Recent studies have shown that the genome components affect mutation patterns to some extent. In order to infer the correlation between the Transition/Transversion ratio (Ts/Tv) and the number of immediately adjacent A and T nucleotides, we investigated 3611007 Oryza sativa SNPs (including 45462 coding SNPs, and 242811 intronic SNPs) and 32019 Arabidopsis SNPs. The results show that Ts/Tv is negatively correlated with the number of immediately adjacent A and T in O. sativa and Arabidopsis. We further calculated AT2 (the number of SNPs whose immediately adjacent nucleotides are either A or T) and AT0 (the number of SNPs whose immediately adjacent nucleotides are either C or G) for all 6 types of SNPs. C/G SNP of O. sativa and Arabidopsis has the highest AT2/AT0, which denotes C/G SNP may be influenced by the adjacent A and T nucleotides mostly. For SNPs in O. sativa, the neighboring effect of A and T nucleotides is limited to 2 nucleotides on both sides; for SNPs in Arabidopsis, the effect extends no more than 4 nucleotides on both sides.
Collapse
Affiliation(s)
- Hui Zhao
- Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100030, China
| | | | | | | | | | | |
Collapse
|
122
|
Subramanian S, Kumar S. Higher intensity of purifying selection on >90% of the human genes revealed by the intrinsic replacement mutation rates. Mol Biol Evol 2006; 23:2283-7. [PMID: 16982819 PMCID: PMC3072915 DOI: 10.1093/molbev/msl123] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
For over 3 decades, the rate of replacement mutations has been assumed to be equal to, and estimated from, the rate of "strictly" neutral sequence divergence in noncoding regions and in silent-codon positions where mutations do not alter the amino acid encoded. This assumption is fundamental to estimating the fraction of harmful protein mutations and to identifying adaptive evolution at individual codons and proteins. We show that the assumption is not justifiable because a much larger fraction of codon positions is involved in hypermutable CpG dinucleotides as compared with the introns, leading to a higher expected replacement mutation rate per site in a vast majority of the genes. Consideration of this difference reveals a higher intensity of purifying natural selection than previously inferred in human genes. We also show that a much smaller number of genes are expected to be evolving with positive selection than that predicted using sequence divergence at intron and silent positions in the human genome. These patterns indicate the need for using new approaches for estimating rates of amino acid-altering mutations in order to find positively selected genes and codons in genomes that contain hypermutable CpG's.
Collapse
Affiliation(s)
- Sankar Subramanian
- Center for Evolutionary Functional Genomics, The Biodesign Institute, Arizona State University
- School of Life Sciences, Arizona State University
| | - Sudhir Kumar
- Center for Evolutionary Functional Genomics, The Biodesign Institute, Arizona State University
- School of Life Sciences, Arizona State University
| |
Collapse
|
123
|
Fernandes-Rosa FL, de Castro M, Latronico AC, Sippell WG, Riepe FG, Antonini SR. Recurrence of the R947X mutation in unrelated families with autosomal dominant pseudohypoaldosteronism type 1: evidence for a mutational hot spot in the mineralocorticoid receptor gene. J Clin Endocrinol Metab 2006; 91:3671-5. [PMID: 16757525 DOI: 10.1210/jc.2006-0605] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/11/2023]
Abstract
BACKGROUND The renal form of pseudohypoaldosteronism type 1 (PHA1) is a rare disease characterized by congenital mineralocorticoid resistance of the kidney. Twenty-two different loss-of-function mutations in the mineralocorticoid receptor gene have been described in families with PHA1. These mutations were not recurrent and resulted in a large phenotypic variability. OBJECTIVE The objective of this study is to analyze the recurrence of an inactivating mutation in the mineralocorticoid receptor gene in unrelated families with autosomal dominant PHA1. PATIENTS Seventeen members from three unrelated families with autosomal dominant PHA1 were studied, including 11 affected patients with variable clinical manifestations. Fifty healthy subjects were used as controls. METHODS Genomic DNA was extracted, and the entire coding region of the mineralocorticoid receptor gene was submitted to automatic sequencing. Four dinucleotide microsatellite markers spanning a region of 3.2 cM in the human mineralocorticoid receptor gene locus, and two intragenic polymorphisms were used for haplotype analysis. RESULTS A heterozygous point mutation at codon 947 (c.2839C>T) changing arginine to stop codon (R947X) was found in the three families. Different haplotypes segregated with the R947X mutation in each family, demonstrating the absence of a founder effect for this mutation. CONCLUSION Codon 947 of the mineralocorticoid receptor is the first mutational hot spot for autosomal dominant PHA1.
Collapse
Affiliation(s)
- Fabio L Fernandes-Rosa
- Department of Pediatrics, School of Medicine of Ribeirão Preto, Avenida Bandeirantes, 3900-Ribeirão Preto, 14049-900 Sao Paulo, Brazil
| | | | | | | | | | | |
Collapse
|
124
|
Qu HQ, Lawrence SG, Guo F, Majewski J, Polychronakos C. Strand bias in complementary single-nucleotide polymorphisms of transcribed human sequences: evidence for functional effects of synonymous polymorphisms. BMC Genomics 2006; 7:213. [PMID: 16916449 PMCID: PMC1559705 DOI: 10.1186/1471-2164-7-213] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2006] [Accepted: 08/17/2006] [Indexed: 11/25/2022] Open
Abstract
Background Complementary single-nucleotide polymorphisms (SNPs) may not be distributed equally between two DNA strands if the strands are functionally distinct, such as in transcribed genes. In introns, an excess of A↔G over the complementary C↔T substitutions had previously been found and attributed to transcription-coupled repair (TCR), demonstrating the valuable functional clues that can be obtained by studying such asymmetry. Here we studied asymmetry of human synonymous SNPs (sSNPs) in the fourfold degenerate (FFD) sites as compared to intronic SNPs (iSNPs). Results The identities of the ancestral bases and the direction of mutations were inferred from human-chimpanzee genomic alignment. After correction for background nucleotide composition, excess of A→G over the complementary T→C polymorphisms, which was observed previously and can be explained by TCR, was confirmed in FFD SNPs and iSNPs. However, when SNPs were separately examined according to whether they mapped to a CpG dinucleotide or not, an excess of C→T over G→A polymorphisms was found in non-CpG site FFD SNPs but was absent from iSNPs and CpG site FFD SNPs. Conclusion The genome-wide discrepancy of human FFD SNPs provides novel evidence for widespread selective pressure due to functional effects of sSNPs. The similar asymmetry pattern of FFD SNPs and iSNPs that map to a CpG can be explained by transcription-coupled mechanisms, including TCR and transcription-coupled mutation. Because of the hypermutability of CpG sites, more CpG site FFD SNPs are relatively younger and have confronted less selection effect than non-CpG FFD SNPs, which can explain the asymmetric discrepancy of CpG site FFD SNPs vs. non-CpG site FFD SNPs.
Collapse
Affiliation(s)
- Hui-Qi Qu
- Endocrine Genetics Laboratory, The McGill University Health Center (Montreal Children's Hospital), Montréal, Québec, Canada
| | - Steve G Lawrence
- Department of Human Genetics, McGill University, Montréal, Québec, Canada
| | - Fan Guo
- Endocrine Genetics Laboratory, The McGill University Health Center (Montreal Children's Hospital), Montréal, Québec, Canada
| | - Jacek Majewski
- Department of Human Genetics, McGill University, Montréal, Québec, Canada
| | - Constantin Polychronakos
- Endocrine Genetics Laboratory, The McGill University Health Center (Montreal Children's Hospital), Montréal, Québec, Canada
- Department of Pediatrics, The McGill University Health Center (Montreal Children's Hospital), 2300 Tupper, Montréal, Québec H3H 1P3, Canada
| |
Collapse
|
125
|
Jiang C, Zhao Z. Mutational spectrum in the recent human genome inferred by single nucleotide polymorphisms. Genomics 2006; 88:527-34. [PMID: 16860534 DOI: 10.1016/j.ygeno.2006.06.003] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2006] [Revised: 06/01/2006] [Accepted: 06/06/2006] [Indexed: 01/09/2023]
Abstract
So far, there is no genome-wide estimation of the mutational spectrum in humans. In this study, we systematically examined the directionality of the point mutations and maintenance of GC content in the human genome using approximately 1.8 million high-quality human single nucleotide polymorphisms and their ancestral sequences in chimpanzees. The frequency of C-->T (G-->A) changes was the highest among all mutation types and the frequency of each type of transition was approximately fourfold that of each type of transversion. In intergenic regions, when the GC content increased, the frequency of changes from G or C increased. In exons, the frequency of G:C-->A:T was the highest among the genomic categories and contributed mainly by the frequent mutations at the CpG sites. In contrast, mutations at the CpG sites, or CpG-->TpG/CpA mutations, occurred less frequently in the CpG islands relative to intergenic regions with similar GC content. Our results suggest that the GC content is overall not in equilibrium in the human genome, with a trend toward shifting the human genome to be AT rich and shifting the GC content of a region to approach the genome average. Our results, which differ from previous estimates based on limited loci or on the rodent lineage, provide the first representative and reliable mutational spectrum in the recent human genome and categorized genomic regions.
Collapse
Affiliation(s)
- Cizhong Jiang
- Virginia Institute for Psychiatric and Behavioral Genetics, Virginia Commonwealth University, Richmond, VA 23298-0126, USA
| | | |
Collapse
|
126
|
Tomatsu S, Montaño AM, Nishioka T, Gutierrez MA, Peña OM, Tranda Firescu GG, Lopez P, Yamaguchi S, Noguchi A, Orii T. Mutation and polymorphism spectrum of the GALNS gene in mucopolysaccharidosis IVA (Morquio A). Hum Mutat 2006; 26:500-12. [PMID: 16287098 DOI: 10.1002/humu.20257] [Citation(s) in RCA: 108] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
Mucopolysaccharidosis IVA (MPS IVA; Morquio A disease) is an autosomal-recessive disorder caused by a deficiency of lysosomal N-acetylgalactosamine-6-sulfate sulfatase (GALNS; E.C.3.1.6.4). GALNS is required to degrade glycosaminoglycans, keratan sulfate (KS), and chondroitin-6-sulfate. Accumulation of undegraded substrates in lysosomes of the affected tissues leads to a systemic bone dysplasia. We summarize information on 148 unique mutations determined to date in the GALNS gene, including 26 novel mutations (19 missense, four small deletions, one splice-site, and two insertions). This heterogeneity in GALNS gene mutations accounts for an extensive clinical variability within MPS IVA. Seven polymorphisms that cause an amino acid change, and nine silent variants in the coding region are also described. Of the analyzed mutant alleles, missense mutations accounted for 78.4%; small deletions, 9.2%; nonsense mutation, 5.0%; large deletion, 2.4%; and insertions, 1.6%. Transitional mutations at CpG dinucleotides accounted for 26.4% of all the described mutations. The importance of the relationship between methylation status and distribution of transitional mutations at CpG sites at the GALNS gene locus was elucidated. The three most frequent mutations (over 5% of all mutations) were represented by missense mutations (p.R386C, p.G301C, and p.I113F). A genotype/phenotype correlation was defined in some mutations. Missense mutations associated with a certain phenotype were studied for their effects on enzyme activity and stability, the levels of blood and urine KS, the location of mutations with regard to the tertiary structure, and the loci of the altered amino acid residues among sulfatase proteins.
Collapse
Affiliation(s)
- Shunji Tomatsu
- Department of Pediatrics, Pediatric Research Institute, Saint Louis University, St. Louis, Missouri 63110-2586, USA.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
127
|
Abstract
5-Methylcytosine in DNA is genetically unstable. Methylated CpG (mCpG) sequences frequently undergo mutation resulting in a general depletion of this dinucleotide sequence in mammalian genomes. In human genetic disease- and cancer-relevant genes, mCpG sequences are mutational hotspots. It is an almost universally accepted dogma that these mutations are caused by random deamination of 5-methylcytosines. However, it is plausible that mCpG transitions are not caused simply by spontaneous deamination of 5-methylcytosine in double-stranded DNA but by other processes including, for example, mCpG-specific base modification by endogenous or exogenous mutagens or, alternatively, by secondary factors operating at mCpG sequences and promoting deamination. We also discuss that mCpG sequences are favored targets for specific exogenous mutagens and carcinogens. When adjacent to another pyrimidine, 5-methylcytosine preferentially undergoes sunlight-induced pyrimidine dimer formation. Certain polycyclic aromatic hydrocarbons form guanine adducts and induce G to T transversion mutations with high selectivity at mCpG sequences.
Collapse
Affiliation(s)
- G P Pfeifer
- Division of Biology, Beckman Research Institute of the City of Hope, Duarte, CA 91010, USA.
| |
Collapse
|
128
|
Zhao H, Li QZ, Zeng CQ, Yang HM, Yu J. Neighboring-nucleotide effects on the mutation patterns of the rice genome. GENOMICS PROTEOMICS & BIOINFORMATICS 2006; 3:158-68. [PMID: 16487081 PMCID: PMC5172528 DOI: 10.1016/s1672-0229(05)03021-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
DNA composition dynamics across genomes of diverse taxonomy is a major subject of genome analyses. DNA composition changes are characteristics of both replication and repair machineries. We investigated 3,611,007 single nucleotide polymorphisms (SNPs) generated by comparing two sequenced rice genomes from distant inbred lines (subspecies), including those from 242,811 introns and 45,462 protein-coding sequences (CDSs). Neighboring-nucleotide effects (NNEs) of these SNPs are diverse, depending on structural content-based classifications (genome-wide, intronic, and CDS) and sequence context-based categories (A/C, A/G, A/T, C/G, C/T, and G/T substitutions) of the analyzed SNPs. Strong and evident NNEs and nucleotide proportion biases surrounding the analyzed SNPs were observed in 1−3 bp sequences on both sides of an SNP. Strong biases were observed around neighboring nucleotides of protein-coding SNPs, which exhibit a periodicity of three in nucleotide content, constrained by a combined effect of codon-related rules and DNA repair mechanisms. Unlike a previous finding in the human genome, we found negative correlation between GC contents of chromosomes and the magnitude of corresponding bias of nucleotide C at −1 site and G at +1 site. These results will further our understanding of the mutation mechanism in rice as well as its evolutionary implications.
Collapse
Affiliation(s)
- Hui Zhao
- Beijing Genomics Institute, Chinese Academy of Sciences (CAS), Beijing 101300, China
- Graduate School, CAS, Beijing 100039, China
| | - Qi-Zhai Li
- Beijing Genomics Institute, Chinese Academy of Sciences (CAS), Beijing 101300, China
- Academy of Mathematics and Systems Science, CAS, Beijing 100080, China
- Graduate School, CAS, Beijing 100039, China
| | - Chang-Qing Zeng
- Beijing Genomics Institute, Chinese Academy of Sciences (CAS), Beijing 101300, China
| | - Huan-Ming Yang
- Beijing Genomics Institute, Chinese Academy of Sciences (CAS), Beijing 101300, China
- Corresponding authors.
| | - Jun Yu
- Beijing Genomics Institute, Chinese Academy of Sciences (CAS), Beijing 101300, China
- Corresponding authors.
| |
Collapse
|
129
|
Chen JM, Chuzhanova N, Stenson PD, Férec C, Cooper DN. Meta-analysis of gross insertions causing human genetic disease: novel mutational mechanisms and the role of replication slippage. Hum Mutat 2006; 25:207-21. [PMID: 15643617 DOI: 10.1002/humu.20133] [Citation(s) in RCA: 116] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Although gross insertions (>20 bp) comprise <1% of disease-causing mutations, they nevertheless represent an important category of pathological lesion. In an attempt to study these insertions in a systematic way, 158 gross insertions ranging in size between 21 bp and approximately 10 kb were identified using the Human Gene Mutation Database (www.hgmd.org). A careful meta-analytical study revealed extensive diversity in terms of the nature of the inserted DNA sequence and has provided new insights into the underlying mutational mechanisms. Some 70% of gross insertions were found to represent sequence duplications of different types (tandem, partial tandem, or complex). Although most of the tandem duplications were explicable by simple replication slippage, the three complex duplications appear to result from multiple slippage events. Some 11% of gross insertions were attributable to nonpolyglutamine repeat expansions (including octapeptide repeat expansions in the prion protein gene [PRNP] and polyalanine tract expansions) and evidence is presented to support the contention that these mutations are also caused by replication slippage rather than by unequal crossing over. Some 17% of gross insertions, all >or=276 bp in length, were found to be due to LINE-1 (L1) retrotransposition involving different types of element (L1 trans-driven Alu, L1 direct, and L1 trans-driven SVA). A second example of pathological mitochondrial-nuclear sequence transfer was identified in the USH1C gene but appears to arise via a novel mechanism, trans-replication slippage. Finally, evidence for another novel mechanism of human genetic disease, involving the possible capture of DNA oligonucleotides, is presented in the context of a 26-bp insertion into the ERCC6 gene.
Collapse
Affiliation(s)
- Jian-Min Chen
- INSERM (Institut National de la Santé et de la Recherche Médicale) U613-Génétique Moléculaire et Génétique Epidémiologique, Université de Bretagne Occidentale, Centre Hospitalier Universitaire, Brest, France.
| | | | | | | | | |
Collapse
|
130
|
Wong WKP, Morse JH, Knowles JA. Evolutionary conservation and mutational spectrum of BMPR2 gene. Gene 2006; 368:84-93. [PMID: 16361068 DOI: 10.1016/j.gene.2005.10.025] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2005] [Revised: 09/27/2005] [Accepted: 10/14/2005] [Indexed: 10/25/2022]
Abstract
A variety of mutations in the bone morphogenetic protein receptor type 2 (BMPR2) have been identified in patients with pulmonary arterial hypertension. In this study, using our BMPR2 mutation database and BMPR-II protein sequences from eight distantly related species, we defined the relationship among evolutionary conservation, mutation frequency and mutation distribution. As a whole, BMPR2 is evolving slower than the average for mammalian protein-encoding genes. As expected, the kinase domain is evolving more slowly than the extracellular ligand-binding and C-terminal domains. A detailed map of evolutionary conservation shows that there are repeating peaks and valleys within the C-terminal domain, representing higher and lower evolutionary conservation. We observed a strong correlation between evolutionary conservation and the distribution of mutations along the gene. All except two, of the nineteen missense mutations occur in absolutely conserved amino acids among the vertebrate homologs. In addition, we identified six mutational hotspots (P<0.05) by comparing the observed distribution of mutations to the pattern expected from a random multinomial distribution. Furthermore, analysis of the sequence environment surrounding the mutations revealed a specific pattern of mutagenesis. Over 22% of all single base-paired substitutions and 30% of all deletions and insertions are situated within tandem or non-tandem direct repeats of at least 5-bp and may be explained by slipped-mispairing model of mutagenesis. Also, over 59% of single base-paired substitutions versus 20% of deletions and insertions are located in perfect palindromic sequences that could produce "hairpin-loop" secondary structures with relatively high thermodynamic stability under physiological conditions. In addition, 3.7% of single base-paired substitutions versus 30% of deletions and insertions are located either within or in close proximity to the Krawczak and Cooper consensus sequence (TG A/G A/G G/T A/C). Further study of the mechanism of mutagenesis in BMPR2 may help identify other potentially mutable sites and differentiate between deleterious mutations and harmless polymorphic variants.
Collapse
Affiliation(s)
- Wai K P Wong
- Department of Medicine, Columbia University/New York State Psychiatric Institute, 1051 Riverside Drive, Unit 28, Room 5917, New York, NY 10032, USA.
| | | | | |
Collapse
|
131
|
Abstract
Are there analogous sequence positions in families of related proteins where disease-linked mutations occur with unusually high frequency? We attempt to answer this question by examining sequence alignments for G-protein coupled receptors (GPCRs) and voltage-gated potassium channels that have a significant number of missense mutations linked to some form of human disease. When the disease-linked mutations are mapped onto the sequences for each family, there are a large number of aligned sites at which disease-linked mutations occur in more than one protein. The statistical significance of the aligned sites is judged by analysis of artificially-generated random datasets. There are a modest number of aligned sites that are statistically significant-we refer to these as "phenotologous" sequence positions. Phenotologous sites represent aligned positions at which mutations linked to disease phenotypes occur with high frequency within a family of proteins. The most interesting of these sites are those which are not conserved-such sites are apparently critical in defining structural or functional differences between related proteins. Phenotology may be used to make experimentally testable predictions regarding medical genetics, the molecular basis of disease, and protein structure-function relationships.
Collapse
Affiliation(s)
- Jeffrey K Myers
- Department of Biochemistry and Center for Structural Biology, Vanderbilt University Medical Center, Nashville, Tennessee 37232-8725, USA
| | | | | |
Collapse
|
132
|
Sinnett D, Beaulieu P, Bélanger H, Lefebvre JF, Langlois S, Théberge MC, Drouin S, Zotti C, Hudson TJ, Labuda D. Detection and characterization of DNA variants in the promoter regions of hundreds of human disease candidate genes. Genomics 2006; 87:704-10. [PMID: 16500075 DOI: 10.1016/j.ygeno.2006.01.001] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2005] [Revised: 12/21/2005] [Accepted: 01/02/2006] [Indexed: 11/20/2022]
Abstract
Understanding genetic variation might reveal the cause of individual susceptibility to a variety of complex diseases such as asthma, diabetes, and cancer. Current efforts to identify functional DNA variants have essentially been oriented toward single nucleotide polymorphisms (SNPs) found in coding regions of candidate genes since they have direct impact on the structure and function of the affected proteins. Abnormal expression of finely regulated genes could also lead to disequilibria in different metabolic pathways and/or biological processes. Thus investigation of SNPs in the promoter regions (pSNPs) of genes should improve our knowledge of the etiology of complex diseases. Unfortunately, little is known about the nature and the prevalence of pSNPs. We have analyzed 197 genes targeting the promoter region, arbitrarily defined as a 2-kb genomic segment upstream of the transcription initiation site, by screening by dHPLC for the presence of SNPs in a worldwide panel of 40 individuals. As a result 1838 pSNPs were detected, 75% of which modify (by either gain or loss) putative binding sites of known transcription factors. We also examined the distribution of these pSNPs among features such as conserved regions, repeats, and dinucleotides as well as Gene Ontology terms. This report supports the functional relevance of several of the pSNPs investigated and suggests a putative impact on disease susceptibility.
Collapse
Affiliation(s)
- Daniel Sinnett
- Division of Hematology-Oncology, Research Center, Sainte-Justine Hospital, 3175 Chemin de la Côte-Sainte-Catherine, Montreal, Canada QC H3T 1C5.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
133
|
Abstract
Cytosine methylation is a common form of post-replicative DNA modification seen in both bacteria and eukaryotes. Modified cytosines have long been known to act as hotspots for mutations due to the high rate of spontaneous deamination of this base to thymine, resulting in a G/T mismatch. This will be fixed as a C-->T transition after replication if not repaired by the base excision repair (BER) pathway or specific repair enzymes dedicated to this purpose. This hypermutability has led to depletion of the target dinucleotide CpG outside of special CpG islands in mammals, which are normally unmethylated. We review the importance of C-->T transitions at non-island CpGs in human disease: When these occur in the germline, they are a common cause of inherited diseases such as epidermolysis bullosa and mucopolysaccharidosis, while in the soma they are frequently found in the genes for tumor suppressors such as p53 and the retinoblastoma protein, causing cancer. We also examine the specific repair enzymes involved, namely the endonuclease Vsr in Escherichia coli and two members of the uracil DNA glycosylase (UDG) superfamily in mammals, TDG and MBD4. Repair brings its own problems, since it will require remethylation of the replacement cytosine, presumably coupling repair to methylation by either the maintenance methylase Dnmt1 or a de novo enzyme such as Dnmt3a. Uncoupling of methylation from repair may be one way to remove methylation from DNA. We also look at the possible role of specific cytosine deaminases such as Aid and Apobec in accelerating deamination of methylcytosine and consequent DNA demethylation.
Collapse
Affiliation(s)
- C P Walsh
- Centre for Molecular Biosciences, School of Biomedical Sciences, University of Ulster, Northern Ireland
| | | |
Collapse
|
134
|
Hahn Y, Lee B. Human-specific nonsense mutations identified by genome sequence comparisons. Hum Genet 2006; 119:169-78. [PMID: 16395595 DOI: 10.1007/s00439-005-0125-6] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2005] [Accepted: 12/10/2005] [Indexed: 11/29/2022]
Abstract
The comparative study of the human and chimpanzee genomes may shed light on the genetic ingredients for the evolution of the unique traits of humans. Here, we present a simple procedure to identify human-specific nonsense mutations that might have arisen since the human-chimpanzee divergence. The procedure involves collecting orthologous sequences in which a stop codon of the human sequence is aligned to a non-stop codon in the chimpanzee sequence and verifying that the latter is ancestral by finding homologs in other species without a stop codon. Using this procedure, we identify nine genes (CML2, FLJ14640, MT1L, NPPA, PDE3B, SERPINA13, TAP2, UIP1, and ZNF277) that would produce human-specific truncated proteins resulting in a loss or modification of the function. The premature terminations of CML2, MT1L, and SERPINA13 genes appear to abolish the original function of the encoded protein because the mutation removes a major part of the known active site in each case. The other six mutated genes are either known or presumed to produce functionally modified proteins. The mutations of five genes (CML2, FLJ14640, MT1L, NPPA, TAP2) are known or predicted to be polymorphic in humans. In these cases, the stop codon alleles are more prevalent than the ancestral allele, suggesting that the mutant alleles are approaching fixation since their emergence during the human evolution. The findings support the notion that functional modification or inactivation of genes by nonsense mutation is a part of the process of adaptive evolution and acquisition of species-specific features.
Collapse
Affiliation(s)
- Yoonsoo Hahn
- Laboratory of Molecular Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Building 37, MSC 4264, 37 Convent Drive Room 5120A, Bethesda, MD 20892-4264, USA
| | | |
Collapse
|
135
|
Bjornsson HT, Ellingsen LM, Jonsson JJ. Transposon-derived repeats in the human genome and 5-methylcytosine-associated mutations in adjacent genes. Gene 2006; 370:43-50. [PMID: 16446059 DOI: 10.1016/j.gene.2005.11.011] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2005] [Revised: 09/19/2005] [Accepted: 11/04/2005] [Indexed: 10/25/2022]
Abstract
Transposon-derived repeats (TDR) represent approximately 50% of the human genome. A transposon suppression system has been proposed to explain why transposon-derived repeats (TDR) seldom cause mutations in humans. If this system is based on DNA methylation, a correlation might exist between amount of TDR adjacent to genes and frequency of coding sequence mutations due to m5C deaminations. To test this hypothesis we selected 385 genes based on availability of accurate information on their genome structure and mutation patterns (at least 10 mutations described in the Human Gene Mutation Database (HGMD)). The CENSOR program was used to estimate amount and class of TDR for the gene region and an arbitrarily selected 1 KB from each end. We assumed all C --> T transitions to be possible 5-methylcytosine-associated mutations (MAM) and calculated the number and proportion of MAM in the 385 genes. If there is a strong correlation between methylation of certain CpX dinuclecotides and TDR we might be able to detect it despite limitations of available data for this analysis. We found statistically significant correlations between: i) TDR and number of MAM in genes (r = 0.118, p = 0.02), ii) SINE-TDR and proportion of CpG --> TpG (r = 0.11, p = 0.03); limited to MIR elements only (r = 0.14, p = 0.006), and iii) LINE-TDR and proportion of CpT --> TpT (r = 0.166, p = 0.04). The group of genes with no TDR had a statistically significant lower proportion of MAM (184/479, 0.38 vs. 6466/14524, 0.46; p = 0.009) with differences noted for CpA --> TpA (35/479, 0.073 vs. 1380/11474; p = 0.003). In addition, CpT --> TpT were least common in genes with no TDR (8/479, 0.017), intermediate in genes with TDR in genomic sequence but not mRNA (337/11474, 0.029) and most common in genes with TDR within mature mRNA (121/3050, 0.040; p for trend = 0.003). Our data suggest that TDR adjacent to genes may sometimes influence methylation of cytosines in coding sequences to a degree that it affects mutation patterns. These observations should be followed up with further database analysis and biochemical studies.
Collapse
Affiliation(s)
- Hans T Bjornsson
- Department of Biochemistry and Molecular Biology, Faculty of Medicine, University of Iceland, Iceland
| | | | | |
Collapse
|
136
|
Schwartz IVD, Lima LC, Tylee K, Sobrinho RPO, Norato DYJ, Duarte AR, Besley G, Burin MG, Matte U, Giugliani R, Leistner-Segal S. Further cases of “neighbor” mutations in mucopolysaccharidosis type II. Am J Med Genet A 2006; 140:1684-6. [PMID: 16770800 DOI: 10.1002/ajmg.a.31317] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Affiliation(s)
- Ida V D Schwartz
- Department of Genetics and Postgraduation Program in Genetics and Molecular Biology, Federal University of Rio Grande do Sul, Porto Alegre, Rio Grande do Sul, Brazil.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
137
|
Zhao Z, Zhang F. Sequence context analysis in the mouse genome: Single nucleotide polymorphisms and CpG island sequences. Genomics 2006; 87:68-74. [PMID: 16316740 DOI: 10.1016/j.ygeno.2005.09.012] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2005] [Revised: 09/16/2005] [Accepted: 09/20/2005] [Indexed: 11/29/2022]
Abstract
A genome-wide view of sequence mutability in mice is still limited, although biologists usually assume the same scenario for mice as for humans. In this study, we examined the sequence context in the local environment of 482,528 mouse single nucleotide polymorphisms (SNPs). We found that CpG-containing short sequences, in general, had more representation in the local sequences of SNPs compared to the genome sequences. The extent of this overrepresentation was stronger in mice than in humans, which is inconsistent with previous observations of the weaker neighboring-nucleotide biases on mouse SNPs. To exclude the CpG effect, we compared the distribution patterns of short sequences among the six categories of SNPs. The results revealed an even stronger pattern in the CpG-containing group for C/G substitution compared to for A/G or C/T substitutions. We next performed the first genome-wide sequence context analysis of SNPs in the mouse CpG islands. SNPs occurring at CpG sites were 3.14-fold less prevalent than expected, suggesting the suppression of methylation-dependent deamination in the CpG islands. The extent of this suppression was less in mice than in humans. Finally, compared with humans, the observations of a greater deficit of CpG dinucleotides, a stronger overrepresentation of CpG-containing n-mers surrounding the polymorphic sites, and a higher SNP/genome ratio of CpG dinucleotides in the mouse genome support the "loss of CpG islands" model in the mouse lineage.
Collapse
Affiliation(s)
- Zhongming Zhao
- Virginia Institute for Psychiatric and Behavioral Genetics, Virginia Commonwealth University, Richmond, VA 23298, USA.
| | | |
Collapse
|
138
|
Zhao Z, Zhang F. Sequence context analysis of 8.2 million single nucleotide polymorphisms in the human genome. Gene 2005; 366:316-24. [PMID: 16314054 DOI: 10.1016/j.gene.2005.08.024] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2005] [Revised: 08/29/2005] [Accepted: 08/30/2005] [Indexed: 12/29/2022]
Abstract
We analyzed n-mers (n=3-8) in the local environment of 8,249,446 human SNPs and compared their distribution with that in the genome reference sequences. The results revealed that the short sequences, which contained at least one CpG dinucleotide, occurred more frequently in the local SNP sequences than in the genome sequences. To exclude the hypermutability effect of the methylated CpG dinucleotides on the sequence context of SNPs, we examined the distribution patterns for each of the six categories of substitution. We observed the similar pattern (i.e., CpG-containing n-mers vs. non-CpG-containing n-mers) in SNP categories A/G, C/T and C/G but the opposite pattern in category A/T. We next identified 34,928 putative CpG islands in the human genome and located 133,591 SNPs within these islands. In the CpG islands, CpG SNPs were 3.92-fold less prevalent relative to the presence of CpG dinucleotides. Conversely, in the human genome, the frequency of CpG dinucleotides at the polymorphic sites was 6.09 times that in the genome reference sequences. These results support the previous views of mutational suppression at the CpG sites in the CpG islands and hypermutability of the methylated CpG dinucleotides that are prevalent in the non-CpG island sequences in the human genome. Our study represents a comprehensive investigation of the sequence context of SNPs in the human genome and in human CpG islands.
Collapse
Affiliation(s)
- Zhongming Zhao
- Virginia Institute for Psychiatric and Behavioral Genetics, Virginia Commonwealth University, Richmond, VA 23298, USA.
| | | |
Collapse
|
139
|
Abstract
The identification of mutations leading to human genetic diseases has grown into an intensive research field during the last few years. Through novel DNA analysis progress, it is now possible to determine the mutational spectrum for a given genetic disease and international databases are now available online. Genetic diagnosis of hereditary diseases has become an essential tool in genetic counselling and prenatal diagnosis. The knowledge of the deleterious mutation type and the molecular associated mechanism is fundamental in order to devise the optimal molecular diagnosis strategy. This review aims to present the various mutation categories involved in genetic diseases (single base-pair substitutions, small deletions or insertions, dynamic mutations, gross DNA lesions...) and to summarize our current knowledge about the main molecular mechanisms responsible for these mutations. Their deleterious consequences on gene expression, including transcription and transcript maturation, and protein loss or gain of function are also discussed in this review.
Collapse
Affiliation(s)
- Nadine Hanna
- Laboratoire de Génétique Moléculaire EA 3618, Université René Descartes Paris 5, Faculté des Sciences Pharmaceutiques et Biologiques, 4, avenue de l'Observatoire, 75006 Paris, France
| | | | | | | |
Collapse
|
140
|
Valverde JR, Alonso J, Palacios I, Pestaña Á. RB1 gene mutation up-date, a meta-analysis based on 932 reported mutations available in a searchable database. BMC Genet 2005; 6:53. [PMID: 16269091 PMCID: PMC1298292 DOI: 10.1186/1471-2156-6-53] [Citation(s) in RCA: 110] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2005] [Accepted: 11/04/2005] [Indexed: 11/16/2022] Open
Abstract
Background Retinoblastoma, a prototype of hereditary cancer, is the most common intraocular tumour in children and potential cause of blindness from therapeutic eye ablation, second tumours in germ line carrier's survivors, and even death when left untreated. The molecular scanning of RB1 in search of germ line mutations lead to the publication of more than 900 mutations whose knowledge is important for genetic counselling and the characterization of phenotypic-genotypic relationships. Results A searchable database (RBGMdb) has been constructed with 932 published RB1 mutations. The spectrum of these mutations has been analyzed with the following results: 1) the retinoblastoma protein is frequently inactivated by deletions and nonsense mutations while missense mutations are the main inactivating event in most genetic diseases. 2) Near 40% of RB1 gene mutations are recurrent and gather in sixteen hot points, including twelve nonsense, two missense and three splicing mutations. The remainder mutations are scattered along RB1, being most frequent in exons 9, 10, 14, 17, 18, 20, and 23. 3) The analysis of RB1 mutations by country of origin of the patients identifies two groups in which the incidence of nonsense and splicing mutations show differences extremely significant, and suggest the involvement of predisposing ethnic backgrounds. 4) A significant association between late age at diagnosis and splicing mutations in bilateral retinoblastoma patients suggests the occurrence of a delayed-onset genotype. 5) Most of the reported mutations in low-penetrance families fall in three groups: a) Mutations in regulatory sequences at the promoter resulting in low expression of a normal Rb; b) Missense and in-frame deletions affecting non-essential sequence motifs which result in a partial inactivation of Rb functions; c) Splicing mutations leading to the reduction of normal mRNA splicing or to alternative splicing involving either true oncogenic or defective (weak) alleles. Conclusion The analysis of RB1 gene mutations logged in the RBGMdb has shown relevant phenotype-genotype relationships and provided working hypothesis to ascertain mechanisms linking certain mutations to ethnicity, delayed onset of the disease and low-penetrance. Gene profiling of tumors will help to clarify the genetic background linked to ethnicity and variable expressivity or delayed onset phenotypes.
Collapse
Affiliation(s)
- José R Valverde
- Servicio de Informática. Centro Nacional de Biotecnología, CSIC. Campus de Cantoblanco. 28049-Madrid, Spain
| | - Javier Alonso
- Oncolab. Deparatamento de Biología Molecular y Celular del Cáncer. Instituto de Investigaciones Biomédicas "A. Sols", CSIC-UAM. 28029 Madrid, Spain
| | - Itziar Palacios
- Oncolab. Deparatamento de Biología Molecular y Celular del Cáncer. Instituto de Investigaciones Biomédicas "A. Sols", CSIC-UAM. 28029 Madrid, Spain
| | - Ángel Pestaña
- Oncolab. Deparatamento de Biología Molecular y Celular del Cáncer. Instituto de Investigaciones Biomédicas "A. Sols", CSIC-UAM. 28029 Madrid, Spain
| |
Collapse
|
141
|
Rudd MF, Williams RD, Webb EL, Schmidt S, Sellick GS, Houlston RS. The Predicted Impact of Coding Single Nucleotide Polymorphisms Database. Cancer Epidemiol Biomarkers Prev 2005; 14:2598-604. [PMID: 16284384 DOI: 10.1158/1055-9965.epi-05-0469] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Nonsynonymous single nucleotide polymorphisms (nsSNP) have the potential to affect the structure or function of expressed proteins and are, therefore, likely to represent modifiers of inherited susceptibility. We have classified and catalogued the predicted functionality of nsSNPs in genes relevant to the biology of cancer to facilitate sequence-based association studies. Candidate genes were identified using targeted search terms and pathways to interrogate the Gene Ontology Consortium database, Kyoto Encyclopedia of Genes and Genomes database, Iobion's Interaction Explorer PathwayAssist Program, National Center for Biotechnology Information Entrez Gene database, and CancerGene database. A total of 9,537 validated nsSNPs located within annotated genes were retrieved from National Center for Biotechnology Information dbSNP Build 123. Filtering this list and linking it to 7,080 candidate genes yielded 3,666 validated nsSNPs with minor allele frequencies > or =0.01 in Caucasian populations. The functional effect of nsSNPs in genes with a single mRNA transcript was predicted using three computational tools-Grantham matrix, Polymorphism Phenotyping, and Sorting Intolerant from Tolerant algorithms. The resultant pool of 3,009 fully annotated nsSNPs is accessible from the Predicted Impact of Coding SNPs database at http://www.icr.ac.uk/cancgen/molgen/MolPopGen_PICS_database.htm. Predicted Impact of Coding SNPs is an ongoing project that will continue to curate and release data on the putative functionality of coding SNPs.
Collapse
Affiliation(s)
- Matthew F Rudd
- Section of Cancer Genetics, Institute of Cancer Research, Sutton, Surrey SM2 5NG, United Kingdom
| | | | | | | | | | | |
Collapse
|
142
|
Bjornsson HT, Cui H, Gius D, Fallin MD, Feinberg AP. The new field of epigenomics: implications for cancer and other common disease research. COLD SPRING HARBOR SYMPOSIA ON QUANTITATIVE BIOLOGY 2005; 69:447-56. [PMID: 16117680 PMCID: PMC5434869 DOI: 10.1101/sqb.2004.69.447] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Affiliation(s)
- H T Bjornsson
- Predoctoral Program in Human Genetics and Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
| | | | | | | | | |
Collapse
|
143
|
Morton BR, Bi IV, McMullen MD, Gaut BS. Variation in mutation dynamics across the maize genome as a function of regional and flanking base composition. Genetics 2005; 172:569-77. [PMID: 16219784 PMCID: PMC1456184 DOI: 10.1534/genetics.105.049916] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We examine variation in mutation dynamics across a single genome (Zea mays ssp. mays) in relation to regional and flanking base composition using a data set of 10,472 SNPs generated by resequencing 1776 transcribed regions. We report several relationships between flanking base composition and mutation pattern. The A + T content of the two sites immediately flanking the mutation site is correlated with rate, transition bias, and GC --> AT pressure. We also observe a significant CpG effect, or increase in transition rate at CpG sites. At the regional level we find that the strength of the CpG effect is correlated with regional A + T content, ranging from a 1.7-fold increase in transition rate in relatively G + C-rich regions to a 2.6-fold increase in A + T-rich regions. We also observe a relationship between locus A + T content and GC --> AT pressure. This regional effect is in opposition to the influence of the two immediate neighbors in that GC --> AT pressure increases with increasing locus A + T content but decreases with increasing flanking base A + T content and may represent a relationship between genome location and mutation bias. The data indicate multiple context effects on mutations, resulting in significant variation in mutation dynamics across the genome.
Collapse
Affiliation(s)
- Brian R Morton
- Department of Biological Sciences, Barnard College, Columbia University, New York, New York 10027, USA.
| | | | | | | |
Collapse
|
144
|
Alharbi KK, Aldahmesh MA, Spanakis E, Haddad L, Whittall RA, Chen XH, Rassoulian H, Smith MJ, Sillibourne J, Ball NJ, Graham NJ, Briggs PJ, Simpson IA, Phillips DIW, Lawlor DA, Ye S, Humphries SE, Cooper C, Smith GD, Ebrahim S, Eccles DM, Day INM. Mutation scanning by meltMADGE: validations using BRCA1 and LDLR, and demonstration of the potential to identify severe, moderate, silent, rare, and paucimorphic mutations in the general population. Genome Res 2005; 15:967-77. [PMID: 15998910 PMCID: PMC1172041 DOI: 10.1101/gr.3313405] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
We have developed a mutation-scanning approach suitable for whole population screening for unknown mutations. The method, meltMADGE, combines thermal ramp electrophoresis with MADGE to achieve suitable cost efficiency and throughput. The sensitivity was tested in blind trials using 54 amplicons representing the BRCA1 coding region and a panel of 94 unrelated family breast cancer risk consultands previously screened in a clinical diagnostic laboratory. All 10 common polymorphisms, 15/15 previously identified disease-causing mutations, and three previously untested single base changes were identified. Assays of LDLR exons 3 and 8 were validated in 460 familial hypercholesteremics and detected 8/9 known variants. We then applied the exon 3 assay in several DNA banks representing approximately 8000 subjects with known cholesterol values and applied both assays in one DNA bank (n = 3600). In exon 3 we identified one previously reported moderate mutation, P84S (n = 1), also associated with moderate hypercholesteremia in this subject; an unreported silent variant, N76N (n = 1); and known severe hypercholesteremia splice mutation 313+1G-->A (n = 2). Around exon 8 we identified a paucimorphism (n = 35) at the splice site 1061-8T-->C (known to be in complete linkage disequilibrium with T705I) and unreported sequence variants 1186+11G-->A (n = 1) and D335N G-->A (n = 1). The cholesterol value for D335N was on the 96.2 percentile and for T705I, 2/35 carriers were above the 99th percentile. Thus, variants with predicted severe, moderate, and no effect were identified at the population level. In contrast with case collections, CpG mutations predominated. MeltMADGE will enable definition of the full population spectrum of rare, paucimorphic, severe, moderate (forme fruste), and silent mutations and effects.
Collapse
Affiliation(s)
- Khalid K Alharbi
- Human Genetics Division, School of Medicine, University of Southampton, Southampton University Hospitals NHS Trust, Southampton SO16 6YD, United Kingdom
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
145
|
Kaminsky ZA, Popendikyte V, Assadzadeh A, Petronis A. Search for somatic DNA variation in the brain: investigation of the serotonin 2A receptor gene. Mamm Genome 2005; 16:587-93. [PMID: 16180140 DOI: 10.1007/s00335-005-0040-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2005] [Accepted: 05/05/2005] [Indexed: 01/05/2023]
Abstract
Somatic DNA variation represents one of the most interesting but also one of the least investigated genetic phenomena. In addition to the classical case of DNA hypermutability at the V(D)J region, there is an increasing body of experimental evidence suggesting that genes other than immunoglobulin in tissues other than lymphocytes also exhibit nonuniformity of DNA sequence, which opens new opportunities for explaining various features of multicellular organisms. Identification of somatic DNA mutability, however, is not a trivial task and numerous confounding factors have to be taken into account. In this work we investigated putative DNA variation in the serotonin 2A receptor gene (HTR2A). A series of real-time PCR-based experiments was performed on DNA samples (n = 8) from human brain and peripheral leukocytes. Amplification of the target DNA sequences was carefully matched to that of the control plasmid containing the insert of HTR2A. Sequencing of nearly 500 clones containing a total of 150,000 nucleotides did not show any evidence for somatic DNA variation in the brain and peripheral leukocytes. It is argued in this article that although intraindividual DNA mutability may be a more common phenomenon than is generally accepted, some of the earlier claims of genetic nonidentity on the brain cells may be premature.
Collapse
Affiliation(s)
- Zachary A Kaminsky
- The Krembil Family Epigenetics Laboratory, Centre for Addiction and Mental Health, 250 College Street, Toronto, Ontario, M5T 1R8, Canada
| | | | | | | |
Collapse
|
146
|
Nekrutenko A, Wadhawan S, Goetting-Minesky P, Makova KD. Oscillating evolution of a mammalian locus with overlapping reading frames: an XLalphas/ALEX relay. PLoS Genet 2005; 1:e18. [PMID: 16110341 PMCID: PMC1186735 DOI: 10.1371/journal.pgen.0010018] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2005] [Accepted: 06/23/2005] [Indexed: 11/29/2022] Open
Abstract
XLαs and ALEX are structurally unrelated mammalian proteins translated from alternative overlapping reading frames of a single transcript. Not only are they encoded by the same locus, but a specific XLαs/ALEX interaction is essential for G-protein signaling in neuroendocrine cells. A disruption of this interaction leads to abnormal human phenotypes, including mental retardation and growth deficiency. The region of overlap between the two reading frames evolves at a remarkable speed: the divergence between human and mouse ALEX polypeptides makes them virtually unalignable. To trace the evolution of this puzzling locus, we sequenced it in apes, Old World monkeys, and a New World monkey. We show that the overlap between the two reading frames and the physical interaction between the two proteins force the locus to evolve in an unprecedented way. Namely, to maintain two overlapping protein-coding regions the locus is forced to have high GC content, which significantly elevates its intrinsic evolutionary rate. However, the two encoded proteins cannot afford to change too quickly relative to each other as this may impair their interaction and lead to severe physiological consequences. As a result XLαs and ALEX evolve in an oscillating fashion constantly balancing the rates of amino acid replacements. This is the first example of a rapidly evolving locus encoding interacting proteins via overlapping reading frames, with a possible link to the origin of species-specific neurological differences. One of the possible ways to achieve tight co-expression of two proteins is to encode them within a single mRNA. The GNAS1 gene in mammals does just that: it encodes two interacting signaling polypeptides within a single transcript using nested reading frames shifted one nucleotide relative to each other. The exceptionally high GC content of the region where the two reading frames overlap diminishes the probability of encountering stop codons but makes the locus highly mutable. To preserve their ability to interact functionally with each other despite the high mutation rate, the two polypeptides appear to evolve in an oscillating fashion, trying to maintain approximately equal rates of amino acid substitutions. This unexpected observation provides new insights into the evolution of mostly overlooked overlapping coding regions in eukaryotic genomes.
Collapse
Affiliation(s)
- Anton Nekrutenko
- Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, Pennsylvania, USA.
| | | | | | | |
Collapse
|
147
|
Abstract
The comparative analysis of protein sequences depends crucially on measures of amino acid similarity or distance. Many such measures exist, yet it is not known how well these measures reflect the operational exchangeability of amino acids in proteins, since most are derived by methods that confound a variety of effects, including effects of mutation. In pursuit of a pure measure of exchangeability, we present (1) a compilation of data on the effects of 9671 amino acid exchanges engineered and assayed in a set of 12 proteins; (2) a statistical procedure to combine results from diverse assays of exchange effects; (3) a matrix of "experimental exchangeability" values EX(ij) derived from applying this procedure to the compiled data; and (4) a set of three tests designed to evaluate the power of an exchangeability measure to (i) predict the effects of amino acid exchanges in the laboratory, (ii) account for the disease-causing potential of missense mutations in the human population, and (iii) model the probability of fixation of missense mutations in evolution. EX not only captures useful information on exchangeability while remaining free of other effects, but also outperforms all measures tested except for the best-performing alignment scoring matrix, which is comparable in performance.
Collapse
Affiliation(s)
- Lev Y Yampolsky
- Department of Biological Sciences, East Tennessee State University, Johnson City, Tennessee 37614-1710, USA
| | | |
Collapse
|
148
|
Zhang F, Zhao Z. The influence of neighboring-nucleotide composition on single nucleotide polymorphisms (SNPs) in the mouse genome and its comparison with human SNPs. Genomics 2005; 84:785-95. [PMID: 15475257 DOI: 10.1016/j.ygeno.2004.06.015] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2004] [Accepted: 06/28/2004] [Indexed: 11/23/2022]
Abstract
We analyzed the neighboring-nucleotide composition of 433,192 biallelic substitutions, representing the largest public collection of SNPs across the mouse genome. Large neighboring-nucleotide biases relative to the genome- or chromosome-specific average were observed at the immediate adjacent sites and small biases extended farther from the substitution site. For all substitutions, the biases for A, C, G, and T were 0.21, 2.63, 0.71, and -3.55%, respectively, on the immediate adjacent 5' site and -3.67, 0.75, 2.69, and 0.23%, respectively, on the immediate adjacent 3' side. Further examination of the six categories of substitution revealed that the neighboring-nucleotide patterns for transitions were strongly influenced by the hypermutability of dinucleotide CpG and the neighboring effects on transversions were complex. Probability of a transversion increased with increasing A + T content of the two immediate adjacent sites, which was similarly observed in the human and Arabidopsis genomes. Overall, the bias patterns for the neighboring nucleotides in the mouse and human genomes were essentially the same; however, the extent of the biases was notably less in mice. Our results provide the first comprehensive view of the neighboring-nucleotide effects in the mouse genome and are important for understanding the mutational mechanisms and sequence evolution in the mammalian genomes.
Collapse
Affiliation(s)
- Fengkai Zhang
- Virginia Institute for Psychiatric and Behavioral Genetics, Virginia Commonwealth University, PO Box 980126, Richmond, VA 23298-0126, USA
| | | |
Collapse
|
149
|
Heit JA, Petterson TM, Owen WG, Burke JP, DE Andrade M, Melton LJ. Thrombomodulin gene polymorphisms or haplotypes as potential risk factors for venous thromboembolism: a population-based case-control study. J Thromb Haemost 2005; 3:710-7. [PMID: 15842356 DOI: 10.1111/j.1538-7836.2005.01187.x] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
Dysfunction of the protein C anticoagulant system is associated with venous thromboembolism (VTE) and thrombomodulin (TM) is a critical cofactor within the protein C system. The aim of this study was to test the hypotheses that polymorphisms or haplotypes within the TM gene are common risk factors for VTE. We screened the TM putative promoter, exon and 3'-untranslated region for sequence variations in a random sample (n = 266) of consecutive idiopathic, objectively confirmed non-Olmsted County VTE patients referred to the Mayo Clinic. We then genotyped a sample of Olmsted County, MN residents with a first lifetime, objectively confirmed VTE in the 25-year period, 1966-90 (n = 223), and a sample of Olmsted County residents without VTE (n = 237) for polymorphisms either discovered in the screening population or previously published, and tested for an association of VTE with TM genotype or haplotypes using unconditional logistic regression and generalized linear models, respectively. We also genotyped these Olmsted County cases and controls at 20 'null' genetic maker loci and tested for population admixture. Nine novel and three previously described mutations were identified in the screening population. Mutations within the TM promoter, EGF(1-5), serine/threonine-rich, transmembrane, and cytoplasm regions were absent or uncommon. TM845G-->A (Ala25Thr; lectin region), TM2136T-->C (Ala455Val; EGF(6) region), TM2470C deletion (3'-untranslated region), and 4363A-->G (3'-flanking region) were more common, but were not associated with VTE by genotype or haplotype. Null genetic marker allele frequencies did not differ significantly among cases and controls. We conclude that polymorphisms or haplotypes within the TM gene are not common risk factors for incident VTE.
Collapse
Affiliation(s)
- John A Heit
- Division of Cardiovascular Diseases, Mayo Clinic and Foundation, Rochester, MI, USA.
| | | | | | | | | | | |
Collapse
|
150
|
Saifi GM, Szigeti K, Wiszniewski W, Shy ME, Krajewski K, Hausmanowa-Petrusewicz I, Kochanski A, Reeser S, Mancias P, Butler I, Lupski JR. SIMPLEmutations in Charcot-Marie-Tooth disease and the potential role of its protein product in protein degradation. Hum Mutat 2005; 25:372-83. [PMID: 15776429 DOI: 10.1002/humu.20153] [Citation(s) in RCA: 75] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Charcot-Marie-Tooth (CMT) disease is a clinically and genetically heterogeneous group of inherited peripheral neuropathies characterized by progressive weakness and atrophy of distal limb muscles. Recently, SIMPLE/LITAF was shown to be responsible for an autosomal dominant demyelinating form of CMT linked to 16p (CMT1C). Although two transcripts encoding different proteins (SIMPLE and LITAF) have been reported from the same gene, we could not confirm the existence of LITAF. Here we show that the LITAF transcript appears to result from a DNA sequencing error. We screened the SIMPLE gene for mutations in a cohort of 192 patients with CMT or related neuropathies, each of whom tested negative for other known genetic causes of CMT. In 16 unrelated CMT families we identified nine different nucleotide variations in SIMPLE that were not detected in control chromosomes. SIMPLE mutations can occur de novo, associated with sporadic CMT1 and may convey both demyelinating and axonal forms. Bioinformatics analyses and other observations of SIMPLE suggest that 1) it could be a member of the RING finger motif-containing subfamily of E3 ubiquitin ligases that are associated with the ubiquitin-mediated proteasome processing pathway, 2) it could interact through its PPXY motifs with a WW domain containing protein, for instance with NEDD4, an E3 ubiquitin ligase, and 3) it could interact through the PSAP motif with TSG10, a protein associated with endosomal multivesicular protein sorting. Since both SIMPLE and Hrs are endosomal proteins and have both PPXY and P(S/T)AP motifs, we hypothesize that SIMPLE, like Hrs, is potentially a clathrin adaptor aiding in the retention of ubiquitinated proteins on to the endosomes. Thus the potential E3 ubiquitin ligase activity of SIMPLE, alteration in its interactions with NEDD4 or TSG101, or changes in its properties as a clathrin coat adaptor may underlie the pathogenesis of Charcot-Marie-Tooth disease.
Collapse
Affiliation(s)
- Gulam Mustafa Saifi
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|