1
|
Lebel M, Monnat RJ. Werner syndrome (WRN) gene variants and their association with altered function and age-associated diseases. Ageing Res Rev 2018; 41:82-97. [PMID: 29146545 DOI: 10.1016/j.arr.2017.11.003] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2017] [Revised: 11/07/2017] [Accepted: 11/09/2017] [Indexed: 01/14/2023]
Abstract
Werner syndrome (WS) is a heritable autosomal recessive human disorder characterized by the premature onset of several age-associated pathologies including cancer. The protein defective in WS patients, WRN, is encoded by a member of the human RECQ gene family that contains both a DNA exonuclease and a helicase domain. WRN has been shown to participate in several DNA metabolic pathways including DNA replication, recombination and repair, as well as telomere maintenance and transcription modulation. Here we review base pair-level genetic variation that has been documented in WRN, with an emphasis on non-synonymous coding single nucleotide polymorphisms (SNPs) and their associations with anthropomorphic features, longevity and disease risk. These associations have been challenging to identify, as many reported WRN SNP associations appear to be further conditioned upon ethnic, age, gender or other environmental co-variables. The WRN variant phenotypic associations identified to date are intriguing, and several are of clear clinical import. Consequently, it will be important to extend these initial associations and to identify the mechanisms and conditions under which specific WRN variants may compromise WRN function to drive cellular and organismal phenotypes as well as disease risk.
Collapse
Affiliation(s)
- Michel Lebel
- Centre de recherche du CHU de Québec, Pavillon CHUL Université Laval, Faculté de Médecine, Québec City, Québec, G1V 4G2, Canada.
| | - Raymond J Monnat
- Departments of Pathology and Genome Sciences, University of Washington, Seattle, WA 98195, USA
| |
Collapse
|
2
|
KIMMEL MAREK. WHY MATHEMATICS IS NEEDED TO UNDERSTAND COMPLEX GENETICS DISEASES. J BIOL SYST 2012. [DOI: 10.1142/s0218339002000688] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
We discuss mathematical approaches to population genetics and evolutionary theory in the context of complex genetic disease. Mechanisms, which we discuss, include gene-environment interaction in lung cancer as well as classical mechanisms of stabilization of genetic disease such as overdominance, antagonistic pleiotropy and recurring mutations. Specific modeling approaches discussed include: (1) Mathematical model of the evolution of disease chromosome applied to mapping of a disease gene. (2) Iterated Galton–Watson branching process applied to modeling of trinucleotide expansion in triplet-repeat diseases. (3) Application of Ewens' sampling formula to analysis of Single Nucleotide Polymorphism haplotypes at disease-related genes. The aim of this paper is not to present an exhaustive review, but rather to advocate mathematical modeling approaches in a field of current interest.
Collapse
Affiliation(s)
- MAREK KIMMEL
- Department of Statistics, Rice University, 6100 Main Street, MS 138, Houston, TX 77005, USA
| |
Collapse
|
3
|
Kerstens HHD, Crooijmans RPMA, Veenendaal A, Dibbits BW, Chin-A-Woeng TFC, den Dunnen JT, Groenen MAM. Large scale single nucleotide polymorphism discovery in unsequenced genomes using second generation high throughput sequencing technology: applied to turkey. BMC Genomics 2009; 10:479. [PMID: 19835600 PMCID: PMC2772860 DOI: 10.1186/1471-2164-10-479] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2009] [Accepted: 10/16/2009] [Indexed: 01/18/2023] Open
Abstract
BACKGROUND The development of second generation sequencing methods has enabled large scale DNA variation studies at moderate cost. For the high throughput discovery of single nucleotide polymorphisms (SNPs) in species lacking a sequenced reference genome, we set-up an analysis pipeline based on a short read de novo sequence assembler and a program designed to identify variation within short reads. To illustrate the potential of this technique, we present the results obtained with a randomly sheared, enzymatically generated, 2-3 kbp genome fraction of six pooled Meleagris gallopavo (turkey) individuals. RESULTS A total of 100 million 36 bp reads were generated, representing approximately 5-6% (approximately 62 Mbp) of the turkey genome, with an estimated sequence depth of 58. Reads consisting of bases called with less than 1% error probability were selected and assembled into contigs. Subsequently, high throughput discovery of nucleotide variation was performed using sequences with more than 90% reliability by using the assembled contigs that were 50 bp or longer as the reference sequence. We identified more than 7,500 SNPs with a high probability of representing true nucleotide variation in turkeys. Increasing the reference genome by adding publicly available turkey BAC-end sequences increased the number of SNPs to over 11,000. A comparison with the sequenced chicken genome indicated that the assembled turkey contigs were distributed uniformly across the turkey genome. Genotyping of a representative sample of 340 SNPs resulted in a SNP conversion rate of 95%. The correlation of the minor allele count (MAC) and observed minor allele frequency (MAF) for the validated SNPs was 0.69. CONCLUSION We provide an efficient and cost-effective approach for the identification of thousands of high quality SNPs in species currently lacking a sequenced genome and applied this to turkey. The methodology addresses a random fraction of the genome, resulting in an even distribution of SNPs across the targeted genome.
Collapse
Affiliation(s)
- Hindrik H D Kerstens
- Animal Breeding and Genomics Center, Wageningen University, Marijkeweg 40, Wageningen, 6709 PG, the Netherlands.
| | | | | | | | | | | | | |
Collapse
|
4
|
Kerstens HH, Kollers S, Kommadath A, Del Rosario M, Dibbits B, Kinders SM, Crooijmans RP, Groenen MA. Mining for single nucleotide polymorphisms in pig genome sequence data. BMC Genomics 2009; 10:4. [PMID: 19126189 PMCID: PMC2637895 DOI: 10.1186/1471-2164-10-4] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2008] [Accepted: 01/06/2009] [Indexed: 11/30/2022] Open
Abstract
Background Single nucleotide polymorphisms (SNPs) are ideal genetic markers due to their high abundance and the highly automated way in which SNPs are detected and SNP assays are performed. The number of SNPs identified in the pig thus far is still limited. Results A total of 4.8 million whole genome shotgun sequences obtained from the NCBI trace-repository with center name "SDJVP", and project name "Sino-Danish Pig Genome Project" were analysed for the presence of SNPs. Available BAC and BAC-end sequences and their naming and mapping information, all obtained from SangerInstitute FTP site, served as a rough assembly of a reference genome. In 1.2 Gb of pig genome sequence, we identified 98,151 SNPs in which one of the sequences in the alignment represented the polymorphism and 6,374 SNPs in which two sequences represent an identical polymorphism. To benchmark the SNP identification method, 163 SNPs, in which the polymorphism was represented twice in the sequence alignment, were selected and tested on a panel of three purebred boar lines and wild boar. Of these 163 in silico identified SNPs, 134 were shown to be polymorphic in our animal panel. Conclusion This SNP identification method, which mines for SNPs in publicly available porcine shotgun sequences repositories, provides thousands of high quality SNPs. Benchmarking in an animal panel showed that more than 80% of the predicted SNPs represented true genetic variation.
Collapse
Affiliation(s)
- Hindrik Hd Kerstens
- Animal Breeding and Genetics Group, Wageningen University, Wageningen, the Netherlands.
| | | | | | | | | | | | | | | |
Collapse
|
5
|
Innocenti F, Mirkov S, Nagasubramanian R, Ramírez J, Liu W, Bleibel WK, Shukla SJ, Hennessy K, Rosner GL, Cook E, Eileen Dolan M, Ratain MJ. The Werner's syndrome 4330T>C (Cys1367Arg) gene variant does not affect the in vitro cytotoxicity of topoisomerase inhibitors and platinum compounds. Cancer Chemother Pharmacol 2008; 63:881-7. [PMID: 18677484 DOI: 10.1007/s00280-008-0793-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2008] [Accepted: 07/06/2008] [Indexed: 02/01/2023]
Abstract
PURPOSE Werner's syndrome (WS) is a recessive disorder of premature onset of processes associated with aging. Defective DNA repair has been reported after exposure of cells isolated from WS patients to DNA-damaging agents. The germline 4330T>C (Cys1367Arg) variant in the WS gene (WRN) has been associated with protection from age-related diseases, suggesting it has a functional role. We studied whether the 4330T>C variant confers altered drug sensitivity in vitro. METHODS 4330T>C was genotyped in 372 human lymphoblastoid cell lines (LCLs) from unrelated healthy Caucasian individuals using a TaqMan-based method. The study was powered to detect the effect of the 4330T>C genotypes after exposure to camptothecin (based upon preliminary data). The effect of the 4330T>C variant on the cytotoxicity of etoposide, carboplatin, cisplatin and daunorubicin was also tested. WRN expression in 57 LCLs was measured by microarray. RESULTS No significant difference between the IC50 of the cells was observed among genotypes (P = 0.46) after exposure to camptothecin. No association was also observed for etoposide, carboplatin, cisplatin, and daunorubicin (ANOVA, P > 0.05). WRN expression also did not vary across genotypes (ANOVA, P = 0.37). CONCLUSION These results suggest that this nonsynonymous variant has relatively normal function at the cellular level.
Collapse
Affiliation(s)
- Federico Innocenti
- Department of Medicine, The University of Chicago, 5841 South Maryland Avenue, MC 2115, Chicago, IL 60637, USA.
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
6
|
Anderson LM. Environmental genotoxicants/carcinogens and childhood cancer: Bridgeable gaps in scientific knowledge. MUTATION RESEARCH-GENETIC TOXICOLOGY AND ENVIRONMENTAL MUTAGENESIS 2006; 608:136-56. [PMID: 16829162 DOI: 10.1016/j.mrgentox.2006.02.016] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2005] [Accepted: 02/13/2006] [Indexed: 10/24/2022]
Abstract
Cancer in children is a major concern in many countries. An important question is whether these childhood cancers are caused by something, or are just tragic random events. Causation of at least some children's cancers is suggested by direct and indirect evidence, including epidemiological data, and animal studies that predict early life sensitivity of humans to carcinogenic effects. Candidate risk factors include genotoxic agents (chemicals and radiation), but also diet/nutrition, and infectious agents/immune responses. With regard to likelihood of risks posed by genotoxicants, there are pros and cons. The biological properties of fetuses and infants are consistent with sensitivity to preneoplastic genotoxic damage. Recent studies of genetic polymorphisms in carcinogen-metabolizing enzymes confirm a role for chemicals. On the other hand, in numerous epidemiological studies, associations between childhood cancers and exposure to genotoxicants, including tobacco smoke, have been weak and hard to reproduce. Possibly, sensitive genetic or ontogenetic subpopulations, and/or co-exposure situations need to be discovered to allow identification of susceptible individuals and their risk factors. Among the critical knowledge gaps needing to be bridged to aid in this effort include detailed tissue and cellular ontogeny of carcinogen metabolism and DNA repair enzymes, and associations of polymorphisms in DNA repair enzymes with childhood cancers. Perinatal bioassays in animals of specific environmental candidates, for example, benzene, could help guide epidemiology. Genetically engineered animal models could be useful for identification of chemical effects on specific genes. Investigations of interactions between factors may be key to understanding risk. Finally, fathers and newborn infants should receive more attention as especially sensitive targets.
Collapse
Affiliation(s)
- Lucy M Anderson
- Laboratory of Comparative Carcinogenesis, National Cancer Institute at Frederick, Frederick, MD 21702, USA.
| |
Collapse
|
7
|
Bessenyei B, Márka M, Urbán L, Zeher M, Semsei I. Single nucleotide polymorphisms: aging and diseases. Biogerontology 2005; 5:291-303. [PMID: 15547317 DOI: 10.1007/s10522-004-2567-y] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Differences of more than 3 million nucleotides can bee seen comparing the genomes of two individuals as a result of single nucleotide polymorphism (SNP). More and more SNPs can be identified and it seems that these alterations are behind of several biological phenomena. Personal differences in these nucleotides result for example in elevated disease susceptibilities, that is, certain nucleotides are more frequent in patients suffering from different diseases comparing to the healthy population. SNPs may cause substantial alterations in the cells, e.g. the enzyme activity of the respective gene changes, but in other cases the effects of the SNPs are not so pronounced. Later results indicate that SNPs can be rendered to individuals living a longer life than the average. Perhaps these results will not directly lead to the lengthening of the maximal life span; however, genes that play an important role in the aging process could be identified. In this respect SNPs are important factors in determining the information level of the cells of individuals which determines the maximal life span (I. Semsei On the nature of aging. Mech. Ageing Dev . 2000; 117: 93-108), in turn SNP is one of the factors that determine the aging process. Since there are certain age-related diseases, the discovery and the description of the SNPs as a function of age and diseases may result in a better understanding of the common roots of aging and those diseases.
Collapse
Affiliation(s)
- B Bessenyei
- Medical and Health Science Center, 3rd Department of Medicine, Molecular Biology Research Laboratory, University Medical School, Moricz Zs 22, H-4004 Debrecen, Hungary
| | | | | | | | | |
Collapse
|
8
|
Brown AM, Gordon D, Lee H, Caudy M, Haroutunian V, Blass JP. Substantial linkage disequilibrium across the dihydrolipoyl succinyltransferase gene region without Alzheimer's disease association. Neurochem Res 2004; 29:629-35. [PMID: 15038610 DOI: 10.1023/b:nere.0000014833.54481.1d] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Association of the candidate gene DLST with late-onset Alzheimer's disease (LOAD) risk has been suggested on the basis of case-control studies. This gene, located on chromosome 14q24.3, encodes a subunit of a mitochondrial component known to be defective in AD, the alpha-ketoglutarate dehydrogenase complex. Positive reports have correlated different DLST alleles with LOAD, whereas other groups have failed to find any significant association. We therefore reexamined the association of DLST and LOAD in a more ethnically homogeneous series using three additional single nucleotide polymorphisms (SNP) located within or closely flanking either end of the DLST gene. Pairwise analysis of these SNPs indicated there was strong linkage disequilibrium across the DLST locus. Analysis of complex genotypes or haplotypes based upon all five SNP loci failed to identify a LOAD risk allele, suggesting that further studies of DLST in relation to AD are not warranted.
Collapse
Affiliation(s)
- Abraham M Brown
- Dementia Research Service, Burke Medical Research Institute, 785 Mamaroneck Avenue, White Plains, New York 10605, USA.
| | | | | | | | | | | |
Collapse
|
9
|
Polanski A, Kimmel M. New Explicit Expressions for Relative Frequencies of Single-Nucleotide Polymorphisms With Application to Statistical Inference on Population Growth. Genetics 2003; 165:427-36. [PMID: 14504247 PMCID: PMC1462751 DOI: 10.1093/genetics/165.1.427] [Citation(s) in RCA: 86] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Abstract
We present new methodology for calculating sampling distributions of single-nucleotide polymorphism (SNP) frequencies in populations with time-varying size. Our approach is based on deriving analytical expressions for frequencies of SNPs. Analytical expressions allow for computations that are faster and more accurate than Monte Carlo simulations. In contrast to other articles showing analytical formulas for frequencies of SNPs, we derive expressions that contain coefficients that do not explode when the genealogy size increases. We also provide analytical formulas to describe the way in which the ascertainment procedure modifies SNP distributions. Using our methods, we study the power to test the hypothesis of exponential population expansion vs. the hypothesis of evolution with constant population size. We also analyze some of the available SNP data and we compare our results of demographic parameters estimation to those obtained in previous studies in population genetics. The analyzed data seem consistent with the hypothesis of past population growth of modern humans. The analysis of the data also shows a very strong sensitivity of estimated demographic parameters to changes of the model of the ascertainment procedure.
Collapse
Affiliation(s)
- A Polanski
- Department of Statistics, Rice University, Houston, Texas 77005, USA
| | | |
Collapse
|
10
|
Kim H, Schmidt CJ, Decker KS, Emara MG. A double-screening method to identify reliable candidate non-synonymous SNPs from chicken EST data. Anim Genet 2003; 34:249-54. [PMID: 12873212 DOI: 10.1046/j.1365-2052.2003.01003.x] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Discovery of non-synonymous single nucleotide polymorphisms (nsSNP), which cause amino acid substitutions, is important because they are more likely to alter protein function than synonymous SNPs (sSNP) or those SNPs that do not result in amino acid changes. By changing the coding sequences, nsSNP may play a role in heritable differences between individual organisms. In the chicken and many other vertebrates, the main obstacle for identifying nsSNP is that there is insufficient protein and mRNA sequence information for self-species referencing and thus, determination of the correct reading frame for expressed sequence tags (ESTs) is difficult. Therefore, in order to estimate the correct reading frame at nsSNP in chicken ESTs, a double-screening approach was designed using self- or cross-species protein referencing, in addition to the ESTScan coding region estimation programme. Starting with 23 427 chicken ESTs, 1210 potential SNPs were discovered using a phred/phrap/polyphred/consed pipeline process and among these, 108 candidate nsSNP were identified with the double screening method. A searchable SNP database (chicksnps) for the candidate chicken SNPs, including both nsSNPs and sSNPs is available at http://chicksnps.afs.udel.edu. The chicken SNP data described in this paper have been submitted to the data base SNP under National Center for Biotechnology Information assay ID ss4387050-ss4388259.
Collapse
Affiliation(s)
- H Kim
- Department of Animal and Food Sciences, University of Delaware, Newark, DE 19716, USA
| | | | | | | |
Collapse
|
11
|
Nebert DW. Proposal for an allele nomenclature system based on the evolutionary divergence of haplotypes. Hum Mutat 2002; 20:463-72. [PMID: 12442271 DOI: 10.1002/humu.10143] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
The classical view of what constitutes an "allele" has been challenged by recent findings of a great deal of human genetic variability, i.e., we can expect, on average, one variant site every 100-250 bases of our haploid genome. The haplotype is defined as "the patterns of co-occurrence of variant sites on the same chromosome" (and therefore within each particular gene). Sufficient evidence exists for the divergence of haplotypes during evolution of Homo sapiens sapiens, and the total number of haplotypes per gene will reflect the amount of time any particular ethnic group has existed on the planet, e.g., greatest in Africans, fewer in East Asians, and still fewer in Caucasians. If the average gene spans 30 kb, we can expect approximately 170 polymorphic variant sites per gene in the world population. We do not see 2(170) haplotypes, however; we might find only 10 to 200 haplotypes (depending on the gene's size and degree of conservation of the gene product). This finite number allows for a reasonable haplotype nomenclature system for each gene, based on evolutionary divergence. For polymorphic variants (i.e., frequency > or = 0.01), I propose using Arabic numerals for the major clades (e.g., *1, *2, em leader *20, *21), capital letters for sublineages (e.g., *2A, *2B, *2C), and Arabic numerals for sub-sublineages (e.g., *22G12, *22G13); additional subcategories may be added, in an alternating number/letter/number/letter sequence, depending on the complexity of present-day haplotypes of a particular gene. Web sites with a web master and external advisory committee should be set up for each gene superfamily, family, or individual gene (depending on complexity), and an international haplotype nomenclature committee, perhaps comprised of several dozen of these web masters, should oversee haplotype nomenclature for the entire human genome. The higher heterozygosity and multiallelic nature makes haplotypes more informative than biallelic SNPs. Ultimately, our knowledge of haplotype patterns, rather than single variant sites, of perhaps several hundred genes will likely be helpful in finding associations between genotype and any multiplex phenotype (e.g., complex diseases including cancer, and/or toxicity of pharmaceutical agents or environmental pollutants).
Collapse
Affiliation(s)
- Daniel W Nebert
- Center for Environmental Genetics, Department of Environmental Health, University of Cincinnati Medical Center, Cincinnati, Ohio 45267-0056, USA.
| |
Collapse
|
12
|
Abstract
The AH receptor (AHR) mediates toxicity of 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD) as well as induction of three cytochrome P450 enzymes and certain Phase II enzymes. In laboratory animals, genetic variations in the AHR lead to substantial differences in sensitivity to biochemical and toxic effects of TCDD and related compounds. Relatively few polymorphisms have been discovered in the human AHR gene; these occur predominantly in exon 10, a region that encodes a major portion of the transactivation domain of the receptor that is responsible for regulating expression of other genes. In human populations there is a wide range of variation in responses regulated by the AHR for example, induction of CYP1A1. Some variation in human responsiveness likely is due to genetically based variations in AHR structure. Thus far, however, only one pair of polymorphisms, those at codons 517 and 570, has been shown to have a clear cut and strong effect on the phenotype of an AHR-mediated response. The search continues for polymorphisms that alter AHR function because this receptor is a central factor in determining responses to important environmental contaminants and also plays a physiologic role in early development in mammals.
Collapse
Affiliation(s)
- Patricia A Harper
- Division of Clinical Pharmacology, Research Institute, The Hospital for Sick Children, 555 University Avenue, Toronto, ON, Canada M5G 1X8.
| | | | | | | |
Collapse
|