1
|
Díaz-Gay M, Alexandrov LB. Unraveling the genomic landscape of colorectal cancer through mutational signatures. Adv Cancer Res 2021; 151:385-424. [PMID: 34148618 DOI: 10.1016/bs.acr.2021.03.003] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Colorectal cancer, along with most other cancer types, is driven by somatic mutations. Characteristic patterns of somatic mutations, known as mutational signatures, arise as a result of the activities of different mutational processes. Mutational signatures have diverse origins, including exogenous and endogenous sources. In the case of colorectal cancer, the analysis of mutational signatures has elucidated specific signatures for classically associated DNA repair deficiencies, namely mismatch repair (leading to microsatellite instability), base excision repair (due to MUTYH or NTHL1 mutations), and polymerase proofreading (due to POLE and POLD1 exonuclease domain mutations). Additional signatures also play a role in colorectal cancer, including those related to normal aging and those associated with gut microbiota, as well as a number of signatures with unknown etiologies. This chapter provides an overview of the current knowledge of mutational signatures, with a focus on colorectal cancer and on the recently reported signatures in physiologically normal and inflammatory bowel disease-affected somatic colon tissues.
Collapse
Affiliation(s)
- Marcos Díaz-Gay
- Department of Cellular and Molecular Medicine, UC San Diego, La Jolla, CA, United States; Department of Bioengineering, UC San Diego, La Jolla, CA, United States; Moores Cancer Center, UC San Diego, La Jolla, CA, United States
| | - Ludmil B Alexandrov
- Department of Cellular and Molecular Medicine, UC San Diego, La Jolla, CA, United States; Department of Bioengineering, UC San Diego, La Jolla, CA, United States; Moores Cancer Center, UC San Diego, La Jolla, CA, United States.
| |
Collapse
|
2
|
Alexandrov LB, Kim J, Haradhvala NJ, Huang MN, Tian Ng AW, Wu Y, Boot A, Covington KR, Gordenin DA, Bergstrom EN, Islam SMA, Lopez-Bigas N, Klimczak LJ, McPherson JR, Morganella S, Sabarinathan R, Wheeler DA, Mustonen V, Getz G, Rozen SG, Stratton MR. The repertoire of mutational signatures in human cancer. Nature 2020; 578:94-101. [PMID: 32025018 PMCID: PMC7054213 DOI: 10.1038/s41586-020-1943-3] [Citation(s) in RCA: 2019] [Impact Index Per Article: 403.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2018] [Accepted: 11/18/2019] [Indexed: 01/27/2023]
Abstract
Somatic mutations in cancer genomes are caused by multiple mutational processes, each of which generates a characteristic mutational signature1. Here, as part of the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium2 of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA), we characterized mutational signatures using 84,729,690 somatic mutations from 4,645 whole-genome and 19,184 exome sequences that encompass most types of cancer. We identified 49 single-base-substitution, 11 doublet-base-substitution, 4 clustered-base-substitution and 17 small insertion-and-deletion signatures. The substantial size of our dataset, compared with previous analyses3-15, enabled the discovery of new signatures, the separation of overlapping signatures and the decomposition of signatures into components that may represent associated-but distinct-DNA damage, repair and/or replication mechanisms. By estimating the contribution of each signature to the mutational catalogues of individual cancer genomes, we revealed associations of signatures to exogenous or endogenous exposures, as well as to defective DNA-maintenance processes. However, many signatures are of unknown cause. This analysis provides a systematic perspective on the repertoire of mutational processes that contribute to the development of human cancer.
Collapse
Affiliation(s)
- Ludmil B Alexandrov
- Department of Cellular and Molecular Medicine, Department of Bioengineering, Moores Cancer Center, University of California, San Diego, CA, USA
| | - Jaegil Kim
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Nicholas J Haradhvala
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Cancer Research, Massachusetts General Hospital, Boston, MA, USA
| | - Mi Ni Huang
- Programme in Cancer & Stem Cell Biology, Duke-NUS Medical School, Singapore, Singapore
- Centre for Computational Biology, Duke-NUS Medical School, Singapore, Singapore
| | - Alvin Wei Tian Ng
- Programme in Cancer & Stem Cell Biology, Duke-NUS Medical School, Singapore, Singapore
- Centre for Computational Biology, Duke-NUS Medical School, Singapore, Singapore
| | - Yang Wu
- Programme in Cancer & Stem Cell Biology, Duke-NUS Medical School, Singapore, Singapore
- Centre for Computational Biology, Duke-NUS Medical School, Singapore, Singapore
| | - Arnoud Boot
- Programme in Cancer & Stem Cell Biology, Duke-NUS Medical School, Singapore, Singapore
- Centre for Computational Biology, Duke-NUS Medical School, Singapore, Singapore
| | - Kyle R Covington
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
- Dan L. Duncan Cancer Center, Baylor College of Medicine, Houston, TX, USA
| | - Dmitry A Gordenin
- Genome Integrity and Structural Biology Laboratory, National Institute of Environmental Health Sciences (NIEHS), Durham, NC, USA
| | - Erik N Bergstrom
- Department of Cellular and Molecular Medicine, Department of Bioengineering, Moores Cancer Center, University of California, San Diego, CA, USA
| | - S M Ashiqul Islam
- Department of Cellular and Molecular Medicine, Department of Bioengineering, Moores Cancer Center, University of California, San Diego, CA, USA
| | - Nuria Lopez-Bigas
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Research Program on Biomedical Informatics, Universitat Pompeu Fabra, Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
| | - Leszek J Klimczak
- Integrative Bioinformatics Support Group, National Institute of Environmental Health Sciences (NIEHS), Durham, NC, USA
| | - John R McPherson
- Programme in Cancer & Stem Cell Biology, Duke-NUS Medical School, Singapore, Singapore
- Centre for Computational Biology, Duke-NUS Medical School, Singapore, Singapore
| | | | - Radhakrishnan Sabarinathan
- Research Program on Biomedical Informatics, Universitat Pompeu Fabra, Barcelona, Spain
- National Centre for Biological Sciences, Tata Institute of Fundamental Research, Bangalore, India
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - David A Wheeler
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Ville Mustonen
- Department of Computer Science, University of Helsinki, Helsinki, Finland
- Organismal and Evolutionary Biology Research Programme, University of Helsinki, Helsinki, Finland
- Institute of Biotechnology, University of Helsinki, Helsinki, Finland
| | - Gad Getz
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Cancer Research, Massachusetts General Hospital, Boston, MA, USA
- Department of Pathology, Massachusetts General Hospital, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Steven G Rozen
- Programme in Cancer & Stem Cell Biology, Duke-NUS Medical School, Singapore, Singapore.
- Centre for Computational Biology, Duke-NUS Medical School, Singapore, Singapore.
- SingHealth, Duke-NUS Institute of Precision Medicine, National Heart Centre Singapore, Singapore, Singapore.
| | | |
Collapse
|
3
|
Chen JM, Férec C, Cooper DN. Patterns and mutational signatures of tandem base substitutions causing human inherited disease. Hum Mutat 2013; 34:1119-30. [PMID: 23606422 DOI: 10.1002/humu.22341] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2013] [Accepted: 04/09/2013] [Indexed: 12/31/2022]
Abstract
Tandem base substitutions (TBSs) are multiple mutations that comprise two or more contiguous nucleotide substitutions without any net gain or loss of bases. They have recently become recognized as a distinct category of human genomic variant. However, their role in causing human inherited disease so far has not been studied methodically. Here, using data from the Human Gene Mutation Database (http://www.hgmd.org), we identified 477 events to be TBSs (doublets, 448; triplets, 16; and quadruplets to octuplets, 13). A comprehensive sequence pattern and context analysis implied the likely fundamental importance of translesion synthesis (TLS) DNA polymerases in generating these diverse TBSs but revealed that TLS polymerases may operate differently in generating TBSs of ≤ 3 bases (bypass of endogenous DNA lesions) than those of ≥ 4 bases (serial replication slippage). Moreover, GC was found to be the most frequently affected dinucleotide with GC/GC>AA/TT being the most frequent double TBS. Comparison with cancer genome mutational spectra allowed us to conclude that human germline TBSs arise predominantly through the action of endogenous mechanisms of mutagenesis rather than through exposure to exogenous mutagens. Finally, the rates of double and triple TBSs were estimated to be 0.2-1.2 × 10(-10) and 0.8-4.8 × 10(-12) per base per generation, respectively.
Collapse
Affiliation(s)
- Jian-Min Chen
- Institut National de la Santé et de la Recherche Médicale, Brest, France.
| | | | | |
Collapse
|
4
|
Chen JM, Férec C, Cooper DN. Transient hypermutability, chromothripsis and replication-based mechanisms in the generation of concurrent clustered mutations. Mutat Res 2012; 750:52-9. [PMID: 22100908 DOI: 10.1016/j.mrrev.2011.10.002] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2011] [Revised: 10/25/2011] [Accepted: 10/27/2011] [Indexed: 12/21/2022]
Abstract
Clustered mutations may be broadly defined as the presence of two or more mutations within a spatially localized genomic region on a single chromosome. Known instances vary in terms of both the number and type of the component mutations, ranging from two closely spaced point mutations to tens or even hundreds of genomic rearrangements. Although clustered mutations can represent the observable net result of independent lesions sequentially acquired over multiple cell cycles, they can also be generated in a simultaneous or quasi-simultaneous manner within a single cell cycle. This review focuses on those mechanisms known to underlie the latter type. Both gene conversion and transient hypermutability are capable of generating closely spaced multiple mutations. However, a recently described phenomenon in human cancer cells, known as 'chromothripsis', has provided convincing evidence that tens to hundreds of genomic rearrangements can sometimes be generated simultaneously via a single catastrophic event. The distinctive genomic features observed in the derivative chromosomes, together with the highly characteristic junction sequences, point to non-homologous end joining (NHEJ) as being the likely underlying mutational mechanism. By contrast, replication-based mechanisms such as microhomology-mediated break-induced replication (MMBIR) which involves serial replication slippage or serial template switching probably account for those complex genomic rearrangements that comprise multiple duplications and/or triplications.
Collapse
Affiliation(s)
- Jian-Min Chen
- Institut National de la Santé et de la Recherche Médicale (INSERM), U613, Brest, France.
| | | | | |
Collapse
|
5
|
Chen JM, Férec C, Cooper DN. Closely spaced multiple mutations as potential signatures of transient hypermutability in human genes. Hum Mutat 2009; 30:1435-48. [PMID: 19685533 DOI: 10.1002/humu.21088] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Data from diverse organisms suggests that transient hypermutability is a general mutational mechanism with the potential to generate multiple synchronous mutations, a phenomenon probably best exemplified by closely spaced multiple mutations (CSMMs). Here we have attempted to extend the concept of transient hypermutability from somatic cells to the germline, using human inherited disease-causing multiple mutations as a model system. Employing stringent criteria for data inclusion, we have retrospectively identified numerous potential examples of pathogenic CSMMs that exhibit marked similarities to the CSMMs reported in other systems. These examples include (1) eight multiple mutations, each comprising three or more components within a sequence tract of <100 bp; (2) three possible instances of "mutation showers"; and (3) numerous highly informative "homocoordinate" mutations. Using the proportion of CpG substitution as a crude indicator of the relative likelihood of transient hypermutability, we present evidence to suggest that CSMMs comprising at least one pair of mutations separated by < or =100 bp may constitute signatures of transient hypermutability in human genes. Although this analysis extends the generality of the concept of transient hypermutability and provides new insights into what may be considered a novel mechanism of mutagenesis underlying human inherited disease, it has raised serious concerns regarding current practices in mutation screening.
Collapse
Affiliation(s)
- Jian-Min Chen
- Institut National de la Santé et de la Recherche Médicale, U613, Brest, France.
| | | | | |
Collapse
|
6
|
Chen Z, Feng J, Buzin CH, Liu Q, Weiss L, Kernstine K, Somlo G, Sommer SS. Analysis of cancer mutation signatures in blood by a novel ultra-sensitive assay: monitoring of therapy or recurrence in non-metastatic breast cancer. PLoS One 2009; 4:e7220. [PMID: 19789704 PMCID: PMC2749210 DOI: 10.1371/journal.pone.0007220] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2008] [Accepted: 04/20/2009] [Indexed: 11/23/2022] Open
Abstract
BACKGROUND Tumor DNA has been shown to be present both in circulating tumor cells in blood and as fragments in the plasma of metastatic cancer patients. The identification of ultra-rare tumor-specific mutations in blood would be the ultimate marker to measure efficacy of cancer therapy and/or early recurrence. Herein we present a method for detecting microinsertions/deletions/indels (MIDIs) at ultra-high analytical selectivity. MIDIs comprise about 15% of mutations. METHODS AND FINDINGS We describe MIDI-Activated Pyrophosphorolysis (MAP), a method of ultra-high analytical selectivity for detecting MIDIs. The high analytical selectivity of MAP is putatively due to serial coupling of two rare events: heteroduplex slippage and mis-pyrophosphorolysis. MAP generally has an analytical selectivity of one mutant molecule per >1 billion wild type molecules and an analytical sensitivity of one mutant molecule per reaction. The analytical selectivity of MAP is about 100,000-fold better than that of our previously described method of Pyrophosphorolysis Activated Polymerization-Allele specific amplification (PAP-A) for detecting MIDIs. The utility of this method is illustrated in two ways. 1) We demonstrate that two EGFR deletions commonly found in lung cancers are not present in tissue from four normal human lungs (10(7) copies of gDNA each) or in blood samples from 10 healthy individuals (10(7) copies of gDNA each). This is inconsistent, at least at an analytical sensitivity of 10(-7), with the hypotheses of (a) hypermutation or (b) strong selection of these growth factor-mutated cells during normal lung development leads to accumulation of pre-neoplastic cells with these EGFR mutations, which sometimes can lead to lung cancer in late adulthood. Moreover, MAP was used for large scale, high throughput "gene pool" analysis. No germline or early embryonic somatic mosaic mutation was detected (at a frequency of >0.3%) for the 15/18 bp EGFR deletion mutations in 6,400 individuals, suggesting that early embryonic EGFR somatic mutation is very rare, inconsistent with hypermutation or strong selection of these deletions in the embryo. 2) The second illustration of MAP utility is in personalized monitoring of therapy and early recurrence in cancer. Tumor-specific p53 mutations identified at diagnosis in the plasma of six patients with stage II and III breast cancer were undetectable after therapy in four women, consistent with clinical remission, and continued to be detected after treatment in two others, reflecting tumor progression. CONCLUSIONS MAP has an analytical selectivity of one part per billion for detection of MIDIs and an analytical sensitivity of one molecule. MAP provides a general tool for monitoring ultra-rare mutations in tissues and blood. As an example, we show that the personalized cancer signature in six out of six patients with non-metastatic breast cancer can be detected and that levels over time are correlated with the clinical course of disease.
Collapse
Affiliation(s)
- Zhenbin Chen
- Department of Molecular Genetics, City of Hope National Medical Center, Duarte, California, United States of America
| | - Jinong Feng
- Department of Molecular Genetics, City of Hope National Medical Center, Duarte, California, United States of America
| | - Carolyn H. Buzin
- Department of Molecular Diagnosis, City of Hope National Medical Center, Duarte, California, United States of America
- MEDomics, LLC, Azusa, California, United States of America
| | - Qiang Liu
- Department of Molecular Diagnosis, City of Hope National Medical Center, Duarte, California, United States of America
| | - Lawrence Weiss
- Department of Anatomic Pathology, City of Hope National Medical Center, Duarte, California, United States of America
| | - Kemp Kernstine
- Division of Surgery, City of Hope National Medical Center, Duarte, California, United States of America
| | - George Somlo
- Department of Medical Oncology, City of Hope National Medical Center, Duarte, California, United States of America
| | - Steve S. Sommer
- Department of Molecular Genetics, City of Hope National Medical Center, Duarte, California, United States of America
- Department of Molecular Diagnosis, City of Hope National Medical Center, Duarte, California, United States of America
- MEDomics, LLC, Azusa, California, United States of America
| |
Collapse
|
7
|
Chen Z, Feng J, Buzin CH, Sommer SS. Epidemiology of doublet/multiplet mutations in lung cancers: evidence that a subset arises by chronocoordinate events. PLoS One 2008; 3:e3714. [PMID: 19005564 PMCID: PMC2579325 DOI: 10.1371/journal.pone.0003714] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2008] [Accepted: 10/10/2008] [Indexed: 01/01/2023] Open
Abstract
Background Evidence strongly suggests that spontaneous doublet mutations in normal mouse tissues generally arise from chronocoordinate events. These chronocoordinate mutations sometimes reflect “mutation showers”, which are multiple chronocoordinate mutations spanning many kilobases. However, little is known about mutagenesis of doublet and multiplet mutations (domuplets) in human cancer. Lung cancer accounts for about 25% of all cancer deaths. Herein, we analyze the epidemiology of domuplets in the EGFR and TP53 genes in lung cancer. The EGFR gene is an oncogene in which doublets are generally driver plus driver mutations, while the TP53 gene is a tumor suppressor gene with a more typical situation in which doublets derive from a driver and passenger mutation. Methodology/Principal Findings EGFR mutations identified by sequencing were collected from 66 published papers and our updated EGFR mutation database (www.egfr.org). TP53 mutations were collected from IARC version 12 (www-p53.iarc.fr). For EGFR and TP53 doublets, no clearly significant differences in race, ethnicity, gender and smoking status were observed. Doublets in the EGFR and TP53 genes in human lung cancer are elevated about eight- and three-fold, respectively, relative to spontaneous doublets in mouse (6% and 2.3% versus 0.7%). Conclusions/Significance Although no one characteristic is definitive, the aggregate properties of doublet and multiplet mutations in lung cancer are consistent with a subset derived from chronocoordinate events in the EGFR gene: i) the eight frameshift doublets (present in 0.5% of all patients with EGFR mutations) are clustered and produce a net in-frame change; ii) about 32% of doublets are very closely spaced (≤30 nt); and iii) multiplets contain two or more closely spaced mutations. TP53 mutations in lung cancer are very closely spaced (≤30 nt) in 33% of doublets, and multiplets generally contain two or more very closely spaced mutations. Work in model systems is necessary to confirm the significance of chronocoordinate events in lung and other cancers.
Collapse
Affiliation(s)
- Zhenbin Chen
- Department of Molecular Genetics, City of Hope National Medical Center, Duarte, California, United States of America
| | - Jinong Feng
- Department of Molecular Genetics, City of Hope National Medical Center, Duarte, California, United States of America
| | - Carolyn H. Buzin
- Department of Molecular Genetics, City of Hope National Medical Center, Duarte, California, United States of America
| | - Steve S. Sommer
- Department of Molecular Genetics, City of Hope National Medical Center, Duarte, California, United States of America
- * E-mail:
| |
Collapse
|
8
|
Scaringe WA, Li K, Gu D, Gonzalez KD, Chen Z, Hill KA, Sommer SS. Somatic microindels in human cancer: the insertions are highly error-prone and derive from nearby but not adjacent sense and antisense templates. Hum Mol Genet 2008; 17:2910-8. [PMID: 18632684 DOI: 10.1093/hmg/ddn190] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Somatic microindels (microdeletions with microinsertions) have been studied in normal mouse tissues using the Big Blue lacI transgenic mutation detection system. Here we analyze microindels in human cancers using an endogenous and transcribed gene, the TP53 gene. Microindel frequency, the enhancement of 1-2 microindels and other features are generally similar to that observed in the non-transcribed lacI gene in normal mouse tissues. The current larger sample of somatic microindels reveals recurroids: mutations in which deletions are identical and the co-localized insertion is similar. The data reveal that the inserted sequences derive from nearby but not adjacent sequences in contrast to the slippage that characterizes the great majority of pure microinsertions. The microindel inserted sequences derive from a template on the sense or antisense strand with similar frequency. The estimated error rate of the insertion process of 13% per bp is by far the largest reported in vivo, with the possible exception of somatic hypermutation in the immunoglobulin gene. The data constrain possible mechanisms of microindels and raise the question of whether microindels are 'scars' from the bypass of large DNA adducts by a translesional polymerase, e.g. the 'Tarzan model' presented herein.
Collapse
Affiliation(s)
- William A Scaringe
- Department of Molecular Genetics, City of Hope National Medical Center, 1500 E. Duarte Rd, Duarte, CA 91010, USA
| | | | | | | | | | | | | |
Collapse
|
9
|
Chen Z, Feng J, Saldivar JS, Gu D, Bockholt A, Sommer SS. EGFR somatic doublets in lung cancer are frequent and generally arise from a pair of driver mutations uncommonly seen as singlet mutations: one-third of doublets occur at five pairs of amino acids. Oncogene 2008; 27:4336-43. [PMID: 18372921 DOI: 10.1038/onc.2008.71] [Citation(s) in RCA: 62] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Doublet mutations in cancer are not well studied. We find that allelic somatic doublet mutations are present at high frequency in the epidermal growth factor receptor (EGFR) tyrosine kinase (TK) domain in lung cancers. When doublets from the literature are added, a total of 96 doublets are available for analysis. The frequency of doublets overall is 6%, which is sevenfold greater than that observed in normal tissue in mouse. All characterized doublets are allelic, and silent mutations occur rarely. About half of all doublets contain one or two of 12 distinct missense mutations at five amino acids: E709, G719, S768, T790 and L861. The mutations at these five amino acids are seldom reported as singlets. Moreover, when the common L858 target is included, more than one-third of EGFR doublets are one of five specific missense pairs: G719/E709, G719/S768, G719/L861, L858/E709 and L858/T790. Structure suggests function: The data imply that most EGFR doublets are NOT consistent with a 'driver and passenger' mutation mechanism. EGFR doublets are highly skewed relative to singlets, consistent with functional selection of two individually suboptimal mutations that, in combination, have enhanced oncogenic potential.
Collapse
Affiliation(s)
- Z Chen
- Department of Molecular Genetics, City of Hope National Medical Center, Duarte, CA 91010-3000, USA
| | | | | | | | | | | |
Collapse
|
10
|
Gu D, Scaringe WA, Li K, Saldivar JS, Hill KA, Chen Z, Gonzalez KD, Sommer SS. Database of somatic mutations in EGFR with analyses revealing indel hotspots but no smoking-associated signature. Hum Mutat 2007; 28:760-70. [PMID: 17429853 DOI: 10.1002/humu.20512] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
We created an Epidermal Growth Factor Receptor (EGFR) Mutation Database (http://www.cityofhope.org/cmdl/egfr_db) that curates a convenient compilation of somatic EGFR mutations in non-small-cell lung cancer (NSCLC) and associated epidemiological and methodological data, including response to the tyrosine kinase inhibitors Gefitinib and Erlotinib. Herein, we analyze 809 mutations collected from 26 publications. Four super hotspots account for 70% of reported mutations while two-thirds of 131 unique mutations have been reported only once and account for only 11% of reported mutations. Consistent with strong biological selection for gain of function, the reported mutations are virtually all missense substitutions or in-frame microdeletions, microinsertions, or microindels (colocalized insertion and deletion with a net gain or loss of 1-50 nucleotides). Microdeletions and microindels are common in a region of exon 19. Microindels, which account for 8% of mutations, have smaller inserted sequences (95% are 1 to 5 bp) and are elevated 16-fold relative to mouse somatic microindels and to human germline microindels. Microdeletions/microindels are significantly more frequent in responders to Gefitinib or Erlotinib (P = 0.003). In addition, EGFR mutations in smokers do not carry signatures of mutagens in cigarette smoke. Otherwise, the mutation pattern does not differ significantly with respect to gender, age, or tumor histology. The EGFR Mutation Database is a central resource of EGFR sequence variant data for clinicians, geneticists, and other researchers. Authors are encouraged to submit new publications with EGFR sequence variants to be included in the database or to provide direct submissions via The WayStation submission and publication process (http://www.centralmutations.org).
Collapse
Affiliation(s)
- Dongqing Gu
- Clinical Molecular Diagnostic Laboratory (CMDL), Department of Molecular Diagnosis, City of Hope National Medical Center, City of Hope, Duarte, California 91010-0269, USA
| | | | | | | | | | | | | | | |
Collapse
|
11
|
Wang J, Gonzalez KD, Scaringe WA, Tsai K, Liu N, Gu D, Li W, Hill KA, Sommer SS. Evidence for mutation showers. Proc Natl Acad Sci U S A 2007; 104:8403-8. [PMID: 17485671 PMCID: PMC1895962 DOI: 10.1073/pnas.0610902104] [Citation(s) in RCA: 68] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Mutants in the Big Blue transgenic mouse system show spontaneous clustered multiple mutations with unexpectedly high frequency, consistent with chronocoordinate events. We tested the prediction that the multiple mutations seen within the lacI mutation target sometimes occur in the context of chronocoordinate multiple mutations spanning multiple kilobases (mutation showers). Additional sequencing of mutants was performed in regions immediately flanking the lacI region (total of 10.7 kb). Nineteen additional mutations were found outside the lacI region ("ectomutations") from 10 mutants containing two or more lacI mutations, whereas only one ectomutation was found in 130 mutants with a single mutation (P < 0.0001). The mutation showers had an average of approximately one mutation per 3 kb. Four mutants showed closely spaced double mutations in the new sequence, and analysis of the spacing between these mutations revealed significant clustering (P = 0.0098). To determine the extent of the mutation showers, regions (8.5 kb total) remote from the lacI region (approximately 16-17 kb away) were sequenced. Only two additional ectomutations were found in these remote regions, consistent with mutation showers that generally do not extend more than approximately 30 kb. We conclude that mutation showers exist and that they constitute at least 0.2% and possibly 1% or more of mutational events observed in this system. The existence of mutation showers has implications for oncogenesis and evolution, raising the possibilities of "cancer in an instant" and "introns as sponges to reduce the deleterious impact of mutation showers."
Collapse
Affiliation(s)
| | | | - William A. Scaringe
- *Department of Molecular Genetics and
- Bioinformatics Group, Department of Molecular Genetics, City of Hope National Medical Center, Duarte, CA 91010; and
| | | | - Ning Liu
- *Department of Molecular Genetics and
| | - Dongqing Gu
- *Department of Molecular Genetics and
- Bioinformatics Group, Department of Molecular Genetics, City of Hope National Medical Center, Duarte, CA 91010; and
| | - Wenyan Li
- *Department of Molecular Genetics and
| | - Kathleen A. Hill
- *Department of Molecular Genetics and
- Department of Biology, University of Western Ontario, London, ON, CanadaN6A 5B7
| | - Steve S. Sommer
- *Department of Molecular Genetics and
- To whom correspondence should be addressed at:
Beckman Research Institute/City of Hope National Medical Center, 1500 East Duarte Road, Duarte, CA 91010-0269. E-mail:
| |
Collapse
|
12
|
Gonzalez KD, Hill KA, Li K, Li W, Scaringe WA, Wang JC, Gu D, Sommer SS. Somatic microindels: analysis in mouse soma and comparison with the human germline. Hum Mutat 2007; 28:69-80. [PMID: 16977595 DOI: 10.1002/humu.20416] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Microindels, defined as mutations that result in a colocalized microinsertion and microdeletion with a net gain or loss of between 1 and 50 nucleotides, may be an important contributor to cancer. We report the first comprehensive analysis of somatic microindels. Our large database of mutations in the lacI transgene of Big Blue((R)) mice contains 0.5% microindels, 2.8% pure microinsertions, and 11.5% pure microdeletions. There appears to be no age, gender, or tissue-type specificity in the frequency of microindels. Of the independent somatic mutations that result in a net in-frame insertion or deletion, microindels are responsible for 13% of protein expansions and 6% of protein contractions. These in-frame microindels may play a crucial role in oncogenesis and evolution via "protein tinkering" (i.e., modest expansion or contraction of proteins). Four characteristics suggest that microindels are caused by unique mechanisms, not just simple combinations of the same mechanisms that cause pure microinsertions and pure microdeletions. First, microinsertions and microdeletions commonly occur at hotspots, but none of the 30 microindels are recurrent. Second, the sizes of the deletions and insertions in microindels are larger and more varied than in pure microdeletions and pure microinsertions. Third, microinsertions overwhelmingly repeat the adjacent base (97%) while the insertions in microindels do so only infrequently (17%). Fourth, analysis of the sequence contexts of microindels is consistent with unique mechanisms including recruitment of translesion DNA synthesis polymerases. The mouse somatic microindels have characteristics similar to those of human germline microindels, consistent with similar causative mechanisms in mouse and human, and in soma and germline.
Collapse
Affiliation(s)
- Kelly D Gonzalez
- Department of Molecular Genetics, City of Hope National Medical Center, Duarte, California
| | | | | | | | | | | | | | | |
Collapse
|
13
|
Hill KA, Gonzalez KD, Scaringe WA, Wang JC, Sommer SS. Preferential occurrence of 1-2 microindels. Hum Mutat 2006; 27:55-61. [PMID: 16281281 DOI: 10.1002/humu.20260] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Microindels are unique, infrequent mutations that result in inserted and deleted sequences of different sizes (between one and 50 nucleotides) at the same nucleotide position. Little is known about the mutational mechanisms that are responsible for these mutations. From our database of 6,016 independent somatic mutational events in the lacI gene in Big Blue mice, we assembled the 30 microindels (0.5%) for analysis. Microindels with one nucleotide inserted and two nucleotides deleted (1-2 microindels) accounted for seven (23%) of the microindels observed, with the remaining microindels distributed among 21 other combinations of insertion and deletion sizes. A preferential occurrence of 1-2 microindels (20%) was also observed in human germline transmitted mutations in the Human Gene Mutation Database (HGMD). An examination of the sequence flanking the mouse 1-2 microindels did not reveal obvious site specificity or associated secondary structure. A detailed examination of 1-2 microindels did not reveal the features typical of pure microinsertion and microdeletion events, but rather suggested a unique mutational mechanism. The 1 bp insertion in 1-2 microinsertions, and pure 1 bp insertions show distinct features. The mechanism for 1-2 microindels is not obviously a simple combination of pure microinsertion and microdeletion events. The dramatic enhancement of 1-2 microindels requires explanation. We speculate that certain error-prone polymerases may be responsible for the preferential occurrence of 1-2 microindels in both somatic tissues and germ cells. It is estimated that a human adult carries roughly 400 billion somatic 1-2 microindels with the potential to predispose to cancer.
Collapse
Affiliation(s)
- Kathleen A Hill
- Department of Molecular Genetics, City of Hope National Medical Center, Duarte, California 91010-0269, USA
| | | | | | | | | |
Collapse
|
14
|
Ono T, Ikehata H, Uehara Y, Komura JI. The Maintenance of Genome Integrity is Tissue-Specific. Genes Environ 2006. [DOI: 10.3123/jemsge.28.16] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
|
15
|
Hill KA, Halangoda A, Heinmoeller PW, Gonzalez K, Chitaphan C, Longmate J, Scaringe WA, Wang JC, Sommer SS. Tissue-specific time courses of spontaneous mutation frequency and deviations in mutation pattern are observed in middle to late adulthood in Big Blue mice. ENVIRONMENTAL AND MOLECULAR MUTAGENESIS 2005; 45:442-454. [PMID: 15690342 DOI: 10.1002/em.20119] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
To better define the time course of spontaneous mutation frequency in middle to late adulthood of the mouse, measurements were made at 10, 14, 17, 23, 25, and 30 months of age in samples of adipose tissue, liver, cerebellum (90% neurons), and the male germline (95% germ cells). A total of 46 million plaque-forming units (pfus) were screened at the six time points and 1,450 circular blue plaques were harvested and sequenced. These data improve resolution and confirm the previously observed occurrence of at least two tissue-specific profiles of spontaneous mutation frequency (elevation with age in adipose tissue and liver, and constancy with age in neurons and male germ cells), a low mutation frequency in the male germline, and a mutation pattern unchanged with age within a tissue. These findings appear to extend to very old age (30 months). Additional findings include interanimal variation in spontaneous mutation frequency is larger in adipose tissues and liver compared with neurons and male germ cells, and subtle but significant differences in the mutation pattern among tissues, consistent with a minor effect of tissue-specific metabolism. The presumptive unaltered balance of DNA damage and repair with age in the male germline has evolutionary consequences. It is of particular interest given the controversy over whether or not increasing germline mutation frequency with paternal age underlies the reports associating older males with a higher incidence of some types of genetic disease. These most detailed measurements available to date regarding the time course of spontaneous mutation frequency and pattern in individual tissues help to constrain hypotheses regarding the role of mutational mechanisms in DNA repair and aging.
Collapse
Affiliation(s)
- Kathleen A Hill
- Department of Molecular Genetics, City of Hope National Medical Center, Duarte, California 91010, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
16
|
Abstract
A PubMed search for the term "oxidative stress" yields over 29,000 articles published on the subject over the past 10 years; more than 2000 of these articles also include the term "aging" in their title or abstract. Many theories of aging predict causal roles for oxidative stress in the myriad of pathological changes that occur as a function of age, including an increasing propensity to develop cancer. A possible link between aging and cancer is the induction and accumulation of somatic mutations caused by oxidative stress. This Review focuses on small mutational events that are induced by oxidative stress and the role of mismatch repair (MMR) in preventing their formation. It also discusses a possible inhibitory effect of oxidative stress on MMR. We speculate that a synergistic interaction between oxidative damage to DNA and reduced MMR levels will, in part, account for an accumulation of small mutational events, and hence cancer, with aging.
Collapse
Affiliation(s)
- Amy M Skinner
- Center for Research on Occupational and Environmental Toxicology, Oregon Health & Science University, Portland, OR 97239, USA
| | | |
Collapse
|
17
|
Ono T, Ikehata H, Pithani VP, Uehara Y, Chen Y, Kinouchi Y, Shimosegawa T, Hosoi Y. Spontaneous mutations in digestive tract of old mice show tissue-specific patterns of genomic instability. Cancer Res 2004; 64:6919-23. [PMID: 15466182 DOI: 10.1158/0008-5472.can-04-1476] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
In an attempt to evaluate the possible role of mutations in the age-dependent increase of tumor incidence, we studied the mutational burden that accumulates in the aging process in different parts of the digestive tract in mice. The mutations were monitored in lacZ genes integrated in the mouse genome. The digestive tract was divided into the esophagus, stomach, proximal, medial, and distal part of the small intestine, and the colon. Epithelial tissues were separated from these tissues with the exception of the esophagus, in which case the whole tissue was examined. At a young age, the mutant frequencies as well as the molecular nature of the mutations were similar among the tissues examined. In old age, on the other hand, mutant frequencies were elevated to different degrees among the tissues; they were high in the small intestine and colon, intermediate in the stomach, and low in the esophagus. The molecular characteristics of the mutations also revealed distinct tissue-specificity; there were elevated rates of a small deletion mutation in the esophagus, G:C to T:A transversion in the proximal small intestine, and multiple mutations in the distal small intestine and colon. The results indicate that different parts of the digestive tract suffer from different kinds of mutational stress in the aging process. The nature of the multiple mutations suggests the presence of a mutator phenotype based on an imbalance in deoxyribonucleotide pools.
Collapse
Affiliation(s)
- Tetsuya Ono
- Division of Genome and Radiation Biology, Graduate School of Medicine, Tohoku University, Sendai, Japan.
| | | | | | | | | | | | | | | |
Collapse
|
18
|
Hill KA, Wang J, Farwell KD, Scaringe WA, Sommer SS. Spontaneous multiple mutations show both proximal spacing consistent with chronocoordinate events and alterations with p53-deficiency. Mutat Res 2004; 554:223-40. [PMID: 15450421 DOI: 10.1016/j.mrfmmm.2004.05.005] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2003] [Revised: 05/05/2004] [Accepted: 05/07/2004] [Indexed: 11/25/2022]
Abstract
Analysis of spontaneous multiple mutations in normal and tumor cells may constrain hypotheses about the mechanisms responsible for multiple mutations and provide insight into the mutator phenotype. In a previous study, spontaneous doublets in Big Blue mice were dramatically more frequent than expected by chance and exhibited a mutation pattern similar to that observed for single mutations [Mutat. Res. 452 (2000) 219]. The spacing between mutations in doublets was generally closer than expected by chance and the distribution of mutation spacing fit an exponential, albeit with substantial scatter. We now analyze 2658 additional mutants and confirm that doublets are enhanced dramatically relative to chance expectation. The spacing, frequency and pattern of spontaneous doublets and multiplets (domuplets) are examined as a function of age, tissue type, p53-deficiency and neoplasia in the new and combined data. The new and combined data confirm that the distribution of the spacing between mutations in doublets is non-random with the mutations more closely spaced than expected by chance (P < 0.0005; combined data), consistent with temporally coordinate (chronocoordinate) events. An exponential provides an excellent fit to the distribution (R2 = 0.98) and estimates that half of doublets have mutations separated by 120 nucleotides or less (the "half-life of mutation spacing"). We make several novel observations: (i) singlets and doublets show similar overall increases in frequency with age (ii) doublet frequency may be lower in the male germline, consistent with the generally reduced mutation frequency in the male germline (iii) doublet frequencies are elevated in somatic tissues of p53-deficient mice (Li-Fraumini cancer syndrome model; P = 0.005) and (iv) doublets and singlets in tumors from p53-deficient mice have a different mutation pattern (P = 0.007). The observations are consistent with chronocoordinate occurrence of spontaneous doublets and multiplets due to a transient error-prone condition and do not suggest a major role for the recently discovered Y family of error-prone polymerases. The enhancement of doublets in p53-deficient mice may contribute to cancer risk.
Collapse
Affiliation(s)
- Kathleen A Hill
- Department of Molecular Genetics and Molecular Diagnosis, Beckman Research Institute, City of Hope National Medical Center, Duarte, CA, USA
| | | | | | | | | |
Collapse
|
19
|
Curry BJ, Roman SD, Wallace CA, Scott R, Miriami E, Aitken RJ. Identification and characterization of a novel splice variant of mouse and rat cytochrome b5/cytochrome b5 reductase. Genomics 2004; 83:425-38. [PMID: 14962668 DOI: 10.1016/j.ygeno.2003.08.020] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2003] [Accepted: 08/22/2003] [Indexed: 10/27/2022]
Abstract
Cytochrome b5/cytochrome b5 reductase (cb5/cb5r) is a cytosolic fusion protein between the hemoprotein cytochrome b5 and the flavoprotein cytochrome b5 reductase. We describe the identification and characterization of a novel splice variant of cb5/cb5r in the mouse and rat and show that expression of the variant is conserved in both species but is not expressed in human tissue. Characterization of the exon structure of cb5/cb5r indicated that the variant was due to the deletion of the whole of exon 12, thus the variant was named cb5/cb5rdelta12. Exon 12 codes for the flavin-adenine dinucleotide binding domain of cb5/cb5r. Expression analysis revealed the transcript of cb5/cb5rdelta12 in mouse and rat testis, brain, and skeletal muscle and also in the male germ line. We postulate that cb5/cb5rdelta12 may function in a dominant negative fashion, limiting the amount of damage caused by the production of reactive oxygen species by cb5/cb5r.
Collapse
Affiliation(s)
- Benjamin J Curry
- ARC Centre of Excellence for Biotechnology and Development, Discipline of Biological Sciences, School of Environmental and Life Sciences, University of Newcastle, University Drive, Callaghan, NSW 2308, Australia
| | | | | | | | | | | |
Collapse
|
20
|
Hill KA, Buettner VL, Halangoda A, Kunishige M, Moore SR, Longmate J, Scaringe WA, Sommer SS. Spontaneous mutation in Big Blue mice from fetus to old age: tissue-specific time courses of mutation frequency but similar mutation types. ENVIRONMENTAL AND MOLECULAR MUTAGENESIS 2004; 43:110-120. [PMID: 14991751 DOI: 10.1002/em.20004] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Transgenic mouse mutation detection systems permit rapid determination of the frequency and type of mutations allowing direct examination of mutational markers for aging, neurodegeneration, and cancer. The Big Blue transgenic mouse mutation detection system was used to determine the frequency and nature of spontaneous mutations versus age in multiple tissue types. Nuclear DNA was extracted from whole fetus at 13.5 days postcoitus (dpc) and from six tissues postbirth (cerebellum, forebrain, thymus, liver, adipose tissue, and male germline) of Big Blue transgenic mice at four ages: 10 days and at 3, 10, and 25 months postbirth. Forty million total plaque-forming units (pfu) were screened. The time course of mutation frequency with age had a significantly different shape in different tissues (P < 10(-6)). By 13.5 dpc, the whole fetus mutation frequency had already started increasing from the theoretical zero at conception to a value that was about one-half the mid-adulthood (3-10 months) average. From 10 days to 3 months, mutation frequency increased significantly in liver (P = 0.007) and showed an increasing trend in cerebellum, forebrain, and thymus. From 3 to 10 months, there was no significant change in mutation frequency in any tissue examined. From 10 to 25 months, the mutation frequency increased significantly in liver (P < 10(-6)) and adipose tissue (P = 0.002), but not in the other tissues examined (cerebellum, forebrain, and male germline). It is of interest that the mutation frequency in the male germline is consistently the lowest, remaining essentially unchanged in old age. The spectrum of mutation types was unaltered with age, tissue type and gender, although, as previously reported, tandem GG-->TT mutations are tissue specific and show significant increases with age and certain hotspots (Buettner VL et al. [1999]: Environ Mol Mutagen 33:320-324; Hill KA et al. [2003]: Mutat Res 534:173-186). The spectrum of mutation types was generally the same for all tissue types, despite the tissue-specific increases in mutation frequency with age. These data provide a useful reference for future studies of endogenous and exogenous mutagenesis.
Collapse
Affiliation(s)
- Kathleen A Hill
- Department of Molecular Genetics, Beckman Research Institute/City of Hope, Duarte, California 91010-0269, USA
| | | | | | | | | | | | | | | |
Collapse
|
21
|
Aitken RJ, Baker MA, Sawyer D. Oxidative stress in the male germ line and its role in the aetiology of male infertility and genetic disease. Reprod Biomed Online 2003; 7:65-70. [PMID: 12930576 DOI: 10.1016/s1472-6483(10)61730-0] [Citation(s) in RCA: 181] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
The human male is characterized by extremely poor semen quality as reflected in the number, morphology and motility of the spermatozoa and a high incidence of nuclear and mitochondrial DNA damage. As a consequence of these factors, defective sperm function is thought to be a major contributor to the aetiology of human infertility, as well as childhood diseases including dominant genetic mutations such as achondroplasia and cancer. Factors associated with the origin of poor semen quality include: (i) a lack of selection pressure for high fecundity genes in developed countries, (ii) an evolutionary lineage associated with the deterioration of several male fertility genes in humans and their close ancestors, (iii) genetic factors including, but not limited to, Y-chromosome deletions (iv) paternal age and (v) environmental factors. A model is proposed whereby factors such as ageing or environmental toxicants initiate DNA strand breakage in the spermatozoa of affected males, eventually leading to a mutation in the embryo. This hypothesis stresses the importance of discovering the identity of those environmental factors that are capable of damaging DNA integrity in the male germ line. Such information could make an important contribution to understanding of the origins of both male infertility and a variety of pathological conditions that affect humans, including cancer and dominant genetic disease.
Collapse
Affiliation(s)
- R John Aitken
- ARC Centre of Excellence in Biotechnology and Development, Discipline of Biological Sciences, School of Environmental and Life Sciences, University of Newcastle, NSW 2308, Australia.
| | | | | |
Collapse
|