1
|
Steele EJ, Franklin A, Lindley RA. Somatic mutation patterns at Ig and Non-Ig Loci. DNA Repair (Amst) 2024; 133:103607. [PMID: 38056368 DOI: 10.1016/j.dnarep.2023.103607] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Revised: 11/17/2023] [Accepted: 11/21/2023] [Indexed: 12/08/2023]
Abstract
The reverse transcriptase (RT) model of immunoglobulin (Ig) somatic hypermutation (SHM) has received insufficient scientific attention. This is understandable given that DNA deamination mediated by activation-induced deaminase (AID), the initiating step of Ig SHM, has dominated experiments since 2002. We summarise some key history of the RT Ig SHM model dating to 1987. For example, it is now established that DNA polymerase η, the sole DNA repair polymerase involved in post-replication short-patch repair, is an efficient cellular RT. This implies that it is potentially able to initiate target site reverse transcription by RNA-directed DNA repair at AID-induced lesions. Recently, DNA polymerase θ has also been shown to be an efficient cellular RT. Since DNA polymerase θ plays no significant role in Ig SHM, it could serve a similar RNA-dependent DNA polymerase role as DNA polymerase η at non-Ig loci in the putative RNA-templated nucleotide excision repair of bulky adducts and other mutagenic lesions on the transcribed strand. A major yet still poorly recognised consequence of the proposed RT process in Ig SHM is the generation of significant and characteristic strand-biased mutation signatures at both deoxyadenosine/deoxythymidine and deoxyguanosine/deoxycytidine base pairs. In this historical perspective, we highlight how diagnostic strand-biased mutation signatures are detected in vivo during SHM at both Ig loci in germinal centre B lymphocytes and non-Ig loci in cancer genomes. These strand-biased signatures have been significantly obscured by technical issues created by improper use of the polymerase chain reaction technique. A heightened awareness of this fact should contribute to better data interpretation and somatic mutation pattern recognition both at Ig and non-Ig loci.
Collapse
Affiliation(s)
- Edward J Steele
- Melville Analytics Pty Ltd, 2/102 Duke St, Kangaroo Point, Brisbane 4169, Qld, Australia.
| | - Andrew Franklin
- Novartis Pharmaceuticals UK Limited, The WestWorks Building, White City Place, 195 Wood Lane, W12 7FQ London, United Kingdom
| | - Robyn A Lindley
- GMDxgenomics, Suite 201, 697 Burke Rd, Camberwell, Melbourne 3124, Vic, Australia; Department of Clinical Pathology, Faculty of Medicine, Dentistry & Health Sciences, University of Melbourne, Vic, Australia
| |
Collapse
|
2
|
Verbiest M, Maksimov M, Jin Y, Anisimova M, Gymrek M, Bilgin Sonay T. Mutation and selection processes regulating short tandem repeats give rise to genetic and phenotypic diversity across species. J Evol Biol 2023; 36:321-336. [PMID: 36289560 PMCID: PMC9990875 DOI: 10.1111/jeb.14106] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Revised: 06/29/2022] [Accepted: 08/01/2022] [Indexed: 02/03/2023]
Abstract
Short tandem repeats (STRs) are units of 1-6 bp that repeat in a tandem fashion in DNA. Along with single nucleotide polymorphisms and large structural variations, they are among the major genomic variants underlying genetic, and likely phenotypic, divergence. STRs experience mutation rates that are orders of magnitude higher than other well-studied genotypic variants. Frequent copy number changes result in a wide range of alleles, and provide unique opportunities for modulating complex phenotypes through variation in repeat length. While classical studies have identified key roles of individual STR loci, the advent of improved sequencing technology, high-quality genome assemblies for diverse species, and bioinformatics methods for genome-wide STR analysis now enable more systematic study of STR variation across wide evolutionary ranges. In this review, we explore mutation and selection processes that affect STR copy number evolution, and how these processes give rise to varying STR patterns both within and across species. Finally, we review recent examples of functional and adaptive changes linked to STRs.
Collapse
Affiliation(s)
- Max Verbiest
- Institute of Computational Life Sciences, School of Life Sciences and Facility ManagementZürich University of Applied SciencesWädenswilSwitzerland
- Department of Molecular Life SciencesUniversity of ZurichZurichSwitzerland
- Swiss Institute of BioinformaticsLausanneSwitzerland
| | - Mikhail Maksimov
- Department of Computer Science & EngineeringUniversity of California San DiegoLa JollaCaliforniaUSA
- Department of MedicineUniversity of California San DiegoLa JollaCaliforniaUSA
| | - Ye Jin
- Department of MedicineUniversity of California San DiegoLa JollaCaliforniaUSA
- Department of BioengineeringUniversity of California San DiegoLa JollaCaliforniaUSA
| | - Maria Anisimova
- Institute of Computational Life Sciences, School of Life Sciences and Facility ManagementZürich University of Applied SciencesWädenswilSwitzerland
- Swiss Institute of BioinformaticsLausanneSwitzerland
| | - Melissa Gymrek
- Department of Computer Science & EngineeringUniversity of California San DiegoLa JollaCaliforniaUSA
- Department of MedicineUniversity of California San DiegoLa JollaCaliforniaUSA
| | - Tugce Bilgin Sonay
- Institute of Ecology, Evolution and Environmental BiologyColumbia UniversityNew YorkNew YorkUSA
| |
Collapse
|
3
|
Adenovirus DNA Polymerase Loses Fidelity on a Stretch of Eleven Homocytidines during Pre-GMP Vaccine Preparation. Vaccines (Basel) 2022; 10:vaccines10060960. [PMID: 35746566 PMCID: PMC9227658 DOI: 10.3390/vaccines10060960] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2022] [Revised: 05/09/2022] [Accepted: 06/07/2022] [Indexed: 11/21/2022] Open
Abstract
In this study, we invented and construct novel candidate HIV-1 vaccines. Through genetic and protein engineering, we unknowingly constructed an HIV-1-derived transgene with a homopolymeric run of 11 cytidines, which was inserted into an adenovirus vaccine vector. Here, we describe the virus rescue, three rounds of clonal purification and preparation of good manufacturing practise (GMP) starting material assessed for genetic stability in five additional virus passages. Throughout these steps, quality control assays indicated the presence of the transgene in the virus genome, expression of the correct transgene product and immunogenicity in mice. However, DNA sequencing of the transgene revealed additional cytidines inserted into the original 11-cytidine region, and the GMP manufacture had to be aborted. Subsequent analyses indicated that as little as 1/25th of the virus dose used for confirmation of protein expression (106 cells at a multiplicity of infection of 10) and murine immunogenicity (108 infectious units per animal) met the quality acceptance criteria. Similar frameshifts in the expressed proteins were reproduced in a one-reaction in vitro transcription/translation employing phage T7 polymerase and E. coli ribosomes. Thus, the most likely mechanism for addition of extra cytidines into the ChAdOx1.tHIVconsv6 genome is that the adenovirus DNA polymerase lost its fidelity on a stretch of 11 cytidines, which informs future adenovirus vaccine designs.
Collapse
|
4
|
Franklin A, Steele EJ. RNA-directed DNA repair and antibody somatic hypermutation. Trends Genet 2021; 38:426-436. [PMID: 34740453 DOI: 10.1016/j.tig.2021.10.005] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Revised: 10/03/2021] [Accepted: 10/04/2021] [Indexed: 10/19/2022]
Abstract
Somatic hypermutation at antibody loci affects both deoxyadenosine-deoxythymidine (A/T) and deoxycytidine-deoxyguanosine (C/G) pairs. Deamination of C to deoxyuridine (U) by activation-induced deaminase (AID) explains how mutation at C/G pairs is potentiated. Mutation at A/T pairs is triggered during the initial stages of repair of AID-generated U lesions and occurs through an as yet unknown mechanism in which polymerase η has a major role. Recent evidence confirms that human polymerase η can act as a reverse transcriptase. Here, we compare the popular suggestion of mutation at A/T pairs through nucleotide mispairing (owing to polymerase error) during short-patch repair synthesis with the alternative proposal of mutation at A/T pairs through RNA editing and RNA-directed DNA repair.
Collapse
Affiliation(s)
- Andrew Franklin
- Novartis Pharma AG, Novartis Campus, 4056, Basel, Switzerland.
| | | |
Collapse
|
5
|
APOBECs orchestrate genomic and epigenomic editing across health and disease. Trends Genet 2021; 37:1028-1043. [PMID: 34353635 DOI: 10.1016/j.tig.2021.07.003] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2021] [Revised: 07/03/2021] [Accepted: 07/05/2021] [Indexed: 12/17/2022]
Abstract
APOBEC proteins can deaminate cytosine residues in DNA and RNA. This can lead to somatic mutations, DNA breaks, RNA modifications, or DNA demethylation in a selective manner. APOBECs function in various cellular compartments and recognize different nucleic acid motifs and structures. They orchestrate a wide array of genomic and epigenomic modifications, thereby affecting various cellular functions positively or negatively, including immune editing, viral and retroelement restriction, DNA damage responses, DNA demethylation, gene expression, and tissue homeostasis. Furthermore, the cumulative increase in genomic and epigenomic editing with aging could also, at least in part, be attributed to APOBEC function. We synthesize our cumulative understanding of APOBEC activity in a unifying overview and discuss their genomic and epigenomic impact in physiological, pathological, and technological contexts.
Collapse
|
6
|
Mamrot J, Hall NE, Lindley RA. Predicting clinical outcomes using cancer progression associated signatures. Oncotarget 2021; 12:845-858. [PMID: 33889305 PMCID: PMC8057277 DOI: 10.18632/oncotarget.27934] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Accepted: 03/22/2021] [Indexed: 12/09/2022] Open
Abstract
Somatic mutation signatures are an informative facet of cancer aetiology, however they are rarely useful for predicting patient outcome. The aim of this study is to evaluate the utility of a panel of 142 mutation-signature–associated metrics (P142) for predicting cancer progression in patients from a ‘TCGA PanCancer Atlas’ cohort. The P142 metrics are comprised of AID/APOBEC and ADAR deaminase associated SNVs analyzed for codon context, strand bias, and transitions/transversions. TCGA tumor-normal mutation data was obtained for 10,437 patients, representing 31 of the most prevalent forms of cancer. Stratified random sampling was used to split patients into training, tuning and validation cohorts for each cancer type. Cancer specific machine learning (XGBoost) models were built using the output from the P142 panel to predict patient Progression Free Survival (PFS) status as either “High PFS” or “Low PFS”. Predictive performance of each model was evaluated using the validation cohort. Models accurately predicted PFS status for several cancer types, including adrenocortical carcinoma, glioma, mesothelioma, and sarcoma. In conclusion, the P142 panel of metrics successfully predicted cancer progression status in patients with some, but not all cancer types analyzed. These results pave the way for future studies on cancer progression associated signatures.
Collapse
Affiliation(s)
- Jared Mamrot
- GMDx Group Ltd, Melbourne, Victoria, Australia.,Department of Obstetrics and Gynaecology, Monash University, Clayton, VIC, Australia
| | | | - Robyn A Lindley
- GMDx Group Ltd, Melbourne, Victoria, Australia.,Department of Clinical Pathology, The Victorian Comprehensive Cancer Centre, Faculty of Medicine, Dentistry & Health Sciences, University of Melbourne, VIC, Australia
| |
Collapse
|
7
|
Zamai L. Unveiling Human Non-Random Genome Editing Mechanisms Activated in Response to Chronic Environmental Changes: I. Where Might These Mechanisms Come from and What Might They Have Led To? Cells 2020; 9:E2362. [PMID: 33121045 PMCID: PMC7693803 DOI: 10.3390/cells9112362] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2020] [Revised: 10/18/2020] [Accepted: 10/19/2020] [Indexed: 12/13/2022] Open
Abstract
This article challenges the notion of the randomness of mutations in eukaryotic cells by unveiling stress-induced human non-random genome editing mechanisms. To account for the existence of such mechanisms, I have developed molecular concepts of the cell environment and cell environmental stressors and, making use of a large quantity of published data, hypothesised the origin of some crucial biological leaps along the evolutionary path of life on Earth under the pressure of natural selection, in particular, (1) virus-cell mating as a primordial form of sexual recombination and symbiosis; (2) Lamarckian CRISPR-Cas systems; (3) eukaryotic gene development; (4) antiviral activity of retrotransposon-guided mutagenic enzymes; and finally, (5) the exaptation of antiviral mutagenic mechanisms to stress-induced genome editing mechanisms directed at "hyper-transcribed" endogenous genes. Genes transcribed at their maximum rate (hyper-transcribed), yet still unable to meet new chronic environmental demands generated by "pollution", are inadequate and generate more and more intronic retrotransposon transcripts. In this scenario, RNA-guided mutagenic enzymes (e.g., Apolipoprotein B mRNA editing catalytic polypeptide-like enzymes, APOBECs), which have been shown to bind to retrotransposon RNA-repetitive sequences, would be surgically targeted by intronic retrotransposons on opened chromatin regions of the same "hyper-transcribed" genes. RNA-guided mutagenic enzymes may therefore "Lamarkianly" generate single nucleotide polymorphisms (SNP) and gene copy number variations (CNV), as well as transposon transposition and chromosomal translocations in the restricted areas of hyper-functional and inadequate genes, leaving intact the rest of the genome. CNV and SNP of hyper-transcribed genes may allow cells to surgically explore a new fitness scenario, which increases their adaptability to stressful environmental conditions. Like the mechanisms of immunoglobulin somatic hypermutation, non-random genome editing mechanisms may generate several cell mutants, and those codifying for the most environmentally adequate proteins would have a survival advantage and would therefore be Darwinianly selected. Non-random genome editing mechanisms represent tools of evolvability leading to organismal adaptation including transgenerational non-Mendelian gene transmission or to death of environmentally inadequate genomes. They are a link between environmental changes and biological novelty and plasticity, finally providing a molecular basis to reconcile gene-centred and "ecological" views of evolution.
Collapse
Affiliation(s)
- Loris Zamai
- Department of Biomolecular Sciences, University of Urbino Carlo Bo, 61029 Urbino, Italy; ; Tel./Fax: +39-0722-304-319
- National Institute for Nuclear Physics (INFN)-Gran Sasso National Laboratory (LNGS), 67100 Assergi, L’Aquila, Italy
| |
Collapse
|
8
|
Steele EJ, Gorczynski RM, Lindley RA, Liu Y, Temple R, Tokoro G, Wickramasinghe DT, Wickramasinghe NC. The efficient Lamarckian spread of life in the cosmos. ADVANCES IN GENETICS 2020; 106:21-43. [PMID: 33081924 PMCID: PMC7340397 DOI: 10.1016/bs.adgen.2020.03.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
In this Chapter we discuss the various mechanisms that are available for the possible transfer of cosmic microbial living systems from one cosmic habitat to another. With the 100 or so habitable planets that are now known to exist in our galaxy alone transfers of cometary dust carrying life including fragments of icy planetoids/asteroids would be expected to occur on a routine basis. It is thus easy to view the galaxy as a single connected "biosphere" of which our planet Earth is a minor component. The Hoyle-Wickramasinghe Panspermia paradigm provides a cogent biological rationale for the actual widespread existence of Lamarckian modes of inheritance in terrestrial systems (which we review here). Thus the Panspermia paradigm provides the raison d'etre for Lamarckian Inheritance. Under a terrestrially confined neoDarwinian viewpoint such an association may have been thought spurious in the past. Our aim here is to outline the main evidence for rapid terrestrial-based Lamarckian-based evolutionary hypermutation processes dependent on reverse transcription-coupled mechanisms among others. Such rapid adaptation mechanisms would be consistent with the effective cosmic spread of living systems. For example, a viable, or cryo-preserved, living system traveling through space in a protective matrix will of necessity need to adapt rapidly and proliferate on landing in a new cosmic niche. Lamarckian mechanisms thus come to the fore and supersede the slow (blind and random) genetic processes expected under neoDarwinian Earth centred theories.
Collapse
Affiliation(s)
- Edward J Steele
- C.Y.O'Connor ERADE Village Foundation, Piara Waters, Perth, WA, Australia; Centre for Astrobiology, University of Ruhuna, Matara, Sri Lanka; Melville Analytics Pty Ltd, Melbourne, VIC, Australia.
| | | | - Robyn A Lindley
- Department of Clinical Pathology, Faculty of Medicine, Dentistry & Health Sciences, University of Melbourne, Melbourne, VIC, Australia; GMDx Group Ltd, Melbourne, VIC, Australia
| | - Yongsheng Liu
- Henan Collaborative Innovation Center of Modern Biological Breeding, Henan Institute of Science and Technology, Xinxiang, China
| | - Robert Temple
- The History of Chinese Science and Culture Foundation, Conway Hall, London, United Kingdom
| | - Gensuke Tokoro
- Institute for the Study of Panspermia and Astroeconomics, Gifu, Japan
| | - Dayal T Wickramasinghe
- College of Physical and Mathematical Sciences, Australian National University, Canberra, ACT, Australia
| | - N Chandra Wickramasinghe
- Centre for Astrobiology, University of Ruhuna, Matara, Sri Lanka; Institute for the Study of Panspermia and Astroeconomics, Gifu, Japan; University of Buckingham, Buckingham, United Kingdom; National Institute of Fundamental Studies, Kandy, Sri Lanka.
| |
Collapse
|
9
|
Blood and saliva-derived exomes from healthy Caucasian subjects do not display overt evidence of somatic mosaicism. Mutat Res 2020; 821:111705. [PMID: 32569906 DOI: 10.1016/j.mrfmmm.2020.111705] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2020] [Revised: 04/07/2020] [Accepted: 04/27/2020] [Indexed: 02/07/2023]
Abstract
Somatic mosaicism is a normal occurrence during development in the tissues and organs. As part of establishing a "healthy population "(HP) background or base-line, we investigated whether such mosaicism can be routinely detected in the circulating DNA secured from a rigorously designed healthy human liquid biopsy clinical trial (saliva, blood). We deployed next generation (NG) whole exome sequencing (WES) at median exome coverage rates of 97.2 % (-to-30x) and 70.0 % (-to-100x). We found that somatic mosaicism is not detectable by such standard bulk WES sequencing assays in saliva and blood DNA in 24 normal healthy Caucasians of both sexes from 18 to 60 years of age. We conclude that for circulating DNA using standard WES no novel somatic mutational variants can be detected in protein-coding regions of normal healthy subjects. This implies that the extent within normal tissues of somatic mosaicism must be at a lower level, below the detection threshold, for these circulating DNA WES read depths.
Collapse
|