1
|
Zhang X, Hu X, Zhang T, Yang L, Liu C, Xu N, Wang H, Sun W. PLM_Sol: predicting protein solubility by benchmarking multiple protein language models with the updated Escherichia coli protein solubility dataset. Brief Bioinform 2024; 25:bbae404. [PMID: 39179250 PMCID: PMC11343611 DOI: 10.1093/bib/bbae404] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Revised: 07/19/2024] [Accepted: 08/07/2024] [Indexed: 08/26/2024] Open
Abstract
Protein solubility plays a crucial role in various biotechnological, industrial, and biomedical applications. With the reduction in sequencing and gene synthesis costs, the adoption of high-throughput experimental screening coupled with tailored bioinformatic prediction has witnessed a rapidly growing trend for the development of novel functional enzymes of interest (EOI). High protein solubility rates are essential in this process and accurate prediction of solubility is a challenging task. As deep learning technology continues to evolve, attention-based protein language models (PLMs) can extract intrinsic information from protein sequences to a greater extent. Leveraging these models along with the increasing availability of protein solubility data inferred from structural database like the Protein Data Bank holds great potential to enhance the prediction of protein solubility. In this study, we curated an Updated Escherichia coli protein Solubility DataSet (UESolDS) and employed a combination of multiple PLMs and classification layers to predict protein solubility. The resulting best-performing model, named Protein Language Model-based protein Solubility prediction model (PLM_Sol), demonstrated significant improvements over previous reported models, achieving a notable 6.4% increase in accuracy, 9.0% increase in F1_score, and 11.1% increase in Matthews correlation coefficient score on the independent test set. Moreover, additional evaluation utilizing our in-house synthesized protein resource as test data, encompassing diverse types of enzymes, also showcased the good performance of PLM_Sol. Overall, PLM_Sol exhibited consistent and promising performance across both independent test set and experimental set, thereby making it well suited for facilitating large-scale EOI studies. PLM_Sol is available as a standalone program and as an easy-to-use model at https://zenodo.org/doi/10.5281/zenodo.10675340.
Collapse
Affiliation(s)
- Xuechun Zhang
- Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, 1 Beichen West Road, Chaoyang District, Beijing 100101, China
- Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, 1 Beichen West Road, Chaoyang District, Beijing 100101, China
- University of Chinese Academy of Sciences, No. 1 Yanqihu East Rd, Huairou District, Beijing 101408, China
| | - Xiaoxuan Hu
- Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, 1 Beichen West Road, Chaoyang District, Beijing 100101, China
- Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, 1 Beichen West Road, Chaoyang District, Beijing 100101, China
- University of Chinese Academy of Sciences, No. 1 Yanqihu East Rd, Huairou District, Beijing 101408, China
| | - Tongtong Zhang
- Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, 1 Beichen West Road, Chaoyang District, Beijing 100101, China
- Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, 1 Beichen West Road, Chaoyang District, Beijing 100101, China
- University of Chinese Academy of Sciences, No. 1 Yanqihu East Rd, Huairou District, Beijing 101408, China
| | - Ling Yang
- Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, 1 Beichen West Road, Chaoyang District, Beijing 100101, China
- Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, 1 Beichen West Road, Chaoyang District, Beijing 100101, China
- University of Chinese Academy of Sciences, No. 1 Yanqihu East Rd, Huairou District, Beijing 101408, China
| | - Chunhong Liu
- Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, 1 Beichen West Road, Chaoyang District, Beijing 100101, China
- Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, 1 Beichen West Road, Chaoyang District, Beijing 100101, China
- University of Chinese Academy of Sciences, No. 1 Yanqihu East Rd, Huairou District, Beijing 101408, China
| | - Ning Xu
- Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, 1 Beichen West Road, Chaoyang District, Beijing 100101, China
- Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, 1 Beichen West Road, Chaoyang District, Beijing 100101, China
- University of Chinese Academy of Sciences, No. 1 Yanqihu East Rd, Huairou District, Beijing 101408, China
| | - Haoyi Wang
- Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, 1 Beichen West Road, Chaoyang District, Beijing 100101, China
- Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, 1 Beichen West Road, Chaoyang District, Beijing 100101, China
- University of Chinese Academy of Sciences, No. 1 Yanqihu East Rd, Huairou District, Beijing 101408, China
- Beijing Institute for Stem Cell and Regenerative Medicine, A 3 Datun Road, Chaoyang District, Beijing 100100, China
| | - Wen Sun
- Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, 1 Beichen West Road, Chaoyang District, Beijing 100101, China
- Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, 1 Beichen West Road, Chaoyang District, Beijing 100101, China
- Beijing Institute for Stem Cell and Regenerative Medicine, A 3 Datun Road, Chaoyang District, Beijing 100100, China
| |
Collapse
|
2
|
Spínola-Amilibia M, Araújo-Bazán L, de la Gándara Á, Berger JM, Arias-Palomo E. IS21 family transposase cleaved donor complex traps two right-handed superhelical crossings. Nat Commun 2023; 14:2335. [PMID: 37087515 PMCID: PMC10122671 DOI: 10.1038/s41467-023-38071-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Accepted: 04/14/2023] [Indexed: 04/24/2023] Open
Abstract
Transposases are ubiquitous enzymes that catalyze DNA rearrangement events with broad impacts on gene expression, genome evolution, and the spread of drug-resistance in bacteria. Here, we use biochemical and structural approaches to define the molecular determinants by which IstA, a transposase present in the widespread IS21 family of mobile elements, catalyzes efficient DNA transposition. Solution studies show that IstA engages the transposon terminal sequences to form a high-molecular weight complex and promote DNA integration. A 3.4 Å resolution structure of the transposase bound to transposon ends corroborates our biochemical findings and reveals that IstA self-assembles into a highly intertwined tetramer that synapses two supercoiled terminal inverted repeats. The three-dimensional organization of the IstA•DNA cleaved donor complex reveals remarkable similarities with retroviral integrases and classic transposase systems, such as Tn7 and bacteriophage Mu, and provides insights into IS21 transposition.
Collapse
Affiliation(s)
- Mercedes Spínola-Amilibia
- Department of Structural & Chemical Biology, Centro de Investigaciones Biológicas Margarita Salas, CSIC, Madrid, 28040, Spain
| | - Lidia Araújo-Bazán
- Department of Structural & Chemical Biology, Centro de Investigaciones Biológicas Margarita Salas, CSIC, Madrid, 28040, Spain
| | - Álvaro de la Gándara
- Department of Structural & Chemical Biology, Centro de Investigaciones Biológicas Margarita Salas, CSIC, Madrid, 28040, Spain
| | - James M Berger
- Department of Biophysics and Biophysical Chemistry, Johns Hopkins University School of Medicine, Baltimore, MD, 21205, USA
| | - Ernesto Arias-Palomo
- Department of Structural & Chemical Biology, Centro de Investigaciones Biológicas Margarita Salas, CSIC, Madrid, 28040, Spain.
| |
Collapse
|
3
|
Intracellular common gardens reveal niche differentiation in transposable element community during bacterial adaptive evolution. THE ISME JOURNAL 2023; 17:297-308. [PMID: 36434281 PMCID: PMC9860058 DOI: 10.1038/s41396-022-01344-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Revised: 11/08/2022] [Accepted: 11/10/2022] [Indexed: 11/26/2022]
Abstract
The distribution and abundance of transposable elements across the tree of life have significantly shaped the evolution of cellular organisms, but the underlying mechanisms shaping these ecological patterns remain elusive. Here we establish a "common garden" approach to study causal ecological interactions between a xenogeneic conditional lethal sacB gene and the community of transposable insertion sequences (ISs) in a multipartite prokaryote genome. Xenogeneic sacB of low, medium, or high GC content was individually inserted into three replicons of a model bacterium Sinorhizobium fredii, and exhibited replicon- and GC-dependent variation in genetic stability. This variation was largely attributable to multidimensional niche differentiation for IS community members. The transposition efficiency of major active ISs depended on the nucleoid-associated xenogeneic silencer MucR. Experimentally eliminating insertion activity of specific ISs by deleting MucR strongly demonstrated a dominant role of niche differentiation among ISs. This intracellular common garden approach in the experimental evolution context allows not only for evaluating genetic stability of natural and synthetic xenogeneic genes of different sequence signatures in host cells but also for tracking and testing causal relationships in unifying ecological principles in genome ecology.
Collapse
|
4
|
Characterization of the specific DNA-binding properties of Tnp26, the transposase of insertion sequence IS26. J Biol Chem 2021; 297:101165. [PMID: 34487761 PMCID: PMC8477213 DOI: 10.1016/j.jbc.2021.101165] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2021] [Revised: 08/31/2021] [Accepted: 09/01/2021] [Indexed: 11/21/2022] Open
Abstract
The bacterial insertion sequence (IS) IS26 mobilizes and disseminates antibiotic resistance genes. It differs from bacterial IS that have been studied to date as it exclusively forms cointegrates via either a copy-in (replicative) or a recently discovered targeted conservative mode. To investigate how the Tnp26 transposase recognizes the 14-bp terminal inverted repeats (TIRs) that bound the IS, amino acids in two domains in the N-terminal (amino acids M1-P56) region were replaced. These changes substantially reduced cointegration in both modes. Tnp26 was purified as a maltose-binding fusion protein and shown to bind specifically to dsDNA fragments that included an IS26 TIR. However, Tnp26 with an R49A or a W50A substitution in helix 3 of a predicted trihelical helix-turn-helix domain (amino acids I13-R53) or an F4A or F9A substitution replacing the conserved amino acids in a unique disordered N-terminal domain (amino acids M1-D12) did not bind. The N-terminal M1-P56 fragment also bound to the TIR but only at substantially higher concentrations, indicating that other parts of Tnp26 enhance the binding affinity. The binding site was confined to the internal part of the TIR, and a G to T nucleotide substitution in the TGT at positions 6 to 8 of the TIR that is conserved in most IS26 family members abolished binding of both Tnp26 (M1-M234) and Tnp26 M1-P56 fragment. These findings indicate that the helix-turn-helix and disordered domains of Tnp26 play a role in Tnp26-TIR complex formation. Both domains are conserved in all members of the IS26 family.
Collapse
|
5
|
Yakovenko I, Agronin J, Smith LC, Oren M. Guardian of the Genome: An Alternative RAG/Transib Co-Evolution Hypothesis for the Origin of V(D)J Recombination. Front Immunol 2021; 12:709165. [PMID: 34394111 PMCID: PMC8355894 DOI: 10.3389/fimmu.2021.709165] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2021] [Accepted: 07/05/2021] [Indexed: 11/13/2022] Open
Abstract
The appearance of adaptive immunity in jawed vertebrates is termed the immunological 'Big Bang' because of the short evolutionary time over which it developed. Underlying it is the recombination activating gene (RAG)-based V(D)J recombination system, which initiates the sequence diversification of the immunoglobulins and lymphocyte antigen receptors. It was convincingly argued that the RAG1 and RAG2 genes originated from a single transposon. The current dogma postulates that the V(D)J recombination system was established by the split of a primordial vertebrate immune receptor gene into V and J segments by a RAG1/2 transposon, in parallel with the domestication of the same transposable element in a separate genomic locus as the RAG recombinase. Here, based on a new interpretation of previously published data, we propose an alternative evolutionary hypothesis suggesting that two different elements, a RAG1/2 transposase and a Transib transposon invader with RSS-like terminal inverted repeats, co-evolved to work together, resulting in a functional recombination process. This hypothesis offers an alternative understanding of the acquisition of recombinase function by RAGs and the origin of the V(D)J system.
Collapse
Affiliation(s)
- Iryna Yakovenko
- Department of Molecular Biology, Ariel University, Ariel, Israel
| | - Jacob Agronin
- Department of Biological Sciences, George Washington University, Washington, DC, United States
| | - L. Courtney Smith
- Department of Biological Sciences, George Washington University, Washington, DC, United States
| | - Matan Oren
- Department of Molecular Biology, Ariel University, Ariel, Israel
| |
Collapse
|
6
|
Kolenko P, Svoboda J, Černý J, Charnavets T, Schneider B. Structural variability of CG-rich DNA 18-mers accommodating double T-T mismatches. Acta Crystallogr D Struct Biol 2020; 76:1233-1243. [PMID: 33263329 PMCID: PMC7709200 DOI: 10.1107/s2059798320014151] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2020] [Accepted: 10/23/2020] [Indexed: 11/26/2022] Open
Abstract
Solution and crystal data are reported for DNA 18-mers with sequences related to those of bacterial noncoding single-stranded DNA segments called repetitive extragenic palindromes (REPs). Solution CD and melting data showed that the CG-rich, near-palindromic REPs from various bacterial species exhibit dynamic temperature-dependent and concentration-dependent equilibria, including architectures compatible with not only hairpins, which are expected to be biologically relevant, but also antiparallel duplexes and bimolecular tetraplexes. Three 18-mer oligonucleotides named Hpar-18 (PDB entry 6rou), Chom-18 (PDB entry 6ros) and its brominated variant Chom-18Br (PDB entry 6ror) crystallized as isomorphic right-handed A-like duplexes. The low-resolution crystal structures were solved with the help of experimental phases for Chom-18Br. The center of the duplexes is formed by two successive T-T noncanonical base pairs (mismatches). They do not deform the double-helical geometry. The presence of T-T mismatches prompted an analysis of the geometries of these and other noncanonical pairs in other DNA crystals in terms of their fit to the experimental electron densities (RSCC) and their geometric fit to the NtC (dinucleotide conformational) classes (https://dnatco.datmos.org/). Throughout this work, knowledge of the NtC classes was used to refine and validate the crystal structures, and to analyze the mismatches.
Collapse
Affiliation(s)
- Petr Kolenko
- Faculty of Nuclear Sciences and Physical Engineering, Czech Technical University in Prague, Brehova 7, 11519 Prague 1, Czech Republic
- Institute of Biotechnology of the Czech Academy of Sciences, BIOCEV, Prumyslova 595, 252 50 Vestec, Czech Republic
| | - Jakub Svoboda
- Institute of Biotechnology of the Czech Academy of Sciences, BIOCEV, Prumyslova 595, 252 50 Vestec, Czech Republic
| | - Jiří Černý
- Institute of Biotechnology of the Czech Academy of Sciences, BIOCEV, Prumyslova 595, 252 50 Vestec, Czech Republic
| | - Tatsiana Charnavets
- Institute of Biotechnology of the Czech Academy of Sciences, BIOCEV, Prumyslova 595, 252 50 Vestec, Czech Republic
| | - Bohdan Schneider
- Institute of Biotechnology of the Czech Academy of Sciences, BIOCEV, Prumyslova 595, 252 50 Vestec, Czech Republic
| |
Collapse
|
7
|
Structural Insights on Retroviral DNA Integration: Learning from Foamy Viruses. Viruses 2019; 11:v11090770. [PMID: 31443391 PMCID: PMC6784120 DOI: 10.3390/v11090770] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2019] [Revised: 08/19/2019] [Accepted: 08/20/2019] [Indexed: 12/28/2022] Open
Abstract
Foamy viruses (FV) are retroviruses belonging to the Spumaretrovirinae subfamily. They are non-pathogenic viruses endemic in several mammalian hosts like non-human primates, felines, bovines, and equines. Retroviral DNA integration is a mandatory step and constitutes a prime target for antiretroviral therapy. This activity, conserved among retroviruses and long terminal repeat (LTR) retrotransposons, involves a viral nucleoprotein complex called intasome. In the last decade, a plethora of structural insights on retroviral DNA integration arose from the study of FV. Here, we review the biochemistry and the structural features of the FV integration apparatus and will also discuss the mechanism of action of strand transfer inhibitors.
Collapse
|
8
|
Singer CM, Joy D, Jacobs DJ, Nesmelova IV. Rigidity and flexibility characteristics of DD[E/D]-transposases Mos1 and Sleeping Beauty. Proteins 2018; 87:313-325. [PMID: 30582767 DOI: 10.1002/prot.25653] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2018] [Revised: 12/06/2018] [Accepted: 12/19/2018] [Indexed: 11/05/2022]
Abstract
DD[E/D]-transposases catalyze the multistep reaction of cut-and-paste DNA transposition. Structurally, several DD[E/D]-transposases have been characterized, revealing a multi-domain structure with the catalytic domain possessing the RNase H-like structural motif that brings three catalytic residues (D, D, and E or D) into close proximity for the catalysis. However, the dynamic behavior of DD[E/D]-transposases during transposition remains poorly understood. Here, we analyze the rigidity and flexibility characteristics of two representative DD[E/D]-transposases Mos1 and Sleeping Beauty (SB) using the minimal distance constraint model (mDCM). We find that the catalytic domain of both transposases is globally rigid, with the notable exception of the clamp loop being flexible in the DNA-unbound form. Within this globally rigid structure, the central β-sheet of the RNase H-like motif is much less rigid in comparison to its surrounding α-helices, forming a cage-like structure. The comparison of the original SB transposase to its hyperactive version SB100X reveals the region where the change in flexibility/rigidity correlates with increased activity. This region is found to be within the RNase H-like structural motif and comprise the loop leading from beta-strand B3 to helix H1, helices H1 and H2, which are located on the same side of the central beta-sheet, and the loop between helix H3 and beta-strand B5. We further identify the RKEN214-217DAVQ mutations of the set of hyperactive mutations within the catalytic domain of SB transposase to be the driving factor that induces change in residue-pair rigidity correlations within SB transposase. Given that a signature RNase H-like structural motif is found in DD[E/D]-transposases and, more broadly, in a large superfamily of polynucleotidyl transferases, our results are relevant to these proteins as well.
Collapse
Affiliation(s)
- Christopher M Singer
- Department of Physics and Optical Science, University of North Carolina, Charlotte, North Carolina
| | - Diana Joy
- Department of Physics and Optical Science, University of North Carolina, Charlotte, North Carolina
| | - Donald J Jacobs
- Department of Physics and Optical Science, University of North Carolina, Charlotte, North Carolina.,Center for Biomedical Engineering, University of North Carolina, Charlotte, North Carolina
| | - Irina V Nesmelova
- Department of Physics and Optical Science, University of North Carolina, Charlotte, North Carolina.,Center for Biomedical Engineering, University of North Carolina, Charlotte, North Carolina
| |
Collapse
|
9
|
Konnova TA, Singer CM, Nesmelova IV. NMR solution structure of the RED subdomain of the Sleeping Beauty transposase. Protein Sci 2017; 26:1171-1181. [PMID: 28345263 DOI: 10.1002/pro.3167] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2017] [Accepted: 03/22/2017] [Indexed: 12/22/2022]
Abstract
DNA transposons can be employed for stable gene transfer in vertebrates. The Sleeping Beauty (SB) DNA transposon has been recently adapted for human application and is being evaluated in clinical trials, however its molecular mechanism is not clear. SB transposition is catalyzed by the transposase enzyme, which is a multi-domain protein containing the catalytic and the DNA-binding domains. The DNA-binding domain of the SB transposase contains two structurally independent subdomains, PAI and RED. Recently, the structures of the catalytic domain and the PAI subdomain have been determined, however no structural information on the RED subdomain and its interactions with DNA has been available. Here, we used NMR spectroscopy to determine the solution structure of the RED subdomain and characterize its interactions with the transposon DNA.
Collapse
Affiliation(s)
- Tatiana A Konnova
- Department of Physics and Optical Science, University of North Carolina, Charlotte, North Carolina, 28223
| | - Christopher M Singer
- Department of Physics and Optical Science, University of North Carolina, Charlotte, North Carolina, 28223
| | - Irina V Nesmelova
- Department of Physics and Optical Science, University of North Carolina, Charlotte, North Carolina, 28223.,Center for Biomedical Engineering and Science, University of North Carolina, Charlotte, North Carolina, 28223
| |
Collapse
|
10
|
Abstract
DNA transposons are defined segments of DNA that are able to move from one genomic location to another. Movement is facilitated by one or more proteins, called the transposase, typically encoded by the mobile element itself. Here, we first provide an overview of the classification of such mobile elements in a variety of organisms. From a mechanistic perspective, we have focused on one particular group of DNA transposons that encode a transposase with a DD(E/D) catalytic domain that is topologically similar to RNase H. For these, a number of three-dimensional structures of transpososomes (transposase-nucleic acid complexes) are available, and we use these to describe the basics of their mechanisms. The DD(E/D) group, in addition to being the largest and most common among all DNA transposases, is the one whose members have been used for a wide variety of genomic applications. Therefore, a second focus of the article is to provide a nonexhaustive overview of transposon applications. Although several non-transposon-based approaches to site-directed genome modifications have emerged in the past decade, transposon-based applications are highly relevant when integration specificity is not sought. In fact, for many applications, the almost-perfect randomness and high frequency of integration make transposon-based approaches indispensable.
Collapse
Affiliation(s)
- Alison B. Hickman
- Laboratory of Molecular Biology, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, United States
| | - Fred Dyda
- Laboratory of Molecular Biology, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, United States
| |
Collapse
|
11
|
Abstract
The integration of a DNA copy of the viral RNA genome into host chromatin is the defining step of retroviral replication. This enzymatic process is catalyzed by the virus-encoded integrase protein, which is conserved among retroviruses and LTR-retrotransposons. Retroviral integration proceeds via two integrase activities: 3'-processing of the viral DNA ends, followed by the strand transfer of the processed ends into host cell chromosomal DNA. Herein we review the molecular mechanism of retroviral DNA integration, with an emphasis on reaction chemistries and architectures of the nucleoprotein complexes involved. We additionally discuss the latest advances on anti-integrase drug development for the treatment of AIDS and the utility of integrating retroviral vectors in gene therapy applications.
Collapse
Affiliation(s)
- Paul Lesbats
- Clare Hall Laboratories, The Francis Crick Institute , Blanche Lane, South Mimms, EN6 3LD, U.K
| | - Alan N Engelman
- Department of Cancer Immunology and Virology, Dana-Farber Cancer Institute and Department of Medicine, Harvard Medical School , 450 Brookline Avenue, Boston, Massachusetts 02215 United States
| | - Peter Cherepanov
- Clare Hall Laboratories, The Francis Crick Institute , Blanche Lane, South Mimms, EN6 3LD, U.K.,Imperial College London , St-Mary's Campus, Norfolk Place, London, W2 1PG, U.K
| |
Collapse
|
12
|
Abstract
IS911 has provided a powerful model for studying the transposition of members of a large class of transposable element: the IS3 family of bacterial Insertion Sequences (IS). These transpose by a Copy-out-Paste-in mechanism in which a double-strand IS circle transposition intermediate is generated from the donor site by replication and proceeds to integrate into a suitable double strand DNA target. This is perhaps one of the most common transposition mechanisms known to date. Copy-out-Paste-in transposition has been adopted by members of at least eight large IS families. This chapter details the different steps of the Copy-out-Paste-in mechanism involved in IS911 transposition. At a more biological level it also describes various aspects of regulation of the transposition process. These include transposase production by programmed translational frameshifting, transposase expression from the circular intermediate using a specialized promoter assembled at the circle junction and binding of the nascent transposase while it remains attached to the ribosome during translation (co-translational binding). This co-translational binding of the transposase to neighboring IS ends provides an explanation for the longstanding observation that transposases show a cis-preference for their activities.
Collapse
|
13
|
Yutin N, Shevchenko S, Kapitonov V, Krupovic M, Koonin EV. A novel group of diverse Polinton-like viruses discovered by metagenome analysis. BMC Biol 2015; 13:95. [PMID: 26560305 PMCID: PMC4642659 DOI: 10.1186/s12915-015-0207-4] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2015] [Accepted: 10/28/2015] [Indexed: 01/08/2023] Open
Abstract
Background The rapidly growing metagenomic databases provide increasing opportunities for computational discovery of new groups of organisms. Identification of new viruses is particularly straightforward given the comparatively small size of viral genomes, although fast evolution of viruses complicates the analysis of novel sequences. Here we report the metagenomic discovery of a distinct group of diverse viruses that are distantly related to the eukaryotic virus-like transposons of the Polinton superfamily. Results The sequence of the putative major capsid protein (MCP) of the unusual linear virophage associated with Phaeocystis globosa virus (PgVV) was used as a bait to identify potential related viruses in metagenomic databases. Assembly of the contigs encoding the PgVV MCP homologs followed by comprehensive sequence analysis of the proteins encoded in these contigs resulted in the identification of a large group of Polinton-like viruses (PLV) that resemble Polintons (polintoviruses) and virophages in genome size, and share with them a conserved minimal morphogenetic module that consists of major and minor capsid proteins and the packaging ATPase. With a single exception, the PLV lack the retrovirus-type integrase that is encoded in the genomes of all Polintons and the Mavirus group of virophages. However, some PLV encode a newly identified tyrosine recombinase-integrase that is common in bacteria and bacteriophages and is also found in the Organic Lake virophage group. Although several PLV genomes and individual genes are integrated into algal genomes, it appears likely that most of the PLV are viruses. Given the absence of protease and retrovirus-type integrase, the PLV could resemble the ancestral polintoviruses that evolved from bacterial tectiviruses. Apart from the conserved minimal morphogenetic module, the PLV widely differ in their genome complements but share a gene network with Polintons and virophages, suggestive of multiple gene exchanges within a shared gene pool. Conclusions The discovery of PLV substantially expands the emerging class of eukaryotic viruses and transposons that also includes Polintons and virophages. This class of selfish elements is extremely widespread and might have been a hotbed of eukaryotic virus, transposon and plasmid evolution. New families of these elements are expected to be discovered. Electronic supplementary material The online version of this article (doi:10.1186/s12915-015-0207-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Natalya Yutin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Sofiya Shevchenko
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Vladimir Kapitonov
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Mart Krupovic
- Unité Biologie Moléculaire du Gène chez les Extrêmophiles, Department of Microbiology, Institut Pasteur, Paris, France
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA.
| |
Collapse
|
14
|
Henssen AG, Henaff E, Jiang E, Eisenberg AR, Carson JR, Villasante CM, Ray M, Still E, Burns M, Gandara J, Feschotte C, Mason CE, Kentsis A. Genomic DNA transposition induced by human PGBD5. eLife 2015; 4. [PMID: 26406119 PMCID: PMC4625184 DOI: 10.7554/elife.10565] [Citation(s) in RCA: 56] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2015] [Accepted: 09/23/2015] [Indexed: 11/13/2022] Open
Abstract
Transposons are mobile genetic elements that are found in nearly all organisms, including humans. Mobilization of DNA transposons by transposase enzymes can cause genomic rearrangements, but our knowledge of human genes derived from transposases is limited. In this study, we find that the protein encoded by human PGBD5, the most evolutionarily conserved transposable element-derived gene in vertebrates, can induce stereotypical cut-and-paste DNA transposition in human cells. Genomic integration activity of PGBD5 requires distinct aspartic acid residues in its transposase domain, and specific DNA sequences containing inverted terminal repeats with similarity to piggyBac transposons. DNA transposition catalyzed by PGBD5 in human cells occurs genome-wide, with precise transposon excision and preference for insertion at TTAA sites. The apparent conservation of DNA transposition activity by PGBD5 suggests that genomic remodeling contributes to its biological function.
Collapse
Affiliation(s)
- Anton G Henssen
- Molecular Pharmacology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Elizabeth Henaff
- Institute for Computational Biomedicine, Weill Cornell Medical College, New York, United States
| | - Eileen Jiang
- Molecular Pharmacology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Amy R Eisenberg
- Molecular Pharmacology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Julianne R Carson
- Molecular Pharmacology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Camila M Villasante
- Molecular Pharmacology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Mondira Ray
- Molecular Pharmacology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Eric Still
- Molecular Pharmacology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Melissa Burns
- Boston Children's Hospital, Harvard Medical School, Boston, United States
| | - Jorge Gandara
- Institute for Computational Biomedicine, Weill Cornell Medical College, New York, United States
| | - Cedric Feschotte
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, United States
| | - Christopher E Mason
- Institute for Computational Biomedicine, Weill Cornell Medical College, New York, United States
| | - Alex Kentsis
- Molecular Pharmacology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, United States.,Department of Pediatrics, Memorial Sloan Kaettering Cancer Center, New York, United States.,Weill Cornell Medical College, Cornell University, New York, United States
| |
Collapse
|
15
|
Transposase interaction with the β sliding clamp: effects on insertion sequence proliferation and transposition rate. Sci Rep 2015; 5:13329. [PMID: 26306550 PMCID: PMC4549789 DOI: 10.1038/srep13329] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2015] [Accepted: 07/23/2015] [Indexed: 01/05/2023] Open
Abstract
Insertion sequences (ISs) are ubiquitous and abundant mobile genetic elements in prokaryotic genomes. ISs often encode only one protein, the transposase, which catalyzes their transposition. Recent studies have shown that transposases of many different IS families interact with the β sliding clamp, a DNA replication factor of the host. However, it was unclear to what extent this interaction limits or favors the ability of ISs to colonize a chromosome from a phylogenetically-distant organism, or if the strength of this interaction affects the transposition rate. Here we describe the proliferation of a member of the IS1634 family in Acidiphilium over ~600 generations of cultured growth. We demonstrate that the purified transposase binds to the β sliding clamp of Acidiphilium, Leptospirillum and E. coli. Further, we also demonstrate that the Acidiphilium IS1634 transposase binds to the archaeal sliding clamp (PCNA) from Methanosarcina, and that the transposase encoded by Methanosarcina IS1634 binds to Acidiphilium β. Finally, we demonstrate that increasing the strength of the interaction between β and transposase results in a higher transposition rate in vivo. Our results suggest that the interaction could determine the potential of ISs to be mobilized in bacterial populations and also their ability to proliferate within chromosomes.
Collapse
|
16
|
Arias-Palomo E, Berger JM. An Atypical AAA+ ATPase Assembly Controls Efficient Transposition through DNA Remodeling and Transposase Recruitment. Cell 2015; 162:860-71. [PMID: 26276634 PMCID: PMC4537775 DOI: 10.1016/j.cell.2015.07.037] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2015] [Revised: 04/21/2015] [Accepted: 06/24/2015] [Indexed: 01/27/2023]
Abstract
Transposons are ubiquitous genetic elements that drive genome rearrangements, evolution, and the spread of infectious disease and drug-resistance. Many transposons, such as Mu, Tn7, and IS21, require regulatory AAA+ ATPases for function. We use X-ray crystallography and cryo-electron microscopy to show that the ATPase subunit of IS21, IstB, assembles into a clamshell-shaped decamer that sandwiches DNA between two helical pentamers of ATP-associated AAA+ domains, sharply bending the duplex into a 180° U-turn. Biochemical studies corroborate key features of the structure and further show that the IS21 transposase, IstA, recognizes the IstB•DNA complex and promotes its disassembly by stimulating ATP hydrolysis. Collectively, these studies reveal a distinct manner of higher-order assembly and client engagement by a AAA+ ATPase and suggest a mechanistic model where IstB binding and subsequent DNA bending primes a selected insertion site for efficient transposition.
Collapse
Affiliation(s)
- Ernesto Arias-Palomo
- Department of Biophysics and Biophysical Chemistry, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA
| | - James M Berger
- Department of Biophysics and Biophysical Chemistry, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA.
| |
Collapse
|
17
|
Insertion Sequence IS26 Reorganizes Plasmids in Clinically Isolated Multidrug-Resistant Bacteria by Replicative Transposition. mBio 2015; 6:e00762. [PMID: 26060276 PMCID: PMC4471558 DOI: 10.1128/mbio.00762-15] [Citation(s) in RCA: 227] [Impact Index Per Article: 25.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
Carbapenemase-producing Enterobacteriaceae (CPE), which are resistant to most or all known antibiotics, constitute a global threat to public health. Transposable elements are often associated with antibiotic resistance determinants, suggesting a role in the emergence of resistance. One insertion sequence, IS26, is frequently associated with resistance determinants, but its role remains unclear. We have analyzed the genomic contexts of 70 IS26 copies in several clinical and surveillance CPE isolates from the National Institutes of Health Clinical Center. We used target site duplications and their patterns as guides and found that a large fraction of plasmid reorganizations result from IS26 replicative transpositions, including replicon fusions, DNA inversions, and deletions. Replicative transposition could also be inferred for transposon Tn4401, which harbors the carbapenemase blaKPC gene. Thus, replicative transposition is important in the ongoing reorganization of plasmids carrying multidrug-resistant determinants, an observation that carries substantial clinical and epidemiological implications for understanding how such extreme drug resistance phenotypes evolve. Although IS26 is frequently reported to reside in resistance plasmids of clinical isolates, the characteristic hallmark of transposition, target site duplication (TSD), is generally not observed, raising questions about the mode of transposition for IS26. The previous observation of cointegrate formation during transposition implies that IS26 transposes via a replicative mechanism. The other possible outcome of replicative transposition is DNA inversion or deletion, when transposition occurs intramolecularly, and this would also generate a specific TSD pattern that might also serve as supporting evidence for the transposition mechanism. The numerous examples we present here demonstrate that replicative transposition, used by many mobile elements (including IS26 and Tn4401), is prevalent in the plasmids of clinical isolates and results in significant plasmid reorganization. This study also provides a method to trace the evolution of resistance plasmids based on TSD patterns.
Collapse
|
18
|
Dyda F, Hickman AB. Mechanism of spacer integration links the CRISPR/Cas system to transposition as a form of mobile DNA. Mob DNA 2015; 6:9. [PMID: 27408625 PMCID: PMC4940900 DOI: 10.1186/s13100-015-0039-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2015] [Accepted: 04/16/2015] [Indexed: 11/12/2022] Open
Abstract
It has recently become clear that many bacterial and archaeal species possess adaptive immune systems. These are typified by multiple copies of DNA sequences known as clustered regularly interspaced short palindromic repeats (CRISPRs). These CRISPR repeats are the sites at which short spacers containing sequences of previously encountered foreign DNA are integrated, and the spacers serve as the molecular memory of previous invaders. In vivo work has demonstrated that two CRISPR-associated proteins - Cas1 and Cas2 - are required for spacer integration, but the mechanism by which this is accomplished remained unclear. Here we review a recent paper describing the in vitro reconstitution of CRISPR spacer integration using purified Cas1 and Cas2 and place the results in context of similar DNA transposition reactions and the crystal structure of the Cas1/Cas2 complex.
Collapse
Affiliation(s)
- Fred Dyda
- Laboratory of Molecular Biology, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, 5 Center Dr., Bethesda, MD 20892 USA
| | - Alison B Hickman
- Laboratory of Molecular Biology, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, 5 Center Dr., Bethesda, MD 20892 USA
| |
Collapse
|
19
|
Majumdar S, Rio DC. P Transposable Elements in Drosophila and other Eukaryotic Organisms. Microbiol Spectr 2015; 3:MDNA3-0004-2014. [PMID: 26104714 PMCID: PMC4399808 DOI: 10.1128/microbiolspec.mdna3-0004-2014] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2014] [Indexed: 11/20/2022] Open
Abstract
P transposable elements were discovered in Drosophila as the causative agents of a syndrome of genetic traits called hybrid dysgenesis. Hybrid dysgenesis exhibits a unique pattern of maternal inheritance linked to the germline-specific small RNA piwi-interacting (piRNA) pathway. The use of P transposable elements as vectors for gene transfer and as genetic tools revolutionized the field of Drosophila molecular genetics. P element transposons have served as a useful model to investigate mechanisms of cut-and-paste transposition in eukaryotes. Biochemical studies have revealed new and unexpected insights into how eukaryotic DNA-based transposons are mobilized. For example, the P element transposase makes unusual 17nt-3' extended double-strand DNA breaks at the transposon termini and uses guanosine triphosphate (GTP) as a cofactor to promote synapsis of the two transposon ends early in the transposition pathway. The N-terminal DNA binding domain of the P element transposase, called a THAP domain, contains a C2CH zinc-coordinating motif and is the founding member of a large family of animal-specific site-specific DNA binding proteins. Over the past decade genome sequencing efforts have revealed the presence of P element-like transposable elements or P element transposase-like genes (called THAP9) in many eukaryotic genomes, including vertebrates, such as primates including humans, zebrafish and Xenopus, as well as the human parasite Trichomonas vaginalis, the sea squirt Ciona, sea urchin and hydra. Surprisingly, the human and zebrafish P element transposase-related THAP9 genes promote transposition of the Drosophila P element transposon DNA in human and Drosophila cells, indicating that the THAP9 genes encode active P element "transposase" proteins.
Collapse
Affiliation(s)
| | - Donald C. Rio
- Department of Molecular and Cell Biology University of California, Berkeley Berkeley, CA 94720-3204
| |
Collapse
|
20
|
Abstract
DNA transposases use a limited repertoire of structurally and mechanistically distinct nuclease domains to catalyze the DNA strand breaking and rejoining reactions that comprise DNA transposition. Here, we review the mechanisms of the four known types of transposition reactions catalyzed by (1) RNase H-like transposases (also known as DD(E/D) enzymes); (2) HUH single-stranded DNA transposases; (3) serine transposases; and (4) tyrosine transposases. The large body of accumulated biochemical and structural data, particularly for the RNase H-like transposases, has revealed not only the distinguishing features of each transposon family, but also some emerging themes that appear conserved across all families. The more-recently characterized single-stranded DNA transposases provide insight into how an ancient HUH domain fold has been adapted for transposition to accomplish excision and then site-specific integration. The serine and tyrosine transposases are structurally and mechanistically related to their cousins, the serine and tyrosine site-specific recombinases, but have to date been less intensively studied. These types of enzymes are particularly intriguing as in the context of site-specific recombination they require strict homology between recombining sites, yet for transposition can catalyze the joining of transposon ends to form an excised circle and then integration into a genomic site with much relaxed sequence specificity.
Collapse
Affiliation(s)
- Alison B Hickman
- Laboratory of Molecular Biology, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, 5 Center Dr., Bethesda, MD 20892, USA
| | - Fred Dyda
- Laboratory of Molecular Biology, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, 5 Center Dr., Bethesda, MD 20892, USA
| |
Collapse
|
21
|
Gómez MJ, Díaz-Maldonado H, González-Tortuero E, López de Saro FJ. Chromosomal replication dynamics and interaction with the β sliding clamp determine orientation of bacterial transposable elements. Genome Biol Evol 2014; 6:727-40. [PMID: 24614824 PMCID: PMC3971601 DOI: 10.1093/gbe/evu052] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
Insertion sequences (ISs) are small transposable elements widespread in bacterial genomes, where they play an essential role in chromosome evolution by stimulating recombination and genetic flow. Despite their ubiquity, it is unclear how ISs interact with the host. Here, we report a survey of the orientation patterns of ISs in bacterial chromosomes with the objective of gaining insight into the interplay between ISs and host chromosomal functions. We find that a significant fraction of IS families present a consistent and family-specific orientation bias with respect to chromosomal DNA replication, especially in Firmicutes. Additionally, we find that the transposases of up to nine different IS families with different transposition pathways interact with the β sliding clamp, an essential replication factor, suggesting that this is a widespread mechanism of interaction with the host. Although we find evidence that the interaction with the β sliding clamp is common to all bacterial phyla, it also could explain the observed strong orientation bias found in Firmicutes, because in this group β is asymmetrically distributed during synthesis of the leading or lagging strands. Besides the interaction with the β sliding clamp, other asymmetries also play a role in the biased orientation of some IS families. The utilization of the highly conserved replication sliding clamps suggests a mechanism for host regulation of IS proliferation and also a universal platform for IS dispersal and transmission within bacterial populations and among phylogenetically distant species.
Collapse
Affiliation(s)
- Manuel J Gómez
- Department of Molecular Evolution, Centro de Astrobiología (INTA-CSIC), Madrid, Spain
| | | | | | | |
Collapse
|
22
|
The Tn7 transposition regulator TnsC interacts with the transposase subunit TnsB and target selector TnsD. Proc Natl Acad Sci U S A 2014; 111:E2858-65. [PMID: 24982178 DOI: 10.1073/pnas.1409869111] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The excision of transposon Tn7 from a donor site and its insertion into its preferred target site, attachment site attTn7, is mediated by four Tn7-encoded transposition proteins: TnsA, TnsB, TnsC, and TnsD. Transposition requires the assembly of a nucleoprotein complex containing all four Tns proteins and the DNA substrates, the donor site containing Tn7, and the preferred target site attTn7. TnsA and TnsB together form the heteromeric Tn7 transposase, and TnsD is a target-selecting protein that binds specifically to attTn7. TnsC is the key regulator of transposition, interacting with both the TnsAB transposase and TnsD-attTn7. We show here that TnsC interacts directly with TnsB, and identify the specific region of TnsC involved in the TnsB-TnsC interaction during transposition. We also show that a TnsC mutant defective in interaction with TnsB is defective for Tn7 transposition both in vitro and in vivo. Tn7 displays cis-acting target immunity, which blocks Tn7 insertion into a target DNA that already contains Tn7. We provide evidence that the direct TnsB-TnsC interaction that we have identified also mediates cis-acting Tn7 target immunity. We also show that TnsC interacts directly with the target selector protein TnsD.
Collapse
|
23
|
Siguier P, Gourbeyre E, Chandler M. Bacterial insertion sequences: their genomic impact and diversity. FEMS Microbiol Rev 2014; 38:865-91. [PMID: 24499397 PMCID: PMC7190074 DOI: 10.1111/1574-6976.12067] [Citation(s) in RCA: 394] [Impact Index Per Article: 39.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2013] [Revised: 01/19/2014] [Accepted: 01/22/2014] [Indexed: 01/06/2023] Open
Abstract
Insertion sequences (ISs), arguably the smallest and most numerous autonomous transposable elements (TEs), are important players in shaping their host genomes. This review focuses on prokaryotic ISs. We discuss IS distribution and impact on genome evolution. We also examine their effects on gene expression, especially their role in activating neighbouring genes, a phenomenon of particular importance in the recent upsurge of bacterial antibiotic resistance. We explain how ISs are identified and classified into families by a combination of characteristics including their transposases (Tpases), their overall genetic organisation and the accessory genes which some ISs carry. We then describe the organisation of autonomous and nonautonomous IS‐related elements. This is used to illustrate the growing recognition that the boundaries between different types of mobile element are becoming increasingly difficult to define as more are being identified. We review the known Tpase types, their different catalytic activities used in cleaving and rejoining DNA strands during transposition, their organisation into functional domains and the role of this in regulation. Finally, we consider examples of prokaryotic IS domestication. In a more speculative section, we discuss the necessity of constructing more quantitative dynamic models to fully appreciate the continuing impact of TEs on prokaryotic populations.
Collapse
Affiliation(s)
- Patricia Siguier
- Laboratoire de Microbiologie et Génétique Moléculaires, Unité Mixte de Recherche 5100, Centre National de Recherche Scientifique, Toulouse Cedex, France
| | | | | |
Collapse
|
24
|
Lineage-specific expansions of TET/JBP genes and a new class of DNA transposons shape fungal genomic and epigenetic landscapes. Proc Natl Acad Sci U S A 2014; 111:1676-83. [PMID: 24398522 DOI: 10.1073/pnas.1321818111] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
TET/JBP dioxygenases oxidize methylpyrimidines in nucleic acids and are implicated in generation of epigenetic marks and potential intermediates for DNA demethylation. We show that TET/JBP genes are lineage-specifically expanded in all major clades of basidiomycete fungi, with the majority of copies predicted to encode catalytically active proteins. This pattern differs starkly from the situation in most other organisms that possess just a single or a few copies of the TET/JBP family. In most basidiomycetes, TET/JBP genes are frequently linked to a unique class of transposons, KDZ (Kyakuja, Dileera, and Zisupton) and appear to have dispersed across chromosomes along with them. Several of these elements typically encode additional proteins, including a divergent version of the HMG domain. Analysis of their transposases shows that they contain a previously uncharacterized version of the RNase H fold with multiple distinctive Zn-chelating motifs and a unique insert, which are predicted to play roles in structural stabilization and target sequence recognition, respectively. We reconstruct the complex evolutionary history of TET/JBPs and associated transposons as involving multiple rounds of expansion with concomitant lineage sorting and loss, along with several capture events of TET/JBP genes by different transposon clades. On a few occasions, these TET/JBP genes were also laterally transferred to certain Ascomycota, Glomeromycota, Viridiplantae, and Amoebozoa. One such is an inactive version, calnexin-independence factor 1 (Cif1), from Schizosaccharomyces pombe, which has been implicated in inducing an epigenetically transmitted prion state. We argue that this unique transposon-TET/JBP association is likely to play important roles in speciation during evolution and epigenetic regulation.
Collapse
|
25
|
Vogt A, Mochizuki K. A domesticated PiggyBac transposase interacts with heterochromatin and catalyzes reproducible DNA elimination in Tetrahymena. PLoS Genet 2013; 9:e1004032. [PMID: 24348275 PMCID: PMC3861120 DOI: 10.1371/journal.pgen.1004032] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2013] [Accepted: 10/31/2013] [Indexed: 12/20/2022] Open
Abstract
The somatic genome of the ciliated protist Tetrahymena undergoes DNA elimination of defined sequences called internal eliminated sequences (IESs), which account for ~30% of the germline genome. During DNA elimination, IES regions are heterochromatinized and assembled into heterochromatin bodies in the developing somatic nucleus. The domesticated piggyBac transposase Tpb2p is essential for the formation of heterochromatin bodies and DNA elimination. In this study, we demonstrate that the activities of Tpb2p involved in forming heterochromatin bodies and executing DNA elimination are genetically separable. The cysteine-rich domain of Tpb2p, which interacts with the heterochromatin-specific histone modifications, is necessary for both heterochromatin body formation and DNA elimination, whereas the endonuclease activity of Tpb2p is only necessary for DNA elimination. Furthermore, we demonstrate that the endonuclease activity of Tpb2p in vitro and the endonuclease activity that executes DNA elimination in vivo have similar substrate sequence preferences. These results strongly indicate that Tpb2p is the endonuclease that directly catalyzes the excision of IESs and that the boundaries of IESs are at least partially determined by the combination of Tpb2p-heterochromatin interaction and relaxed sequence preference of the endonuclease activity of Tpb2p.
Collapse
Affiliation(s)
- Alexander Vogt
- Institute of Molecular Biotechnology of the Austrian Academy of Sciences (IMBA) Vienna, Austria
| | - Kazufumi Mochizuki
- Institute of Molecular Biotechnology of the Austrian Academy of Sciences (IMBA) Vienna, Austria
- * E-mail:
| |
Collapse
|
26
|
Boocock MR, Rice PA. A proposed mechanism for IS607-family serine transposases. Mob DNA 2013; 4:24. [PMID: 24195768 PMCID: PMC4058570 DOI: 10.1186/1759-8753-4-24] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2013] [Accepted: 10/07/2013] [Indexed: 01/26/2023] Open
Abstract
Background The transposases encoded by the IS607 family of mobile elements are unusual serine recombinases with an inverted domain order and minimal specificity for target DNA. Results Structural genomics groups have determined three crystal structures of the catalytic domains of IS607 family transposases. The dimers formed by these catalytic domains are very different from those seen for other serine recombinases and include interactions that usually only occur upon formation of a synaptic tetramer. Conclusions Based on these structures, we propose a model for how IS607-family transposases could form a synaptic tetramer. The model suggests that, unlike other serine recombinases, these enzymes carry out sequence-specific DNA binding and catalysis in trans: the DNA binding and catalytic domains of each subunit are proposed to interact with different DNA duplexes. The model also suggests an explanation for the minimal target DNA specificity.
Collapse
Affiliation(s)
| | - Phoebe A Rice
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL 60637, USA.
| |
Collapse
|
27
|
Abstract
The transposon piggyBac is being used increasingly for genetic studies. Here, we describe modified versions of piggyBac transposase that have potentially wide-ranging applications, such as reversible transgenesis and modified targeting of insertions. piggyBac is distinguished by its ability to excise precisely, restoring the donor site to its pretransposon state. This characteristic makes piggyBac useful for reversible transgenesis, a potentially valuable feature when generating induced pluripotent stem cells without permanent alterations to genomic sequence. To avoid further genome modification following piggyBac excision by reintegration, we generated an excision competent/integration defective (Exc(+)Int(-)) transposase. Our findings also suggest the position of a target DNA-transposase interaction. Another goal of genome engineering is to develop reagents that can guide transgenes to preferred genomic regions. Others have shown that piggyBac transposase can be active when fused to a heterologous DNA-binding domain. An Exc(+)Int(-) transposase, the intrinsic targeting of which is defective, might also be a useful intermediate in generating a transposase whose integration activity could be rescued and redirected by fusion to a site-specific DNA-binding domain. We show that fusion to two designed zinc finger proteins rescued the Int(-) phenotype. Successful guided transgene integration into genomic DNA would have broad applications to gene therapy and molecular genetics. Thus, an Exc(+)Int(-) transposase is a potentially useful reagent for genome engineering and provides insight into the mechanism of transposase-target DNA interaction.
Collapse
|