1
|
Miklík D, Grim J, Elleder D, Hejnar J. Unraveling the palindromic and nonpalindromic motifs of retroviral integration site sequences by statistical mixture models. Genome Res 2023; 33:1395-1408. [PMID: 37463751 PMCID: PMC10547254 DOI: 10.1101/gr.277694.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Accepted: 07/12/2023] [Indexed: 07/20/2023]
Abstract
A weak palindromic nucleotide motif is the hallmark of retroviral integration site alignments. Given that the majority of target sequences are not palindromic, the current model explains the symmetry by an overlap of the nonpalindromic motif present on one of the half-sites of the sequences. Here, we show that the implementation of multicomponent mixture models allows for different interpretations consistent with the existence of both palindromic and nonpalindromic submotifs in the sets of integration site sequences. We further show that the weak palindromic motifs result from freely combined site-specific submotifs restricted to only a few positions proximal to the site of integration. The submotifs are formed by either palindrome-forming nucleotide preference or nucleotide exclusion. Using the mixture models, we also identify HIV-1-favored palindromic sequences in Alu repeats serving as local hotspots for integration. The application of the novel statistical approach provides deeper insight into the selection of retroviral integration sites and may prove to be a valuable tool in the analysis of any type of DNA motifs.
Collapse
Affiliation(s)
- Dalibor Miklík
- Laboratory of Viral and Cellular Genetics, Institute of Molecular Genetics of the Czech Academy of Sciences, Prague 4, 142 20, Czech Republic
| | - Jiří Grim
- Pattern Recognition Department, Institute of Information Theory and Automation of the Czech Academy of Sciences, Prague 8, 182 08, Czech Republic
| | - Daniel Elleder
- Laboratory of Viral and Cellular Genetics, Institute of Molecular Genetics of the Czech Academy of Sciences, Prague 4, 142 20, Czech Republic
| | - Jiří Hejnar
- Laboratory of Viral and Cellular Genetics, Institute of Molecular Genetics of the Czech Academy of Sciences, Prague 4, 142 20, Czech Republic;
| |
Collapse
|
2
|
HIV-1 Preintegration Complex Preferentially Integrates the Viral DNA into Nucleosomes Containing Trimethylated Histone 3-Lysine 36 Modification and Flanking Linker DNA. J Virol 2022; 96:e0101122. [PMID: 36094316 PMCID: PMC9517705 DOI: 10.1128/jvi.01011-22] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
HIV-1 DNA is preferentially integrated into chromosomal hot spots by the preintegration complex (PIC). To understand the mechanism, we measured the DNA integration activity of PICs-extracted from infected cells-and intasomes, biochemically assembled PIC substructures using a number of relevant target substrates. We observed that PIC-mediated integration into human chromatin is preferred compared to genomic DNA. Surprisingly, nucleosomes lacking histone modifications were not preferred integration compared to the analogous naked DNA. Nucleosomes containing the trimethylated histone 3 lysine 36 (H3K36me3), an epigenetic mark linked to active transcription, significantly stimulated integration, but the levels remained lower than the naked DNA. Notably, H3K36me3-modified nucleosomes with linker DNA optimally supported integration mediated by the PIC but not by the intasome. Interestingly, optimal intasome-mediated integration required the cellular cofactor LEDGF. Unexpectedly, LEDGF minimally affected PIC-mediated integration into naked DNA but blocked integration into nucleosomes. The block for the PIC-mediated integration was significantly relieved by H3K36me3 modification. Mapping the integration sites in the preferred substrates revealed that specific features of the nucleosome-bound DNA are preferred for integration, whereas integration into naked DNA was random. Finally, biochemical and genetic studies demonstrate that DNA condensation by the H1 protein dramatically reduces integration, providing further evidence that features inherent to the open chromatin are preferred for HIV-1 integration. Collectively, these results identify the optimal target substrate for HIV-1 integration, report a mechanistic link between H3K36me3 and integration preference, and importantly, reveal distinct mechanisms utilized by the PIC for integration compared to the intasomes. IMPORTANCE HIV-1 infection is dependent on integration of the viral DNA into the host chromosomes. The preintegration complex (PIC) containing the viral DNA, the virally encoded integrase (IN) enzyme, and other viral/host factors carries out HIV-1 integration. HIV-1 integration is not dependent on the target DNA sequence, and yet the viral DNA is selectively inserted into specific "hot spots" of human chromosomes. A growing body of literature indicates that structural features of the human chromatin are important for integration targeting. However, the mechanisms that guide the PIC and enable insertion of the PIC-associated viral DNA into specific hot spots of the human chromosomes are not fully understood. In this study, we describe a biochemical mechanism for the preference of the HIV-1 DNA integration into open chromatin. Furthermore, our study defines a direct role for the histone epigenetic mark H3K36me3 in HIV-1 integration preference and identify an optimal substrate for HIV-1 PIC-mediated viral DNA integration.
Collapse
|
3
|
Hejnar J, Ruml T. The Current View of Retroviruses as Seen from the Shoulders of a Giant. Viruses 2019; 11:v11090828. [PMID: 31491994 PMCID: PMC6784152 DOI: 10.3390/v11090828] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2019] [Accepted: 09/03/2019] [Indexed: 11/16/2022] Open
Abstract
It has now been more than two years since we said our last goodbye to Jan Svoboda (14 [...].
Collapse
Affiliation(s)
- Jiří Hejnar
- Department of Viral and Cellular Genetics, Institute of Molecular Genetics of the Czech Academy of Sciences, Videnska 1083, CZ-14220 Prague, Czech Republic.
| | - Tomáš Ruml
- Department of Biochemistry and Microbiology, University of Chemistry and Technology, CZ-166 28 Prague, Czech Republic.
| |
Collapse
|
4
|
D Urbano V, De Crignis E, Re MC. Host Restriction Factors and Human Immunodeficiency Virus (HIV-1): A Dynamic Interplay Involving All Phases of the Viral Life Cycle. Curr HIV Res 2019; 16:184-207. [PMID: 30117396 DOI: 10.2174/1570162x16666180817115830] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2018] [Revised: 07/31/2018] [Accepted: 08/09/2018] [Indexed: 02/08/2023]
Abstract
Mammalian cells have evolved several mechanisms to prevent or block lentiviral infection and spread. Among the innate immune mechanisms, the signaling cascade triggered by type I interferon (IFN) plays a pivotal role in limiting the burden of HIV-1. In the presence of IFN, human cells upregulate the expression of a number of genes, referred to as IFN-stimulated genes (ISGs), many of them acting as antiviral restriction factors (RFs). RFs are dominant proteins that target different essential steps of the viral cycle, thereby providing an early line of defense against the virus. The identification and characterization of RFs have provided unique insights into the molecular biology of HIV-1, further revealing the complex host-pathogen interplay that characterizes the infection. The presence of RFs drove viral evolution, forcing the virus to develop specific proteins to counteract their activity. The knowledge of the mechanisms that prevent viral infection and their viral counterparts may offer new insights to improve current antiviral strategies. This review provides an overview of the RFs targeting HIV-1 replication and the mechanisms that regulate their expression as well as their impact on viral replication and the clinical course of the disease.
Collapse
Affiliation(s)
- Vanessa D Urbano
- Retrovirus Laboratory, Operative Unit of Clinical Microbiology, S. Orsola-Malpighi University Hospital, Bologna, Italy
| | - Elisa De Crignis
- Retrovirus Laboratory, Operative Unit of Clinical Microbiology, S. Orsola-Malpighi University Hospital, Bologna, Italy
| | - Maria Carla Re
- Retrovirus Laboratory, Operative Unit of Clinical Microbiology, S. Orsola-Malpighi University Hospital, Bologna, Italy
| |
Collapse
|
5
|
Proviruses with Long-Term Stable Expression Accumulate in Transcriptionally Active Chromatin Close to the Gene Regulatory Elements: Comparison of ASLV-, HIV- and MLV-Derived Vectors. Viruses 2018. [PMID: 29517993 PMCID: PMC5869509 DOI: 10.3390/v10030116] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
Individual groups of retroviruses and retroviral vectors differ in their integration site preference and interaction with the host genome. Hence, immediately after infection genome-wide distribution of integrated proviruses is non-random. During long-term in vitro or persistent in vivo infection, the genomic position and chromatin environment of the provirus affects its transcriptional activity. Thus, a selection of long-term stably expressed proviruses and elimination of proviruses, which have been gradually silenced by epigenetic mechanisms, helps in the identification of genomic compartments permissive for proviral transcription. We compare here the extent and time course of provirus silencing in single cell clones of the K562 human myeloid lymphoblastoma cell line that have been infected with retroviral reporter vectors derived from avian sarcoma/leukosis virus (ASLV), human immunodeficiency virus type 1 (HIV) and murine leukaemia virus (MLV). While MLV proviruses remain transcriptionally active, ASLV proviruses are prone to rapid silencing. The HIV provirus displays gradual silencing only after an extended time period in culture. The analysis of integration sites of long-term stably expressed proviruses shows a strong bias for some genomic features-especially integration close to the transcription start sites of active transcription units. Furthermore, complex analysis of histone modifications enriched at the site of integration points to the accumulation of proviruses of all three groups in gene regulatory segments, particularly close to the enhancer loci. We conclude that the proximity to active regulatory chromatin segments correlates with stable provirus expression in various retroviral species.
Collapse
|
6
|
Šenigl F, Miklík D, Auxt M, Hejnar J. Accumulation of long-term transcriptionally active integrated retroviral vectors in active promoters and enhancers. Nucleic Acids Res 2018; 45:12752-12765. [PMID: 29244184 PMCID: PMC5727404 DOI: 10.1093/nar/gkx889] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2017] [Accepted: 09/26/2017] [Indexed: 01/13/2023] Open
Abstract
Most retroviruses preferentially integrate into certain genomic locations and, as a result, their genome-wide integration patterns are non-random. We investigate the epigenetic landscape of integrated retroviral vectors and correlate it with the long-term stability of proviral transcription. Retroviral vectors derived from the avian sarcoma/leukosis virus expressing the GFP reporter were used to transduce the human myeloid lymphoblastoma cell line K562. Because of efficient silencing of avian retrovirus in mammalian cells, only ∼3% of established clones displayed stable proviral expression. We analyzed the vector integration sites in non-selected cells and in clones selected for the GFP expression. This selection led to overrepresentation of proviruses integrated in active transcription units, with particular accumulation in promoter-proximal areas. In parallel, we investigated the integration of vectors equipped with an anti-silencing CpG island core sequence. Such modification increased the frequency of stably expressing proviruses by one order. The modified vectors are also overrepresented in active transcription units, but stably expressed in distal parts of transcriptional units further away from promoters with marked accumulation in enhancers. These results suggest that integrated retroviruses subject to gradual epigenetic silencing during long-term cultivation. Among most genomic compartments, however, active promoters and enhancers protect the adjacent retroviruses from transcriptional silencing.
Collapse
Affiliation(s)
- Filip Šenigl
- Institute of Molecular Genetics, Academy of Sciences of the Czech Republic, Videnska 1083, CZ-14220 Prague 4, Czech Republic
| | - Dalibor Miklík
- Institute of Molecular Genetics, Academy of Sciences of the Czech Republic, Videnska 1083, CZ-14220 Prague 4, Czech Republic
| | - Miroslav Auxt
- Institute of Molecular Genetics, Academy of Sciences of the Czech Republic, Videnska 1083, CZ-14220 Prague 4, Czech Republic
| | - Jirí Hejnar
- Institute of Molecular Genetics, Academy of Sciences of the Czech Republic, Videnska 1083, CZ-14220 Prague 4, Czech Republic
| |
Collapse
|
7
|
The Multifaceted Contributions of Chromatin to HIV-1 Integration, Transcription, and Latency. INTERNATIONAL REVIEW OF CELL AND MOLECULAR BIOLOGY 2016; 328:197-252. [PMID: 28069134 DOI: 10.1016/bs.ircmb.2016.08.006] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
The capacity of the human immunodeficiency virus (HIV-1) to establish latent infections constitutes a major barrier to the development of a cure for HIV-1. In latent infection, replication competent HIV-1 provirus is integrated within the host genome but remains silent, masking the infected cells from the activity of the host immune response. Despite the progress in elucidating the molecular players that regulate HIV-1 gene expression, the mechanisms driving the establishment and maintenance of latency are still not fully understood. Transcription from the HIV-1 genome occurs in the context of chromatin and is subjected to the same regulatory mechanisms that drive cellular gene expression. Much like in eukaryotic genes, the nucleosomal landscape of the HIV-1 promoter and its position within genomic chromatin are determinants of its transcriptional activity. Understanding the multilayered chromatin-mediated mechanisms that underpin HIV-1 integration and expression is of utmost importance for the development of therapeutic strategies aimed at reducing the pool of latently infected cells. In this review, we discuss the impact of chromatin structure on viral integration, transcriptional regulation and latency, and the host factors that influence HIV-1 replication by regulating chromatin organization. Finally, we describe therapeutic strategies under development to target the chromatin-HIV-1 interplay.
Collapse
|
8
|
Integration of HIV in the Human Genome: Which Sites Are Preferential? A Genetic and Statistical Assessment. Int J Genomics 2016; 2016:2168590. [PMID: 27294106 PMCID: PMC4880676 DOI: 10.1155/2016/2168590] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2016] [Accepted: 04/24/2016] [Indexed: 12/17/2022] Open
Abstract
Chromosomal fragile sites (FSs) are loci where gaps and breaks may occur and are preferential integration targets for some viruses, for example, Hepatitis B, Epstein-Barr virus, HPV16, HPV18, and MLV vectors. However, the integration of the human immunodeficiency virus (HIV) in Giemsa bands and in FSs is not yet completely clear. This study aimed to assess the integration preferences of HIV in FSs and in Giemsa bands using an in silico study. HIV integration positions from Jurkat cells were used and two nonparametric tests were applied to compare HIV integration in dark versus light bands and in FS versus non-FS (NFSs). The results show that light bands are preferential targets for integration of HIV-1 in Jurkat cells and also that it integrates with equal intensity in FSs and in NFSs. The data indicates that HIV displays different preferences for FSs compared to other viruses. The aim was to develop and apply an approach to predict the conditions and constraints of HIV insertion in the human genome which seems to adequately complement empirical data.
Collapse
|
9
|
Abstract
The persistence of human immunodeficiency virus type 1 (HIV-1) in latent reservoirs is a major barrier to HIV cure. Reservoir establishment depends on low viral expression that may be related to provirus integration sites (IS). In vitro, in cell lines and primary T cells, latency is associated with specific IS through reduced viral expression mediated by transcriptional interference by host cellular promoters, reverse orientation, and the presence of specific epigenetic modifiers. In primary T cell models of latency, specific IS are associated with intracellular viral antigen expression that is not directly related to cell activation. In contrast, in patient CD4+ T cells, there is enrichment for IS in genes controlling cell cycle and survival and in some clonally expanded T cell subpopulations. Multiple insertion sites within some specific genes may suggest that integrated HIV can increase the host’s T cell survival.
Collapse
Affiliation(s)
- Simin D. Rezaei
- Faculty of Medicine, Dentistry and Health Sciences, Department of Microbiology and Immunology, Doherty Institute for Infection and Immunity, The University of Melbourne, 4th Floor, 786-798 Elizabeth St, Melbourne, 3010 Australia
| | - Paul U. Cameron
- Faculty of Medicine, Dentistry and Health Sciences, Department of Microbiology and Immunology, Doherty Institute for Infection and Immunity, The University of Melbourne, 4th Floor, 786-798 Elizabeth St, Melbourne, 3010 Australia
- Infectious Diseases Unit, Department of Infectious Diseases, Alfred Hospital, 85 Commercial Rd, Melbourne, Victoria 3004 Australia
| |
Collapse
|
10
|
Cell cycle status of CD34(+) hemopoietic stem cells determines lentiviral integration in actively transcribed and development-related genes. Mol Ther 2014; 23:683-96. [PMID: 25523760 DOI: 10.1038/mt.2014.246] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2013] [Accepted: 12/09/2014] [Indexed: 01/03/2023] Open
Abstract
Gene therapy utilizing lentiviral-vectors (LVs) is postulated as a dynamic therapeutic alternative for monogenic diseases. However, retroviral gene transfer may cause insertional mutagenesis. Although, such risks had been originally estimated as extremely low, several reports of leukemias or clonal dominance, have led to a re-evaluation of the mechanisms operating in insertional mutagenesis. Therefore, unraveling the mechanism of retroviral integration is mandatory toward safer gene therapy applications. In the present study, we undertook an experimental approach which enabled direct correlation of the cell cycle stage of the target cell with the integration profile of LVs. CD34(+) cells arrested at different stages of cell cycle, were transduced with a GFP-LV. LAM-PCR was employed for integration site detection, followed by microarray analysis to correlate transcribed genes with integration sites. The results indicate that ~10% of integration events occurred in actively transcribed genes and that the cell cycle stage of target cells affects integration pattern. Specifically, use of thymine promoted a safer profile, since it significantly reduced integration within cell cycle-related genes, while we observed increased possibility for integration into genes related to development, and decreased possibility for integration within cell cycle and cancer-related genes, when transduction occurs during mitosis.
Collapse
|
11
|
Šenigl F, Auxt M, Hejnar J. Transcriptional provirus silencing as a crosstalk of de novo DNA methylation and epigenomic features at the integration site. Nucleic Acids Res 2012; 40:5298-312. [PMID: 22379139 PMCID: PMC3384319 DOI: 10.1093/nar/gks197] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2012] [Revised: 02/12/2012] [Accepted: 02/13/2012] [Indexed: 12/03/2022] Open
Abstract
The autonomous transcription of integrated retroviruses strongly depends on genetic and epigenetic effects of the chromatin at the site of integration. These effects are mostly suppressive and proviral activity can be finally silenced by mechanisms, such as DNA methylation and histone modifications. To address the role of the integration site at the whole-genome-scale, we performed clonal analysis of provirus silencing with an avian leucosis/sarcoma virus-based reporter vector and correlated the transcriptional silencing with the epigenomic landscape of respective integrations. We demonstrate efficient provirus silencing in human HCT116 cell line, which is strongly but not absolutely dependent on the de novo DNA methyltransferase activity, particularly of Dnmt3b. Proviruses integrated close to the transcription start sites of active genes into the regions enriched in H3K4 trimethylation display long-term stability of expression and are resistant to the transcriptional silencing after over-expression of Dnmt3a or Dnmt3b. In contrast, proviruses in the intergenic regions tend to spontaneous transcriptional silencing even in Dnmt3a(-/-) Dnmt3b(-/-) cells. The silencing of proviruses within genes is accompanied with DNA methylation of long terminal repeats, whereas silencing in intergenic regions is DNA methylation-independent. These findings indicate that the epigenomic features of integration sites are crucial for their permissivity to the proviral expression.
Collapse
Affiliation(s)
- Filip Šenigl
- Department of Cellular and Viral Genetics, Institute of Molecular Genetics, Academy of Sciences of the Czech Republic, Vídeňská 1083, 14220 Prague, Czech Republic
| | | | - Jiří Hejnar
- Department of Cellular and Viral Genetics, Institute of Molecular Genetics, Academy of Sciences of the Czech Republic, Vídeňská 1083, 14220 Prague, Czech Republic
| |
Collapse
|
12
|
Polymorphic integrations of an endogenous gammaretrovirus in the mule deer genome. J Virol 2011; 86:2787-96. [PMID: 22190723 DOI: 10.1128/jvi.06859-11] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Endogenous retroviruses constitute a significant genomic fraction in all mammalian species. Typically they are evolutionarily old and fixed in the host species population. Here we report on a novel endogenous gammaretrovirus (CrERVγ; for cervid endogenous gammaretrovirus) in the mule deer (Odocoileus hemionus) that is insertionally polymorphic among individuals from the same geographical location, suggesting that it has a more recent evolutionary origin. Using PCR-based methods, we identified seven CrERVγ proviruses and demonstrated that they show various levels of insertional polymorphism in mule deer individuals. One CrERVγ provirus was detected in all mule deer sampled but was absent from white-tailed deer, indicating that this virus originally integrated after the split of the two species, which occurred approximately one million years ago. There are, on average, 100 CrERVγ copies in the mule deer genome based on quantitative PCR analysis. A CrERVγ provirus was sequenced and contained intact open reading frames (ORFs) for three virus genes. Transcripts were identified covering the entire provirus. CrERVγ forms a distinct branch of the gammaretrovirus phylogeny, with the closest relatives of CrERVγ being endogenous gammaretroviruses from sheep and pig. We demonstrated that white-tailed deer (Odocoileus virginianus) and elk (Cervus canadensis) DNA contain proviruses that are closely related to mule deer CrERVγ in a conserved region of pol; more distantly related sequences can be identified in the genome of another member of the Cervidae, the muntjac (Muntiacus muntjak). The discovery of a novel transcriptionally active and insertionally polymorphic retrovirus in mammals could provide a useful model system to study the dynamic interaction between the host genome and an invading retrovirus.
Collapse
|
13
|
Knyazhanskaya ES, Kondrashina OV, Gottikh MB. Approaches to site-directed DNA integration based on transposases and retroviral integrases. Mol Biol 2011. [DOI: 10.1134/s0026893311060069] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
14
|
Costantini M, Auletta F, Bernardi G. The distributions of "new" and "old" Alu sequences in the human genome: the solution of a "mystery". Mol Biol Evol 2011; 29:421-7. [PMID: 22057813 DOI: 10.1093/molbev/msr242] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
The distribution in the human genome of the largest family of mobile elements, the Alu sequences, has been investigated for the past 30 years, and the vast majority of Alu sequences were shown to have the highest density in GC-rich isochores. Ten years ago, it was discovered, however, that the small "youngest" (most recently transposed) Alu families had a strikingly different distribution compared with the "old" families. This raised the question as to how this change took place in evolution. We solved what was considered to be a "mystery" by 1) revisiting our previous results on the integration and stability of retroviral sequences, and 2) assessing the densities of acceptor sites TTTT/AA in isochore families. We could conclude 1) that the open state of chromatin structure plays a crucial role in allowing not only the initial integration of retroviral sequences but also that of the youngest Alu sequences, and 2) that the distribution of old Alus can be explained as due to Alu sequences being unstable in the GC-poor isochores but stable in the compositionally matching GC-rich isochores, again in line with what happens in the case of retroviral sequences.
Collapse
Affiliation(s)
- Maria Costantini
- Laboratory of Cellular and Developmental Biology, Stazione Zoologica Anton Dohrn, Naples, Italy
| | | | | |
Collapse
|
15
|
Proviruses selected for high and stable expression of transduced genes accumulate in broadly transcribed genome areas. J Virol 2010; 84:4204-11. [PMID: 20147411 DOI: 10.1128/jvi.02511-09] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
Retroviruses and retrovirus-derived vectors integrate nonrandomly into the genomes of host cells with specific preferences for transcribed genes, gene-rich regions, and CpG islands. However, the genomic features that influence the transcriptional activities of integrated retroviruses or retroviral vectors are poorly understood. We report here the cloning and characterization of avian sarcoma virus integration sites from chicken tumors. Growing progressively, dependent on high and stable expression of the transduced v-src oncogene, these tumors represent clonal expansions of cells bearing transcriptionally active replication-defective proviruses. Therefore, integration sites in our study distinguished genomic loci favorable for the expression of integrated retroviruses and gene transfer vectors. Analysis of integration sites from avian sarcoma virus-induced tumors showed strikingly nonrandom distribution, with proviruses found prevalently within or close to transcription units, particularly in genes broadly expressed in multiple tissues but not in tissue-specifically expressed genes. We infer that proviruses integrated in these genomic areas efficiently avoid transcriptional silencing and remain active for a long time during the growth of tumors. Defining the differences between unselected retroviral integration sites and sites selected for long-terminal-repeat-driven gene expression is relevant for retrovirus-mediated gene transfer and has ramifications for gene therapy.
Collapse
|
16
|
Costantini M, Bernardi G. Mapping insertions, deletions and SNPs on Venter's chromosomes. PLoS One 2009; 4:e5972. [PMID: 19543403 PMCID: PMC2696090 DOI: 10.1371/journal.pone.0005972] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2009] [Accepted: 05/19/2009] [Indexed: 12/11/2022] Open
Abstract
BACKGROUND The very recent availability of fully sequenced individual human genomes is a major revolution in biology which is certainly going to provide new insights into genetic diseases and genomic rearrangements. RESULTS We mapped the insertions, deletions and SNPs (single nucleotide polymorphisms) that are present in Craig Venter's genome, more precisely on chromosomes 17 to 22, and compared them with the human reference genome hg17. Our results show that insertions and deletions are almost absent in L1 and generally scarce in L2 isochore families (GC-poor L1+L2 isochores represent slightly over half of the human genome), whereas they increase in GC-rich isochores, largely paralleling the densities of genes, retroviral integrations and Alu sequences. The distributions of insertions/deletions are in striking contrast with those of SNPs which exhibit almost the same density across all isochore families with, however, a trend for lower concentrations in gene-rich regions. CONCLUSIONS Our study strongly suggests that the distribution of insertions/deletions is due to the structure of chromatin which is mostly open in gene-rich, GC-rich isochores, and largely closed in gene-poor, GC-poor isochores. The different distributions of insertions/deletions and SNPs are clearly related to the two different responsible mechanisms, namely recombination and point mutations.
Collapse
Affiliation(s)
- Maria Costantini
- Stazione Zoologica Anton Dohrn, Naples, Italy
- * E-mail: (MC); (GB)
| | - Giorgio Bernardi
- Stazione Zoologica Anton Dohrn, Naples, Italy
- * E-mail: (MC); (GB)
| |
Collapse
|
17
|
Wellensiek BP, Ramakrishnan R, Sundaravaradan V, Mehta R, Harris DT, Ahmad N. Differential HIV-1 integration targets more actively transcribed host genes in neonatal than adult blood mononuclear cells. Virology 2008; 385:28-38. [PMID: 19100594 DOI: 10.1016/j.virol.2008.10.052] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2008] [Revised: 08/30/2008] [Accepted: 10/31/2008] [Indexed: 11/30/2022]
Abstract
We have recently shown an increased HIV-1 replication and gene expression in neonatal (cord) blood mononuclear cells compared with adult cells, which could be due to HIV-1 integration as it targets active host genes. Here we have characterized 468 HIV-1 integration sites within cord and adult blood T-lymphocytes and monocyte-derived macrophages (MDM) from five donors. Several functional classes of genes were identified by gene ontology to be over represented, including genes for cellular components, maintenance of intracellular environment, enzyme regulation, cellular metabolism, catalytic activity and cation transport. Numerous potential transcription factor binding sites at the sites of integration were identified. Furthermore, the genes at the site of integration, transcription factors which potentially bind upstream of the HIV-1 promoter and factors that assist HIV-1 integration were found to be expressed at higher levels in cord than adult cells. Taken together, these results suggest HIV-1 integration occurred in a more actively transcribed genes in neonatal cells compared with adult cells, which may help explain a higher level of HIV-1 gene expression and replication in neonatal compared with adult cells.
Collapse
|
18
|
The core element of a CpG island protects avian sarcoma and leukosis virus-derived vectors from transcriptional silencing. J Virol 2008; 82:7818-27. [PMID: 18550662 DOI: 10.1128/jvi.00419-08] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
Unmethylated CpG islands are known to keep adjacent promoters transcriptionally active. In the CpG island adjacent to the adenosine phosphoribosyltransferase gene, the protection against transcriptional silencing can be attributed to the short CpG-rich core element containing Sp1 binding sites. We report here the insertion of this CpG island core element, IE, into the long terminal repeat of a retroviral vector derived from Rous sarcoma virus, which normally suffers from progressive transcriptional silencing in mammalian cells. IE insertion into a specific position between enhancer and promoter sequences led to efficient protection of the integrated vector from silencing and gradual CpG methylation in rodent and human cells. Individual cell clones with IE-modified reporter vectors display high levels of reporter expression for a sustained period and without substantial variegation in the cell culture. The presence of Sp1 binding sites is important for the protective effect of IE, but at least some part of the entire antisilencing capacity is maintained in IE with mutated Sp1 sites. We suggest that this strategy of antisilencing protection by the CpG island core element may prove generally useful in retroviral vectors.
Collapse
|
19
|
Di Filippo M, Bernardi G. Mapping DNase-I hypersensitive sites on human isochores. Gene 2008; 419:62-5. [PMID: 18436395 DOI: 10.1016/j.gene.2008.02.012] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2007] [Revised: 02/04/2008] [Accepted: 02/05/2008] [Indexed: 10/22/2022]
Abstract
Mapping DNase-I hypersensitive sites (HS) was used in the past to identify regulatory elements of specific genes. More recently, thousands of HS were identified in the human genome by using high-throughput methods. These approaches showed a general enrichment of HS near or within known genes, within CpG islands, within human-mouse conserved regions and in GC-rich regions of the genome. Here we show that HS: (i) are characterized by a much higher GC level (approximately 56%) than the average GC level of the human genome (approximately 41%); (ii) are overwhelmingly located in the GC-richest compartment of the genome, which is predominantly associated with an open chromatin structure; (iii) and are slightly more and slightly less frequent than genes, respectively, in the gene-rich and in the gene-poor isochore families.
Collapse
Affiliation(s)
- Miriam Di Filippo
- Laboratory of Molecular Evolution, Stazione Zoologica Anton Dohrn, 80121 Naples, Italy
| | | |
Collapse
|
20
|
Arhondakis S, Clay O, Bernardi G. GC level and expression of human coding sequences. Biochem Biophys Res Commun 2008; 367:542-5. [PMID: 18177737 DOI: 10.1016/j.bbrc.2007.12.155] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2007] [Accepted: 12/21/2007] [Indexed: 11/29/2022]
Abstract
Several groups have addressed the issue of the influence of GC on expression levels in mammalian genes. In general, GC-rich genes appeared to be more expressed than GC-poor ones. Recently, expression levels of GC(3)-rich and GC(3)-poor versions of genes (GC(3) is the third codon position GC), inserted in vector plasmids, were compared in order to eliminate differences associated with their genomic context. Transfection experiments showed that GC(3)-rich genes were expressed more efficiently than their GC(3)-poor counterparts, indicating that GC(3) dramatically and intrinsically boosts expression efficiency. Here we show that, while the protocols used eliminated the original genomic context, they replaced it with the plasmid contexts whose compositional properties affected the results.
Collapse
Affiliation(s)
- Stilianos Arhondakis
- Laboratory of Molecular Evolution, Stazione Zoologica Anton Dohrn, Villa Comunale, 80121 Naples, Italy
| | | | | |
Collapse
|
21
|
Kumar PP, Mehta S, Purbey PK, Notani D, Jayani RS, Purohit HJ, Raje DV, Ravi DS, Bhonde RR, Mitra D, Galande S. SATB1-binding sequences and Alu-like motifs define a unique chromatin context in the vicinity of human immunodeficiency virus type 1 integration sites. J Virol 2007; 81:5617-27. [PMID: 17376900 PMCID: PMC1900249 DOI: 10.1128/jvi.01405-06] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2006] [Accepted: 03/07/2007] [Indexed: 02/07/2023] Open
Abstract
Retroviral integration has recently been shown to be nonrandom, favoring transcriptionally active regions of chromatin. However, the mechanism for integration site selection by retroviruses is not clear. We show here the occurrence of Alu-like motifs in the sequences flanking the reported viral integration sites that are significantly different from those obtained from the randomly picked sequences from the human genome, suggesting that unique primary sequence features exist in the genomic regions targeted by human immunodeficiency virus type 1 (HIV-1). Additionally, these sequences were preferentially bound by SATB1, the T lineage-restricted chromatin organizer, in vitro and in vivo. Alu repeats make up nearly 10% of the human genome and have been implicated in the regulation of transcription. To specifically isolate sequences flanking the viral integration sites and also harboring both Alu-like repeats and SATB1-binding sites, we combined chromatin immunoprecipitation with sequential PCRs. The cloned sequences flanking HIV-1 integration sites were specifically immunoprecipitated and amplified from the pool of anti-SATB1-immunoprecipitated genomic DNA fragments isolated from HIV-1 NL4.3-infected Jurkat T-cell chromatin. Moreover, many of these sequences were preferentially partitioned in the DNA associated tightly with the nuclear matrix and not in the chromatin loops. Strikingly, many of these regions were disfavored for integration when SATB1 was silenced, providing unequivocal evidence for its role in HIV-1 integration site selection. We propose that definitive sequence features such as the Alu-like motifs and SATB1-binding sites provide a unique chromatin context in vivo which is preferentially targeted by the HIV-1 integration machinery.
Collapse
Affiliation(s)
- Pavan P Kumar
- National Centre for Cell Science, NCCS Complex, Ganeshkhind, Pune 411007, India
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
22
|
Pajer P, Pecenka V, Králová J, Karafiát V, Průková D, Zemanová Z, Kodet R, Dvorák M. Identification of potential human oncogenes by mapping the common viral integration sites in avian nephroblastoma. Cancer Res 2006; 66:78-86. [PMID: 16397219 DOI: 10.1158/0008-5472.can-05-1728] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Gene deregulation is a frequent cause of malignant transformation. Alteration of the gene structure and/or expression leading to cellular transformation and tumor growth can be experimentally achieved by insertion of the retroviral genome into the host DNA. Retrovirus-containing host loci found repeatedly in clonal tumors are called common viral integration sites (cVIS). cVIS are located in genes or chromosomal regions whose alterations participate in cellular transformation. Here, we present the chicken model for the identification of oncogenes and tumor suppressor genes in solid tumors by mapping the cVIS. Using the combination of inverse PCR and long terminal repeat-rapid amplification of cDNA ends technique, we have analyzed 93 myeloblastosis-associated virus type 2-induced clonal nephroblastoma tumors in detail, and mapped >500 independent retroviral integration sites. Eighteen genomic loci were hit repeatedly and thus classified as cVIS, five of these genomic loci have previously been shown to be involved in malignant transformation of different human cell types. The expression levels of selected genes and their human orthologues have been assayed in chicken and selected human renal tumor samples, and their possible correlation with tumor development, has been suggested. We have found that genes associated with cVIS are frequently, but not in all cases, deregulated at the mRNA level as a result of proviral integration. Furthermore, the deregulation of their human orthologues has been observed in the samples of human pediatric renal tumors. Thus, the avian nephroblastoma is a valid source of cancer-associated genes. Moreover, the results bring deeper insight into the molecular background of tumorigenesis in distant species.
Collapse
Affiliation(s)
- Petr Pajer
- Institute of Molecular Genetics AS CR, Prague, Czech Republic
| | | | | | | | | | | | | | | |
Collapse
|
23
|
Lewinski MK, Bushman FD. Retroviral DNA integration--mechanism and consequences. ADVANCES IN GENETICS 2005; 55:147-81. [PMID: 16291214 DOI: 10.1016/s0065-2660(05)55005-3] [Citation(s) in RCA: 107] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Integration of retroviral cDNA into the host cell chromosome is an essential step in its replication. This process is catalyzed by the retroviral integrase protein, which is conserved among retroviruses and retrotransposons. Integrase binds viral and host DNA in a complex, called the preintegration complex (PIC), with other viral and cellular proteins. While the PIC is capable of directing integration of the viral DNA into any chromosomal location, different retroviruses have clear preferences for integration in or near particular chromosomal features. The determinants of integration site selection are under investigation but may include retrovirus-specific interactions between integrase and tethering factors bound to the host cell chromosomes. Research into the mechanisms of retroviral integration site selection has shed light on the phenomena of insertional mutagenesis and viral latency.
Collapse
Affiliation(s)
- Mary K Lewinski
- Infectious Disease Laboratory, The Salk Institute for Biological Studies, La Jolla, CA 92186, USA
| | | |
Collapse
|
24
|
Hematti P, Hong BK, Ferguson C, Adler R, Hanawa H, Sellers S, Holt IE, Eckfeldt CE, Sharma Y, Schmidt M, von Kalle C, Persons DA, Billings EM, Verfaillie CM, Nienhuis AW, Wolfsberg TG, Dunbar CE, Calmels B. Distinct genomic integration of MLV and SIV vectors in primate hematopoietic stem and progenitor cells. PLoS Biol 2004; 2:e423. [PMID: 15550989 PMCID: PMC529319 DOI: 10.1371/journal.pbio.0020423] [Citation(s) in RCA: 223] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2004] [Accepted: 10/04/2004] [Indexed: 12/18/2022] Open
Abstract
Murine leukemia virus (MLV)-derived vectors are widely used for hematopoietic stem cell (HSC) gene transfer, but lentiviral vectors such as the simian immunodeficiency virus (SIV) may allow higher efficiency transfer and better expression. Recent studies in cell lines have challenged the notion that retroviruses and retroviral vectors integrate randomly into their host genome. Medical applications using these vectors are aimed at HSCs, and thus large-scale comprehensive analysis of MLV and SIV integration in long-term repopulating HSCs is crucial to help develop improved integrating vectors. We studied integration sites in HSCs of rhesus monkeys that had been transplanted 6 mo to 6 y prior with MLV- or SIV-transduced CD34+ cells. Unique MLV (491) and SIV (501) insertions were compared to a set of in silico-generated random integration sites. While MLV integrants were located predominantly around transcription start sites, SIV integrants strongly favored transcription units and gene-dense regions of the genome. These integration patterns suggest different mechanisms for integration as well as distinct safety implications for MLV versus SIV vectors. A primate model of gene transfer into hematopoietic stem cells demonstrated MLV integration around transcription start sites whereas SIV integrated into gene-dense regions, indicating distinct safety implications for each
Collapse
Affiliation(s)
- Peiman Hematti
- 1Hematology Branch, National Institutes of HealthBethesda, MarylandUnited States of America
| | - Bum-Kee Hong
- 1Hematology Branch, National Institutes of HealthBethesda, MarylandUnited States of America
| | - Cole Ferguson
- 1Hematology Branch, National Institutes of HealthBethesda, MarylandUnited States of America
| | - Rima Adler
- 1Hematology Branch, National Institutes of HealthBethesda, MarylandUnited States of America
| | - Hideki Hanawa
- 2Experimental Hematology Division, Department of Hematology/Oncology, St. Jude Children's Research HospitalMemphis, TennesseeUnited States of America
| | - Stephanie Sellers
- 1Hematology Branch, National Institutes of HealthBethesda, MarylandUnited States of America
| | - Ingeborg E Holt
- 3Genome Technology Branch, National Human Genome Research Institute, National Institutes of HealthBethesda, MarylandUnited States of America
| | - Craig E Eckfeldt
- 4Stem Cell Institute, University of MinnesotaMinneapolis, MinnesotaUnited States of America
| | - Yugal Sharma
- 5Bioinformatics Core Facility, National Heart, Lung, and Blood Institute, National Institutes of HealthBethesda, MarylandUnited States of America
| | - Manfred Schmidt
- 6Department of Internal Medicine, University of Freiburg, Freiburg, Germany
| | - Christof von Kalle
- 7Division of Experimental Hematology, Children's Hospital Research FoundationCincinnati, OhioUnited States of America
| | - Derek A Persons
- 2Experimental Hematology Division, Department of Hematology/Oncology, St. Jude Children's Research HospitalMemphis, TennesseeUnited States of America
| | - Eric M Billings
- 5Bioinformatics Core Facility, National Heart, Lung, and Blood Institute, National Institutes of HealthBethesda, MarylandUnited States of America
| | - Catherine M Verfaillie
- 4Stem Cell Institute, University of MinnesotaMinneapolis, MinnesotaUnited States of America
| | - Arthur W Nienhuis
- 2Experimental Hematology Division, Department of Hematology/Oncology, St. Jude Children's Research HospitalMemphis, TennesseeUnited States of America
| | - Tyra G Wolfsberg
- 3Genome Technology Branch, National Human Genome Research Institute, National Institutes of HealthBethesda, MarylandUnited States of America
| | - Cynthia E Dunbar
- 1Hematology Branch, National Institutes of HealthBethesda, MarylandUnited States of America
| | - Boris Calmels
- 1Hematology Branch, National Institutes of HealthBethesda, MarylandUnited States of America
| |
Collapse
|
25
|
|
26
|
Borisenko LG, Rynditch AV, Bernardi G. Distribution and expression of chicken endogenous retroviruses in the host genome. ACTA ACUST UNITED AC 2004. [DOI: 10.7124/bc.000690] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Affiliation(s)
- L. G. Borisenko
- Institute of Molecular Biology and Genetics, National Academy of Sciences of Ukraine
| | - A. V. Rynditch
- Institute of Molecular Biology and Genetics, National Academy of Sciences of Ukraine
| | - G. Bernardi
- Laboratorio di Evoluzione Molecolare, Stazione Zoologica Anton Dohrn
| |
Collapse
|
27
|
Hejnar J, Elleder D, Hájková P, Walter J, Blazková J, Svoboda J. Demethylation of host-cell DNA at the site of avian retrovirus integration. Biochem Biophys Res Commun 2003; 311:641-8. [PMID: 14623319 DOI: 10.1016/j.bbrc.2003.10.035] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
The transcriptional activity of an integrated retroviral copy strongly depends on the adjacent host-cell DNA at the site of integration. Transcribed DNA loci as well as cis-acting sequences like enhancers or CpG islands usually permit expression of nearby integrated proviruses. In contrast, proviruses residing close to cellular silencers tend to transcriptional silencing and CpG methylation. Little is known, however, about the influence of provirus integration on the target sequence in the host genome. Here, we report interesting features of a simplified Rous sarcoma virus integrated into a non-transcribed hypermethylated DNA sequence in the Syrian hamster genome. After integration, CpG methylation of this sequence has been lost almost completely and hypomethylated DNA permits proviral transcription and hamster cell transformation by the proviral v-src oncogene. This, however, is not a stable state, and non-transformed revertants bearing transcriptionally silenced proviruses segregate with a high rate. The provirus silencing is followed by DNA methylation of both provirus regulatory regions and adjacent cellular sequences. This CpG methylation is very dense and resistant to the demethylation effects of 5-aza-2(')-deoxycytidine and/or trichostatin A. Our description exemplifies the capacity of retroviruses/retroviral vectors to overcome, at least transiently, negative position effects of DNA methylation at the site of integration.
Collapse
Affiliation(s)
- Jirí Hejnar
- Institute of Molecular Genetics, Academy of Sciences of the Czech Republic, Flemingovo námestí 2, 16637 6, Prague, Czech Republic.
| | | | | | | | | | | |
Collapse
|
28
|
Chromatin insulators and position effects. ACTA ACUST UNITED AC 2003. [DOI: 10.1016/s0167-7306(03)38023-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/29/2023]
|
29
|
|