1
|
Xu D, Tang L, Zhou J, Wang F, Cao H, Huang Y, Kapranov P. Evidence for widespread existence of functional novel and non-canonical human transcripts. BMC Biol 2023; 21:271. [PMID: 38001496 PMCID: PMC10675921 DOI: 10.1186/s12915-023-01753-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Accepted: 10/31/2023] [Indexed: 11/26/2023] Open
Abstract
BACKGROUND Fraction of functional sequence in the human genome remains a key unresolved question in Biology and the subject of vigorous debate. While a plethora of studies have connected a significant fraction of human DNA to various biochemical processes, the classical definition of function requires evidence of effects on cellular or organismal fitness that such studies do not provide. Although multiple high-throughput reverse genetics screens have been developed to address this issue, they are limited to annotated genomic elements and suffer from non-specific effects, arguing for a strong need to develop additional functional genomics approaches. RESULTS In this work, we established a high-throughput lentivirus-based insertional mutagenesis strategy as a forward genetics screen tool in aneuploid cells. Application of this approach to human cell lines in multiple phenotypic screens suggested the presence of many yet uncharacterized functional elements in the human genome, represented at least in part by novel exons of known and novel genes. The novel transcripts containing these exons can be massively, up to thousands-fold, induced by specific stresses, and at least some can represent bi-cistronic protein-coding mRNAs. CONCLUSIONS Altogether, these results argue that many unannotated and non-canonical human transcripts, including those that appear as aberrant splice products, have biological relevance under specific biological conditions.
Collapse
Affiliation(s)
- Dongyang Xu
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Lu Tang
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Junjun Zhou
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Fang Wang
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Huifen Cao
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Yu Huang
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Philipp Kapranov
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China.
- State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Xiamen University, Xiamen, 361102, China.
| |
Collapse
|
2
|
Simple innovative adaptor to improve genome walking with convenient PCR. J Genet Eng Biotechnol 2020; 18:64. [PMID: 33083895 PMCID: PMC7575660 DOI: 10.1186/s43141-020-00082-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2020] [Accepted: 09/30/2020] [Indexed: 11/10/2022]
Abstract
BACKGROUND Various polymerase chain reaction (PCR)-based methods have been applied for the development of genome walking (GW) technique. These methods which could be based on the application of restriction enzymes or primers have various efficiencies to identify the unknown nucleotide sequences. The present study was conducted to design a new innovative double-strand adaptor using MAP30 gene sequence of Momordica charantia plant as a model to improve genome walking with convenient PCR. RESULTS The adaptor was designed using multiple restriction sites of Hind III, BamH I, EcoR I, and Bgl II enzymes with no restriction site in a known sequence of the MAP30 gene. In addition, no modification was required to add phosphate, amine, or other groups to the adaptor, since restriction enzyme digestion of double-strand adaptor provided the 5' phosphate group. Here, preparation of the phosphate group in the genomic DNA of the plant digestion with restriction enzymes was performed followed by ligation with digested adaptor containing 5' phosphate group. CONCLUSION PCR was done to amplify the unknown sequence using MAP30 gene-specific primer and adaptor primer. Results confirmed the ability of the technique for successful identification of the sequence. Consequently, a newly designed adaptor in the developed technique reduced the time and cost of the method compared to the conventional genome walking; also, cloning and culturing of bacterial steps could be eliminated.
Collapse
|
3
|
Chang EC, Liu H, West JA, Zhou X, Dakhova O, Wheeler DA, Heslop HE, Brenner MK, Dotti G. Clonal Dynamics In Vivo of Virus Integration Sites of T Cells Expressing a Safety Switch. Mol Ther 2015; 24:736-45. [PMID: 26639404 DOI: 10.1038/mt.2015.217] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2015] [Accepted: 11/23/2015] [Indexed: 12/13/2022] Open
Abstract
Safety switches are becoming relevant for the clinical translation of T-cell-based immunotherapies. In patients receiving an allogeneic hematopoietic stem cell transplant, the inducible caspase-9 gene (iC9) safety switch expressed by donor-derived T lymphocytes efficiently controls acute graft versus host disease (GvHD). However, in vivo elimination of iC9-T cells by the chemical inducer of dimerization (CID) that activates the iC9 protein is incomplete. To study this effect, we characterized the clonal diversity and dynamics of vector insertion sites (VIS) in iC9-T cells pre- and post-CID administration in four patients who developed GvHD. We identified 3,203 VIS among four patients and followed their in vivo clonal dynamics up to 161 days post-CID. VIS were categorized by their proximity to host genome elements, gene associations, and cis-modulatory relationship to mapped promoters. We found that VIS are preferentially located near open chromatin and promoter regions; furthermore, there was no evidence for selection bias among VIS surviving the CID treatment. The majority of iC9-T cells with high normalized VIS copy number at the time of GvHD onset were eliminated by CID, while iC9-T cells detectable post-CID generally have low normalized VIS copy number. We propose that suboptimal iC9 transgene expression is responsible for the incomplete elimination of iC9-T cells and illustrate here by simple model how cis-modulatory influences of local genome context and T-cell receptor activation status at time of CID treatment contribute to stochastic sparing of iC9-T cells.
Collapse
Affiliation(s)
- Edmund C Chang
- Center for Cell and Gene Therapy, Baylor College of Medicine, Houston, Texas, USA.,Houston Methodist Hospital, Houston, Texas, USA
| | - Hao Liu
- Biostatistics Shared Resources, Baylor College of Medicine, Houston, Texas, USA
| | - John A West
- Lineberger Cancer Center, University of North Carolina, Chapel Hill, North Carolina, USA
| | - Xiaoou Zhou
- Center for Cell and Gene Therapy, Baylor College of Medicine, Houston, Texas, USA.,Houston Methodist Hospital, Houston, Texas, USA
| | - Olga Dakhova
- Center for Cell and Gene Therapy, Baylor College of Medicine, Houston, Texas, USA.,Houston Methodist Hospital, Houston, Texas, USA
| | - David A Wheeler
- Human Sequencing Center, Baylor College of Medicine, Houston, Texas, USA
| | - Helen E Heslop
- Center for Cell and Gene Therapy, Baylor College of Medicine, Houston, Texas, USA.,Houston Methodist Hospital, Houston, Texas, USA.,Department of Pediatrics, Texas Children's Hospital, Baylor College of Medicine, Houston, Texas, USA.,Department of Medicine, Baylor College of Medicine, Houston, Texas, USA
| | - Malcolm K Brenner
- Center for Cell and Gene Therapy, Baylor College of Medicine, Houston, Texas, USA.,Houston Methodist Hospital, Houston, Texas, USA.,Department of Pediatrics, Texas Children's Hospital, Baylor College of Medicine, Houston, Texas, USA.,Department of Medicine, Baylor College of Medicine, Houston, Texas, USA
| | - Gianpietro Dotti
- Center for Cell and Gene Therapy, Baylor College of Medicine, Houston, Texas, USA.,Houston Methodist Hospital, Houston, Texas, USA.,Lineberger Cancer Center, University of North Carolina, Chapel Hill, North Carolina, USA.,Department of Medicine, Baylor College of Medicine, Houston, Texas, USA.,Department of Pathology and Immunology, Baylor College of Medicine, Houston, Texas, USA
| |
Collapse
|
4
|
Henssen AG, Henaff E, Jiang E, Eisenberg AR, Carson JR, Villasante CM, Ray M, Still E, Burns M, Gandara J, Feschotte C, Mason CE, Kentsis A. Genomic DNA transposition induced by human PGBD5. eLife 2015; 4. [PMID: 26406119 PMCID: PMC4625184 DOI: 10.7554/elife.10565] [Citation(s) in RCA: 56] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2015] [Accepted: 09/23/2015] [Indexed: 11/13/2022] Open
Abstract
Transposons are mobile genetic elements that are found in nearly all organisms, including humans. Mobilization of DNA transposons by transposase enzymes can cause genomic rearrangements, but our knowledge of human genes derived from transposases is limited. In this study, we find that the protein encoded by human PGBD5, the most evolutionarily conserved transposable element-derived gene in vertebrates, can induce stereotypical cut-and-paste DNA transposition in human cells. Genomic integration activity of PGBD5 requires distinct aspartic acid residues in its transposase domain, and specific DNA sequences containing inverted terminal repeats with similarity to piggyBac transposons. DNA transposition catalyzed by PGBD5 in human cells occurs genome-wide, with precise transposon excision and preference for insertion at TTAA sites. The apparent conservation of DNA transposition activity by PGBD5 suggests that genomic remodeling contributes to its biological function.
Collapse
Affiliation(s)
- Anton G Henssen
- Molecular Pharmacology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Elizabeth Henaff
- Institute for Computational Biomedicine, Weill Cornell Medical College, New York, United States
| | - Eileen Jiang
- Molecular Pharmacology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Amy R Eisenberg
- Molecular Pharmacology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Julianne R Carson
- Molecular Pharmacology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Camila M Villasante
- Molecular Pharmacology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Mondira Ray
- Molecular Pharmacology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Eric Still
- Molecular Pharmacology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Melissa Burns
- Boston Children's Hospital, Harvard Medical School, Boston, United States
| | - Jorge Gandara
- Institute for Computational Biomedicine, Weill Cornell Medical College, New York, United States
| | - Cedric Feschotte
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, United States
| | - Christopher E Mason
- Institute for Computational Biomedicine, Weill Cornell Medical College, New York, United States
| | - Alex Kentsis
- Molecular Pharmacology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, United States.,Department of Pediatrics, Memorial Sloan Kaettering Cancer Center, New York, United States.,Weill Cornell Medical College, Cornell University, New York, United States
| |
Collapse
|
5
|
Deakin CT, Deakin JJ, Ginn SL, Young P, Humphreys D, Suter CM, Alexander IE, Hallwirth CV. Impact of next-generation sequencing error on analysis of barcoded plasmid libraries of known complexity and sequence. Nucleic Acids Res 2014; 42:e129. [PMID: 25013183 PMCID: PMC4176369 DOI: 10.1093/nar/gku607] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Barcoded vectors are promising tools for investigating clonal diversity and dynamics in hematopoietic gene therapy. Analysis of clones marked with barcoded vectors requires accurate identification of potentially large numbers of individually rare barcodes, when the exact number, sequence identity and abundance are unknown. This is an inherently challenging application, and the feasibility of using contemporary next-generation sequencing technologies is unresolved. To explore this potential application empirically, without prior assumptions, we sequenced barcode libraries of known complexity. Libraries containing 1, 10 and 100 Sanger-sequenced barcodes were sequenced using an Illumina platform, with a 100-barcode library also sequenced using a SOLiD platform. Libraries containing 1 and 10 barcodes were distinguished from false barcodes generated by sequencing error by a several log-fold difference in abundance. In 100-barcode libraries, however, expected and false barcodes overlapped and could not be resolved by bioinformatic filtering and clustering strategies. In independent sequencing runs multiple false-positive barcodes appeared to be represented at higher abundance than known barcodes, despite their confirmed absence from the original library. Such errors, which potentially impact barcoding studies in an application-dependent manner, are consistent with the existence of both stochastic and systematic error, the mechanism of which is yet to be fully resolved.
Collapse
Affiliation(s)
- Claire T Deakin
- Gene Therapy Research Unit, Children's Medical Research Institute and The Children's Hospital at Westmead, Westmead, New South Wales 2145, Australia
| | - Jeffrey J Deakin
- Gene Therapy Research Unit, Children's Medical Research Institute and The Children's Hospital at Westmead, Westmead, New South Wales 2145, Australia
| | - Samantha L Ginn
- Gene Therapy Research Unit, Children's Medical Research Institute and The Children's Hospital at Westmead, Westmead, New South Wales 2145, Australia
| | - Paul Young
- Molecular Genetics Division, Victor Chang Cardiac Research Institute, Sydney, Darlinghurst, New South Wales 2010, Australia
| | - David Humphreys
- Molecular Genetics Division, Victor Chang Cardiac Research Institute, Sydney, Darlinghurst, New South Wales 2010, Australia
| | - Catherine M Suter
- Molecular Genetics Division, Victor Chang Cardiac Research Institute, Sydney, Darlinghurst, New South Wales 2010, Australia Faculty of Medicine, University of New South Wales, Kensington, New South Wales 2052, Australia
| | - Ian E Alexander
- Gene Therapy Research Unit, Children's Medical Research Institute and The Children's Hospital at Westmead, Westmead, New South Wales 2145, Australia Discipline of Paediatrics and Child Health, The Children's Hospital at Westmead Clinical School, The University of Sydney, Westmead, New South Wales 2145, Australia
| | - Claus V Hallwirth
- Gene Therapy Research Unit, Children's Medical Research Institute and The Children's Hospital at Westmead, Westmead, New South Wales 2145, Australia
| |
Collapse
|
6
|
Wu C, Jares A, Winkler T, Xie J, Metais JY, Dunbar CE. High efficiency restriction enzyme-free linear amplification-mediated polymerase chain reaction approach for tracking lentiviral integration sites does not abrogate retrieval bias. Hum Gene Ther 2012; 24:38-47. [PMID: 22992116 DOI: 10.1089/hum.2012.082] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Retroviral vectors are an efficient and widely employed means of introducing an exogenous expression cassette into target cells. These vectors have been shown to integrate semi-randomly into the cellular genome, and can be associated with genotoxicity due to impact on expression of proximate genes. Therefore, efficient and accurate integration site analysis, while quantifying contributions of individual vector-containing clones, is desirable. Linear amplification-mediated polymerase chain reaction (LAM-PCR) is a widely used technique for identifying integrated proviral and host genomic DNA junctions. However, LAM-PCR is subject to selection bias inherent in the reliance of the assay on the presence of a restriction enzyme-cutting site adjacent to a retrievable integration site, and it is further limited by an inability to discriminate prior to sequencing between the flanking genomic DNA of interest and uninformative internal vector DNA. We report a modified restriction enzyme-free LAM-PCR (Re-free LAM-PCR) approach that is less time and labor intensive compared to conventional LAM-PCR, but in contrast to some other nonrestrictive methods, compares in efficiency and sensitivity, excludes retrieval of uninformative internal vector sequences, and allows retrieval of integration sites unbiased by the presence of nearby restriction sites. However, we report that Re-free LAM-PCR remains inaccurate for quantitation of the relative contributions of individual integration site-containing clones in a polyclonal setting, suggesting that bias in LAM-PCR retrieval of integration sites is not wholly explained by restriction enzyme-related factors.
Collapse
Affiliation(s)
- Chuanfeng Wu
- Hematology Branch, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | | | | | | | | | | |
Collapse
|
7
|
Wu C, Dunbar CE. Stem cell gene therapy: the risks of insertional mutagenesis and approaches to minimize genotoxicity. Front Med 2011; 5:356-71. [PMID: 22198747 DOI: 10.1007/s11684-011-0159-1] [Citation(s) in RCA: 78] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2011] [Accepted: 09/08/2011] [Indexed: 12/15/2022]
Abstract
Virus-based vectors are widely used in hematopoietic stem cell (HSC) gene therapy, and have the ability to integrate permanently into genomic DNA, thus driving long-term expression of corrective genes in all hematopoietic lineages. To date, HSC gene therapy has been successfully employed in the clinic for improving clinical outcomes in small numbers of patients with X-linked severe combined immunodeficiency (SCID-X1), adenosine deaminase deficiency (ADA-SCID), adrenoleukodystrophy (ALD), thalassemia, chronic granulomatous disease (CGD), and Wiskott-Aldrich syndrome (WAS). However, adverse events were observed during some of these HSC gene therapy clinical trials, linked to insertional activation of proto-oncogenes by integrated proviral vectors leading to clonal expansion and eventual development of leukemia. Numerous studies have been performed to understand the molecular basis of vector-mediated genotoxicity, with the aim of developing safer vectors and lower-risk gene therapy protocols. This review will summarize current information on the mechanisms of insertional mutagenesis in hematopoietic stem and progenitor cells due to integrating gene transfer vectors, discuss the available assays for predicting genotoxicity and mapping vector integration sites, and introduce newly-developed approaches for minimizing genotoxicity as a way to further move HSC gene therapy forward into broader clinical application.
Collapse
Affiliation(s)
- Chuanfeng Wu
- Hematology Branch, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | | |
Collapse
|
8
|
Abstract
Genome walking is a molecular procedure for the direct identification of nucleotide sequences from purified genomes. The only requirement is the availability of a known nucleotide sequence from which to start. Several genome walking methods have been developed in the last 20 years, with continuous improvements added to the first basic strategies, including the recent coupling with next generation sequencing technologies. This review focuses on the use of genome walking strategies in several aspects of the study of eukaryotic genomes. In a first part, the analysis of the numerous strategies available is reported. The technical aspects involved in genome walking are particularly intriguing, also because they represent the synthesis of the talent, the fantasy and the intelligence of several scientists. Applications in which genome walking can be employed are systematically examined in the second part of the review, showing the large potentiality of this technique, including not only the simple identification of nucleotide sequences but also the analysis of large collections of mutants obtained from the insertion of DNA of viral origin, transposons and transfer DNA (T-DNA) constructs. The enormous amount of data obtained indicates that genome walking, with its large range of applicability, multiplicity of strategies and recent developments, will continue to have much to offer for the rapid identification of unknown sequences in several fields of genomic research.
Collapse
Affiliation(s)
- Claudia Leoni
- Department of Biochemistry and Molecular Biology, University of Bari, Bari, Italy
| | | | | | | | | |
Collapse
|
9
|
Kustikova O, Brugman M, Baum C. The genomic risk of somatic gene therapy. Semin Cancer Biol 2010; 20:269-78. [DOI: 10.1016/j.semcancer.2010.06.003] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2010] [Revised: 06/02/2010] [Accepted: 06/24/2010] [Indexed: 01/08/2023]
|
10
|
Long-term outcome of EBV-specific T-cell infusions to prevent or treat EBV-related lymphoproliferative disease in transplant recipients. Blood 2009; 115:925-35. [PMID: 19880495 DOI: 10.1182/blood-2009-08-239186] [Citation(s) in RCA: 616] [Impact Index Per Article: 41.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open
Abstract
T-cell immunotherapy that takes advantage of Epstein-Barr virus (EBV)-stimulated immunity has the potential to fill an important niche in targeted therapy for EBV-related cancers. To address questions of long-term efficacy, safety, and practicality, we studied 114 patients who had received infusions of EBV-specific cytotoxic T lymphocytes (CTLs) at 3 different centers to prevent or treat EBV(+) lymphoproliferative disease (LPD) arising after hematopoietic stem cell transplantation. Toxicity was minimal, consisting mainly of localized swelling at sites of responsive disease. None of the 101 patients who received CTL prophylaxis developed EBV(+) LPD, whereas 11 of 13 patients treated with CTLs for biopsy-proven or probable LPD achieved sustained complete remissions. The gene-marking component of this study enabled us to demonstrate the persistence of functional CTLs for up to 9 years. A preliminary analysis indicated that a patient-specific CTL line can be manufactured, tested, and infused for $6095, a cost that compares favorably with other modalities used in the treatment of LPD. We conclude that the CTL lines described here provide safe and effective prophylaxis or treatment for lymphoproliferative disease in transplantation recipients, and the manufacturing methodology is robust and can be transferred readily from one institution to another without loss of reproducibility.
Collapse
|