1
|
Ylagan M, Xu Q, Kowalski J. TTSBBC: triplex target site biomarkers and barcodes in cancer. Nucleic Acids Res 2024; 52:W547-W555. [PMID: 38661214 PMCID: PMC11223863 DOI: 10.1093/nar/gkae312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2024] [Revised: 03/28/2024] [Accepted: 04/10/2024] [Indexed: 04/26/2024] Open
Abstract
The technology of triplex-forming oligonucleotides (TFOs) provides an approach to manipulate genes at the DNA level. TFOs bind to specific sites on genomic DNA, creating a unique intermolecular triple-helix DNA structure through Hoogsteen hydrogen bonding. This targeting by TFOs is site-specific and the locations TFOs bind are referred to as TFO target sites (TTS). Triplexes have been observed to selectively influence gene expression, homologous recombination, mutations, protein binding, and DNA damage. These sites typically feature a poly-purine sequence in duplex DNA, and the characteristics of these TTS sequences greatly influence the formation of the triplex. We introduce TTSBBC, a novel analysis and visualization platform designed to explore features of TTS sequences to enable users to design and validate TTSs. The web server can be freely accessed at https://kowalski-labapps.dellmed.utexas.edu/TTSBBC/.
Collapse
Affiliation(s)
- Maya Ylagan
- Department of Oncology, Dell Medical School, The University of Texas at Austin, Austin, TX78712, USA
| | - Qi Xu
- Department of Oncology, Dell Medical School, The University of Texas at Austin, Austin, TX78712, USA
| | - Jeanne Kowalski
- Department of Oncology, Dell Medical School, The University of Texas at Austin, Austin, TX78712, USA
| |
Collapse
|
2
|
Tao S, Hou Y, Diao L, Hu Y, Xu W, Xie S, Xiao Z. Long noncoding RNA study: Genome-wide approaches. Genes Dis 2023; 10:2491-2510. [PMID: 37554208 PMCID: PMC10404890 DOI: 10.1016/j.gendis.2022.10.024] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2022] [Revised: 10/09/2022] [Accepted: 10/23/2022] [Indexed: 11/30/2022] Open
Abstract
Long noncoding RNAs (lncRNAs) have been confirmed to play a crucial role in various biological processes across several species. Though many efforts have been devoted to the expansion of the lncRNAs landscape, much about lncRNAs is still unknown due to their great complexity. The development of high-throughput technologies and the constantly improved bioinformatic methods have resulted in a rapid expansion of lncRNA research and relevant databases. In this review, we introduced genome-wide research of lncRNAs in three parts: (i) novel lncRNA identification by high-throughput sequencing and computational pipelines; (ii) functional characterization of lncRNAs by expression atlas profiling, genome-scale screening, and the research of cancer-related lncRNAs; (iii) mechanism research by large-scale experimental technologies and computational analysis. Besides, primary experimental methods and bioinformatic pipelines related to these three parts are summarized. This review aimed to provide a comprehensive and systemic overview of lncRNA genome-wide research strategies and indicate a genome-wide lncRNA research system.
Collapse
Affiliation(s)
- Shuang Tao
- The Biotherapy Center, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, Guangdong 510630, China
| | - Yarui Hou
- The Biotherapy Center, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, Guangdong 510630, China
| | - Liting Diao
- The Biotherapy Center, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, Guangdong 510630, China
| | - Yanxia Hu
- The Biotherapy Center, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, Guangdong 510630, China
| | - Wanyi Xu
- The Biotherapy Center, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, Guangdong 510630, China
| | - Shujuan Xie
- The Biotherapy Center, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, Guangdong 510630, China
- Institute of Vaccine, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, Guangdong 510630, China
| | - Zhendong Xiao
- The Biotherapy Center, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, Guangdong 510630, China
| |
Collapse
|
3
|
Kulkarni V, Jayakumar S, Mohan M, Kulkarni S. Aid or Antagonize: Nuclear Long Noncoding RNAs Regulate Host Responses and Outcomes of Viral Infections. Cells 2023; 12:987. [PMID: 37048060 PMCID: PMC10093752 DOI: 10.3390/cells12070987] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Revised: 03/12/2023] [Accepted: 03/15/2023] [Indexed: 04/14/2023] Open
Abstract
Long noncoding RNAs (lncRNAs) are transcripts measuring >200 bp in length and devoid of protein-coding potential. LncRNAs exceed the number of protein-coding mRNAs and regulate cellular, developmental, and immune pathways through diverse molecular mechanisms. In recent years, lncRNAs have emerged as epigenetic regulators with prominent roles in health and disease. Many lncRNAs, either host or virus-encoded, have been implicated in critical cellular defense processes, such as cytokine and antiviral gene expression, the regulation of cell signaling pathways, and the activation of transcription factors. In addition, cellular and viral lncRNAs regulate virus gene expression. Viral infections and associated immune responses alter the expression of host lncRNAs regulating immune responses, host metabolism, and viral replication. The influence of lncRNAs on the pathogenesis and outcomes of viral infections is being widely explored because virus-induced lncRNAs can serve as diagnostic and therapeutic targets. Future studies should focus on thoroughly characterizing lncRNA expressions in virus-infected primary cells, investigating their role in disease prognosis, and developing biologically relevant animal or organoid models to determine their suitability for specific therapeutic targeting. Many cellular and viral lncRNAs localize in the nucleus and epigenetically modulate viral transcription, latency, and host responses to infection. In this review, we provide an overview of the role of nuclear lncRNAs in the pathogenesis and outcomes of viral infections, such as the Influenza A virus, Sendai Virus, Respiratory Syncytial Virus, Hepatitis C virus, Human Immunodeficiency Virus, and Herpes Simplex Virus. We also address significant advances and barriers in characterizing lncRNA function and explore the potential of lncRNAs as therapeutic targets.
Collapse
Affiliation(s)
- Viraj Kulkarni
- Disease Intervention and Prevention Program, Texas Biomedical Research Institute, San Antonio, TX 78227, USA;
| | - Sahana Jayakumar
- Host-Pathogen Interaction Program, Texas Biomedical Research Institute, San Antonio, TX 78227, USA; (S.J.); (M.M.)
| | - Mahesh Mohan
- Host-Pathogen Interaction Program, Texas Biomedical Research Institute, San Antonio, TX 78227, USA; (S.J.); (M.M.)
| | - Smita Kulkarni
- Host-Pathogen Interaction Program, Texas Biomedical Research Institute, San Antonio, TX 78227, USA; (S.J.); (M.M.)
| |
Collapse
|
4
|
Bekkouche I, Shishonin AY, Vetcher AA. Recent Development in Biomedical Applications of Oligonucleotides with Triplex-Forming Ability. Polymers (Basel) 2023; 15:polym15040858. [PMID: 36850142 PMCID: PMC9964087 DOI: 10.3390/polym15040858] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2022] [Revised: 01/31/2023] [Accepted: 02/02/2023] [Indexed: 02/12/2023] Open
Abstract
A DNA structure, known as triple-stranded DNA, is made up of three oligonucleotide chains that wind around one another to form a triple helix (TFO). Hoogsteen base pairing describes how triple-stranded DNA may be built at certain conditions by the attachment of the third strand to an RNA, PNA, or DNA, which might all be employed as oligonucleotide chains. In each of these situations, the oligonucleotides can be employed as an anchor, in conjunction with a specific bioactive chemical, or as a messenger that enables switching between transcription and replication through the triplex-forming zone. These data are also considered since various illnesses have been linked to the expansion of triplex-prone sequences. In light of metabolic acidosis and associated symptoms, some consideration is given to the impact of several low-molecular-weight compounds, including pH on triplex production in vivo. The review is focused on the development of biomedical oligonucleotides with triplexes.
Collapse
Affiliation(s)
- Incherah Bekkouche
- Nanotechnology Scientific and Educational Center, Institute of Biochemical Technology and Nanotechnology, Peoples’ Friendship University of Russia (RUDN), Miklukho-Maklaya Str. 6, Moscow 117198, Russia
| | - Alexander Y. Shishonin
- Complementary and Integrative Health Clinic of Dr. Shishonin, 5, Yasnogorskaya Str., Moscow 117588, Russia
| | - Alexandre A. Vetcher
- Nanotechnology Scientific and Educational Center, Institute of Biochemical Technology and Nanotechnology, Peoples’ Friendship University of Russia (RUDN), Miklukho-Maklaya Str. 6, Moscow 117198, Russia
- Complementary and Integrative Health Clinic of Dr. Shishonin, 5, Yasnogorskaya Str., Moscow 117588, Russia
- Correspondence:
| |
Collapse
|
5
|
Sun L, Cao B, Liu Y, Shi P, Zheng Y, Wang B, Zhang Q. TripDesign: A DNA Triplex Design Approach Based on Interaction Forces. J Phys Chem B 2022; 126:8708-8719. [PMID: 36260921 DOI: 10.1021/acs.jpcb.2c05611] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
A DNA triplex has the advantages of improved nanostructure stability and pH environment responsiveness compared with single-stranded and double-stranded nucleic acids. However, sequence stability and low design efficiency hinder the application of DNA triplexes. Therefore, a DNA triplex design approach (TripDesign) based on interaction forces is proposed. First, we present the stacking force constraint, torsional stress constraint, and G-quadruplex motif constraint and then use an improved memetic algorithm to design triplex sequences under combinatorial constraints. Finally, to quantify the process of triplex formation, we also explore the minimum length of the triplex-forming oligos (TFOs) required to form the triplex and the factors that produce depletion in cyclic pH-jump experiments. The experimental results show that the sequences produced by TripDesign have high stability and reversibility, and the proposed approach achieves efficient and automatic sequence design. In addition, this study characterizes multiple basic parameters of DNA triplex formation and promotes the wider application of DNA triplexes in nanotechnology.
Collapse
Affiliation(s)
- Lijun Sun
- The Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, School of Software Engineering, Dalian University, Dalian116622, China
| | - Ben Cao
- School of Computer Science and Technology, Dalian University of Technology, Dalian116024, China
| | - Yuan Liu
- School of Computer Science and Technology, Dalian University of Technology, Dalian116024, China
| | - Peijun Shi
- School of Computer Science and Technology, Dalian University of Technology, Dalian116024, China
| | - Yanfen Zheng
- School of Computer Science and Technology, Dalian University of Technology, Dalian116024, China
| | - Bin Wang
- The Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, School of Software Engineering, Dalian University, Dalian116622, China
| | - Qiang Zhang
- The Key Laboratory of Advanced Design and Intelligent Computing, Ministry of Education, School of Software Engineering, Dalian University, Dalian116622, China
| |
Collapse
|
6
|
Abstract
Most of the transcribed human genome codes for noncoding RNAs (ncRNAs), and long noncoding RNAs (lncRNAs) make for the lion's share of the human ncRNA space. Despite growing interest in lncRNAs, because there are so many of them, and because of their tissue specialization and, often, lower abundance, their catalog remains incomplete and there are multiple ongoing efforts to improve it. Consequently, the number of human lncRNA genes may be lower than 10,000 or higher than 200,000. A key open challenge for lncRNA research, now that so many lncRNA species have been identified, is the characterization of lncRNA function and the interpretation of the roles of genetic and epigenetic alterations at their loci. After all, the most important human genes to catalog and study are those that contribute to important cellular functions-that affect development or cell differentiation and whose dysregulation may play a role in the genesis and progression of human diseases. Multiple efforts have used screens based on RNA-mediated interference (RNAi), antisense oligonucleotide (ASO), and CRISPR screens to identify the consequences of lncRNA dysregulation and predict lncRNA function in select contexts, but these approaches have unresolved scalability and accuracy challenges. Instead-as was the case for better-studied ncRNAs in the past-researchers often focus on characterizing lncRNA interactions and investigating their effects on genes and pathways with known functions. Here, we focus most of our review on computational methods to identify lncRNA interactions and to predict the effects of their alterations and dysregulation on human disease pathways.
Collapse
|
7
|
Hao A, Wang Y, Stovall DB, Wang Y, Sui G. Emerging Roles of LncRNAs in the EZH2-regulated Oncogenic Network. Int J Biol Sci 2021; 17:3268-3280. [PMID: 34512145 PMCID: PMC8416728 DOI: 10.7150/ijbs.63488] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2021] [Accepted: 07/16/2021] [Indexed: 12/15/2022] Open
Abstract
Cancer is a life-threatening disease, but cancer therapies based on epigenetic mechanisms have made great progress. Enhancer of zeste homolog 2 (EZH2) is the key catalytic component of Polycomb repressive complex 2 (PRC2) that mediates the tri-methylation of lysine 27 on histone 3 (H3K27me3), a well-recognized marker of transcriptional repression. Mounting evidence indicates that EZH2 is elevated in various cancers and associates with poor prognosis. In addition, many studies revealed that EZH2 is also involved in transcriptional repression dependent or independent of PRC2. Meanwhile, long non-coding RNAs (lncRNAs) have been reported to regulate numerous and diverse signaling pathways in oncogenesis. In this review, we firstly discuss functional interactions between EZH2 and lncRNAs that determine PRC2-dependent and -independent roles of EZH2. Secondly, we summarize the lncRNAs regulating EZH2 expression at transcription, post-transcription and post-translation levels. Thirdly, we review several oncogenic pathways cooperatively regulated by lncRNAs and EZH2, including the Wnt/β-catenin and p53 pathways. In conclusion, lncRNAs play a key role in the EZH2-regulated oncogenic network with many fertile directions to be explored.
Collapse
Affiliation(s)
- Aixin Hao
- Key Laboratory of Saline-alkali Vegetation Ecology Restoration, Ministry of Education, College of Life Science, Northeast Forestry University, Harbin 150040, China
| | - Yunxuan Wang
- Department of Medical Oncology, Harbin Medical University Cancer Hospital, Harbin, 150081, China
| | - Daniel B Stovall
- College of Arts and Sciences, Winthrop University, Rock Hill, SC 29733, the United States
| | - Yu Wang
- Key Laboratory of Saline-alkali Vegetation Ecology Restoration, Ministry of Education, College of Life Science, Northeast Forestry University, Harbin 150040, China
| | - Guangchao Sui
- Key Laboratory of Saline-alkali Vegetation Ecology Restoration, Ministry of Education, College of Life Science, Northeast Forestry University, Harbin 150040, China
| |
Collapse
|
8
|
RNA:DNA triple helices: from peculiar structures to pervasive chromatin regulators. Essays Biochem 2021; 65:731-740. [PMID: 33835128 DOI: 10.1042/ebc20200089] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2020] [Revised: 03/10/2021] [Accepted: 03/23/2021] [Indexed: 11/17/2022]
Abstract
The genomes of complex eukaryotes largely contain non-protein-coding DNA, which is pervasively transcribed into a plethora of non-coding RNAs (ncRNAs). The functional importance of many of these ncRNAs has been investigated in the last two decades, revealing their crucial and multifaceted roles in chromatin regulation. A common mode of action of ncRNAs is the recruitment of chromatin modifiers to specific regions in the genome. Whereas many ncRNA-protein interactions have been characterised in detail, binding of ncRNAs to their DNA target sites is much less understood. Recently developed RNA-centric methods have mapped the genome-wide distribution of ncRNAs, however, how ncRNAs achieve locus-specificity remains mainly unresolved. In terms of direct RNA-DNA interactions, two kinds of triple-stranded structures can be formed: R-loops consisting of an RNA:DNA hybrid and a looped out DNA strand, and RNA:DNA triple helices (triplexes), in which the RNA binds to the major groove of the DNA double helix by sequence-specific Hoogsteen base pairing. In this essay, we will review the current knowledge about RNA:DNA triplexes, summarising triplex formation rules, detection methods, and ncRNAs reported to engage in triplexes. While the functional characterisation of RNA:DNA triplexes is still anecdotal, recent advances in high-throughput and computational analyses indicate their widespread distribution in the genome. Thus, we are witnessing a paradigm shift in the appreciation of RNA:DNA triplexes, away from exotic structures towards a prominent mode of ncRNA-chromatin interactions.
Collapse
|
9
|
Pabis K. Triplex and other DNA motifs show motif-specific associations with mitochondrial DNA deletions and species lifespan. Mech Ageing Dev 2021; 194:111429. [PMID: 33422563 DOI: 10.1016/j.mad.2021.111429] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2020] [Revised: 01/02/2021] [Accepted: 01/03/2021] [Indexed: 11/20/2022]
Abstract
The "theory of resistant biomolecules" posits that long-lived species show resistance to molecular damage at the level of their biomolecules. Here, we test this hypothesis in the context of mitochondrial DNA (mtDNA) as it implies that predicted mutagenic DNA motifs should be inversely correlated with species maximum lifespan (MLS). First, we confirmed that guanine-quadruplex and direct repeat (DR) motifs are mutagenic, as they associate with mtDNA deletions in the human major arc of mtDNA, while also adding mirror repeat (MR) and intramolecular triplex motifs to a growing list of potentially mutagenic features. What is more, triplex motifs showed disease-specific associations with deletions and an apparent interaction with guanine-quadruplex motifs. Surprisingly, even though DR, MR and guanine-quadruplex motifs were associated with mtDNA deletions, their correlation with MLS was explained by the biased base composition of mtDNA. Only triplex motifs negatively correlated with MLS even after adjusting for body mass, phylogeny, mtDNA base composition and effective number of codons. Taken together, our work highlights the importance of base composition for the comparative biogerontology of mtDNA and suggests that future research on mitochondrial triplex motifs is warranted.
Collapse
Affiliation(s)
- Kamil Pabis
- Georg August University of Göttingen, Göttingen, Germany.
| |
Collapse
|
10
|
Jalali S, Singh A, Scaria V, Maiti S. Genome-Wide Computational Analysis and Validation of Potential Long Noncoding RNA-Mediated DNA-DNA-RNA Triplexes in the Human Genome. Methods Mol Biol 2021; 2254:61-71. [PMID: 33326070 DOI: 10.1007/978-1-0716-1158-6_5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Long noncoding RNAs are well studied for their regulatory actions through interaction with DNA regulating biological roles of DNA, RNA, or protein. However, direct binding of lncRNA with DNA is rarely demonstrated in experiments. The present protocol explains genome wide computational strategies to choose lncRNAs that can bind directly to the chromatin by forming highly stable DNA-DNA-RNA triplexes. The chapter also focuses on biophysical methods that can be used to validate the computationally derived lncRNA-gene targets in vitro.
Collapse
Affiliation(s)
- Saakshi Jalali
- CSIR Institute of Genomics and Integrative Biology (CSIR-IGIB), Delhi, India.,Reliance Technology Group, Reliance Industries Limited, Navi Mumbai, India
| | - Amrita Singh
- CSIR Institute of Genomics and Integrative Biology (CSIR-IGIB), Delhi, India
| | - Vinod Scaria
- CSIR Institute of Genomics and Integrative Biology (CSIR-IGIB), Delhi, India. .,Academy of Scientific and Innovative Research (AcSIR), CSIR IGIB South Campus, Delhi, India.
| | - Souvik Maiti
- CSIR Institute of Genomics and Integrative Biology (CSIR-IGIB), Delhi, India. .,Academy of Scientific and Innovative Research (AcSIR), CSIR IGIB South Campus, Delhi, India.
| |
Collapse
|
11
|
Alam T, Al-Absi HRH, Schmeier S. Deep Learning in LncRNAome: Contribution, Challenges, and Perspectives. Noncoding RNA 2020; 6:E47. [PMID: 33266128 PMCID: PMC7711891 DOI: 10.3390/ncrna6040047] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2020] [Revised: 10/27/2020] [Accepted: 11/06/2020] [Indexed: 12/11/2022] Open
Abstract
Long non-coding RNAs (lncRNA), the pervasively transcribed part of the mammalian genome, have played a significant role in changing our protein-centric view of genomes. The abundance of lncRNAs and their diverse roles across cell types have opened numerous avenues for the research community regarding lncRNAome. To discover and understand lncRNAome, many sophisticated computational techniques have been leveraged. Recently, deep learning (DL)-based modeling techniques have been successfully used in genomics due to their capacity to handle large amounts of data and produce relatively better results than traditional machine learning (ML) models. DL-based modeling techniques have now become a choice for many modeling tasks in the field of lncRNAome as well. In this review article, we summarized the contribution of DL-based methods in nine different lncRNAome research areas. We also outlined DL-based techniques leveraged in lncRNAome, highlighting the challenges computational scientists face while developing DL-based models for lncRNAome. To the best of our knowledge, this is the first review article that summarizes the role of DL-based techniques in multiple areas of lncRNAome.
Collapse
Affiliation(s)
- Tanvir Alam
- College of Science and Engineering, Hamad Bin Khalifa University, Doha 34110, Qatar;
| | - Hamada R. H. Al-Absi
- College of Science and Engineering, Hamad Bin Khalifa University, Doha 34110, Qatar;
| | - Sebastian Schmeier
- School of Natural and Computational Sciences, Massey University, Auckland 0632, New Zealand;
| |
Collapse
|
12
|
Zhang Y, Long Y, Kwoh CK. Deep learning based DNA:RNA triplex forming potential prediction. BMC Bioinformatics 2020; 21:522. [PMID: 33183242 PMCID: PMC7663897 DOI: 10.1186/s12859-020-03864-0] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2020] [Accepted: 11/09/2020] [Indexed: 01/14/2023] Open
Abstract
BACKGROUND Long non-coding RNAs (lncRNAs) can exert functions via forming triplex with DNA. The current methods in predicting the triplex formation mainly rely on mathematic statistic according to the base paring rules. However, these methods have two main limitations: (1) they identify a large number of triplex-forming lncRNAs, but the limited number of experimentally verified triplex-forming lncRNA indicates that maybe not all of them can form triplex in practice, and (2) their predictions only consider the theoretical relationship while lacking the features from the experimentally verified data. RESULTS In this work, we develop an integrated program named TriplexFPP (Triplex Forming Potential Prediction), which is the first machine learning model in DNA:RNA triplex prediction. TriplexFPP predicts the most likely triplex-forming lncRNAs and DNA sites based on the experimentally verified data, where the high-level features are learned by the convolutional neural networks. In the fivefold cross validation, the average values of Area Under the ROC curves and PRC curves for removed redundancy triplex-forming lncRNA dataset with threshold 0.8 are 0.9649 and 0.9996, and these two values for triplex DNA sites prediction are 0.8705 and 0.9671, respectively. Besides, we also briefly summarize the cis and trans targeting of triplexes lncRNAs. CONCLUSIONS The TriplexFPP is able to predict the most likely triplex-forming lncRNAs from all the lncRNAs with computationally defined triplex forming capacities and the potential of a DNA site to become a triplex. It may provide insights to the exploration of lncRNA functions.
Collapse
Affiliation(s)
- Yu Zhang
- School of Computer Science and Engineering, Nanyang Technological University, Singapore, 639798, Singapore
| | - Yahui Long
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410000, China
| | - Chee Keong Kwoh
- School of Computer Science and Engineering, Nanyang Technological University, Singapore, 639798, Singapore.
| |
Collapse
|
13
|
Towards a comprehensive pipeline to identify and functionally annotate long noncoding RNA (lncRNA). Comput Biol Med 2020; 127:104028. [PMID: 33126123 DOI: 10.1016/j.compbiomed.2020.104028] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Revised: 09/28/2020] [Accepted: 09/29/2020] [Indexed: 12/20/2022]
Abstract
Long noncoding RNAs (lncRNAs) are implicated in various genetic diseases and cancer, attributed to their critical role in gene regulation. They are a divergent group of RNAs and are easily differentiated from other types with unique characteristics, functions, and mechanisms of action. In this review, we provide a list of some of the prominent data repositories containing lncRNAs, their interactome, and predicted and validated disease associations. Next, we discuss various wet-lab experiments formulated to obtain the data for these repositories. We also provide a critical review of in silico methods available for the identification purpose and suggest techniques to further improve their performance. The bulk of the methods currently focus on distinguishing lncRNA transcripts from the coding ones. Functional annotation of these transcripts still remains a grey area and more efforts are needed in that space. Finally, we provide details of current progress, discuss impediments, and illustrate a roadmap for developing a generalized computational pipeline for comprehensive annotation of lncRNAs, which is essential to accelerate research in this area.
Collapse
|
14
|
Kazimierczyk M, Kasprowicz MK, Kasprzyk ME, Wrzesinski J. Human Long Noncoding RNA Interactome: Detection, Characterization and Function. Int J Mol Sci 2020; 21:E1027. [PMID: 32033158 PMCID: PMC7037361 DOI: 10.3390/ijms21031027] [Citation(s) in RCA: 113] [Impact Index Per Article: 28.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2020] [Revised: 01/31/2020] [Accepted: 02/02/2020] [Indexed: 01/17/2023] Open
Abstract
The application of a new generation of sequencing techniques has revealed that most of the genome has already been transcribed. However, only a small part of the genome codes proteins. The rest of the genome "dark matter" belongs to divergent groups of non-coding RNA (ncRNA), that is not translated into proteins. There are two groups of ncRNAs, which include small and long non-coding RNAs (sncRNA and lncRNA respectively). Over the last decade, there has been an increased interest in lncRNAs and their interaction with cellular components. In this review, we presented the newest information about the human lncRNA interactome. The term lncRNA interactome refers to cellular biomolecules, such as nucleic acids, proteins, and peptides that interact with lncRNA. The lncRNA interactome was characterized in the last decade, however, understanding what role the biomolecules associated with lncRNA play and the nature of these interactions will allow us to better understand lncRNA's biological functions in the cell. We also describe a set of methods currently used for the detection of lncRNA interactome components and the analysis of their interactions. We think that such a holistic and integrated analysis of the lncRNA interactome will help to better understand its potential role in the development of organisms and cancers.
Collapse
Affiliation(s)
| | | | | | - Jan Wrzesinski
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznań, Poland (M.K.K.); (M.E.K.)
| |
Collapse
|
15
|
Antonov IV, Mazurov E, Borodovsky M, Medvedeva YA. Prediction of lncRNAs and their interactions with nucleic acids: benchmarking bioinformatics tools. Brief Bioinform 2019; 20:551-564. [PMID: 29697742 DOI: 10.1093/bib/bby032] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2017] [Revised: 03/26/2018] [Indexed: 01/22/2023] Open
Abstract
The genomes of mammalian species are pervasively transcribed producing as many noncoding as protein-coding RNAs. There is a growing body of evidence supporting their functional role. Long noncoding RNA (lncRNA) can bind both nucleic acids and proteins through several mechanisms. A reliable computational prediction of the most probable mechanism of lncRNA interaction can facilitate experimental validation of its function. In this study, we benchmarked computational tools capable to discriminate lncRNA from mRNA and predict lncRNA interactions with other nucleic acids. We assessed the performance of 9 tools for distinguishing protein-coding from noncoding RNAs, as well as 19 tools for prediction of RNA-RNA and RNA-DNA interactions. Our conclusions about the considered tools were based on their performances on the entire genome/transcriptome level, as it is the most common task nowadays. We found that FEELnc and CPAT distinguish between coding and noncoding mammalian transcripts in the most accurate manner. ASSA, RIBlast and LASTAL, as well as Triplexator, turned out to be the best predictors of RNA-RNA and RNA-DNA interactions, respectively. We showed that the normalization of the predicted interaction strength to the transcript length and GC content may improve the accuracy of inferring RNA interactions. Yet, all the current tools have difficulties to make accurate predictions of short-trans RNA-RNA interactions-stretches of sparse contacts. All over, there is still room for improvement in each category, especially for predictions of RNA interactions.
Collapse
Affiliation(s)
- Ivan V Antonov
- Institute of Bioengineering, Research Center of Biotechnology, Russian Academy of Science, Moscow, Russian Federation.,Department of Biological and Medical Physics, Moscow Institute of Physics and Technology, Dolgoprudny, Russian Federation
| | | | - Mark Borodovsky
- Department of Biological and Medical Physics, Moscow Institute of Physics and Technology, Dolgoprudny, Russian Federation
| | - Yulia A Medvedeva
- Institute of Bioengineering, Research Center of Biotechnology, Russian Academy of Science, Moscow, Russian Federation.,Department of Biological and Medical Physics, Moscow Institute of Physics and Technology, Dolgoprudny, Russian Federation.,Department of Computational Biology, Vavilov Institute of General Genetics, Russian Academy of Science, Moscow, Russian Federation
| |
Collapse
|
16
|
Abstract
Gene maps, or annotations, enable us to navigate the functional landscape of our genome. They are a resource upon which virtually all studies depend, from single-gene to genome-wide scales and from basic molecular biology to medical genetics. Yet present-day annotations suffer from trade-offs between quality and size, with serious but often unappreciated consequences for downstream studies. This is particularly true for long non-coding RNAs (lncRNAs), which are poorly characterized compared to protein-coding genes. Long-read sequencing technologies promise to improve current annotations, paving the way towards a complete annotation of lncRNAs expressed throughout a human lifetime.
Collapse
|
17
|
Long Noncoding RNA MEG3 Is an Epigenetic Determinant of Oncogenic Signaling in Functional Pancreatic Neuroendocrine Tumor Cells. Mol Cell Biol 2017; 37:MCB.00278-17. [PMID: 28847847 DOI: 10.1128/mcb.00278-17] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2017] [Accepted: 08/22/2017] [Indexed: 12/26/2022] Open
Abstract
The long noncoding RNA (lncRNA) MEG3 is significantly downregulated in pancreatic neuroendocrine tumors (PNETs). MEG3 loss corresponds with aberrant upregulation of the oncogenic hepatocyte growth factor (HGF) receptor c-MET in PNETs. Meg3 overexpression in a mouse insulin-secreting PNET cell line, MIN6, downregulates c-Met expression. However, the molecular mechanism by which MEG3 regulates c-MET is not known. Using chromatin isolation by RNA purification and sequencing (ChIRP-Seq), we identified Meg3 binding to unique genomic regions in and around the c-Met gene. In the absence of Meg3, these c-Met regions displayed distinctive enhancer-signature histone modifications. Furthermore, Meg3 relied on functional enhancer of zeste homolog 2 (EZH2), a component of polycomb repressive complex 2 (PRC2), to inhibit c-Met expression. Another mechanism of lncRNA-mediated regulation of gene expression utilized triplex-forming GA-GT rich sequences. Transfection of such motifs from Meg3 RNA, termed triplex-forming oligonucleotides (TFOs), in MIN6 cells suppressed c-Met expression and enhanced cell proliferation, perhaps by modulating other targets. This study comprehensively establishes epigenetic mechanisms underlying Meg3 control of c-Met and the oncogenic consequences of Meg3 loss or c-Met gain. These findings have clinical relevance for targeting c-MET in PNETs. There is also the potential for pancreatic islet β-cell expansion through c-MET regulation to ameliorate β-cell loss in diabetes.
Collapse
|
18
|
Jalali S, Singh A, Maiti S, Scaria V. Genome-wide computational analysis of potential long noncoding RNA mediated DNA:DNA:RNA triplexes in the human genome. J Transl Med 2017; 15:186. [PMID: 28865451 PMCID: PMC7670996 DOI: 10.1186/s12967-017-1282-9] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2016] [Accepted: 08/18/2017] [Indexed: 01/23/2023] Open
Abstract
BACKGROUND Only a handful of long noncoding RNAs have been functionally characterized. They are known to modulate regulation through interacting with other biomolecules in the cell: DNA, RNA and protein. Though there have been detailed investigations on lncRNA-miRNA and lncRNA-protein interactions, the interaction of lncRNAs with DNA have not been studied extensively. In the present study, we explore whether lncRNAs could modulate genomic regulation by interacting with DNA through the formation of highly stable DNA:DNA:RNA triplexes. METHODS We computationally screened 23,898 lncRNA transcripts as annotated by GENCODE, across the human genome for potential triplex forming sequence stretches (PTS). The PTS frequencies were compared across 5'UTR, CDS, 3'UTR, introns, promoter and 1000 bases downstream of the transcription termination sites. These regions were annotated by mapping to experimental regulatory regions, classes of repeat regions and transcription factors. We validated few putative triplex mediated interactions where lncRNA-gene pair interaction is via pyrimidine triplex motif using biophysical methods. RESULTS We identified 20,04,034 PTS sites to be enriched in promoter and intronic regions across human genome. Additional analysis of the association of PTS with core promoter elements revealed a systematic paucity of PTS in all regulatory regions, except TF binding sites. A total of 25 transcription factors were found to be associated with PTS. Using an interaction network, we showed that a subset of the triplex forming lncRNAs, have a positive association with gene promoters. We also demonstrated an in vitro interaction of one lncRNA candidate with its predicted gene target promoter regions. CONCLUSIONS Our analysis shows that PTS are enriched in gene promoter and largely associated with simple repeats. The current study suggests a major role of a subset of lncRNAs in mediating chromatin organization modulation through CTCF and NSRF proteins.
Collapse
Affiliation(s)
- Saakshi Jalali
- CSIR Institute of Genomics and Integrative Biology (CSIR-IGIB), Mathura Road, Delhi, 110020 India
- Academy of Scientific and Innovative Research (AcSIR), CSIR IGIB South Campus, Mathura Road, Delhi, 110020 India
| | - Amrita Singh
- CSIR Institute of Genomics and Integrative Biology (CSIR-IGIB), Mathura Road, Delhi, 110020 India
- Academy of Scientific and Innovative Research (AcSIR), CSIR IGIB South Campus, Mathura Road, Delhi, 110020 India
| | - Souvik Maiti
- CSIR Institute of Genomics and Integrative Biology (CSIR-IGIB), Mathura Road, Delhi, 110020 India
| | - Vinod Scaria
- CSIR Institute of Genomics and Integrative Biology (CSIR-IGIB), Mathura Road, Delhi, 110020 India
- Academy of Scientific and Innovative Research (AcSIR), CSIR IGIB South Campus, Mathura Road, Delhi, 110020 India
| |
Collapse
|
19
|
p53 Specifically Binds Triplex DNA In Vitro and in Cells. PLoS One 2016; 11:e0167439. [PMID: 27907175 PMCID: PMC5131957 DOI: 10.1371/journal.pone.0167439] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2016] [Accepted: 11/14/2016] [Indexed: 11/30/2022] Open
Abstract
Triplex DNA is implicated in a wide range of biological activities, including regulation of gene expression and genomic instability leading to cancer. The tumor suppressor p53 is a central regulator of cell fate in response to different type of insults. Sequence and structure specific modes of DNA recognition are core attributes of the p53 protein. The focus of this work is the structure-specific binding of p53 to DNA containing triplex-forming sequences in vitro and in cells and the effect on p53-driven transcription. This is the first DNA binding study of full-length p53 and its deletion variants to both intermolecular and intramolecular T.A.T triplexes. We demonstrate that the interaction of p53 with intermolecular T.A.T triplex is comparable to the recognition of CTG-hairpin non-B DNA structure. Using deletion mutants we determined the C-terminal DNA binding domain of p53 to be crucial for triplex recognition. Furthermore, strong p53 recognition of intramolecular T.A.T triplexes (H-DNA), stabilized by negative superhelicity in plasmid DNA, was detected by competition and immunoprecipitation experiments, and visualized by AFM. Moreover, chromatin immunoprecipitation revealed p53 binding T.A.T forming sequence in vivo. Enhanced reporter transactivation by p53 on insertion of triplex forming sequence into plasmid with p53 consensus sequence was observed by luciferase reporter assays. In-silico scan of human regulatory regions for the simultaneous presence of both consensus sequence and T.A.T motifs identified a set of candidate p53 target genes and p53-dependent activation of several of them (ABCG5, ENOX1, INSR, MCC, NFAT5) was confirmed by RT-qPCR. Our results show that T.A.T triplex comprises a new class of p53 binding sites targeted by p53 in a DNA structure-dependent mode in vitro and in cells. The contribution of p53 DNA structure-dependent binding to the regulation of transcription is discussed.
Collapse
|
20
|
Signal B, Gloss BS, Dinger ME. Computational Approaches for Functional Prediction and Characterisation of Long Noncoding RNAs. Trends Genet 2016; 32:620-637. [PMID: 27592414 DOI: 10.1016/j.tig.2016.08.004] [Citation(s) in RCA: 63] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2016] [Revised: 08/03/2016] [Accepted: 08/04/2016] [Indexed: 02/09/2023]
Abstract
Although a considerable portion of eukaryotic genomes is transcribed as long noncoding RNAs (lncRNAs), the vast majority are functionally uncharacterised. The rapidly expanding catalogue of mechanistically investigated lncRNAs has provided evidence for distinct functional subclasses, which are now ripe for exploitation as a general model to predict functions for uncharacterised lncRNAs. By utilising publicly-available genome-wide datasets and computational methods, we present several developed and emerging in silico approaches to characterise and predict the functions of lncRNAs. We propose that the application of these techniques provides valuable functional and mechanistic insight into lncRNAs, and is a crucial step for informing subsequent functional studies.
Collapse
Affiliation(s)
- Bethany Signal
- Garvan Institute of Medical Research, Sydney, Australia; St Vincent's Clinical School, University of New South Wales, Sydney, Australia
| | - Brian S Gloss
- Garvan Institute of Medical Research, Sydney, Australia; St Vincent's Clinical School, University of New South Wales, Sydney, Australia
| | - Marcel E Dinger
- Garvan Institute of Medical Research, Sydney, Australia; St Vincent's Clinical School, University of New South Wales, Sydney, Australia.
| |
Collapse
|
21
|
Goldsmith G, Rathinavelan T, Yathindra N. Selective Preference of Parallel DNA Triplexes Is Due to the Disruption of Hoogsteen Hydrogen Bonds Caused by the Severe Nonisostericity between the G*GC and T*AT Triplets. PLoS One 2016; 11:e0152102. [PMID: 27010368 PMCID: PMC4807104 DOI: 10.1371/journal.pone.0152102] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2016] [Accepted: 03/08/2016] [Indexed: 12/14/2022] Open
Abstract
Implications of DNA, RNA and RNA.DNA hybrid triplexes in diverse biological functions, diseases and therapeutic applications call for a thorough understanding of their structure-function relationships. Despite exhaustive studies mechanistic rationale for the discriminatory preference of parallel DNA triplexes with G*GC & T*AT triplets still remains elusive. Here, we show that the highest nonisostericity between the G*GC & T*AT triplets imposes extensive stereochemical rearrangements contributing to context dependent triplex destabilisation through selective disruption of Hoogsteen scheme of hydrogen bonds. MD simulations of nineteen DNA triplexes with an assortment of sequence milieu reveal for the first time fresh insights into the nature and extent of destabilization from a single (non-overlapping), double (overlapping) and multiple pairs of nonisosteric base triplets (NIBTs). It is found that a solitary pair of NIBTs, feasible either at a G*GC/T*AT or T*AT/G*GC triplex junction, does not impinge significantly on triplex stability. But two overlapping pairs of NIBTs resulting from either a T*AT or a G*GC interruption disrupt Hoogsteen pair to a noncanonical mismatch destabilizing the triplex by ~10 to 14 kcal/mol, implying that their frequent incidence in multiples, especially, in short sequences could even hinder triplex formation. The results provide (i) an unambiguous and generalised mechanistic rationale for the discriminatory trait of parallel triplexes, including those studied experimentally (ii) clarity for the prevalence of antiparallel triplexes and (iii) comprehensive perspectives on the sequence dependent influence of nonisosteric base triplets useful in the rational design of TFO's against potential triplex target sites.
Collapse
Affiliation(s)
- Gunaseelan Goldsmith
- Institute of Bioinformatics and Applied Biotechnology, Biotech Park, Electronics City Phase I, Bangalore, India
- Manipal University, Manipal, India
| | | | - Narayanarao Yathindra
- Institute of Bioinformatics and Applied Biotechnology, Biotech Park, Electronics City Phase I, Bangalore, India
| |
Collapse
|
22
|
McFadden EJ, Hargrove AE. Biochemical Methods To Investigate lncRNA and the Influence of lncRNA:Protein Complexes on Chromatin. Biochemistry 2016; 55:1615-30. [PMID: 26859437 DOI: 10.1021/acs.biochem.5b01141] [Citation(s) in RCA: 42] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Long noncoding RNAs (lncRNAs), defined as nontranslated transcripts greater than 200 nucleotides in length, are often differentially expressed throughout developmental stages, tissue types, and disease states. The identification, visualization, and suppression/overexpression of these sequences have revealed impacts on a wide range of biological processes, including epigenetic regulation. Biochemical investigations on select systems have revealed striking insight into the biological roles of lncRNAs and lncRNA:protein complexes, which in turn prompt even more unanswered questions. To begin, multiple protein- and RNA-centric technologies have been employed to isolate lncRNA:protein and lncRNA:chromatin complexes. LncRNA interactions with the multi-subunit protein complex PRC2, which acts as a transcriptional silencer, represent some of the few cases where the binding affinity, selectivity, and activity of a lncRNA:protein complex have been investigated. At the same time, recent reports of full-length lncRNA secondary structures suggest the formation of complex structures with multiple independent folding domains and pave the way for more detailed structural investigations and predictions of lncRNA three-dimensional structure. This review will provide an overview of the methods and progress made to date as well as highlight new methods that promise to further inform the molecular recognition, specificity, and function of lncRNAs.
Collapse
Affiliation(s)
- Emily J McFadden
- Department of Biochemistry, Duke University Medical Center , Durham, North Carolina 27710, United States
| | - Amanda E Hargrove
- Department of Biochemistry, Duke University Medical Center , Durham, North Carolina 27710, United States.,Department of Chemistry, Duke University , 124 Science Drive, Durham, North Carolina 27708, United States
| |
Collapse
|
23
|
Jenjaroenpun P, Chew CS, Yong TP, Choowongkomon K, Thammasorn W, Kuznetsov VA. The TTSMI database: a catalog of triplex target DNA sites associated with genes and regulatory elements in the human genome. Nucleic Acids Res 2014; 43:D110-6. [PMID: 25324314 PMCID: PMC4384029 DOI: 10.1093/nar/gku970] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
A triplex target DNA site (TTS), a stretch of DNA that is composed of polypurines, is able to form a triple-helix (triplex) structure with triplex-forming oligonucleotides (TFOs) and is able to influence the site-specific modulation of gene expression and/or the modification of genomic DNA. The co-localization of a genomic TTS with gene regulatory signals and functional genome structures suggests that TFOs could potentially be exploited in antigene strategies for the therapy of cancers and other genetic diseases. Here, we present the TTS Mapping and Integration (TTSMI; http://ttsmi.bii.a-star.edu.sg) database, which provides a catalog of unique TTS locations in the human genome and tools for analyzing the co-localization of TTSs with genomic regulatory sequences and signals that were identified using next-generation sequencing techniques and/or predicted by computational models. TTSMI was designed as a user-friendly tool that facilitates (i) fast searching/filtering of TTSs using several search terms and criteria associated with sequence stability and specificity, (ii) interactive filtering of TTSs that co-localize with gene regulatory signals and non-B DNA structures, (iii) exploration of dynamic combinations of the biological signals of specific TTSs and (iv) visualization of a TTS simultaneously with diverse annotation tracks via the UCSC genome browser.
Collapse
Affiliation(s)
- Piroon Jenjaroenpun
- Department of Genome and Gene Expression Data Analysis, Bioinformatics Institute, 138671, Singapore Interdisciplinary Graduate Program in Genetic Engineering, Graduate School, Kasetsart University, Bangkean, Bangkok 10900, Thailand
| | - Chee Siang Chew
- Open source Computing and Technology Innovation, Bioinformatics Institute, 138671, Singapore
| | - Tai Pang Yong
- Open source Computing and Technology Innovation, Bioinformatics Institute, 138671, Singapore
| | - Kiattawee Choowongkomon
- Department of Biochemistry, Faculty of Science, Kasetsart University, 50 Ngam Wong Wan Rd, Chatuchak, Bangkok 10900, Thailand
| | - Wimada Thammasorn
- Bioinformatics and Systems Biology Program, King Mongkut's University of Technology Thonburi (Bang Khun Thian Campus), 49 Soi Thian Thale 25, Bang Khun Thian Chai Thale Rd, Tha Kham, Bangkok 10150, Thailand
| | - Vladimir A Kuznetsov
- Department of Genome and Gene Expression Data Analysis, Bioinformatics Institute, 138671, Singapore
| |
Collapse
|
24
|
Abstract
UNLABELLED A number of technologies, including CRISPR/Cas, transcription activator-like effector nucleases and zinc-finger nucleases, allow the user to target a chosen locus for genome editing or regulatory interference. Specificity, however, is a major problem, and the targeted locus must be chosen with care to avoid inadvertently affecting other loci ('off-targets') in the genome. To address this we have created 'Genome Target Scan' (GT-Scan), a flexible web-based tool that ranks all potential targets in a user-selected region of a genome in terms of how many off-targets they have. GT-Scan gives the user flexibility to define the desired characteristics of targets and off-targets via a simple 'target rule', and its interactive output allows detailed inspection of each of the most promising candidate targets. GT-Scan can be used to identify optimal targets for CRISPR/Cas systems, but its flexibility gives it potential to be adapted to other genome-targeting technologies as well. AVAILABILITY AND IMPLEMENTATION GT-Scan can be run via the web at: http://gt-scan.braembl.org.au.
Collapse
Affiliation(s)
- Aidan O'Brien
- Genomics and Computational Biology, Institute for Molecular Bioscience, The University of Queensland, Brisbane, Qld. 4072, Australia
| | - Timothy L Bailey
- Genomics and Computational Biology, Institute for Molecular Bioscience, The University of Queensland, Brisbane, Qld. 4072, Australia
| |
Collapse
|