1
|
Bartas M, Brázda V, Pečinka P. Special Issue "Bioinformatics of Unusual DNA and RNA Structures". Int J Mol Sci 2024; 25:5226. [PMID: 38791265 PMCID: PMC11121459 DOI: 10.3390/ijms25105226] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Revised: 04/29/2024] [Accepted: 05/06/2024] [Indexed: 05/26/2024] Open
Abstract
Nucleic acids are not only static carriers of genetic information but also play vital roles in controlling cellular lifecycles through their fascinating structural diversity [...].
Collapse
Affiliation(s)
- Martin Bartas
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, 710 00 Ostrava, Czech Republic;
| | - Václav Brázda
- Institute of Biophysics, Czech Academy of Sciences, Královopolská 135, 612 00 Brno, Czech Republic;
| | - Petr Pečinka
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, 710 00 Ostrava, Czech Republic;
| |
Collapse
|
2
|
Yang CC, Wang ZY, Cheng CM. Insights into Superinfection Immunity Regulation of Xanthomonas axonopodis Filamentous Bacteriophage cf. Curr Microbiol 2023; 81:42. [PMID: 38112972 DOI: 10.1007/s00284-023-03539-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Accepted: 10/26/2023] [Indexed: 12/21/2023]
Abstract
Filamentous bacteriophage cf infects Xanthomonas axonopodis pv. citri, a serious plant pathogen which causes citrus canker. To understand the immunity regulation of bacteria infected with bacteriophage cf, we applied DNA shuffling to mutate the cf intergenic region. One of the immunity mutants, cf-m3 (NCBI Taxonomy ID: 3050368) expressed a 106-109 fold greater superinfection ability compared with wild type cf. Nine mutations were identified on the cf-m3 phage, four of which were located within the coding region of an open reading frame (ORF165) for a hypothetical repressor, PT, and five located upstream of the PT coding region. A set of phages with mutations to the predicted PT protein or the upstream coding region were generated. All showed similarly low superinfection efficiency to wild type cf and no superinfection ability on cf lysogens. The results indicate that rather than superinfection inhibition, the PT protein and the un-transcribed cis element function individually as positive regulators of cf superinfection immunity. Greater superinfection ability depends on the simultaneous presence of both elements. This work yields further insight into the possible control of citrus canker disease through phages that overcome host superinfection immunity.
Collapse
Affiliation(s)
- Chia-Chin Yang
- Department of Medical Research, National Taiwan University Hospital, Taipei, Taiwan
| | - Zih-Yun Wang
- Department of Biomedical Sciences and Engineering, Tzu-Chi University, 701 Chung Yang Road Section 3, Hualien, 970, Taiwan
| | - Ching-Ming Cheng
- Department of Biomedical Sciences and Engineering, Tzu-Chi University, 701 Chung Yang Road Section 3, Hualien, 970, Taiwan.
| |
Collapse
|
3
|
Song K, Brochu HN, Zhang Q, Williams JD, Iyer LK. An In Silico Analysis of PCR-Based Monkeypox Virus Detection Assays: A Case Study for Ongoing Clinical Surveillance. Viruses 2023; 15:2327. [PMID: 38140568 PMCID: PMC10747849 DOI: 10.3390/v15122327] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2023] [Revised: 11/17/2023] [Accepted: 11/21/2023] [Indexed: 12/24/2023] Open
Abstract
The 2022 global Mpox outbreak swiftly introduced unforeseen diversity in the monkeypox virus (MPXV) population, resulting in numerous Clade IIb sublineages. This propagation of new MPXV mutations warrants the thorough re-investigation of previously recommended or validated primers designed to target MPXV genomes. In this study, we explored 18 PCR primer sets and examined their binding specificity against 5210 MPXV genomes, representing all the established MPXV lineages. Our results indicated that only five primer sets resulted in almost all perfect matches against the targeted MPXV lineages, and the remaining primer sets all contained 1-2 mismatches against almost all the MPXV lineages. We further investigated the mismatched primer-genome pairs and discovered that some of the primers overlapped with poorly sequenced and assembled regions of the MPXV genomes, which are consistent across multiple lineages. However, we identified 173 99% genome-wide conserved regions across all 5210 MPXV genomes, representing 30 lineages/clades with at least 80% lineage-specific consensus for future primer development and primer binding evaluation. This exercise is crucial to ensure that the current detection schemes are robust and serve as a framework for primer evaluation in clinical testing development for other infectious diseases.
Collapse
Affiliation(s)
- Kuncheng Song
- Center of Excellence for Bioinformatics, Data Science and AI, Laboratory Corporation of America Holdings (Labcorp), Burlington, NC 27215, USA; (K.S.); (H.N.B.); (Q.Z.)
| | - Hayden N. Brochu
- Center of Excellence for Bioinformatics, Data Science and AI, Laboratory Corporation of America Holdings (Labcorp), Burlington, NC 27215, USA; (K.S.); (H.N.B.); (Q.Z.)
| | - Qimin Zhang
- Center of Excellence for Bioinformatics, Data Science and AI, Laboratory Corporation of America Holdings (Labcorp), Burlington, NC 27215, USA; (K.S.); (H.N.B.); (Q.Z.)
| | - Jonathan D. Williams
- Labcorp Research and Development, Laboratory Corporation of America Holdings (Labcorp), Burlington, NC 27215, USA;
| | - Lakshmanan K. Iyer
- Center of Excellence for Bioinformatics, Data Science and AI, Laboratory Corporation of America Holdings (Labcorp), Burlington, NC 27215, USA; (K.S.); (H.N.B.); (Q.Z.)
| |
Collapse
|
4
|
Yella VR, Vanaja A. Computational analysis on the dissemination of non-B DNA structural motifs in promoter regions of 1180 cellular genomes. Biochimie 2023; 214:101-111. [PMID: 37311475 DOI: 10.1016/j.biochi.2023.06.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2022] [Revised: 05/05/2023] [Accepted: 06/05/2023] [Indexed: 06/15/2023]
Abstract
The promoter regions of gene regulation are under evolutionary constraints and earlier studies uncovered that they are characterized by enrichment of functional non-B DNA structural signatures like curved DNA, cruciform DNA, G-quadruplex, triple-helical DNA, slipped DNA structures, and Z-DNA. However, these studies are restricted to a few model organisms, single non-B DNA motif types, or whole genomic sequences, and their comparative accumulation in promoter regions of different domains of life has not been reported comprehensively. In this study, for the first time, we investigated the preponderance of non-B DNA-prone motifs in promoter regions in 1180 genomes belonging to 28 taxonomic groups using the non-B DNA Motif Search Tool (nBMST). The trends suggest that they are predominant in promoters compared to the upstream and downstream regions of all three domains of life and variably linked to taxonomic groups. Cruciform DNA motif is the most abundant form of non-B DNA, spanning from archaea to lower eukaryotes. Curved DNA motifs are prominent in host-associated bacteria, and suppressed in mammals. Triplex-DNA and slipped DNA structure repeats are discretely dispersed in all lineages. G-quadruplex motifs are significantly enriched in mammals. We also observed that the unique enrichment of non-B DNA in promoters is strongly linked to genome GC, size, evolutionary time divergence, and ecological adaptations. Overall, our work systematically reports the unique non-B DNA structural landscape of cellular organisms from the perspective of the cis-regulatory code of genomes.
Collapse
Affiliation(s)
- Venkata Rajesh Yella
- Department of Biotechnology, Koneru Lakshmaiah Education Foundation, Guntur, 522302, Andhra Pradesh, India.
| | - Akkinepally Vanaja
- Department of Biotechnology, Koneru Lakshmaiah Education Foundation, Guntur, 522302, Andhra Pradesh, India; KL College of Pharmacy, Koneru Lakshmaiah Education Foundation, Guntur, 522302, Andhra Pradesh, India
| |
Collapse
|
5
|
Porubiaková O, Havlík J, Indu, Šedý M, Přepechalová V, Bartas M, Bidula S, Šťastný J, Fojta M, Brázda V. Variability of Inverted Repeats in All Available Genomes of Bacteria. Microbiol Spectr 2023; 11:e0164823. [PMID: 37358458 PMCID: PMC10434271 DOI: 10.1128/spectrum.01648-23] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Accepted: 06/03/2023] [Indexed: 06/27/2023] Open
Abstract
Noncanonical secondary structures in nucleic acids have been studied intensively in recent years. Important biological roles of cruciform structures formed by inverted repeats (IRs) have been demonstrated in diverse organisms, including humans. Using Palindrome analyser, we analyzed IRs in all accessible bacterial genome sequences to determine their frequencies, lengths, and localizations. IR sequences were identified in all species, but their frequencies differed significantly across various evolutionary groups. We detected 242,373,717 IRs in all 1,565 bacterial genomes. The highest mean IR frequency was detected in the Tenericutes (61.89 IRs/kbp) and the lowest mean frequency was found in the Alphaproteobacteria (27.08 IRs/kbp). IRs were abundant near genes and around regulatory, tRNA, transfer-messenger RNA (tmRNA), and rRNA regions, pointing to the importance of IRs in such basic cellular processes as genome maintenance, DNA replication, and transcription. Moreover, we found that organisms with high IR frequencies were more likely to be endosymbiotic, antibiotic producing, or pathogenic. On the other hand, those with low IR frequencies were far more likely to be thermophilic. This first comprehensive analysis of IRs in all available bacterial genomes demonstrates their genomic ubiquity, nonrandom distribution, and enrichment in genomic regulatory regions. IMPORTANCE Our manuscript reports for the first time a complete analysis of inverted repeats in all fully sequenced bacterial genomes. Thanks to the availability of unique computational resources, we were able to statistically evaluate the presence and localization of these important regulatory sequences in bacterial genomes. This work revealed a strong abundance of these sequences in regulatory regions and provides researchers with a valuable tool for their manipulation.
Collapse
Affiliation(s)
- Otília Porubiaková
- Institute of Biophysics of the Czech Academy of Sciences, Brno, Czech Republic
| | - Jan Havlík
- Mendel University in Brno, Brno, Czech Republic
| | - Indu
- Institute of Biophysics of the Czech Academy of Sciences, Brno, Czech Republic
- Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - Michal Šedý
- Brno University of Technology, Faculty of Chemistry, Brno, Czech Republic
| | - Veronika Přepechalová
- Institute of Biophysics of the Czech Academy of Sciences, Brno, Czech Republic
- Brno University of Technology, Faculty of Chemistry, Brno, Czech Republic
| | - Martin Bartas
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
| | - Stefan Bidula
- School of Pharmacy, University of East Anglia, Norwich Research Park, Norwich, United Kingdom
| | - Jiří Šťastný
- Mendel University in Brno, Brno, Czech Republic
- Brno University of Technology, Faculty of Mechanical Engineering, Brno, Czech Republic
| | - Miroslav Fojta
- Institute of Biophysics of the Czech Academy of Sciences, Brno, Czech Republic
| | - Václav Brázda
- Institute of Biophysics of the Czech Academy of Sciences, Brno, Czech Republic
- Brno University of Technology, Faculty of Chemistry, Brno, Czech Republic
| |
Collapse
|
6
|
Getz LJ, Brown JM, Sobot L, Chow A, Mahendrarajah J, Thomas N. Attenuation of a DNA cruciform by a conserved regulator directs T3SS1 mediated virulence in Vibrio parahaemolyticus. Nucleic Acids Res 2023; 51:6156-6171. [PMID: 37158250 PMCID: PMC10325908 DOI: 10.1093/nar/gkad370] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2022] [Revised: 04/23/2023] [Accepted: 04/27/2023] [Indexed: 05/10/2023] Open
Abstract
Pathogenic Vibrio species account for 3-5 million annual life-threatening human infections. Virulence is driven by bacterial hemolysin and toxin gene expression often positively regulated by the winged helix-turn-helix (wHTH) HlyU transcriptional regulator family and silenced by histone-like nucleoid structural protein (H-NS). In the case of Vibrio parahaemolyticus, HlyU is required for virulence gene expression associated with type 3 Secretion System-1 (T3SS1) although its mechanism of action is not understood. Here, we provide evidence for DNA cruciform attenuation mediated by HlyU binding to support concomitant virulence gene expression. Genetic and biochemical experiments revealed that upon HlyU mediated DNA cruciform attenuation, an intergenic cryptic promoter became accessible allowing for exsA mRNA expression and initiation of an ExsA autoactivation feedback loop at a separate ExsA-dependent promoter. Using a heterologous E. coli expression system, we reconstituted the dual promoter elements which revealed that HlyU binding and DNA cruciform attenuation were strictly required to initiate the ExsA autoactivation loop. The data indicate that HlyU acts to attenuate a transcriptional repressive DNA cruciform to support T3SS1 virulence gene expression and reveals a non-canonical extricating gene regulation mechanism in pathogenic Vibrio species.
Collapse
Affiliation(s)
- Landon J Getz
- Department of Microbiology and Immunology, Faculty of Medicine, Dalhousie University. Halifax, NS, Canada
| | - Justin M Brown
- Department of Microbiology and Immunology, Faculty of Medicine, Dalhousie University. Halifax, NS, Canada
| | - Lauren Sobot
- Department of Microbiology and Immunology, Faculty of Medicine, Dalhousie University. Halifax, NS, Canada
| | - Alexandra Chow
- Department of Microbiology and Immunology, Faculty of Medicine, Dalhousie University. Halifax, NS, Canada
| | - Jastina Mahendrarajah
- Department of Microbiology and Immunology, Faculty of Medicine, Dalhousie University. Halifax, NS, Canada
| | - Nikhil A Thomas
- Department of Microbiology and Immunology, Faculty of Medicine, Dalhousie University. Halifax, NS, Canada
- Department of Medicine, Faculty of Medicine, Dalhousie University. Halifax, NS, Canada
| |
Collapse
|
7
|
Kwok ACM, Leung SK, Wong JTY. DNA:RNA Hybrids Are Major Dinoflagellate Minicircle Molecular Types. Int J Mol Sci 2023; 24:ijms24119651. [PMID: 37298602 DOI: 10.3390/ijms24119651] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2023] [Revised: 05/30/2023] [Accepted: 05/31/2023] [Indexed: 06/12/2023] Open
Abstract
Peridinin-containing dinoflagellate plastomes are predominantly encoded in nuclear genomes, with less than 20 essential chloroplast proteins carried on "minicircles". Each minicircle generally carries one gene and a short non-coding region (NCR) with a median length of approximately 400-1000 bp. We report here differential nuclease sensitivity and two-dimensional southern blot patterns, suggesting that dsDNA minicircles are in fact the minor forms, with substantial DNA:RNA hybrids (DRHs). Additionally, we observed large molecular weight intermediates, cell-lysate-dependent NCR secondary structures, multiple bidirectional predicted ssDNA structures, and different southern blot patterns when probed with different NCR fragments. In silico analysis suggested the existence of substantial secondary structures with inverted repeats (IR) and palindrome structures within the initial ~650 bp of the NCR sequences, in accordance with conversion event(s) outcomes with PCR. Based on these findings, we propose a new transcription-templating-translation model, which is associated with cross-hopping shift intermediates. Since dinoflagellate chloroplasts are cytosolic and lack nuclear envelope breakdown, the dynamic DRH minicircle transport could have contributed to the spatial-temporal dynamics required for photosystem repair. This represents a paradigm shift from the previous understanding of "minicircle DNAs" to a "working plastome", which will have significant implications for its molecular functionality and evolution.
Collapse
Affiliation(s)
- Alvin Chun Man Kwok
- Division of Life Science, The Hong Kong University of Science and Technology, Clearwater Bay, Kowloon, Hong Kong, China
| | - Siu Kai Leung
- Division of Life Science, The Hong Kong University of Science and Technology, Clearwater Bay, Kowloon, Hong Kong, China
| | - Joseph Tin Yum Wong
- Division of Life Science, The Hong Kong University of Science and Technology, Clearwater Bay, Kowloon, Hong Kong, China
| |
Collapse
|
8
|
Bastos CAC, Afreixo V, Rodrigues JMOS, Pinho AJ. Concentration of inverted repeats along human DNA. J Integr Bioinform 2023; 20:jib-2022-0052. [PMID: 37486620 PMCID: PMC10561070 DOI: 10.1515/jib-2022-0052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Accepted: 02/27/2023] [Indexed: 07/25/2023] Open
Abstract
This work aims to describe the observed enrichment of inverted repeats in the human genome; and to identify and describe, with detailed length profiles, the regions with significant and relevant enriched occurrence of inverted repeats. The enrichment is assessed and tested with a recently proposed measure (z-scores based measure). We simulate a genome using an order 7 Markov model trained with the data from the real genome. The simulated genome is used to establish the critical values which are used as decision thresholds to identify the regions with significant enriched concentrations. Several human genome regions are highly enriched in the occurrence of inverted repeats. This is observed in all the human chromosomes. The distribution of inverted repeat lengths varies along the genome. The majority of the regions with severely exaggerated enrichment contain mainly short length inverted repeats. There are also regions with regular peaks along the inverted repeats lengths distribution (periodic regularities) and other regions with exaggerated enrichment for long lengths (less frequent). However, adjacent regions tend to have similar distributions.
Collapse
Affiliation(s)
- Carlos A. C. Bastos
- DETI – Department of Electronics, Telecommunications and Informatics, IEETA – Institute of Electronics and Informatics Engineering of Aveiro, University of Aveiro, 3810-193Aveiro, Portugal
- LASI – Intelligent Systems Associate Laboratory, Aveiro, Portugal
| | - Vera Afreixo
- CIDMA – Center for Research and Development in Mathematics and Applications, DMAT – Department of Mathematics, University of Aveiro, 3810-193Aveiro, Portugal
| | - João M. O. S. Rodrigues
- DETI – Department of Electronics, Telecommunications and Informatics, IEETA – Institute of Electronics and Informatics Engineering of Aveiro, University of Aveiro, 3810-193Aveiro, Portugal
- LASI – Intelligent Systems Associate Laboratory, Aveiro, Portugal
| | - Armando J. Pinho
- DETI – Department of Electronics, Telecommunications and Informatics, IEETA – Institute of Electronics and Informatics Engineering of Aveiro, University of Aveiro, 3810-193Aveiro, Portugal
- LASI – Intelligent Systems Associate Laboratory, Aveiro, Portugal
| |
Collapse
|
9
|
Valková N, Kratochvilová L, Martinková L, Brázda V. Dual mode of IFI16 binding to supercoiled and linear DNA: A closer insight. Biochem Biophys Res Commun 2023; 667:89-94. [PMID: 37209567 DOI: 10.1016/j.bbrc.2023.05.049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Accepted: 05/14/2023] [Indexed: 05/22/2023]
Abstract
IFI16 (Interferon inducible protein 16) is a DNA sensor responsible for innate immune response stimulation and a direct viral restriction by modulating gene expression and replication. Many IFI16-DNA binding properties were described - length-dependent and sequence-independent binding, oligomerization of IFI16 upon recognition, sliding on the DNA, and preference for supercoiled DNA. However, the question of the role of IFI16-DNA binding in distinct IFI16 functions remains unclear. Here we demonstrate two modes of IFI16 binding to DNA using atomic force microscopy and electrophoretic mobility shift assays. In our study, we show that IFI16 can bind to DNA in the form of globular complexes or oligomers depending on DNA topology and molar ratios. The stability of the complexes is different in higher salt concentrations. In addition, we observed no preferential binding with the HIN-A or HIN-B domains to supercoiled DNA, revealing the importance of the whole protein for this specificity. These results provide more profound insight into IFI16-DNA interactions and may be important in answering the question of self- and non-self-DNA binding by the IFI16 protein and potentially could shed light on the role of DNA binding in distinct IFI16 functions.
Collapse
Affiliation(s)
- Natália Valková
- Institute of Biophysics, Academy of Sciences of the Czech Republic, 612 65, Brno, Czech Republic; Department of Experimental Biology, Faculty of Science, Masaryk University, Kamenice 5, 625 00, Brno, Czech Republic
| | - Libuše Kratochvilová
- Department of Food Chemistry and Biotechnology, Faculty of Chemistry, Brno University of Technology, Purkyňova 118, 612 00, Brno, Czech Republic
| | - Lucia Martinková
- RECAMO, Masaryk Memorial Cancer Institute, Zluty kopec 7, 656 53, Brno, Czech Republic
| | - Václav Brázda
- Institute of Biophysics, Academy of Sciences of the Czech Republic, 612 65, Brno, Czech Republic.
| |
Collapse
|
10
|
Genomic Analysis of Non-B Nucleic Acids Structures in SARS-CoV-2: Potential Key Roles for These Structures in Mutability, Translation, and Replication? Genes (Basel) 2023; 14:genes14010157. [PMID: 36672896 PMCID: PMC9859294 DOI: 10.3390/genes14010157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 01/01/2023] [Accepted: 01/04/2023] [Indexed: 01/09/2023] Open
Abstract
Non-B nucleic acids structures have arisen as key contributors to genetic variation in SARS-CoV-2. Herein, we investigated the presence of defining spike protein mutations falling within inverted repeats (IRs) for 18 SARS-CoV-2 variants, discussed the potential roles of G-quadruplexes (G4s) in SARS-CoV-2 biology, and identified potential pseudoknots within the SARS-CoV-2 genome. Surprisingly, there was a large variation in the number of defining spike protein mutations arising within IRs between variants and these were more likely to occur in the stem region of the predicted hairpin stem-loop secondary structure. Notably, mutations implicated in ACE2 binding and propagation (e.g., ΔH69/V70, N501Y, and D614G) were likely to occur within IRs, whilst mutations involved in antibody neutralization and reduced vaccine efficacy (e.g., T19R, ΔE156, ΔF157, R158G, and G446S) were rarely found within IRs. We also predicted that RNA pseudoknots could predominantly be found within, or next to, 29 mutations found in the SARS-CoV-2 spike protein. Finally, the Omicron variants BA.2, BA.4, BA.5, BA.2.12.1, and BA.2.75 appear to have lost two of the predicted G4-forming sequences found in other variants. These were found in nsp2 and the sequence complementary to the conserved stem-loop II-like motif (S2M) in the 3' untranslated region (UTR). Taken together, non-B nucleic acids structures likely play an integral role in SARS-CoV-2 evolution and genetic diversity.
Collapse
|
11
|
Dobrovolná M, Brázda V, Warner EF, Bidula S. Inverted repeats in the monkeypox virus genome are hot spots for mutation. J Med Virol 2023; 95:e28322. [PMID: 36400742 PMCID: PMC10100261 DOI: 10.1002/jmv.28322] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Revised: 11/15/2022] [Accepted: 11/16/2022] [Indexed: 11/21/2022]
Abstract
The current monkeypox virus (MPXV) strain differs from the strain arising in 2018 by 50+ single nucleotide polymorphisms (SNPs) and is mutating much faster than expected. The cytidine deaminase apolipoprotein B messenger RNA editing enzyme, catalytic subunit B (APOBEC3) was hypothesized to be driving this increased mutation. APOBEC has recently been identified to preferentially mutate cruciform DNA secondary structures formed by inverted repeats (IRs). IRs were recently identified as hot spots for mutation in severe acute respiratory syndrome coronavirus 2, and we aimed to identify whether IRs were also hot spots for mutation within MPXV genomes. We found that MPXV genomes were replete with IR sequences. Of the 50+ SNPs identified in the 2022 outbreak strain, 63.9% of these were found to have arisen within IR regions in the 2018 reference strain (MT903344.1). Notably, IR sequences found in the 2018 reference strain were significantly lost over time, with an average of 32.5% of these sequences being conserved in the 2022 MPXV genomes. This evidence was highly indicative that mutations were arising within IRs. This data provides further support to the hypothesis that APOBEC may be driving MPXV mutation and highlights the necessity for greater surveillance of IRs of MPXV genomes to detect new mutations.
Collapse
Affiliation(s)
- Michaela Dobrovolná
- Institute of Biophysics of the Czech Academy of Sciences, Brno, Czech Republic.,Faculty of Chemistry, Brno University of Technology, Brno, Czech Republic
| | - Václav Brázda
- Institute of Biophysics of the Czech Academy of Sciences, Brno, Czech Republic.,Faculty of Chemistry, Brno University of Technology, Brno, Czech Republic
| | - Emily F Warner
- School of Pharmacy, University of East Anglia, Norwich Research Park, Norwich, UK
| | - Stefan Bidula
- School of Pharmacy, University of East Anglia, Norwich Research Park, Norwich, UK
| |
Collapse
|
12
|
Goswami P, Šislerová L, Dobrovolná M, Havlík J, Šťastný J, Brázda V. Interaction of C-terminal p53 isoforms depends strongly upon DNA sequence and topology. Biochimie 2022; 208:93-99. [PMID: 36549455 DOI: 10.1016/j.biochi.2022.12.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Revised: 12/14/2022] [Accepted: 12/16/2022] [Indexed: 12/24/2022]
Abstract
The p53 protein is a key tumor suppressor and the most commonly mutated and down-regulated protein in human tumors. It functions mainly through interaction with DNA, and p53 acts as a transcription factor that recognizes the so-called p53 target sites on the promoters of various genes. P53 has been shown to exist as many isoforms, including three C-terminal isoforms that are produced by alternative splicing. Because the C-terminal domain is responsible for sequence-nonspecific binding and regulation of p53 binding, we have analyzed DNA recognition by these C-terminal isoforms. Using atomic force microscopy, we show for the first time that all C-terminal isoforms recognize superhelical DNA. It is particularly noteworthy that a sequence-specific p53 consensus binding site is bound by p53α and β isoforms with similar affinities, whilst p53α shows higher binding to a quadruplex sequence than both p53β and p53γ, and p53γ loses preferential binding to both the consensus binding sequence and the quadruplex-forming sequence. These results show the important role of the variable p53 C-terminal amino acid sequences for DNA recognition.
Collapse
Affiliation(s)
- Pratik Goswami
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 00, Brno, Czech Republic
| | - Lucie Šislerová
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 00, Brno, Czech Republic; Faculty of Chemistry, Brno University of Technology, Purkyňova 118, 612 00, Brno, Czech Republic
| | - Michaela Dobrovolná
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 00, Brno, Czech Republic; Faculty of Chemistry, Brno University of Technology, Purkyňova 118, 612 00, Brno, Czech Republic
| | - Jan Havlík
- Department of Informatics, Mendel University in Brno, Zemědělská 1, 613 00, Brno, Czech Republic
| | - Jiří Šťastný
- Department of Informatics, Mendel University in Brno, Zemědělská 1, 613 00, Brno, Czech Republic; Faculty of Mechanical Engineering, Brno University of Technology, Technická 2, 616 69, Brno, Czech Republic
| | - Václav Brázda
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 00, Brno, Czech Republic; Faculty of Chemistry, Brno University of Technology, Purkyňova 118, 612 00, Brno, Czech Republic.
| |
Collapse
|
13
|
Shi X, Teng H, Sun Z. An updated overview of experimental and computational approaches to identify non-canonical DNA/RNA structures with emphasis on G-quadruplexes and R-loops. Brief Bioinform 2022; 23:bbac441. [PMID: 36208174 PMCID: PMC9677470 DOI: 10.1093/bib/bbac441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2022] [Revised: 08/22/2022] [Accepted: 09/13/2022] [Indexed: 12/14/2022] Open
Abstract
Multiple types of non-canonical nucleic acid structures play essential roles in DNA recombination and replication, transcription, and genomic instability and have been associated with several human diseases. Thus, an increasing number of experimental and bioinformatics methods have been developed to identify these structures. To date, most reviews have focused on the features of non-canonical DNA/RNA structure formation, experimental approaches to mapping these structures, and the association of these structures with diseases. In addition, two reviews of computational algorithms for the prediction of non-canonical nucleic acid structures have been published. One of these reviews focused only on computational approaches for G4 detection until 2020. The other mainly summarized the computational tools for predicting cruciform, H-DNA and Z-DNA, in which the algorithms discussed were published before 2012. Since then, several experimental and computational methods have been developed. However, a systematic review including the conformation, sequencing mapping methods and computational prediction strategies for these structures has not yet been published. The purpose of this review is to provide an updated overview of conformation, current sequencing technologies and computational identification methods for non-canonical nucleic acid structures, as well as their strengths and weaknesses. We expect that this review will aid in understanding how these structures are characterised and how they contribute to related biological processes and diseases.
Collapse
Affiliation(s)
- Xiaohui Shi
- Key Laboratory of Clinical Laboratory Diagnosis and Translational Research of Zhejiang Province, The first Affiliated Hospital of WMU; Beijing Institutes of Life Science, Chinese Academy of Sciences; University of Chinese Academy of Sciences, Ouhai District, Wenzhou 325000, China
| | - Huajing Teng
- Department of Radiation Oncology, Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education) at Peking University Cancer Hospital and Institute, Ouhai District, Wenzhou 325000, China
| | - Zhongsheng Sun
- Key Laboratory of Clinical Laboratory Diagnosis and Translational Research of Zhejiang Province, The first Affiliated Hospital of WMU; Beijing Institutes of Life Science, Chinese Academy of Sciences; CAS Center for Excellence in Biotic Interactions and State Key Laboratory of Integrated Management of Pest Insects and Rodents, University of Chinese Academy of Sciences; Institute of Genomic Medicine, Wenzhou Medical University; IBMC-BGI Center, the Cancer Hospital of the University of Chinese Academy of Sciences (Zhejiang Cancer Hospital); Institute of Basic Medicine and Cancer (IBMC), Chinese Academy of Sciences, Ouhai District, Wenzhou 325000, China
| |
Collapse
|
14
|
Dobrovolná M, Bohálová N, Peška V, Wang J, Luo Y, Bartas M, Volná A, Mergny JL, Brázda V. The Newly Sequenced Genome of Pisum sativum Is Replete with Potential G-Quadruplex-Forming Sequences-Implications for Evolution and Biological Regulation. Int J Mol Sci 2022; 23:8482. [PMID: 35955617 PMCID: PMC9369095 DOI: 10.3390/ijms23158482] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Revised: 07/25/2022] [Accepted: 07/28/2022] [Indexed: 11/20/2022] Open
Abstract
G-quadruplexes (G4s) have been long considered rare and physiologically unimportant in vitro curiosities, but recent methodological advances have proved their presence and functions in vivo. Moreover, in addition to their functional relevance in bacteria and animals, including humans, their importance has been recently demonstrated in evolutionarily distinct plant species. In this study, we analyzed the genome of Pisum sativum (garden pea, or the so-called green pea), a unique member of the Fabaceae family. Our results showed that this genome contained putative G4 sequences (PQSs). Interestingly, these PQSs were located nonrandomly in the nuclear genome. We also found PQSs in mitochondrial (mt) and chloroplast (cp) DNA, and we experimentally confirmed G4 formation for sequences found in these two organelles. The frequency of PQSs for nuclear DNA was 0.42 PQSs per thousand base pairs (kbp), in the same range as for cpDNA (0.53/kbp), but significantly lower than what was found for mitochondrial DNA (1.58/kbp). In the nuclear genome, PQSs were mainly associated with regulatory regions, including 5'UTRs, and upstream of the rRNA region. In contrast to genomic DNA, PQSs were located around RNA genes in cpDNA and mtDNA. Interestingly, PQSs were also associated with specific transposable elements such as TIR and LTR and around them, pointing to their role in their spreading in nuclear DNA. The nonrandom localization of PQSs uncovered their evolutionary and functional significance in the Pisum sativum genome.
Collapse
Affiliation(s)
- Michaela Dobrovolná
- Institute of Biophysics of the Czech Academy of Sciences, 612 65 Brno, Czech Republic; (M.D.); (N.B.); (V.P.)
- Faculty of Chemistry, Brno University of Technology, Purkyňova 118, 612 00 Brno, Czech Republic
| | - Natália Bohálová
- Institute of Biophysics of the Czech Academy of Sciences, 612 65 Brno, Czech Republic; (M.D.); (N.B.); (V.P.)
- Department of Experimental Biology, Faculty of Science, Masaryk University, 611 37 Brno, Czech Republic
| | - Vratislav Peška
- Institute of Biophysics of the Czech Academy of Sciences, 612 65 Brno, Czech Republic; (M.D.); (N.B.); (V.P.)
| | - Jiawei Wang
- Laboratoire d’Optique et Biosciences (LOB), Ecole Polytechnique, CNRS, INSERM, Institut Polytechnique de Paris, CEDEX, 91128 Palaiseau, France; (J.W.); (Y.L.)
| | - Yu Luo
- Laboratoire d’Optique et Biosciences (LOB), Ecole Polytechnique, CNRS, INSERM, Institut Polytechnique de Paris, CEDEX, 91128 Palaiseau, France; (J.W.); (Y.L.)
- CNRS UMR9187, INSERM U1196, Université Paris-Saclay, CEDEX, 91405 Orsay, France
| | - Martin Bartas
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, 710 00 Ostrava, Czech Republic;
| | - Adriana Volná
- Department of Physics, Faculty of Science, University of Ostrava, 710 00 Ostrava, Czech Republic;
| | - Jean-Louis Mergny
- Institute of Biophysics of the Czech Academy of Sciences, 612 65 Brno, Czech Republic; (M.D.); (N.B.); (V.P.)
- Laboratoire d’Optique et Biosciences (LOB), Ecole Polytechnique, CNRS, INSERM, Institut Polytechnique de Paris, CEDEX, 91128 Palaiseau, France; (J.W.); (Y.L.)
| | - Václav Brázda
- Institute of Biophysics of the Czech Academy of Sciences, 612 65 Brno, Czech Republic; (M.D.); (N.B.); (V.P.)
- Faculty of Chemistry, Brno University of Technology, Purkyňova 118, 612 00 Brno, Czech Republic
| |
Collapse
|
15
|
Rakpenthai A, Apodiakou A, Whitcomb SJ, Hoefgen R. In silico analysis of cis-elements and identification of transcription factors putatively involved in the regulation of the OAS cluster genes SDI1 and SDI2. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022; 110:1286-1304. [PMID: 35315155 DOI: 10.1111/tpj.15735] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Revised: 02/09/2022] [Accepted: 03/01/2022] [Indexed: 06/14/2023]
Abstract
Arabidopsis thaliana sulfur deficiency-induced 1 and sulfur deficiency-induced 2 (SDI1 and SDI2) are involved in partitioning sulfur among metabolite pools during sulfur deficiency, and their transcript levels strongly increase in this condition. However, little is currently known about the cis- and trans-factors that regulate SDI expression. We aimed at identifying DNA sequence elements (cis-elements) and transcription factors (TFs) involved in regulating expression of the SDI genes. We performed in silico analysis of their promoter sequences cataloging known cis-elements and identifying conserved sequence motifs. We screened by yeast-one-hybrid an arrayed library of Arabidopsis TFs for binding to the SDI1 and SDI2 promoters. In total, 14 candidate TFs were identified. Direct association between particular cis-elements in the proximal SDI promoter regions and specific TFs was established via electrophoretic mobility shift assays: sulfur limitation 1 (SLIM1) was shown to bind SURE cis-element(s), the basic domain/leucine zipper (bZIP) core cis-element was shown to be important for HY5-homolog (HYH) binding, and G-box binding factor 1 (GBF1) was shown to bind the E box. Functional analysis of GBF1 and HYH using mutant and over-expressing lines indicated that these TFs promote a higher transcript level of SDI1 in vivo. Additionally, we performed a meta-analysis of expression changes of the 14 TF candidates in a variety of conditions that alter SDI expression. The presented results expand our understanding of sulfur pool regulation by SDI genes.
Collapse
Affiliation(s)
- Apidet Rakpenthai
- Max Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476, Potsdam-Golm, Germany
| | - Anastasia Apodiakou
- Max Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476, Potsdam-Golm, Germany
| | - Sarah J Whitcomb
- Max Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476, Potsdam-Golm, Germany
| | - Rainer Hoefgen
- Max Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476, Potsdam-Golm, Germany
| |
Collapse
|
16
|
Bowater RP, Bohálová N, Brázda V. Interaction of Proteins with Inverted Repeats and Cruciform Structures in Nucleic Acids. Int J Mol Sci 2022; 23:ijms23116171. [PMID: 35682854 PMCID: PMC9180970 DOI: 10.3390/ijms23116171] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Revised: 05/26/2022] [Accepted: 05/30/2022] [Indexed: 01/27/2023] Open
Abstract
Cruciforms occur when inverted repeat sequences in double-stranded DNA adopt intra-strand hairpins on opposing strands. Biophysical and molecular studies of these structures confirm their characterization as four-way junctions and have demonstrated that several factors influence their stability, including overall chromatin structure and DNA supercoiling. Here, we review our understanding of processes that influence the formation and stability of cruciforms in genomes, covering the range of sequences shown to have biological significance. It is challenging to accurately sequence repetitive DNA sequences, but recent advances in sequencing methods have deepened understanding about the amounts of inverted repeats in genomes from all forms of life. We highlight that, in the majority of genomes, inverted repeats are present in higher numbers than is expected from a random occurrence. It is, therefore, becoming clear that inverted repeats play important roles in regulating many aspects of DNA metabolism, including replication, gene expression, and recombination. Cruciforms are targets for many architectural and regulatory proteins, including topoisomerases, p53, Rif1, and others. Notably, some of these proteins can induce the formation of cruciform structures when they bind to DNA. Inverted repeat sequences also influence the evolution of genomes, and growing evidence highlights their significance in several human diseases, suggesting that the inverted repeat sequences and/or DNA cruciforms could be useful therapeutic targets in some cases.
Collapse
Affiliation(s)
- Richard P. Bowater
- School of Biological Sciences, University of East Anglia, Norwich Research Park, Norwich NR4 7TJ, UK;
| | - Natália Bohálová
- Department of Biophysical Chemistry and Molecular Oncology, Institute of Biophysics of the Czech Academy of Sciences, 61265 Brno, Czech Republic;
- Department of Experimental Biology, Faculty of Science, Masaryk University, Kamenice 5, 62500 Brno, Czech Republic
| | - Václav Brázda
- Department of Biophysical Chemistry and Molecular Oncology, Institute of Biophysics of the Czech Academy of Sciences, 61265 Brno, Czech Republic;
- Correspondence:
| |
Collapse
|
17
|
Pánek T, Barcytė D, Treitli SC, Záhonová K, Sokol M, Ševčíková T, Zadrobílková E, Jaške K, Yubuki N, Čepička I, Eliáš M. A new lineage of non-photosynthetic green algae with extreme organellar genomes. BMC Biol 2022; 20:66. [PMID: 35296310 PMCID: PMC8928634 DOI: 10.1186/s12915-022-01263-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2021] [Accepted: 02/22/2022] [Indexed: 12/27/2022] Open
Abstract
Background The plastid genomes of the green algal order Chlamydomonadales tend to expand their non-coding regions, but this phenomenon is poorly understood. Here we shed new light on organellar genome evolution in Chlamydomonadales by studying a previously unknown non-photosynthetic lineage. We established cultures of two new Polytoma-like flagellates, defined their basic characteristics and phylogenetic position, and obtained complete organellar genome sequences and a transcriptome assembly for one of them. Results We discovered a novel deeply diverged chlamydomonadalean lineage that has no close photosynthetic relatives and represents an independent case of photosynthesis loss. To accommodate these organisms, we establish the new genus Leontynka, with two species (L. pallida and L. elongata) distinguishable through both their morphological and molecular characteristics. Notable features of the colourless plastid of L. pallida deduced from the plastid genome (plastome) sequence and transcriptome assembly include the retention of ATP synthase, thylakoid-associated proteins, the carotenoid biosynthesis pathway, and a plastoquinone-based electron transport chain, the latter two modules having an obvious functional link to the eyespot present in Leontynka. Most strikingly, the ~362 kbp plastome of L. pallida is by far the largest among the non-photosynthetic eukaryotes investigated to date due to an extreme proliferation of sequence repeats. These repeats are also present in coding sequences, with one repeat type found in the exons of 11 out of 34 protein-coding genes, with up to 36 copies per gene, thus affecting the encoded proteins. The mitochondrial genome of L. pallida is likewise exceptionally large, with its >104 kbp surpassed only by the mitogenome of Haematococcus lacustris among all members of Chlamydomonadales hitherto studied. It is also bloated with repeats, though entirely different from those in the L. pallida plastome, which contrasts with the situation in H. lacustris where both the organellar genomes have accumulated related repeats. Furthermore, the L. pallida mitogenome exhibits an extremely high GC content in both coding and non-coding regions and, strikingly, a high number of predicted G-quadruplexes. Conclusions With its unprecedented combination of plastid and mitochondrial genome characteristics, Leontynka pushes the frontiers of organellar genome diversity and is an interesting model for studying organellar genome evolution. Supplementary Information The online version contains supplementary material available at 10.1186/s12915-022-01263-w.
Collapse
Affiliation(s)
- Tomáš Pánek
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, 701 00, Ostrava, Czech Republic.,Department of Zoology, Faculty of Science, Charles University, 128 43, Prague, Czech Republic
| | - Dovilė Barcytė
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, 701 00, Ostrava, Czech Republic
| | - Sebastian C Treitli
- Department of Parasitology, Faculty of Science, Charles University, BIOCEV, 252 42, Vestec, Czech Republic
| | - Kristína Záhonová
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, 701 00, Ostrava, Czech Republic
| | - Martin Sokol
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, 701 00, Ostrava, Czech Republic
| | - Tereza Ševčíková
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, 701 00, Ostrava, Czech Republic
| | - Eliška Zadrobílková
- Department of Zoology, Faculty of Science, Charles University, 128 43, Prague, Czech Republic
| | - Karin Jaške
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, 701 00, Ostrava, Czech Republic
| | - Naoji Yubuki
- Department of Zoology, Faculty of Science, Charles University, 128 43, Prague, Czech Republic.,Bioimaging Facility, University of British Columbia, Vancouver, V6T 1Z4, Canada
| | - Ivan Čepička
- Department of Zoology, Faculty of Science, Charles University, 128 43, Prague, Czech Republic
| | - Marek Eliáš
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, 701 00, Ostrava, Czech Republic.
| |
Collapse
|
18
|
Víglaský V. Hidden Information Revealed Using the Orthogonal System of Nucleic Acids. Int J Mol Sci 2022; 23:ijms23031804. [PMID: 35163723 PMCID: PMC8836696 DOI: 10.3390/ijms23031804] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2021] [Revised: 01/31/2022] [Accepted: 02/02/2022] [Indexed: 11/25/2022] Open
Abstract
In this study, the organization of genetic information in nucleic acids is defined using a novel orthogonal representation. Clearly defined base pairing in DNA allows the linear base chain and sequence to be mathematically transformed into an orthogonal representation where the G–C and A–T pairs are displayed in different planes that are perpendicular to each other. This form of base allocation enables the evaluation of any nucleic acid and predicts the likelihood of a particular region to form non-canonical motifs. The G4Hunter algorithm is currently a popular method of identifying G-quadruplex forming sequences in nucleic acids, and offers promising scores despite its lack of a substantial rational basis. The orthogonal representation described here is an effort to address this incongruity. In addition, the orthogonal display facilitates the search for other sequences that are capable of adopting non-canonical motifs, such as direct and palindromic repeats. The technique can also be used for various RNAs, including any aptamers. This powerful tool based on an orthogonal system offers considerable potential for a wide range of applications.
Collapse
Affiliation(s)
- Viktor Víglaský
- Department of Biochemistry, Institute of Chemistry, Faculty of Sciences, Pavol Jozef Šafárik University, 04001 Košice, Slovakia
| |
Collapse
|
19
|
Skorupski J. Characterisation of the Complete Mitochondrial Genome of Critically Endangered Mustela lutreola (Carnivora: Mustelidae) and Its Phylogenetic and Conservation Implications. Genes (Basel) 2022; 13:genes13010125. [PMID: 35052465 PMCID: PMC8774856 DOI: 10.3390/genes13010125] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2021] [Revised: 12/28/2021] [Accepted: 01/06/2022] [Indexed: 02/07/2023] Open
Abstract
In this paper, a complete mitochondrial genome of the critically endangered European mink Mustela lutreola L., 1761 is reported. The mitogenome was 16,504 bp in length and encoded the typical 13 protein-coding genes, two ribosomal RNA genes and 22 transfer RNA genes, and harboured a putative control region. The A+T content of the entire genome was 60.06% (A > T > C > G), and the AT-skew and GC-skew were 0.093 and −0.308, respectively. The encoding-strand identity of genes and their order were consistent with a collinear gene order characteristic for vertebrate mitogenomes. The start codons of all protein-coding genes were the typical ATN. In eight cases, they were ended by complete stop codons, while five had incomplete termination codons (TA or T). All tRNAs had a typical cloverleaf secondary structure, except tRNASer(AGC) and tRNALys, which lacked the DHU stem and had reduced DHU loop, respectively. Both rRNAs were capable of folding into complex secondary structures, containing unmatched base pairs. Eighty-one single nucleotide variants (substitutions and indels) were identified. Comparative interspecies analyses confirmed the close phylogenetic relationship of the European mink to the so-called ferret group, clustering the European polecat, the steppe polecat and the black-footed ferret. The obtained results are expected to provide useful molecular data, informing and supporting effective conservation measures to save M. lutreola.
Collapse
Affiliation(s)
- Jakub Skorupski
- Institute of Marine and Environmental Sciences, University of Szczecin, Adama Mickiewicza 16 St., 70-383 Szczecin, Poland; ; Tel.: +48-91-444-16-85
- Polish Society for Conservation Genetics LUTREOLA, Maciejkowa 21 St., 71-784 Szczecin, Poland
- The European Mink Centre, 71-415 Szczecin, Poland
| |
Collapse
|
20
|
Bacolla A, Tainer JA. Robust Computational Approaches to Defining Insights on the Interface of DNA Repair with Replication and Transcription in Cancer. Methods Mol Biol 2022; 2444:1-13. [PMID: 35290628 PMCID: PMC9377048 DOI: 10.1007/978-1-0716-2063-2_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
The massive amount of experimental DNA and RNA sequence information provides an encyclopedia for cell biology that requires computational tools for efficient interpretation. The ability to write and apply simple computing scripts propels the investigator beyond the boundaries of online analysis tools to more broadly interrogate laboratory experimental data and to integrate them with all available datasets to test and challenge hypotheses. Here we describe robust prototypic bash and C++ scripts with metrics and methods for validation that we have made publicly available to address the roles of non-B DNA-forming motifs in eliciting genetic instability and to query The Cancer Genome Atlas. Importantly, the methods presented provide practical data interpretation tools to examine fundamental relationships and to enable insights and correlations between alterations in gene expression patterns and patient outcome. The exemplary source codes described are simple and can be efficiently modified, elaborated, and applied to other relationships and areas of investigation.
Collapse
Affiliation(s)
- Albino Bacolla
- Departments of Cancer Biology and of Molecular and Cellular Oncology, The University of Texas M.D. Anderson Cancer Center, Houston, TX, USA.
| | - John A Tainer
- Departments of Cancer Biology and of Molecular and Cellular Oncology, The University of Texas M.D. Anderson Cancer Center, Houston, TX, USA.
| |
Collapse
|
21
|
Yin C, Yau SST. Inverted repeats in coronavirus SARS-CoV-2 genome manifest the evolution events. J Theor Biol 2021; 530:110885. [PMID: 34478743 PMCID: PMC8406619 DOI: 10.1016/j.jtbi.2021.110885] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Revised: 08/13/2021] [Accepted: 08/25/2021] [Indexed: 11/24/2022]
Abstract
The world faces a great unforeseen challenge through the COVID-19 pandemic caused by coronavirus SARS-CoV-2. The virus genome structure and evolution are positioned front and center for further understanding insights on vaccine development, monitoring of transmission trajectories, and prevention of zoonotic infections of new coronaviruses. Of particular interest are genomic elements Inverse Repeats (IRs), which maintain genome stability, regulate gene expressions, and are the targets of mutations. However, little research attention is given to the IR content analysis in the SARS-CoV-2 genome. In this study, we propose a geometric analysis method and using the method to investigate the distributions of IRs in SARS-CoV-2 and its related coronavirus genomes. The method represents each genomic IR sequence pair as a single point and constructs the geometric shape of the genome using the IRs. Thus, the IR shape can be considered as the signature of the genome. The genomes of different coronaviruses are then compared using the constructed IR shapes. The results demonstrate that SARS-CoV-2 genome, specifically, has an abundance of IRs, and the IRs in coronavirus genomes show an increase during evolution events.
Collapse
Affiliation(s)
- Changchuan Yin
- Department of Mathematics, Statistics, and Computer Science, The University of Illinois at Chicago, Chicago, IL 60607-7045, USA.
| | - Stephen S-T Yau
- Department of Mathematical Sciences, Tsinghua University, Beijing 100084, China.
| |
Collapse
|
22
|
The minicircular and extremely heteroplasmic mitogenome of the holoparasitic plant Rhopalocnemis phalloides. Curr Biol 2021; 32:470-479.e5. [PMID: 34906352 DOI: 10.1016/j.cub.2021.11.053] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2021] [Revised: 10/25/2021] [Accepted: 11/22/2021] [Indexed: 12/14/2022]
Abstract
The plastid and nuclear genomes of parasitic plants exhibit deeply altered architectures,1-13 whereas the few examined mitogenomes range from deeply altered to conventional.14-20 To provide further insight on mitogenome evolution in parasitic plants, we report the highly modified mitogenome of Rhopalocnemis phalloides, a holoparasite in Balanophoraceae. Its mitogenome is uniquely arranged in 21 minicircular chromosomes that vary in size from 4,949 to 7,861 bp, with a total length of only 130,713 bp. All chromosomes share an identical 896 bp conserved region, with a large stem-loop that acts as the origin of replication, flanked on each side by hypervariable and semi-conserved regions. Similar minicircular structures with shared and unique regions have been observed in parasitic animals and free-living protists,21-24 suggesting convergent structural evolution. Southern blots confirm both the minicircular structure and the replication origin of the mitochondrial chromosomes. PacBio reads provide evidence for chromosome recombination and rolling-circle replication for the R. phalloides mitogenome. Despite its small size, the mitogenome harbors a typical set of genes and introns within the unique regions of each chromosome, yet introns are the smallest among seed plants and ferns. The mitogenome also exhibits extreme heteroplasmy, predominantly involving short indels and more complex variants, many of which cause potential loss-of-function mutations for some gene copies. All heteroplasmic variants are transcribed, and functional and nonfunctional protein-coding variants are spliced and RNA edited. Our findings offer a unique perspective into how mitogenomes of parasitic plants can be deeply altered and shed light on plant mitogenome replication.
Collapse
|
23
|
R-Loop Tracker: Web Access-Based Tool for R-Loop Detection and Analysis in Genomic DNA Sequences. Int J Mol Sci 2021; 22:ijms222312857. [PMID: 34884661 PMCID: PMC8657672 DOI: 10.3390/ijms222312857] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Revised: 11/19/2021] [Accepted: 11/25/2021] [Indexed: 12/02/2022] Open
Abstract
R-loops are common non-B nucleic acid structures formed by a three-stranded nucleic acid composed of an RNA–DNA hybrid and a displaced single-stranded DNA (ssDNA) loop. Because the aberrant R-loop formation leads to increased mutagenesis, hyper-recombination, rearrangements, and transcription-replication collisions, it is regarded as important in human diseases. Therefore, its prevalence and distribution in genomes are studied intensively. However, in silico tools for R-loop prediction are limited, and therefore, we have developed the R-loop tracker tool, which was implemented as a part of the DNA Analyser web server. This new tool is focused upon (1) prediction of R-loops in genomic DNA without length and sequence limitations; (2) integration of R-loop tracker results with other tools for nucleic acids analyses, including Genome Browser; (3) internal cross-evaluation of in silico results with experimental data, where available; (4) easy export and correlation analyses with other genome features and markers; and (5) enhanced visualization outputs. Our new R-loop tracker tool is freely accessible on the web pages of DNA Analyser tools, and its implementation on the web-based server allows effective analyses not only for DNA segments but also for full chromosomes and genomes.
Collapse
|
24
|
Brázda V, Bohálová N, Bowater RP. New telomere to telomere assembly of human chromosome 8 reveals a previous underestimation of G-quadruplex forming sequences and inverted repeats. Gene 2021; 810:146058. [PMID: 34737002 DOI: 10.1016/j.gene.2021.146058] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Revised: 10/14/2021] [Accepted: 10/29/2021] [Indexed: 11/04/2022]
Abstract
Taking advantage of evolving and improving sequencing methods, human chromosome 8 is now available as a gapless, end-to-end assembly. Thanks to advances in long-read sequencing technologies, its centromere, telomeres, duplicated gene families and repeat-rich regions are now fully sequenced. We were interested to assess if the new assembly altered our understanding of the potential impact of non-B DNA structures within this completed chromosome sequence. It has been shown that non-B secondary structures, such as G-quadruplexes, hairpins and cruciforms, have important regulatory functions and potential as targeted therapeutics. Therefore, we analysed the presence of putative G-quadruplex forming sequences and inverted repeats in the current human reference genome (GRCh38) and in the new end-to-end assembly of chromosome 8. The comparison revealed that the new assembly contains significantly more inverted repeats and G-quadruplex forming sequences compared to the current reference sequence. This observation can be explained by improved accuracy of the new sequencing methods, particularly in regions that contain extensive repeats of bases, as is preferred by many non-B DNA structures. These results show a significant underestimation of the prevalence of non-B DNA secondary structure in previous assembly versions of the human genome and point to their importance being not fully appreciated. We anticipate that similar observations will occur as the improved sequencing technologies fill in gaps across the genomes of humans and other organisms.
Collapse
Affiliation(s)
- Václav Brázda
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, Brno 612 65, Czech Republic.
| | - Natália Bohálová
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, Brno 612 65, Czech Republic; Department of Experimental Biology, Faculty of Science, Masaryk University, Kamenice 5, Brno 62500, Czech Republic
| | - Richard P Bowater
- School of Biological Sciences, University of East Anglia, Norwich Research Park, Norwich NR4 7TJ, United Kingdom.
| |
Collapse
|
25
|
Emrich-Mills TZ, Yates G, Barrett J, Girr P, Grouneva I, Lau CS, Walker CE, Kwok TK, Davey JW, Johnson MP, Mackinder LCM. A recombineering pipeline to clone large and complex genes in Chlamydomonas. THE PLANT CELL 2021; 33:1161-1181. [PMID: 33723601 PMCID: PMC8633747 DOI: 10.1093/plcell/koab024] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/07/2020] [Accepted: 01/18/2021] [Indexed: 05/10/2023]
Abstract
The ability to clone genes has greatly advanced cell and molecular biology research, enabling researchers to generate fluorescent protein fusions for localization and confirm genetic causation by mutant complementation. Most gene cloning is polymerase chain reaction (PCR)�or DNA synthesis-dependent, which can become costly and technically challenging as genes increase in size, particularly if they contain complex regions. This has been a long-standing challenge for the Chlamydomonas reinhardtii research community, as this alga has a high percentage of genes containing complex sequence structures. Here we overcame these challenges by developing a recombineering pipeline for the rapid parallel cloning of genes from a Chlamydomonas bacterial artificial chromosome collection. To generate fluorescent protein fusions for localization, we applied the pipeline at both batch and high-throughput scales to 203 genes related to the Chlamydomonas CO2 concentrating mechanism (CCM), with an overall cloning success rate of 77%. Cloning success was independent of gene size and complexity, with cloned genes as large as 23 kb. Localization of a subset of CCM targets confirmed previous mass spectrometry data, identified new pyrenoid components, and enabled complementation of mutants. We provide vectors and detailed protocols to facilitate easy adoption of this technology, which we envision will open up new possibilities in algal and plant research.
Collapse
Affiliation(s)
- Tom Z Emrich-Mills
- Department of Biology, University of York, York YO10 5DD, UK
- Department Molecular Biology and Biotechnology, University of Sheffield, Sheffield S10 2TN, UK
| | - Gary Yates
- Department of Biology, University of York, York YO10 5DD, UK
| | - James Barrett
- Department of Biology, University of York, York YO10 5DD, UK
| | - Philipp Girr
- Department of Biology, University of York, York YO10 5DD, UK
| | - Irina Grouneva
- Department of Biology, University of York, York YO10 5DD, UK
| | - Chun Sing Lau
- Department of Biology, University of York, York YO10 5DD, UK
| | | | - Tsz Kam Kwok
- Department of Biology, University of York, York YO10 5DD, UK
| | - John W Davey
- Department of Biology, University of York, York YO10 5DD, UK
| | - Matthew P Johnson
- Department Molecular Biology and Biotechnology, University of Sheffield, Sheffield S10 2TN, UK
| | - Luke C M Mackinder
- Department of Biology, University of York, York YO10 5DD, UK
- Author for correspondence: (L.C.M.M.)
| |
Collapse
|
26
|
Brázda V, Bartas M, Bowater RP. Evolution of Diverse Strategies for Promoter Regulation. Trends Genet 2021; 37:730-744. [PMID: 33931265 DOI: 10.1016/j.tig.2021.04.003] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Revised: 03/31/2021] [Accepted: 04/01/2021] [Indexed: 12/15/2022]
Abstract
DNA is fundamentally important for all cellular organisms due to its role as a store of hereditary genetic information. The precise and accurate regulation of gene transcription depends primarily on promoters, which vary significantly within and between genomes. Some promoters are rich in specific types of bases, while others have more varied, complex sequence characteristics. However, it is not only base sequence but also epigenetic modifications and altered DNA structure that regulate promoter activity. Significantly, many promoters across all organisms contain sequences that can form intrastrand hairpins (cruciforms) or four-stranded structures (G-quadruplex or i-motif). In this review we integrate recent studies on promoter regulation that highlight the importance of DNA structure in the evolutionary adaptation of promoter sequences.
Collapse
Affiliation(s)
- Václav Brázda
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic
| | - Martin Bartas
- Department of Biology and Ecology/Institute of Environmental Technologies, Faculty of Science, University of Ostrava, 710 00 Ostrava, Czech Republic
| | - Richard P Bowater
- School of Biological Sciences, University of East Anglia, Norwich Research Park, Norwich NR4 7TJ, UK.
| |
Collapse
|
27
|
Goswami P, Bartas M, Lexa M, Bohálová N, Volná A, Červeň J, Červeňová V, Pečinka P, Špunda V, Fojta M, Brázda V. SARS-CoV-2 hot-spot mutations are significantly enriched within inverted repeats and CpG island loci. Brief Bioinform 2021; 22:1338-1345. [PMID: 33341900 PMCID: PMC7799342 DOI: 10.1093/bib/bbaa385] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2020] [Revised: 11/16/2020] [Accepted: 11/27/2020] [Indexed: 12/18/2022] Open
Abstract
SARS-CoV-2 is an intensively investigated virus from the order Nidovirales (Coronaviridae family) that causes COVID-19 disease in humans. Through enormous scientific effort, thousands of viral strains have been sequenced to date, thereby creating a strong background for deep bioinformatics studies of the SARS-CoV-2 genome. In this study, we inspected high-frequency mutations of SARS-CoV-2 and carried out systematic analyses of their overlay with inverted repeat (IR) loci and CpG islands. The main conclusion of our study is that SARS-CoV-2 hot-spot mutations are significantly enriched within both IRs and CpG island loci. This points to their role in genomic instability and may predict further mutational drive of the SARS-CoV-2 genome. Moreover, CpG islands are strongly enriched upstream from viral ORFs and thus could play important roles in transcription and the viral life cycle. We hypothesize that hypermethylation of these loci will decrease the transcription of viral ORFs and could therefore limit the progression of the disease.
Collapse
Affiliation(s)
- Pratik Goswami
- Department of Biophysical Chemistry and Molecular Oncology, Institute of Biophysics of the Czech Academy of Sciences, Brno, Czech Republic.,National Centre for Biomolecular Research, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - Martin Bartas
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
| | - Matej Lexa
- Faculty of Informatics, Masaryk University, Brno, Czech Republic
| | - Natália Bohálová
- Department of Biophysical Chemistry and Molecular Oncology, Institute of Biophysics of the Czech Academy of Sciences, Brno, Czech Republic.,Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - Adriana Volná
- Department of Physics, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
| | - Jiří Červeň
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
| | - Veronika Červeňová
- Department of Mathematics, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
| | - Petr Pečinka
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
| | - Vladimír Špunda
- Department of Physics, Faculty of Science, University of Ostrava, Ostrava, Czech Republic.,Global Change Research Institute of the Czech Academy of Sciences, Brno, Czech Republic
| | - Miroslav Fojta
- Department of Biophysical Chemistry and Molecular Oncology, Institute of Biophysics of the Czech Academy of Sciences, Brno, Czech Republic
| | - Václav Brázda
- Department of Biophysical Chemistry and Molecular Oncology, Institute of Biophysics of the Czech Academy of Sciences, Brno, Czech Republic
| |
Collapse
|
28
|
Bartas M, Goswami P, Lexa M, Červeň J, Volná A, Fojta M, Brázda V, Pečinka P. Letter to the Editor: Significant mutation enrichment in inverted repeat sites of new SARS-CoV-2 strains. Brief Bioinform 2021; 22:6219140. [PMID: 33837760 PMCID: PMC8083281 DOI: 10.1093/bib/bbab129] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2021] [Revised: 03/16/2021] [Accepted: 03/19/2021] [Indexed: 01/08/2023] Open
Abstract
In a recently published paper, we have found that SARS-CoV-2 hot-spot mutations are significantly associated with inverted repeat loci and CG dinucleotides. However, fast-spreading strains with new mutations (so-called mink farm mutations, England mutations and Japan mutations) have been recently described. We used the new datasets to check the positioning of mutation sites in genomes of the new SARS-CoV-2 strains. Using an open-access Palindrome analyzer tool, we found mutations in these new strains to be significantly enriched in inverted repeat loci.
Collapse
Affiliation(s)
- Martin Bartas
- Department of Biology, University of Ostrava, Czech Republic
| | - Pratik Goswami
- Institute of Biophysics of the Czech Academy of Sciences, Brno, Czech Republic. He is a student of the Faculty of Science, Masaryk University, Brno, Czech Republic
| | - Matej Lexa
- Faculty of Informatics, Masaryk University, Brno, Czech Republic
| | - Jiří Červeň
- Department of Biology, University of Ostrava, Czech Republic
| | - Adriana Volná
- Department of Physics, University of Ostrava, Czech Republic
| | - Miroslav Fojta
- Department of Biophysical Chemistry and Molecular Oncology, Institute of Biophysics of the Czech Academy of Sciences, Brno, Czech Republic
| | - Václav Brázda
- Laboratory of Protein-DNA Interactions, Institute of Biophysics of the Czech Academy of Sciences, Brno, Czech Republic
| | - Petr Pečinka
- Molecular Biology group in the Department of Biology, University of Ostrava, Czech Republic
| |
Collapse
|
29
|
Tracing dsDNA Virus-Host Coevolution through Correlation of Their G-Quadruplex-Forming Sequences. Int J Mol Sci 2021; 22:ijms22073433. [PMID: 33810462 PMCID: PMC8036883 DOI: 10.3390/ijms22073433] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Revised: 03/17/2021] [Accepted: 03/23/2021] [Indexed: 12/12/2022] Open
Abstract
The importance of gene expression regulation in viruses based upon G-quadruplex may point to its potential utilization in therapeutic targeting. Here, we present analyses as to the occurrence of putative G-quadruplex-forming sequences (PQS) in all reference viral dsDNA genomes and evaluate their dependence on PQS occurrence in host organisms using the G4Hunter tool. PQS frequencies differ across host taxa without regard to GC content. The overlay of PQS with annotated regions reveals the localization of PQS in specific regions. While abundance in some, such as repeat regions, is shared by all groups, others are unique. There is abundance within introns of Eukaryota-infecting viruses, but depletion of PQS in introns of bacteria-infecting viruses. We reveal a significant positive correlation between PQS frequencies in dsDNA viruses and corresponding hosts from archaea, bacteria, and eukaryotes. A strong relationship between PQS in a virus and its host indicates their close coevolution and evolutionarily reciprocal mimicking of genome organization.
Collapse
|
30
|
Zell J, Rota Sperti F, Britton S, Monchaud D. DNA folds threaten genetic stability and can be leveraged for chemotherapy. RSC Chem Biol 2021; 2:47-76. [PMID: 35340894 PMCID: PMC8885165 DOI: 10.1039/d0cb00151a] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2020] [Accepted: 09/20/2020] [Indexed: 12/22/2022] Open
Abstract
Damaging DNA is a current and efficient strategy to fight against cancer cell proliferation. Numerous mechanisms exist to counteract DNA damage, collectively referred to as the DNA damage response (DDR) and which are commonly dysregulated in cancer cells. Precise knowledge of these mechanisms is necessary to optimise chemotherapeutic DNA targeting. New research on DDR has uncovered a series of promising therapeutic targets, proteins and nucleic acids, with application notably via an approach referred to as combination therapy or combinatorial synthetic lethality. In this review, we summarise the cornerstone discoveries which gave way to the DNA being considered as an anticancer target, and the manipulation of DDR pathways as a valuable anticancer strategy. We describe in detail the DDR signalling and repair pathways activated in response to DNA damage. We then summarise the current understanding of non-B DNA folds, such as G-quadruplexes and DNA junctions, when they are formed and why they can offer a more specific therapeutic target compared to that of canonical B-DNA. Finally, we merge these subjects to depict the new and highly promising chemotherapeutic strategy which combines enhanced-specificity DNA damaging and DDR targeting agents. This review thus highlights how chemical biology has given rise to significant scientific advances thanks to resolutely multidisciplinary research efforts combining molecular and cell biology, chemistry and biophysics. We aim to provide the non-specialist reader a gateway into this exciting field and the specialist reader with a new perspective on the latest results achieved and strategies devised.
Collapse
Affiliation(s)
- Joanna Zell
- Institut de Chimie Moléculaire de l'Université de Bourgogne, ICMUB CNRS UMR 6302, UBFC Dijon France
| | - Francesco Rota Sperti
- Institut de Chimie Moléculaire de l'Université de Bourgogne, ICMUB CNRS UMR 6302, UBFC Dijon France
| | - Sébastien Britton
- Institut de Pharmacologie et de Biologie Structurale, IPBS, Université de Toulouse, CNRS, UPS Toulouse France
- Équipe Labellisée la Ligue Contre le Cancer 2018 Toulouse France
| | - David Monchaud
- Institut de Chimie Moléculaire de l'Université de Bourgogne, ICMUB CNRS UMR 6302, UBFC Dijon France
| |
Collapse
|
31
|
Karageorgiou C, Tarrío R, Rodríguez-Trelles F. The Cyclically Seasonal Drosophila subobscura Inversion O 7 Originated From Fragile Genomic Sites and Relocated Immunity and Metabolic Genes. Front Genet 2020; 11:565836. [PMID: 33193649 PMCID: PMC7584159 DOI: 10.3389/fgene.2020.565836] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2020] [Accepted: 09/09/2020] [Indexed: 11/28/2022] Open
Abstract
Chromosome inversions are important contributors to standing genetic variation in Drosophila subobscura. Presently, the species is experiencing a rapid replacement of high-latitude by low-latitude inversions associated with global warming. Yet not all low-latitude inversions are correlated with the ongoing warming trend. This is particularly unexpected in the case of O7 because it shows a regular seasonal cycle that peaks in summer and rose with a heatwave. The inconsistent behavior of O7 across components of the ambient temperature suggests that is causally more complex than simply due to temperature alone. In order to understand the dynamics of O7, high-quality genomic data are needed to determine both the breakpoints and the genetic content. To fill this gap, here we generated a PacBio long read-based chromosome-scale genome assembly, from a highly homozygous line made isogenic for an O3 + 4 + 7 chromosome. Then we isolated the complete continuous sequence of O7 by conserved synteny analysis with the available reference genome. Main findings include the following: (i) the assembled O7 inversion stretches 9.936 Mb, containing > 1,000 annotated genes; (ii) O7 had a complex origin, involving multiple breaks associated with non-B DNA-forming motifs, formation of a microinversion, and ectopic repair in trans with the two homologous chromosomes; (iii) the O7 breakpoints carry a pre-inversion record of fragility, including a sequence insertion, and transposition with later inverted duplication of an Attacin immunity gene; and (iv) the O7 inversion relocated the major insulin signaling forkhead box subgroup O (foxo) gene in tight linkage with its antagonistic regulatory partner serine/threonine-protein kinase B (Akt1) and disrupted concerted evolution of the two inverted Attacin duplicates, reattaching them to dFOXO metabolic enhancers. Our findings suggest that O7 exerts antagonistic pleiotropic effects on reproduction and immunity, setting a framework to understand its relationship with climate change. Furthermore, they are relevant for fragility in genome rearrangement evolution and for current views on the contribution of breakage versus repair in shaping inversion-breakpoint junctions.
Collapse
Affiliation(s)
- Charikleia Karageorgiou
- Grup de Genòmica, Bioinformàtica i Biologia Evolutiva (GGBE), Departament de Genètica i de Microbiologia, Universitat Autonòma de Barcelona, Barcelona, Spain
| | - Rosa Tarrío
- Grup de Genòmica, Bioinformàtica i Biologia Evolutiva (GGBE), Departament de Genètica i de Microbiologia, Universitat Autonòma de Barcelona, Barcelona, Spain
| | - Francisco Rodríguez-Trelles
- Grup de Genòmica, Bioinformàtica i Biologia Evolutiva (GGBE), Departament de Genètica i de Microbiologia, Universitat Autonòma de Barcelona, Barcelona, Spain
| |
Collapse
|
32
|
Brázda V, Luo Y, Bartas M, Kaura P, Porubiaková O, Šťastný J, Pečinka P, Verga D, Da Cunha V, Takahashi TS, Forterre P, Myllykallio H, Fojta M, Mergny JL. G-Quadruplexes in the Archaea Domain. Biomolecules 2020; 10:biom10091349. [PMID: 32967357 PMCID: PMC7565180 DOI: 10.3390/biom10091349] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Revised: 09/16/2020] [Accepted: 09/18/2020] [Indexed: 11/26/2022] Open
Abstract
The importance of unusual DNA structures in the regulation of basic cellular processes is an emerging field of research. Amongst local non-B DNA structures, G-quadruplexes (G4s) have gained in popularity during the last decade, and their presence and functional relevance at the DNA and RNA level has been demonstrated in a number of viral, bacterial, and eukaryotic genomes, including humans. Here, we performed the first systematic search of G4-forming sequences in all archaeal genomes available in the NCBI database. In this article, we investigate the presence and locations of G-quadruplex forming sequences using the G4Hunter algorithm. G-quadruplex-prone sequences were identified in all archaeal species, with highly significant differences in frequency, from 0.037 to 15.31 potential quadruplex sequences per kb. While G4 forming sequences were extremely abundant in Hadesarchaea archeon (strikingly, more than 50% of the Hadesarchaea archaeon isolate WYZ-LMO6 genome is a potential part of a G4-motif), they were very rare in the Parvarchaeota phylum. The presence of G-quadruplex forming sequences does not follow a random distribution with an over-representation in non-coding RNA, suggesting possible roles for ncRNA regulation. These data illustrate the unique and non-random localization of G-quadruplexes in Archaea.
Collapse
Affiliation(s)
- Václav Brázda
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic
| | - Yu Luo
- Institut Curie, CNRS UMR9187, INSERM U1196, Universite Paris Saclay, 91400 Orsay, France
| | - Martin Bartas
- Department of Biology and Ecology/Institute of Environmental Technologies, Faculty of Science, University of Ostrava, 710 00 Ostrava, Czech Republic
| | - Patrik Kaura
- Faculty of Mechanical Engineering, Brno University of Technology, Technicka 2896/2, 616 69 Brno, Czech Republic
| | - Otilia Porubiaková
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic
- Faculty of Chemistry, Brno University of Technology, Purkyňova 464/118, 612 00 Brno, Czech Republic
| | - Jiří Šťastný
- Faculty of Mechanical Engineering, Brno University of Technology, Technicka 2896/2, 616 69 Brno, Czech Republic
- Mendel University in Brno, Zemědělská 1, 613 00 Brno, Czech Republic
| | - Petr Pečinka
- Department of Biology and Ecology/Institute of Environmental Technologies, Faculty of Science, University of Ostrava, 710 00 Ostrava, Czech Republic
| | - Daniela Verga
- Institut Curie, CNRS UMR9187, INSERM U1196, Universite Paris Saclay, 91400 Orsay, France
| | - Violette Da Cunha
- Institut de Biologie Intégrative de la Cellule (I2BC), CNRS, Université Paris-Saclay, CEDEX, 91198 Gif-sur-Yvette, France
| | - Tomio S Takahashi
- Institut de Biologie Intégrative de la Cellule (I2BC), CNRS, Université Paris-Saclay, CEDEX, 91198 Gif-sur-Yvette, France
| | - Patrick Forterre
- Institut de Biologie Intégrative de la Cellule (I2BC), CNRS, Université Paris-Saclay, CEDEX, 91198 Gif-sur-Yvette, France
| | - Hannu Myllykallio
- Laboratoire d'Optique et Biosciences, Ecole Polytechnique, CNRS, INSERM, Institut Polytechnique de Paris, 91128 Palaiseau, France
| | - Miroslav Fojta
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic
| | - Jean-Louis Mergny
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic
- Laboratoire d'Optique et Biosciences, Ecole Polytechnique, CNRS, INSERM, Institut Polytechnique de Paris, 91128 Palaiseau, France
| |
Collapse
|
33
|
An J, Zheng W, Liang J, Xi Q, Chen R, Jia J, Lu X, Jakovlić I. Disrupted architecture and fast evolution of the mitochondrial genome of Argeia pugettensis (Isopoda): implications for speciation and fitness. BMC Genomics 2020; 21:607. [PMID: 32883208 PMCID: PMC7469299 DOI: 10.1186/s12864-020-07021-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2020] [Accepted: 08/24/2020] [Indexed: 11/24/2022] Open
Abstract
BACKGROUND Argeia pugettensis is an isopod species that parasitizes other crustaceans. Its huge native geographic range spans the Pacific from China to California, but molecular data are available only for a handful of specimens from North-American populations. We sequenced and characterised the complete mitogenome of a specimen collected in the Yellow Sea. RESULTS It exhibited a barcode (cox1) similarity level of only 87-89% with North-American populations, which is unusually low for conspecifics. Its mitogenome is among the largest in isopods (≈16.5 Kbp), mostly due to a large duplicated palindromic genomic segment (2 Kbp) comprising three genes. However, it lost a segment comprising three genes, nad4L-trnP-nad6, and many genes exhibited highly divergent sequences in comparison to isopod orthologues, including numerous mutations, deletions and insertions. Phylogenetic and selection analyses corroborated that this is one of the handful of most rapidly evolving available isopod mitogenomes, and that it evolves under highly relaxed selection constraints (as opposed to positive selection). However, its nuclear 18S gene is highly conserved, which suggests that rapid evolution is limited to its mitochondrial genome. The cox1 sequence analysis indicates that elevated mitogenomic evolutionary rates are not shared by North-American conspecifics, which suggests a breakdown of cox1 barcoding in this species. CONCLUSIONS A highly architecturally disrupted mitogenome and decoupling of mitochondrial and nuclear rates would normally be expected to have strong negative impacts on the fitness of the organism, so the existence of this lineage is a puzzling evolutionary question. Additional studies are needed to assess the phylogenetic breadth of this disrupted mitochondrial architecture and its impact on fitness.
Collapse
Affiliation(s)
- Jianmei An
- School of Life Science, Shanxi Normal University, Linfen, 041000, PR China.
| | - Wanrui Zheng
- School of Life Science, Shanxi Normal University, Linfen, 041000, PR China
| | - Jielong Liang
- School of Life Science, Shanxi Normal University, Linfen, 041000, PR China
| | - Qianqian Xi
- School of Life Science, Shanxi Normal University, Linfen, 041000, PR China
| | - Ruru Chen
- School of Life Science, Shanxi Normal University, Linfen, 041000, PR China
| | - Junli Jia
- School of Life Science, Shanxi Normal University, Linfen, 041000, PR China
| | - Xia Lu
- School of Life Science, Shanxi Normal University, Linfen, 041000, PR China
| | - Ivan Jakovlić
- Bio-Transduction Lab, Wuhan, 430075, Hubei, PR China
| |
Collapse
|
34
|
Liu X, Wu X, Tan H, Xie B, Deng Y. Large inverted repeats identified by intra-specific comparison of mitochondrial genomes provide insights into the evolution of Agrocybe aegerita. Comput Struct Biotechnol J 2020; 18:2424-2437. [PMID: 33005305 PMCID: PMC7508693 DOI: 10.1016/j.csbj.2020.08.022] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2020] [Revised: 08/25/2020] [Accepted: 08/26/2020] [Indexed: 11/29/2022] Open
Abstract
Genomic structure and content of Agrocybe aegerita mitochondrial DNA contain essential information regarding the evolution of this gourmet mushroom. In this study, eight isolates of A. aegerita were sequenced and assembled into complete mitochondrial genomes. The mtDNA of the isolate Ag0067 contained two genotypes, both of which were quadripartite architecture consisting of two identical inverted repeats, separated by a small single-copy region and a large single-copy region. The only difference was opposite directions of the small single-copy region. The mtDNAs ranged from 116,329 bp to 134,035 bp, harboring two large identical inverted repeats. Genes of plasmid-origin were present in regions flanked by inverted repeat ID2. Most of the core genes evolved at a relatively low rate, whereas five tRNA genes located in corresponding regions of Ag0002:1-14000 and Ag0002:50001-61000 showed higher diversity. A long fragment inversion (10 Kb) was suggested to have occurred during the differentiation of two main clades, leading to two different gene orders. The number and distribution of the introns varied greatly among the A. aegerita mtDNAs. Fast invasion of short insertions likely resulted in the diversity of introns as well as other non-coding regions, increasing the variation of the mtDNAs. We raised a model about the evolution of the large repeats to explain the unusual features of A. aegerita mtDNAs. This study constructed quadripartite architecture of A. aegerita mtDNAs analogous to chloroplast DNA, proposed an interconversion model of the divergent mitochondrial genotypes with large inverted repeats. The findings could increase our knowledge of fungal evolution.
Collapse
Affiliation(s)
- Xinrui Liu
- College of Life Science, Fujian Agriculture and Forestry University, Fuzhou, Fujian 350002, China
| | - Xiaoping Wu
- College of Life Science, Fujian Agriculture and Forestry University, Fuzhou, Fujian 350002, China
| | - Hao Tan
- Mushroom Research Center, Soil and Fertilizer Institute, Sichuan Academy of Agricultural Sciences, Chengdu 610000, China
- School of Bioengineering, Jiangnan University, Wuxi 214062, China
| | - Baogui Xie
- College of Life Science, Fujian Agriculture and Forestry University, Fuzhou, Fujian 350002, China
| | - Youjin Deng
- College of Life Science, Fujian Agriculture and Forestry University, Fuzhou, Fujian 350002, China
| |
Collapse
|
35
|
Bartas M, Brázda V, Bohálová N, Cantara A, Volná A, Stachurová T, Malachová K, Jagelská EB, Porubiaková O, Červeň J, Pečinka P. In-Depth Bioinformatic Analyses of Nidovirales Including Human SARS-CoV-2, SARS-CoV, MERS-CoV Viruses Suggest Important Roles of Non-canonical Nucleic Acid Structures in Their Lifecycles. Front Microbiol 2020; 11:1583. [PMID: 32719673 PMCID: PMC7347907 DOI: 10.3389/fmicb.2020.01583] [Citation(s) in RCA: 49] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2020] [Accepted: 06/17/2020] [Indexed: 12/15/2022] Open
Abstract
Non-canonical nucleic acid structures play important roles in the regulation of molecular processes. Considering the importance of the ongoing coronavirus crisis, we decided to evaluate genomes of all coronaviruses sequenced to date (stated more broadly, the order Nidovirales) to determine if they contain non-canonical nucleic acid structures. We discovered much evidence of putative G-quadruplex sites and even much more of inverted repeats (IRs) loci, which in fact are ubiquitous along the whole genomic sequence and indicate a possible mechanism for genomic RNA packaging. The most notable enrichment of IRs was found inside 5'UTR for IRs of size 12+ nucleotides, and the most notable enrichment of putative quadruplex sites (PQSs) was located before 3'UTR, inside 5'UTR, and before mRNA. This indicates crucial regulatory roles for both IRs and PQSs. Moreover, we found multiple G-quadruplex binding motifs in human proteins having potential for binding of SARS-CoV-2 RNA. Non-canonical nucleic acids structures in Nidovirales and in novel SARS-CoV-2 are therefore promising druggable structures that can be targeted and utilized in the future.
Collapse
Affiliation(s)
- Martin Bartas
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czechia
| | - Václav Brázda
- Department of Biophysical Chemistry and Molecular Oncology, Institute of Biophysics, Academy of Sciences of the Czech Republic, Brno, Czechia
- Faculty of Chemistry, Brno University of Technology, Brno, Czechia
| | - Natália Bohálová
- Department of Biophysical Chemistry and Molecular Oncology, Institute of Biophysics, Academy of Sciences of the Czech Republic, Brno, Czechia
- Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czechia
| | - Alessio Cantara
- Department of Biophysical Chemistry and Molecular Oncology, Institute of Biophysics, Academy of Sciences of the Czech Republic, Brno, Czechia
- Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czechia
| | - Adriana Volná
- Department of Physics, Faculty of Science, University of Ostrava, Ostrava, Czechia
| | - Tereza Stachurová
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czechia
| | - Kateřina Malachová
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czechia
| | - Eva B. Jagelská
- Department of Biophysical Chemistry and Molecular Oncology, Institute of Biophysics, Academy of Sciences of the Czech Republic, Brno, Czechia
| | - Otília Porubiaková
- Department of Biophysical Chemistry and Molecular Oncology, Institute of Biophysics, Academy of Sciences of the Czech Republic, Brno, Czechia
- Faculty of Chemistry, Brno University of Technology, Brno, Czechia
| | - Jiří Červeň
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czechia
| | - Petr Pečinka
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czechia
| |
Collapse
|
36
|
Global analysis of inverted repeat sequences in human gene promoters reveals their non-random distribution and association with specific biological pathways. Genomics 2020; 112:2772-2777. [DOI: 10.1016/j.ygeno.2020.03.014] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2019] [Revised: 01/02/2020] [Accepted: 03/20/2020] [Indexed: 12/11/2022]
|
37
|
Dupeyron M, Baril T, Bass C, Hayward A. Phylogenetic analysis of the Tc1/mariner superfamily reveals the unexplored diversity of pogo-like elements. Mob DNA 2020; 11:21. [PMID: 32612713 PMCID: PMC7325037 DOI: 10.1186/s13100-020-00212-0] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2019] [Accepted: 04/08/2020] [Indexed: 01/18/2023] Open
Abstract
Background Tc1/mariner transposons are widespread DNA transposable elements (TEs) that have made important contributions to the evolution of host genomic complexity in metazoans. However, the evolution and diversity of the Tc1/mariner superfamily remains poorly understood. Following recent developments in genome sequencing and the availability of a wealth of new genomes, Tc1/mariner TEs have been identified in many new taxa across the eukaryotic tree of life. To date, the majority of studies focussing on Tc1/mariner elements have considered only a single host lineage or just a small number of host lineages. Thus, much remains to be learnt about the evolution of Tc1/mariner TEs by performing analyses that consider elements that originate from across host diversity. Results We mined the non-redundant database of NCBI using BLASTp searches, with transposase sequences from a diverse set of reference Tc1/mariner elements as queries. A total of 5158 Tc1/mariner elements were retrieved and used to reconstruct evolutionary relationships within the superfamily. The resulting phylogeny is well resolved and includes several new groups of Tc1/mariner elements. In particular, we identify a new family of plant-genome restricted Tc1/mariner elements, which we call PlantMar. We also show that the pogo family is much larger and more diverse than previously appreciated, and we review evidence for a potential revision of its status to become a separate superfamily. Conclusions Our study provides an overview of Tc1-mariner phylogeny and summarises the impressive diversity of Tc1-mariner TEs among sequenced eukaryotes. Tc1/mariner TEs are successful in a wide range of eukaryotes, especially unikonts (the taxonomic supergroup containing Amoebozoa, Opisthokonta, Breviatea, and Apusomonadida). In particular, ecdysozoa, and especially arthropods, emerge as important hosts for Tc1/mariner elements (except the PlantMar family). Meanwhile, the pogo family, which is by far the largest Tc1/mariner family, also includes many elements from fungal and chordate genomes. Moreover, there is evidence of the repeated exaptation of pogo elements in vertebrates, including humans, in addition to the well-known example of CENP-B. Collectively, our findings provide a considerable advancement in understanding of Tc1/mariner elements, and more generally they suggest that much work remains to improve understanding of the diversity and evolution of DNA TEs.
Collapse
Affiliation(s)
- Mathilde Dupeyron
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall, TR10 9FE UK
| | - Tobias Baril
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall, TR10 9FE UK
| | - Chris Bass
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall, TR10 9FE UK
| | - Alexander Hayward
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall, TR10 9FE UK
| |
Collapse
|
38
|
Zhang R, Ge F, Li H, Chen Y, Zhao Y, Gao Y, Liu Z, Yang L. PCIR: a database of Plant Chloroplast Inverted Repeats. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2020; 2019:5611292. [PMID: 31696928 PMCID: PMC6835207 DOI: 10.1093/database/baz127] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/16/2019] [Revised: 09/26/2019] [Accepted: 10/07/2019] [Indexed: 01/06/2023]
Abstract
Inverted repeats (IRs) serve as potential biomarkers for genomic instability, DNA replication and other genetic processes. However, little information can be found in databases to help researchers recognize potential IR nucleotides, explore junction sites and annotate related functional genes. Plant Chloroplast Inverted Repeats (PCIR) is an interactive, web-based platform containing various sequenced chloroplast genomes that enables detection, searching and visualization of large-scale detailed information on IRs. PCIR contains many datasets, including 21 433 IRs, 113 plants chloroplast genomes, 16 948 functional genes and 21 659 visual maps. This database offers an online prediction tool for detecting IRs based on DNA sequences. PCIR can also analyze phylogenetic relationships using IR information among different species and provide users with high-quality marker maps. This database will be a valuable resource for IR distribution patterns, related genes and architectural features.
Collapse
Affiliation(s)
- Rui Zhang
- Agricultural Big-Data Research Center and College of Plant Protection, Shandong Agricultural University, Tai'an 271018, China
| | - Fangfang Ge
- Agricultural Big-Data Research Center and College of Plant Protection, Shandong Agricultural University, Tai'an 271018, China
| | - Huayang Li
- Agricultural Big-Data Research Center and College of Plant Protection, Shandong Agricultural University, Tai'an 271018, China
| | - Yudong Chen
- Agricultural Big-Data Research Center and College of Plant Protection, Shandong Agricultural University, Tai'an 271018, China
| | - Ying Zhao
- Agricultural Big-Data Research Center and College of Plant Protection, Shandong Agricultural University, Tai'an 271018, China
| | - Ying Gao
- Agricultural Big-Data Research Center and College of Plant Protection, Shandong Agricultural University, Tai'an 271018, China
| | - Zhiguo Liu
- Agricultural Big-Data Research Center and College of Plant Protection, Shandong Agricultural University, Tai'an 271018, China
| | - Long Yang
- Agricultural Big-Data Research Center and College of Plant Protection, Shandong Agricultural University, Tai'an 271018, China
| |
Collapse
|
39
|
The Rich World of p53 DNA Binding Targets: The Role of DNA Structure. Int J Mol Sci 2019; 20:ijms20225605. [PMID: 31717504 PMCID: PMC6888028 DOI: 10.3390/ijms20225605] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2019] [Revised: 10/29/2019] [Accepted: 11/08/2019] [Indexed: 12/14/2022] Open
Abstract
The tumor suppressor functions of p53 and its roles in regulating the cell cycle, apoptosis, senescence, and metabolism are accomplished mainly by its interactions with DNA. p53 works as a transcription factor for a significant number of genes. Most p53 target genes contain so-called p53 response elements in their promoters, consisting of 20 bp long canonical consensus sequences. Compared to other transcription factors, which usually bind to one concrete and clearly defined DNA target, the p53 consensus sequence is not strict, but contains two repeats of a 5′RRRCWWGYYY3′ sequence; therefore it varies remarkably among target genes. Moreover, p53 binds also to DNA fragments that at least partially and often completely lack this consensus sequence. p53 also binds with high affinity to a variety of non-B DNA structures including Holliday junctions, cruciform structures, quadruplex DNA, triplex DNA, DNA loops, bulged DNA, and hemicatenane DNA. In this review, we summarize information of the interactions of p53 with various DNA targets and discuss the functional consequences of the rich world of p53 DNA binding targets for its complex regulatory functions.
Collapse
|
40
|
Čutová M, Manta J, Porubiaková O, Kaura P, Šťastný J, Jagelská EB, Goswami P, Bartas M, Brázda V. Divergent distributions of inverted repeats and G-quadruplex forming sequences in Saccharomyces cerevisiae. Genomics 2019; 112:1897-1901. [PMID: 31706022 DOI: 10.1016/j.ygeno.2019.11.002] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2019] [Revised: 09/13/2019] [Accepted: 11/01/2019] [Indexed: 12/17/2022]
Abstract
The importance of DNA structure in the regulation of basic cellular processes is an emerging field of research. Among local non-B DNA structures, inverted repeat (IR) sequences that form cruciforms and G-rich sequences that form G-quadruplexes (G4) are found in all prokaryotic and eukaryotic organisms and are targets for regulatory proteins. We analyzed IRs and G4 sequences in the genome of the most important biotechnology microorganism, S. cerevisiae. IR and G4-prone sequences are enriched in specific genomic locations and differ markedly between mitochondrial and nuclear DNA. While G4s are overrepresented in telomeres and regions surrounding tRNAs, IRs are most enriched in centromeres, rDNA, replication origins and surrounding tRNAs. Mitochondrial DNA is enriched in both IR and G4-prone sequences relative to the nuclear genome. This extensive analysis of local DNA structures adds to the emerging picture of their importance in genome maintenance, DNA replication and transcription of subsets of genes.
Collapse
Affiliation(s)
- Michaela Čutová
- Brno University of Technology, Faculty of Chemistry, Purkyňova 118, 612 00 Brno, Czech Republic
| | - Jacinta Manta
- Brno University of Technology, Faculty of Chemistry, Purkyňova 118, 612 00 Brno, Czech Republic
| | - Otília Porubiaková
- Brno University of Technology, Faculty of Chemistry, Purkyňova 118, 612 00 Brno, Czech Republic
| | - Patrik Kaura
- Brno University of Technology, Faculty of Mechanical Engineering, Technická 2896/2, 616 69 Brno, Czech Republic
| | - Jiří Šťastný
- Brno University of Technology, Faculty of Mechanical Engineering, Technická 2896/2, 616 69 Brno, Czech Republic; Mendel University in Brno, Zemědělská 1665/1, 61300 Brno, Czech Republic
| | - Eva B Jagelská
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic
| | - Pratik Goswami
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic
| | - Martin Bartas
- Department of Biology and Ecology/Institute of Environmental Technologies, Faculty of Science, University of Ostrava, Ostrava 710 00, Czech Republic
| | - Václav Brázda
- Brno University of Technology, Faculty of Chemistry, Purkyňova 118, 612 00 Brno, Czech Republic; Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic.
| |
Collapse
|
41
|
Brázda V, Kolomazník J, Lýsek J, Bartas M, Fojta M, Šťastný J, Mergny JL. G4Hunter web application: a web server for G-quadruplex prediction. Bioinformatics 2019; 35:3493-3495. [PMID: 30721922 PMCID: PMC6748775 DOI: 10.1093/bioinformatics/btz087] [Citation(s) in RCA: 108] [Impact Index Per Article: 21.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2018] [Revised: 01/08/2019] [Accepted: 02/04/2019] [Indexed: 12/26/2022] Open
Abstract
MOTIVATION Expanding research highlights the importance of guanine quadruplex structures. Therefore, easy-accessible tools for quadruplex analyses in DNA and RNA molecules are important for the scientific community. RESULTS We developed a web version of the G4Hunter application. This new web-based server is a platform-independent and user-friendly application for quadruplex analyses. It allows retrieval of gene/nucleotide sequence entries from NCBI databases and provides complete characterization of localization and quadruplex propensity of quadruplex-forming sequences. The G4Hunter web application includes an interactive graphical data representation with many useful options including visualization, sorting, data storage and export. AVAILABILITY AND IMPLEMENTATION G4Hunter web application can be accessed at: http://bioinformatics.ibp.cz. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Václav Brázda
- Department of biophysical chemistry and molecular oncology, Institute of Biophysics, Academy of Sciences of the Czech Republic, Brno, Czech Republic
| | - Jan Kolomazník
- Department of Informatics, Mendel University in Brno, Brno, Czech Republic
| | - Jiří Lýsek
- Department of Informatics, Mendel University in Brno, Brno, Czech Republic
| | - Martin Bartas
- Department of biophysical chemistry and molecular oncology, Institute of Biophysics, Academy of Sciences of the Czech Republic, Brno, Czech Republic
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
| | - Miroslav Fojta
- Department of biophysical chemistry and molecular oncology, Institute of Biophysics, Academy of Sciences of the Czech Republic, Brno, Czech Republic
| | - Jiří Šťastný
- Department of Informatics, Mendel University in Brno, Brno, Czech Republic
- Brno University of Technology, Technicka 2896/2, 61969 Brno, Czech Republic
| | - Jean-Louis Mergny
- Department of biophysical chemistry and molecular oncology, Institute of Biophysics, Academy of Sciences of the Czech Republic, Brno, Czech Republic
- ARNA Laboratory, Inserm U1212, CNRS UMR5320, IECB, Université de Bordeaux, Pessac, France
| |
Collapse
|
42
|
Bartas M, Čutová M, Brázda V, Kaura P, Šťastný J, Kolomazník J, Coufal J, Goswami P, Červeň J, Pečinka P. The Presence and Localization of G-Quadruplex Forming Sequences in the Domain of Bacteria. Molecules 2019; 24:molecules24091711. [PMID: 31052562 PMCID: PMC6539912 DOI: 10.3390/molecules24091711] [Citation(s) in RCA: 66] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2019] [Revised: 04/30/2019] [Accepted: 05/01/2019] [Indexed: 01/09/2023] Open
Abstract
The role of local DNA structures in the regulation of basic cellular processes is an emerging field of research. Amongst local non-B DNA structures, the significance of G-quadruplexes was demonstrated in the last decade, and their presence and functional relevance has been demonstrated in many genomes, including humans. In this study, we analyzed the presence and locations of G-quadruplex-forming sequences by G4Hunter in all complete bacterial genomes available in the NCBI database. G-quadruplex-forming sequences were identified in all species, however the frequency differed significantly across evolutionary groups. The highest frequency of G-quadruplex forming sequences was detected in the subgroup Deinococcus-Thermus, and the lowest frequency in Thermotogae. G-quadruplex forming sequences are non-randomly distributed and are favored in various evolutionary groups. G-quadruplex-forming sequences are enriched in ncRNA segments followed by mRNAs. Analyses of surrounding sequences showed G-quadruplex-forming sequences around tRNA and regulatory sequences. These data point to the unique and non-random localization of G-quadruplex-forming sequences in bacterial genomes.
Collapse
Affiliation(s)
- Martin Bartas
- Department of Biology and Ecology/Institute of Environmental Technologies, Faculty of Science, University of Ostrava, 710 00 Ostrava, Czech Republic.
| | - Michaela Čutová
- Faculty of Chemistry, Brno University of Technology, Purkyňova 118, 612 00 Brno, Czech Republic.
| | - Václav Brázda
- Faculty of Chemistry, Brno University of Technology, Purkyňova 118, 612 00 Brno, Czech Republic.
- Institute of Biophysics, Academy of Sciences of the Czech Republic v.v.i., Královopolská 135, 612 65 Brno, Czech Republic.
| | - Patrik Kaura
- Faculty of Mechanical Engineering, Brno University of Technology, Technicka 2896/2, 616 69 Brno, Czech Republic.
| | - Jiří Šťastný
- Faculty of Mechanical Engineering, Brno University of Technology, Technicka 2896/2, 616 69 Brno, Czech Republic.
- Department of Informatics, Mendel University in Brno, Zemedelska 1665/1, 61300 Brno, Czech Republic.
| | - Jan Kolomazník
- Department of Informatics, Mendel University in Brno, Zemedelska 1665/1, 61300 Brno, Czech Republic.
| | - Jan Coufal
- Institute of Biophysics, Academy of Sciences of the Czech Republic v.v.i., Královopolská 135, 612 65 Brno, Czech Republic.
| | - Pratik Goswami
- Institute of Biophysics, Academy of Sciences of the Czech Republic v.v.i., Královopolská 135, 612 65 Brno, Czech Republic.
| | - Jiří Červeň
- Department of Biology and Ecology/Institute of Environmental Technologies, Faculty of Science, University of Ostrava, 710 00 Ostrava, Czech Republic.
| | - Petr Pečinka
- Department of Biology and Ecology/Institute of Environmental Technologies, Faculty of Science, University of Ostrava, 710 00 Ostrava, Czech Republic.
| |
Collapse
|
43
|
Dupeyron M, Singh KS, Bass C, Hayward A. Evolution of Mutator transposable elements across eukaryotic diversity. Mob DNA 2019; 10:12. [PMID: 30988700 PMCID: PMC6446971 DOI: 10.1186/s13100-019-0153-8] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2018] [Accepted: 03/01/2019] [Indexed: 11/15/2022] Open
Abstract
Background Mutator-like elements (MULEs) are a significant superfamily of DNA transposons on account of their: (i) great transpositional activity and propensity for insertion in or near gene sequences, (ii) their consequent high mutagenic capacity, and, (iii) their tendency to acquire host gene fragments. Consequently, MULEs are important genetic tools and represent a key study system for research into host-transposon interactions. Yet, while several studies have focused on the impacts of MULEs on crop and fungus genomes, their evolution remains poorly explored. Results We perform comprehensive bioinformatic and phylogenetic analyses to address currently available MULE diversity and reconstruct evolution for the group. For this, we mine MULEs from online databases, and combine search results with available transposase sequences retrieved from previously published studies. Our analyses uncover two entirely new MULE clades that contain elements almost entirely restricted to arthropod hosts, considerably expanding the set of MULEs known from this group, suggesting that many additional MULEs may await discovery from further arthropod genomes. In several cases, close relationships occur between MULEs recovered from distantly related host organisms, suggesting that horizontal transfer events may have played an important role in the evolution of the group. However, it is apparent that MULEs from plants remain separate from MULEs identified from other host groups. MULE structure varies considerably across phylogeny, and TIR length is shown to vary greatly both within and between MULE groups. Our phylogeny suggests that MULE diversity is clustered in well-supported groups, typically according to host taxonomy. With reference to this, we make suggestions on how MULE diversity can be partitioned to provide a robust taxonomic framework. Conclusions Our study represents a considerable advance in the understanding of MULE diversity, host range and evolution, and provides a taxonomic framework for the classification of further MULE elements that await discovery. Our findings also raise a number of questions relating to MULE biology, suggesting that this group will provide a rich avenue for future study. Electronic supplementary material The online version of this article (10.1186/s13100-019-0153-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Mathilde Dupeyron
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall, TR10 9FE UK
| | - Kumar S Singh
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall, TR10 9FE UK
| | - Chris Bass
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall, TR10 9FE UK
| | - Alexander Hayward
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall, TR10 9FE UK
| |
Collapse
|
44
|
Bartas M, Bažantová P, Brázda V, Liao JC, Červeň J, Pečinka P. Identification of Distinct Amino Acid Composition of Human Cruciform Binding Proteins. Mol Biol 2019. [DOI: 10.1134/s0026893319010023] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
45
|
Cechová J, Lýsek J, Bartas M, Brázda V. Complex analyses of inverted repeats in mitochondrial genomes revealed their importance and variability. Bioinformatics 2019; 34:1081-1085. [PMID: 29126205 PMCID: PMC6030915 DOI: 10.1093/bioinformatics/btx729] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2017] [Accepted: 11/07/2017] [Indexed: 01/08/2023] Open
Abstract
Motivation The NCBI database contains mitochondrial DNA (mtDNA) genomes from numerous species. We investigated the presence and locations of inverted repeat sequences (IRs) in these mtDNA sequences, which are known to be important for regulating nuclear genomes. Results IRs were identified in mtDNA in all species. IR lengths and frequencies correlate with evolutionary age and the greatest variability was detected in subgroups of plants and fungi and the lowest variability in mammals. IR presence is non-random and evolutionary favoured. The frequency of IRs generally decreased with IR length, but not for IRs 24 or 30 bp long, which are 1.5 times more abundant. IRs are enriched in sequences from the replication origin, followed by D-loop, stem-loop and miscellaneous sequences, pointing to the importance of IRs in regulatory regions of mitochondrial DNA. Availability and implementation Data were produced using Palindrome analyser, freely available on the web at http://bioinformatics.ibp.cz. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jana Cechová
- Department of Biophysical Chemistry and Molecular Oncology, Institute of Biophysics, The Czech Academy of Sciences, 612 65 Brno, Czech Republic
| | - Jirí Lýsek
- Department of Informatics, Mendel University in Brno, 613 00 Brno, Czech Republic
| | - Martin Bartas
- Department of Biophysical Chemistry and Molecular Oncology, Institute of Biophysics, The Czech Academy of Sciences, 612 65 Brno, Czech Republic.,Department of Biology and Ecology/Institute of Environmental Technologies, Faculty of Science, University of Ostrava, 701 03 Ostrava, Czech Republic
| | - Václav Brázda
- Department of Biophysical Chemistry and Molecular Oncology, Institute of Biophysics, The Czech Academy of Sciences, 612 65 Brno, Czech Republic
| |
Collapse
|
46
|
Zou H, Jakovlić I, Zhang D, Chen R, Mahboob S, Al-Ghanim KA, Al-Misned F, Li WX, Wang GT. The complete mitochondrial genome of Cymothoa indica has a highly rearranged gene order and clusters at the very base of the Isopoda clade. PLoS One 2018; 13:e0203089. [PMID: 30180209 PMCID: PMC6122833 DOI: 10.1371/journal.pone.0203089] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2018] [Accepted: 08/14/2018] [Indexed: 11/18/2022] Open
Abstract
As a result of great diversity in life histories and a large number of described species, taxonomic and phylogenetic uncertainty permeates the entire crustacean order of Isopoda. Large molecular datasets capable of providing sufficiently high phylogenetic resolution, such as mitochondrial genomes (mitogenomes), are needed to infer their evolutionary history with confidence, but isopod mitogenomes remain remarkably poorly represented in public databases. We sequenced the complete mitogenome of Cymothoa indica, a species belonging to a family from which no mitochondrial genome was sequenced yet, Cymothoidae. The mitogenome (circular, 14484 bp, A+T = 63.8%) is highly compact, appears to be missing two tRNA genes (trnI and trnE), and exhibits a unique gene order with a large number of rearrangements. High compactness and the existence of palindromes indicate that the mechanism behind these rearrangements might be associated with linearization events in its evolutionary history, similar to those proposed for isopods from the Armadillidium genus (Oniscidea). Isopods might present an important model system to study the proposed discontinuity in the dynamics of mitochondrial genomic architecture evolution. Phylogenetic analyses (Bayesian Inference and Maximum Likelihood) conducted using nucleotide sequences of all mitochondrial genes resolved Oniscidea and Cymothoida suborders as paraphyletic. Cymothoa indica was resolved as a sister group (basal) to all remaining isopods, which challenges the accepted isopod phylogeny, where Cymothoida are the most derived, and Phreatoicidea the most basal isopod group. There is growing evidence that Cymothoida suborder might be split into two evolutionary distant clades, with parasitic species being the most basal split in the Isopoda clade, but a much larger amount of molecular resources carrying a high phylogenetic resolution will be needed to infer the remarkably complex evolutionary history of this group of animals with confidence.
Collapse
Affiliation(s)
- Hong Zou
- Key Laboratory of Aquaculture Disease Control, Ministry of Agriculture, and State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, P. R. China
| | | | - Dong Zhang
- Key Laboratory of Aquaculture Disease Control, Ministry of Agriculture, and State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, P. R. China
- University of Chinese Academy of Sciences, Beijing, P. R. China
| | - Rong Chen
- Bio-Transduction Lab, Biolake, Wuhan, P. R. China
| | - Shahid Mahboob
- Department of Zoology, College of Science, King Saud University, Riyadh, Saudi Arabia
- Department of Zoology, GC University, Faisalabad, Pakistan
| | | | - Fahad Al-Misned
- Department of Zoology, College of Science, King Saud University, Riyadh, Saudi Arabia
| | - Wen-Xiang Li
- Key Laboratory of Aquaculture Disease Control, Ministry of Agriculture, and State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, P. R. China
| | - Gui-Tang Wang
- Key Laboratory of Aquaculture Disease Control, Ministry of Agriculture, and State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, P. R. China
| |
Collapse
|
47
|
Mandal S, Selvam S, Cui Y, Hoque ME, Mao H. Mechanical Cooperativity in DNA Cruciform Structures. Chemphyschem 2018; 19:2627-2634. [PMID: 29992736 DOI: 10.1002/cphc.201800480] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2018] [Indexed: 01/20/2023]
Abstract
Unlike short-range chemical bonds that maintain chemical properties of a biological molecule, long-range mechanical interactions determine mechanochemical properties of molecules. Limited by experimental approaches, however, direct quantification of such mechanical interactions is challenging. Using magneto-optical tweezers, herein we found torque can change the topology and mechanochemical property of DNA cruciform, a naturally occurring structure consisting of two opposing hairpin arms. Both mechanical and thermodynamic stabilities of DNA cruciforms increase with positive torque, which have been attributed to the topological coupling between DNA template and the cruciform. The coupling exists simultaneously in both arms of a cruciform, which coordinates the folding and unfolding of the cruciform, leading to a mechanical cooperativity not observed previously. As DNA torque readily varies during transcriptions, our finding suggests that DNA cruciforms can modulate transcriptions by adjusting their properties according to the torque.
Collapse
Affiliation(s)
- Shankar Mandal
- Department of Chemistry & Biochemistry and School of Biomedical Sciences, Kent State University, Kent, OH, 44242, USA
| | - Sangeetha Selvam
- Department of Chemistry & Biochemistry and School of Biomedical Sciences, Kent State University, Kent, OH, 44242, USA
| | - Yunxi Cui
- Department of Chemistry & Biochemistry and School of Biomedical Sciences, Kent State University, Kent, OH, 44242, USA.,State Key Laboratory of Medicinal Chemical Biology, Nankai University, 94 Weijin Road, Tianjin, 300071, China
| | - Mohammed Enamul Hoque
- Department of Chemistry & Biochemistry and School of Biomedical Sciences, Kent State University, Kent, OH, 44242, USA
| | - Hanbin Mao
- Department of Chemistry & Biochemistry and School of Biomedical Sciences, Kent State University, Kent, OH, 44242, USA
| |
Collapse
|
48
|
Complex Analyses of Short Inverted Repeats in All Sequenced Chloroplast DNAs. BIOMED RESEARCH INTERNATIONAL 2018; 2018:1097018. [PMID: 30140690 PMCID: PMC6081594 DOI: 10.1155/2018/1097018] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/15/2018] [Revised: 06/19/2018] [Accepted: 07/12/2018] [Indexed: 01/14/2023]
Abstract
Chloroplasts are key organelles in the management of oxygen in algae and plants and are therefore crucial for all living beings that consume oxygen. Chloroplasts typically contain a circular DNA molecule with nucleus-independent replication and heredity. Using "palindrome analyser" we performed complete analyses of short inverted repeats (S-IRs) in all chloroplast DNAs (cpDNAs) available from the NCBI genome database. Our results provide basic parameters of cpDNAs including comparative information on localization, frequency, and differences in S-IR presence. In a total of 2,565 cpDNA sequences available, the average frequency of S-IRs in cpDNA genomes is 45 S-IRs/per kbp, significantly higher than that found in mitochondrial DNA sequences. The frequency of S-IRs in cpDNAs generally decreased with S-IR length, but not for S-IRs 15, 22, 24, or 27 bp long, which are significantly more abundant than S-IRs with other lengths. These results point to the importance of specific S-IRs in cpDNA genomes. Moreover, comparison by Levenshtein distance of S-IR similarities showed that a limited number of S-IR sequences are shared in the majority of cpDNAs. S-IRs are not located randomly in cpDNAs, but are length-dependently enriched in specific locations, including the repeat region, stem, introns, and tRNA regions. The highest enrichment was found for 12 bp and longer S-IRs in the stem-loop region followed by 12 bp and longer S-IRs located before the repeat region. On the other hand, S-IRs are relatively rare in rRNA sequences and around introns. These data show nonrandom and conserved arrangements of S-IRs in chloroplast genomes.
Collapse
|
49
|
Čechová J, Coufal J, Jagelská EB, Fojta M, Brázda V. p73, like its p53 homolog, shows preference for inverted repeats forming cruciforms. PLoS One 2018; 13:e0195835. [PMID: 29668749 PMCID: PMC5905954 DOI: 10.1371/journal.pone.0195835] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2018] [Accepted: 04/01/2018] [Indexed: 12/12/2022] Open
Abstract
p73 is a member of the p53 protein family and has essential functions in several signaling pathways involved in development, differentiation, DNA damage responses and cancer. As a transcription factor, p73 achieves these functions by binding to consensus DNA sequences and p73 shares at least partial target DNA binding sequence specificity with p53. Transcriptional activation by p73 has been demonstrated for more than fifty p53 targets in yeast and/or human cancer cell lines. It has also been shown previously that p53 binding to DNA is strongly dependent on DNA topology and the presence of inverted repeats that can form DNA cruciforms, but whether p73 transcriptional activity has similar dependence has not been investigated. Therefore, we evaluated p73 binding to a set of p53-response elements with identical theoretical binding affinity in their linear state, but different probabilities to form extra helical structures. We show by a yeast-based assay that transactivation in vivo correlated more with the relative propensity of a response element to form cruciforms than to its expected in vitro DNA binding affinity. Structural features of p73 target sites are therefore likely to be an important determinant of its transactivation function.
Collapse
Affiliation(s)
- Jana Čechová
- The Czech Academy of Sciences, Institute of Biophysics, Královopolská, Brno, Czech Republic
- Department of Biochemistry, Faculty of Science, Masaryk University, Kotlarska, Brno, Czech Republic
| | - Jan Coufal
- The Czech Academy of Sciences, Institute of Biophysics, Královopolská, Brno, Czech Republic
| | - Eva B. Jagelská
- The Czech Academy of Sciences, Institute of Biophysics, Královopolská, Brno, Czech Republic
| | - Miroslav Fojta
- The Czech Academy of Sciences, Institute of Biophysics, Královopolská, Brno, Czech Republic
| | - Václav Brázda
- The Czech Academy of Sciences, Institute of Biophysics, Královopolská, Brno, Czech Republic
- * E-mail:
| |
Collapse
|
50
|
Miura O, Ogake T, Ohyama T. Requirement or exclusion of inverted repeat sequences with cruciform-forming potential in Escherichia coli revealed by genome-wide analyses. Curr Genet 2018; 64:945-958. [PMID: 29484452 PMCID: PMC6060812 DOI: 10.1007/s00294-018-0815-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2017] [Revised: 02/16/2018] [Accepted: 02/19/2018] [Indexed: 12/31/2022]
Abstract
Inverted repeat (IR) sequences are DNA sequences that read the same from 5' to 3' in each strand. Some IRs can form cruciforms under the stress of negative supercoiling, and these IRs are widely found in genomes. However, their biological significance remains unclear. The aim of the current study is to explore this issue further. We constructed the first Escherichia coli genome-wide comprehensive map of IRs with cruciform-forming potential. Based on the map, we performed detailed and quantitative analyses. Here, we report that IRs with cruciform-forming potential are statistically enriched in the following five regions: the adjacent regions downstream of the stop codon-coding sites (referred to as the stop codons), on and around the positions corresponding to mRNA ends (referred to as the gene ends), ~ 20 to ~45 bp upstream of the start codon-coding sites (referred to as the start codons) within the 5'-UTR (untranslated region), ~ 25 to ~ 60 bp downstream of the start codons, and promoter regions. For the adjacent regions downstream of the stop codons and on and around the gene ends, most of the IRs with a repeat unit length of ≥ 8 bp and a spacer size of ≤ 8 bp were parts of the intrinsic terminators, regardless of the location, and presumably used for Rho-independent transcription termination. In contrast, fewer IRs were present in the small region preceding the start codons. In E. coli, IRs with cruciform-forming potential are actively placed or excluded in the regulatory regions for the initiation and termination of transcription and translation, indicating their deep involvement or influence in these processes.
Collapse
Affiliation(s)
- Osamu Miura
- Department of Biology, Faculty of Education and Integrated Arts and Sciences, Waseda University, 2-2 Wakamatsu-cho, Shinjuku-ku, Tokyo, 162-8480, Japan
- Major in Integrative Bioscience and Biomedical Engineering, Graduate School of Science and Engineering, Waseda University, 2-2 Wakamatsu-cho, Shinjuku-ku, Tokyo, 162-8480, Japan
| | - Toshihiro Ogake
- Major in Integrative Bioscience and Biomedical Engineering, Graduate School of Science and Engineering, Waseda University, 2-2 Wakamatsu-cho, Shinjuku-ku, Tokyo, 162-8480, Japan
| | - Takashi Ohyama
- Department of Biology, Faculty of Education and Integrated Arts and Sciences, Waseda University, 2-2 Wakamatsu-cho, Shinjuku-ku, Tokyo, 162-8480, Japan.
- Major in Integrative Bioscience and Biomedical Engineering, Graduate School of Science and Engineering, Waseda University, 2-2 Wakamatsu-cho, Shinjuku-ku, Tokyo, 162-8480, Japan.
| |
Collapse
|