1
|
Direk T, Doluca O. Computational Identification and Illustrative Standard for Representation of Unimolecular G-Quadruplex Secondary Structures (CIIS-GQ). J Comput Aided Mol Des 2024; 38:35. [PMID: 39470927 DOI: 10.1007/s10822-024-00573-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2024] [Accepted: 08/30/2024] [Indexed: 11/01/2024]
Abstract
G-quadruplexes refer to a large group of nucleic acid-based structures. In recent years, they have been attracting attention due to their biological roles in the telomeres and promoter regions. These structures show wide diversity in topology, however, development of methods for structural classification of G-quadruplexes has been evaded for a long time. There has been a limited number of studies aiming to bring forth a secondary structure classification method. The situation was even more complex than imagined, since the discovery of bulged and mismatched G-quadruplexes while most of the available tools fail to distinguish these non-canonical G-quadruplex motifs. Moreover, the interpretation of their analysis output still requires expert knowledge. In this study, we propose a new method for identification of unimolecular G-Quadruplexes and classification by secondary structures based on three-dimensional structural data. Briefly, coordinates of guanines are processed to identify tetrads, loops and bulges. Then, we present the secondary structure in the form of a depiction which shows the loop types, bulges, and guanines that participate in each tetrad. Moreover, CIIS-GQ identifies non-guanine nucleotides that joins the G-tetrads and forms multiplets. Finally, the results of our study are compared with DSSR and ElTetrado classification methods, and the advantages of the proposed depiction method for representing secondary structures were discussed. The source code of the method can be accessed via https://github.com/TugayDirek/CIIS-GQ .
Collapse
Affiliation(s)
- Tugay Direk
- Department of Software Engineering, Izmir University of Economics, İzmir, Turkey
| | - Osman Doluca
- Department of Biomedical Engineering, Izmir University of Economics, İzmir, Turkey.
| |
Collapse
|
2
|
Choudhury SD, Kumar P, Choudhury D. Bioactive nutraceuticals as G4 stabilizers: potential cancer prevention and therapy-a critical review. NAUNYN-SCHMIEDEBERG'S ARCHIVES OF PHARMACOLOGY 2024; 397:3585-3616. [PMID: 38019298 DOI: 10.1007/s00210-023-02857-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Accepted: 11/13/2023] [Indexed: 11/30/2023]
Abstract
G-quadruplexes (G4) are non-canonical, four-stranded, nucleic acid secondary structures formed in the guanine-rich sequences, where guanine nucleotides associate with each other via Hoogsteen hydrogen bonding. These structures are widely found near the functional regions of the mammalian genome, such as telomeres, oncogenic promoters, and replication origins, and play crucial regulatory roles in replication and transcription. Destabilization of G4 by various carcinogenic agents allows oncogene overexpression and extension of telomeric ends resulting in dysregulation of cellular growth-promoting oncogenesis. Therefore, targeting and stabilizing these G4 structures with potential ligands could aid cancer prevention and therapy. The field of G-quadruplex targeting is relatively nascent, although many articles have demonstrated the effect of G4 stabilization on oncogenic expressions; however, no previous study has provided a comprehensive analysis about the potency of a wide variety of nutraceuticals and some of their derivatives in targeting G4 and the lattice of oncogenic cell signaling cascade affected by them. In this review, we have discussed bioactive G4-stabilizing nutraceuticals, their sources, mode of action, and their influence on cellular signaling, and we believe our insight would bring new light to the current status of the field and motivate researchers to explore this relatively poorly studied arena.
Collapse
Affiliation(s)
- Satabdi Datta Choudhury
- Department of Chemistry and Biochemistry, Thapar Institute of Engineering and Technology, Patiala, Punjab, 147004, India
| | - Prateek Kumar
- School of Basic Sciences, Indian Institute of Technology (IIT), Mandi, Himachal Pradesh, 175005, India
| | - Diptiman Choudhury
- Department of Chemistry and Biochemistry, Thapar Institute of Engineering and Technology, Patiala, Punjab, 147004, India.
- Centre for Excellence in Emerging Materials, Thapar Institute of Engineering and Technology, Patiala, Punjab, 147004, India.
| |
Collapse
|
3
|
Zhong HS, Dong MJ, Gao F. G4Bank: A database of experimentally identified DNA G-quadruplex sequences. Interdiscip Sci 2023; 15:515-523. [PMID: 37389723 DOI: 10.1007/s12539-023-00577-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Revised: 06/18/2023] [Accepted: 06/19/2023] [Indexed: 07/01/2023]
Abstract
G-quadruplex (G4), a non-canonical nucleic acid structure, has been suggested to play a key role in important cellular processes including transcription, replication and cancer development. Recently, high-throughput sequencing approaches for G4 detection have provided a large amount of experimentally identified G4 data that reveal genome-wide G4 landscapes and enable the development of new methods for predicting potential G4s from sequences. Although several existing databases provide G4 experimental data and relevant biological information from different perspectives, there is no dedicated database to collect and analyze DNA G4 experimental data genome-widely. Here, we constructed G4Bank, a database of experimentally identified DNA G-quadruplex sequences. A total of 6,915,983 DNA G4s were collected from 13 organisms, and state-of-the-art prediction methods were performed to filter and analyze the G4 data. Therefore, G4Bank will facilitate users to access comprehensive G4 experimental data and enable sequence feature analysis of G4 for further investigation. The database of the experimentally identified DNA G-quadruplex sequences can be accessed at http://tubic.tju.edu.cn/g4bank/ .
Collapse
Affiliation(s)
- Hong-Sheng Zhong
- Department of Physics, School of Science, Tianjin University, Tianjin, 300072, China
| | - Mei-Jing Dong
- Department of Physics, School of Science, Tianjin University, Tianjin, 300072, China
| | - Feng Gao
- Department of Physics, School of Science, Tianjin University, Tianjin, 300072, China.
- Frontiers Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin, 300072, China.
- SynBio Research Platform, Collaborative Innovation Center of Chemical Science and Engineering (Tianjin), Tianjin, 300072, China.
| |
Collapse
|
4
|
Zurkowski M, Zok T, Szachniuk M. DrawTetrado to create layer diagrams of G4 structures. Bioinformatics 2022; 38:3835-3836. [PMID: 35703937 PMCID: PMC9344840 DOI: 10.1093/bioinformatics/btac394] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Revised: 05/13/2022] [Accepted: 06/13/2022] [Indexed: 11/14/2022] Open
Abstract
Motivation Quadruplexes are specific 3D structures found in nucleic acids. Due to the exceptional properties of these motifs, their exploration with the general-purpose bioinformatics methods can be problematic or insufficient. The same applies to visualizing their structure. A hand-drawn layer diagram is the most common way to represent the quadruplex anatomy. No molecular visualization software generates such a structural model based on atomic coordinates. Results DrawTetrado is an open-source Python program for automated visualization targeting the structures of quadruplexes and G4-helices. It generates static layer diagrams that represent structural data in a pseudo-3D perspective. The possibility to set color schemes, nucleotide labels, inter-element distances or angle of view allows for easy customization of the output drawing. Availability and implementation The program is available under the MIT license at https://github.com/RNApolis/drawtetrado.
Collapse
Affiliation(s)
- Michal Zurkowski
- Institute of Computing Science, Poznan University of Technology, Piotrowo 2, Poznan, 60-965, Poland
| | - Tomasz Zok
- Institute of Computing Science, Poznan University of Technology, Piotrowo 2, Poznan, 60-965, Poland
| | - Marta Szachniuk
- Institute of Computing Science, Poznan University of Technology, Piotrowo 2, Poznan, 60-965, Poland.,Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, Poznan, 61-704, Poland
| |
Collapse
|
5
|
Zok T, Kraszewska N, Miskiewicz J, Pielacinska P, Zurkowski M, Szachniuk M. ONQUADRO: a database of experimentally determined quadruplex structures. Nucleic Acids Res 2022; 50:D253-D258. [PMID: 34986600 PMCID: PMC8728301 DOI: 10.1093/nar/gkab1118] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Revised: 10/22/2021] [Accepted: 10/25/2021] [Indexed: 01/02/2023] Open
Abstract
ONQUADRO is an advanced database system that supports the study of the structures of canonical and non-canonical quadruplexes. It combines a relational database that collects comprehensive information on tetrads, quadruplexes, and G4-helices; programs to compute structure parameters and visualise the data; scripts for statistical analysis; automatic updates and newsletter modules; and a web application that provides a user interface. The database is a self-updating resource, with new information arriving once a week. The preliminary data are downloaded from the Protein Data Bank, processed, annotated, and completed. As of August 2021, ONQUADRO contains 1,661 tetrads, 518 quadruplexes, and 30 G4-helices found in 467 experimentally determined 3D structures of nucleic acids. Users can view and download their description: sequence, secondary structure (dot-bracket, classical diagram, arc diagram), tertiary structure (ball-and-stick, surface or vdw-ball model, layer diagram), planarity, twist, rise, chi angle (value and type), loop characteristics, strand directionality, metal ions, ONZ, and Webba da Silva classification (the latter by loop topology and tetrad combination), origin structure ID, assembly ID, experimental method, and molecule type. The database is freely available at https://onquadro.cs.put.poznan.pl/. It can be used on both desktop computers and mobile devices.
Collapse
Affiliation(s)
- Tomasz Zok
- Institute of Computing Science, Poznan University of Technology, 60-965 Poznan, Poland
| | - Natalia Kraszewska
- Institute of Computing Science, Poznan University of Technology, 60-965 Poznan, Poland
| | - Joanna Miskiewicz
- Institute of Computing Science, Poznan University of Technology, 60-965 Poznan, Poland
| | - Paulina Pielacinska
- Institute of Computing Science, Poznan University of Technology, 60-965 Poznan, Poland
| | - Michal Zurkowski
- Institute of Computing Science, Poznan University of Technology, 60-965 Poznan, Poland
| | - Marta Szachniuk
- Institute of Computing Science, Poznan University of Technology, 60-965 Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| |
Collapse
|
6
|
Detecting G4 unwinding. Methods Enzymol 2022; 672:261-281. [DOI: 10.1016/bs.mie.2022.03.034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
7
|
Mazzei L, Musiani F, Żerko S, Koźminski W, Cianci M, Beniamino Y, Ciurli S, Zambelli B. Structure, dynamics, and function of SrnR, a transcription factor for nickel-dependent gene expression. Metallomics 2021; 13:6445039. [PMID: 34850061 DOI: 10.1093/mtomcs/mfab069] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2021] [Accepted: 11/18/2021] [Indexed: 11/14/2022]
Abstract
Streptomyces griseus, a bacterium producing antibacterial drugs and featuring possible application in phytoremediation, expresses two metal-dependent superoxide dismutase (SOD) enzymes, containing either Fe(II) or Ni(II) in their active site. In particular, the alternative expression of the two proteins occurs in a metal-dependent mode, with the Fe(II)-enzyme gene (sodF) repressed at high intracellular Ni(II) concentrations by a two-component system (TCS). This complex involves two proteins, namely SgSrnR and SgSrnQ, which represent the transcriptional regulator and the Ni(II) sensor of the system, respectively. SgSrnR belongs to the ArsR/SmtB family of metal-dependent transcription factors; in the apo-form and in the absence of SgSrnQ, it can bind the DNA operator of sodF, upregulating gene transcription. According to a recently proposed hypothesis, Ni(II) binding to SgSrnQ would promote its interaction with SgSrnR, causing the release of the complex from DNA and the consequent downregulation of the sodF expression. SgSrnQ is predicted to be highly disordered, thus the understanding, at the molecular level, of how the SgSrnR/SgSrnQ TCS specifically responds to Ni(II) requires the knowledge of the structural, dynamic, and functional features of SgSrnR. These were investigated synergistically in this work using X-ray crystallography, nuclear magnetic resonance (NMR) spectroscopy, atomistic molecular dynamics calculations, isothermal titration calorimetry, and in silico molecular docking. The results reveal that the homodimeric apo-SgSrnR binds to its operator in a two-step process that involves the more rigid globular portion of the protein and leaves its largely disordered regions available to possibly interact with the disordered SgSrnQ in a Ni-dependent process.
Collapse
Affiliation(s)
- Luca Mazzei
- Laboratory of Bioinorganic Chemistry, Department of Pharmacy and Biotechnology (FaBiT), University of Bologna, Via Giuseppe Fanin 40, I-40127 Bologna. Italy
| | - Francesco Musiani
- Laboratory of Bioinorganic Chemistry, Department of Pharmacy and Biotechnology (FaBiT), University of Bologna, Via Giuseppe Fanin 40, I-40127 Bologna. Italy
| | - Szymon Żerko
- Faculty of Chemistry, Biological and Chemical Research Centre, University of Warsaw, Żwirki i Wigury 101, 02-089, Warsaw, Poland
| | - Wiktor Koźminski
- Faculty of Chemistry, Biological and Chemical Research Centre, University of Warsaw, Żwirki i Wigury 101, 02-089, Warsaw, Poland
| | - Michele Cianci
- Department of Agricultural, Food and Environmental Sciences, Polytechnic University of Marche, Via Brecce Bianche, I-60131 Ancona, Italy
| | - Ylenia Beniamino
- Laboratory of Bioinorganic Chemistry, Department of Pharmacy and Biotechnology (FaBiT), University of Bologna, Via Giuseppe Fanin 40, I-40127 Bologna. Italy
| | - Stefano Ciurli
- Laboratory of Bioinorganic Chemistry, Department of Pharmacy and Biotechnology (FaBiT), University of Bologna, Via Giuseppe Fanin 40, I-40127 Bologna. Italy
| | - Barbara Zambelli
- Laboratory of Bioinorganic Chemistry, Department of Pharmacy and Biotechnology (FaBiT), University of Bologna, Via Giuseppe Fanin 40, I-40127 Bologna. Italy
| |
Collapse
|
8
|
Abstract
Quadruplex structures have been identified in a plethora of organisms where they play important functions in the regulation of molecular processes, and hence have been proposed as therapeutic targets for many diseases. In this paper we report the extensive bioinformatic analysis of the SARS-CoV-2 genome and related viruses using an upgraded version of the open-source algorithm G4-iM Grinder. This version improves the functionality of the software, including an easy way to determine the potential biological features affected by the candidates found. The quadruplex definitions of the algorithm were optimized for SARS-CoV-2. Using a lax quadruplex definition ruleset, which accepts amongst other parameters two residue G- and C-tracks, 512 potential quadruplex candidates were discovered. These sequences were evaluated by their in vitro formation probability, their position in the viral RNA, their uniqueness and their conservation rates (calculated in over seventeen thousand different COVID-19 clinical cases and sequenced at different times and locations during the ongoing pandemic). These results were then compared subsequently to other Coronaviridae members, other Group IV (+)ssRNA viruses and the entire viral realm. Sequences found in common with other viral species were further analyzed and characterized. Sequences with high scores unique to the SARS-CoV-2 were studied to investigate the variations amongst similar species. Quadruplex formation of the best candidates were then confirmed experimentally. Using NMR and CD spectroscopy, we found several highly stable RNA quadruplexes that may be suitable therapeutic targets for the SARS-CoV-2.
Collapse
|