1
|
Demir G, Alkan C. Characterizing microsatellite polymorphisms using assembly-based and mapping-based tools. ACTA ACUST UNITED AC 2019; 43:264-273. [PMID: 31496881 PMCID: PMC6710001 DOI: 10.3906/biy-1903-16] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Microsatellite polymorphism has always been a challenge for genome assembly and sequence alignment due to sequencing errors, short read lengths, and high incidence of polymerase slippage in microsatellite regions. Despite the information they carry being very valuable, microsatellite variations have not gained enough attention to be a routine step in genome sequence analysis pipelines. After the completion of the 1000 Genomes Project, which aimed to establish the most detailed genetic variation catalog for humans, the consortium released only two microsatellite prediction sets generated by two tools. Many other large research efforts have failed to shed light on microsatellite variations. We evaluated the performance of three different local assembly methods on three different experimental settings, focusing on genotype-based performance, coverage impact, and preprocessing including flanking regions. All these experiments supported our initial expectations on assembly. We also demonstrate that overlap-layout-consensus (OLC)-basedassembly methods show higher sensitivity to microsatellite variant calling when compared to a de Bruijn graph-based approach. We conclude that assembly with OLC is the better method for genotyping microsatellites. Our pipeline is available at https://github.com/gulfemd/STRAssembly.
Collapse
Affiliation(s)
- Gülfem Demir
- Department of Computer Engineering, Faculty of Engineering, Bilkent University, Bilkent, Ankara Turkey
| | - Can Alkan
- Department of Computer Engineering, Faculty of Engineering, Bilkent University, Bilkent, Ankara Turkey
| |
Collapse
|
2
|
Mukherjee S, Dohno C, Nakatani K. Design and Synthesis of Cyclic Mismatch-Binding Ligands (CMBLs) with Variable Linkers by Ring-Closing Metathesis and their Photophysical and DNA Repeat Binding Properties. Chemistry 2017; 23:11385-11396. [DOI: 10.1002/chem.201702064] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2017] [Indexed: 02/06/2023]
Affiliation(s)
- Sanjukta Mukherjee
- Department of Regulatory Bioorganic Chemistry; The Institute of Scientific and Industrial Research; Osaka University; 8-1 Mihogaoka Ibaraki 567-0047 Japan
- National Centre for Biological Sciences (NCBS); Tata Institute of Fundamental Research (TIFR); Bellary Road Bangalore 560065 India
| | - Chikara Dohno
- Department of Regulatory Bioorganic Chemistry; The Institute of Scientific and Industrial Research; Osaka University; 8-1 Mihogaoka Ibaraki 567-0047 Japan
| | - Kazuhiko Nakatani
- Department of Regulatory Bioorganic Chemistry; The Institute of Scientific and Industrial Research; Osaka University; 8-1 Mihogaoka Ibaraki 567-0047 Japan
| |
Collapse
|
3
|
Mukherjee S, Dohno C, Asano K, Nakatani K. Cyclic mismatch binding ligand CMBL4 binds to the 5'-T-3'/5'-GG-3' site by inducing the flipping out of thymine base. Nucleic Acids Res 2016; 44:7090-9. [PMID: 27466390 PMCID: PMC5009760 DOI: 10.1093/nar/gkw672] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2016] [Revised: 07/15/2016] [Accepted: 07/19/2016] [Indexed: 12/13/2022] Open
Abstract
A newly designed cyclic bis-naphthyridine carbamate dimer CMBL4: with a limited conformational flexibility was synthesized and characterized. Absorption spectra revealed that two naphthyridines in CMBL4: were stacked on each other in aqueous solutions. The most efficient binding of CMBL4: to DNA was observed for the sequence 5'-T-3'/5'-GG-3' (T/GG) with the formation of a 1:1 complex, which is one of possible structural elements involved in the higher order structures of (TGG)n repeat DNA triggering the genome microdeletion. Surface plasmon resonance assay also showed the binding of CMBL4: with TGG repeat DNA. Potassium permanganate oxidation studies of CMBL4: -bound duplex containing the T/GG site showed that the CMBL4: -binding accelerated the oxidation of thymine at that site, which suggests the flipping out of the thymine base from a π-stack. Preferential binding was observed for CMBL4: compared with its acyclic variants, which suggests the marked significance of the macrocyclic structure for the recognition of the T/GG site.
Collapse
Affiliation(s)
- Sanjukta Mukherjee
- Department of Regulatory Bioorganic Chemistry, The Institute of Scientific and Industrial Research (ISIR), Osaka University, 8-1 Mihogaoka, Ibaraki, Osaka 567-0047, Japan
| | - Chikara Dohno
- Department of Regulatory Bioorganic Chemistry, The Institute of Scientific and Industrial Research (ISIR), Osaka University, 8-1 Mihogaoka, Ibaraki, Osaka 567-0047, Japan
| | - Kaori Asano
- Comprehensive Analysis Center, The Institute of Scientific and Industrial Research (ISIR), Osaka University, 8-1 Mihogaoka, Ibaraki, Osaka 567-0047, Japan
| | - Kazuhiko Nakatani
- Department of Regulatory Bioorganic Chemistry, The Institute of Scientific and Industrial Research (ISIR), Osaka University, 8-1 Mihogaoka, Ibaraki, Osaka 567-0047, Japan
| |
Collapse
|
4
|
Khan N, Kolimi N, Rathinavelan T. Twisting right to left: A…A mismatch in a CAG trinucleotide repeat overexpansion provokes left-handed Z-DNA conformation. PLoS Comput Biol 2015; 11:e1004162. [PMID: 25876062 PMCID: PMC4395422 DOI: 10.1371/journal.pcbi.1004162] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2014] [Accepted: 01/28/2015] [Indexed: 12/02/2022] Open
Abstract
Conformational polymorphism of DNA is a major causative factor behind several incurable trinucleotide repeat expansion disorders that arise from overexpansion of trinucleotide repeats located in coding/non-coding regions of specific genes. Hairpin DNA structures that are formed due to overexpansion of CAG repeat lead to Huntington’s disorder and spinocerebellar ataxias. Nonetheless, DNA hairpin stem structure that generally embraces B-form with canonical base pairs is poorly understood in the context of periodic noncanonical A…A mismatch as found in CAG repeat overexpansion. Molecular dynamics simulations on DNA hairpin stems containing A…A mismatches in a CAG repeat overexpansion show that A…A dictates local Z-form irrespective of starting glycosyl conformation, in sharp contrast to canonical DNA duplex. Transition from B-to-Z is due to the mechanistic effect that originates from its pronounced nonisostericity with flanking canonical base pairs facilitated by base extrusion, backbone and/or base flipping. Based on these structural insights we envisage that such an unusual DNA structure of the CAG hairpin stem may have a role in disease pathogenesis. As this is the first study that delineates the influence of a single A…A mismatch in reversing DNA helicity, it would further have an impact on understanding DNA mismatch repair. When a set of 3 nucleotides in a DNA sequence repeats beyond a certain number, it leads to incurable neurological or neuromuscular disorders. Such DNA sequences tend to form unusual DNA structures comprising of base pairing schemes different from the canonical A…T/G…C base pairs. Influence of such unusual base pairing on the overall 3-dimensional structure of DNA and its impact on the pathogenesis of disorder is not well understood. CAG repeat overexpansion that leads to Huntington’s disorder and several spinocerebellar ataxias forms noncanonical A…A base pair in between canonical C…G and G…C base pairs. However, no detailed structural information is available on the influence of an A…A mismatch on a DNA structure under any sequence context. Here, we have shown for the first time that A…A base pairing in a CAG repeat provokes the formation of left-handed Z-DNA due to the pronounced structural dissimilarity of A…A base pair with G…C base pair, leading to periodic B-Z junction. Thus, these results suggest that formation of periodic B-Z junction may be one of the molecular bases for CAG repeat instability.
Collapse
Affiliation(s)
- Noorain Khan
- Department of Biotechnology, Indian Institute of Technology Hyderabad, Kandi, Telangana State, India
| | - Narendar Kolimi
- Department of Biotechnology, Indian Institute of Technology Hyderabad, Kandi, Telangana State, India
| | | |
Collapse
|
5
|
Sun S, Zhang W, Chen X, Song H. The CAA repeat polymorphism in the ZFHX3 gene is associated with risk of coronary heart disease in a Chinese population. TOHOKU J EXP MED 2015; 235:261-6. [PMID: 25797214 DOI: 10.1620/tjem.235.261] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Coronary heart disease (CHD) is a disease resulting from the interaction between genetic variations and environmental factors. Zinc finger homeobox 3 (ZFHX3) is a transcription factor and contains a poly-glutamine tract in a compositionally biased region that is encoded by exon 9, containing a cluster of CAG and CAA triplets followed by the polymorphic CAA repeats: (CAG)2(CAA)2(CAG)3CAACAG(CAA)nGCA. Thus, nine successive glutamine residues precede the poly-glutamine tract, encoded by the polymorphic CAA repeats. The aim of this study was to investigate the association of the CAA repeat polymorphism in exon 9 of the ZFHX3 gene with the risk of CHD in a Chinese population. The CAA repeat polymorphism was determined by polymerase chain reaction followed by DNA sequencing in 321 CHD patients. Genotype frequencies were compared using the non-parametric mood median test. Four alleles of CAG(CAA)10GCA, CAG(CAA)8GCA, CAG(CAA)9GCA, and CAG(CAA)11GCA were found in Chinese CHD patients in exon 9 of the ZFHX3 gene. The CAG(CAA)10GCA was a major allele (95.95%), and the CAG(CAA)8GCA was a minor allele (3.58%). The CAG(CAA)9GCA and CAG(CAA)11GCA were rare alleles (0.31% and 0.16%). The CAG(CAA)10GCA allele encodes a poly-glutamine tract of 19 residues. Importantly, the CHD patients homozygous for the CAG(CAA)10GCA allele had a higher risk of CHD, compared to the heterozygous patients carrying a CAG(CAA)8GCA allele. Moreover, the CAG(CAA)10GCA allele was significantly associated with hypertension, diabetes mellitus, or dyslipidemia (P < 0.05). Thus, the CAA repeat polymorphism in exon 9 of the ZFHX3 gene contributes to the CHD susceptibility in the Chinese population.
Collapse
Affiliation(s)
- Shunchang Sun
- Central Laboratory, Shenzhen Baoan Hospital, Southern Medical University
| | | | | | | |
Collapse
|
6
|
Abstract
Short tandem repeats (STRs), also known as microsatellites, have a wide range of applications, including medical genetics, forensics, and population genetics. High-throughput sequencing has the potential to profile large numbers of STRs, but cumbersome gapped alignment and STR-specific noise patterns hamper this task. We recently developed an algorithm, called lobSTR, to overcome these challenges and to accurately profile STRs from short reads. Here we describe how to use lobSTR to call STR variations from high-throughput sequencing datasets and to diagnose the quality of the calls.
Collapse
Affiliation(s)
- Melissa Gymrek
- Harvard-MIT Division of Health Sciences and Technology, MIT, Cambridge, MA, USA
| | | |
Collapse
|
7
|
Krzysztoń-Russjan J, Zielonka D, Jackiewicz J, Kuśmirek S, Bubko I, Klimberg A, Marcinkowski JT, Anuszewska EL. A study of molecular changes relating to energy metabolism and cellular stress in people with Huntington's disease: looking for biomarkers. J Bioenerg Biomembr 2012; 45:71-85. [PMID: 23070563 DOI: 10.1007/s10863-012-9479-3] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2012] [Accepted: 10/04/2012] [Indexed: 12/18/2022]
Abstract
Huntington's disease (HD) is a neurodegenerative disorder characterized by a progressive motor and cognitive decline and the development of psychiatric symptoms. The origin of molecular and biochemical disturbances in HD is a mutation in the HTT gene, which is autosomally dominantly inherited. The altered huntingtin protein is ubiquitously expressed in the CNS, as well as in peripheral tissues. In this study we measured the metabolism changes in gene transcription in blood of HD gene carriers (premanifest and manifest combined) versus 28 healthy controls. The comparison revealed statistically significant Global Pattern Recognition Fold Change (FC) for 6 mRNA transcripts, reflecting an increase of: MAOB (FC = 3.07; p = 0.0005) which encodes an outer mitochondrial membrane-bound enzyme called monoamine oxidase type B; TGM2 (FC = 1.8; p = 0.02) encoding a transglutaminase 2 that mediates cellular stress; SLC2A4 (FC = 1.64; p = 0.02) solute carrier family 2 (facilitated glucose transporter) member 4; branched chain ketoacid dehydrogenase kinase (BCKDK) (FC = 1.34; p = 0.02); decrease of LDHA (FC = -1.16; p = 0.03) lactate dehydrogenase A; and brain-derived neurotrophic factor (BDNF) (FC = -2,11; p = 0.03). These distinguished changes coincided with HD progress. The analyses of gene transcription levels in sub-cohorts confirmed these changes and also revealed 28 statistically significant FCs of gene transcripts involved in ATP production and BCAA metabolism.
Collapse
Affiliation(s)
- Jolanta Krzysztoń-Russjan
- Department of Biochemistry and Biopharmaceuticals, National Medicines Institute, Chelmska 30/34 Str., 00-725 Warsaw, Poland.
| | | | | | | | | | | | | | | |
Collapse
|
8
|
Begum NA, Honjo T. Evolutionary comparison of the mechanism of DNA cleavage with respect to immune diversity and genomic instability. Biochemistry 2012; 51:5243-56. [PMID: 22712724 DOI: 10.1021/bi3005895] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
It is generally assumed that the genetic mechanism for immune diversity is unique and distinct from that for general genome diversity, in part because of the high efficiency and strict regulation of immune diversity. This expectation was partially met by the discovery of RAG1 and -2, which catalyze V(D)J recombination to generate the immune repertoire of B and T lymphocyte receptors. RAG1 and -2 were later shown to be derived from a transposon. On the other hand, activation-induced cytidine deaminase (AID), which mediates both somatic hypermutation (SHM) and the class-switch recombination (CSR) of the immunoglobulin genes, evolved earlier than RAG1 and -2 in jawless vertebrates. This review compares immune diversity and general genome diversity from an evolutionary perspective, shedding light on the roles of DNA-cleaving enzymes and target recognition markers. This comparison revealed that AID-mediated SHM and CSR share the cleaving enzyme topoisomerase 1 with transcription-associated mutation (TAM) and triplet contraction, which is involved in many genetic diseases. These genome-altering events appear to target DNA with non-B structure, which is induced by the inefficient correction of the excessive supercoiling that is caused by active transcription. Furthermore, an epigenetic modification on chromatin (histone H3K4 trimethylation) is used as a mark for DNA cleavage sites in meiotic recombination, V(D)J recombination, CSR, and SHM. We conclude that acquired immune diversity evolved via the appearance of an AID orthologue that utilized a preexisting mechanism for genomic instability, such as TAM.
Collapse
Affiliation(s)
- Nasim A Begum
- Department of Immunology and Genomic Medicine, Graduate School of Medicine, Kyoto University, Kyoto 606-8501, Japan
| | | |
Collapse
|