1
|
Ramos RM, Petroli RJ, D'Alessandre NDR, Guardia GDA, Afonso ACDF, Nishi MY, Domenice S, Galante PAF, Mendonca BB, Batista RL. Small Indels in the Androgen Receptor Gene: Phenotype Implications and Mechanisms of Mutagenesis. J Clin Endocrinol Metab 2023; 109:68-79. [PMID: 37572362 DOI: 10.1210/clinem/dgad470] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Revised: 08/02/2023] [Accepted: 08/07/2023] [Indexed: 08/14/2023]
Abstract
CONTEXT Despite high abundance of small indels in human genomes, their precise roles and underlying mechanisms of mutagenesis in Mendelian disorders require further investigation. OBJECTIVE To profile the distribution, functional implications, and mechanisms of small indels in the androgen receptor (AR) gene in individuals with androgen insensitivity syndrome (AIS). METHODS We conducted a systematic review of previously reported indels within the coding region of the AR gene, including 3 novel indels. Distribution throughout the AR coding region was examined and compared with genomic population data. Additionally, we assessed their impact on the AIS phenotype and investigated potential mechanisms driving their occurrence. RESULTS A total of 82 indels in AIS were included. Notably, all frameshift indels exhibited complete AIS. The distribution of indels across the AR gene showed a predominance in the N-terminal domain, most leading to frameshift mutations. Small deletions accounted for 59.7%. Most indels occurred in nonrepetitive sequences, with 15.8% situated within triplet regions. Gene burden analysis demonstrated significant enrichment of frameshift indels in AIS compared with controls (P < .00001), and deletions were overrepresented in AIS (P < .00001). CONCLUSION Our findings underscore a robust genotype-phenotype relationship regarding small indels in the AR gene in AIS, with a vast majority presenting complete AIS. Triplet regions and homopolymeric runs emerged as prone loci for small indels within the AR. Most were frameshift indels, with polymerase slippage potentially explaining half of AR indel occurrences. Complex frameshift indels exhibited association with palindromic runs. These discoveries advance understanding of the genetic basis of AIS and shed light on potential mechanisms underlying pathogenic small indel events.
Collapse
Affiliation(s)
- Raquel Martinez Ramos
- Developmental Endocrinology Unit, Hormone and Molecular Genetics Laboratory (LIM/42), Endocrinology Division, Internal Medicine Department, Medical School, University of São Paulo (USP), São Paulo, SP, 05403-000, Brazil
| | - Reginaldo José Petroli
- Faculdade de Medicina da Universidade Federal de Alagoas (UFAL), Programa de Pós-Graduação em Ciências Médicas-UFAL, Maceió, AL, 57072-900, Brazil
| | | | | | - Ana Caroline de Freitas Afonso
- Developmental Endocrinology Unit, Hormone and Molecular Genetics Laboratory (LIM/42), Endocrinology Division, Internal Medicine Department, Medical School, University of São Paulo (USP), São Paulo, SP, 05403-000, Brazil
| | - Mirian Yumie Nishi
- Developmental Endocrinology Unit, Hormone and Molecular Genetics Laboratory (LIM/42), Endocrinology Division, Internal Medicine Department, Medical School, University of São Paulo (USP), São Paulo, SP, 05403-000, Brazil
| | - Sorahia Domenice
- Developmental Endocrinology Unit, Hormone and Molecular Genetics Laboratory (LIM/42), Endocrinology Division, Internal Medicine Department, Medical School, University of São Paulo (USP), São Paulo, SP, 05403-000, Brazil
| | | | - Berenice Bilharinho Mendonca
- Developmental Endocrinology Unit, Hormone and Molecular Genetics Laboratory (LIM/42), Endocrinology Division, Internal Medicine Department, Medical School, University of São Paulo (USP), São Paulo, SP, 05403-000, Brazil
| | - Rafael Loch Batista
- Developmental Endocrinology Unit, Hormone and Molecular Genetics Laboratory (LIM/42), Endocrinology Division, Internal Medicine Department, Medical School, University of São Paulo (USP), São Paulo, SP, 05403-000, Brazil
- Instituto do Câncer do Estado de São Paulo da Faculdade, de Medicina da Universidade de São Paulo (ICESP), São Paulo, SP, 01246-000, Brazil
| |
Collapse
|
2
|
Serebriiskii IG, Pavlov VA, Andrianov GV, Litwin S, Basickes S, Newberg JY, Frampton GM, Meyer JE, Golemis EA. Source, co-occurrence, and prognostic value of PTEN mutations or loss in colorectal cancer. NPJ Genom Med 2023; 8:40. [PMID: 38001126 PMCID: PMC10674024 DOI: 10.1038/s41525-023-00384-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Accepted: 11/07/2023] [Indexed: 11/26/2023] Open
Abstract
Somatic PTEN mutations are common and have driver function in some cancer types. However, in colorectal cancers (CRCs), somatic PTEN-inactivating mutations occur at a low frequency (~8-9%), and whether these mutations are actively selected and promote tumor aggressiveness has been controversial. Analysis of genomic data from ~53,000 CRCs indicates that hotspot mutation patterns in PTEN partially reflect DNA-dependent selection pressures, but also suggests a strong selection pressure based on protein function. In microsatellite stable (MSS) tumors, PTEN alterations co-occur with mutations activating BRAF or PI3K, or with TP53 deletions, but not in CRC with microsatellite instability (MSI). Unexpectedly, PTEN deletions are associated with poor survival in MSS CRC, whereas PTEN mutations are associated with improved survival in MSI CRC. These and other data suggest use of PTEN as a prognostic marker is valid in CRC, but such use must consider driver mutation landscape, tumor subtype, and category of PTEN alteration.
Collapse
Affiliation(s)
- Ilya G Serebriiskii
- Program in Cell Signaling and Microenvironment, Fox Chase Cancer Center, Philadelphia, PA, 19111, USA.
- Kazan Federal University, 420000, Kazan, Russian Federation.
| | - Valerii A Pavlov
- Program in Cell Signaling and Microenvironment, Fox Chase Cancer Center, Philadelphia, PA, 19111, USA
- Moscow Institute of Physics and Technology, 141701, Dolgoprudny, Moscow Region, Russian Federation
| | - Grigorii V Andrianov
- Program in Cell Signaling and Microenvironment, Fox Chase Cancer Center, Philadelphia, PA, 19111, USA
| | - Samuel Litwin
- Program in Cell Signaling and Microenvironment, Fox Chase Cancer Center, Philadelphia, PA, 19111, USA
- Biostatistics and Bioinformatics Facility, Fox Chase Cancer Center, Philadelphia, PA, 19111, USA
| | - Stanley Basickes
- Greenfield Manufacturing, 9800 Bustleton Ave, Philadelphia, PA, 19115, USA
| | - Justin Y Newberg
- Foundation Medicine, Inc., 150 Second St., Cambridge, MA, 02141, USA
| | | | - Joshua E Meyer
- Program in Cell Signaling and Microenvironment, Fox Chase Cancer Center, Philadelphia, PA, 19111, USA
- Department of Radiation Oncology, Fox Chase Cancer Center, Philadelphia, PA, 19111, USA
| | - Erica A Golemis
- Program in Cell Signaling and Microenvironment, Fox Chase Cancer Center, Philadelphia, PA, 19111, USA.
- Department of Cancer and Cellular Biology, Lewis Katz School of Medicine at Temple University, Philadelphia, PA, 19140, USA.
| |
Collapse
|
3
|
Orlov YL, Orlova NG. Bioinformatics tools for the sequence complexity estimates. Biophys Rev 2023; 15:1367-1378. [PMID: 37974990 PMCID: PMC10643780 DOI: 10.1007/s12551-023-01140-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Accepted: 09/01/2023] [Indexed: 11/19/2023] Open
Abstract
We review current methods and bioinformatics tools for the text complexity estimates (information and entropy measures). The search DNA regions with extreme statistical characteristics such as low complexity regions are important for biophysical models of chromosome function and gene transcription regulation in genome scale. We discuss the complexity profiling for segmentation and delineation of genome sequences, search for genome repeats and transposable elements, and applications to next-generation sequencing reads. We review the complexity methods and new applications fields: analysis of mutation hotspots loci, analysis of short sequencing reads with quality control, and alignment-free genome comparisons. The algorithms implementing various numerical measures of text complexity estimates including combinatorial and linguistic measures have been developed before genome sequencing era. The series of tools to estimate sequence complexity use compression approaches, mainly by modification of Lempel-Ziv compression. Most of the tools are available online providing large-scale service for whole genome analysis. Novel machine learning applications for classification of complete genome sequences also include sequence compression and complexity algorithms. We present comparison of the complexity methods on the different sequence sets, the applications for gene transcription regulatory regions analysis. Furthermore, we discuss approaches and application of sequence complexity for proteins. The complexity measures for amino acid sequences could be calculated by the same entropy and compression-based algorithms. But the functional and evolutionary roles of low complexity regions in protein have specific features differing from DNA. The tools for protein sequence complexity aimed for protein structural constraints. It was shown that low complexity regions in protein sequences are conservative in evolution and have important biological and structural functions. Finally, we summarize recent findings in large scale genome complexity comparison and applications for coronavirus genome analysis.
Collapse
Affiliation(s)
- Yuriy L. Orlov
- The Digital Health Institute, I.M. Sechenov First Moscow State Medical University of the Russian Ministry of Health (Sechenov University), Moscow, 119991 Russia
- Institute of Cytology and Genetics SB RAS, 630090 Novosibirsk, Russia
- Agrarian and Technological Institute, Peoples’ Friendship University of Russia, 117198 Moscow, Russia
| | - Nina G. Orlova
- Department of Mathematics, Financial University under the Government of the Russian Federation, Moscow, 125167 Russia
| |
Collapse
|
4
|
Fang Y, Liu Y, Xu H, Zhu B. Performance evaluation of an in-house panel containing 59 autosomal InDels for forensic identification in Chinese Hui and Mongolian groups. Genomics 2023; 115:110552. [PMID: 36565793 DOI: 10.1016/j.ygeno.2022.110552] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 10/31/2022] [Accepted: 12/20/2022] [Indexed: 12/24/2022]
Abstract
In recent years, a novel multiplex system containing two mini-short tandem repeats, 59 autosomal InDels, two Y-chromosomal InDels, and the Amelogenin gene with all amplicons less than 200 bp has been constructed and validated by ourselves for forensic degration sample, and its forensic application efficiency has been studied in Chinese some populations. Herein, the population genetic polymorphisms of these loci were investigated in Chinese Hui (n = 249) and Mongolian (n = 222) ethnic groups using direct multiplex amplification and capillary electrophoresis platform. The forensic identification efficiencies of this self-developed system were further evaluated in these two groups. And the results showed that the values of the combined power of discrimination were 0.9999999999999999999999999999006 (Hui) and 0.999999999999999999999999999738 (Mongolian), respectively. Moreover, the combined power of exclusion values were 0.99999817 (Hui) and 0.99999779 (Mongolian). The 59 autosomal InDels used in this study exhibited high forensic identification efficiencies in 10 East Asian populations, which was also expected to be a new powerful tool for identifying degraded biological materials in East Asian populations.
Collapse
Affiliation(s)
- Yating Fang
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou 510515, China; School of Basic Medical Sciences, Anhui Medical University, Anhui 230031, China
| | - Yanfang Liu
- Laboratory of Fundamental Nursing Research, School of Nursing, Guangdong Medical University, Dongguan, China
| | - Hui Xu
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou 510515, China
| | - Bofeng Zhu
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou 510515, China; College of Forensic Medicine, Xi'an Jiaotong University Health Science Center, Xi'an 710061, China.
| |
Collapse
|
5
|
Jilani M, Turcan A, Haspel N, Jagodzinski F. Elucidating the Structural Impacts of Protein InDels. Biomolecules 2022; 12:1435. [PMID: 36291643 PMCID: PMC9599607 DOI: 10.3390/biom12101435] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Revised: 09/23/2022] [Accepted: 09/27/2022] [Indexed: 09/17/2023] Open
Abstract
The effects of amino acid insertions and deletions (InDels) remain a rather under-explored area of structural biology. These variations oftentimes are the cause of numerous disease phenotypes. In spite of this, research to study InDels and their structural significance remains limited, primarily due to a lack of experimental information and computational methods. In this work, we fill this gap by modeling InDels computationally; we investigate the rigidity differences between the wildtype and a mutant variant with one or more InDels. Further, we compare how structural effects due to InDels differ from the effects of amino acid substitutions, which are another type of amino acid mutation. We finish by performing a correlation analysis between our rigidity-based metrics and wet lab data for their ability to infer the effects of InDels on protein fitness.
Collapse
Affiliation(s)
- Muneeba Jilani
- Department of Computer Science, University of Massachusetts Boston, Boston, MA 02125, USA
| | - Alistair Turcan
- Department of Computer Science, Western Washington University, Bellingham, WA 98225, USA
| | - Nurit Haspel
- Department of Computer Science, University of Massachusetts Boston, Boston, MA 02125, USA
| | - Filip Jagodzinski
- Department of Computer Science, Western Washington University, Bellingham, WA 98225, USA
| |
Collapse
|
6
|
Chen J, Guo JT. Structural and functional analysis of somatic coding and UTR indels in breast and lung cancer genomes. Sci Rep 2021; 11:21178. [PMID: 34707120 PMCID: PMC8551294 DOI: 10.1038/s41598-021-00583-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2021] [Accepted: 10/14/2021] [Indexed: 11/24/2022] Open
Abstract
Insertions and deletions (Indels) represent one of the major variation types in the human genome and have been implicated in diseases including cancer. To study the features of somatic indels in different cancer genomes, we investigated the indels from two large samples of cancer types: invasive breast carcinoma (BRCA) and lung adenocarcinoma (LUAD). Besides mapping somatic indels in both coding and untranslated regions (UTRs) from the cancer whole exome sequences, we investigated the overlap between these indels and transcription factor binding sites (TFBSs), the key elements for regulation of gene expression that have been found in both coding and non-coding sequences. Compared to the germline indels in healthy genomes, somatic indels contain more coding indels with higher than expected frame-shift (FS) indels in cancer genomes. LUAD has a higher ratio of deletions and higher coding and FS indel rates than BRCA. More importantly, these somatic indels in cancer genomes tend to locate in sequences with important functions, which can affect the core secondary structures of proteins and have a bigger overlap with predicted TFBSs in coding regions than the germline indels. The somatic CDS indels are also enriched in highly conserved nucleotides when compared with germline CDS indels.
Collapse
Affiliation(s)
- Jing Chen
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC, 28223, USA
| | - Jun-Tao Guo
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC, 28223, USA.
| |
Collapse
|
7
|
Bennett EP, Petersen BL, Johansen IE, Niu Y, Yang Z, Chamberlain CA, Met Ö, Wandall HH, Frödin M. INDEL detection, the 'Achilles heel' of precise genome editing: a survey of methods for accurate profiling of gene editing induced indels. Nucleic Acids Res 2020; 48:11958-11981. [PMID: 33170255 PMCID: PMC7708060 DOI: 10.1093/nar/gkaa975] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2019] [Revised: 10/05/2020] [Accepted: 10/15/2020] [Indexed: 12/11/2022] Open
Abstract
Advances in genome editing technologies have enabled manipulation of genomes at the single base level. These technologies are based on programmable nucleases (PNs) that include meganucleases, zinc-finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs) and Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)/CRISPR-associated 9 (Cas9) nucleases and have given researchers the ability to delete, insert or replace genomic DNA in cells, tissues and whole organisms. The great flexibility in re-designing the genomic target specificity of PNs has vastly expanded the scope of gene editing applications in life science, and shows great promise for development of the next generation gene therapies. PN technologies share the principle of inducing a DNA double-strand break (DSB) at a user-specified site in the genome, followed by cellular repair of the induced DSB. PN-elicited DSBs are mainly repaired by the non-homologous end joining (NHEJ) and the microhomology-mediated end joining (MMEJ) pathways, which can elicit a variety of small insertion or deletion (indel) mutations. If indels are elicited in a protein coding sequence and shift the reading frame, targeted gene knock out (KO) can readily be achieved using either of the available PNs. Despite the ease by which gene inactivation in principle can be achieved, in practice, successful KO is not only determined by the efficiency of NHEJ and MMEJ repair; it also depends on the design and properties of the PN utilized, delivery format chosen, the preferred indel repair outcomes at the targeted site, the chromatin state of the target site and the relative activities of the repair pathways in the edited cells. These variables preclude accurate prediction of the nature and frequency of PN induced indels. A key step of any gene KO experiment therefore becomes the detection, characterization and quantification of the indel(s) induced at the targeted genomic site in cells, tissues or whole organisms. In this survey, we briefly review naturally occurring indels and their detection. Next, we review the methods that have been developed for detection of PN-induced indels. We briefly outline the experimental steps and describe the pros and cons of the various methods to help users decide a suitable method for their editing application. We highlight recent advances that enable accurate and sensitive quantification of indel events in cells regardless of their genome complexity, turning a complex pool of different indel events into informative indel profiles. Finally, we review what has been learned about PN-elicited indel formation through the use of the new methods and how this insight is helping to further advance the genome editing field.
Collapse
Affiliation(s)
- Eric Paul Bennett
- Copenhagen Center for Glycomics, Department of Odontology and Molecular and Cellular Medicine, Faculty of Health Sciences, University of Copenhagen, DK-2200 Copenhagen N, Denmark
| | - Bent Larsen Petersen
- Department of Plant and Environmental Sciences, University of Copenhagen, DK-1871 Frederiksberg C, Denmark
| | - Ida Elisabeth Johansen
- Department of Plant and Environmental Sciences, University of Copenhagen, DK-1871 Frederiksberg C, Denmark
| | - Yiyuan Niu
- Biotech Research and Innovation Centre (BRIC), Faculty of Health Sciences, University of Copenhagen, Copenhagen, Denmark
- College of Animal Science and Technology, Northwest A&F University, Yangling Shaanxi, China
| | - Zhang Yang
- Copenhagen Center for Glycomics, Department of Odontology and Molecular and Cellular Medicine, Faculty of Health Sciences, University of Copenhagen, DK-2200 Copenhagen N, Denmark
| | | | - Özcan Met
- Center for Cancer Immune Therapy, Department of Oncology, Copenhagen University Hospital, Herlev, Denmark
- Department of Immunology and Microbiology, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Hans H Wandall
- Copenhagen Center for Glycomics, Department of Odontology and Molecular and Cellular Medicine, Faculty of Health Sciences, University of Copenhagen, DK-2200 Copenhagen N, Denmark
| | - Morten Frödin
- Biotech Research and Innovation Centre (BRIC), Faculty of Health Sciences, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
8
|
Zhao M, Qu H. circVAR database: genome-wide archive of genetic variants for human circular RNAs. BMC Genomics 2020; 21:750. [PMID: 33121433 PMCID: PMC7599103 DOI: 10.1186/s12864-020-07172-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2020] [Accepted: 10/21/2020] [Indexed: 01/07/2023] Open
Abstract
BACKGROUND Circular RNAs (circRNAs) play important roles in regulating gene expression through binding miRNAs and RNA binding proteins. Genetic variation of circRNAs may affect complex traits/diseases by changing their binding efficiency to target miRNAs and proteins. There is a growing demand for investigations of the functions of genetic changes using large-scale experimental evidence. However, there is no online genetic resource for circRNA genes. RESULTS We performed extensive genetic annotation of 295,526 circRNAs integrated from circBase, circNet and circRNAdb. All pre-computed genetic variants were presented at our online resource, circVAR, with data browsing and search functionality. We explored the chromosome-based distribution of circRNAs and their associated variants. We found that, based on mapping to the 1000 Genomes and ClinVAR databases, chromosome 17 has a relatively large number of circRNAs and associated common and health-related genetic variants. Following the annotation of genome wide association studies (GWAS)-based circRNA variants, we found many non-coding variants within circRNAs, suggesting novel mechanisms for common diseases reported from GWAS studies. For cancer-based somatic variants, we found that chromosome 7 has many highly complex mutations that have been overlooked in previous research. CONCLUSION We used the circVAR database to collect SNPs and small insertions and deletions (INDELs) in putative circRNA regions and to identify their potential phenotypic information. To provide a reusable resource for the circRNA research community, we have published all the pre-computed genetic data concerning circRNAs and associated genes together with data query and browsing functions at http://soft.bioinfo-minzhao.org/circvar .
Collapse
Affiliation(s)
- Min Zhao
- School of Science and Engineering, University of the Sunshine Coast, Maroochydore DC, Queensland, 4558, Australia
| | - Hong Qu
- Center for Bioinformatics, State Key Laboratory of Protein and Plant Gene Research, College of Life Sciences, Peking University, Beijing, 100871, P. R. China.
| |
Collapse
|
9
|
Ding Y, Xue H, Ding X, Zhao Y, Zhao Z, Wang D, Wu J. On the complexity measures of mutation hotspots in human TP53 protein. CHAOS (WOODBURY, N.Y.) 2020; 30:073118. [PMID: 32752620 DOI: 10.1063/1.5143584] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/27/2019] [Accepted: 06/15/2020] [Indexed: 06/11/2023]
Abstract
The role of sequence complexity in 23 051 somatic missense mutations including 73 well-known mutation hotspots across 22 major cancers was studied in human TP53 proteins. A role for sequence complexity in TP53 protein mutations is suggested since (i) the mutation rate significantly increases in low amino acid pair bias complexity; (ii) probability distribution complexity increases following single point substitution mutations and strikingly increases after mutation at the mutation hotspots including six detectable hotspot mutations (R175, G245, R248, R249, R273, and R282); and (iii) the degree of increase in distribution complexity is significantly correlated with the frequency of missense mutations (r = -0.5758, P < 0.0001) across 20 major types of solid tumors. These results are consistent with the hypothesis that amino acid pair bias and distribution probability may be used as novel measures for protein sequence complexity, and the degree of complexity is related to its susceptibility to mutation, as such, it may be used as a predictor for modeling protein mutations in human cancers.
Collapse
Affiliation(s)
- Yan Ding
- Department of Pediatrics, Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Hongsheng Xue
- Institute for Translational Medicine, The Affiliated Zhongshan Hospital of Dalian University, Dalian 116001, China
| | - Xinjia Ding
- Department of Urology, The Second Affiliated Hospital of Dalian Medical University, Dalian 116023, China
| | - Yuqing Zhao
- Department of Urology, The Second Affiliated Hospital of Dalian Medical University, Dalian 116023, China
| | - Zhilong Zhao
- Institute for Translational Medicine, The Affiliated Zhongshan Hospital of Dalian University, Dalian 116001, China
| | - Dazhi Wang
- Department of Pediatrics, Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Jianlin Wu
- Institute for Translational Medicine, The Affiliated Zhongshan Hospital of Dalian University, Dalian 116001, China
| |
Collapse
|
10
|
Decoding whole-genome mutational signatures in 37 human pan-cancers by denoising sparse autoencoder neural network. Oncogene 2020; 39:5031-5041. [PMID: 32528130 PMCID: PMC7334101 DOI: 10.1038/s41388-020-1343-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2019] [Revised: 05/19/2020] [Accepted: 05/29/2020] [Indexed: 12/28/2022]
Abstract
Millions of somatic mutations have recently been discovered in cancer genomes. These mutations in cancer genomes occur due to internal and external mutagenesis forces. Decoding the mutational processes by examining their unique patterns has successfully revealed many known and novel signatures from whole exome data, but many still remain undiscovered. Here, we developed a deep learning approach, DeepMS, to decompose mutational signatures using 52,671,908 somatic mutations from 2780 highly curated cancer genomes with whole genome sequencing (WGS) in 37 cancer types/subtypes. With rigorous model training and comparison, we characterized 54 signatures for single base substitutions (SBSs), 11 for doublet base substitutions (DBSs) and 16 for small insertions and deletions (Indels). Compared to the previous methods, DeepMS could discover 37 SBS, 5 DBS and 9 Indel new signatures, many of which represent associations with DNA mismatch or base excision repair and cisplatin therapy mechanisms. We further developed a regression-based model to estimate the correlation between signatures and clinical and demographical phenotypes. The first deep learning model DeepMS on WGS somatic mutational profiles enable us identify more comprehensive context-based mutational signatures than traditional NMF approaches. Our work substantially expands the landscape of the naturally occurring mutational signatures in cancer genomes, and provides new insights into cancer biology.
Collapse
|
11
|
Trimidal SG, Benjamin R, Bae JE, Han MV, Kong E, Singer A, Williams TS, Yang B, Schiller MR. Can Designer Indels Be Tailored by Gene Editing?: Can Indels Be Customized? Bioessays 2019; 41:e1900126. [PMID: 31693213 DOI: 10.1002/bies.201900126] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2019] [Revised: 10/01/2019] [Indexed: 12/23/2022]
Abstract
Genome editing with engineered nucleases (GEENs) introduce site-specific DNA double-strand breaks (DSBs) and repairs DSBs via nonhomologous end-joining (NHEJ) pathways that eventually create indels (insertions/deletions) in a genome. Whether the features of indels resulting from gene editing could be customized is asked. A review of the literature reveals how gene editing technologies via NHEJ pathways impact gene editing. The survey consolidates a body of literature that suggests that the type (insertion, deletion, and complex) and the approximate length of indel edits can be somewhat customized with different GEENs and by manipulating the expression of key NHEJ genes. Structural data suggest that binding of GEENs to DNA may interfere with binding of key components of DNA repair complexes, favoring either classical- or alternative-NHEJ. The hypotheses have some limitations, but if validated, will enable scientists to better control indel makeup, holding promise for basic science and clinical applications of gene editing. Also see the video abstract here https://youtu.be/vTkJtUsLi3w.
Collapse
Affiliation(s)
- Sara G Trimidal
- School of Life Sciences, University of Nevada Las Vegas, Las Vegas, NV, 89154, USA.,Nevada Institute of Personalized Medicine, University of Nevada Las Vegas, Las Vegas, NV, 89154, USA
| | - Ronald Benjamin
- School of Life Sciences, University of Nevada Las Vegas, Las Vegas, NV, 89154, USA.,Nevada Institute of Personalized Medicine, University of Nevada Las Vegas, Las Vegas, NV, 89154, USA
| | - Ji Eun Bae
- School of Life Sciences, University of Nevada Las Vegas, Las Vegas, NV, 89154, USA.,Nevada Institute of Personalized Medicine, University of Nevada Las Vegas, Las Vegas, NV, 89154, USA
| | - Mira V Han
- School of Life Sciences, University of Nevada Las Vegas, Las Vegas, NV, 89154, USA.,Nevada Institute of Personalized Medicine, University of Nevada Las Vegas, Las Vegas, NV, 89154, USA
| | - Elizabeth Kong
- School of Life Sciences, University of Nevada Las Vegas, Las Vegas, NV, 89154, USA.,Nevada Institute of Personalized Medicine, University of Nevada Las Vegas, Las Vegas, NV, 89154, USA
| | - Aaron Singer
- School of Life Sciences, University of Nevada Las Vegas, Las Vegas, NV, 89154, USA.,Nevada Institute of Personalized Medicine, University of Nevada Las Vegas, Las Vegas, NV, 89154, USA
| | - Tyler S Williams
- School of Life Sciences, University of Nevada Las Vegas, Las Vegas, NV, 89154, USA.,Nevada Institute of Personalized Medicine, University of Nevada Las Vegas, Las Vegas, NV, 89154, USA
| | - Bing Yang
- Donald Danforth Plant Science Center, St. Louis, MO, 63132, USA
| | - Martin R Schiller
- School of Life Sciences, University of Nevada Las Vegas, Las Vegas, NV, 89154, USA.,Nevada Institute of Personalized Medicine, University of Nevada Las Vegas, Las Vegas, NV, 89154, USA
| |
Collapse
|
12
|
Růžička M, Souček P, Kulhánek P, Radová L, Fajkusová L, Réblová K. Bending of DNA duplexes with mutation motifs. DNA Res 2019; 26:341-352. [PMID: 31230075 PMCID: PMC6704406 DOI: 10.1093/dnares/dsz013] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2019] [Accepted: 05/27/2019] [Indexed: 01/30/2023] Open
Abstract
Mutations can be induced by environmental factors but also arise spontaneously during DNA replication or due to deamination of methylated cytosines at CpG dinucleotides. Sites where mutations occur with higher frequency than would be expected by chance are termed hotspots while sites that contain mutations rarely are termed coldspots. Mutations are permanently scanned and repaired by repair systems. Among them, the mismatch repair targets base pair mismatches, which are discriminated from canonical base pairs by probing altered elasticity of DNA. Using biased molecular dynamics simulations, we investigated the elasticity of coldspots and hotspots motifs detected in human genes associated with inherited disorders, and also of motifs with Czech population hotspots and de novo mutations. Main attention was paid to mutations leading to G/T and A+/C pairs. We observed that hotspots without CpG/CpHpG sequences are less flexible than coldspots, which indicates that flexible sequences are more effectively repaired. In contrary, hotspots with CpG/CpHpG sequences exhibited increased flexibility as coldspots. Their mutability is more likely related to spontaneous deamination of methylated cytosines leading to C > T mutations, which are primarily targeted by base excision repair. We corroborated conclusions based on computer simulations by measuring melting curves of hotspots and coldspots containing G/T mismatch.
Collapse
Affiliation(s)
- Michal Růžička
- CEITEC - Central European Institute of Technology, Masaryk University, Brno, Czech Republic.,Department of Condensed Matter Physics, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - Přemysl Souček
- CEITEC - Central European Institute of Technology, Masaryk University, Brno, Czech Republic
| | - Petr Kulhánek
- CEITEC - Central European Institute of Technology, Masaryk University, Brno, Czech Republic.,National Centre for Biomolecular Research, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - Lenka Radová
- CEITEC - Central European Institute of Technology, Masaryk University, Brno, Czech Republic
| | - Lenka Fajkusová
- Centre of Molecular Biology and Gene Therapy, University Hospital Brno, Brno, Czech Republic
| | - Kamila Réblová
- CEITEC - Central European Institute of Technology, Masaryk University, Brno, Czech Republic
| |
Collapse
|
13
|
Abstract
The early radiation of Neoaves has been hypothesized to be an intractable “hard polytomy”. We explore the fundamental properties of insertion/deletion alleles (indels), an under-utilized form of genomic data with the potential to help solve this. We scored >5 million indels from >7000 pan-genomic intronic and ultraconserved element (UCE) loci in 48 representatives of all neoavian orders. We found that intronic and UCE indels exhibited less homoplasy than nucleotide (nt) data. Gene trees estimated using indel data were less resolved than those estimated using nt data. Nevertheless, Accurate Species TRee Algorithm (ASTRAL) species trees estimated using indels were generally similar to nt-based ASTRAL trees, albeit with lower support. However, the power of indel gene trees became clear when we combined them with nt gene trees, including a striking result for UCEs. The individual UCE indel and nt ASTRAL trees were incongruent with each other and with the intron ASTRAL trees; however, the combined indel+nt ASTRAL tree was much more congruent with the intronic trees. Finally, combining indel and nt data for both introns and UCEs provided sufficient power to reduce the scope of the polytomy that was previously proposed for several supraordinal lineages of Neoaves.
Collapse
|
14
|
Abstract
Genetic robustness, or the ability of an organism to maintain fitness in the presence of mutations, can be achieved via protein feedback loops. Recent evidence suggests that organisms may also respond to mutations by upregulating related gene(s) independently of protein feedback loops, a phenomenon called transcriptional adaptation. However, the prevalence of transcriptional adaptation and its underlying molecular mechanisms are unknown. Here, by analyzing several models of transcriptional adaptation in zebrafish and mouse, we show a requirement for mRNA degradation. Alleles that fail to transcribe the mutated gene do not display transcriptional adaptation and exhibit more severe phenotypes than alleles displaying mutant mRNA decay. Transcriptome analysis reveals the upregulation of a substantial proportion of the genes that exhibit sequence similarity with the mutated gene’s mRNA, suggesting a sequence dependent mechanism. Besides implications for our understanding of disease-causing mutations, these findings will help design mutant alleles with minimal transcriptional adaptation-derived compensation.
Collapse
|
15
|
Wagih O, Galardini M, Busby BP, Memon D, Typas A, Beltrao P. A resource of variant effect predictions of single nucleotide variants in model organisms. Mol Syst Biol 2018; 14:e8430. [PMID: 30573687 PMCID: PMC6301329 DOI: 10.15252/msb.20188430] [Citation(s) in RCA: 56] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2018] [Revised: 11/19/2018] [Accepted: 11/21/2018] [Indexed: 12/18/2022] Open
Abstract
The effect of single nucleotide variants (SNVs) in coding and noncoding regions is of great interest in genetics. Although many computational methods aim to elucidate the effects of SNVs on cellular mechanisms, it is not straightforward to comprehensively cover different molecular effects. To address this, we compiled and benchmarked sequence and structure-based variant effect predictors and we computed the impact of nearly all possible amino acid and nucleotide variants in the reference genomes of Homo sapiens, Saccharomyces cerevisiae and Escherichia coli Studied mechanisms include protein stability, interaction interfaces, post-translational modifications and transcription factor binding sites. We apply this resource to the study of natural and disease coding variants. We also show how variant effects can be aggregated to generate protein complex burden scores that uncover protein complex to phenotype associations based on a set of newly generated growth profiles of 93 sequenced S. cerevisiae strains in 43 conditions. This resource is available through mutfunc (www.mutfunc.com), a tool by which users can query precomputed predictions by providing amino acid or nucleotide-level variants.
Collapse
Affiliation(s)
- Omar Wagih
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge, UK
| | - Marco Galardini
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge, UK
| | - Bede P Busby
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge, UK
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - Danish Memon
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge, UK
| | - Athanasios Typas
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - Pedro Beltrao
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge, UK
| |
Collapse
|
16
|
Růžička M, Kulhánek P, Radová L, Čechová A, Špačková N, Fajkusová L, Réblová K. DNA mutation motifs in the genes associated with inherited diseases. PLoS One 2017; 12:e0182377. [PMID: 28767725 PMCID: PMC5540541 DOI: 10.1371/journal.pone.0182377] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2017] [Accepted: 07/17/2017] [Indexed: 11/18/2022] Open
Abstract
Mutations in human genes can be responsible for inherited genetic disorders and cancer. Mutations can arise due to environmental factors or spontaneously. It has been shown that certain DNA sequences are more prone to mutate. These sites are termed hotspots and exhibit a higher mutation frequency than expected by chance. In contrast, DNA sequences with lower mutation frequencies than expected by chance are termed coldspots. Mutation hotspots are usually derived from a mutation spectrum, which reflects particular population where an effect of a common ancestor plays a role. To detect coldspots/hotspots unaffected by population bias, we analysed the presence of germline mutations obtained from HGMD database in the 5-nucleotide segments repeatedly occurring in genes associated with common inherited disorders, in particular, the PAH, LDLR, CFTR, F8, and F9 genes. Statistically significant sequences (mutational motifs) rarely associated with mutations (coldspots) and frequently associated with mutations (hotspots) exhibited characteristic sequence patterns, e.g. coldspots contained purine tract while hotspots showed alternating purine-pyrimidine bases, often with the presence of CpG dinucleotide. Using molecular dynamics simulations and free energy calculations, we analysed the global bending properties of two selected coldspots and two hotspots with a G/T mismatch. We observed that the coldspots were inherently more flexible than the hotspots. We assume that this property might be critical for effective mismatch repair as DNA with a mutation recognized by MutSα protein is noticeably bent.
Collapse
Affiliation(s)
- Michal Růžička
- CEITEC—Central European Institute of Technology, Masaryk University, Kamenice 5, Brno, Czech Republic
- Department of Condensed Matter Physics, Faculty of Science, Masaryk University, Kotlářská 2, Brno, Czech Republic
| | - Petr Kulhánek
- CEITEC—Central European Institute of Technology, Masaryk University, Kamenice 5, Brno, Czech Republic
- National Centre for Biomolecular Research, Faculty of Science, Masaryk University, Kamenice 5, Brno, Czech Republic
| | - Lenka Radová
- CEITEC—Central European Institute of Technology, Masaryk University, Kamenice 5, Brno, Czech Republic
| | - Andrea Čechová
- CEITEC—Central European Institute of Technology, Masaryk University, Kamenice 5, Brno, Czech Republic
| | - Naďa Špačková
- Department of Condensed Matter Physics, Faculty of Science, Masaryk University, Kotlářská 2, Brno, Czech Republic
| | - Lenka Fajkusová
- Centre of Molecular Biology and Gene Therapy, University Hospital Brno and Masaryk University, Jihlavská 20, Brno, Czech Republic
| | - Kamila Réblová
- CEITEC—Central European Institute of Technology, Masaryk University, Kamenice 5, Brno, Czech Republic
- * E-mail:
| |
Collapse
|
17
|
Gu Y, Bhatta AK, Du X, Shao M, Keyal U, Zhang G, Hua Y. Identification of one Novel complex delins mutation and one recurrent mutation of ERCC8 gene in a Chinese family with Cockayne Syndrome A. J Eur Acad Dermatol Venereol 2017; 31:e394-e395. [PMID: 28273366 DOI: 10.1111/jdv.14201] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Affiliation(s)
- Y Gu
- Department of Dermatology, Affiliated Wuxi People's Hospital, Nanjing Medical University, Wuxi, Jiangsu, 214023, China
| | - A K Bhatta
- Institute of Photomedicine, Shanghai Skin Disease Hospital, Tongji University School of Medicine, 200443, Shanghai, China
| | - X Du
- Department of Dermatology, Affiliated Wuxi People's Hospital, Nanjing Medical University, Wuxi, Jiangsu, 214023, China
| | - M Shao
- Department of Dermatology, Affiliated Wuxi People's Hospital, Nanjing Medical University, Wuxi, Jiangsu, 214023, China
| | - U Keyal
- Institute of Photomedicine, Shanghai Skin Disease Hospital, Tongji University School of Medicine, 200443, Shanghai, China
| | - G Zhang
- Institute of Photomedicine, Shanghai Skin Disease Hospital, Tongji University School of Medicine, 200443, Shanghai, China
| | - Y Hua
- Department of Neurology, Affiliated Wuxi Children's Hospital, Nanjing Medical University, Wuxi, Jiangsu, 214023, China
| |
Collapse
|
18
|
A Small Indel Mutant Mouse Model of Epidermolytic Palmoplantar Keratoderma and Its Application to Mutant-specific shRNA Therapy. MOLECULAR THERAPY. NUCLEIC ACIDS 2016; 5:e299. [PMID: 27003758 PMCID: PMC5014458 DOI: 10.1038/mtna.2016.17] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/15/2015] [Accepted: 02/15/2016] [Indexed: 12/13/2022]
Abstract
Epidermolytic palmoplantar keratoderma (EPPK) is a relatively common autosomal-dominant skin disorder caused by mutations in the keratin 9 gene (KRT9), with few therapeutic options for the affected so far. Here, we report a knock-in transgenic mouse model that carried a small insertion–deletion (indel) mutant of Krt9, c.434delAinsGGCT (p.Tyr144delinsTrpLeu), corresponding to the human mutation KRT9/c.500delAinsGGCT (p.Tyr167delinsTrpLeu), which resulted in a human EPPK-like phenotype in the weight-stress areas of the fore- and hind-paws of both Krt9+/mut and Krt9mut/mut mice. The phenotype confirmed that EPPK is a dominant-negative condition, such that mice heterozygotic for the K9-mutant allele (Krt9+/mut) showed a clear EPPK-like phenotype. Then, we developed a mutant-specific short hairpin RNA (shRNA) therapy for EPPK mice. Mutant-specific shRNAs were systematically identified in vitro using a luciferase reporter gene assay and delivered into Krt9+/mut mice. shRNA-mediated knockdown of mutant protein resulted in almost normal morphology and functions of the skin, whereas the same shRNA had a negligible effect in wild-type K9 mice. Our results suggest that EPPK can be treated by gene therapy, and this has significant implications for future clinical application.
Collapse
|
19
|
Morine M, Kohmoto T, Masuda K, Inagaki H, Watanabe M, Naruto T, Kurahashi H, Maeda K, Imoto I. A unique TBX5 microdeletion with microinsertion detected in patient with Holt-Oram syndrome. Am J Med Genet A 2016; 167A:3192-6. [PMID: 26780237 DOI: 10.1002/ajmg.a.37359] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2015] [Accepted: 08/24/2015] [Indexed: 11/08/2022]
Abstract
Holt-Oram syndrome (HOS) is an autosomal dominant condition characterized by upper limb and congenital heart defects and caused by numerous germline mutations of TBX5 producing preterminal stop codons. Here, we report on a novel and unusual heterozygous TBX5 microdeletion with microinsertion (microindel) mutation (c.627delinsGTGACTCAGGAAACGCTTTCCTGA), which is predicted to synthesize a truncated TBX5 protein, detected in a sporadic patient with clinical features of HOS prenatally diagnosed by ultrasonography. This uncommon and relatively large inserted sequence contains sequences derived from nearby but not adjacent templates on both sense and antisense strands, suggesting two possible models, which require no repeat sequences, causing this complex microindel through the bypass of large DNA adducts via an error-prone DNA polymerase-mediated translesion synthesis.
Collapse
Affiliation(s)
- Mikio Morine
- General Perinatal Medical Center, Shikoku Medical Center for Children and Adults, Zentsuji, Japan
| | - Tomohiro Kohmoto
- Department of Human Genetics, Institute of Biomedical Sciences, Tokushima University Graduate School, Tokushima, Japan.,Student Lab, Tokushima University Faculty of Medicine, Tokushima, Japan
| | - Kiyoshi Masuda
- Department of Human Genetics, Institute of Biomedical Sciences, Tokushima University Graduate School, Tokushima, Japan
| | - Hidehito Inagaki
- Division of Molecular Genetics, Institute for Comprehensive Medical Science, Fujita Health University, Toyoake, Japan
| | - Miki Watanabe
- Department of Human Genetics, Institute of Biomedical Sciences, Tokushima University Graduate School, Tokushima, Japan.,Student Lab, Tokushima University Faculty of Medicine, Tokushima, Japan
| | - Takuya Naruto
- Department of Stress Science, Institute of Biomedical Sciences, Tokushima University Graduate School, Tokushima, Japan
| | - Hiroki Kurahashi
- Division of Molecular Genetics, Institute for Comprehensive Medical Science, Fujita Health University, Toyoake, Japan
| | - Kazuhisa Maeda
- General Perinatal Medical Center, Shikoku Medical Center for Children and Adults, Zentsuji, Japan
| | - Issei Imoto
- Department of Human Genetics, Institute of Biomedical Sciences, Tokushima University Graduate School, Tokushima, Japan
| |
Collapse
|
20
|
Pereza N, Ostojić S, Smirčić A, Hodžić A, Kapović M, Peterlin B. The -2549 insertion/deletion polymorphism in the promoter region of the VEGFA gene in couples with idiopathic recurrent spontaneous abortion. J Assist Reprod Genet 2015; 32:1789-94. [PMID: 26475697 DOI: 10.1007/s10815-015-0593-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2015] [Accepted: 10/01/2015] [Indexed: 12/17/2022] Open
Abstract
PURPOSE The vascular endothelial growth factor A (VEGFA) is crucial for normal vasculogenesis and angiogenesis during pregnancy, and alterations in the VEGFA gene expression were detected in women with idiopathic recurrent spontaneous abortion (IRSA) and spontaneously aborted conceptuses. Our aim was to evaluate whether there is an association between the functional -2549 insertion/deletion (I/D) polymorphism in the promoter region of the VEGFA gene and IRSA in reproductive couples. METHODS We performed a case-control study involving 149 women and their 140 partners with three or more IRSA and 149 control women and men. Allele-specific polymerase chain reaction was used for genotyping. RESULTS We found no association of the -2549 I/D polymorphism with IRSA in women. However, men with the DD genotype have a 1.75-fold increased risk of IRSA compared with men carrying the ID and II genotypes (95 % confidence interval (CI) = 1.05-2.93, P = 0.032). In addition, the D allele in men contributes to a 1.42-fold increased risk of IRSA (95 % CI = 1.02-1.97, P = 0.036) compared to men carrying the I allele. CONCLUSIONS Our results indicate that the -2549 I/D polymorphism in the VEGFA gene in men might be associated with IRSA. Additional genetic association studies including both partners, as well as expression studies, are needed to elucidate the role of this polymorphism in IRSA.
Collapse
Affiliation(s)
- Nina Pereza
- Department of Biology and Medical Genetics, Faculty of Medicine, University of Rijeka, B. Branchetta 20, Rijeka, 51000, Croatia
| | - Saša Ostojić
- Department of Biology and Medical Genetics, Faculty of Medicine, University of Rijeka, B. Branchetta 20, Rijeka, 51000, Croatia.
| | - Anamarija Smirčić
- Department of Biology and Medical Genetics, Faculty of Medicine, University of Rijeka, B. Branchetta 20, Rijeka, 51000, Croatia
| | - Alenka Hodžić
- Department of Gynaecology and Obstetrics, Clinical Institute of Medical Genetics, UMC Ljubljana, Ljubljana, 1000, Slovenia
| | - Miljenko Kapović
- Department of Biology and Medical Genetics, Faculty of Medicine, University of Rijeka, B. Branchetta 20, Rijeka, 51000, Croatia
| | - Borut Peterlin
- Department of Gynaecology and Obstetrics, Clinical Institute of Medical Genetics, UMC Ljubljana, Ljubljana, 1000, Slovenia
| |
Collapse
|
21
|
Simonti CN, Capra JA. The evolution of the human genome. Curr Opin Genet Dev 2015; 35:9-15. [PMID: 26338498 DOI: 10.1016/j.gde.2015.08.005] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2015] [Revised: 08/08/2015] [Accepted: 08/12/2015] [Indexed: 02/05/2023]
Abstract
Human genomes hold a record of the evolutionary forces that have shaped our species. Advances in DNA sequencing, functional genomics, and population genetic modeling have deepened our understanding of human demographic history, natural selection, and many other long-studied topics. These advances have also revealed many previously underappreciated factors that influence the evolution of the human genome, including functional modifications to DNA and histones, conserved 3D topological chromatin domains, structural variation, and heterogeneous mutation patterns along the genome. Using evolutionary theory as a lens to study these phenomena will lead to significant breakthroughs in understanding what makes us human and why we get sick.
Collapse
Affiliation(s)
- Corinne N Simonti
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN 37235, USA
| | - John A Capra
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN 37235, USA; Department of Biological Sciences, Vanderbilt University, Nashville, TN 37235, USA; Department of Biomedical Informatics, Vanderbilt University, Nashville, TN 37235, USA.
| |
Collapse
|
22
|
Song T, Xiao S, Yao D, Huang F, Hu M, Liang H. An efficient DNA-fueled molecular machine for the discrimination of single-base changes. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2014; 26:6181-5. [PMID: 25066311 DOI: 10.1002/adma.201402314] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/23/2014] [Indexed: 05/18/2023]
Abstract
A new strategy for single-base polymorphism (SNP) detection based on the assembly of DNA-AuNPs (gold nanoparticles) driven by a DNA-fueled molecular machine, is established and optimized. It is highly efficient, works at room temperature, and is easy to handle. A single-base change on an oligonucleotide strand is unambiguously discriminated for either SNPs or insertions and deletions (indels). The strategy is demonstrated to detect a mutation in the breast cancer gene BRCA1 in homogeneous solution at room temperature.
Collapse
Affiliation(s)
- Tingjie Song
- CAS Key Laboratory of Soft Matter Chemistry, Collaborative Innovation Center of Chemistry for Energy Materials, Department of Polymer Science and Engineering, University of Science and Technology of China, Hefei, Anhui, 230026, PR China
| | | | | | | | | | | |
Collapse
|
23
|
Abstract
Models of codon evolution have attracted particular interest because of their unique capabilities to detect selection forces and their high fit when applied to sequence evolution. We described here a novel approach for modeling codon evolution, which is based on Kronecker product of matrices. The 61 × 61 codon substitution rate matrix is created using Kronecker product of three 4 × 4 nucleotide substitution matrices, the equilibrium frequency of codons, and the selection rate parameter. The entities of the nucleotide substitution matrices and selection rate are considered as parameters of the model, which are optimized by maximum likelihood. Our fully mechanistic model allows the instantaneous substitution matrix between codons to be fully estimated with only 19 parameters instead of 3,721, by using the biological interdependence existing between positions within codons. We illustrate the properties of our models using computer simulations and assessed its relevance by comparing the AICc measures of our model and other models of codon evolution on simulations and a large range of empirical data sets. We show that our model fits most biological data better compared with the current codon models. Furthermore, the parameters in our model can be interpreted in a similar way as the exchangeability rates found in empirical codon models.
Collapse
Affiliation(s)
- Maryam Zaheri
- Department of Ecology and Evolution, Biophore, University of Lausanne, 1015 Lausanne, SwitzerlandSwiss Institute of Bioinformatics, Genopode, Quartier Sorge, 1015 Lausanne, Switzerland
| | - Linda Dib
- Department of Ecology and Evolution, Biophore, University of Lausanne, 1015 Lausanne, SwitzerlandSwiss Institute of Bioinformatics, Genopode, Quartier Sorge, 1015 Lausanne, Switzerland
| | - Nicolas Salamin
- Department of Ecology and Evolution, Biophore, University of Lausanne, 1015 Lausanne, SwitzerlandSwiss Institute of Bioinformatics, Genopode, Quartier Sorge, 1015 Lausanne, Switzerland
| |
Collapse
|
24
|
Asanuma H, Kashida H, Kamiya Y. De novo design of functional oligonucleotides with acyclic scaffolds. CHEM REC 2014; 14:1055-69. [PMID: 25171046 DOI: 10.1002/tcr.201402040] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2014] [Indexed: 01/20/2023]
Abstract
In this account, we demonstrate a new methodology for the de novo design of functional oligonucleotides with the acyclic scaffolds threoninol and serinol. Four functional motifs-wedge, interstrand-wedge, dimer, and cluster-have been prepared from natural DNA or RNA and functional base surrogates prepared from d-threoninol. The following applications of these motifs are described: (1) photoregulation of formation and dissociation of a DNA duplex modified with azobenzene, (2) sequence-specific detection of DNA using a fluorescent probe, (3) formation of fluorophore assemblies that mimic quantum dots, (4) improved strand selectivity of siRNA modified with a base surrogate, and (5) in vivo tracing of the RNAi pathway. Finally, we introduce artificial nucleic acids (XNAs) prepared from d-threoninol and serinol functionalized with each of the four nucleobases, which have unique properties compared with other acyclic XNAs. Functional oligonucleotides designed from acyclic scaffolds will be powerful tools for both DNA nanotechnology and biotechnology.
Collapse
Affiliation(s)
- Hiroyuki Asanuma
- Department of Molecular Design and Engineering, Graduate School of Engineering, Nagoya University, Furo-cho, Chikusa-ku, Nagoya, 464-8603, Japan.
| | | | | |
Collapse
|
25
|
Genetic analysis of West Nile virus isolates from an outbreak in Idaho, United States, 2006-2007. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2013; 10:4486-506. [PMID: 24065039 PMCID: PMC3799518 DOI: 10.3390/ijerph10094486] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/09/2013] [Revised: 09/12/2013] [Accepted: 09/16/2013] [Indexed: 12/26/2022]
Abstract
West Nile virus (WNV) appeared in the U.S. in 1999 and has since become endemic, with yearly summer epidemics causing tens of thousands of cases of serious disease over the past 14 years. Analysis of WNV strains isolated during the 2006–2007 epidemic seasons demonstrates that a new genetic variant had emerged coincidentally with an intense outbreak in Idaho during 2006. The isolates belonging to the new variant carry a 13 nt deletion, termed ID-Δ13, located at the variable region of the 3′UTR, and are genetically related. The analysis of deletions and insertions in the 3′UTR of two major lineages of WNV revealed the presence of conserved repeats and two indel motifs in the variable region of the 3′UTR. One human and two bird isolates from the Idaho 2006–2007 outbreaks were sequenced using Illumina technology and within-host variability was analyzed. Continued monitoring of new genetic variants is important for public health as WNV continues to evolve.
Collapse
|
26
|
Sankaranarayanan K, Taleei R, Rahmanian S, Nikjoo H. Ionizing radiation and genetic risks. XVII. Formation mechanisms underlying naturally occurring DNA deletions in the human genome and their potential relevance for bridging the gap between induced DNA double-strand breaks and deletions in irradiated germ cells. Mutat Res 2013; 753:114-130. [PMID: 23948232 DOI: 10.1016/j.mrrev.2013.07.003] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2013] [Revised: 06/27/2013] [Accepted: 07/22/2013] [Indexed: 12/21/2022]
Abstract
While much is known about radiation-induced DNA double-strand breaks (DSBs) and their repair, the question of how deletions of different sizes arise as a result of the processing of DSBs by the cell's repair systems has not been fully answered. In order to bridge this gap between DSBs and deletions, we critically reviewed published data on mechanisms pertaining to: (a) repair of DNA DSBs (from basic studies in this area); (b) formation of naturally occurring structural variation (SV) - especially of deletions - in the human genome (from genomic studies) and (c) radiation-induced mutations and structural chromosomal aberrations in mammalian somatic cells (from radiation mutagenesis and radiation cytogenetic studies). The specific aim was to assess the relative importance of the postulated mechanisms in generating deletions in the human genome and examine whether empirical data on radiation-induced deletions in mouse germ cells are consistent with predictions of these mechanisms. The mechanisms include (a) NHEJ, a DSB repair process that does not require any homology and which functions in all stages of the cell cycle (and is of particular relevance in G0/G1); (b) MMEJ, also a DSB repair process but which requires microhomology and which presumably functions in all cell cycle stages; (c) NAHR, a recombination-based DSB repair mechanism which operates in prophase I of meiosis in germ cells; (d) MMBIR, a microhomology-mediated, replication-based mechanism which operates in the S phase of the cell cycle, and (e) strand slippage during replication (involved in the origin of small insertions and deletions (INDELs). Our analysis permits the inference that, between them, these five mechanisms can explain nearly all naturally occurring deletions of different sizes identified in the human genome, NAHR and MMBIR being potentially more versatile in this regard. With respect to radiation-induced deletions, the basic studies suggest that those arising as a result of the operation of NHEJ/MMEJ processes, as currently formulated, are expected to be relatively small. However, data on induced mutations in mouse spermatogonial stem cells (irradiation in G0/G1 phase of the cell cycle and DSB repair presumed to be via NHEJ predominantly) show that most are associated with deletions of different sizes, some in the megabase range. There is thus a 'discrepancy' between what the basic studies suggest and the empirical observations in mutagenesis studies. This discrepancy, however, is only an apparent but not a real one. It can be resolved by considering the issue of deletions in the broader context of and in conjunction with the organization of chromatin in chromosomes and nuclear architecture, the conceptual framework for which already exists in studies carried out during the past fifteen years or so. In this paper, we specifically hypothesize that repair of DSBs induced in chromatin loops may offer a basis to explain the induction of deletions of different sizes and suggest an approach to test the hypothesis. We emphasize that the bridging of the gap between induced DSB and resulting deletions of different sizes is critical for current efforts in computational modeling of genetic risks.
Collapse
Affiliation(s)
- Krishnaswami Sankaranarayanan
- Radiation Biophysics Group, Department of Oncology-Pathology, Karolinska Instituet, Box 260, Stockholm SE 17176, Sweden
| | - Reza Taleei
- Radiation Biophysics Group, Department of Oncology-Pathology, Karolinska Instituet, Box 260, Stockholm SE 17176, Sweden
| | - Shirin Rahmanian
- Radiation Biophysics Group, Department of Oncology-Pathology, Karolinska Instituet, Box 260, Stockholm SE 17176, Sweden
| | - Hooshang Nikjoo
- Radiation Biophysics Group, Department of Oncology-Pathology, Karolinska Instituet, Box 260, Stockholm SE 17176, Sweden.
| |
Collapse
|
27
|
Characterization of bud emergence 46 (BEM46) protein: sequence, structural, phylogenetic and subcellular localization analyses. Biochem Biophys Res Commun 2013; 438:526-32. [PMID: 23916612 DOI: 10.1016/j.bbrc.2013.07.103] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2013] [Accepted: 07/25/2013] [Indexed: 02/04/2023]
Abstract
The bud emergence 46 (BEM46) protein from Neurospora crassa belongs to the α/β-hydrolase superfamily. Recently, we have reported that the BEM46 protein is localized in the perinuclear ER and also forms spots close by the plasma membrane. The protein appears to be required for cell type-specific polarity formation in N. crassa. Furthermore, initial studies suggested that the BEM46 amino acid sequence is conserved in eukaryotes and is considered to be one of the widespread conserved "known unknown" eukaryotic genes. This warrants for a comprehensive phylogenetic analysis of this superfamily to unravel origin and molecular evolution of these genes in different eukaryotes. Herein, we observe that all eukaryotes have at least a single copy of a bem46 ortholog. Upon scanning of these proteins in various genomes, we find that there are expansions leading into several paralogs in vertebrates. Usingcomparative genomic analyses, we identified insertion/deletions (indels) in the conserved domain of BEM46 protein, which allow to differentiate fungal classes such as ascomycetes from basidiomycetes. We also find that exonic indels are able to differentiate BEM46 homologs of different eukaryotic lineage. Furthermore, we unravel that BEM46 protein from N. crassa possess a novel endoplasmic-retention signal (PEKK) using GFP-fusion tagging experiments. We propose that three residues namely a serine 188S, a histidine 292H and an aspartic acid 262D are most critical residues, forming a catalytic triad in BEM46 protein from N. crassa. We carried out a comprehensive study on bem46 genes from a molecular evolution perspective with combination of functional analyses. The evolutionary history of BEM46 proteins is characterized by exonic indels in lineage specific manner.
Collapse
|
28
|
Storkanova G, Vlaskova H, Chuzhanova N, Zeman J, Stranecky V, Majer F, Peskova K, Luksan O, Jirsa M, Hrebicek M, Dvorakova L. Ornithine carbamoyltransferase deficiency: molecular characterization of 29 families. Clin Genet 2013; 84:552-9. [DOI: 10.1111/cge.12085] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2012] [Revised: 12/21/2012] [Accepted: 12/21/2012] [Indexed: 11/30/2022]
Affiliation(s)
- G Storkanova
- Institute of Inherited Metabolic Disorders, First Faculty of Medicine; Charles University in Prague, and General University Hospital in Prague; Czech Republic
| | - H Vlaskova
- Institute of Inherited Metabolic Disorders, First Faculty of Medicine; Charles University in Prague, and General University Hospital in Prague; Czech Republic
| | - N Chuzhanova
- School of Science and Technology; Nottingham Trent University; Nottingham NG11 8NS UK
| | - J Zeman
- Department of Paediatrics and Adolescent Medicine; First Faculty of Medicine, Charles University in Prague, and General University Hospital in Prague; Czech Republic
| | - V Stranecky
- Institute of Inherited Metabolic Disorders, First Faculty of Medicine; Charles University in Prague, and General University Hospital in Prague; Czech Republic
| | - F Majer
- Institute of Inherited Metabolic Disorders, First Faculty of Medicine; Charles University in Prague, and General University Hospital in Prague; Czech Republic
| | - K Peskova
- Institute of Inherited Metabolic Disorders, First Faculty of Medicine; Charles University in Prague, and General University Hospital in Prague; Czech Republic
| | - O Luksan
- Laboratory of Experimental Hepatology; Institute for Clinical and Experimental Medicine (IKEM); Prague Czech Republic
| | - M Jirsa
- Laboratory of Experimental Hepatology; Institute for Clinical and Experimental Medicine (IKEM); Prague Czech Republic
| | - M Hrebicek
- Institute of Inherited Metabolic Disorders, First Faculty of Medicine; Charles University in Prague, and General University Hospital in Prague; Czech Republic
| | - L Dvorakova
- Institute of Inherited Metabolic Disorders, First Faculty of Medicine; Charles University in Prague, and General University Hospital in Prague; Czech Republic
| |
Collapse
|
29
|
Cardoso-Moreira M, Arguello JR, Clark AG. Mutation spectrum of Drosophila CNVs revealed by breakpoint sequencing. Genome Biol 2012; 13:R119. [PMID: 23259534 PMCID: PMC4056370 DOI: 10.1186/gb-2012-13-12-r119] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2012] [Accepted: 12/22/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The detailed study of breakpoints associated with copy number variants (CNVs) can elucidate the mutational mechanisms that generate them and the comparison of breakpoints across species can highlight differences in genomic architecture that may lead to lineage-specific differences in patterns of CNVs. Here, we provide a detailed analysis of Drosophila CNV breakpoints and contrast it with similar analyses recently carried out for the human genome. RESULTS By applying split-read methods to a total of 10x coverage of 454 shotgun sequence across nine lines of D. melanogaster and by re-examining a previously published dataset of CNVs detected using tiling arrays, we identified the precise breakpoints of more than 600 insertions, deletions, and duplications. Contrasting these CNVs with those found in humans showed that in both taxa CNV breakpoints fall into three classes: blunt breakpoints; simple breakpoints associated with microhomology; and breakpoints with additional nucleotides inserted/deleted and no microhomology. In both taxa CNV breakpoints are enriched with non-B DNA sequence structures, which may impair DNA replication and/or repair. However, in contrast to human genomes, non-allelic homologous-recombination (NAHR) plays a negligible role in CNV formation in Drosophila. In flies, non-homologous repair mechanisms are responsible for simple, recurrent, and complex CNVs, including insertions of de novo sequence as large as 60 bp. CONCLUSIONS Humans and Drosophila differ considerably in the importance of homology-based mechanisms for the formation of CNVs, likely as a consequence of the differences in the abundance and distribution of both segmental duplications and transposable elements between the two genomes.
Collapse
|
30
|
Abstract
Griscelli syndrome (GS) is a rare autosomal recessive disorder associated with skin or hair hypopigmentation, hepatosplenomegaly, pancytopenia, and immunologic and central nervous system abnormalities. GS type II is caused by RAB27A mutations. We present RAB27A mutation analysis of 6 cases diagnosed as GS type II. Missense mutations (L26P and L130P) in 2 cases, deletion of 5 bases (514delCAAGC) in 2 cases, and 1 base deletion (148delA) in 2 cases were detected. This report has importance in phenotype-genotype correlation of different types of mutations including missense mutations and deletions within the RAB27A gene in GSII syndrome.
Collapse
|
31
|
Pelleymounter LL, Moon I, Johnson JA, Laederach A, Halvorsen M, Eckloff B, Abo R, Rossetti S. A novel application of pattern recognition for accurate SNP and indel discovery from high-throughput data: targeted resequencing of the glucocorticoid receptor co-chaperone FKBP5 in a Caucasian population. Mol Genet Metab 2011; 104:457-69. [PMID: 21917492 PMCID: PMC3224211 DOI: 10.1016/j.ymgme.2011.08.019] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/24/2011] [Revised: 08/18/2011] [Accepted: 08/18/2011] [Indexed: 11/28/2022]
Abstract
The detection of single nucleotide polymorphisms (SNPs) and insertion/deletions (indels) with precision from high-throughput data remains a significant bioinformatics challenge. Accurate detection is necessary before next-generation sequencing can routinely be used in the clinic. In research, scientific advances are inhibited by gaps in data, exemplified by the underrepresented discovery of rare variants, variants in non-coding regions and indels. The continued presence of false positives and false negatives prevents full automation and requires additional manual verification steps. Our methodology presents applications of both pattern recognition and sensitivity analysis to eliminate false positives and aid in the detection of SNP/indel loci and genotypes from high-throughput data. We chose FK506-binding protein 51(FKBP5) (6p21.31) for our clinical target because of its role in modulating pharmacological responses to physiological and synthetic glucocorticoids and because of the complexity of the genomic region. We detected genetic variation across a 160 kb region encompassing FKBP5. 613 SNPs and 57 indels, including a 3.3 kb deletion were discovered. We validated our method using three independent data sets and, with Sanger sequencing and Affymetrix and Illumina microarrays, achieved 99% concordance. Furthermore we were able to detect 267 novel rare variants and assess linkage disequilibrium. Our results showed both a sensitivity and specificity of 98%, indicating near perfect classification between true and false variants. The process is scalable and amenable to automation, with the downstream filters taking only 1.5h to analyze 96 individuals simultaneously. We provide examples of how our level of precision uncovered the interactions of multiple loci, their predicted influences on mRNA stability, perturbations of the hsp90 binding site, and individual variation in FKBP5 expression. Finally we show how our discovery of rare variants may change current conceptions of evolution at this locus.
Collapse
Affiliation(s)
- Linda L Pelleymounter
- Department of Pharmacology, Department of Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, MN 55905, USA.
| | | | | | | | | | | | | | | |
Collapse
|
32
|
Hamada M, Wijaya E, Frith MC, Asai K. Probabilistic alignments with quality scores: an application to short-read mapping toward accurate SNP/indel detection. ACTA ACUST UNITED AC 2011; 27:3085-92. [PMID: 21976422 DOI: 10.1093/bioinformatics/btr537] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
MOTIVATION Recent studies have revealed the importance of considering quality scores of reads generated by next-generation sequence (NGS) platforms in various downstream analyses. It is also known that probabilistic alignments based on marginal probabilities (e.g. aligned-column and/or gap probabilities) provide more accurate alignment than conventional maximum score-based alignment. There exists, however, no study about probabilistic alignment that considers quality scores explicitly, although the method is expected to be useful in SNP/indel callers and bisulfite mapping, because accurate estimation of aligned columns or gaps is important in those analyses. RESULTS In this study, we propose methods of probabilistic alignment that consider quality scores of (one of) the sequences as well as a usual score matrix. The method is based on posterior decoding techniques in which various marginal probabilities are computed from a probabilistic model of alignments with quality scores, and can arbitrarily trade-off sensitivity and positive predictive value (PPV) of prediction (aligned columns and gaps). The method is directly applicable to read mapping (alignment) toward accurate detection of SNPs and indels. Several computational experiments indicated that probabilistic alignments can estimate aligned columns and gaps accurately, compared with other mapping algorithms e.g. SHRiMP2, Stampy, BWA and Novoalign. The study also suggested that our approach yields favorable precision for SNP/indel calling.
Collapse
Affiliation(s)
- Michiaki Hamada
- Graduate School of Frontier Sciences, University of Tokyo, Kashiwa 277-8562, Japan.
| | | | | | | |
Collapse
|
33
|
Cooper DN, Bacolla A, Férec C, Vasquez KM, Kehrer-Sawatzki H, Chen JM. On the sequence-directed nature of human gene mutation: the role of genomic architecture and the local DNA sequence environment in mediating gene mutations underlying human inherited disease. Hum Mutat 2011; 32:1075-99. [PMID: 21853507 PMCID: PMC3177966 DOI: 10.1002/humu.21557] [Citation(s) in RCA: 94] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2011] [Accepted: 06/17/2011] [Indexed: 12/21/2022]
Abstract
Different types of human gene mutation may vary in size, from structural variants (SVs) to single base-pair substitutions, but what they all have in common is that their nature, size and location are often determined either by specific characteristics of the local DNA sequence environment or by higher order features of the genomic architecture. The human genome is now recognized to contain "pervasive architectural flaws" in that certain DNA sequences are inherently mutation prone by virtue of their base composition, sequence repetitivity and/or epigenetic modification. Here, we explore how the nature, location and frequency of different types of mutation causing inherited disease are shaped in large part, and often in remarkably predictable ways, by the local DNA sequence environment. The mutability of a given gene or genomic region may also be influenced indirectly by a variety of noncanonical (non-B) secondary structures whose formation is facilitated by the underlying DNA sequence. Since these non-B DNA structures can interfere with subsequent DNA replication and repair and may serve to increase mutation frequencies in generalized fashion (i.e., both in the context of subtle mutations and SVs), they have the potential to serve as a unifying concept in studies of mutational mechanisms underlying human inherited disease.
Collapse
Affiliation(s)
- David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff, United Kingdom.
| | | | | | | | | | | |
Collapse
|
34
|
Hickey G, Blanchette M. A probabilistic model for sequence alignment with context-sensitive indels. J Comput Biol 2011; 18:1449-64. [PMID: 21951055 DOI: 10.1089/cmb.2011.0157] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Probabilistic approaches for sequence alignment are usually based on pair Hidden Markov Models (HMMs) or Stochastic Context Free Grammars (SCFGs). Recent studies have shown a significant correlation between the content of short indels and their flanking regions, which by definition cannot be modelled by the above two approaches. In this work, we present a context-sensitive indel model based on a pair Tree-Adjoining Grammar (TAG), along with accompanying algorithms for efficient alignment and parameter estimation. The increased precision and statistical power of this model is shown on simulated and real genomic data. As the cost of sequencing plummets, the usefulness of comparative analysis is becoming limited by alignment accuracy rather than data availability. Our results will therefore have an impact on any type of downstream comparative genomics analyses that rely on alignments. Fine-grained studies of small functional regions or disease markers, for example, could be significantly improved by our method. The implementation is available at www.mcb.mcgill.ca/~blanchem/software.html.
Collapse
Affiliation(s)
- Glenn Hickey
- Center for Biomolecular Science and Engineering, University of California, Santa Cruz, California 95064, USA.
| | | |
Collapse
|
35
|
Abstract
Sequence alignment (the grouping of homologous bases into one column) is fundamental to almost any task in comparative genomics. This translates to positing gaps in the genomic sequences to account for events of insertions and deletions (indels). The interrelationship between sequence alignment and phylogenetic reconstruction has drawn substantial attention recently with works showing the significance of differences in alignments. One of the plausible approaches in this direction is to grade the suitability of a tree to an associated alignment and vice verse. We here present a combinatorial (as opposed to statistical) approach based on the indel history. We show--both by simulations and by using real biological data from the Encyclopedia of DNA Elements (ENCODE)--that this criterion is sound. The novelty of our approach is the distinguishing between insertions and deletions, and augmenting the analysis with a dimension of "depth," extending it from the sequence space to the phylogenetic space. Using this approach, we perform a comprehensive study of indel characteristic behavior among mammals in both coding and non-coding regions. Our results show significant differences in indel patterns between coding and non-coding regions. We also show other characteristic patterns of indel evolution in the depth of the underlying phylogeny.
Collapse
Affiliation(s)
- Sagi Snir
- Department of Evolutionary Biology and the Institute of Evolution, Haifa University, Haifa, Israel.
| | | |
Collapse
|
36
|
Mu XJ, Lu ZJ, Kong Y, Lam HYK, Gerstein MB. Analysis of genomic variation in non-coding elements using population-scale sequencing data from the 1000 Genomes Project. Nucleic Acids Res 2011; 39:7058-76. [PMID: 21596777 PMCID: PMC3167619 DOI: 10.1093/nar/gkr342] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
In the human genome, it has been estimated that considerably more sequence is under natural selection in non-coding regions [such as transcription-factor binding sites (TF-binding sites) and non-coding RNAs (ncRNAs)] compared to protein-coding ones. However, less attention has been paid to them. To study selective pressure on non-coding elements, we use next-generation sequencing data from the recently completed pilot phase of the 1000 Genomes Project, which, compared to traditional methods, allows for the characterization of a full spectrum of genomic variations, including single-nucleotide polymorphisms (SNPs), short insertions and deletions (indels) and structural variations (SVs). We develop a framework for combining these variation data with non-coding elements, calculating various population-based metrics to compare classes and subclasses of elements, and developing element-aware aggregation procedures to probe the internal structure of an element. Overall, we find that TF-binding sites and ncRNAs are less selectively constrained for SNPs than coding sequences (CDSs), but more constrained than a neutral reference. We also determine that the relative amounts of constraint for the three types of variations are, in general, correlated, but there are some differences: counter-intuitively, TF-binding sites and ncRNAs are more selectively constrained for indels than for SNPs, compared to CDSs. After inspecting the overall properties of a class of elements, we analyze selective pressure on subclasses within an element class, and show that the extent of selection is associated with the genomic properties of each subclass. We find, for instance, that ncRNAs with higher expression levels tend to be under stronger purifying selection, and the actual regions of TF-binding motifs are under stronger selective pressure than the corresponding peak regions. Further, we develop element-aware aggregation plots to analyze selective pressure across the linear structure of an element, with the confidence intervals evaluated using both simple bootstrapping and block bootstrapping techniques. We find, for example, that both micro-RNAs (particularly the seed regions) and their binding targets are under stronger selective pressure for SNPs than their immediate genomic surroundings. In addition, we demonstrate that substitutions in TF-binding motifs inversely correlate with site conservation, and SNPs unfavorable for motifs are under more selective constraints than favorable SNPs. Finally, to further investigate intra-element differences, we show that SVs have the tendency to use distinctive modes and mechanisms when they interact with genomic elements, such as enveloping whole gene(s) rather than disrupting them partially, as well as duplicating TF motifs in tandem.
Collapse
Affiliation(s)
- Xinmeng Jasmine Mu
- Program in Computational Biology and Bioinformatics, Department of Molecular Biophysics and Biochemistry, W.M. Keck Foundation Biotechnology Resource Laboratory, Yale University, New Haven, CT 06520, USA
| | | | | | | | | |
Collapse
|
37
|
Ivanov D, Hamby SE, Stenson PD, Phillips AD, Kehrer-Sawatzki H, Cooper DN, Chuzhanova N. Comparative analysis of germline and somatic microlesion mutational spectra in 17 human tumor suppressor genes. Hum Mutat 2011; 32:620-32. [PMID: 21432943 DOI: 10.1002/humu.21483] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2010] [Accepted: 02/07/2011] [Indexed: 12/17/2022]
Abstract
Mutations associated with tumorigenesis may either arise somatically or can be inherited through the germline. We performed a comparison of somatic, germline, shared (found in both soma and germline) and somatic recurrent mutational spectra for 17 human tumor suppressor genes, which focused upon missense single base-pair substitutions and microdeletions/microinsertions. Somatic and germline mutational spectra were similar in relation to C.G>T.A transitions but differed with respect to the frequency of A.T>G.C, A.T>T.A, and C.G>A.T substitutions. Shared missense mutations were characterized by higher mutability rates, greater physicochemical differences between wild-type and mutant residues, and a tendency to occur in evolutionarily conserved residues and within CpG/CpHpG oligonucleotides. Mononucleotide runs (≥4 bp) were identified as hotspots for shared microdeletions/microinsertions. Both germline and somatic microdeletions/microinsertions were found to be significantly overrepresented within the "indel-hotspot" motif, GTAAGT. Using a naïve Bayes' classifier trained to discriminate between five missense mutation groups, 63% of mutations in our dataset were on average correctly recognized. Applying this classifier to an independent dataset of probable driver mutations, we concluded that ∼50% of these somatic missense mutations possess features consistent with their being either shared or recurrent, suggesting that a disproportionate number of such lesions are likely to be drivers of tumorigenesis.
Collapse
Affiliation(s)
- Dobril Ivanov
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff, United Kingdom
| | | | | | | | | | | | | |
Collapse
|
38
|
Vargas-Poussou R, Dahan K, Kahila D, Venisse A, Riveira-Munoz E, Debaix H, Grisart B, Bridoux F, Unwin R, Moulin B, Haymann JP, Vantyghem MC, Rigothier C, Dussol B, Godin M, Nivet H, Dubourg L, Tack I, Gimenez-Roqueplo AP, Houillier P, Blanchard A, Devuyst O, Jeunemaitre X. Spectrum of mutations in Gitelman syndrome. J Am Soc Nephrol 2011; 22:693-703. [PMID: 21415153 DOI: 10.1681/asn.2010090907] [Citation(s) in RCA: 154] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Gitelman's syndrome (GS) is a rare, autosomal recessive, salt-losing tubulopathy caused by mutations in the SLC12A3 gene, which encodes the thiazide-sensitive NaCl cotransporter (NCC). Because 18 to 40% of suspected GS patients carry only one SLC12A3 mutant allele, large genomic rearrangements may account for unidentified mutations. Here, we directly sequenced genomic DNA from a large cohort of 448 unrelated patients suspected of having GS. We found 172 distinct mutations, of which 100 were unreported previously. In 315 patients (70%), we identified two mutations; in 81 patients (18%), we identified one; and in 52 patients (12%), we did not detect a mutation. In 88 patients, we performed a search for large rearrangements by multiplex ligation-dependent probe amplification (MLPA) and found nine deletions and two duplications in 24 of the 51 heterozygous patients. A second technique confirmed each rearrangement. Based on the breakpoints of seven deletions, nonallelic homologous recombination by Alu sequences and nonhomologous end-joining probably favor these intragenic deletions. In summary, missense mutations account for approximately 59% of the mutations in Gitelman's syndrome, and there is a predisposition to large rearrangements (6% of our cases) caused by the presence of repeated sequences within the SLC12A3 gene.
Collapse
Affiliation(s)
- Rosa Vargas-Poussou
- Assistance Publique-Hôpitaux de Paris, Hôpital Européen Georges Pompidou, Service de Génétique, Paris, France.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
39
|
Kashida H, Kondo N, Sekiguchi K, Asanuma H. Detection of three-base deletion by exciplex formation with perylene derivatives. Chem Commun (Camb) 2011; 47:6404-6. [DOI: 10.1039/c1cc11041a] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
|
40
|
Meyer K, Ueland PM. Use of matrix-assisted laser desorption/ionization time-of-flight mass spectrometry for multiplex genotyping. Adv Clin Chem 2011; 53:1-29. [PMID: 21404912 DOI: 10.1016/b978-0-12-385855-9.00001-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
After completion of the human genome project, the focus of geneticists has shifted to elucidation of gene function and genetic diversity to understand the mechanisms of complex diseases or variation of patient response in drug treatment. In the past decade, many different genotyping techniques have been described for the detection of single-nucleotide polymorphisms (SNPs) and other common polymorphic variants. Matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF-MS) is among the most powerful and widely used genotyping technologies. The method offers great flexibility in assay design and enables highly accurate genotyping at high sample throughput. Different strategies for allele discrimination and quantification have been combined with MALDI (hybridization, ligation, cleavage, and primer extension). Approaches based on primer extension have become the most popular applications. This combination enables rapid and reliable multiplexing of SNPs and other common variants, and makes MALDI-TOF-MS well suited for large-scale studies in fine-mapping and verification of genome-wide scans. In contrast to standard genotyping, more demanding approaches have enabled genotyping of DNA pools, molecular haplotyping or the detection of free circulating DNA for prenatal or cancer diagnostics. In addition, MALDI can also be used in novel applications as DNA methylation analysis, expression profiling, and resequencing. This review gives an introduction to multiplex genotyping by MALDI-MS and will focus on the latest developments of this technology.
Collapse
|
41
|
Analysis of NF1 somatic mutations in cutaneous neurofibromas from patients with high tumor burden. Neurogenetics 2010; 11:391-400. [PMID: 20358387 DOI: 10.1007/s10048-010-0240-y] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2009] [Accepted: 03/01/2010] [Indexed: 01/09/2023]
Abstract
Neurofibromatosis type 1, (NF1) is a complex, autosomal dominant disorder characterized by benign and malignant tumors which result from NF1 gene mutations. The molecular mechanisms that underlie NF1 tumorigenesis are still poorly understood although inactivation of other modifying loci in conjunction with NF1 mutations is postulated to be involved. These modifying loci may include deficiencies in mismatch repair genes and elements involved in cell cycle regulation (TP53, RB1, and CDKN2A). We have analyzed the somatic mutations in 89 cutaneous neurofibromas derived from three unrelated NF1 patients with high tumor burden, by loss of heterozygosity (LOH) analysis of the NF1, TP53, RB1, and CDKN2A genes, by assessing microsatellite instability (MSI), by direct sequencing of the NF1, TP53, and several mismatch repair (MMR) genes and by multiplex ligation-dependent probe amplification of the NF1 and TP53 genes. The aim was both to assess the possible clonality of these tumors and also to assess the involvement of other potential genetic loci in the development of these neurofibromas. Somatic NF1 mutations were identified in 57 (64%) of neurofibroma samples. Each mutation was distinct demonstrating the independent origin of each tumor. While somatic LOH of the TP53 gene was identified in four tumors, no specific deletions or sequence variations were identified. LOH of markers flanking the RB1 gene was also found in one tumor but no CDKN2A mutations were detected. Although evidence of MSI was seen in 21 tumors, no MMR gene alterations were identified. The identification of LOH involving TP53 and RB1 loci is a novel finding in benign cutaneous neurofibromas possibly demonstrating an alternative underlying molecular mechanism associated with the development of these benign tumors from this cohort of patients.
Collapse
|
42
|
Ameur A, Wetterbom A, Feuk L, Gyllensten U. Global and unbiased detection of splice junctions from RNA-seq data. Genome Biol 2010; 11:R34. [PMID: 20236510 PMCID: PMC2864574 DOI: 10.1186/gb-2010-11-3-r34] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2009] [Revised: 03/08/2010] [Accepted: 03/17/2010] [Indexed: 11/10/2022] Open
Abstract
We have developed a new strategy for de novo prediction of splice junctions in short-read RNA-seq data, suitable for detection of novel splicing events and chimeric transcripts. When tested on mouse RNA-seq data, >31,000 splice events were predicted, of which 88% bridged between two regions separated by <or=100 kb, and 74% connected two exons of the same RefSeq gene. Our method also reports genomic rearrangements such as insertions and deletions.
Collapse
Affiliation(s)
- Adam Ameur
- Department of Genetics and Pathology, Rudbeck laboratory, Uppsala University, Uppsala, Sweden.
| | | | | | | |
Collapse
|
43
|
Kepp K, Org E, Sõber S, Kelgo P, Viigimaa M, Veldre G, Tõnisson N, Juhanson P, Putku M, Kindmark A, Kozich V, Laan M. Hypervariable intronic region in NCX1 is enriched in short insertion-deletion polymorphisms and showed association with cardiovascular traits. BMC MEDICAL GENETICS 2010; 11:15. [PMID: 20109173 PMCID: PMC2832636 DOI: 10.1186/1471-2350-11-15] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/06/2009] [Accepted: 01/28/2010] [Indexed: 01/07/2023]
Abstract
Background Conserved non-coding regions (CNR) have been shown to harbor gene expression regulatory elements. Genetic variations in these regions may potentially contribute to complex disease susceptibility. Methods We targeted CNRs of cardiovascular disease (CVD) candidate gene, Na(+)-Ca(2+) exchanger (NCX1) with polymorphism screening among CVD patients (n = 46) using DHPLC technology. The flanking region (348 bp) of the 14 bp indel in intron 2 was further genotyped by DGGE assay in two Eastern-European CVD samples: essential hypertension (HYPEST; 470 cases, 652 controls) and coronary artery disease, CAD (CADCZ; 257 cases, controls 413). Genotype-phenotype associations were tested by regression analysis implemented in PLINK. Alignments of primate sequences were performed by ClustalW2. Results Nine of the identified NCX1 variants were either singletons or targeted by commercial platforms. The 14 bp intronic indel (rs11274804) was represented with substantial frequency in HYPEST (6.82%) and CADCZ (14.58%). Genotyping in Eastern-Europeans (n = 1792) revealed hypervariable nature of this locus, represented by seven alternative alleles. The alignments of human-chimpanzee-macaque sequences showed that the major human variant (allele frequency 90.45%) was actually a human-specific deletion compared to other primates. In humans, this deletion was surrounded by other short (5-43 bp) deletion variants and a duplication (40 bp) polymorphism possessing overlapping breakpoints. This indicates a potential indel hotspot, triggered by the initial deletion in human lineage. An association was detected between the carrier status of 14 bp indel ancestral allele and CAD (P = 0.0016, OR = 2.02; Bonferroni significance level alpha = 0.0045), but not with hypertension. The risk for the CAD development was even higher among the patients additionally diagnosed with metabolic syndrome (P = 0.0014, OR = 2.34). Consistent with the effect on metabolic processes, suggestive evidence for the association with heart rate, serum triglyceride and LDL levels was detected (P = 0.04). Conclusions Compared to SNPs targeted by large number of locus-specific and genome-wide assays, considerably less attention has been paid to short indel variants in the human genome. The data of genome dynamics, mutation rate and population genetics of short indels, as well as their impact on gene expressional profile and human disease susceptibility is limited. The characterization of NCX1 intronic hypervariable non-coding region enriched in human-specific indel variants contributes to this gap of knowledge.
Collapse
Affiliation(s)
- Katrin Kepp
- Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
44
|
Zhang J, Xiao L, Yin Y, Sirois P, Gao H, Li K. A law of mutation: power decay of small insertions and small deletions associated with human diseases. Appl Biochem Biotechnol 2009; 162:321-8. [PMID: 19816659 DOI: 10.1007/s12010-009-8793-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2009] [Accepted: 09/24/2009] [Indexed: 11/28/2022]
Abstract
Indels in evolutionary studies are rapidly decayed obeying a power law. The present study analyzed the length distribution of small insertions and deletions associated with human diseases and confirmed that the decay pattern of these small mutations is similar to that of indels when the mutation datasets are large enough. The describable decay pattern of somatic mutations may have application in the evaluation of varied penetrance of different mutations and in association study of gene mutation with carcinogenesis.
Collapse
Affiliation(s)
- Jia Zhang
- Clinical Molecular Diagnostic Center, the Second Affiliated Hospital of Soochow University, Suzhou 215004, China
| | | | | | | | | | | |
Collapse
|
45
|
Stoltzfus A, Yampolsky LY. Climbing mount probable: mutation as a cause of nonrandomness in evolution. J Hered 2009; 100:637-47. [PMID: 19625453 DOI: 10.1093/jhered/esp048] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The classic view of evolution as "shifting gene frequencies" in the Modern Synthesis literally means that evolution is the modulation of existing variation ("standing variation"), as opposed to a "new mutations" view of evolution as a 2-step process of mutational origin followed by acceptance-or-rejection (via selection and drift). The latter view has received renewed attention, yet its implications for evolutionary causation still are not widely understood. We review theoretical results showing that this conception of evolution allows for a role of mutation as a cause of nonrandomness, a role that could be important but has been misconceived and associated misleadingly with neutral evolution. Specifically, biases in the introduction of variation, including mutational biases, may impose predictable biases on evolution, with no necessary dependence on neutrality. As an example of how important such effects may be, we present a new analysis partitioning the variance in mean rates of amino acid replacement during human-chimpanzee divergence to components of codon mutation and amino acid exchangeability. The results indicate that mutational effects are not merely important but account for most of the variance explained. The challenge that such results pose for comparative genomics is to address mutational effects as a necessary part of any analysis of causal factors. To meet this challenge requires developing knowledge of mutation as a biological process, understanding how mutation imposes propensities on evolution, and applying methods of analysis that incorporate mutational effects.
Collapse
Affiliation(s)
- Arlin Stoltzfus
- Center for Advanced Research in Biotechnology, Rockville, MD 20850, USA.
| | | |
Collapse
|
46
|
Atchaneeyasakul LO, Trinavarat A, Dulayajinda D, Kumpornsin K, Thongnoppakhun W, Yenchitsomanus PT, Limwongse C. Novel and De-novo TruncatingPAX6Mutations and Ocular Phenotypes in Thai Aniridia Patients. Ophthalmic Genet 2009; 27:21-7. [PMID: 16543198 DOI: 10.1080/13816810500481667] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
PURPOSE To describe the ophthalmic findings and mutation analyses of the PAX6 gene in Thai aniridia patients. METHODS Ten patients from six unrelated families underwent a comprehensive ophthalmic examination. Mutations in the PAX6 gene were screened by single-strand conformational polymorphism (SSCP) and direct DNA sequencing of the SSCP variants. RESULTS Seven patients developed cataracts and six developed glaucoma. Mutation analysis demonstrated four different truncating mutations, two of which were de novo. These included one novel insertion/deletion mutation (c.474del12insGA in exon 5) and three nonsense mutations. R203X and R240X are common recurrent mutations, while Q277X in exon 10 is novel. All mutations resulted in loss of function of the PAX6 protein. CONCLUSION Our data confirm inter- and intrafamilial variable phenotypic manifestations of which the underlying mechanisms may be haploinsufficiency or dominant-negative mutation.
Collapse
Affiliation(s)
- La-Ongsri Atchaneeyasakul
- Department of Ophthalmology, Faculty of Medicine Siriraj Hospital, Mahidol University, 2 Prannok Road, Bangkok-noi, Bangkok 10700, Thailand.
| | | | | | | | | | | | | |
Collapse
|
47
|
Kvikstad EM, Chiaromonte F, Makova KD. Ride the wavelet: A multiscale analysis of genomic contexts flanking small insertions and deletions. Genome Res 2009; 19:1153-64. [PMID: 19502380 DOI: 10.1101/gr.088922.108] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Recent studies have revealed that insertions and deletions (indels) are more different in their formation than previously assumed. What remains enigmatic is how the local DNA sequence context contributes to these differences. To investigate the relative impact of various molecular mechanisms to indel formation, we analyzed sequence contexts of indels in the non protein- or RNA-coding, nonrepetitive (NCNR) portion of the human genome. We considered small (<or=30-bp) indels occurring in the human lineage since its divergence from chimpanzee and used wavelet techniques to study, simultaneously for multiple scales, the spatial patterns of short sequence motifs associated with indel mutagenesis. In particular, we focused on motifs associated with DNA polymerase activity, topoisomerase cleavage, double-strand breaks (DSBs), and their repair. We came to the following conclusions. First, many motifs are characterized by unique enrichment profiles in the vicinity of indels vs. indel-free portions of the genome, verifying the importance of sequence context in indel mutagenesis. Second, only limited similarity in motif frequency profiles is evident flanking insertions vs. deletions, confirming differences in their mutagenesis. Third, substantial similarity in frequency profiles exists between pairs of individual motifs flanking insertions (and separately deletions), suggesting "cooperation" among motifs, and thus molecular mechanisms, during indel formation. Fourth, the wavelet analyses demonstrate that all these patterns are highly dependent on scale (the size of an interval considered). Finally, our results depict a model of indel mutagenesis comprising both replication and recombination (via repair of paused replication forks and site-specific recombination).
Collapse
Affiliation(s)
- Erika M Kvikstad
- Center for Comparative Genomics and Bioinformatics, Penn State University, University Park, Pennsylvania 16802, USA
| | | | | |
Collapse
|
48
|
Upadhyaya M, Spurlock G, Kluwe L, Chuzhanova N, Bennett E, Thomas N, Guha A, Mautner V. The spectrum of somatic and germline NF1 mutations in NF1 patients with spinal neurofibromas. Neurogenetics 2009; 10:251-63. [DOI: 10.1007/s10048-009-0178-0] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2008] [Accepted: 01/22/2009] [Indexed: 01/17/2023]
|
49
|
Fernández-Burriel M, Martínez-Rubio D, Lupo V, Pérez-Colosía V, Piñán-López E, Palau F, Espinós C. A novel delins mutation in the alpha-TTP gene in a family segregating ataxia with isolated vitamin E deficiency. Pediatr Res 2008; 64:262-4. [PMID: 18458655 DOI: 10.1203/pdr.0b013e31817d9bf7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Ataxia with isolated vitamin E deficiency is a rare autosomal recessive neurodegenerative disease due to mutations in the alpha-tocopherol transfer protein gene. In ataxia with isolated vitamin E deficiency, the biochemical hallmark is the low plasmatic levels of vitamin E and, in most of the patients, vitamin E supplementation allows a stabilization of the neurologic conditions. We have investigated the genetic cause of ataxia and reduced levels of vitamin E, and apolipoproteins A1 and B in a 16-y-old patient. Results revealed that our propositus is a compound heterozygote for the c.227_229delinsATT/c.744delA mutations in the alpha-tocopherol transfer protein gene, each inherited from one of the two parents. His sister is also a compound heterozygote for both mutations, and she presents a biochemical pattern similar to that of his brother. After receiving the vitamin E supplementation, plasmatic levels of vitamin E and apolipoprotein A1 have been normalized in the propositus. The detected mutations would justify the undetectable levels of vitamin E, but would not explain the also decreased levels of the apolipoproteins, as neither that after treatment with vitamin E, the levels of apolipoprotein B do not become normal. These findings suggest that other genes may play a role in producing this atypical biochemical profile.
Collapse
|
50
|
Scaringe WA, Li K, Gu D, Gonzalez KD, Chen Z, Hill KA, Sommer SS. Somatic microindels in human cancer: the insertions are highly error-prone and derive from nearby but not adjacent sense and antisense templates. Hum Mol Genet 2008; 17:2910-8. [PMID: 18632684 DOI: 10.1093/hmg/ddn190] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Somatic microindels (microdeletions with microinsertions) have been studied in normal mouse tissues using the Big Blue lacI transgenic mutation detection system. Here we analyze microindels in human cancers using an endogenous and transcribed gene, the TP53 gene. Microindel frequency, the enhancement of 1-2 microindels and other features are generally similar to that observed in the non-transcribed lacI gene in normal mouse tissues. The current larger sample of somatic microindels reveals recurroids: mutations in which deletions are identical and the co-localized insertion is similar. The data reveal that the inserted sequences derive from nearby but not adjacent sequences in contrast to the slippage that characterizes the great majority of pure microinsertions. The microindel inserted sequences derive from a template on the sense or antisense strand with similar frequency. The estimated error rate of the insertion process of 13% per bp is by far the largest reported in vivo, with the possible exception of somatic hypermutation in the immunoglobulin gene. The data constrain possible mechanisms of microindels and raise the question of whether microindels are 'scars' from the bypass of large DNA adducts by a translesional polymerase, e.g. the 'Tarzan model' presented herein.
Collapse
Affiliation(s)
- William A Scaringe
- Department of Molecular Genetics, City of Hope National Medical Center, 1500 E. Duarte Rd, Duarte, CA 91010, USA
| | | | | | | | | | | | | |
Collapse
|