Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Bhangale TR, Rieder MJ, Livingston RJ, Nickerson DA. Comprehensive identification and characterization of diallelic insertion-deletion polymorphisms in 330 human candidate genes. Hum Mol Genet 2004;14:59-69. [PMID: 15525656 DOI: 10.1093/hmg/ddi006] [Citation(s) in RCA: 105] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

For:	Bhangale TR, Rieder MJ, Livingston RJ, Nickerson DA. Comprehensive identification and characterization of diallelic insertion-deletion polymorphisms in 330 human candidate genes. Hum Mol Genet 2004;14:59-69. [PMID: 15525656 DOI: 10.1093/hmg/ddi006] [Citation(s) in RCA: 105] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Number

Cited by Other Article(s)

Spisak S, Tisza V, Nuzzo PV, Seo JH, Pataki B, Ribli D, Sztupinszki Z, Bell C, Rohanizadegan M, Stillman DR, Alaiwi SA, Bartels AH, Papp M, Shetty A, Abbasi F, Lin X, Lawrenson K, Gayther SA, Pomerantz M, Baca S, Solymosi N, Csabai I, Szallasi Z, Gusev A, Freedman ML. A biallelic multiple nucleotide length polymorphism explains functional causality at 5p15.33 prostate cancer risk locus. Nat Commun 2023;14:5118. [PMID: 37612286 PMCID: PMC10447552 DOI: 10.1038/s41467-023-40616-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Accepted: 08/03/2023] [Indexed: 08/25/2023] Open

Affiliation(s)

Sandor Spisak Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
Viktoria Tisza Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA Computational Health Informatics Program (CHIP) Boston Children's Hospital Harvard Medical School, Boston, MA, 02215, USA Institute of Enzymology, Research Centre for Natural Sciences, Budapest, 1117, Hungary
Pier Vitale Nuzzo Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA, 02215, USA Department of Internal Medicine, School of Medicine, University of Genoa, Genoa, Lgo R. Benzi 10, 16132, Italy
Ji-Heui Seo Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
Balint Pataki Department of Physics of Complex Systems, ELTE Eötvös Loránd University, Pázmány P. s. 1A, Budapest, 1117, Hungary
Dezso Ribli Department of Physics of Complex Systems, ELTE Eötvös Loránd University, Pázmány P. s. 1A, Budapest, 1117, Hungary
Zsofia Sztupinszki Computational Health Informatics Program (CHIP) Boston Children's Hospital Harvard Medical School, Boston, MA, 02215, USA
Connor Bell Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
Mersedeh Rohanizadegan Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
David R Stillman Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
Sarah Abou Alaiwi Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
Alan H Bartels Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
Marton Papp Institute of Enzymology, Research Centre for Natural Sciences, Budapest, 1117, Hungary Centre for Bioinformatics, University of Veterinary Medicine, Istvan str. 2, Budapest, 1078, Hungary
Anamay Shetty Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA Division of Genetics, Brigham & Women's Hospital, Boston, MA, USA
Forough Abbasi Women's Cancer Program, Samuel Oschin Comprehensive Cancer Institute, Cedars-Sinai Medical Center, Los Angeles, CA, 90048, USA Division of Gynecologic Oncology, Department of Obstetrics and Gynecology, Cedars-Sinai Medical Center, Los Angeles, CA, 90048, USA
Xianzhi Lin Women's Cancer Program, Samuel Oschin Comprehensive Cancer Institute, Cedars-Sinai Medical Center, Los Angeles, CA, 90048, USA Division of Gynecologic Oncology, Department of Obstetrics and Gynecology, Cedars-Sinai Medical Center, Los Angeles, CA, 90048, USA
Kate Lawrenson Women's Cancer Program, Samuel Oschin Comprehensive Cancer Institute, Cedars-Sinai Medical Center, Los Angeles, CA, 90048, USA Division of Gynecologic Oncology, Department of Obstetrics and Gynecology, Cedars-Sinai Medical Center, Los Angeles, CA, 90048, USA Center for Bioinformatics and Functional Genomics, Department of Biomedical Science, Cedars-Sinai Medical Center, Los Angeles, CA, 90048, USA
Simon A Gayther Division of Gynecologic Oncology, Department of Obstetrics and Gynecology, Cedars-Sinai Medical Center, Los Angeles, CA, 90048, USA Center for Bioinformatics and Functional Genomics, Department of Biomedical Science, Cedars-Sinai Medical Center, Los Angeles, CA, 90048, USA
Mark Pomerantz Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
Sylvan Baca Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA, 02215, USA The Eli and Edythe L. Broad Institute, Cambridge, MA, 02142, USA
Norbert Solymosi Department of Physics of Complex Systems, ELTE Eötvös Loránd University, Pázmány P. s. 1A, Budapest, 1117, Hungary
Istvan Csabai Department of Physics of Complex Systems, ELTE Eötvös Loránd University, Pázmány P. s. 1A, Budapest, 1117, Hungary
Zoltan Szallasi Computational Health Informatics Program (CHIP) Boston Children's Hospital Harvard Medical School, Boston, MA, 02215, USA Department of Bioinformatics, Forensic and Insurance Medicine Semmelweis University, Budapest, Hungary Danish Cancer Society Research Center, Strandboulevarden 49, 2100, Copenhagen, Denmark National Korányi Institute of Pulmonology, Budapest, 1112, Hungary
Alexander Gusev Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA Division of Genetics, Brigham & Women's Hospital, Boston, MA, USA The Eli and Edythe L. Broad Institute, Cambridge, MA, 02142, USA
Matthew L Freedman Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA. Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA, 02215, USA. The Eli and Edythe L. Broad Institute, Cambridge, MA, 02142, USA.

Collapse

Yao Y, Sun K, Yang Q, Zhou Z, Shao C, Qian X, Tang Q, Xie J. Assessing Autosomal InDel Loci With Multiple Insertions or Deletions of Random DNA Sequences in Human Genome. Front Genet 2022;12:809815. [PMID: 35178073 PMCID: PMC8844376 DOI: 10.3389/fgene.2021.809815] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Accepted: 12/27/2021] [Indexed: 11/13/2022] Open

Fan H, He Y, Li S, Xie Q, Wang F, Du Z, Fang Y, Qiu P, Zhu B. Systematic Evaluation of a Novel 6-dye Direct and Multiplex PCR-CE-Based InDel Typing System for Forensic Purposes. Front Genet 2022;12:744645. [PMID: 35082827 PMCID: PMC8784372 DOI: 10.3389/fgene.2021.744645] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2021] [Accepted: 10/29/2021] [Indexed: 12/16/2022] Open

Abstract

Insertion/deletion (InDel) polymorphisms, combined desirable characteristics of both short tandem repeats (STRs) and single nucleotide polymorphisms (SNPs), are considerable potential in the fields of forensic practices and population genetics. However, most commercial InDel kits designed based on non-Asians limited extensive forensic applications in East Asian (EAS) populations. Recently, a novel 6-dye direct and multiplex PCR-CE-based typing system was designed on the basis of genome-wide EAS population data, which could amplify 60 molecular genetic markers, consisting of 57 autosomal InDels (A-InDels), 2 Y-chromosomal InDels (Y-InDels), and Amelogenin in a single PCR reaction and detect by capillary electrophoresis, simultaneously. In the present study, the DNA profiles of 279 unrelated individuals from the Hainan Li group were generated by the novel typing system. In addition, we collected two A-InDel sets to evaluate the forensic performances of the novel system in the 1,000 Genomes Project (1KG) populations and Hainan Li group. For the Universal A-InDel set (UAIS, containing 44 A-InDels) the cumulative power of discrimination (CPD) ranged from 1-1.03 × 10-14 to 1-1.27 × 10-18, and the cumulative power of exclusion (CPE) varied from 0.993634 to 0.999908 in the 1KG populations. For the East Asia-based A-InDel set (EAIS, containing 57 A-InDels) the CPD spanned from 1-1.32 × 10-23 to 1-9.42 × 10-24, and the CPE ranged from 0.999965 to 0.999997. In the Hainan Li group, the average heterozygote (He) was 0.4666 (0.2366-0.5448), and the polymorphism information content (PIC) spanned from 0.2116 to 0.3750 (mean PIC: 0.3563 ± 0.0291). In total, the CPD and CPE of 57 A-InDels were 1-1.32 × 10-23 and 0.999965, respectively. Consequently, the novel 6-dye direct and multiplex PCR-CE-based typing system could be considered as the reliable and robust tool for human identification and intercontinental population differentiation, and supplied additional information for kinship analysis in the 1KG populations and Hainan Li group.

Collapse

Chen J, Guo JT. Structural and functional analysis of somatic coding and UTR indels in breast and lung cancer genomes. Sci Rep 2021;11:21178. [PMID: 34707120 PMCID: PMC8551294 DOI: 10.1038/s41598-021-00583-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2021] [Accepted: 10/14/2021] [Indexed: 11/24/2022] Open

Roberts R, Fair J. Genetics, its role in preventing the pandemic of coronary artery disease. Clin Cardiol 2021;44:771-779. [PMID: 34080689 PMCID: PMC8207986 DOI: 10.1002/clc.23627] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Revised: 04/23/2021] [Accepted: 04/30/2021] [Indexed: 01/14/2023] Open

Genetic variation in the Mauritian cynomolgus macaque population reflects variation in the human population. Gene 2021;787:145648. [PMID: 33848572 DOI: 10.1016/j.gene.2021.145648] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2020] [Revised: 03/23/2021] [Accepted: 04/07/2021] [Indexed: 11/21/2022]

Roberts R, Chang CC. A Journey through Genetic Architecture and Predisposition of Coronary Artery Disease. Curr Genomics 2020;21:382-398. [PMID: 33093801 PMCID: PMC7536803 DOI: 10.2174/1389202921999200630145241] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2020] [Revised: 05/18/2020] [Accepted: 05/26/2020] [Indexed: 01/14/2023] Open

Liu Y, Jin X, Lan Q, Zhao C, Xu H, Xie T, Lan J, Tai Y, Zhu B. Forensic characteristic and population structure dissection of Shaanxi Han population in the light of diallelic deletion/insertion polymorphism data. Genomics 2020;112:3837-3845. [PMID: 32574833 DOI: 10.1016/j.ygeno.2020.06.028] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2020] [Revised: 06/15/2020] [Accepted: 06/17/2020] [Indexed: 12/08/2022]

Affiliation(s)

Yanfang Liu Multi-Omics Innovative Research Center of Forensic Identification; Department of Forensic Genetics, School of Forensic Medicine, Southern Medical University, Guangzhou 510515, China
Xiaoye Jin Key Laboratory of Shaanxi Province for Craniofacial Precision Medicine Research, College of Stomatology, Xi'an Jiaotong University, 710004 Xi'an, China; Clinical Research Center of Shaanxi Province for Dental and Maxillofacial Diseases, College of Stomatology, Xi'an Jiaotong University, 710004, Xi'an, China; College of Forensic Medicine, Xi'an Jiaotong University Health Science Center, Xi'an, 710061, China
Qiong Lan Multi-Omics Innovative Research Center of Forensic Identification; Department of Forensic Genetics, School of Forensic Medicine, Southern Medical University, Guangzhou 510515, China
Congying Zhao Multi-Omics Innovative Research Center of Forensic Identification; Department of Forensic Genetics, School of Forensic Medicine, Southern Medical University, Guangzhou 510515, China
Hui Xu Multi-Omics Innovative Research Center of Forensic Identification; Department of Forensic Genetics, School of Forensic Medicine, Southern Medical University, Guangzhou 510515, China
Tong Xie Multi-Omics Innovative Research Center of Forensic Identification; Department of Forensic Genetics, School of Forensic Medicine, Southern Medical University, Guangzhou 510515, China
Jiangwei Lan Multi-Omics Innovative Research Center of Forensic Identification; Department of Forensic Genetics, School of Forensic Medicine, Southern Medical University, Guangzhou 510515, China
Yunchun Tai Multi-Omics Innovative Research Center of Forensic Identification; Department of Forensic Genetics, School of Forensic Medicine, Southern Medical University, Guangzhou 510515, China
Bofeng Zhu Multi-Omics Innovative Research Center of Forensic Identification; Department of Forensic Genetics, School of Forensic Medicine, Southern Medical University, Guangzhou 510515, China; Key Laboratory of Shaanxi Province for Craniofacial Precision Medicine Research, College of Stomatology, Xi'an Jiaotong University, 710004 Xi'an, China; Clinical Research Center of Shaanxi Province for Dental and Maxillofacial Diseases, College of Stomatology, Xi'an Jiaotong University, 710004, Xi'an, China.

Collapse

Whole genome detection of sequence and structural polymorphism in six diverse horses. PLoS One 2020;15:e0230899. [PMID: 32271776 PMCID: PMC7144971 DOI: 10.1371/journal.pone.0230899] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2019] [Accepted: 03/12/2020] [Indexed: 12/30/2022] Open

Abstract

The domesticated horse has played a unique role in human history, serving not just as a source of animal protein, but also as a catalyst for long-distance migration and military conquest. As a result, the horse developed unique physiological adaptations to meet the demands of both their climatic environment and their relationship with man. Completed in 2009, the first domesticated horse reference genome assembly (EquCab 2.0) produced most of the publicly available genetic variations annotations in this species. Yet, there are around 400 geographically and physiologically diverse breeds of horse. To enrich the current collection of genetic variants in the horse, we sequenced whole genomes from six horses of six different breeds: an American Miniature, a Percheron, an Arabian, a Mangalarga Marchador, a Native Mongolian Chakouyi, and a Tennessee Walking Horse, and mapped them to EquCab3.0 genome. Aside from extreme contrasts in body size, these breeds originate from diverse global locations and each possess unique adaptive physiology. A total of 1.3 billion reads were generated for the six horses with coverage between 15x to 24x per horse. After applying rigorous filtration, we identified and functionally annotated 17,514,723 Single Nucleotide Polymorphisms (SNPs), and 1,923,693 Insertions/Deletions (INDELs), as well as an average of 1,540 Copy Number Variations (CNVs) and 3,321 Structural Variations (SVs) per horse. Our results revealed putative functional variants including genes associated with size variation like LCORL gene (found in all horses), ZFAT in the Arabian, American Miniature and Percheron horses and ANKRD1 in the Native Mongolian Chakouyi horse. We detected a copy number variation in the Latherin gene that may be the result of evolutionary selection impacting thermoregulation by sweating, an important component of athleticism and heat tolerance. The newly discovered variants were formatted into user-friendly browser tracks and will provide a foundational database for future studies of the genetic underpinnings of diverse phenotypes within the horse.

The domesticated horse played a unique role in human history, serving not just as a source of dietary animal protein, but also as a catalyst for long-distance migration and military conquest. As a result, the horse developed unique physiological adaptations to meet the demands of both their climatic environment and their relationship with man. Although the completion of the horse reference genome allowed for the discovery of many genetic variants, the remarkable diversity across breeds of horse calls for additional effort to quantify the complete span of genetic polymorphism within this unique species. In this work, we present genome re-sequencing and variant detection analysis for six horses belonging to six different breeds representing different morphology, origins and vary in their physiological demands and response. We identified and annotated not just single nucleotide polymorphisms (SNPs), but also insertions and deletions (INDELs), copy number variations (CNVs) and structural variations (SVs). Our results illustrate novel sources of polymorphism and highlight potentially impactful variations for phenotypes of body size and conformation. We also detected a copy number loss in the Latherin gene that could be the result of an evolutionary selection affecting thermoregulation through sweating. Our newly discovered variants were formatted into easy-to-use tracks that can be easily accessed by researchers around the globe.

Collapse

Wang S, Yi X, Wu M, Zhao H, Liu S, Pan Y, Li Q, Tang X, Zhu Y, Sun X. Detection of key gene InDels in TGF-β pathway and its relationship with growth traits in four sheep breeds. Anim Biotechnol 2019;32:194-204. [PMID: 31625451 DOI: 10.1080/10495398.2019.1675682] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]

Prediction and management of CAD risk based on genetic stratification. Trends Cardiovasc Med 2019;30:328-334. [PMID: 31543237 DOI: 10.1016/j.tcm.2019.08.006] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/24/2019] [Revised: 08/01/2019] [Accepted: 08/20/2019] [Indexed: 12/24/2022]

Hasan MS, Wu X, Zhang L. Uncovering missed indels by leveraging unmapped reads. Sci Rep 2019;9:11093. [PMID: 31366961 PMCID: PMC6668410 DOI: 10.1038/s41598-019-47405-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2019] [Accepted: 07/12/2019] [Indexed: 02/08/2023] Open

Fuertes MA, Rodrigo JR, Alonso C. Conserved Critical Evolutionary Gene Structures in Orthologs. J Mol Evol 2019;87:93-105. [PMID: 30815710 DOI: 10.1007/s00239-019-09889-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2018] [Accepted: 02/13/2019] [Indexed: 12/18/2022]

Lin M, Whitmire S, Chen J, Farrel A, Shi X, Guo JT. Effects of short indels on protein structure and function in human genomes. Sci Rep 2017;7:9313. [PMID: 28839204 PMCID: PMC5570956 DOI: 10.1038/s41598-017-09287-x] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2017] [Accepted: 07/24/2017] [Indexed: 01/20/2023] Open

Genetics: Implications for Prevention and Management of Coronary Artery Disease. J Am Coll Cardiol 2017;68:2797-2818. [PMID: 28007143 DOI: 10.1016/j.jacc.2016.10.039] [Citation(s) in RCA: 78] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/15/2016] [Revised: 10/12/2016] [Accepted: 10/24/2016] [Indexed: 12/21/2022]

An Incomplete Understanding of Human Genetic Variation. Genetics 2017;202:1251-4. [PMID: 27053122 DOI: 10.1534/genetics.115.180539] [Citation(s) in RCA: 64] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open

Huddleston J, Chaisson MJP, Steinberg KM, Warren W, Hoekzema K, Gordon D, Graves-Lindsay TA, Munson KM, Kronenberg ZN, Vives L, Peluso P, Boitano M, Chin CS, Korlach J, Wilson RK, Eichler EE. Discovery and genotyping of structural variation from long-read haploid genome sequence data. Genome Res 2016;27:677-685. [PMID: 27895111 PMCID: PMC5411763 DOI: 10.1101/gr.214007.116] [Citation(s) in RCA: 227] [Impact Index Per Article: 28.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2016] [Accepted: 11/15/2016] [Indexed: 01/07/2023]

Affiliation(s)

John Huddleston Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA.,Howard Hughes Medical Institute, University of Washington, Seattle, Washington 98195, USA
Mark J P Chaisson Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA
Karyn Meltz Steinberg McDonnell Genome Institute, Department of Medicine, Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63108, USA
Wes Warren McDonnell Genome Institute, Department of Medicine, Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63108, USA
Kendra Hoekzema Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA
David Gordon Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA.,Howard Hughes Medical Institute, University of Washington, Seattle, Washington 98195, USA
Tina A Graves-Lindsay McDonnell Genome Institute, Department of Medicine, Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63108, USA
Katherine M Munson Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA
Zev N Kronenberg Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA
Laura Vives Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA
Paul Peluso Pacific Biosciences of California, Incorporated, Menlo Park, California 94025, USA
Matthew Boitano Pacific Biosciences of California, Incorporated, Menlo Park, California 94025, USA
Chen-Shin Chin Pacific Biosciences of California, Incorporated, Menlo Park, California 94025, USA
Jonas Korlach Pacific Biosciences of California, Incorporated, Menlo Park, California 94025, USA
Richard K Wilson Department of Pathology, University of Pittsburgh, Pittsburgh, Pennsylvania 15261, USA
Evan E Eichler Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA.,Howard Hughes Medical Institute, University of Washington, Seattle, Washington 98195, USA

Collapse

Spinks PQ, Thomson RC, McCartney-Melstad E, Shaffer HB. Phylogeny and temporal diversification of the New World pond turtles (Emydidae). Mol Phylogenet Evol 2016;103:85-97. [DOI: 10.1016/j.ympev.2016.07.007] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2015] [Revised: 06/03/2016] [Accepted: 07/07/2016] [Indexed: 11/16/2022]

Wajnberg G, Passetti F. Using high-throughput sequencing transcriptome data for INDEL detection: challenges for cancer drug discovery. Expert Opin Drug Discov 2016;11:257-68. [PMID: 26787005 DOI: 10.1517/17460441.2016.1143813] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]

Bobilev AM, McDougal ME, Taylor WL, Geisert EE, Netland PA, Lauderdale JD. Assessment of PAX6 alleles in 66 families with aniridia. Clin Genet 2016;89:669-77. [PMID: 26661695 DOI: 10.1111/cge.12708] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2015] [Revised: 12/03/2015] [Accepted: 12/04/2015] [Indexed: 12/18/2022]

PExFInS: An Integrative Post-GWAS Explorer for Functional Indels and SNPs. Sci Rep 2015;5:17302. [PMID: 26612672 PMCID: PMC4661514 DOI: 10.1038/srep17302] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2015] [Accepted: 10/28/2015] [Indexed: 12/22/2022] Open

Hasan MS, Wu X, Zhang L. Performance evaluation of indel calling tools using real short-read data. Hum Genomics 2015;9:20. [PMID: 26286629 PMCID: PMC4545535 DOI: 10.1186/s40246-015-0042-2] [Citation(s) in RCA: 68] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2015] [Accepted: 07/20/2015] [Indexed: 12/16/2022] Open

Zhang G, Wang J, Yang J, Li W, Deng Y, Li J, Huang J, Hu S, Zhang B. Comparison and evaluation of two exome capture kits and sequencing platforms for variant calling. BMC Genomics 2015;16:581. [PMID: 26242175 PMCID: PMC4524363 DOI: 10.1186/s12864-015-1796-6] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2014] [Accepted: 07/23/2015] [Indexed: 12/30/2022] Open

Abstract

Background

To promote the clinical application of next-generation sequencing, it is important to obtain accurate and consistent variants of target genomic regions at low cost. Ion Proton, the latest updated semiconductor-based sequencing instrument from Life Technologies, is designed to provide investigators with an inexpensive platform for human whole exome sequencing that achieves a rapid turnaround time. However, few studies have comprehensively compared and evaluated the accuracy of variant calling between Ion Proton and Illumina sequencing platforms such as HiSeq 2000, which is the most popular sequencing platform for the human genome. The Ion Proton sequencer combined with the Ion TargetSeq™ Exome Enrichment Kit together make up TargetSeq-Proton, whereas SureSelect-Hiseq is based on the Agilent SureSelect Human All Exon v4 Kit and the HiSeq 2000 sequencer.

Results

Here, we sequenced exonic DNA from four human blood samples using both TargetSeq-Proton and SureSelect-HiSeq. We then called variants in the exonic regions that overlapped between the two exome capture kits (33.6 Mb). The rates of shared variant loci called by two sequencing platforms were from 68.0 to 75.3 % in four samples, whereas the concordance of co-detected variant loci reached 99 %. Sanger sequencing validation revealed that the validated rate of concordant single nucleotide polymorphisms (SNPs) (91.5 %) was higher than the SNPs specific to TargetSeq-Proton (60.0 %) or specific to SureSelect-HiSeq (88.3 %). With regard to 1-bp small insertions and deletions (InDels), the Sanger sequencing validated rates of concordant variants (100.0 %) and SureSelect-HiSeq-specific (89.6 %) were higher than those of TargetSeq-Proton-specific (15.8 %).

Conclusions

In the sequencing of exonic regions, a combination of using of two sequencing strategies (SureSelect-HiSeq and TargetSeq-Proton) increased the variant calling specificity for concordant variant loci and the sensitivity for variant loci called by any one platform. However, for the sequencing of platform-specific variants, the accuracy of variant calling by HiSeq 2000 was higher than that of Ion Proton, specifically for the InDel detection. Moreover, the variant calling software also influences the detection of SNPs and, specifically, InDels in Ion Proton exome sequencing.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-015-1796-6) contains supplementary material, which is available to authorized users.

Collapse

Lim JQ, Tennakoon C, Guan P, Sung WK. BatAlign: an incremental method for accurate alignment of sequencing reads. Nucleic Acids Res 2015;43:e107. [PMID: 26170239 PMCID: PMC4652746 DOI: 10.1093/nar/gkv533] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2015] [Accepted: 05/09/2015] [Indexed: 11/12/2022] Open

Kloosterman WP, Francioli LC, Hormozdiari F, Marschall T, Hehir-Kwa JY, Abdellaoui A, Lameijer EW, Moed MH, Koval V, Renkens I, van Roosmalen MJ, Arp P, Karssen LC, Coe BP, Handsaker RE, Suchiman ED, Cuppen E, Thung DT, McVey M, Wendl MC, Uitterlinden A, van Duijn CM, Swertz MA, Wijmenga C, van Ommen GB, Slagboom PE, Boomsma DI, Schönhuth A, Eichler EE, de Bakker PIW, Ye K, Guryev V. Characteristics of de novo structural changes in the human genome. Genome Res 2015;25:792-801. [PMID: 25883321 PMCID: PMC4448676 DOI: 10.1101/gr.185041.114] [Citation(s) in RCA: 94] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2014] [Accepted: 04/01/2015] [Indexed: 11/29/2022]

Affiliation(s)

Wigard P Kloosterman Department of Medical Genetics, Center for Molecular Medicine, University Medical Center Utrecht, Utrecht 3584CG, The Netherlands
Laurent C Francioli Department of Medical Genetics, Center for Molecular Medicine, University Medical Center Utrecht, Utrecht 3584CG, The Netherlands
Fereydoun Hormozdiari Department of Genome Sciences, University of Washington, Seattle, Washington 98105, USA
Tobias Marschall Life Sciences Group, Centrum voor Wiskunde en Informatica, Amsterdam 1098XG, The Netherlands
Jayne Y Hehir-Kwa Department of Human Genetics, Radboud University Medical Center, Nijmegen 6525GA, The Netherlands
Abdel Abdellaoui Department of Biological Psychology, VU University Amsterdam, Amsterdam 1081BT, The Netherlands
Eric-Wubbo Lameijer Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden 2300RC, The Netherlands
Matthijs H Moed Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden 2300RC, The Netherlands
Vyacheslav Koval Department of Internal Medicine, Erasmus Medical Center, Rotterdam 3000CA, The Netherlands
Ivo Renkens Department of Medical Genetics, Center for Molecular Medicine, University Medical Center Utrecht, Utrecht 3584CG, The Netherlands
Markus J van Roosmalen Department of Medical Genetics, Center for Molecular Medicine, University Medical Center Utrecht, Utrecht 3584CG, The Netherlands
Pascal Arp Department of Internal Medicine, Erasmus Medical Center, Rotterdam 3000CA, The Netherlands
Lennart C Karssen Department of Epidemiology, Erasmus Medical Center, Rotterdam 3000CA, The Netherlands
Bradley P Coe Department of Genome Sciences, University of Washington, Seattle, Washington 98105, USA
Robert E Handsaker Department of Genetics, Harvard Medical School, Boston, Massachusetts 02115, USA
Eka D Suchiman Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden 2300RC, The Netherlands
Edwin Cuppen Department of Medical Genetics, Center for Molecular Medicine, University Medical Center Utrecht, Utrecht 3584CG, The Netherlands
Djie Tjwan Thung Department of Human Genetics, Radboud University Medical Center, Nijmegen 6525GA, The Netherlands
Mitch McVey Department of Biology, Tufts University, Medford, Massachusetts 02115, USA
Michael C Wendl The Genome Institute, Washington University, St. Louis, Missouri 63108, USA; Department of Mathematics, Washington University, St. Louis, Missouri 63108, USA
André Uitterlinden Department of Internal Medicine, Erasmus Medical Center, Rotterdam 3000CA, The Netherlands; Department of Epidemiology, Erasmus Medical Center, Rotterdam 3000CA, The Netherlands
Cornelia M van Duijn Department of Epidemiology, Erasmus Medical Center, Rotterdam 3000CA, The Netherlands
Morris A Swertz Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen 9700RB, The Netherlands; Genomics Coordination Center, University of Groningen, University Medical Center Groningen, Groningen 9700RB, The Netherlands
Cisca Wijmenga Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen 9700RB, The Netherlands; Genomics Coordination Center, University of Groningen, University Medical Center Groningen, Groningen 9700RB, The Netherlands
GertJan B van Ommen Department of Human Genetics, Leiden University Medical Center, Leiden 2300RC, The Netherlands
P Eline Slagboom Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden 2300RC, The Netherlands
Dorret I Boomsma Department of Biological Psychology, VU University Amsterdam, Amsterdam 1081BT, The Netherlands
Alexander Schönhuth Life Sciences Group, Centrum voor Wiskunde en Informatica, Amsterdam 1098XG, The Netherlands
Evan E Eichler Department of Genome Sciences, University of Washington, Seattle, Washington 98105, USA
Paul I W de Bakker Department of Medical Genetics, Center for Molecular Medicine, University Medical Center Utrecht, Utrecht 3584CG, The Netherlands; Department of Epidemiology, University Medical Center Utrecht, Utrecht 3584CG, The Netherlands
Kai Ye The Genome Institute, Washington University, St. Louis, Missouri 63108, USA
Victor Guryev European Research Institute for the Biology of Ageing, University of Groningen, University Medical Center Groningen, Groningen 9713AD, The Netherlands

Collapse

Roberts R. A genetic basis for coronary artery disease. Trends Cardiovasc Med 2014;25:171-8. [PMID: 25453988 DOI: 10.1016/j.tcm.2014.10.008] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/09/2014] [Revised: 10/10/2014] [Accepted: 10/10/2014] [Indexed: 01/29/2023]

Xu H, Deng W, Huang F, Xiao S, Liu G, Liang H. Enhanced DNA toehold exchange reaction on a chip surface to discriminate single-base changes. Chem Commun (Camb) 2014;50:14171-4. [DOI: 10.1039/c4cc07272c] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]

Booker CS, Grattan DR. Identification of a truncated splice variant of IL-18 receptor alpha in the human and rat, with evidence of wider evolutionary conservation. PeerJ 2014;2:e560. [PMID: 25250214 PMCID: PMC4168765 DOI: 10.7717/peerj.560] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2014] [Accepted: 08/15/2014] [Indexed: 01/14/2023] Open

Yan Y, Yi G, Sun C, Qu L, Yang N. Genome-wide characterization of insertion and deletion variation in chicken using next generation sequencing. PLoS One 2014;9:e104652. [PMID: 25133774 PMCID: PMC4136736 DOI: 10.1371/journal.pone.0104652] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2014] [Accepted: 07/10/2014] [Indexed: 12/30/2022] Open

Abstract

Insertion and deletion (INDEL) is one of the main events contributing to genetic and phenotypic diversity, which receives less attention than SNP and large structural variation. To gain a better knowledge of INDEL variation in chicken genome, we applied next generation sequencing on 12 diverse chicken breeds at an average effective depth of 8.6. Over 1.3 million non-redundant short INDELs (1-49 bp) were obtained, the vast majority (92.48%) of which were novel. Follow-up validation assays confirmed that most (88.00%) of the randomly selected INDELs represent true variations. The majority (95.76%) of INDELs were less than 10 bp. Both the detected number and affected bases were larger for deletions than insertions. In total, INDELs covered 3.8 Mbp, corresponding to 0.36% of the chicken genome. The average genomic INDEL density was estimated as 0.49 per kb. INDELs were ubiquitous and distributed in a non-uniform fashion across chromosomes, with lower INDEL density in micro-chromosomes than in others, and some functional regions like exons and UTRs were prone to less INDELs than introns and intergenic regions. Nearly 620,253 INDELs fell in genic regions, 1,765 (0.28%) of which located in exons, spanning 1,358 (7.56%) unique Ensembl genes. Many of them are associated with economically important traits and some are the homologues of human disease-related genes. We demonstrate that sequencing multiple individuals at a medium depth offers a promising way for reliable identification of INDELs. The coding INDELs are valuable candidates for further elucidation of the association between genotypes and phenotypes. The chicken INDELs revealed by our study can be useful for future studies, including development of INDEL markers, construction of high density linkage map, INDEL arrays design, and hopefully, molecular breeding programs in chicken.

Collapse

Roberts R. Genetics of Coronary Artery Disease. Circ Res 2014;114:1890-903. [DOI: 10.1161/circresaha.114.302692] [Citation(s) in RCA: 89] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]

Haberstick BC, Smolen A, Stetler GL, Tabor JW, Roy T, Rick Casey H, Pardo A, Roy F, Ryals LA, Hewitt C, Whitsel EA, Halpern CT, Killeya-Jones LA, Lessem JM, Hewitt JK, Harris KM. Simple sequence repeats in the national longitudinal study of adolescent health: an ethnically diverse resource for genetic analysis of health and behavior. Behav Genet 2014;44:487-97. [PMID: 24890516 DOI: 10.1007/s10519-014-9662-x] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2013] [Accepted: 05/08/2014] [Indexed: 12/16/2022]

Zhang X, Lin H, Zhao H, Hao Y, Mort M, Cooper DN, Zhou Y, Liu Y. Impact of human pathogenic micro-insertions and micro-deletions on post-transcriptional regulation. Hum Mol Genet 2014;23:3024-34. [PMID: 24436305 DOI: 10.1093/hmg/ddu019] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open

Rockah-Shmuel L, Tóth-Petróczy Á, Sela A, Wurtzel O, Sorek R, Tawfik DS. Correlated occurrence and bypass of frame-shifting insertion-deletions (InDels) to give functional proteins. PLoS Genet 2013;9:e1003882. [PMID: 24204297 PMCID: PMC3812077 DOI: 10.1371/journal.pgen.1003882] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2013] [Accepted: 09/02/2013] [Indexed: 11/19/2022] Open

Abstract

Short insertions and deletions (InDels) comprise an important part of the natural mutational repertoire. InDels are, however, highly deleterious, primarily because two-thirds result in frame-shifts. Bypass through slippage over homonucleotide repeats by transcriptional and/or translational infidelity is known to occur sporadically. However, the overall frequency of bypass and its relation to sequence composition remain unclear. Intriguingly, the occurrence of InDels and the bypass of frame-shifts are mechanistically related - occurring through slippage over repeats by DNA or RNA polymerases, or by the ribosome, respectively. Here, we show that the frequency of frame-shifting InDels, and the frequency by which they are bypassed to give full-length, functional proteins, are indeed highly correlated. Using a laboratory genetic drift, we have exhaustively mapped all InDels that occurred within a single gene. We thus compared the naive InDel repertoire that results from DNA polymerase slippage to the frame-shifting InDels tolerated following selection to maintain protein function. We found that InDels repeatedly occurred, and were bypassed, within homonucleotide repeats of 3–8 bases. The longer the repeat, the higher was the frequency of InDels formation, and the more frequent was their bypass. Besides an expected 8A repeat, other types of repeats, including short ones, and G and C repeats, were bypassed. Although obtained in vitro, our results indicate a direct link between the genetic occurrence of InDels and their phenotypic rescue, thus suggesting a potential role for frame-shifting InDels as bridging evolutionary intermediates.

Homonucleotide repeats are exceptionally prone to insertions and/or deletions of bases (InDels). However, unless they occur in a multiplicity of 3 bases, InDels disrupt the reading frame and are thus expected to be purged from coding regions. Homonucleotide repeats, however, are also vulnerable to slippage by RNA polymerases and the ribosome. Using laboratory evolution techniques, we systematically mapped the occurrence of InDels within a given gene, before and after selection. Our data indicate that frame-shifting InDels were frequently bypassed to give functional proteins at surprisingly high frequencies. Further, we found a strict correlation between the repeat length, the frequency of occurrence of InDels at the DNA level, and the likelihood of bypass by transcriptional/translational slippage. Our results suggest that frame-shifting InDels might comprise functional evolutionary intermediates, and an effective mean of sequence divergence (e.g. when an adjacent InDel restores the frame, resulting in altered sequence and, potentially, in an altered protein structure).

Collapse

Kvikstad EM, Duret L. Strong heterogeneity in mutation rate causes misleading hallmarks of natural selection on indel mutations in the human genome. Mol Biol Evol 2013;31:23-36. [PMID: 24113537 PMCID: PMC3879449 DOI: 10.1093/molbev/mst185] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open

Zeng F, Jiang R, Chen T. PyroHMMvar: a sensitive and accurate method to call short indels and SNPs for Ion Torrent and 454 data. Bioinformatics 2013;29:2859-68. [PMID: 23995392 DOI: 10.1093/bioinformatics/btt512] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Spinks PQ, Thomson RC, Pauly GB, Newman CE, Mount G, Shaffer HB. Misleading phylogenetic inferences based on single-exemplar sampling in the turtle genus Pseudemys. Mol Phylogenet Evol 2013;68:269-81. [PMID: 23583419 DOI: 10.1016/j.ympev.2013.03.031] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2012] [Revised: 03/05/2013] [Accepted: 03/25/2013] [Indexed: 11/16/2022]

Abstract

Reconstructing species trees for clades containing weakly delimited or incorrectly identified taxa is one of the most serious challenges facing systematists because building phylogenetic trees is generally predicated on correctly identifying species membership for the terminals in an analysis. A common practice, particularly in large-scale phylogenetic analyses, is to use single-exemplar sampling under the implicit assumption that the resulting phylogenetic trees will be poorly supported if the sampled taxa are not good species. We examine this fundamental assumption in the North American turtle genus Pseudemys, a group of common, widely distributed freshwater turtles whose species boundaries and phylogenetic relationships have challenged systematists for over half a century. We sequenced 10 nuclear and three mitochondrial genes from the nine currently recognized species and subspecies of Pseudemys using geographically-widespread sampling of each taxon, and analyzed the resulting 86-individual data set using population-genetic and phylogenetic methods. We found little or no evidence supporting the division of Pseudemys into its currently recognized species/subspecies. Rather, our data strongly suggest that the group has been oversplit and contains fewer species than currently recognized. Even so, when we conducted 100 replicated, single-exemplar phylogenetic analyses of these same nine taxa, most Bayesian trees were well resolved, had high posterior probabilities, and yet returned completely conflicting topologies. These analyses suggest that phylogenetic analyses based on single-exemplar sampling may recover trees that depend on the individuals that are sampled, rather than the underlying species tree that systematists assume they are estimating. Our results clearly indicate that final resolution of Pseudemys will require an integrated analysis of morphology and historical biogeographic data coupled with extensive geographic sampling and large amounts of molecular data, and we do not recommend taxonomic changes based on our analyses. If our 100-tree resampling experiments generalize to other taxa, they suggest that single-exemplar phylogenies should be interpreted with caution, particularly for groups where species are shallowly diverged or inadequately delimited.

Collapse

Genomics in cardiovascular disease. J Am Coll Cardiol 2013;61:2029-37. [PMID: 23524054 DOI: 10.1016/j.jacc.2012.12.054] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/12/2012] [Revised: 01/29/2013] [Accepted: 02/19/2013] [Indexed: 01/29/2023]

Montgomery SB, Goode DL, Kvikstad E, Albers CA, Zhang ZD, Mu XJ, Ananda G, Howie B, Karczewski KJ, Smith KS, Anaya V, Richardson R, Davis J, MacArthur DG, Sidow A, Duret L, Gerstein M, Makova KD, Marchini J, McVean G, Lunter G. The origin, evolution, and functional impact of short insertion-deletion variants identified in 179 human genomes. Genome Res 2013;23:749-61. [PMID: 23478400 PMCID: PMC3638132 DOI: 10.1101/gr.148718.112] [Citation(s) in RCA: 163] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

Abstract

Short insertions and deletions (indels) are the second most abundant form of human genetic variation, but our understanding of their origins and functional effects lags behind that of other types of variants. Using population-scale sequencing, we have identified a high-quality set of 1.6 million indels from 179 individuals representing three diverse human populations. We show that rates of indel mutagenesis are highly heterogeneous, with 43%–48% of indels occurring in 4.03% of the genome, whereas in the remaining 96% their prevalence is 16 times lower than SNPs. Polymerase slippage can explain upwards of three-fourths of all indels, with the remainder being mostly simple deletions in complex sequence. However, insertions do occur and are significantly associated with pseudo-palindromic sequence features compatible with the fork stalling and template switching (FoSTeS) mechanism more commonly associated with large structural variations. We introduce a quantitative model of polymerase slippage, which enables us to identify indel-hypermutagenic protein-coding genes, some of which are associated with recurrent mutations leading to disease. Accounting for mutational rate heterogeneity due to sequence context, we find that indels across functional sequence are generally subject to stronger purifying selection than SNPs. We find that indel length modulates selection strength, and that indels affecting multiple functionally constrained nucleotides undergo stronger purifying selection. We further find that indels are enriched in associations with gene expression and find evidence for a contribution of nonsense-mediated decay. Finally, we show that indels can be integrated in existing genome-wide association studies (GWAS); although we do not find direct evidence that potentially causal protein-coding indels are enriched with associations to known disease-associated SNPs, our findings suggest that the causal variant underlying some of these associations may be indels.

Collapse

Lettre G. The search for genetic modifiers of disease severity in the β-hemoglobinopathies. Cold Spring Harb Perspect Med 2012;2:2/10/a015032. [PMID: 23028136 DOI: 10.1101/cshperspect.a015032] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]

Spinks PQ, Thomson RC, Zhang Y, Che J, Wu Y, Shaffer HB. Species boundaries and phylogenetic relationships in the critically endangered Asian box turtle genus Cuora. Mol Phylogenet Evol 2012;63:656-67. [PMID: 22649793 DOI: 10.1016/j.ympev.2012.02.014] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Yuan Q, Zhou Z, Lindell SG, Higley JD, Ferguson B, Thompson RC, Lopez JF, Suomi SJ, Baghal B, Baker M, Mash DC, Barr CS, Goldman D. The rhesus macaque is three times as diverse but more closely equivalent in damaging coding variation as compared to the human. BMC Genet 2012;13:52. [PMID: 22747632 PMCID: PMC3426462 DOI: 10.1186/1471-2156-13-52] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2011] [Accepted: 05/18/2012] [Indexed: 11/23/2022] Open

Abstract

Background

As a model organism in biomedicine, the rhesus macaque (Macaca mulatta) is the most widely used nonhuman primate. Although a draft genome sequence was completed in 2007, there has been no systematic genome-wide comparison of genetic variation of this species to humans. Comparative analysis of functional and nonfunctional diversity in this highly abundant and adaptable non-human primate could inform its use as a model for human biology, and could reveal how variation in population history and size alters patterns and levels of sequence variation in primates.

Results

We sequenced the mRNA transcriptome and H3K4me3-marked DNA regions in hippocampus from 14 humans and 14 rhesus macaques. Using equivalent methodology and sampling spaces, we identified 462,802 macaque SNPs, most of which were novel and disproportionately located in the functionally important genomic regions we had targeted in the sequencing. At least one SNP was identified in each of 16,797 annotated macaque genes. Accuracy of macaque SNP identification was conservatively estimated to be >90%. Comparative analyses using SNPs equivalently identified in the two species revealed that rhesus macaque has approximately three times higher SNP density and average nucleotide diversity as compared to the human. Based on this level of diversity, the effective population size of the rhesus macaque is approximately 80,000 which contrasts with an effective population size of less than 10,000 for humans. Across five categories of genomic regions, intergenic regions had the highest SNP density and average nucleotide diversity and CDS (coding sequences) the lowest, in both humans and macaques. Although there are more coding SNPs (cSNPs) per individual in macaques than in humans, the ratio of d_N/d_S is significantly lower in the macaque. Furthermore, the number of damaging nonsynonymous cSNPs (have damaging effects on protein functions from PolyPhen-2 prediction) in the macaque is more closely equivalent to that of the human.

Conclusions

This large panel of newly identified macaque SNPs enriched for functionally significant regions considerably expands our knowledge of genetic variation in the rhesus macaque. Comparative analysis reveals that this widespread, highly adaptable species is approximately three times as diverse as the human but more closely equivalent in damaging variation.

Collapse

Huang S, Yu T, Chen Z, Yuan S, Chen S, Xu A. More single-nucleotide mutations surround small insertions than small deletions in primates. Hum Mutat 2012;33:1099-106. [PMID: 22461281 DOI: 10.1002/humu.22085] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2011] [Accepted: 03/06/2012] [Indexed: 01/26/2023]

SPINKS PHILLIPQ, THOMSON ROBERTC, HUGHES BILL, MOXLEY BRAD, BROWN RAFE, DIESMOS ARVIN, SHAFFER HBRADLEY. Cryptic variation and the tragedy of unrecognized taxa: the case of international trade in the spiny turtle Heosemys spinosa (Testudines: Geoemydidae). Zool J Linn Soc 2012. [DOI: 10.1111/j.1096-3642.2011.00788.x] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Lu JT, Wang Y, Gibbs RA, Yu F. Characterizing linkage disequilibrium and evaluating imputation power of human genomic insertion-deletion polymorphisms. Genome Biol 2012;13:R15. [PMID: 22377349 PMCID: PMC3334570 DOI: 10.1186/gb-2012-13-2-r15] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2011] [Revised: 02/14/2012] [Accepted: 02/29/2012] [Indexed: 02/07/2023] Open

Lemos RR, Souza MBR, Oliveira JRM. Exploring the implications of INDELs in neuropsychiatric genetics: challenges and perspectives. J Mol Neurosci 2012;47:419-24. [PMID: 22350990 DOI: 10.1007/s12031-012-9714-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2011] [Accepted: 01/24/2012] [Indexed: 02/04/2023]

Abstract

The decade passed after publishing the Human Genome first draft faced an enormous growth at the understanding of the genomic variation among different subjects, populations, and groups of patients. Single nucleotide polymorphisms (SNPs) and insertion or deletions (INDELs) have been increasingly recognized as a major type of genetic variations, with potential impact in protein activities and gene expression changes observed in complex genetic traits, like neuropsychiatric diseases. INDELs represent the second most common class of variations after SNPs, but there is still an important gap between the number of INDELs reported and the actual knowledge about the functional implications of such variations. There are approximately 10 million SNPs already reported, and the human populations are expected to collectively harbor at least 1.6-2.5 million INDELs. One of the major challenges is to find better platforms to screen for INDELs in a high throughput manner. The discordance in between the data from different studies might be explained by the diverse approaches employed to sequence the genomes with variable platforms. Short INDEL variations increased the scope of genetic markers in human genetic diseases, and various studies showed that common microdeletions and smaller INDELs might be highly associated with neuropsychiatric diseases such as schizophrenia, autism, mental retardation, and Alzheimer disease. The rapidly increasing amount of resequencing, genotyping, and personal genome data generated by large-scale genetic human projects require the development of integrated bioinformatics tools able to efficiently manage and analyze these genetic data. Our group is currently dealing with different approaches that might optimize sequencing and bioinformatics analyses of short INDELs to broaden our research capabilities of identifying those intriguing genetic variations. Hopefully, INDELs might become a new trend in association studies in neuropsychiatric genetics since so far the level of significant and positive associations with the standard SNPs reported presents limited predictive application.

Collapse

Bansal V, Libiger O. A probabilistic method for the detection and genotyping of small indels from population-scale sequence data. ACTA ACUST UNITED AC 2011;27:2047-53. [PMID: 21653520 DOI: 10.1093/bioinformatics/btr344] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Mills RE, Pittard WS, Mullaney JM, Farooq U, Creasy TH, Mahurkar AA, Kemeza DM, Strassler DS, Ponting CP, Webber C, Devine SE. Natural genetic variation caused by small insertions and deletions in the human genome. Genome Res 2011;21:830-9. [PMID: 21460062 DOI: 10.1101/gr.115907.110] [Citation(s) in RCA: 168] [Impact Index Per Article: 12.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

The rhox homeobox gene cluster is imprinted and selectively targeted for regulation by histone h1 and DNA methylation. Mol Cell Biol 2011;31:1275-87. [PMID: 21245380 DOI: 10.1128/mcb.00734-10] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Mullaney JM, Mills RE, Pittard WS, Devine SE. Small insertions and deletions (INDELs) in human genomes. Hum Mol Genet 2010;19:R131-6. [PMID: 20858594 DOI: 10.1093/hmg/ddq400] [Citation(s) in RCA: 215] [Impact Index Per Article: 15.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open

Mullikin JC, Hansen NF, Shen L, Ebling H, Donahue WF, Tao W, Saranga DJ, Brand A, Rubenfield MJ, Young AC, Cruz P, Driscoll C, David V, Al-Murrani SWK, Locniskar MF, Abrahamsen MS, O'Brien SJ, Smith DR, Brockman JA. Light whole genome sequence for SNP discovery across domestic cat breeds. BMC Genomics 2010;11:406. [PMID: 20576142 PMCID: PMC2996934 DOI: 10.1186/1471-2164-11-406] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2009] [Accepted: 06/24/2010] [Indexed: 11/23/2022] Open

Affiliation(s)

James C Mullikin Genome Technology Branch and NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
Nancy F Hansen Genome Technology Branch and NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
Lei Shen Agencourt Bioscience Corporation, Beverly, Massachusetts 01915, USA
Heather Ebling Agencourt Bioscience Corporation, Beverly, Massachusetts 01915, USA
William F Donahue Agencourt Bioscience Corporation, Beverly, Massachusetts 01915, USA
Wei Tao Agencourt Bioscience Corporation, Beverly, Massachusetts 01915, USA
David J Saranga Agencourt Bioscience Corporation, Beverly, Massachusetts 01915, USA
Adrianne Brand Agencourt Bioscience Corporation, Beverly, Massachusetts 01915, USA
Marc J Rubenfield Agencourt Bioscience Corporation, Beverly, Massachusetts 01915, USA
Alice C Young Genome Technology Branch and NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
Pedro Cruz Genome Technology Branch and NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
Carlos Driscoll Laboratory of Genomic Diversity, National Cancer Institute, Frederick, Maryland 21702, USA
Victor David Laboratory of Genomic Diversity, National Cancer Institute, Frederick, Maryland 21702, USA
Samer WK Al-Murrani Hill's Pet Nutrition Inc., PO Box 1658, Topeka, KS 66601, USA
Mary F Locniskar Hill's Pet Nutrition Inc., PO Box 1658, Topeka, KS 66601, USA
Mitchell S Abrahamsen Hill's Pet Nutrition Inc., PO Box 1658, Topeka, KS 66601, USA
Stephen J O'Brien Laboratory of Genomic Diversity, National Cancer Institute, Frederick, Maryland 21702, USA
Douglas R Smith Agencourt Bioscience Corporation, Beverly, Massachusetts 01915, USA
Jeffrey A Brockman Hill's Pet Nutrition Inc., PO Box 1658, Topeka, KS 66601, USA

Collapse