1
|
Neijzen D, Lunter G. Unsupervised learning for medical data: A review of probabilistic factorization methods. Stat Med 2023; 42:5541-5554. [PMID: 37850249 DOI: 10.1002/sim.9924] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Accepted: 09/13/2023] [Indexed: 10/19/2023]
Abstract
We review popular unsupervised learning methods for the analysis of high-dimensional data encountered in, for example, genomics, medical imaging, cohort studies, and biobanks. We show that four commonly used methods, principal component analysis, K-means clustering, nonnegative matrix factorization, and latent Dirichlet allocation, can be written as probabilistic models underpinned by a low-rank matrix factorization. In addition to highlighting their similarities, this formulation clarifies the various assumptions and restrictions of each approach, which eases identifying the appropriate method for specific applications for applied medical researchers. We also touch upon the most important aspects of inference and model selection for the application of these methods to health data.
Collapse
Affiliation(s)
- Dorien Neijzen
- Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Gerton Lunter
- Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
- Weatherall Institute of Molecular Medicine, Oxford University, Oxford, UK
| |
Collapse
|
2
|
Cox EGM, Zhang W, van der Voort PHJ, Lunter G, Keus F, Snieder H. Genetic association studies in critically ill patients: protocol for a systematic review. Syst Rev 2023; 12:233. [PMID: 38093336 PMCID: PMC10716946 DOI: 10.1186/s13643-023-02401-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Accepted: 11/23/2023] [Indexed: 12/17/2023] Open
Abstract
INTRODUCTION Patients in the intensive care unit (ICU) are highly heterogeneous in characteristics, their clinical course, and outcomes. Genetic variability may partly explain the variability and similarity in disease courses observed among critically ill patients and may identify clusters of subgroups. The aim of this study is to conduct a systematic review of all genetic association studies of critically ill patients with their outcomes. METHODS AND ANALYSIS This systematic review will be conducted and reported according to the HuGE Review Handbook V1.0. We will search PubMed, Embase, and the Cochrane Library for relevant studies. All types of genetic association studies that included acutely admitted medical and surgical adult ICU patients will be considered for this review. All studies will be selected according to predefined selection criteria, evaluated and assessed for risk of bias independently by two reviewers. Risk of bias will be assessed according to the HuGE Review Handbook V1.0 with some modifications reflecting recent insights. We will provide an overview of all included studies by reporting the characteristics of the study designs, the patients included in the studies, the genetic variables, and the outcomes evaluated. ETHICS AND DISSEMINATION We will use data from peer-reviewed published articles, and hence, there is no requirement for ethics approval. The results of this systematic review will be disseminated through publication in a peer-reviewed scientific journal. SYSTEMATIC REVIEW REGISTRATION PROSPERO CRD42021209744.
Collapse
Affiliation(s)
- Eline G M Cox
- Department of Critical Care, University Medical Center Groningen, Groningen, 9713 GZ, the Netherlands.
| | - Wenbo Zhang
- Department of Epidemiology, University Medical Center Groningen, Groningen, 9713 GZ, the Netherlands
| | - Peter H J van der Voort
- Department of Critical Care, University Medical Center Groningen, Groningen, 9713 GZ, the Netherlands
| | - Gerton Lunter
- Department of Epidemiology, University Medical Center Groningen, Groningen, 9713 GZ, the Netherlands
| | - Frederik Keus
- Department of Critical Care, University Medical Center Groningen, Groningen, 9713 GZ, the Netherlands
| | - Harold Snieder
- Department of Epidemiology, University Medical Center Groningen, Groningen, 9713 GZ, the Netherlands
| |
Collapse
|
3
|
Pagnamenta AT, Camps C, Giacopuzzi E, Taylor JM, Hashim M, Calpena E, Kaisaki PJ, Hashimoto A, Yu J, Sanders E, Schwessinger R, Hughes JR, Lunter G, Dreau H, Ferla M, Lange L, Kesim Y, Ragoussis V, Vavoulis DV, Allroggen H, Ansorge O, Babbs C, Banka S, Baños-Piñero B, Beeson D, Ben-Ami T, Bennett DL, Bento C, Blair E, Brasch-Andersen C, Bull KR, Cario H, Cilliers D, Conti V, Davies EG, Dhalla F, Dacal BD, Dong Y, Dunford JE, Guerrini R, Harris AL, Hartley J, Hollander G, Javaid K, Kane M, Kelly D, Kelly D, Knight SJL, Kreins AY, Kvikstad EM, Langman CB, Lester T, Lines KE, Lord SR, Lu X, Mansour S, Manzur A, Maroofian R, Marsden B, Mason J, McGowan SJ, Mei D, Mlcochova H, Murakami Y, Németh AH, Okoli S, Ormondroyd E, Ousager LB, Palace J, Patel SY, Pentony MM, Pugh C, Rad A, Ramesh A, Riva SG, Roberts I, Roy N, Salminen O, Schilling KD, Scott C, Sen A, Smith C, Stevenson M, Thakker RV, Twigg SRF, Uhlig HH, van Wijk R, Vona B, Wall S, Wang J, Watkins H, Zak J, Schuh AH, Kini U, Wilkie AOM, Popitsch N, Taylor JC. Structural and non-coding variants increase the diagnostic yield of clinical whole genome sequencing for rare diseases. Genome Med 2023; 15:94. [PMID: 37946251 PMCID: PMC10636885 DOI: 10.1186/s13073-023-01240-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Accepted: 09/27/2023] [Indexed: 11/12/2023] Open
Abstract
BACKGROUND Whole genome sequencing is increasingly being used for the diagnosis of patients with rare diseases. However, the diagnostic yields of many studies, particularly those conducted in a healthcare setting, are often disappointingly low, at 25-30%. This is in part because although entire genomes are sequenced, analysis is often confined to in silico gene panels or coding regions of the genome. METHODS We undertook WGS on a cohort of 122 unrelated rare disease patients and their relatives (300 genomes) who had been pre-screened by gene panels or arrays. Patients were recruited from a broad spectrum of clinical specialties. We applied a bioinformatics pipeline that would allow comprehensive analysis of all variant types. We combined established bioinformatics tools for phenotypic and genomic analysis with our novel algorithms (SVRare, ALTSPLICE and GREEN-DB) to detect and annotate structural, splice site and non-coding variants. RESULTS Our diagnostic yield was 43/122 cases (35%), although 47/122 cases (39%) were considered solved when considering novel candidate genes with supporting functional data into account. Structural, splice site and deep intronic variants contributed to 20/47 (43%) of our solved cases. Five genes that are novel, or were novel at the time of discovery, were identified, whilst a further three genes are putative novel disease genes with evidence of causality. We identified variants of uncertain significance in a further fourteen candidate genes. The phenotypic spectrum associated with RMND1 was expanded to include polymicrogyria. Two patients with secondary findings in FBN1 and KCNQ1 were confirmed to have previously unidentified Marfan and long QT syndromes, respectively, and were referred for further clinical interventions. Clinical diagnoses were changed in six patients and treatment adjustments made for eight individuals, which for five patients was considered life-saving. CONCLUSIONS Genome sequencing is increasingly being considered as a first-line genetic test in routine clinical settings and can make a substantial contribution to rapidly identifying a causal aetiology for many patients, shortening their diagnostic odyssey. We have demonstrated that structural, splice site and intronic variants make a significant contribution to diagnostic yield and that comprehensive analysis of the entire genome is essential to maximise the value of clinical genome sequencing.
Collapse
Affiliation(s)
- Alistair T Pagnamenta
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Carme Camps
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Edoardo Giacopuzzi
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- Human Technopole, Viale Rita Levi Montalcini 1, 20157, Milan, Italy
| | - John M Taylor
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- Oxford Genetics Laboratories, Oxford University Hospitals NHS Foundation Trust, Churchill Hospital, Old Road, Oxford, OX3 7LE, UK
| | - Mona Hashim
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Eduardo Calpena
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Pamela J Kaisaki
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Akiko Hashimoto
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Jing Yu
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Edward Sanders
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Ron Schwessinger
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Jim R Hughes
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Gerton Lunter
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
- University Medical Center Groningen, Groningen University, PO Box 72, 9700 AB, Groningen, The Netherlands
| | - Helene Dreau
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- Department of Oncology, Oxford Molecular Diagnostics Centre, University of Oxford, Level 4, John Radcliffe Hospital, Headley Way, Oxford, OX3 9DU, UK
| | - Matteo Ferla
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Lukas Lange
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Yesim Kesim
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Vassilis Ragoussis
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Dimitrios V Vavoulis
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- Department of Oncology, Oxford Molecular Diagnostics Centre, University of Oxford, Level 4, John Radcliffe Hospital, Headley Way, Oxford, OX3 9DU, UK
| | - Holger Allroggen
- Neurosciences Department, UHCW NHS Trust, Clifford Bridge Road, Coventry, CV2 2DX, UK
| | - Olaf Ansorge
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
| | - Christian Babbs
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Siddharth Banka
- Division of Evolution, Infection and Genomics, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
- Manchester Centre for Genomic Medicine, Saint Mary's Hospital, Oxford Road, Manchester, M13 9WL, UK
| | - Benito Baños-Piñero
- Oxford Genetics Laboratories, Oxford University Hospitals NHS Foundation Trust, Churchill Hospital, Old Road, Oxford, OX3 7LE, UK
| | - David Beeson
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
| | - Tal Ben-Ami
- Pediatric Hematology-Oncology Unit, Kaplan Medical Center, Rehovot, Israel
| | - David L Bennett
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
| | - Celeste Bento
- Hematology Department, Hospitais da Universidade de Coimbra, Coimbra, Portugal
| | - Edward Blair
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- Oxford Centre for Genomic Medicine, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 7LE, UK
| | - Charlotte Brasch-Andersen
- Department of Clinical Genetics, Odense University Hospital and Department of Clinical Research, University of Southern Denmark, Odense, Denmark
| | - Katherine R Bull
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7BN, UK
| | - Holger Cario
- Department of Pediatrics and Adolescent Medicine, University Medical Center, Eythstrasse 24, 89075, Ulm, Germany
| | - Deirdre Cilliers
- Oxford Centre for Genomic Medicine, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 7LE, UK
| | - Valerio Conti
- Neuroscience Department, Meyer Children's Hospital IRCCS, Viale Pieraccini 24, 50139, Florence, Italy
| | - E Graham Davies
- Department of Immunology, Great Ormond Street Hospital for Children NHS Trust and UCL Great Ormond Street Institute of Child Health, Zayed Centre for Research, 2Nd Floor, 20C Guilford Street, London, WC1N 1DZ, UK
| | - Fatima Dhalla
- Department of Paediatrics, Institute of Developmental and Regenerative Medicine, IMS-Tetsuya Nakamura Building, Old Road Campus, Roosevelt Drive, Oxford, OX3 7TY, UK
| | - Beatriz Diez Dacal
- Oxford Genetics Laboratories, Oxford University Hospitals NHS Foundation Trust, Churchill Hospital, Old Road, Oxford, OX3 7LE, UK
| | - Yin Dong
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
| | - James E Dunford
- Oxford NIHR Musculoskeletal BRC and Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, Nuffield Orthopaedic Centre, Old Road, Oxford, OX3 7HE, UK
| | - Renzo Guerrini
- Neuroscience Department, Meyer Children's Hospital IRCCS, Viale Pieraccini 24, 50139, Florence, Italy
| | - Adrian L Harris
- Department of Oncology, University of Oxford, Old Road Campus Research Building, Oxford, OX3 7DQ, UK
| | - Jane Hartley
- Liver Unit, Birmingham Women's & Children's Hospital and University of Birmingham, Steelhouse Lane, Birmingham, B4 6NH, UK
| | - Georg Hollander
- Department of Paediatrics, University of Oxford, Level 2, Children's Hospital, John Radcliffe Hospital, Oxford, OX3 9DU, UK
| | - Kassim Javaid
- Oxford NIHR Musculoskeletal BRC and Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, Nuffield Orthopaedic Centre, Old Road, Oxford, OX3 7HE, UK
| | - Maureen Kane
- Department of Pharmaceutical Sciences, School of Pharmacy, University of Maryland, Pharmacy Hall North, Room 731, 20 N. Pine Street, Baltimore, MD, 21201, USA
| | - Deirdre Kelly
- Liver Unit, Birmingham Women's & Children's Hospital and University of Birmingham, Steelhouse Lane, Birmingham, B4 6NH, UK
| | - Dominic Kelly
- Children's Hospital, OUH NHS Foundation Trust, NIHR Oxford BRC, Headley Way, Oxford, OX3 9DU, UK
| | - Samantha J L Knight
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Alexandra Y Kreins
- Department of Immunology, Great Ormond Street Hospital for Children NHS Trust and UCL Great Ormond Street Institute of Child Health, Zayed Centre for Research, 2Nd Floor, 20C Guilford Street, London, WC1N 1DZ, UK
| | - Erika M Kvikstad
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Craig B Langman
- Feinberg School of Medicine, Northwestern University, 211 E Chicago Avenue, Chicago, IL, MS37, USA
| | - Tracy Lester
- Oxford Genetics Laboratories, Oxford University Hospitals NHS Foundation Trust, Churchill Hospital, Old Road, Oxford, OX3 7LE, UK
| | - Kate E Lines
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- University of Oxford, Academic Endocrine Unit, OCDEM, Churchill Hospital, Oxford, OX3 7LJ, UK
| | - Simon R Lord
- Early Phase Clinical Trials Unit, Department of Oncology, University of Oxford, Cancer and Haematology Centre, Level 2 Administration Area, Churchill Hospital, Oxford, OX3 7LJ, UK
| | - Xin Lu
- Nuffield Department of Clinical Medicine, Ludwig Institute for Cancer Research, University of Oxford, Old Road Campus Research Building, Oxford, OX3 7DQ, UK
| | - Sahar Mansour
- St George's University Hospitals NHS Foundation Trust, Blackshore Road, Tooting, London, SW17 0QT, UK
| | - Adnan Manzur
- MRC Centre for Neuromuscular Diseases, National Hospital for Neurology and Neurosurgery, Queen Square, London, WC1N 3BG, UK
| | - Reza Maroofian
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology and The National Hospital for Neurology and Neurosurgery, London, WC1N 3BG, UK
| | - Brian Marsden
- Nuffield Department of Medicine, Kennedy Institute, University of Oxford, Oxford, OX3 7BN, UK
| | - Joanne Mason
- Yourgene Health Headquarters, Skelton House, Lloyd Street North, Manchester Science Park, Manchester, M15 6SH, UK
| | - Simon J McGowan
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Davide Mei
- Neuroscience Department, Meyer Children's Hospital IRCCS, Viale Pieraccini 24, 50139, Florence, Italy
| | - Hana Mlcochova
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Yoshiko Murakami
- Research Institute for Microbial Diseases, Osaka University, 3-1 Yamadaoka, Suita, Osaka, 565-0871, Japan
| | - Andrea H Németh
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
- Oxford Centre for Genomic Medicine, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 7LE, UK
| | - Steven Okoli
- Imperial College NHS Trust, Department of Haematology, Hammersmith Hospital, Du Cane Road, London, W12 0HS, UK
| | - Elizabeth Ormondroyd
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- University of Oxford, Level 6 West Wing, Oxford, OX3 9DU, JR, UK
| | - Lilian Bomme Ousager
- Department of Clinical Genetics, Odense University Hospital and Department of Clinical Research, University of Southern Denmark, Odense, Denmark
| | - Jacqueline Palace
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
| | - Smita Y Patel
- Clinical Immunology, John Radcliffe Hospital, Level 4A, Oxford, OX3 9DU, UK
| | - Melissa M Pentony
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Chris Pugh
- Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7BN, UK
| | - Aboulfazl Rad
- Department of Otolaryngology-Head & Neck Surgery, Tübingen Hearing Research Centre, Eberhard Karls University, Elfriede-Aulhorn-Str. 5, 72076, Tübingen, Germany
| | - Archana Ramesh
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
| | - Simone G Riva
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Irene Roberts
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
- Department of Paediatrics, University of Oxford, Level 2, Children's Hospital, John Radcliffe Hospital, Oxford, OX3 9DU, UK
| | - Noémi Roy
- Department of Haematology, Oxford University Hospitals NHS Foundation Trust, Level 4, Haematology, John Radcliffe Hospital, Oxford, OX3 9DU, UK
| | - Outi Salminen
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- Department of Oncology, Oxford Molecular Diagnostics Centre, University of Oxford, Level 4, John Radcliffe Hospital, Headley Way, Oxford, OX3 9DU, UK
| | - Kyleen D Schilling
- Ann & Robert H. Lurie Children's Hospital of Chicago, 225 E Chicago Avenue, Chicago, IL, 60611, USA
| | - Caroline Scott
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Arjune Sen
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
| | - Conrad Smith
- Oxford Genetics Laboratories, Oxford University Hospitals NHS Foundation Trust, Churchill Hospital, Old Road, Oxford, OX3 7LE, UK
| | - Mark Stevenson
- University of Oxford, Academic Endocrine Unit, OCDEM, Churchill Hospital, Oxford, OX3 7LJ, UK
| | - Rajesh V Thakker
- University of Oxford, Academic Endocrine Unit, OCDEM, Churchill Hospital, Oxford, OX3 7LJ, UK
| | - Stephen R F Twigg
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Holm H Uhlig
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- Department of Paediatrics, University of Oxford, Level 2, Children's Hospital, John Radcliffe Hospital, Oxford, OX3 9DU, UK
- Translational Gastroenterology Unit, John Radcliffe Hospital, Oxford, OX3 9DU, UK
| | - Richard van Wijk
- UMC Utrecht, Heidelberglaan 100, 3584 CX, Utrecht, The Netherlands
| | - Barbara Vona
- Department of Otolaryngology-Head & Neck Surgery, Tübingen Hearing Research Centre, Eberhard Karls University, Elfriede-Aulhorn-Str. 5, 72076, Tübingen, Germany
- Institute of Human Genetics, University Medical Center Göttingen, Heinrich-Düker-Weg 12, 37073, Göttingen, Germany
- Institute for Auditory Neuroscience and InnerEarLab, University Medical Center Göttingen, Robert-Koch-Str. 40, 37075, Göttingen, Germany
| | - Steven Wall
- Oxford Craniofacial Unit, John Radcliffe Hospital, Level LG1, West Wing, Oxford, OX3 9DU, UK
| | - Jing Wang
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
| | - Hugh Watkins
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- University of Oxford, Level 6 West Wing, Oxford, OX3 9DU, JR, UK
| | - Jaroslav Zak
- Nuffield Department of Clinical Medicine, Ludwig Institute for Cancer Research, University of Oxford, Old Road Campus Research Building, Oxford, OX3 7DQ, UK
- Department of Immunology and Microbiology, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA, 92037, USA
| | - Anna H Schuh
- Department of Oncology, Oxford Molecular Diagnostics Centre, University of Oxford, Level 4, John Radcliffe Hospital, Headley Way, Oxford, OX3 9DU, UK
| | - Usha Kini
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- Oxford Centre for Genomic Medicine, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 7LE, UK
| | - Andrew O M Wilkie
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Niko Popitsch
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- Department of Biochemistry and Cell Biology, Max Perutz Labs, University of Vienna, Vienna BioCenter(VBC), Dr.-Bohr-Gasse 9, 1030, Vienna, Austria
| | - Jenny C Taylor
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK.
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK.
| |
Collapse
|
4
|
Zhang Y, Jiang X, Mentzer AJ, McVean G, Lunter G. Topic modeling identifies novel genetic loci associated with multimorbidities in UK Biobank. Cell Genom 2023; 3:100371. [PMID: 37601973 PMCID: PMC10435382 DOI: 10.1016/j.xgen.2023.100371] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/11/2022] [Revised: 05/04/2023] [Accepted: 07/07/2023] [Indexed: 08/22/2023]
Abstract
Many diseases show patterns of co-occurrence, possibly driven by systemic dysregulation of underlying processes affecting multiple traits. We have developed a method (treeLFA) for identifying such multimorbidities from routine health-care data, which combines topic modeling with an informative prior derived from medical ontology. We apply treeLFA to UK Biobank data and identify a variety of topics representing multimorbidity clusters, including a healthy topic. We find that loci identified using topic weights as traits in a genome-wide association study (GWAS) analysis, which we validated with a range of approaches, only partially overlap with loci from GWASs on constituent single diseases. We also show that treeLFA improves upon existing methods like latent Dirichlet allocation in various ways. Overall, our findings indicate that topic models can characterize multimorbidity patterns and that genetic analysis of these patterns can provide insight into the etiology of complex traits that cannot be determined from the analysis of constituent traits alone.
Collapse
Affiliation(s)
- Yidong Zhang
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford OX3 7LF, UK
- Chinese Academy of Medical Sciences Oxford Institute, Nuffield Department of Medicine, University of Oxford, Oxford OX3 7BN, UK
- Department of Radiation Oncology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100006, China
| | - Xilin Jiang
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford OX3 7LF, UK
- Department of Statistics, University of Oxford, Oxford OX1 3LB, UK
- Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford OX3 7BN, UK
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA
- Victor Phillip Dahdaleh Heart and Lung Research Institute, University of Cambridge, Cambridge CB2 0SR, UK
- Heart and Lung Research Institute, University of Cambridge, Cambridge CB2 0BB, UK
| | - Alexander J. Mentzer
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford OX3 7LF, UK
- Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford OX3 7BN, UK
| | - Gil McVean
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford OX3 7LF, UK
| | - Gerton Lunter
- MRC Weatherall Institute of Molecular Medicine, John Radcliffe Hospital, University of Oxford, Oxford OX3 9DS, UK
- Department of Epidemiology, University Medical Center Groningen, University of Groningen, Groningen 9700 RB, the Netherlands
| |
Collapse
|
5
|
Niebuur J, Vonk JM, Du Y, de Bock GH, Lunter G, Krabbe PFM, Alizadeh BZ, Snieder H, Smidt N, Boezen M, Corpeleijn E. Lifestyle factors related to prevalent chronic disease multimorbidity: A population-based cross-sectional study. PLoS One 2023; 18:e0287263. [PMID: 37486939 PMCID: PMC10365307 DOI: 10.1371/journal.pone.0287263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Accepted: 06/02/2023] [Indexed: 07/26/2023] Open
Abstract
BACKGROUND Multimorbidity is associated with poor quality of life, polypharmacy, health care costs and mortality, with those affected potentially benefitting from a healthy lifestyle. We assessed a comprehensive set of lifestyle factors in relation to multimorbidity with major chronic diseases. METHODS This cross-sectional study utilised baseline data for adults from the prospective Lifelines Cohort in the north of the Netherlands (N = 79,345). We defined multimorbidity as the co-existence of two or more chronic diseases (i.e. cardiovascular disease, cancer, respiratory disease, type 2 diabetes) and evaluated factors in six lifestyle domains (nutrition, physical (in)activity, substance abuse, sleep, stress, relationships) among groups by the number of chronic diseases (≥2, 1, 0). Multinomial logistic regression models were created, adjusted for appropriate confounders, and odds ratios (OR) with 95% confidence intervals (95%CI) were reported. RESULTS 3,712 participants had multimorbidity (4.7%, age 53.5 ± 12.5 years), and this group tended to have less healthy lifestyles. Compared to those without chronic diseases, those with multimorbidity reported physical inactivity more often (OR, 1.15; 95%CI, 1.06-1.25; not significant for one condition), chronic stress (OR, 2.14; 95%CI, 1.92-2.38) and inadequate sleep (OR, 1.70; 95%CI, 1.41-2.06); as expected, they more often watched television (OR, 1.70; 95%CI, 1.42-2.04) and currently smoked (OR, 1.91; 95%CI, 1.73-2.11), but they also had lower alcohol intakes (OR, 0.66; 95%CI, 0.59-0.74). CONCLUSIONS Chronic stress and poor sleep, in addition to physical inactivity and smoking, are lifestyle factors of great concern in patients with multimorbidity.
Collapse
Affiliation(s)
- Jacobien Niebuur
- Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Judith M. Vonk
- Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Yihui Du
- Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Geertruida H. de Bock
- Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Gerton Lunter
- Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Paul F. M. Krabbe
- Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Behrooz Z. Alizadeh
- Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Harold Snieder
- Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Nynke Smidt
- Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Marike Boezen
- Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Eva Corpeleijn
- Department of Epidemiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| |
Collapse
|
6
|
Moes HR, Ten Kate JM, Portman AT, van Harten B, van Kesteren ME, Mondria T, Lunter G, Buskens E, van Laar T. Timely referral for device-aided therapy in Parkinson's disease. Development of a screening tool. Parkinsonism Relat Disord 2023; 109:105359. [PMID: 36958065 DOI: 10.1016/j.parkreldis.2023.105359] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Revised: 02/08/2023] [Accepted: 03/05/2023] [Indexed: 03/11/2023]
Abstract
BACKGROUND Timely referral of Parkinson's disease (PD) patients to specialized centers for treatment with device-aided therapies (DAT) is suboptimal. OBJECTIVE To develop a screening tool for timely referral for DAT in PD and to compare the tool with the published 5-2-1 criteria. METHODS A cross-sectional, observational study was performed in 8 hospitals in the catchment area of a specialized movement disorder center in the Northern part of the Netherlands. The target population comprised PD patients not yet on DAT visiting the outpatient clinic of participating hospitals. The primary outcome was apparent eligibility for referral for DAT based on consensus by a panel of 5 experts in the field of DAT. Multivariable logistic regression modelling was used to develop a screening tool for eligibility for referral for DAT. Potential predictors were patient and disease characteristics as observed by attending neurologists. RESULTS In total, 259 consecutive PD patients were included, of whom 17 were deemed eligible for referral for DAT (point prevalence: 6.6%). Presence of response fluctuations and troublesome dyskinesias were the strongest independent predictors of being considered eligible. Both variables were included in the final model, as well as levodopa equivalent daily dose. Decision curve analysis revealed the new model outperforms the 5-2-1 criteria. A simple chart was constructed to provide guidance for referral. Discrimination of this simplified scoring system proved excellent (AUC after bootstrapping: 0.97). CONCLUSIONS Awaiting external validation, the developed screening tool already appears promising for timely referral and subsequent treatment with DAT in patients with PD.
Collapse
Affiliation(s)
- Harmen R Moes
- University of Groningen, University Medical Center Groningen, Department of Neurology, Groningen, the Netherlands.
| | - Jolien M Ten Kate
- University of Groningen, University Medical Center Groningen, Department of Neurology, Groningen, the Netherlands
| | - Axel T Portman
- Treant Zorggroep, Department of Neurology, Stadskanaal, the Netherlands
| | - Barbera van Harten
- Medical Center Leeuwarden, Department of Neurology, Leeuwarden, the Netherlands
| | | | - Tjeerd Mondria
- Antonius Hospital, Department of Neurology, Sneek, the Netherlands
| | - Gerton Lunter
- University of Groningen, University Medical Center Groningen, Department of Epidemiology, Groningen, the Netherlands
| | - Erik Buskens
- University of Groningen, University Medical Center Groningen, Department of Epidemiology, Groningen, the Netherlands
| | - Teus van Laar
- University of Groningen, University Medical Center Groningen, Department of Neurology, Groningen, the Netherlands
| |
Collapse
|
7
|
de Zwart A, Riezebos-Brilman A, Lunter G, Vonk J, Glanville AR, Gottlieb J, Permpalung N, Kerstjens H, Alffenaar JW, Verschuuren E. Respiratory Syncytial Virus, Human Metapneumovirus, and Parainfluenza Virus Infections in Lung Transplant Recipients: A Systematic Review of Outcomes and Treatment Strategies. Clin Infect Dis 2022; 74:2252-2260. [PMID: 35022697 PMCID: PMC9258934 DOI: 10.1093/cid/ciab969] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2021] [Indexed: 12/16/2022] Open
Abstract
Background Respiratory syncytial virus (RSV), parainfluenza virus (PIV), and human metapneumovirus (hMPV) are increasingly associated with chronic lung allograft dysfunction (CLAD) in lung transplant recipients (LTR). This systematic review primarily aimed to assess outcomes of RSV/PIV/hMPV infections in LTR and secondarily to assess evidence regarding the efficacy of ribavirin. Methods Relevant databases were queried and study outcomes extracted using a standardized method and summarized. Results Nineteen retrospective and 12 prospective studies were included (total 1060 cases). Pooled 30-day mortality was low (0–3%), but CLAD progression 180–360 days postinfection was substantial (pooled incidences 19–24%) and probably associated with severe infection. Ribavirin trended toward effectiveness for CLAD prevention in exploratory meta-analysis (odds ratio [OR] 0.61, [0.27–1.18]), although results were highly variable between studies. Conclusions RSV/PIV/hMPV infection was followed by a high CLAD incidence. Treatment options, including ribavirin, are limited. There is an urgent need for high-quality studies to provide better treatment options for these infections.
Collapse
Affiliation(s)
- Auke de Zwart
- University of Groningen, University Medical Center Groningen, Department of Pulmonary Medicine and Tuberculosis, Groningen, The Netherlands
| | | | - Gerton Lunter
- University of Groningen, University Medical Center Groningen, Department of Epidemiology, Groningen, The Netherlands
| | - Judith Vonk
- University of Groningen, University Medical Center Groningen, Department of Epidemiology, Groningen, The Netherlands
| | | | - Jens Gottlieb
- Department of Respiratory Medicine, Hannover Medical School, Hannover, Germany
| | - Nitipong Permpalung
- Division of Infectious Diseases, Johns Hopkins University School of Medicine, Baltimore, Maryland, USA.,Department of Microbiology, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand
| | - Huib Kerstjens
- University of Groningen, University Medical Center Groningen, Department of Pulmonary Medicine and Tuberculosis, Groningen, The Netherlands
| | - Jan-Willem Alffenaar
- School of Pharmacy, Faculty of Medicine and Health, University of Sydney, Sydney, Australia.,Westmead Hospital, Westmead, Australia.,Marie Bashir Institute of Infectious Diseases and Biosecurity, University of Sydney, Sydney, Australia
| | - Erik Verschuuren
- University of Groningen, University Medical Center Groningen, Department of Pulmonary Medicine and Tuberculosis, Groningen, The Netherlands
| |
Collapse
|
8
|
Abstract
Genotyping from sequencing is the basis of emerging strategies in the molecular breeding of polyploid plants. However, compared with the situation for diploids, in which genotyping accuracies are confidently determined with comprehensive benchmarks, polyploids have been neglected; there are no benchmarks measuring genotyping error rates for small variants using real sequencing reads. We previously introduced a variant calling method, Octopus, that accurately calls germline variants in diploids and somatic mutations in tumors. Here, we evaluate Octopus and other popular tools on whole-genome tetraploid and hexaploid data sets created using in silico mixtures of diploid Genome in a Bottle (GIAB) samples. We find that genotyping errors are abundant for typical sequencing depths but that Octopus makes 25% fewer errors than other methods on average. We supplement our benchmarks with concordance analysis in real autotriploid banana data sets.
Collapse
Affiliation(s)
- Daniel P Cooke
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford OX3 9DS, United Kingdom
| | - David C Wedge
- Manchester Cancer Research Centre, University of Manchester, Manchester M20 4GJ, United Kingdom
| | - Gerton Lunter
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford OX3 9DS, United Kingdom
- Department of Epidemiology, University Medical Center Groningen, 9713 GZ Groningen, The Netherlands
| |
Collapse
|
9
|
Locke DP, Hillier LW, Warren WC, Worley KC, Nazareth LV, Muzny DM, Yang SP, Wang Z, Chinwalla AT, Minx P, Mitreva M, Cook L, Delehaunty KD, Fronick C, Schmidt H, Fulton LA, Fulton RS, Nelson JO, Magrini V, Pohl C, Graves TA, Markovic C, Cree A, Dinh HH, Hume J, Kovar CL, Fowler GR, Lunter G, Meader S, Heger A, Ponting CP, Marques-Bonet T, Alkan C, Chen L, Cheng Z, Kidd JM, Eichler EE, White S, Searle S, Vilella AJ, Chen Y, Flicek P, Ma J, Raney B, Suh B, Burhans R, Herrero J, Haussler D, Faria R, Fernando O, Darré F, Farré D, Gazave E, Oliva M, Navarro A, Roberto R, Capozzi O, Archidiacono N, Della Valle G, Purgato S, Rocchi M, Konkel MK, Walker JA, Ullmer B, Batzer MA, Smit AFA, Hubley R, Casola C, Schrider DR, Hahn MW, Quesada V, Puente XS, Ordoñez GR, López-Otín C, Vinar T, Brejova B, Ratan A, Harris RS, Miller W, Kosiol C, Lawson HA, Taliwal V, Martins AL, Siepel A, RoyChoudhury A, Ma X, Degenhardt J, Bustamante CD, Gutenkunst RN, Mailund T, Dutheil JY, Hobolth A, Schierup MH, Ryder OA, Yoshinaga Y, de Jong PJ, Weinstock GM, Rogers J, Mardis ER, Gibbs RA, Wilson RK. Author Correction: Comparative and demographic analysis of orang-utan genomes. Nature 2022; 608:E36. [PMID: 35962045 PMCID: PMC9402433 DOI: 10.1038/s41586-022-04799-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Affiliation(s)
- Devin P. Locke
- grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
| | - LaDeana W. Hillier
- grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
| | - Wesley C. Warren
- grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
| | - Kim C. Worley
- grid.39382.330000 0001 2160 926XHuman Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas USA
| | - Lynne V. Nazareth
- grid.39382.330000 0001 2160 926XHuman Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas USA
| | - Donna M. Muzny
- grid.39382.330000 0001 2160 926XHuman Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas USA
| | - Shiaw-Pyng Yang
- grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
| | - Zhengyuan Wang
- grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
| | - Asif T. Chinwalla
- grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
| | - Pat Minx
- grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
| | - Makedonka Mitreva
- grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
| | - Lisa Cook
- grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
| | - Kim D. Delehaunty
- grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
| | - Catrina Fronick
- grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
| | - Heather Schmidt
- grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
| | - Lucinda A. Fulton
- grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
| | - Robert S. Fulton
- grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
| | - Joanne O. Nelson
- grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
| | - Vincent Magrini
- grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
| | - Craig Pohl
- grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
| | - Tina A. Graves
- grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
| | - Chris Markovic
- grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
| | - Andy Cree
- grid.39382.330000 0001 2160 926XHuman Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas USA
| | - Huyen H. Dinh
- grid.39382.330000 0001 2160 926XHuman Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas USA
| | - Jennifer Hume
- grid.39382.330000 0001 2160 926XHuman Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas USA
| | - Christie L. Kovar
- grid.39382.330000 0001 2160 926XHuman Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas USA
| | - Gerald R. Fowler
- grid.39382.330000 0001 2160 926XHuman Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas USA
| | - Gerton Lunter
- grid.4991.50000 0004 1936 8948MRC Functional Genomics Unit and Department of Physiology, Anatomy and Genetics, University of Oxford, Le Gros Clark Building, Oxford, UK ,grid.270683.80000 0004 0641 4511Wellcome Trust Centre for Human Genetics, Oxford, UK
| | - Stephen Meader
- grid.4991.50000 0004 1936 8948MRC Functional Genomics Unit and Department of Physiology, Anatomy and Genetics, University of Oxford, Le Gros Clark Building, Oxford, UK
| | - Andreas Heger
- grid.4991.50000 0004 1936 8948MRC Functional Genomics Unit and Department of Physiology, Anatomy and Genetics, University of Oxford, Le Gros Clark Building, Oxford, UK
| | - Chris P. Ponting
- grid.4991.50000 0004 1936 8948MRC Functional Genomics Unit and Department of Physiology, Anatomy and Genetics, University of Oxford, Le Gros Clark Building, Oxford, UK
| | - Tomas Marques-Bonet
- grid.34477.330000000122986657Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington USA ,grid.5612.00000 0001 2172 2676IBE, Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Doctor Aiguader, 88, Barcelona, Spain
| | - Can Alkan
- grid.34477.330000000122986657Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington USA
| | - Lin Chen
- grid.34477.330000000122986657Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington USA
| | - Ze Cheng
- grid.34477.330000000122986657Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington USA
| | - Jeffrey M. Kidd
- grid.34477.330000000122986657Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington USA
| | - Evan E. Eichler
- grid.34477.330000000122986657Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington USA ,grid.413575.10000 0001 2167 1581Howard Hughes Medical Institute, Seattle, Washington USA
| | - Simon White
- grid.10306.340000 0004 0606 5382Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge, UK
| | - Stephen Searle
- grid.10306.340000 0004 0606 5382Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge, UK
| | - Albert J. Vilella
- grid.52788.300000 0004 0427 7672European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge UK
| | - Yuan Chen
- grid.52788.300000 0004 0427 7672European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge UK
| | - Paul Flicek
- grid.52788.300000 0004 0427 7672European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge UK
| | - Jian Ma
- grid.205975.c0000 0001 0740 6917Center for Biomolecular Science and Engineering, University of California, Santa Cruz, California USA ,grid.35403.310000 0004 1936 9991Present Address: Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, Illinois USA
| | - Brian Raney
- grid.205975.c0000 0001 0740 6917Center for Biomolecular Science and Engineering, University of California, Santa Cruz, California USA
| | - Bernard Suh
- grid.205975.c0000 0001 0740 6917Center for Biomolecular Science and Engineering, University of California, Santa Cruz, California USA
| | - Richard Burhans
- grid.29857.310000 0001 2097 4281Center for Comparative Genomics and Bioinformatics, Penn State University, University Park, Pennsylvania, USA
| | - Javier Herrero
- grid.52788.300000 0004 0427 7672European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge UK
| | - David Haussler
- grid.205975.c0000 0001 0740 6917Center for Biomolecular Science and Engineering, University of California, Santa Cruz, California USA
| | - Rui Faria
- grid.5612.00000 0001 2172 2676IBE, Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Doctor Aiguader, 88, Barcelona, Spain ,grid.5808.50000 0001 1503 7226CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Universidade do Porto, Campus Agrário de Vairão, Vairão, Portugal
| | - Olga Fernando
- grid.5612.00000 0001 2172 2676IBE, Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Doctor Aiguader, 88, Barcelona, Spain ,grid.10772.330000000121511713Instituto de Tecnologia Química e Biológica, Universidade Nova de Lisboa, Oeiras, Portugal
| | - Fleur Darré
- grid.5612.00000 0001 2172 2676IBE, Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Doctor Aiguader, 88, Barcelona, Spain
| | - Domènec Farré
- grid.5612.00000 0001 2172 2676IBE, Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Doctor Aiguader, 88, Barcelona, Spain
| | - Elodie Gazave
- grid.5612.00000 0001 2172 2676IBE, Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Doctor Aiguader, 88, Barcelona, Spain
| | - Meritxell Oliva
- grid.5612.00000 0001 2172 2676IBE, Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Doctor Aiguader, 88, Barcelona, Spain
| | - Arcadi Navarro
- grid.5612.00000 0001 2172 2676IBE, Institut de Biologia Evolutiva (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Doctor Aiguader, 88, Barcelona, Spain ,grid.425902.80000 0000 9601 989XICREA (Institució Catalana de Recerca i Estudis Avançats) and INB (Instituto Nacional de Bioinformática) PRBB, Doctor Aiguader, 88, Barcelona, Spain
| | - Roberta Roberto
- grid.7644.10000 0001 0120 3326Department of Biology, University of Bari, Bari, Italy
| | - Oronzo Capozzi
- grid.7644.10000 0001 0120 3326Department of Biology, University of Bari, Bari, Italy
| | | | - Giuliano Della Valle
- grid.6292.f0000 0004 1757 1758Department of Biology, University of Bologna, Bologna, Italy
| | - Stefania Purgato
- grid.6292.f0000 0004 1757 1758Department of Biology, University of Bologna, Bologna, Italy
| | - Mariano Rocchi
- grid.7644.10000 0001 0120 3326Department of Biology, University of Bari, Bari, Italy
| | - Miriam K. Konkel
- grid.64337.350000 0001 0662 7451Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana USA
| | - Jerilyn A. Walker
- grid.64337.350000 0001 0662 7451Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana USA
| | - Brygg Ullmer
- grid.64337.350000 0001 0662 7451Center for Computation and Technology, Department of Computer Sciences, Louisiana State University, Baton Rouge, Louisiana USA
| | - Mark A. Batzer
- grid.64337.350000 0001 0662 7451Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana USA
| | - Arian F. A. Smit
- grid.64212.330000 0004 0463 2320Institute for Systems Biology, Seattle, Washington USA
| | - Robert Hubley
- grid.64212.330000 0004 0463 2320Institute for Systems Biology, Seattle, Washington USA
| | - Claudio Casola
- grid.411377.70000 0001 0790 959XDepartment of Biology and School of Informatics and Computing, Indiana University, Bloomington, Indiana USA
| | - Daniel R. Schrider
- grid.411377.70000 0001 0790 959XDepartment of Biology and School of Informatics and Computing, Indiana University, Bloomington, Indiana USA
| | - Matthew W. Hahn
- grid.411377.70000 0001 0790 959XDepartment of Biology and School of Informatics and Computing, Indiana University, Bloomington, Indiana USA
| | - Victor Quesada
- grid.10863.3c0000 0001 2164 6351Instituto Universitario de Oncologia, Departamento de Bioquimica y Biologia Molecular, Universidad de Oviedo, Oviedo, Spain
| | - Xose S. Puente
- grid.10863.3c0000 0001 2164 6351Instituto Universitario de Oncologia, Departamento de Bioquimica y Biologia Molecular, Universidad de Oviedo, Oviedo, Spain
| | - Gonzalo R. Ordoñez
- grid.10863.3c0000 0001 2164 6351Instituto Universitario de Oncologia, Departamento de Bioquimica y Biologia Molecular, Universidad de Oviedo, Oviedo, Spain
| | - Carlos López-Otín
- grid.10863.3c0000 0001 2164 6351Instituto Universitario de Oncologia, Departamento de Bioquimica y Biologia Molecular, Universidad de Oviedo, Oviedo, Spain
| | - Tomas Vinar
- grid.7634.60000000109409708Faculty of Mathematics, Physics and Informatics, Comenius University, Mlynska Dolina, Bratislava, Slovakia
| | - Brona Brejova
- grid.7634.60000000109409708Faculty of Mathematics, Physics and Informatics, Comenius University, Mlynska Dolina, Bratislava, Slovakia
| | - Aakrosh Ratan
- grid.29857.310000 0001 2097 4281Center for Comparative Genomics and Bioinformatics, Penn State University, University Park, Pennsylvania, USA
| | - Robert S. Harris
- grid.29857.310000 0001 2097 4281Center for Comparative Genomics and Bioinformatics, Penn State University, University Park, Pennsylvania, USA
| | - Webb Miller
- grid.29857.310000 0001 2097 4281Center for Comparative Genomics and Bioinformatics, Penn State University, University Park, Pennsylvania, USA
| | - Carolin Kosiol
- Institut für Populations genetik, Vetmeduni Vienna, Wien, Austria
| | - Heather A. Lawson
- grid.4367.60000 0001 2355 7002Department of Anatomy and Neurobiology, Washington University School of Medicine, Saint Louis, Missouri USA
| | - Vikas Taliwal
- grid.5386.8000000041936877XDepartment of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York USA
| | - André L. Martins
- grid.5386.8000000041936877XDepartment of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York USA
| | - Adam Siepel
- grid.5386.8000000041936877XDepartment of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York USA
| | - Arindam RoyChoudhury
- grid.21729.3f0000000419368729Department of Biostatistics, Columbia University, New York, New York USA
| | - Xin Ma
- grid.5386.8000000041936877XDepartment of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York USA
| | - Jeremiah Degenhardt
- grid.5386.8000000041936877XDepartment of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York USA
| | - Carlos D. Bustamante
- grid.168010.e0000000419368956Department of Genetics, Stanford University, Stanford, California USA
| | - Ryan N. Gutenkunst
- grid.134563.60000 0001 2168 186XDepartment of Molecular and Cellular Biology, University of Arizona, Tucson, Arizona USA
| | - Thomas Mailund
- grid.7048.b0000 0001 1956 2722Bioinformatics Research Centre, Aarhus University, Aarhus C, Denmark
| | - Julien Y. Dutheil
- grid.7048.b0000 0001 1956 2722Bioinformatics Research Centre, Aarhus University, Aarhus C, Denmark
| | - Asger Hobolth
- grid.7048.b0000 0001 1956 2722Bioinformatics Research Centre, Aarhus University, Aarhus C, Denmark
| | - Mikkel H. Schierup
- grid.7048.b0000 0001 1956 2722Bioinformatics Research Centre, Aarhus University, Aarhus C, Denmark
| | - Oliver A. Ryder
- grid.452788.40000 0004 0458 5309San Diego Zoo’s Institute for Conservation Research, Escondido, California USA
| | - Yuko Yoshinaga
- grid.414016.60000 0004 0433 7727Children’s Hospital Oakland Research Institute, Oakland, California USA
| | - Pieter J. de Jong
- grid.414016.60000 0004 0433 7727Children’s Hospital Oakland Research Institute, Oakland, California USA
| | - George M. Weinstock
- grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
| | - Jeffrey Rogers
- grid.39382.330000 0001 2160 926XHuman Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas USA
| | - Elaine R. Mardis
- grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
| | - Richard A. Gibbs
- grid.39382.330000 0001 2160 926XHuman Genome Sequencing Center, Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, Texas USA
| | - Richard K. Wilson
- grid.4367.60000 0001 2355 7002The Genome Center at Washington University, Washington University School of Medicine, Saint Louis, Missouri USA
| |
Collapse
|
10
|
Reijntjes B, van Suijlichem M, Woolderink JM, Bongers MY, Reesink-Peters N, Paulsen L, van der Hurk PJ, Kraayenbrink AA, Apperloo MJA, Slangen B, Schukken T, Tummers FHMP, van Kesteren PJM, Huirne JAF, Boskamp D, Lunter G, de Bock GH, Mourits MJE. Recurrence and survival after laparoscopy versus laparotomy without lymphadenectomy in early-stage endometrial cancer: Long-term outcomes of a randomised trial. Gynecol Oncol 2021; 164:265-270. [PMID: 34955237 DOI: 10.1016/j.ygyno.2021.12.019] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2021] [Revised: 12/11/2021] [Accepted: 12/13/2021] [Indexed: 11/04/2022]
Abstract
BACKGROUND Laparoscopic hysterectomy is accepted worldwide as the standard treatment option for early-stage endometrial cancer. However, there are limited data on long-term survival, particularly when no lymphadenectomy is performed. We compared the survival outcomes of total laparoscopic hysterectomy (TLH) and total abdominal hysterectomy (TAH), both without lymphadenectomy, for early-stage endometrial cancer up to 5 years postoperatively. METHODS Follow-up of a multi-centre, randomised controlled trial comparing TLH and TAH, without routine lymphadenectomy, for women with stage I endometrial cancer. Enrolment was between 2007 and 2009 by 2:1 randomisation to TLH or TAH. Outcomes were disease-free survival (DFS), overall survival (OS), disease-specific survival (DSS), and primary site of recurrence. Multivariable Cox regression analyses were adjusted for age, stage, grade, and radiotherapy with adjusted hazard ratios (aHR) and 95% confidence intervals (95%CI) reported. To test for significance, non-inferiority margins were defined. RESULTS In total, 279 women underwent a surgical procedure, of whom 263 (94%) had follow-up data. For the TLH (n = 175) and TAH (n = 88) groups, DFS (90.3% vs 84.1%; aHR[recurrence], 0.69; 95%CI, 0.31-1.52), OS (89.2% vs 82.8%; aHR[death], 0.60; 95%CI, 0.30-1.19), and DSS (95.0% vs 89.8%; aHR[death], 0.62; 95%CI, 0.23-1.70) were reported at 5 years. At a 10% significance level, and with a non-inferiority margin of 0.20, the null hypothesis of inferiority was rejected for all three outcomes. There were no port-site or wound metastases, and local recurrence rates were comparable. CONCLUSION Disease recurrence and 5-year survival rates were comparable between the TLH and TAH groups and comparable to studies with lymphadenectomy, supporting the widespread use of TLH without lymphadenectomy as the primary treatment for early-stage, low-grade endometrial cancer.
Collapse
Affiliation(s)
- Bianca Reijntjes
- Department of Gynaecology, University Medical Center Groningen, University of Groningen, the Netherlands
| | - Mieke van Suijlichem
- Department of Gynaecology, University Medical Center Groningen, University of Groningen, the Netherlands
| | - Jorien M Woolderink
- Department of Obstetrics and Gynaecology, Martini Hospital Groningen, the Netherlands
| | - Marlies Y Bongers
- Department of Obstetrics and Gynaecology, Maxima Medical Center Veldhoven, the Netherlands
| | - Nathalie Reesink-Peters
- Department of Obstetrics and Gynaecology, Medical Spectrum Twente Enschede/Hospital Group Twente Almelo, the Netherlands
| | - Lasse Paulsen
- Department of Obstetrics and Gynaecology, Wilhelmina Hospital Assen, the Netherlands
| | - Pieter J van der Hurk
- Department of Obstetrics and Gynaecology, Nij Smellinghe Hospital Drachten, the Netherlands
| | - Arjan A Kraayenbrink
- Department of Obstetrics and Gynaecology, Rijnstate Hospital Arnhem, the Netherlands
| | - Mirjam J A Apperloo
- Department of Obstetrics and Gynaecology, Medical Center Leeuwarden, the Netherlands
| | - Brigitte Slangen
- Department of Obstetrics and Gynaecology, Maastricht University Medical Center, the Netherlands
| | - Tineke Schukken
- Department of Obstetrics and Gynaecology, Antonius Hospital Sneek, the Netherlands
| | | | | | - Judith A F Huirne
- Department of Gynaecology, Amsterdam University Medical Center, the Netherlands
| | - Dieuwke Boskamp
- Department of Obstetrics and Gynaecology, VieCuri Medical Center Venlo, the Netherlands
| | - Gerton Lunter
- Department of Epidemiology, University Medical Center Groningen, University of Groningen, the Netherlands
| | - Geertruida H de Bock
- Department of Epidemiology, University Medical Center Groningen, University of Groningen, the Netherlands
| | - Marian J E Mourits
- Department of Gynaecology, University Medical Center Groningen, University of Groningen, the Netherlands.
| |
Collapse
|
11
|
Tapmeier TT, Rahmioglu N, Lin J, De Leo B, Obendorf M, Raveendran M, Fischer OM, Bafligil C, Guo M, Harris RA, Hess-Stumpp H, Laux-Biehlmann A, Lowy E, Lunter G, Malzahn J, Martin NG, Martinez FO, Manek S, Mesch S, Montgomery GW, Morris AP, Nagel J, Simmons HA, Brocklebank D, Shang C, Treloar S, Wells G, Becker CM, Oppermann U, Zollner TM, Kennedy SH, Kemnitz JW, Rogers J, Zondervan KT. Neuropeptide S receptor 1 is a nonhormonal treatment target in endometriosis. Sci Transl Med 2021; 13:13/608/eabd6469. [PMID: 34433639 DOI: 10.1126/scitranslmed.abd6469] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2020] [Revised: 02/25/2021] [Accepted: 08/06/2021] [Indexed: 12/28/2022]
Abstract
Endometriosis is a common chronic inflammatory condition causing pelvic pain and infertility in women, with limited treatment options and 50% heritability. We leveraged genetic analyses in two species with spontaneous endometriosis, humans and the rhesus macaque, to uncover treatment targets. We sequenced DNA from 32 human families contributing to a genetic linkage signal on chromosome 7p13-15 and observed significant overrepresentation of predicted deleterious low-frequency coding variants in NPSR1, the gene encoding neuropeptide S receptor 1, in cases (predominantly stage III/IV) versus controls (P = 7.8 × 10-4). Significant linkage to the region orthologous to human 7p13-15 was replicated in a pedigree of 849 rhesus macaques (P = 0.0095). Targeted association analyses in 3194 surgically confirmed, unrelated cases and 7060 controls revealed that a common insertion/deletion variant, rs142885915, was significantly associated with stage III/IV endometriosis (P = 5.2 × 10-5; odds ratio, 1.23; 95% CI, 1.09 to 1.39). Immunohistochemistry, qRT-PCR, and flow cytometry experiments demonstrated that NPSR1 was expressed in glandular epithelium from eutopic and ectopic endometrium, and on monocytes in peritoneal fluid. The NPSR1 inhibitor SHA 68R blocked NPSR1-mediated signaling, proinflammatory TNF-α release, and monocyte chemotaxis in vitro (P < 0.01), and led to a significant reduction of inflammatory cell infiltrate and abdominal pain (P < 0.05) in a mouse model of peritoneal inflammation as well as in a mouse model of endometriosis. We conclude that the NPSR1/NPS system is a genetically validated, nonhormonal target for the treatment of endometriosis with likely increased relevance to stage III/IV disease.
Collapse
Affiliation(s)
- Thomas T Tapmeier
- Oxford Endometriosis CaRe Centre, Nuffield Department of Women's & Reproductive Health, University of Oxford, Women's Centre, John Radcliffe Hospital, Oxford OX3 9DU, UK. .,Department of Obstetrics and Gynaecology, Monash University, Clayton, Victoria 3168, Australia
| | - Nilufer Rahmioglu
- Oxford Endometriosis CaRe Centre, Nuffield Department of Women's & Reproductive Health, University of Oxford, Women's Centre, John Radcliffe Hospital, Oxford OX3 9DU, UK.,Wellcome Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, UK
| | - Jianghai Lin
- Oxford Endometriosis CaRe Centre, Nuffield Department of Women's & Reproductive Health, University of Oxford, Women's Centre, John Radcliffe Hospital, Oxford OX3 9DU, UK.,Institute of Life and Health Engineering, College of Life Science and Technology, Jinan University, Guangzhou, Guangdong 510632, PR China
| | - Bianca De Leo
- Bayer AG Pharmaceuticals, Research & Development, Building S107, 13342 Berlin, Germany
| | - Maik Obendorf
- Bayer AG Pharmaceuticals, Research & Development, Building S107, 13342 Berlin, Germany
| | | | - Oliver M Fischer
- Bayer AG Pharmaceuticals, Research & Development, Building S107, 13342 Berlin, Germany
| | - Cemsel Bafligil
- Nuffield Department of Orthopedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Botnar Research Centre, Oxford OX3 7LD, UK
| | - Manman Guo
- Nuffield Department of Orthopedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Botnar Research Centre, Oxford OX3 7LD, UK
| | - Ronald Alan Harris
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Holger Hess-Stumpp
- Bayer AG Pharmaceuticals, Research & Development, Building S107, 13342 Berlin, Germany
| | - Alexis Laux-Biehlmann
- Bayer AG Pharmaceuticals, Research & Development, Building S107, 13342 Berlin, Germany
| | - Ernesto Lowy
- Wellcome Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, UK
| | - Gerton Lunter
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford OX3 9DS, UK
| | - Jessica Malzahn
- Nuffield Department of Orthopedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Botnar Research Centre, Oxford OX3 7LD, UK
| | - Nicholas G Martin
- QIMR Berghofer Medical Research Institute, Brisbane, Queensland 4006, Australia
| | - Fernando O Martinez
- Faculty of Health and Medical Sciences, University of Surrey, Guildford GU2 7YH, UK
| | - Sanjiv Manek
- Department of Pathology, Oxford University Hospitals Foundation Trust, Oxford OX3 9DU, UK
| | - Stefanie Mesch
- Bayer AG Pharmaceuticals, Research & Development, Building S107, 13342 Berlin, Germany
| | - Grant W Montgomery
- QIMR Berghofer Medical Research Institute, Brisbane, Queensland 4006, Australia.,Institute for Molecular Bioscience, University of Queensland, Brisbane, Queensland 4072, Australia
| | - Andrew P Morris
- Wellcome Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, UK.,Division of Musculoskeletal and Dermatological Sciences, University of Manchester, Manchester M13 9PL, UK
| | - Jens Nagel
- Bayer AG Pharmaceuticals, Research & Development, Building S107, 13342 Berlin, Germany
| | - Heather A Simmons
- Wisconsin National Primate Research Center, University of Wisconsin-Madison, Madison, WI 53715, USA
| | - Denise Brocklebank
- Wellcome Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, UK
| | - Catherine Shang
- Oxford Endometriosis CaRe Centre, Nuffield Department of Women's & Reproductive Health, University of Oxford, Women's Centre, John Radcliffe Hospital, Oxford OX3 9DU, UK
| | - Susan Treloar
- QIMR Berghofer Medical Research Institute, Brisbane, Queensland 4006, Australia
| | - Graham Wells
- Nuffield Department of Orthopedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Botnar Research Centre, Oxford OX3 7LD, UK
| | - Christian M Becker
- Oxford Endometriosis CaRe Centre, Nuffield Department of Women's & Reproductive Health, University of Oxford, Women's Centre, John Radcliffe Hospital, Oxford OX3 9DU, UK
| | - Udo Oppermann
- Nuffield Department of Orthopedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Botnar Research Centre, Oxford OX3 7LD, UK
| | - Thomas M Zollner
- Bayer AG Pharmaceuticals, Research & Development, Building S107, 13342 Berlin, Germany
| | - Stephen H Kennedy
- Oxford Endometriosis CaRe Centre, Nuffield Department of Women's & Reproductive Health, University of Oxford, Women's Centre, John Radcliffe Hospital, Oxford OX3 9DU, UK
| | - Joseph W Kemnitz
- Wisconsin National Primate Research Center, University of Wisconsin-Madison, Madison, WI 53715, USA.,Department of Cell & Regenerative Biology and Wisconsin National Primate Research Center, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Jeffrey Rogers
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA.,Wisconsin National Primate Research Center, University of Wisconsin-Madison, Madison, WI 53715, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Krina T Zondervan
- Oxford Endometriosis CaRe Centre, Nuffield Department of Women's & Reproductive Health, University of Oxford, Women's Centre, John Radcliffe Hospital, Oxford OX3 9DU, UK. .,Wellcome Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, UK
| |
Collapse
|
12
|
Cooke DP, Wedge DC, Lunter G. A unified haplotype-based method for accurate and comprehensive variant calling. Nat Biotechnol 2021; 39:885-892. [PMID: 33782612 PMCID: PMC7611855 DOI: 10.1038/s41587-021-00861-3] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2018] [Accepted: 02/18/2021] [Indexed: 01/31/2023]
Abstract
Almost all haplotype-based variant callers were designed specifically for detecting common germline variation in diploid populations, and give suboptimal results in other scenarios. Here we present Octopus, a variant caller that uses a polymorphic Bayesian genotyping model capable of modeling sequencing data from a range of experimental designs within a unified haplotype-aware framework. Octopus combines sequencing reads and prior information to phase-called genotypes of arbitrary ploidy, including those with somatic mutations. We show that Octopus accurately calls germline variants in individuals, including single nucleotide variants, indels and small complex replacements such as microinversions. Using a synthetic tumor data set derived from clean sequencing data from a sample with known germline haplotypes and observed mutations in a large cohort of tumor samples, we show that Octopus is more sensitive to low-frequency somatic variation, yet calls considerably fewer false positives than other methods. Octopus also outputs realigned evidence BAM files to aid validation and interpretation.
Collapse
Affiliation(s)
- Daniel P Cooke
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK.
| | - David C Wedge
- Manchester Cancer Research Centre, University of Manchester, Manchester, UK
| | - Gerton Lunter
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
- Department of Epidemiology, University Medical Centre Groningen, Groningen, the Netherlands
| |
Collapse
|
13
|
Sergeant MJ, Hughes JR, Hentges L, Lunter G, Downes DJ, Taylor S. Multi Locus View: an extensible web-based tool for the analysis of genomic data. Commun Biol 2021; 4:623. [PMID: 34035422 PMCID: PMC8149710 DOI: 10.1038/s42003-021-02097-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2020] [Accepted: 03/31/2021] [Indexed: 01/10/2023] Open
Abstract
Tracking and understanding data quality, analysis and reproducibility are critical concerns in the biological sciences. This is especially true in genomics where next generation sequencing (NGS) based technologies such as ChIP-seq, RNA-seq and ATAC-seq are generating a flood of genome-scale data. However, such data are usually processed with automated tools and pipelines, generating tabular outputs and static visualisations. Interpretation is normally made at a high level without the ability to visualise the underlying data in detail. Conventional genome browsers are limited to browsing single locations and do not allow for interactions with the dataset as a whole. Multi Locus View (MLV), a web-based tool, has been developed to allow users to fluidly interact with genomics datasets at multiple scales. The user is able to browse the raw data, cluster, and combine the data with other analysis and annotate the data. User datasets can then be shared with other users or made public for quick assessment from the academic community. MLV is publically available at https://mlv.molbiol.ox.ac.uk .
Collapse
Affiliation(s)
- Martin J Sergeant
- MRC WIMM Centre for Computational Biology, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Jim R Hughes
- MRC WIMM Centre for Computational Biology, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
- MRC Molecular Haematology Unit, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Lance Hentges
- MRC WIMM Centre for Computational Biology, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Gerton Lunter
- MRC WIMM Centre for Computational Biology, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
- University Medical Centre Groningen, Department of Epidemiology, University of Groningen, Groningen, The Netherlands
| | - Damien J Downes
- MRC Molecular Haematology Unit, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Stephen Taylor
- MRC WIMM Centre for Computational Biology, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK.
| |
Collapse
|
14
|
Roberts HE, Lopopolo M, Pagnamenta AT, Sharma E, Parkes D, Lonie L, Freeman C, Knight SJL, Lunter G, Dreau H, Lockstone H, Taylor JC, Schuh A, Bowden R, Buck D. Short and long-read genome sequencing methodologies for somatic variant detection; genomic analysis of a patient with diffuse large B-cell lymphoma. Sci Rep 2021; 11:6408. [PMID: 33742045 PMCID: PMC7979876 DOI: 10.1038/s41598-021-85354-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2020] [Accepted: 02/24/2021] [Indexed: 12/14/2022] Open
Abstract
Recent advances in throughput and accuracy mean that the Oxford Nanopore Technologies PromethION platform is a now a viable solution for genome sequencing. Much of the validation of bioinformatic tools for this long-read data has focussed on calling germline variants (including structural variants). Somatic variants are outnumbered many-fold by germline variants and their detection is further complicated by the effects of tumour purity/subclonality. Here, we evaluate the extent to which Nanopore sequencing enables detection and analysis of somatic variation. We do this through sequencing tumour and germline genomes for a patient with diffuse B-cell lymphoma and comparing results with 150 bp short-read sequencing of the same samples. Calling germline single nucleotide variants (SNVs) from specific chromosomes of the long-read data achieved good specificity and sensitivity. However, results of somatic SNV calling highlight the need for the development of specialised joint calling algorithms. We find the comparative genome-wide performance of different tools varies significantly between structural variant types, and suggest long reads are especially advantageous for calling large somatic deletions and duplications. Finally, we highlight the utility of long reads for phasing clinically relevant variants, confirming that a somatic 1.6 Mb deletion and a p.(Arg249Met) mutation involving TP53 are oriented in trans.
Collapse
Affiliation(s)
- Hannah E Roberts
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Maria Lopopolo
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Alistair T Pagnamenta
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
- National Institute for Health Research Oxford Biomedical Research Centre, Oxford, UK
| | - Eshita Sharma
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Duncan Parkes
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Lorne Lonie
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Colin Freeman
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Samantha J L Knight
- National Institute for Health Research Oxford Biomedical Research Centre, Oxford, UK
| | - Gerton Lunter
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
- Department of Epidemiology, University Medical Centre Groningen, Groningen, The Netherlands
| | - Helene Dreau
- Oxford University Hospitals NHS Trust, Oxford, UK
- Department of Haematology, University of Oxford, Oxford, UK
| | - Helen Lockstone
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Jenny C Taylor
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK.
- National Institute for Health Research Oxford Biomedical Research Centre, Oxford, UK.
| | - Anna Schuh
- National Institute for Health Research Oxford Biomedical Research Centre, Oxford, UK.
- Oxford University Hospitals NHS Trust, Oxford, UK.
- Department of Oncology, University of Oxford, Oxford, UK.
| | - Rory Bowden
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - David Buck
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| |
Collapse
|
15
|
Henderson D, Zhu S(J, Cole CB, Lunter G. Demographic inference from multiple whole genomes using a particle filter for continuous Markov jump processes. PLoS One 2021; 16:e0247647. [PMID: 33651801 PMCID: PMC7924771 DOI: 10.1371/journal.pone.0247647] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2020] [Accepted: 02/10/2021] [Indexed: 12/12/2022] Open
Abstract
Demographic events shape a population's genetic diversity, a process described by the coalescent-with-recombination model that relates demography and genetics by an unobserved sequence of genealogies along the genome. As the space of genealogies over genomes is large and complex, inference under this model is challenging. Formulating the coalescent-with-recombination model as a continuous-time and -space Markov jump process, we develop a particle filter for such processes, and use waypoints that under appropriate conditions allow the problem to be reduced to the discrete-time case. To improve inference, we generalise the Auxiliary Particle Filter for discrete-time models, and use Variational Bayes to model the uncertainty in parameter estimates for rare events, avoiding biases seen with Expectation Maximization. Using real and simulated genomes, we show that past population sizes can be accurately inferred over a larger range of epochs than was previously possible, opening the possibility of jointly analyzing multiple genomes under complex demographic models. Code is available at https://github.com/luntergroup/smcsmc.
Collapse
Affiliation(s)
| | - Sha (Joe) Zhu
- Wellcome Centre for Human Genetics, Oxford, United Kingdom
- Big Data Institute, Oxford, United Kingdom
| | - Christopher B. Cole
- MRC Weatherall Institute of Molecular Medicine, John Radcliffe Hospital, Headington, Oxford, United Kingdom
| | - Gerton Lunter
- MRC Weatherall Institute of Molecular Medicine, John Radcliffe Hospital, Headington, Oxford, United Kingdom
- Department of Epidemiology, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
| |
Collapse
|
16
|
Schwessinger R, Gosden M, Downes D, Brown RC, Oudelaar AM, Telenius J, Teh YW, Lunter G, Hughes JR. DeepC: predicting 3D genome folding using megabase-scale transfer learning. Nat Methods 2020; 17:1118-1124. [PMID: 33046896 PMCID: PMC7610627 DOI: 10.1038/s41592-020-0960-3] [Citation(s) in RCA: 73] [Impact Index Per Article: 18.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2020] [Accepted: 08/20/2020] [Indexed: 01/29/2023]
Abstract
Predicting the impact of noncoding genetic variation requires interpreting it in the context of three-dimensional genome architecture. We have developed deepC, a transfer-learning-based deep neural network that accurately predicts genome folding from megabase-scale DNA sequence. DeepC predicts domain boundaries at high resolution, learns the sequence determinants of genome folding and predicts the impact of both large-scale structural and single base-pair variations.
Collapse
Affiliation(s)
- Ron Schwessinger
- MRC Molecular Haematology Unit, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
- MRC WIMM Centre for Computational Biology, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Matthew Gosden
- MRC Molecular Haematology Unit, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Damien Downes
- MRC Molecular Haematology Unit, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Richard C Brown
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - A Marieke Oudelaar
- MRC Molecular Haematology Unit, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
- MRC WIMM Centre for Computational Biology, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Jelena Telenius
- MRC WIMM Centre for Computational Biology, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Yee Whye Teh
- Department of Statistics, University of Oxford, Oxford, UK
| | - Gerton Lunter
- MRC WIMM Centre for Computational Biology, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK.
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK.
| | - Jim R Hughes
- MRC Molecular Haematology Unit, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK.
- MRC WIMM Centre for Computational Biology, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK.
| |
Collapse
|
17
|
Brown RC, Lunter G. An equivariant Bayesian convolutional network predicts recombination hotspots and accurately resolves binding motifs. Bioinformatics 2020; 35:2177-2184. [PMID: 30481258 PMCID: PMC6596897 DOI: 10.1093/bioinformatics/bty964] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2018] [Revised: 11/13/2018] [Accepted: 11/26/2018] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION Convolutional neural networks (CNNs) have been tremendously successful in many contexts, particularly where training data are abundant and signal-to-noise ratios are large. However, when predicting noisily observed phenotypes from DNA sequence, each training instance is only weakly informative, and the amount of training data is often fundamentally limited, emphasizing the need for methods that make optimal use of training data and any structure inherent in the process. RESULTS Here we show how to combine equivariant networks, a general mathematical framework for handling exact symmetries in CNNs, with Bayesian dropout, a version of Monte Carlo dropout suggested by a reinterpretation of dropout as a variational Bayesian approximation, to develop a model that exhibits exact reverse-complement symmetry and is more resistant to overtraining. We find that this model combines improved prediction consistency with better predictive accuracy compared to standard CNN implementations and state-of-art motif finders. We use our network to predict recombination hotspots from sequence, and identify binding motifs for the recombination-initiation protein PRDM9 previously unobserved in this data, which were recently validated by high-resolution assays. The network achieves a predictive accuracy comparable to that attainable by a direct assay of the H3K4me3 histone mark, a proxy for PRDM9 binding. AVAILABILITY AND IMPLEMENTATION https://github.com/luntergroup/EquivariantNetworks. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
18
|
Fowler A, Galson JD, Trück J, Kelly DF, Lunter G. Inferring B cell specificity for vaccines using a Bayesian mixture model. BMC Genomics 2020; 21:176. [PMID: 32087698 PMCID: PMC7036227 DOI: 10.1186/s12864-020-6571-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2019] [Accepted: 02/10/2020] [Indexed: 12/30/2022] Open
Abstract
Background Vaccines have greatly reduced the burden of infectious disease, ranking in their impact on global health second only after clean water. Most vaccines confer protection by the production of antibodies with binding affinity for the antigen, which is the main effector function of B cells. This results in short term changes in the B cell receptor (BCR) repertoire when an immune response is launched, and long term changes when immunity is conferred. Analysis of antibodies in serum is usually used to evaluate vaccine response, however this is limited and therefore the investigation of the BCR repertoire provides far more detail for the analysis of vaccine response. Results Here, we introduce a novel Bayesian model to describe the observed distribution of BCR sequences and the pattern of sharing across time and between individuals, with the goal to identify vaccine-specific BCRs. We use data from two studies to assess the model and estimate that we can identify vaccine-specific BCRs with 69% sensitivity. Conclusion Our results demonstrate that statistical modelling can capture patterns associated with vaccine response and identify vaccine specific B cells in a range of different data sets. Additionally, the B cells we identify as vaccine specific show greater levels of sequence similarity than expected, suggesting that there are additional signals of vaccine response, not currently considered, which could improve the identification of vaccine specific B cells.
Collapse
Affiliation(s)
- Anna Fowler
- Department of Biostatistics, University of Liverpool, Liverpool, UK.
| | - Jacob D Galson
- University Children's Hospital Zurich and the Children's Research Center, University of Zurich, Zurich, Switzerland
| | - Johannes Trück
- University Children's Hospital Zurich and the Children's Research Center, University of Zurich, Zurich, Switzerland
| | - Dominic F Kelly
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Center, Oxford, UK
| | - Gerton Lunter
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, UK
| |
Collapse
|
19
|
Abstract
Motivation The Li and Stephens model, which approximates the coalescent describing the pattern of variation in a population, underpins a range of key tools and results in genetics. Although highly efficient compared to the coalescent, standard implementations of this model still cannot deal with the very large reference cohorts that are starting to become available, and practical implementations use heuristics to achieve reasonable runtimes. Results Here I describe a new, exact algorithm (‘fastLS’) that implements the Li and Stephens model and achieves runtimes independent of the size of the reference cohort. Key to achieving this runtime is the use of the Burrows-Wheeler transform, allowing the algorithm to efficiently identify partial haplotype matches across a cohort. I show that the proposed data structure is very similar to, and generalizes, Durbin’s positional Burrows-Wheeler transform.
Collapse
Affiliation(s)
- Gerton Lunter
- University of Oxford, Wellcome Centre for Human Genetics, Oxford, UK
| |
Collapse
|
20
|
Henderson D, Lunter G. Efficient inference in state-space models through adaptive learning in online Monte Carlo expectation maximization. Comput Stat 2019; 35:1319-1344. [PMID: 32764847 PMCID: PMC7382664 DOI: 10.1007/s00180-019-00937-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2018] [Accepted: 11/22/2019] [Indexed: 11/04/2022]
Abstract
Expectation maximization (EM) is a technique for estimating maximum-likelihood parameters of a latent variable model given observed data by alternating between taking expectations of sufficient statistics, and maximizing the expected log likelihood. For situations where sufficient statistics are intractable, stochastic approximation EM (SAEM) is often used, which uses Monte Carlo techniques to approximate the expected log likelihood. Two common implementations of SAEM, Batch EM (BEM) and online EM (OEM), are parameterized by a "learning rate", and their efficiency depend strongly on this parameter. We propose an extension to the OEM algorithm, termed Introspective Online Expectation Maximization (IOEM), which removes the need for specifying this parameter by adapting the learning rate to trends in the parameter updates. We show that our algorithm matches the efficiency of the optimal BEM and OEM algorithms in multiple models, and that the efficiency of IOEM can exceed that of BEM/OEM methods with optimal learning rates when the model has many parameters. Finally we use IOEM to fit two models to a financial time series. A Python implementation is available at https://github.com/luntergroup/IOEM.git.
Collapse
Affiliation(s)
- Donna Henderson
- Wellcome Centre of Human Genetics, University of Oxford, Oxford, OX3 7BN UK
| | - Gerton Lunter
- MRC Weatherall Institute of Molecular Medicine, Unversity of Oxford, Oxford, OX3 9DS UK
| |
Collapse
|
21
|
Hoehn KB, Vander Heiden JA, Zhou JQ, Lunter G, Pybus OG, Kleinstein SH. Repertoire-wide phylogenetic models of B cell molecular evolution reveal evolutionary signatures of aging and vaccination. Proc Natl Acad Sci U S A 2019; 116:22664-22672. [PMID: 31636219 PMCID: PMC6842591 DOI: 10.1073/pnas.1906020116] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
In order to produce effective antibodies, B cells undergo rapid somatic hypermutation (SHM) and selection for binding affinity to antigen via a process called affinity maturation. The similarities between this process and evolution by natural selection have led many groups to use phylogenetic methods to characterize the development of immunological memory, vaccination, and other processes that depend on affinity maturation. However, these applications are limited by the fact that most phylogenetic models are designed to be applied to individual lineages comprising genetically diverse sequences, while B cell repertoires often consist of hundreds to thousands of separate low-diversity lineages. Further, several features of affinity maturation violate important assumptions in standard phylogenetic models. Here, we introduce a hierarchical phylogenetic framework that integrates information from all lineages in a repertoire to more precisely estimate model parameters while simultaneously incorporating the unique features of SHM. We demonstrate the power of this repertoire-wide approach by characterizing previously undescribed phenomena in affinity maturation. First, we find evidence consistent with age-related changes in SHM hot-spot targeting. Second, we identify a consistent relationship between increased tree length and signs of increased negative selection, apparent in the repertoires of recently vaccinated subjects and those without any known recent infections or vaccinations. This suggests that B cell lineages shift toward negative selection over time as a general feature of affinity maturation. Our study provides a framework for undertaking repertoire-wide phylogenetic testing of SHM hypotheses and provides a means of characterizing dynamics of mutation and selection during affinity maturation.
Collapse
Affiliation(s)
- Kenneth B Hoehn
- Department of Pathology, Yale School of Medicine, New Haven, CT 06520
| | - Jason A Vander Heiden
- Department of Bioinformatics & Computational Biology, Genentech, South San Francisco, CA 94080
| | - Julian Q Zhou
- Interdepartmental Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06511
| | - Gerton Lunter
- Wellcome Centre for Human Genetics, Oxford OX3 7BN, United Kingdom
| | - Oliver G Pybus
- Department of Zoology, University of Oxford, Oxford OX1 3PS, United Kingdom
| | - Steven H Kleinstein
- Department of Pathology, Yale School of Medicine, New Haven, CT 06520;
- Interdepartmental Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06511
| |
Collapse
|
22
|
Bowden R, Davies RW, Heger A, Pagnamenta AT, de Cesare M, Oikkonen LE, Parkes D, Freeman C, Dhalla F, Patel SY, Popitsch N, Ip CLC, Roberts HE, Salatino S, Lockstone H, Lunter G, Taylor JC, Buck D, Simpson MA, Donnelly P. Sequencing of human genomes with nanopore technology. Nat Commun 2019; 10:1869. [PMID: 31015479 PMCID: PMC6478738 DOI: 10.1038/s41467-019-09637-5] [Citation(s) in RCA: 102] [Impact Index Per Article: 20.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2018] [Accepted: 03/19/2019] [Indexed: 12/17/2022] Open
Abstract
Whole-genome sequencing (WGS) is becoming widely used in clinical medicine in diagnostic contexts and to inform treatment choice. Here we evaluate the potential of the Oxford Nanopore Technologies (ONT) MinION long-read sequencer for routine WGS by sequencing the reference sample NA12878 and the genome of an individual with ataxia-pancytopenia syndrome and severe immune dysregulation. We develop and apply a novel reference panel-free analytical method to infer and then exploit phase information which improves single-nucleotide variant (SNV) calling performance from otherwise modest levels. In the clinical sample, we identify and directly phase two non-synonymous de novo variants in SAMD9L, (OMIM #159550) inferring that they lie on the same paternal haplotype. Whilst consensus SNV-calling error rates from ONT data remain substantially higher than those from short-read methods, we demonstrate the substantial benefits of analytical innovation. Ongoing improvements to base-calling and SNV-calling methodology must continue for nanopore sequencing to establish itself as a primary method for clinical WGS.
Collapse
Affiliation(s)
- Rory Bowden
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
| | - Robert W Davies
- Genomics plc, Oxford, OX1 1JD, UK
- Program in Genetics and Genomic Biology and The Centre for Applied Genomics, Hospital for Sick Children, Toronto, M5G 0A4, Canada
| | | | - Alistair T Pagnamenta
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
- National Institute for Health Research Oxford Biomedical Research Centre, Oxford, OX4 2PG, UK
| | | | - Laura E Oikkonen
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
| | - Duncan Parkes
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
| | - Colin Freeman
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
| | - Fatima Dhalla
- Department of Clinical Immunology, Oxford University Hospitals, Oxford, OX3 9DU, UK
- Developmental Immunology Group, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, OX3 9DS, UK
| | - Smita Y Patel
- Department of Clinical Immunology, Oxford University Hospitals, Oxford, OX3 9DU, UK
- Clinical Immunology Group, National Institute for Health Research Oxford Biomedical Research Centre, Oxford, OX4 2PG, UK
| | - Niko Popitsch
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
- National Institute for Health Research Oxford Biomedical Research Centre, Oxford, OX4 2PG, UK
- Children's Cancer Research Institute, St. Anna Kinderkrebsforschung, 1090, Vienna, Austria
| | - Camilla L C Ip
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
| | - Hannah E Roberts
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
| | - Silvia Salatino
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
| | - Helen Lockstone
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
| | - Gerton Lunter
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
- Genomics plc, Oxford, OX1 1JD, UK
| | - Jenny C Taylor
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
- National Institute for Health Research Oxford Biomedical Research Centre, Oxford, OX4 2PG, UK
| | - David Buck
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
| | | | - Peter Donnelly
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK.
- Genomics plc, Oxford, OX1 1JD, UK.
- Department of Statistics, University of Oxford, Oxford, OX1 3LB, UK.
| |
Collapse
|
23
|
Abstract
Background Transposable elements (TEs) are mobile genetic sequences that randomly propagate within their host’s genome. This mobility has the potential to affect gene transcription and cause disease. However, TEs are technically challenging to identify, which complicates efforts to assess the impact of TE insertions on disease. Here we present a targeted sequencing protocol and computational pipeline to identify polymorphic and novel TE insertions using next-generation sequencing: TE-NGS. The method simultaneously targets the three subfamilies that are responsible for the majority of recent TE activity (L1HS, AluYa5/8, and AluYb8/9) thereby obviating the need for multiple experiments and reducing the amount of input material required. Results Here we describe the laboratory protocol and detection algorithm, and a benchmark experiment for the reference genome NA12878. We demonstrate a substantial enrichment for on-target fragments, and high sensitivity and precision to both reference and NA12878-specific insertions. We report 17 previously unreported loci for this individual which are supported by orthogonal long-read evidence, and we identify 1470 polymorphic and novel TEs in 12 additional samples that were previously undocumented in databases of insertion polymorphisms. Conclusions We anticipate that future applications of TE-NGS alongside exome sequencing of patients with sporadic disease will reduce the number of unresolved cases, and improve estimates of the contribution of TEs to human genetic disease. Electronic supplementary material The online version of this article (10.1186/s12864-018-4485-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Erika M Kvikstad
- Wellcome Trust Centre for Human Genetics, Oxford, UK. .,National Institute for Health Research Comprehensive Biomedical Research Centre, Oxford, UK.
| | - Paolo Piazza
- Wellcome Trust Centre for Human Genetics, Oxford, UK.,Department of Medicine, Imperial College London, London, UK
| | - Jenny C Taylor
- Wellcome Trust Centre for Human Genetics, Oxford, UK.,National Institute for Health Research Comprehensive Biomedical Research Centre, Oxford, UK
| | - Gerton Lunter
- Wellcome Trust Centre for Human Genetics, Oxford, UK
| |
Collapse
|
24
|
Fowler A, Mahamdallie S, Ruark E, Seal S, Ramsay E, Clarke M, Uddin I, Wylie H, Strydom A, Lunter G, Rahman N. Accurate clinical detection of exon copy number variants in a targeted NGS panel using DECoN. Wellcome Open Res 2016; 1:20. [PMID: 28459104 PMCID: PMC5409526 DOI: 10.12688/wellcomeopenres.10069.1] [Citation(s) in RCA: 64] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Background: Targeted next generation sequencing (NGS) panels are increasingly being used in clinical genomics to increase capacity, throughput and affordability of gene testing. Identifying whole exon deletions or duplications (termed exon copy number variants, ‘exon CNVs’) in exon-targeted NGS panels has proved challenging, particularly for single exon CNVs. Methods: We developed a tool for the
Detection of
Exon
Copy
Number variants (DECoN), which is optimised for analysis of exon-targeted NGS panels in the clinical setting. We evaluated DECoN performance using 96 samples with independently validated exon CNV data. We performed simulations to evaluate DECoN detection performance of single exon CNVs and to evaluate performance using different coverage levels and sample numbers. Finally, we implemented DECoN in a clinical laboratory that tests
BRCA1 and
BRCA2 with the TruSight Cancer Panel (TSCP). We used DECoN to analyse 1,919 samples, validating exon CNV detections by multiplex ligation-dependent probe amplification (MLPA). Results: In the evaluation set, DECoN achieved 100% sensitivity and 99% specificity for BRCA exon CNVs, including identification of 8 single exon CNVs. DECoN also identified 14/15 exon CNVs in 8 other genes. Simulations of all possible BRCA single exon CNVs gave a mean sensitivity of 98% for deletions and 95% for duplications. DECoN performance remained excellent with different levels of coverage and sample numbers; sensitivity and specificity was >98% with the typical NGS run parameters. In the clinical pipeline, DECoN automatically analyses pools of 48 samples at a time, taking 24 minutes per pool, on average. DECoN detected 24 BRCA exon CNVs, of which 23 were confirmed by MLPA, giving a false discovery rate of 4%. Specificity was 99.7%. Conclusions: DECoN is a fast, accurate, exon CNV detection tool readily implementable in research and clinical NGS pipelines. It has high sensitivity and specificity and acceptable false discovery rate. DECoN is freely available at
www.icr.ac.uk/decon.
Collapse
Affiliation(s)
- Anna Fowler
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Shazia Mahamdallie
- Division of Genetics and Epidemiology, Institute of Cancer Research, London, UK.,TGLclinical, Institute of Cancer Research, London, UK
| | - Elise Ruark
- Division of Genetics and Epidemiology, Institute of Cancer Research, London, UK.,TGLclinical, Institute of Cancer Research, London, UK
| | - Sheila Seal
- Division of Genetics and Epidemiology, Institute of Cancer Research, London, UK.,TGLclinical, Institute of Cancer Research, London, UK
| | - Emma Ramsay
- Division of Genetics and Epidemiology, Institute of Cancer Research, London, UK.,TGLclinical, Institute of Cancer Research, London, UK
| | - Matthew Clarke
- Division of Genetics and Epidemiology, Institute of Cancer Research, London, UK
| | - Imran Uddin
- Division of Genetics and Epidemiology, Institute of Cancer Research, London, UK.,TGLclinical, Institute of Cancer Research, London, UK
| | - Harriet Wylie
- Division of Genetics and Epidemiology, Institute of Cancer Research, London, UK.,TGLclinical, Institute of Cancer Research, London, UK
| | - Ann Strydom
- Division of Genetics and Epidemiology, Institute of Cancer Research, London, UK.,TGLclinical, Institute of Cancer Research, London, UK
| | - Gerton Lunter
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Nazneen Rahman
- Division of Genetics and Epidemiology, Institute of Cancer Research, London, UK.,TGLclinical, Institute of Cancer Research, London, UK.,Cancer Genetics Unit, The Royal Marsden NHS Foundation Trust, Sutton, UK
| |
Collapse
|
25
|
Galson JD, Trück J, Clutterbuck EA, Fowler A, Cerundolo V, Pollard AJ, Lunter G, Kelly DF. Erratum to: B-cell repertoire dynamics after sequential hepatitis B vaccination and evidence for cross-reactive B-cell activation. Genome Med 2016; 8:81. [PMID: 27488676 PMCID: PMC4973058 DOI: 10.1186/s13073-016-0337-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2016] [Accepted: 07/19/2016] [Indexed: 11/10/2022] Open
Affiliation(s)
- Jacob D Galson
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Center, Oxford, OX3 7LE, UK.
| | - Johannes Trück
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Center, Oxford, OX3 7LE, UK
- Paediatric Immunology, University Children's Hospital Zürich, Zürich, 8032, Switzerland
| | - Elizabeth A Clutterbuck
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Center, Oxford, OX3 7LE, UK
| | - Anna Fowler
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
| | - Vincenzo Cerundolo
- Medical Research Council Human Immunology Unit, Weatherall Institute of Molecular Medicine, Oxford, OX3 9DS, UK
| | - Andrew J Pollard
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Center, Oxford, OX3 7LE, UK
| | - Gerton Lunter
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
| | - Dominic F Kelly
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Center, Oxford, OX3 7LE, UK
| |
Collapse
|
26
|
Galson JD, Trück J, Clutterbuck EA, Fowler A, Cerundolo V, Pollard AJ, Lunter G, Kelly DF. B-cell repertoire dynamics after sequential hepatitis B vaccination and evidence for cross-reactive B-cell activation. Genome Med 2016; 8:68. [PMID: 27312086 PMCID: PMC4910312 DOI: 10.1186/s13073-016-0322-z] [Citation(s) in RCA: 51] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2016] [Accepted: 05/27/2016] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND A diverse B-cell repertoire is essential for recognition and response to infectious and vaccine antigens. High-throughput sequencing of B-cell receptor (BCR) genes can now be used to study the B-cell repertoire at great depth and may shed more light on B-cell responses than conventional immunological methods. Here, we use high-throughput BCR sequencing to provide novel insight into B-cell dynamics following a primary course of hepatitis B vaccination. METHODS Nine vaccine-naïve participants were administered three doses of hepatitis B vaccine (months 0, 1, and 2 or 7). High-throughput Illumina sequencing of the total BCR repertoire was combined with targeted sequencing of sorted vaccine antigen-enriched B cells to analyze the longitudinal response of both the total and vaccine-specific repertoire after each vaccine. ELISpot was used to determine vaccine-specific cell numbers following each vaccine. RESULTS Deconvoluting the vaccine-specific from total BCR repertoire showed that vaccine-specific sequence clusters comprised <0.1 % of total sequence clusters, and had certain stereotypic features. The vaccine-specific BCR sequence clusters were expanded after each of the three vaccine doses, despite no vaccine-specific B cells being detected by ELISpot after the first vaccine dose. These vaccine-specific BCR clusters detected after the first vaccine dose had distinct properties compared to those detected after subsequent doses; they were more mutated, present at low frequency even prior to vaccination, and appeared to be derived from more mature B cells. CONCLUSIONS These results demonstrate the high-sensitivity of our vaccine-specific BCR analysis approach and suggest an alternative view of the B-cell response to novel antigens. In the response to the first vaccine dose, many vaccine-specific BCR clusters appeared to largely derive from previously activated cross-reactive B cells that have low affinity for the vaccine antigen, and subsequent doses were required to yield higher affinity B cells.
Collapse
Affiliation(s)
- Jacob D Galson
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Center, Oxford, OX3 7LE, UK.
| | - Johannes Trück
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Center, Oxford, OX3 7LE, UK
- Paediatric Immunology, University Children's Hospital Zürich, Zürich, 8032, Switzerland
| | - Elizabeth A Clutterbuck
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Center, Oxford, OX3 7LE, UK
| | - Anna Fowler
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
| | - Vincenzo Cerundolo
- Medical Research Council Human Immunology Unit, Weatherall Institute of Molecular Medicine, Oxford, OX3 9DS, UK
| | - Andrew J Pollard
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Center, Oxford, OX3 7LE, UK
| | - Gerton Lunter
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
| | - Dominic F Kelly
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Center, Oxford, OX3 7LE, UK
| |
Collapse
|
27
|
Abstract
B-cell receptors (BCRs) are membrane-bound immunoglobulins that recognize and bind foreign proteins (antigens). BCRs are formed through random somatic changes of germline DNA, creating a vast repertoire of unique sequences that enable individuals to recognize a diverse range of antigens. After encountering antigen for the first time, BCRs undergo a process of affinity maturation, whereby cycles of rapid somatic mutation and selection lead to improved antigen binding. This constitutes an accelerated evolutionary process that takes place over days or weeks. Next-generation sequencing of the gene regions that determine BCR binding has begun to reveal the diversity and dynamics of BCR repertoires in unprecedented detail. Although this new type of sequence data has the potential to revolutionize our understanding of infection dynamics, quantitative analysis is complicated by the unique biology and high diversity of BCR sequences. Models and concepts from molecular evolution and phylogenetics that have been applied successfully to rapidly evolving pathogen populations are increasingly being adopted to study BCR diversity and divergence within individuals. However, BCR dynamics may violate key assumptions of many standard evolutionary methods, as they do not descend from a single ancestor, and experience biased mutation. Here, we review the application of evolutionary models to BCR repertoires and discuss the issues we believe need be addressed for this interdisciplinary field to flourish.
Collapse
Affiliation(s)
- Kenneth B Hoehn
- Department of Zoology, University of Oxford, Oxford, United Kingdom
| | - Anna Fowler
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Gerton Lunter
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Oliver G Pybus
- Department of Zoology, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
28
|
Galson JD, Trück J, Fowler A, Clutterbuck EA, Münz M, Cerundolo V, Reinhard C, van der Most R, Pollard AJ, Lunter G, Kelly DF. Analysis of B Cell Repertoire Dynamics Following Hepatitis B Vaccination in Humans, and Enrichment of Vaccine-specific Antibody Sequences. EBioMedicine 2015; 2:2070-9. [PMID: 26844287 PMCID: PMC4703725 DOI: 10.1016/j.ebiom.2015.11.034] [Citation(s) in RCA: 69] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2015] [Revised: 11/18/2015] [Accepted: 11/18/2015] [Indexed: 12/20/2022] Open
Abstract
Generating a diverse B cell immunoglobulin repertoire is essential for protection against infection. The repertoire in humans can now be comprehensively measured by high-throughput sequencing. Using hepatitis B vaccination as a model, we determined how the total immunoglobulin sequence repertoire changes following antigen exposure in humans, and compared this to sequences from vaccine-specific sorted cells. Clonal sequence expansions were seen 7 days after vaccination, which correlated with vaccine-specific plasma cell numbers. These expansions caused an increase in mutation, and a decrease in diversity and complementarity-determining region 3 sequence length in the repertoire. We also saw an increase in sequence convergence between participants 14 and 21 days after vaccination, coinciding with an increase of vaccine-specific memory cells. These features allowed development of a model for in silico enrichment of vaccine-specific sequences from the total repertoire. Identifying antigen-specific sequences from total repertoire data could aid our understanding B cell driven immunity, and be used for disease diagnostics and vaccine evaluation.
Collapse
Affiliation(s)
- Jacob D. Galson
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Center, Oxford OX3 7LE, United Kingdom
- Welcome Trust Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, United Kingdom
| | - Johannes Trück
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Center, Oxford OX3 7LE, United Kingdom
- Paediatric Immunology, University Children's Hospital, Zürich, 8032, Switzerland
| | - Anna Fowler
- Welcome Trust Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, United Kingdom
| | - Elizabeth A. Clutterbuck
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Center, Oxford OX3 7LE, United Kingdom
| | - Márton Münz
- Welcome Trust Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, United Kingdom
| | - Vincenzo Cerundolo
- Medical Research Council Human Immunology Unit, Weatherall Institute of Molecular Medicine, Oxford OX3 9DS, United Kingdom
| | | | | | - Andrew J. Pollard
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Center, Oxford OX3 7LE, United Kingdom
| | - Gerton Lunter
- Welcome Trust Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, United Kingdom
| | - Dominic F. Kelly
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Center, Oxford OX3 7LE, United Kingdom
| |
Collapse
|
29
|
Galson JD, Clutterbuck EA, Trück J, Ramasamy MN, Münz M, Fowler A, Cerundolo V, Pollard AJ, Lunter G, Kelly DF. BCR repertoire sequencing: different patterns of B-cell activation after two Meningococcal vaccines. Immunol Cell Biol 2015; 93:885-95. [PMID: 25976772 PMCID: PMC4551417 DOI: 10.1038/icb.2015.57] [Citation(s) in RCA: 60] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2015] [Revised: 05/11/2015] [Accepted: 05/11/2015] [Indexed: 12/24/2022]
Abstract
Next-generation sequencing was used to investigate the B-cell receptor heavy chain transcript repertoire of different B-cell subsets (naive, marginal zone (MZ), immunoglobulin M (IgM) memory and IgG memory) at baseline, and of plasma cells (PCs) 7 days following administration of serogroup ACWY meningococcal polysaccharide and protein-polysaccharide conjugate vaccines. Baseline B-cell subsets could be distinguished from each other using a small number of repertoire properties (clonality, mutation from germline and complementarity-determining region 3 (CDR3) length) that were conserved between individuals. However, analyzing the CDR3 amino-acid sequence (which is particularly important for antigen binding) of the baseline subsets showed few sequences shared between individuals. In contrast, day 7 PCs demonstrated nearly 10-fold greater sequence sharing between individuals than the baseline subsets, consistent with the PCs being induced by the vaccine antigen and sharing specificity for a more limited range of epitopes. By annotating PC sequences based on IgG subclass usage and mutation, and also comparing them with the sequences of the baseline cell subsets, we were able to identify different signatures after the polysaccharide and conjugate vaccines. PCs produced after conjugate vaccination were predominantly IgG1, and most related to IgG memory cells. In contrast, after polysaccharide vaccination, the PCs were predominantly IgG2, less mutated and were equally likely to be related to MZ, IgM memory or IgG memory cells. High-throughput B-cell repertoire sequencing thus provides a unique insight into patterns of B-cell activation not possible from more conventional measures of immunogenicity.
Collapse
Affiliation(s)
- Jacob D. Galson
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Center, Oxford, OX3 7LE, United Kingdom
| | - Elizabeth A. Clutterbuck
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Center, Oxford, OX3 7LE, United Kingdom
| | - Johannes Trück
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Center, Oxford, OX3 7LE, United Kingdom
| | - Maheshi N. Ramasamy
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Center, Oxford, OX3 7LE, United Kingdom
| | - Márton Münz
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, United Kingdom
| | - Anna Fowler
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, United Kingdom
| | - Vincenzo Cerundolo
- Medical Research Council Human Immunology Unit, Weatherall Institute of Molecular Medicine, Oxford, OX3 9DS, United Kingdom
| | - Andrew J. Pollard
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Center, Oxford, OX3 7LE, United Kingdom
| | - Gerton Lunter
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, United Kingdom
| | - Dominic F. Kelly
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Center, Oxford, OX3 7LE, United Kingdom
| |
Collapse
|
30
|
Galson JD, Trück J, Fowler A, Münz M, Cerundolo V, Pollard AJ, Lunter G, Kelly DF. In-Depth Assessment of Within-Individual and Inter-Individual Variation in the B Cell Receptor Repertoire. Front Immunol 2015; 6:531. [PMID: 26528292 PMCID: PMC4601265 DOI: 10.3389/fimmu.2015.00531] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2015] [Accepted: 09/28/2015] [Indexed: 11/24/2022] Open
Abstract
High-throughput sequencing of the B cell receptor (BCR) repertoire can provide rapid characterization of the B cell response in a wide variety of applications in health, after vaccination and in infectious, inflammatory and immune-driven disease, and is starting to yield clinical applications. However, the interpretation of repertoire data is compromised by a lack of studies to assess the intra and inter-individual variation in the BCR repertoire over time in healthy individuals. We applied a standardized isotype-specific BCR repertoire deep sequencing protocol to a single highly sampled participant, and then evaluated the method in 9 further participants to comprehensively describe such variation. We assessed total repertoire metrics of mutation, diversity, VJ gene usage and isotype subclass usage as well as tracking specific BCR sequence clusters. There was good assay reproducibility (both in PCR amplification and biological replicates), but we detected striking fluctuations in the repertoire over time that we hypothesize may be due to subclinical immune activation. Repertoire properties were unique for each individual, which could partly be explained by a decrease in IgG2 with age, and genetic differences at the immunoglobulin locus. There was a small repertoire of public clusters (0.5, 0.3, and 1.4% of total IgA, IgG, and IgM clusters, respectively), which was enriched for expanded clusters containing sequences with suspected specificity toward antigens that should have been historically encountered by all participants through prior immunization or infection. We thus provide baseline BCR repertoire information that can be used to inform future study design, and aid in interpretation of results from these studies. Furthermore, our results indicate that BCR repertoire studies could be used to track changes in the public repertoire in and between populations that might relate to population immunity against infectious diseases, and identify the characteristics of inflammatory and immunological diseases.
Collapse
Affiliation(s)
- Jacob D. Galson
- Oxford Vaccine Group, Department of Paediatrics, The NIHR Oxford Biomedical Research Center, University of Oxford, Oxford, UK
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Johannes Trück
- Oxford Vaccine Group, Department of Paediatrics, The NIHR Oxford Biomedical Research Center, University of Oxford, Oxford, UK
- Paediatric Immunology, University Children’s Hospital, Zürich, Switzerland
| | - Anna Fowler
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Márton Münz
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Vincenzo Cerundolo
- Medical Research Council Human Immunology Unit, Weatherall Institute of Molecular Medicine, Oxford, UK
| | - Andrew J. Pollard
- Oxford Vaccine Group, Department of Paediatrics, The NIHR Oxford Biomedical Research Center, University of Oxford, Oxford, UK
| | - Gerton Lunter
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Dominic F. Kelly
- Oxford Vaccine Group, Department of Paediatrics, The NIHR Oxford Biomedical Research Center, University of Oxford, Oxford, UK
| |
Collapse
|
31
|
Münz M, Ruark E, Renwick A, Ramsay E, Clarke M, Mahamdallie S, Cloke V, Seal S, Strydom A, Lunter G, Rahman N. CSN and CAVA: variant annotation tools for rapid, robust next-generation sequencing analysis in the clinical setting. Genome Med 2015; 7:76. [PMID: 26315209 PMCID: PMC4551696 DOI: 10.1186/s13073-015-0195-6] [Citation(s) in RCA: 47] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2015] [Accepted: 07/02/2015] [Indexed: 12/22/2022] Open
Abstract
BACKGROUND Next-generation sequencing (NGS) offers unprecedented opportunities to expand clinical genomics. It also presents challenges with respect to integration with data from other sequencing methods and historical data. Provision of consistent, clinically applicable variant annotation of NGS data has proved difficult, particularly of indels, an important variant class in clinical genomics. Annotation in relation to a reference genome sequence, the DNA strand of coding transcripts and potential alternative variant representations has not been well addressed. Here we present tools that address these challenges to provide rapid, standardized, clinically appropriate annotation of NGS data in line with existing clinical standards. METHODS We developed a clinical sequencing nomenclature (CSN), a fixed variant annotation consistent with the principles of the Human Genome Variation Society (HGVS) guidelines, optimized for automated variant annotation of NGS data. To deliver high-throughput CSN annotation we created CAVA (Clinical Annotation of VAriants), a fast, lightweight tool designed for easy incorporation into NGS pipelines. CAVA allows transcript specification, appropriately accommodates the strand of a gene transcript and flags variants with alternative annotations to facilitate clinical interpretation and comparison with other datasets. We evaluated CAVA in exome data and a clinical BRCA1/BRCA2 gene testing pipeline. RESULTS CAVA generated CSN calls for 10,313,034 variants in the ExAC database in 13.44 hours, and annotated the ICR1000 exome series in 6.5 hours. Evaluation of 731 different indels from a single individual revealed 92 % had alternative representations in left aligned and right aligned data. Annotation of left aligned data, as performed by many annotation tools, would thus give clinically discrepant annotation for the 339 (46 %) indels in genes transcribed from the forward DNA strand. By contrast, CAVA provides the correct clinical annotation for all indels. CAVA also flagged the 370 indels with alternative representations of a different functional class, which may profoundly influence clinical interpretation. CAVA annotation of 50 BRCA1/BRCA2 gene mutations from a clinical pipeline gave 100 % concordance with Sanger data; only 8/25 BRCA2 mutations were correctly clinically annotated by other tools. CONCLUSIONS CAVA is a freely available tool that provides rapid, robust, high-throughput clinical annotation of NGS data, using a standardized clinical sequencing nomenclature.
Collapse
Affiliation(s)
- Márton Münz
- Wellcome Trust Centre for Human Genetics, Roosevelt Drive, Oxford, OX3 7BN, UK.
| | - Elise Ruark
- Division of Genetics & Epidemiology, The Institute of Cancer Research, 15 Cotswold Road, London, SM2 5NG, UK.
| | - Anthony Renwick
- Division of Genetics & Epidemiology, The Institute of Cancer Research, 15 Cotswold Road, London, SM2 5NG, UK.
| | - Emma Ramsay
- Division of Genetics & Epidemiology, The Institute of Cancer Research, 15 Cotswold Road, London, SM2 5NG, UK.
| | - Matthew Clarke
- Division of Genetics & Epidemiology, The Institute of Cancer Research, 15 Cotswold Road, London, SM2 5NG, UK.
| | - Shazia Mahamdallie
- Division of Genetics & Epidemiology, The Institute of Cancer Research, 15 Cotswold Road, London, SM2 5NG, UK. .,TGLclinical, The Institute of Cancer Research, 15 Cotswold Road, London, SM2 5NG, UK.
| | - Victoria Cloke
- TGLclinical, The Institute of Cancer Research, 15 Cotswold Road, London, SM2 5NG, UK.
| | - Sheila Seal
- Division of Genetics & Epidemiology, The Institute of Cancer Research, 15 Cotswold Road, London, SM2 5NG, UK. .,TGLclinical, The Institute of Cancer Research, 15 Cotswold Road, London, SM2 5NG, UK.
| | - Ann Strydom
- Division of Genetics & Epidemiology, The Institute of Cancer Research, 15 Cotswold Road, London, SM2 5NG, UK. .,TGLclinical, The Institute of Cancer Research, 15 Cotswold Road, London, SM2 5NG, UK.
| | - Gerton Lunter
- Wellcome Trust Centre for Human Genetics, Roosevelt Drive, Oxford, OX3 7BN, UK.
| | - Nazneen Rahman
- Division of Genetics & Epidemiology, The Institute of Cancer Research, 15 Cotswold Road, London, SM2 5NG, UK. .,TGLclinical, The Institute of Cancer Research, 15 Cotswold Road, London, SM2 5NG, UK. .,The Royal Marsden NHS Foundation Trust, Downs Road, Sutton, SM2 5PT, UK.
| |
Collapse
|
32
|
Taylor JC, Martin HC, Lise S, Broxholme J, Cazier JB, Rimmer A, Kanapin A, Lunter G, Fiddy S, Allan C, Aricescu AR, Attar M, Babbs C, Becq J, Beeson D, Bento C, Bignell P, Blair E, Buckle VJ, Bull K, Cais O, Cario H, Chapel H, Copley RR, Cornall R, Craft J, Dahan K, Davenport EE, Dendrou C, Devuyst O, Fenwick AL, Flint J, Fugger L, Gilbert RD, Goriely A, Green A, Greger IH, Grocock R, Gruszczyk AV, Hastings R, Hatton E, Higgs D, Hill A, Holmes C, Howard M, Hughes L, Humburg P, Johnson D, Karpe F, Kingsbury Z, Kini U, Knight JC, Krohn J, Lamble S, Langman C, Lonie L, Luck J, McCarthy D, McGowan SJ, McMullin MF, Miller KA, Murray L, Németh AH, Nesbit MA, Nutt D, Ormondroyd E, Oturai AB, Pagnamenta A, Patel SY, Percy M, Petousi N, Piazza P, Piret SE, Polanco-Echeverry G, Popitsch N, Powrie F, Pugh C, Quek L, Robbins PA, Robson K, Russo A, Sahgal N, van Schouwenburg PA, Schuh A, Silverman E, Simmons A, Sørensen PS, Sweeney E, Taylor J, Thakker RV, Tomlinson I, Trebes A, Twigg SR, Uhlig HH, Vyas P, Vyse T, Wall SA, Watkins H, Whyte MP, Witty L, Wright B, Yau C, Buck D, Humphray S, Ratcliffe PJ, Bell JI, Wilkie AO, Bentley D, Donnelly P, McVean G. Factors influencing success of clinical genome sequencing across a broad spectrum of disorders. Nat Genet 2015; 47:717-726. [PMID: 25985138 PMCID: PMC4601524 DOI: 10.1038/ng.3304] [Citation(s) in RCA: 263] [Impact Index Per Article: 29.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2014] [Accepted: 04/22/2015] [Indexed: 12/12/2022]
Abstract
To assess factors influencing the success of whole-genome sequencing for mainstream clinical diagnosis, we sequenced 217 individuals from 156 independent cases or families across a broad spectrum of disorders in whom previous screening had identified no pathogenic variants. We quantified the number of candidate variants identified using different strategies for variant calling, filtering, annotation and prioritization. We found that jointly calling variants across samples, filtering against both local and external databases, deploying multiple annotation tools and using familial transmission above biological plausibility contributed to accuracy. Overall, we identified disease-causing variants in 21% of cases, with the proportion increasing to 34% (23/68) for mendelian disorders and 57% (8/14) in family trios. We also discovered 32 potentially clinically actionable variants in 18 genes unrelated to the referral disorder, although only 4 were ultimately considered reportable. Our results demonstrate the value of genome sequencing for routine clinical diagnosis but also highlight many outstanding challenges.
Collapse
Affiliation(s)
- Jenny C Taylor
- NIHR Comprehensive Biomedical Research Centre, Oxford, UK
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Hilary C Martin
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Stefano Lise
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - John Broxholme
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | | | - Andy Rimmer
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Alexander Kanapin
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Gerton Lunter
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Simon Fiddy
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Chris Allan
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - A Radu Aricescu
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Moustafa Attar
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Christian Babbs
- MRC Molecular Haematology Unit, Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | | | - David Beeson
- Neurosciences Group, Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Celeste Bento
- Hematology Department, Centro Hospitalar e Universitário de Coimbra, Portugal
| | - Patricia Bignell
- Molecular Haematology Department, Oxford University Hospitals NHS Trust, Oxford, UK
| | - Edward Blair
- Department of Clinical Genetics, Oxford University Hospitals NHS Trust, Oxford, UK
| | - Veronica J Buckle
- MRC Molecular Haematology Unit, Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Katherine Bull
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
- Centre for Cellular and Molecular Physiology, University of Oxford, Oxford, UK
| | - Ondrej Cais
- Neurobiology Division, MRC Laboratory of Molecular Biology, Cambridge, UK
| | - Holger Cario
- Department of Pediatrics and Adolescent Medicine, University Medical Center, Ulm, Germany
| | - Helen Chapel
- Primary Immunodeficiency Unit, Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Richard R Copley
- NIHR Comprehensive Biomedical Research Centre, Oxford, UK
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Richard Cornall
- Centre for Cellular and Molecular Physiology, University of Oxford, Oxford, UK
| | - Jude Craft
- NIHR Comprehensive Biomedical Research Centre, Oxford, UK
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Karin Dahan
- Centre de Génétique Humaine, Institut de Génétique et de Pathologie, Gosselies, Belgium
- Cliniques Universitaires Saint-Luc, Université Catholique de Louvain, Brussels, Belgium
| | - Emma E Davenport
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Calliope Dendrou
- MRC Human Immunology Unit, Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Olivier Devuyst
- Institute of Physiology, Zurich Center for Integrative Human Physiology, University of Zurich, Zurich, Switzerland
| | - Aimée L Fenwick
- Clinical Genetics Group, Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Jonathan Flint
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Lars Fugger
- MRC Human Immunology Unit, Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Rodney D Gilbert
- University Hospital Southampton NHS Foundation Trust, University of Southampton, Southampton, UK
| | - Anne Goriely
- Clinical Genetics Group, Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Angie Green
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Ingo H Greger
- Neurobiology Division, MRC Laboratory of Molecular Biology, Cambridge, UK
| | | | - Anja V Gruszczyk
- Clinical Genetics Group, Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Robert Hastings
- Division of Cardiovascular Medicine, Radcliffe Department of Medicine, University of Oxford, Oxford, UK
| | - Edouard Hatton
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Doug Higgs
- MRC Molecular Haematology Unit, Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Adrian Hill
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
- The Jenner Institute, Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Chris Holmes
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
- Department of Statistics, University of Oxford, Oxford, UK
| | - Malcolm Howard
- NIHR Comprehensive Biomedical Research Centre, Oxford, UK
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Linda Hughes
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Peter Humburg
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - David Johnson
- Craniofacial Unit, Department of Plastic and Reconstructive Surgery, Oxford University Hospitals NHS Trust, Oxford, UK
| | - Fredrik Karpe
- Oxford Laboratory for Integrative Physiology, Oxford Centre for Diabetes, Endocrinology and Metabolism, University of Oxford, Churchill Hospital, Oxford, UK
| | | | - Usha Kini
- Department of Clinical Genetics, Oxford University Hospitals NHS Trust, Oxford, UK
| | - Julian C Knight
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Jonathan Krohn
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Sarah Lamble
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Craig Langman
- Kidney Diseases, Feinberg School of Medicine, Northwestern University and the Ann and Robert H Lurie Children's Hospital of Chicago, Chicago, Illinois, USA
| | - Lorne Lonie
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Joshua Luck
- Clinical Genetics Group, Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Davis McCarthy
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Simon J McGowan
- Clinical Genetics Group, Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | | | - Kerry A Miller
- Clinical Genetics Group, Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Lisa Murray
- Illumina Cambridge Limited, Saffron Walden, UK
| | - Andrea H Németh
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, UK
| | - M Andrew Nesbit
- Academic Endocrine Unit, Oxford Centre for Diabetes, Endocrinology and Metabolism, University of Oxford, Churchill Hospital, Oxford, UK
| | - David Nutt
- Centre for Neuropsychopharmacology, Division of Brain Sciences, Imperial College, London, UK
| | - Elizabeth Ormondroyd
- Division of Cardiovascular Medicine, Radcliffe Department of Medicine, University of Oxford, Oxford, UK
| | - Annette Bang Oturai
- Danish Multiple Sclerosis Center, Department of Neurology, Copenhagen University Hospital, Copenhagen, Denmark
| | - Alistair Pagnamenta
- NIHR Comprehensive Biomedical Research Centre, Oxford, UK
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Smita Y Patel
- Primary Immunodeficiency Unit, Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Melanie Percy
- Department of Haematology, Belfast City Hospital, Belfast, UK
| | - Nayia Petousi
- Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Paolo Piazza
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Sian E Piret
- Academic Endocrine Unit, Oxford Centre for Diabetes, Endocrinology and Metabolism, University of Oxford, Churchill Hospital, Oxford, UK
| | | | - Niko Popitsch
- NIHR Comprehensive Biomedical Research Centre, Oxford, UK
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Fiona Powrie
- Translational Gastroenterology Unit, University of Oxford, Oxford, UK
| | - Chris Pugh
- Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Lynn Quek
- MRC Molecular Haematology Unit, Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Peter A Robbins
- Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, UK
| | - Kathryn Robson
- MRC Molecular Haematology Unit, Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Alexandra Russo
- Department of Pediatrics, University Hospital, Mainz, Germany
| | - Natasha Sahgal
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | | | - Anna Schuh
- NIHR Comprehensive Biomedical Research Centre, Oxford, UK
- Department of Oncology, University of Oxford, Oxford, UK
| | - Earl Silverman
- Division of Rheumatology, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Alison Simmons
- MRC Human Immunology Unit, Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
- Translational Gastroenterology Unit, University of Oxford, Oxford, UK
| | - Per Soelberg Sørensen
- Danish Multiple Sclerosis Center, Department of Neurology, Copenhagen University Hospital, Copenhagen, Denmark
| | - Elizabeth Sweeney
- Department of Clinical Genetics, Liverpool Women's NHS Foundation Trust, Liverpool, UK
| | - John Taylor
- NIHR Comprehensive Biomedical Research Centre, Oxford, UK
- Oxford NHS Regional Molecular Genetics Laboratory, Oxford University Hospitals NHS Trust, Oxford, UK
| | - Rajesh V Thakker
- Academic Endocrine Unit, Oxford Centre for Diabetes, Endocrinology and Metabolism, University of Oxford, Churchill Hospital, Oxford, UK
| | - Ian Tomlinson
- NIHR Comprehensive Biomedical Research Centre, Oxford, UK
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Amy Trebes
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Stephen Rf Twigg
- Clinical Genetics Group, Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Holm H Uhlig
- Translational Gastroenterology Unit, University of Oxford, Oxford, UK
| | - Paresh Vyas
- MRC Molecular Haematology Unit, Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Tim Vyse
- Division of Genetics, King's College London, Guy's Hospital, London, UK
| | - Steven A Wall
- Craniofacial Unit, Department of Plastic and Reconstructive Surgery, Oxford University Hospitals NHS Trust, Oxford, UK
| | - Hugh Watkins
- Division of Cardiovascular Medicine, Radcliffe Department of Medicine, University of Oxford, Oxford, UK
| | - Michael P Whyte
- Center for Metabolic Bone Disease and Molecular Research, Shriners Hospital for Children, St Louis, Missouri, USA
| | - Lorna Witty
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Ben Wright
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Chris Yau
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - David Buck
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | | | | | - John I Bell
- Office of the Regius Professor of Medicine, University of Oxford, Oxford, UK
| | - Andrew Om Wilkie
- Clinical Genetics Group, Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | | | - Peter Donnelly
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
- Department of Statistics, University of Oxford, Oxford, UK
| | - Gilean McVean
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| |
Collapse
|
33
|
Jevons SJ, Green A, Lunter G, Kartsonaki C, Buck D, Piazza P, Kiltie AE. High-throughput DNA Sequencing Identifies Novel CtIP (RBBP8) Variants in Muscle-invasive Bladder Cancer Patients. Bladder Cancer 2015; 1:31-44. [PMID: 30561437 PMCID: PMC6218178 DOI: 10.3233/blc-150007] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
BACKGROUND Germline mutations in DNA damage signalling and repair genes predispose individuals to cancer. Rare germline variants may also increase cancer risk and be predictive of outcomes following cancer treatments, but require high-throughput sequencing (HTS) for detection in large cohorts. OBJECTIVE To use a dual indexing system on a HTS platform to detect novel variants in CtIP (RBBP8) which may be associated with clinical outcomes following radiotherapy treatment for bladder cancer. METHODS All exons and flanking introns of CtIP were amplified from germline DNA from bladder cancer patients using seven primer pairs by automated long-range PCR. Amplicons were pooled, fragmented and ligated to adaptor sequences. One of 96 tag sequences was introduced at each end by PCR. Sequencing was performed on a single flow cell of an Illumina MiSeq. Reads were mapped by Stampy and variants called by Platypus. For phasing experiments, target regions were amplified and cloned for Sanger sequencing. RESULTS Of 201 samples, 160 were successfully amplified. Eleven CtIP variants were called, within the exons and 15 bp adjacent intronic DNA, including eight known variants from the 1000 Genomes project, plus three previously unreported variants now confirmed by Sanger sequencing. In two individuals, phasing experiments showed two variants of interest to be on separate alleles, likely to result in stronger impairment of gene function. CONCLUSIONS We have demonstrated proof of principle for dual indexing on 160 samples on one MiSeq flow cell sequencing surface, and show that for the CtIP gene multiplexing of up to 720 samples would provide sufficient coverage to achieve >98% detection power for rare germline variation, reducing HTS costs substantially.
Collapse
Affiliation(s)
- Sarah J. Jevons
- CRUK/MRC Oxford Institute for Radiation Oncology, Department of Oncology, University of Oxford, Oxford, UK
| | - Angela Green
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Gerton Lunter
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Christiana Kartsonaki
- CRUK/MRC Oxford Institute for Radiation Oncology, Department of Oncology, University of Oxford, Oxford, UK
| | - David Buck
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Paolo Piazza
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Anne E. Kiltie
- CRUK/MRC Oxford Institute for Radiation Oncology, Department of Oncology, University of Oxford, Oxford, UK
| |
Collapse
|
34
|
Staab PR, Zhu S, Metzler D, Lunter G. scrm: efficiently simulating long sequences using the approximated coalescent with recombination. ACTA ACUST UNITED AC 2015; 31:1680-2. [PMID: 25596205 PMCID: PMC4426833 DOI: 10.1093/bioinformatics/btu861] [Citation(s) in RCA: 90] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2014] [Accepted: 12/23/2014] [Indexed: 11/13/2022]
Abstract
Motivation: Coalescent-based simulation software for genomic sequences allows the efficient in silico generation of short- and medium-sized genetic sequences. However, the simulation of genome-size datasets as produced by next-generation sequencing is currently only possible using fairly crude approximations. Results: We present the sequential coalescent with recombination model (SCRM), a new method that efficiently and accurately approximates the coalescent with recombination, closing the gap between current approximations and the exact model. We present an efficient implementation and show that it can simulate genomic-scale datasets with an essentially correct linkage structure. Availability and implementation: The open source implementation scrm is freely available at https://scrm.github.io under the conditions of the GPLv3 license. Contact:staab@bio.lmu.de or gerton.lunter@well.ox.ac.uk. Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Paul R Staab
- Department of Biology, Ludwig-Maximilians-Universität München, Planegg-Martinsried, Germany and Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, UK
| | - Sha Zhu
- Department of Biology, Ludwig-Maximilians-Universität München, Planegg-Martinsried, Germany and Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, UK
| | - Dirk Metzler
- Department of Biology, Ludwig-Maximilians-Universität München, Planegg-Martinsried, Germany and Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, UK
| | - Gerton Lunter
- Department of Biology, Ludwig-Maximilians-Universität München, Planegg-Martinsried, Germany and Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, UK
| |
Collapse
|
35
|
Trück J, Ramasamy MN, Galson JD, Rance R, Parkhill J, Lunter G, Pollard AJ, Kelly DF. Identification of antigen-specific B cell receptor sequences using public repertoire analysis. J Immunol 2014; 194:252-261. [PMID: 25392534 DOI: 10.4049/jimmunol.1401405] [Citation(s) in RCA: 92] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
High-throughput sequencing allows detailed study of the BCR repertoire postimmunization, but it remains unclear to what extent the de novo identification of Ag-specific sequences from the total BCR repertoire is possible. A conjugate vaccine containing Haemophilus influenzae type b (Hib) and group C meningococcal polysaccharides, as well as tetanus toxoid (TT), was used to investigate the BCR repertoire of adult humans following immunization and to test the hypothesis that public or convergent repertoire analysis could identify Ag-specific sequences. A number of Ag-specific BCR sequences have been reported for Hib and TT, which made a vaccine containing these two Ags an ideal immunological stimulus. Analysis of identical CDR3 amino acid sequences that were shared by individuals in the postvaccine repertoire identified a number of known Hib-specific sequences but only one previously described TT sequence. The extension of this analysis to nonidentical, but highly similar, CDR3 amino acid sequences revealed a number of other TT-related sequences. The anti-Hib avidity index postvaccination strongly correlated with the relative frequency of Hib-specific sequences, indicating that the postvaccination public BCR repertoire may be related to more conventional measures of immunogenicity correlating with disease protection. Analysis of public BCR repertoire provided evidence of convergent BCR evolution in individuals exposed to the same Ags. If this finding is confirmed, the public repertoire could be used for rapid and direct identification of protective Ag-specific BCR sequences from peripheral blood.
Collapse
Affiliation(s)
- Johannes Trück
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford, and the NIHR Oxford Biomedical Research Centre, Oxford OX3 7LE, United Kingdom
| | - Maheshi N Ramasamy
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford, and the NIHR Oxford Biomedical Research Centre, Oxford OX3 7LE, United Kingdom
| | - Jacob D Galson
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford, and the NIHR Oxford Biomedical Research Centre, Oxford OX3 7LE, United Kingdom
| | - Richard Rance
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, United Kingdom
| | - Julian Parkhill
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, United Kingdom
| | - Gerton Lunter
- The Wellcome Trust Centre for Human Genetics, Oxford OX3 7BN, United Kingdom
| | - Andrew J Pollard
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford, and the NIHR Oxford Biomedical Research Centre, Oxford OX3 7LE, United Kingdom
| | - Dominic F Kelly
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford, and the NIHR Oxford Biomedical Research Centre, Oxford OX3 7LE, United Kingdom
| |
Collapse
|
36
|
Rimmer A, Phan H, Mathieson I, Iqbal Z, Twigg SRF, Wilkie AOM, McVean G, Lunter G. Integrating mapping-, assembly- and haplotype-based approaches for calling variants in clinical sequencing applications. Nat Genet 2014; 46:912-918. [PMID: 25017105 DOI: 10.1038/ng.3036] [Citation(s) in RCA: 689] [Impact Index Per Article: 68.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2013] [Accepted: 06/23/2014] [Indexed: 12/19/2022]
Abstract
High-throughput DNA sequencing technology has transformed genetic research and is starting to make an impact on clinical practice. However, analyzing high-throughput sequencing data remains challenging, particularly in clinical settings where accuracy and turnaround times are critical. We present a new approach to this problem, implemented in a software package called Platypus. Platypus achieves high sensitivity and specificity for SNPs, indels and complex polymorphisms by using local de novo assembly to generate candidate variants, followed by local realignment and probabilistic haplotype estimation. It is an order of magnitude faster than existing tools and generates calls from raw aligned read data without preprocessing. We demonstrate the performance of Platypus in clinically relevant experimental designs by comparing with SAMtools and GATK on whole-genome and exome-capture data, by identifying de novo variation in 15 parent-offspring trios with high sensitivity and specificity, and by estimating human leukocyte antigen genotypes directly from variant calls.
Collapse
Affiliation(s)
- Andy Rimmer
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Hang Phan
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Iain Mathieson
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Zamin Iqbal
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Stephen R F Twigg
- Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Headington, Oxford, UK
| | | | - Andrew O M Wilkie
- Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Headington, Oxford, UK
| | - Gil McVean
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK.,Department of Statistics, University of Oxford, Oxford, UK
| | - Gerton Lunter
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| |
Collapse
|
37
|
Lamble S, Batty E, Attar M, Buck D, Bowden R, Lunter G, Crook D, El-Fahmawi B, Piazza P. Improved workflows for high throughput library preparation using the transposome-based Nextera system. BMC Biotechnol 2013; 13:104. [PMID: 24256843 PMCID: PMC4222894 DOI: 10.1186/1472-6750-13-104] [Citation(s) in RCA: 128] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2013] [Accepted: 10/25/2013] [Indexed: 12/16/2022] Open
Abstract
BACKGROUND The Nextera protocol, which utilises a transposome based approach to create libraries for Illumina sequencing, requires pure DNA template, an accurate assessment of input concentration and a column clean-up that limits its applicability for high-throughput sample preparation. We addressed the identified limitations to develop a robust workflow that supports both rapid and high-throughput projects also reducing reagent costs. RESULTS We show that an initial bead-based normalisation step can remove the need for quantification and improves sample purity. A 75% cost reduction was achieved with a low-volume modified protocol which was tested over genomes with different GC content to demonstrate its robustness. Finally we developed a custom set of index tags and primers which increase the number of samples that can simultaneously be sequenced on a single lane of an Illumina instrument. CONCLUSIONS We addressed the bottlenecks of Nextera library construction to produce a modified protocol which harnesses the full power of the Nextera kit and allows the reproducible construction of libraries on a high-throughput scale reducing the associated cost of the kit.
Collapse
Affiliation(s)
- Sarah Lamble
- Wellcome Trust Centre for Human Genetics, OX3 7BN Oxford, UK
| | - Elizabeth Batty
- Wellcome Trust Centre for Human Genetics, OX3 7BN Oxford, UK
| | - Moustafa Attar
- Wellcome Trust Centre for Human Genetics, OX3 7BN Oxford, UK
| | - David Buck
- Wellcome Trust Centre for Human Genetics, OX3 7BN Oxford, UK
| | - Rory Bowden
- Wellcome Trust Centre for Human Genetics, OX3 7BN Oxford, UK
| | - Gerton Lunter
- Wellcome Trust Centre for Human Genetics, OX3 7BN Oxford, UK
| | - Derrick Crook
- Nuffield Department of Medicine, University of Oxford, John Radcliffe Hospital, Headley Way, OX3 9DU Oxford, UK
- Oxford NIHR Biomedical Research Centre, John Radcliffe Hospital, Headley Way, OX3 9DU Oxford, UK
| | - Bassam El-Fahmawi
- Axygen Inc., A corning Subsidiary, 33120 Central Avenue, 94587 Union City, CA, USA
| | - Paolo Piazza
- Wellcome Trust Centre for Human Genetics, OX3 7BN Oxford, UK
| |
Collapse
|
38
|
Abstract
MOTIVATION A common question in genomic analysis is whether two sets of genomic intervals overlap significantly. This question arises, for example, when interpreting ChIP-Seq or RNA-Seq data in functional terms. Because genome organization is complex, answering this question is non-trivial. SUMMARY We present Genomic Association Test (GAT), a tool for estimating the significance of overlap between multiple sets of genomic intervals. GAT implements a null model that the two sets of intervals are placed independently of one another, but allows each set's density to depend on external variables, for example, isochore structure or chromosome identity. GAT estimates statistical significance based on simulation and controls for multiple tests using the false discovery rate. AVAILABILITY GAT's source code, documentation and tutorials are available at http://code.google.com/p/genomic-association-tester.
Collapse
Affiliation(s)
- Andreas Heger
- MRC CGAT Programme and Functional Genomics Unit, MRC Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, UK.
| | | | | | | | | |
Collapse
|
39
|
Montgomery SB, Goode DL, Kvikstad E, Albers CA, Zhang ZD, Mu XJ, Ananda G, Howie B, Karczewski KJ, Smith KS, Anaya V, Richardson R, Davis J, MacArthur DG, Sidow A, Duret L, Gerstein M, Makova KD, Marchini J, McVean G, Lunter G. The origin, evolution, and functional impact of short insertion-deletion variants identified in 179 human genomes. Genome Res 2013; 23:749-61. [PMID: 23478400 PMCID: PMC3638132 DOI: 10.1101/gr.148718.112] [Citation(s) in RCA: 163] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Short insertions and deletions (indels) are the second most abundant form of human genetic variation, but our understanding of their origins and functional effects lags behind that of other types of variants. Using population-scale sequencing, we have identified a high-quality set of 1.6 million indels from 179 individuals representing three diverse human populations. We show that rates of indel mutagenesis are highly heterogeneous, with 43%–48% of indels occurring in 4.03% of the genome, whereas in the remaining 96% their prevalence is 16 times lower than SNPs. Polymerase slippage can explain upwards of three-fourths of all indels, with the remainder being mostly simple deletions in complex sequence. However, insertions do occur and are significantly associated with pseudo-palindromic sequence features compatible with the fork stalling and template switching (FoSTeS) mechanism more commonly associated with large structural variations. We introduce a quantitative model of polymerase slippage, which enables us to identify indel-hypermutagenic protein-coding genes, some of which are associated with recurrent mutations leading to disease. Accounting for mutational rate heterogeneity due to sequence context, we find that indels across functional sequence are generally subject to stronger purifying selection than SNPs. We find that indel length modulates selection strength, and that indels affecting multiple functionally constrained nucleotides undergo stronger purifying selection. We further find that indels are enriched in associations with gene expression and find evidence for a contribution of nonsense-mediated decay. Finally, we show that indels can be integrated in existing genome-wide association studies (GWAS); although we do not find direct evidence that potentially causal protein-coding indels are enriched with associations to known disease-associated SNPs, our findings suggest that the causal variant underlying some of these associations may be indels.
Collapse
Affiliation(s)
- Stephen B Montgomery
- Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva, 1211, Switzerland.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
40
|
Bull KR, Rimmer AJ, Siggs OM, Miosge LA, Roots CM, Enders A, Bertram EM, Crockford TL, Whittle B, Potter PK, Simon MM, Mallon AM, Brown SDM, Beutler B, Goodnow CC, Lunter G, Cornall RJ. Unlocking the bottleneck in forward genetics using whole-genome sequencing and identity by descent to isolate causative mutations. PLoS Genet 2013; 9:e1003219. [PMID: 23382690 PMCID: PMC3561070 DOI: 10.1371/journal.pgen.1003219] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2012] [Accepted: 11/20/2012] [Indexed: 12/27/2022] Open
Abstract
Forward genetics screens with N-ethyl-N-nitrosourea (ENU) provide a powerful way to illuminate gene function and generate mouse models of human disease; however, the identification of causative mutations remains a limiting step. Current strategies depend on conventional mapping, so the propagation of affected mice requires non-lethal screens; accurate tracking of phenotypes through pedigrees is complex and uncertain; out-crossing can introduce unexpected modifiers; and Sanger sequencing of candidate genes is inefficient. Here we show how these problems can be efficiently overcome using whole-genome sequencing (WGS) to detect the ENU mutations and then identify regions that are identical by descent (IBD) in multiple affected mice. In this strategy, we use a modification of the Lander-Green algorithm to isolate causative recessive and dominant mutations, even at low coverage, on a pure strain background. Analysis of the IBD regions also allows us to calculate the ENU mutation rate (1.54 mutations per Mb) and to model future strategies for genetic screens in mice. The introduction of this approach will accelerate the discovery of causal variants, permit broader and more informative lethal screens to be used, reduce animal costs, and herald a new era for ENU mutagenesis.
Collapse
Affiliation(s)
- Katherine R. Bull
- Nuffield Department of Medicine and Wellcome Trust Centre for Human Genetics, Oxford University, Oxford, United Kingdom
- MRC Human Immunology Unit, Weatherall Institute of Molecular Medicine, Oxford, United Kingdom
| | - Andrew J. Rimmer
- Nuffield Department of Medicine and Wellcome Trust Centre for Human Genetics, Oxford University, Oxford, United Kingdom
| | - Owen M. Siggs
- Nuffield Department of Medicine and Wellcome Trust Centre for Human Genetics, Oxford University, Oxford, United Kingdom
- MRC Human Immunology Unit, Weatherall Institute of Molecular Medicine, Oxford, United Kingdom
| | - Lisa A. Miosge
- Department of Immunology, The John Curtin School of Medical Research, The Australian National University, Canberra, Australia
| | - Carla M. Roots
- Department of Immunology, The John Curtin School of Medical Research, The Australian National University, Canberra, Australia
| | - Anselm Enders
- Department of Immunology, The John Curtin School of Medical Research, The Australian National University, Canberra, Australia
| | - Edward M. Bertram
- Department of Immunology, The John Curtin School of Medical Research, The Australian National University, Canberra, Australia
- Australian Phenomics Facility, The John Curtin School of Medical Research, The Australian National University, Canberra, Australia
| | - Tanya L. Crockford
- Nuffield Department of Medicine and Wellcome Trust Centre for Human Genetics, Oxford University, Oxford, United Kingdom
- MRC Human Immunology Unit, Weatherall Institute of Molecular Medicine, Oxford, United Kingdom
| | - Belinda Whittle
- Australian Phenomics Facility, The John Curtin School of Medical Research, The Australian National University, Canberra, Australia
| | | | | | | | | | - Bruce Beutler
- UT Southwestern Medical Center, Dallas, Texas, United States of America
| | - Christopher C. Goodnow
- Department of Immunology, The John Curtin School of Medical Research, The Australian National University, Canberra, Australia
| | - Gerton Lunter
- Nuffield Department of Medicine and Wellcome Trust Centre for Human Genetics, Oxford University, Oxford, United Kingdom
| | - Richard J. Cornall
- Nuffield Department of Medicine and Wellcome Trust Centre for Human Genetics, Oxford University, Oxford, United Kingdom
- MRC Human Immunology Unit, Weatherall Institute of Molecular Medicine, Oxford, United Kingdom
| |
Collapse
|
41
|
Mailund T, Halager AE, Westergaard M, Dutheil JY, Munch K, Andersen LN, Lunter G, Prüfer K, Scally A, Hobolth A, Schierup MH. A new isolation with migration model along complete genomes infers very different divergence processes among closely related great ape species. PLoS Genet 2012; 8:e1003125. [PMID: 23284294 PMCID: PMC3527290 DOI: 10.1371/journal.pgen.1003125] [Citation(s) in RCA: 73] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2012] [Accepted: 10/14/2012] [Indexed: 11/18/2022] Open
Abstract
We present a hidden Markov model (HMM) for inferring gradual isolation between two populations during speciation, modelled as a time interval with restricted gene flow. The HMM describes the history of adjacent nucleotides in two genomic sequences, such that the nucleotides can be separated by recombination, can migrate between populations, or can coalesce at variable time points, all dependent on the parameters of the model, which are the effective population sizes, splitting times, recombination rate, and migration rate. We show by extensive simulations that the HMM can accurately infer all parameters except the recombination rate, which is biased downwards. Inference is robust to variation in the mutation rate and the recombination rate over the sequence and also robust to unknown phase of genomes unless they are very closely related. We provide a test for whether divergence is gradual or instantaneous, and we apply the model to three key divergence processes in great apes: (a) the bonobo and common chimpanzee, (b) the eastern and western gorilla, and (c) the Sumatran and Bornean orang-utan. We find that the bonobo and chimpanzee appear to have undergone a clear split, whereas the divergence processes of the gorilla and orang-utan species occurred over several hundred thousands years with gene flow stopping quite recently. We also apply the model to the Homo/Pan speciation event and find that the most likely scenario involves an extended period of gene flow during speciation.
Collapse
Affiliation(s)
- Thomas Mailund
- Bioinformatics Research Center, Aarhus University, Aarhus, Denmark.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
42
|
Lise S, Clarkson Y, Perkins E, Kwasniewska A, Sadighi Akha E, Parolin Schnekenberg R, Suminaite D, Hope J, Baker I, Gregory L, Green A, Allan C, Lamble S, Jayawant S, Quaghebeur G, Cader MZ, Hughes S, Armstrong RJE, Kanapin A, Rimmer A, Lunter G, Mathieson I, Cazier JB, Buck D, Taylor JC, Bentley D, McVean G, Donnelly P, Knight SJL, Jackson M, Ragoussis J, Németh AH. Recessive mutations in SPTBN2 implicate β-III spectrin in both cognitive and motor development. PLoS Genet 2012; 8:e1003074. [PMID: 23236289 PMCID: PMC3516553 DOI: 10.1371/journal.pgen.1003074] [Citation(s) in RCA: 82] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2012] [Accepted: 09/21/2012] [Indexed: 11/19/2022] Open
Abstract
β-III spectrin is present in the brain and is known to be important in the function of the cerebellum. Heterozygous mutations in SPTBN2, the gene encoding β-III spectrin, cause Spinocerebellar Ataxia Type 5 (SCA5), an adult-onset, slowly progressive, autosomal-dominant pure cerebellar ataxia. SCA5 is sometimes known as "Lincoln ataxia," because the largest known family is descended from relatives of the United States President Abraham Lincoln. Using targeted capture and next-generation sequencing, we identified a homozygous stop codon in SPTBN2 in a consanguineous family in which childhood developmental ataxia co-segregates with cognitive impairment. The cognitive impairment could result from mutations in a second gene, but further analysis using whole-genome sequencing combined with SNP array analysis did not reveal any evidence of other mutations. We also examined a mouse knockout of β-III spectrin in which ataxia and progressive degeneration of cerebellar Purkinje cells has been previously reported and found morphological abnormalities in neurons from prefrontal cortex and deficits in object recognition tasks, consistent with the human cognitive phenotype. These data provide the first evidence that β-III spectrin plays an important role in cortical brain development and cognition, in addition to its function in the cerebellum; and we conclude that cognitive impairment is an integral part of this novel recessive ataxic syndrome, Spectrin-associated Autosomal Recessive Cerebellar Ataxia type 1 (SPARCA1). In addition, the identification of SPARCA1 and normal heterozygous carriers of the stop codon in SPTBN2 provides insights into the mechanism of molecular dominance in SCA5 and demonstrates that the cell-specific repertoire of spectrin subunits underlies a novel group of disorders, the neuronal spectrinopathies, which includes SCA5, SPARCA1, and a form of West syndrome.
Collapse
Affiliation(s)
- Stefano Lise
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
- NIHR Biomedical Research Centre Oxford, Oxford, United Kingdom
| | - Yvonne Clarkson
- Centre for Integrative Physiology, Euan MacDonald Centre for Motor Neurone Disease Research, University of Edinburgh, Edinburgh, United Kingdom
| | - Emma Perkins
- Centre for Integrative Physiology, Euan MacDonald Centre for Motor Neurone Disease Research, University of Edinburgh, Edinburgh, United Kingdom
| | - Alexandra Kwasniewska
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, United Kingdom
| | - Elham Sadighi Akha
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
- NIHR Biomedical Research Centre Oxford, Oxford, United Kingdom
| | - Ricardo Parolin Schnekenberg
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
- School of Medicine, Universidade Positivo, Curitiba, Brazil
| | - Daumante Suminaite
- Centre for Integrative Physiology, Euan MacDonald Centre for Motor Neurone Disease Research, University of Edinburgh, Edinburgh, United Kingdom
| | - Jilly Hope
- Centre for Integrative Physiology, Euan MacDonald Centre for Motor Neurone Disease Research, University of Edinburgh, Edinburgh, United Kingdom
| | - Ian Baker
- Russell Cairns Unit, Oxford University Hospitals NHS Trust, Oxford, United Kingdom
| | - Lorna Gregory
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Angie Green
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Chris Allan
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Sarah Lamble
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Sandeep Jayawant
- Department of Paediatrics, Oxford University Hospitals NHS Trust, Oxford, United Kingdom
| | - Gerardine Quaghebeur
- Department of Neuroradiology, Oxford University Hospitals NHS Trust, Oxford, United Kingdom
| | - M. Zameel Cader
- Department of Anatomy, Physiology, and Genetics, University of Oxford, Oxford, United Kingdom
| | - Sarah Hughes
- Royal Berkshire Foundation Trust Hospital, Reading, United Kingdom
| | - Richard J. E. Armstrong
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, United Kingdom
- Royal Berkshire Foundation Trust Hospital, Reading, United Kingdom
| | - Alexander Kanapin
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Andrew Rimmer
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Gerton Lunter
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Iain Mathieson
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Jean-Baptiste Cazier
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - David Buck
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Jenny C. Taylor
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
- NIHR Biomedical Research Centre Oxford, Oxford, United Kingdom
| | | | - Gilean McVean
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Peter Donnelly
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Samantha J. L. Knight
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
- NIHR Biomedical Research Centre Oxford, Oxford, United Kingdom
| | - Mandy Jackson
- Centre for Integrative Physiology, Euan MacDonald Centre for Motor Neurone Disease Research, University of Edinburgh, Edinburgh, United Kingdom
| | - Jiannis Ragoussis
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
| | - Andrea H. Németh
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
- NIHR Biomedical Research Centre Oxford, Oxford, United Kingdom
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, United Kingdom
- Department of Clinical Genetics, Oxford University Hospitals NHS Trust, Oxford, United Kingdom
| |
Collapse
|
43
|
Prüfer K, Munch K, Hellmann I, Akagi K, Miller JR, Walenz B, Koren S, Sutton G, Kodira C, Winer R, Knight JR, Mullikin JC, Meader SJ, Ponting CP, Lunter G, Higashino S, Hobolth A, Dutheil J, Karakoç E, Alkan C, Sajjadian S, Catacchio CR, Ventura M, Marques-Bonet T, Eichler EE, André C, Atencia R, Mugisha L, Junhold J, Patterson N, Siebauer M, Good JM, Fischer A, Ptak SE, Lachmann M, Symer DE, Mailund T, Schierup MH, Andrés AM, Kelso J, Pääbo S. The bonobo genome compared with the chimpanzee and human genomes. Nature 2012; 486:527-31. [PMID: 22722832 DOI: 10.1038/nature11128] [Citation(s) in RCA: 284] [Impact Index Per Article: 23.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2011] [Accepted: 04/05/2012] [Indexed: 01/09/2023]
Abstract
Two African apes are the closest living relatives of humans: the chimpanzee (Pan troglodytes) and the bonobo (Pan paniscus). Although they are similar in many respects, bonobos and chimpanzees differ strikingly in key social and sexual behaviours, and for some of these traits they show more similarity with humans than with each other. Here we report the sequencing and assembly of the bonobo genome to study its evolutionary relationship with the chimpanzee and human genomes. We find that more than three per cent of the human genome is more closely related to either the bonobo or the chimpanzee genome than these are to each other. These regions allow various aspects of the ancestry of the two ape species to be reconstructed. In addition, many of the regions that overlap genes may eventually help us understand the genetic basis of phenotypes that humans share with one of the two apes to the exclusion of the other.
Collapse
Affiliation(s)
- Kay Prüfer
- Max Planck Institute for Evolutionary Anthropology, D-04103 Leipzig, Germany.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
44
|
Westesson O, Lunter G, Paten B, Holmes I. Accurate reconstruction of insertion-deletion histories by statistical phylogenetics. PLoS One 2012; 7:e34572. [PMID: 22536326 PMCID: PMC3335033 DOI: 10.1371/journal.pone.0034572] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2012] [Accepted: 03/05/2012] [Indexed: 11/24/2022] Open
Abstract
The Multiple Sequence Alignment (MSA) is a computational abstraction that represents a partial summary either of indel history, or of structural similarity. Taking the former view (indel history), it is possible to use formal automata theory to generalize the phylogenetic likelihood framework for finite substitution models (Dayhoff's probability matrices and Felsenstein's pruning algorithm) to arbitrary-length sequences. In this paper, we report results of a simulation-based benchmark of several methods for reconstruction of indel history. The methods tested include a relatively new algorithm for statistical marginalization of MSAs that sums over a stochastically-sampled ensemble of the most probable evolutionary histories. For mammalian evolutionary parameters on several different trees, the single most likely history sampled by our algorithm appears less biased than histories reconstructed by other MSA methods. The algorithm can also be used for alignment-free inference, where the MSA is explicitly summed out of the analysis. As an illustration of our method, we discuss reconstruction of the evolutionary histories of human protein-coding genes.
Collapse
Affiliation(s)
- Oscar Westesson
- University of California Berkeley and University of California San Francisco Graduate Program in Bioengineering, University of California, Berkeley, California, United States of America
| | - Gerton Lunter
- Wellcome Trust Center for Human Genetics, Oxford, Oxford, United Kingdom
| | - Benedict Paten
- Baskin School of Engineering, University of California Santa Cruz, Santa Cruz, California, United States of America
| | - Ian Holmes
- University of California Berkeley and University of California San Francisco Graduate Program in Bioengineering, University of California, Berkeley, California, United States of America
| |
Collapse
|
45
|
Auton A, Fledel-Alon A, Pfeifer S, Venn O, Ségurel L, Street T, Leffler EM, Bowden R, Aneas I, Broxholme J, Humburg P, Iqbal Z, Lunter G, Maller J, Hernandez RD, Melton C, Venkat A, Nobrega MA, Bontrop R, Myers S, Donnelly P, Przeworski M, McVean G. A fine-scale chimpanzee genetic map from population sequencing. Science 2012; 336:193-8. [PMID: 22422862 PMCID: PMC3532813 DOI: 10.1126/science.1216872] [Citation(s) in RCA: 208] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
To study the evolution of recombination rates in apes, we developed methodology to construct a fine-scale genetic map from high-throughput sequence data from 10 Western chimpanzees, Pan troglodytes verus. Compared to the human genetic map, broad-scale recombination rates tend to be conserved, but with exceptions, particularly in regions of chromosomal rearrangements and around the site of ancestral fusion in human chromosome 2. At fine scales, chimpanzee recombination is dominated by hotspots, which show no overlap with those of humans even though rates are similarly elevated around CpG islands and decreased within genes. The hotspot-specifying protein PRDM9 shows extensive variation among Western chimpanzees, and there is little evidence that any sequence motifs are enriched in hotspots. The contrasting locations of hotspots provide a natural experiment, which demonstrates the impact of recombination on base composition.
Collapse
Affiliation(s)
- Adam Auton
- Wellcome Trust Centre for Human Genetics, Roosevelt Drive, Oxford, OX3 7BN, UK
- Department of Genetics, Albert Einstein College of Medicine, New York, New York, USA
| | - Adi Fledel-Alon
- Department of Human Genetics, University of Chicago, Chicago, Illinois, USA
| | - Susanne Pfeifer
- Department of Statistics, 1 South Parks Road, University of Oxford, Oxford, OX1 3TG, UK
| | - Oliver Venn
- Wellcome Trust Centre for Human Genetics, Roosevelt Drive, Oxford, OX3 7BN, UK
| | - Laure Ségurel
- Department of Human Genetics, University of Chicago, Chicago, Illinois, USA
- Howard Hughes Medical Institute, University of Chicago, Chicago, Illinois, USA
| | - Teresa Street
- Department of Statistics, 1 South Parks Road, University of Oxford, Oxford, OX1 3TG, UK
| | - Ellen M. Leffler
- Department of Human Genetics, University of Chicago, Chicago, Illinois, USA
| | - Rory Bowden
- Wellcome Trust Centre for Human Genetics, Roosevelt Drive, Oxford, OX3 7BN, UK
- Department of Statistics, 1 South Parks Road, University of Oxford, Oxford, OX1 3TG, UK
- Oxford Biomedical Research Centre, Nuffield Department of Medicine, University of Oxford, Oxford, OX3 9DU, UK
| | - Ivy Aneas
- Department of Human Genetics, University of Chicago, Chicago, Illinois, USA
| | - John Broxholme
- Wellcome Trust Centre for Human Genetics, Roosevelt Drive, Oxford, OX3 7BN, UK
| | - Peter Humburg
- Wellcome Trust Centre for Human Genetics, Roosevelt Drive, Oxford, OX3 7BN, UK
| | - Zamin Iqbal
- Wellcome Trust Centre for Human Genetics, Roosevelt Drive, Oxford, OX3 7BN, UK
| | - Gerton Lunter
- Wellcome Trust Centre for Human Genetics, Roosevelt Drive, Oxford, OX3 7BN, UK
| | - Julian Maller
- Wellcome Trust Centre for Human Genetics, Roosevelt Drive, Oxford, OX3 7BN, UK
- Department of Statistics, 1 South Parks Road, University of Oxford, Oxford, OX1 3TG, UK
| | - Ryan D. Hernandez
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94143-0912, USA
| | - Cord Melton
- Department of Human Genetics, University of Chicago, Chicago, Illinois, USA
| | - Aarti Venkat
- Department of Human Genetics, University of Chicago, Chicago, Illinois, USA
- Howard Hughes Medical Institute, University of Chicago, Chicago, Illinois, USA
| | - Marcelo A. Nobrega
- Department of Human Genetics, University of Chicago, Chicago, Illinois, USA
| | - Ronald Bontrop
- Department of Comparative Genetics and Refinement, Biomedical Primate Research Center, Lange Kleiweg 139 2288 GJ, Rijswijk, Netherlands
| | - Simon Myers
- Wellcome Trust Centre for Human Genetics, Roosevelt Drive, Oxford, OX3 7BN, UK
- Department of Statistics, 1 South Parks Road, University of Oxford, Oxford, OX1 3TG, UK
| | - Peter Donnelly
- Wellcome Trust Centre for Human Genetics, Roosevelt Drive, Oxford, OX3 7BN, UK
- Department of Statistics, 1 South Parks Road, University of Oxford, Oxford, OX1 3TG, UK
| | - Molly Przeworski
- Department of Human Genetics, University of Chicago, Chicago, Illinois, USA
- Howard Hughes Medical Institute, University of Chicago, Chicago, Illinois, USA
- Department of Ecology and Evolution, University of Chicago, Chicago, Illinois, USA
| | - Gil McVean
- Wellcome Trust Centre for Human Genetics, Roosevelt Drive, Oxford, OX3 7BN, UK
- Department of Statistics, 1 South Parks Road, University of Oxford, Oxford, OX1 3TG, UK
| |
Collapse
|
46
|
Eizirik DL, Sammeth M, Bouckenooghe T, Bottu G, Sisino G, Igoillo-Esteve M, Ortis F, Santin I, Colli ML, Barthson J, Bouwens L, Hughes L, Gregory L, Lunter G, Marselli L, Marchetti P, McCarthy MI, Cnop M. The human pancreatic islet transcriptome: expression of candidate genes for type 1 diabetes and the impact of pro-inflammatory cytokines. PLoS Genet 2012; 8:e1002552. [PMID: 22412385 PMCID: PMC3297576 DOI: 10.1371/journal.pgen.1002552] [Citation(s) in RCA: 342] [Impact Index Per Article: 28.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2011] [Accepted: 01/10/2012] [Indexed: 01/06/2023] Open
Abstract
Type 1 diabetes (T1D) is an autoimmune disease in which pancreatic beta cells are killed by infiltrating immune cells and by cytokines released by these cells. Signaling events occurring in the pancreatic beta cells are decisive for their survival or death in diabetes. We have used RNA sequencing (RNA–seq) to identify transcripts, including splice variants, expressed in human islets of Langerhans under control conditions or following exposure to the pro-inflammatory cytokines interleukin-1β (IL-1β) and interferon-γ (IFN-γ). Based on this unique dataset, we examined whether putative candidate genes for T1D, previously identified by GWAS, are expressed in human islets. A total of 29,776 transcripts were identified as expressed in human islets. Expression of around 20% of these transcripts was modified by pro-inflammatory cytokines, including apoptosis- and inflammation-related genes. Chemokines were among the transcripts most modified by cytokines, a finding confirmed at the protein level by ELISA. Interestingly, 35% of the genes expressed in human islets undergo alternative splicing as annotated in RefSeq, and cytokines caused substantial changes in spliced transcripts. Nova1, previously considered a brain-specific regulator of mRNA splicing, is expressed in islets and its knockdown modified splicing. 25/41 of the candidate genes for T1D are expressed in islets, and cytokines modified expression of several of these transcripts. The present study doubles the number of known genes expressed in human islets and shows that cytokines modify alternative splicing in human islet cells. Importantly, it indicates that more than half of the known T1D candidate genes are expressed in human islets. This, and the production of a large number of chemokines and cytokines by cytokine-exposed islets, reinforces the concept of a dialog between pancreatic islets and the immune system in T1D. This dialog is modulated by candidate genes for the disease at both the immune system and beta cell level. Pancreatic beta cells are destroyed by the immune system in type 1 diabetes mellitus, causing insulin dependence for life. Candidate genes for diabetes contribute to this process by acting both at the immune system and, as we suggest here, at the pancreatic beta cell level. We have utilized a novel technology, RNA sequencing, to define all transcripts expressed in human pancreatic islets under basal conditions and following exposure to cytokines, pro-inflammatory mediators that contribute to trigger diabetes. Our observations double the number of known genes present in human islets and indicate that >60% of the candidate genes for type 1 diabetes are expressed in beta cells. The data also show that pro-inflammatory cytokines modify alternative splicing in human islets, a process that may generate novel RNAs and proteins recognizable by the immune system. This, taken together with the findings that pancreatic beta cells themselves express and release many cytokines and chemokines (proteins that attract immune cells), indicates that early type 1 diabetes is characterized by a dialog between beta cells and the immune system. We suggest that candidate genes for diabetes function at least in part as “writers” for the beta cell words in this dialog.
Collapse
Affiliation(s)
- Décio L. Eizirik
- Laboratory of Experimental Medicine, Université Libre de Bruxelles (ULB), Brussels, Belgium
- * E-mail: (DLE); (MC)
| | - Michael Sammeth
- Functional Bioinformatics (FBI), Centre Nacional d'Anàlisi Genòmica (CNAG), Barcelona, Spain
| | - Thomas Bouckenooghe
- Laboratory of Experimental Medicine, Université Libre de Bruxelles (ULB), Brussels, Belgium
| | - Guy Bottu
- Laboratory of Experimental Medicine, Université Libre de Bruxelles (ULB), Brussels, Belgium
| | - Giorgia Sisino
- Laboratory of Experimental Medicine, Université Libre de Bruxelles (ULB), Brussels, Belgium
| | - Mariana Igoillo-Esteve
- Laboratory of Experimental Medicine, Université Libre de Bruxelles (ULB), Brussels, Belgium
| | - Fernanda Ortis
- Laboratory of Experimental Medicine, Université Libre de Bruxelles (ULB), Brussels, Belgium
| | - Izortze Santin
- Laboratory of Experimental Medicine, Université Libre de Bruxelles (ULB), Brussels, Belgium
| | - Maikel L. Colli
- Laboratory of Experimental Medicine, Université Libre de Bruxelles (ULB), Brussels, Belgium
| | - Jenny Barthson
- Laboratory of Experimental Medicine, Université Libre de Bruxelles (ULB), Brussels, Belgium
| | - Luc Bouwens
- Cell Differentiation Unit, Diabetes Research Centre, Vrije Universiteit Brussel (VUB), Brussels, Belgium
| | - Linda Hughes
- Oxford Centre for Diabetes, Endocrinology, and Metabolism (OCDEM), Churchill Hospital, Oxford, United Kingdom
| | - Lorna Gregory
- Oxford Centre for Diabetes, Endocrinology, and Metabolism (OCDEM), Churchill Hospital, Oxford, United Kingdom
| | - Gerton Lunter
- Oxford Centre for Diabetes, Endocrinology, and Metabolism (OCDEM), Churchill Hospital, Oxford, United Kingdom
| | - Lorella Marselli
- Department of Endocrinology and Metabolism, University of Pisa, Pisa, Italy
| | - Piero Marchetti
- Department of Endocrinology and Metabolism, University of Pisa, Pisa, Italy
| | - Mark I. McCarthy
- Oxford Centre for Diabetes, Endocrinology, and Metabolism (OCDEM), Churchill Hospital, Oxford, United Kingdom
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
- Oxford NIHR Biomedical Research Centre, Churchill Hospital, Oxford, United Kingdom
| | - Miriam Cnop
- Laboratory of Experimental Medicine, Université Libre de Bruxelles (ULB), Brussels, Belgium
- Division of Endocrinology, Erasmus Hospital, Université Libre de Bruxelles (ULB), Brussels, Belgium
- * E-mail: (DLE); (MC)
| |
Collapse
|
47
|
Scally A, Dutheil JY, Hillier LW, Jordan GE, Goodhead I, Herrero J, Hobolth A, Lappalainen T, Mailund T, Marques-Bonet T, McCarthy S, Montgomery SH, Schwalie PC, Tang YA, Ward MC, Xue Y, Yngvadottir B, Alkan C, Andersen LN, Ayub Q, Ball EV, Beal K, Bradley BJ, Chen Y, Clee CM, Fitzgerald S, Graves TA, Gu Y, Heath P, Heger A, Karakoc E, Kolb-Kokocinski A, Laird GK, Lunter G, Meader S, Mort M, Mullikin JC, Munch K, O'Connor TD, Phillips AD, Prado-Martinez J, Rogers AS, Sajjadian S, Schmidt D, Shaw K, Simpson JT, Stenson PD, Turner DJ, Vigilant L, Vilella AJ, Whitener W, Zhu B, Cooper DN, de Jong P, Dermitzakis ET, Eichler EE, Flicek P, Goldman N, Mundy NI, Ning Z, Odom DT, Ponting CP, Quail MA, Ryder OA, Searle SM, Warren WC, Wilson RK, Schierup MH, Rogers J, Tyler-Smith C, Durbin R. Insights into hominid evolution from the gorilla genome sequence. Nature 2012; 483:169-75. [PMID: 22398555 PMCID: PMC3303130 DOI: 10.1038/nature10842] [Citation(s) in RCA: 457] [Impact Index Per Article: 38.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2011] [Accepted: 01/10/2012] [Indexed: 12/13/2022]
Abstract
Gorillas are humans’ closest living relatives after chimpanzees, and are of comparable importance for the study of human origins and evolution. Here we present the assembly and analysis of a genome sequence for the western lowland gorilla, and compare the whole genomes of all extant great ape genera. We propose a synthesis of genetic and fossil evidence consistent with placing the human-chimpanzee and human-chimpanzee-gorilla speciation events at approximately 6 and 10 million years ago (Mya). In 30% of the genome, gorilla is closer to human or chimpanzee than the latter are to each other; this is rarer around coding genes, indicating pervasive selection throughout great ape evolution, and has functional consequences in gene expression. A comparison of protein coding genes reveals approximately 500 genes showing accelerated evolution on each of the gorilla, human and chimpanzee lineages, and evidence for parallel acceleration, particularly of genes involved in hearing. We also compare the western and eastern gorilla species, estimating an average sequence divergence time 1.75 million years ago, but with evidence for more recent genetic exchange and a population bottleneck in the eastern species. The use of the genome sequence in these and future analyses will promote a deeper understanding of great ape biology and evolution.
Collapse
Affiliation(s)
- Aylwyn Scally
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA, UK
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
48
|
MacArthur DG, Balasubramanian S, Frankish A, Huang N, Morris J, Walter K, Jostins L, Habegger L, Pickrell JK, Montgomery SB, Albers CA, Zhang ZD, Conrad DF, Lunter G, Zheng H, Ayub Q, DePristo MA, Banks E, Hu M, Handsaker RE, Rosenfeld JA, Fromer M, Jin M, Mu XJ, Khurana E, Ye K, Kay M, Saunders GI, Suner MM, Hunt T, Barnes IHA, Amid C, Carvalho-Silva DR, Bignell AH, Snow C, Yngvadottir B, Bumpstead S, Cooper DN, Xue Y, Romero IG, Wang J, Li Y, Gibbs RA, McCarroll SA, Dermitzakis ET, Pritchard JK, Barrett JC, Harrow J, Hurles ME, Gerstein MB, Tyler-Smith C. A systematic survey of loss-of-function variants in human protein-coding genes. Science 2012; 335:823-8. [PMID: 22344438 PMCID: PMC3299548 DOI: 10.1126/science.1215040] [Citation(s) in RCA: 869] [Impact Index Per Article: 72.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
Genome-sequencing studies indicate that all humans carry many genetic variants predicted to cause loss of function (LoF) of protein-coding genes, suggesting unexpected redundancy in the human genome. Here we apply stringent filters to 2951 putative LoF variants obtained from 185 human genomes to determine their true prevalence and properties. We estimate that human genomes typically contain ~100 genuine LoF variants with ~20 genes completely inactivated. We identify rare and likely deleterious LoF alleles, including 26 known and 21 predicted severe disease-causing variants, as well as common LoF variants in nonessential genes. We describe functional and evolutionary differences between LoF-tolerant and recessive disease genes and a method for using these differences to prioritize candidate genes found in clinical sequencing studies.
Collapse
|
49
|
Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, Handsaker RE, Lunter G, Marth GT, Sherry ST, McVean G, Durbin R. The variant call format and VCFtools. Bioinformatics 2011. [PMID: 21653522 DOI: 10.1093/bioinformatics/btr3301] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
SUMMARY The variant call format (VCF) is a generic format for storing DNA polymorphism data such as SNPs, insertions, deletions and structural variants, together with rich annotations. VCF is usually stored in a compressed manner and can be indexed for fast data retrieval of variants from a range of positions on the reference genome. The format was developed for the 1000 Genomes Project, and has also been adopted by other projects such as UK10K, dbSNP and the NHLBI Exome Project. VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API. AVAILABILITY http://vcftools.sourceforge.net
Collapse
Affiliation(s)
- Petr Danecek
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SA, UK
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
50
|
Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, Handsaker RE, Lunter G, Marth GT, Sherry ST, McVean G, Durbin R. The variant call format and VCFtools. Bioinformatics 2011; 27:2156-8. [PMID: 21653522 PMCID: PMC3137218 DOI: 10.1093/bioinformatics/btr330] [Citation(s) in RCA: 7769] [Impact Index Per Article: 597.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
Summary: The variant call format (VCF) is a generic format for storing DNA polymorphism data such as SNPs, insertions, deletions and structural variants, together with rich annotations. VCF is usually stored in a compressed manner and can be indexed for fast data retrieval of variants from a range of positions on the reference genome. The format was developed for the 1000 Genomes Project, and has also been adopted by other projects such as UK10K, dbSNP and the NHLBI Exome Project. VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API. Availability:http://vcftools.sourceforge.net Contact:rd@sanger.ac.uk
Collapse
Affiliation(s)
- Petr Danecek
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SA, UK
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|