1
|
Baudrier L, Benamozig O, Langley J, Chopra S, Kalashnikova T, Benaoudia S, Singh G, Mahoney DJ, Wright NAM, Billon P. One-pot DTECT enables rapid and efficient capture of genetic signatures for precision genome editing and clinical diagnostics. CELL REPORTS METHODS 2024; 4:100698. [PMID: 38301655 PMCID: PMC10921016 DOI: 10.1016/j.crmeth.2024.100698] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Revised: 12/05/2023] [Accepted: 01/09/2024] [Indexed: 02/03/2024]
Abstract
The detection of genomic sequences and their alterations is crucial for basic research and clinical diagnostics. However, current methodologies are costly and time-consuming and require outsourcing sample preparation, processing, and analysis to genomic companies. Here, we establish One-pot DTECT, a platform that expedites the detection of genetic signatures, only requiring a short incubation of a PCR product in an optimized one-pot mixture. One-pot DTECT enables qualitative, quantitative, and visual detection of biologically relevant variants, such as cancer mutations, and nucleotide changes introduced by prime editing and base editing into cancer cells and human primary T cells. Notably, One-pot DTECT achieves quantification accuracy for targeted genetic signatures comparable with Sanger and next-generation sequencing. Furthermore, its effectiveness as a diagnostic platform is demonstrated by successfully detecting sickle cell variants in blood and saliva samples. Altogether, One-pot DTECT offers an efficient, versatile, adaptable, and cost-effective alternative to traditional methods for detecting genomic signatures.
Collapse
Affiliation(s)
- Lou Baudrier
- The University of Calgary, Cumming School of Medicine, Department of Biochemistry and Molecular Biology, 3330 Hospital Drive NW, Calgary, AB T2N 4N1, Canada; Robson DNA Science Centre, Calgary, AB, Canada; Arnie Charbonneau Cancer Institute, Calgary, AB, Canada
| | - Orléna Benamozig
- The University of Calgary, Cumming School of Medicine, Department of Biochemistry and Molecular Biology, 3330 Hospital Drive NW, Calgary, AB T2N 4N1, Canada; Robson DNA Science Centre, Calgary, AB, Canada; Arnie Charbonneau Cancer Institute, Calgary, AB, Canada
| | - Jethro Langley
- The University of Calgary, Cumming School of Medicine, Department of Biochemistry and Molecular Biology, 3330 Hospital Drive NW, Calgary, AB T2N 4N1, Canada; Robson DNA Science Centre, Calgary, AB, Canada; Arnie Charbonneau Cancer Institute, Calgary, AB, Canada
| | - Sanchit Chopra
- The University of Calgary, Cumming School of Medicine, Department of Biochemistry and Molecular Biology, 3330 Hospital Drive NW, Calgary, AB T2N 4N1, Canada; Robson DNA Science Centre, Calgary, AB, Canada; Arnie Charbonneau Cancer Institute, Calgary, AB, Canada
| | - Tatiana Kalashnikova
- Alberta Children's Hospital Research Institute, Calgary, AB, Canada; The University of Calgary, Cumming School of Medicine, Department of Pediatrics, 28 Oki Drive NW, Calgary, AB T3B 6A8, Canada
| | - Sacha Benaoudia
- Arnie Charbonneau Cancer Institute, Calgary, AB, Canada; Alberta Children's Hospital Research Institute, Calgary, AB, Canada
| | - Gurpreet Singh
- Alberta Children's Hospital Research Institute, Calgary, AB, Canada; The University of Calgary, Cumming School of Medicine, Department of Pediatrics, 28 Oki Drive NW, Calgary, AB T3B 6A8, Canada
| | - Douglas J Mahoney
- The University of Calgary, Cumming School of Medicine, Department of Biochemistry and Molecular Biology, 3330 Hospital Drive NW, Calgary, AB T2N 4N1, Canada; Arnie Charbonneau Cancer Institute, Calgary, AB, Canada; Alberta Children's Hospital Research Institute, Calgary, AB, Canada; Snyder Institute for Chronic Disease, Calgary, AB, Canada; Department of Microbiology, Immunology and Infectious Disease, Calgary, AB, Canada
| | - Nicola A M Wright
- Alberta Children's Hospital Research Institute, Calgary, AB, Canada; The University of Calgary, Cumming School of Medicine, Department of Pediatrics, 28 Oki Drive NW, Calgary, AB T3B 6A8, Canada
| | - Pierre Billon
- The University of Calgary, Cumming School of Medicine, Department of Biochemistry and Molecular Biology, 3330 Hospital Drive NW, Calgary, AB T2N 4N1, Canada; Robson DNA Science Centre, Calgary, AB, Canada; Arnie Charbonneau Cancer Institute, Calgary, AB, Canada.
| |
Collapse
|
2
|
Viswan NA, Bhalla US. Understanding molecular signaling cascades in neural disease using multi-resolution models. Curr Opin Neurobiol 2023; 83:102808. [PMID: 37972535 DOI: 10.1016/j.conb.2023.102808] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Revised: 10/10/2023] [Accepted: 10/19/2023] [Indexed: 11/19/2023]
Abstract
If the genome defines the program for the operations of a cell, signaling networks execute it. These cascades of chemical, cell-biological, structural, and trafficking events span milliseconds (e.g., synaptic release) to potentially a lifetime (e.g., stabilization of dendritic spines). In principle almost every aspect of neuronal function, particularly at the synapse, depends on signaling. Thus dysfunction of these cascades, whether through mutations, local dysregulation, or infection, leads to disease. The sheer complexity of these pathways is matched by the range of diseases and the diversity of their phenotypes. In this review, we discuss how to build computational models, how these models are essential to tackle this complexity, and the benefits of using families of models at different levels of detail to understand signaling in health and disease.
Collapse
Affiliation(s)
- Nisha Ann Viswan
- National Centre for Biological Sciences, Tata Institute of Fundamental Research, Bellary Road, Bengaluru, 560065, India; The University of Trans-Disciplinary Health Sciences and Technology, Bangalore, India. https://twitter.com/nishanna
| | - Upinder Singh Bhalla
- National Centre for Biological Sciences, Tata Institute of Fundamental Research, Bellary Road, Bengaluru, 560065, India.
| |
Collapse
|
3
|
Llargués-Sistac G, Bonjoch L, Castellvi-Bel S. HAP1, a new revolutionary cell model for gene editing using CRISPR-Cas9. Front Cell Dev Biol 2023; 11:1111488. [PMID: 36936678 PMCID: PMC10020200 DOI: 10.3389/fcell.2023.1111488] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Accepted: 02/22/2023] [Indexed: 03/06/2023] Open
Abstract
The use of next-generation sequencing (NGS) technologies has been instrumental in the characterization of the mutational landscape of complex human diseases like cancer. But despite the enormous rise in the identification of disease candidate genetic variants, their functionality is yet to be fully elucidated in order to have a clear implication in patient care. Haploid human cell models have become the tool of choice for functional gene studies, since they only contain one copy of the genome and can therefore show the unmasked phenotype of genetic variants. Over the past few years, the human near-haploid cell line HAP1 has widely been consolidated as one of the favorite cell line models for functional genetic studies. Its rapid turnover coupled with the fact that only one allele needs to be modified in order to express the subsequent desired phenotype has made this human cell line a valuable tool for gene editing by CRISPR-Cas9 technologies. This review examines the recent uses of the HAP1 cell line model in functional genetic studies and high-throughput genetic screens using the CRISPR-Cas9 system. It covers its use in an attempt to develop new and relevant disease models to further elucidate gene function, and create new ways to understand the genetic basis of human diseases. We will cover the advantages and potential of the use of CRISPR-Cas9 technology on HAP1 to easily and efficiently study the functional interpretation of gene function and human single-nucleotide genetic variants of unknown significance identified through NGS technologies, and its implications for changes in clinical practice and patient care.
Collapse
|
4
|
Marchetti F, Corsello G. Genetics and"democracy". Ital J Pediatr 2022; 48:202. [PMID: 36572899 PMCID: PMC9793583 DOI: 10.1186/s13052-022-01391-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Accepted: 11/27/2022] [Indexed: 12/27/2022] Open
Abstract
BACKGROUND The spread of knowledge on the important implications of a diagnosis of genetic disease does not correspond to a sharing of the knowledge and equal rights of children. MAIN BODY It is estimated that about 5% of newborns may have a rare disease that in some cases, if diagnosed early, could have specific treatments that may be able to modify the natural history of the disease. However, in most countries the diagnosis during the first hours of life is limited to a few diseases, due to the high costs and time required for genetic investigations with classical methods. Recently, experimental projects to subject all newborns to a complete DNA analysis, with Next Generation Sequencing techniques, to detect any genetic pathologies as early as possible, have been reported in some countries. The late diagnosis of some genetic diseases that have treatment plans, such as spinal muscular atrophy, can be a serious damage, for anyone who has seen and accompanied the life of a child with this disease and his/her family, before and after, the recent availability of therapies which, if started very early, can lead to an almost normal life. Rapid sequencing and genetic diagnosis are a crucial part of directing inpatient management and this resource should be accessible not only to academic medical centers but also in community settings. CONCLUSIONS It is time for a profound reflection that places in Italy, as in other countries, the use of genetic tests in neonatal and pediatric age based on principles of evidence, ethics, and democracy and on clear national guidelines, which also consider organizational aspects.
Collapse
Affiliation(s)
- Federico Marchetti
- grid.415207.50000 0004 1760 3756Department of Pediatrics, Santa Maria Delle Croci Hospital, Viale Randi 5, 48121 Ravenna, Italy
| | - Giovanni Corsello
- grid.10776.370000 0004 1762 5517Department of Sciences for Health Promotion and Mother and Child Care ”G. D’Alessandro”, University of Palermo, Palermo, Italy
| |
Collapse
|
5
|
Pan H, Chai W, Liu X, Yu T, Sun L, Yan M. DYNC1H1 regulates NSCLC cell growth and metastasis by IFN-γ-JAK-STAT signaling and is associated with an aberrant immune response. Exp Cell Res 2021; 409:112897. [PMID: 34717919 DOI: 10.1016/j.yexcr.2021.112897] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2021] [Revised: 10/21/2021] [Accepted: 10/24/2021] [Indexed: 02/07/2023]
Abstract
It is urgent to identify new biomarkers and therapeutic targets to ameliorate the clinical prognosis of patients with lung cancer. The functional significance and molecular mechanism of dynein cytoplasmic 1 heavy chain 1 (DYNC1H1) in nonsmall cell lung cancer (NSCLC) progression is still elusive. In our current study, publicly available data and Western blotting experiments confirmed that DYNC1H1 expression was upregulated in lung cancer samples compared with noncancerous samples. Quantitative real-time PCR (qPCR) results indicated that high DYNC1H1 expression in lung cancer tissues was significantly associated with clinical tumor stage and distal metastasis; moreover, its high expression was negatively correlated with prognosis. Functional experiments demonstrated that DYNC1H1 loss of function caused a significant decrease in cell viability and cell proliferative ability, inhibition of the cell cycle, and promotion of both migration potential and invasion potential in vitro. Animal experiments by tail vein injection of lung cancer cells showed that DYNC1H1 knockdown significantly decreased lung cancer metastasis. Mechanistically, the results from a human protein array showed changes in the IFN-γ-JAK-STAT signaling pathway, and analysis of The Cancer Genome Atlas (TCGA) immune data demonstrated that disturbance of the immune microenvironment might be involved in the impaired growth and metastatic ability mediated by DYNC1H1 loss in NSCLC. DYNC1H1 might serve as a promising biological marker of prognosis and a potential clinical therapeutic target for patients with NSCLC.
Collapse
Affiliation(s)
- Hongyu Pan
- Fudan University Shanghai Cancer Center, Shanghai Medical College, Fudan University, Shanghai, 200032, China
| | - Wenjun Chai
- Fudan University Shanghai Cancer Center, Shanghai Medical College, Fudan University, Shanghai, 200032, China
| | - Xiaoli Liu
- Fudan University Shanghai Cancer Center, Shanghai Medical College, Fudan University, Shanghai, 200032, China
| | - Tao Yu
- Fudan University Shanghai Cancer Center, Shanghai Medical College, Fudan University, Shanghai, 200032, China
| | - Lei Sun
- Fudan University Shanghai Cancer Center, Shanghai Medical College, Fudan University, Shanghai, 200032, China.
| | - Mingxia Yan
- Fudan University Shanghai Cancer Center, Shanghai Medical College, Fudan University, Shanghai, 200032, China.
| |
Collapse
|
6
|
González-Sánchez JC, Ibrahim MFR, Leist IC, Weise KR, Russell RB. Mechnetor: a web server for exploring protein mechanism and the functional context of genetic variants. Nucleic Acids Res 2021; 49:W366-W374. [PMID: 34076240 PMCID: PMC8262711 DOI: 10.1093/nar/gkab399] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Revised: 04/06/2021] [Accepted: 05/31/2021] [Indexed: 01/02/2023] Open
Abstract
Advances in DNA sequencing and proteomics mean that researchers must now regularly interrogate thousands of positional gene/protein changes in order to find those relevant for potential clinical application or biological insights. The abundance of already known information on protein interactions, mechanism, and tertiary structure provides the possible means to understand these changes rapidly, though a careful and systematic integration of these diverse datasets is first needed. For this purpose, we developed Mechnetor, a tool that allows users to quickly explore and visualize integrated mechanistic data for proteins or interactions of interest. Central to the system is a careful cataloguing of diverse sources of protein interaction mechanism, and an efficient means to visualize interactions between relevant and/or known protein regions. The result is a finer resolution interaction network that provides more immediate clues as to points of intervention or mechanistic understanding. Users can import protein, interactions, genetic variants or post-translational modifications and see these data in the best known mechanistic context. We demonstrate the tool with topical examples in human genetic diseases and cancer genomics. The tool is freely available at: mechnetor.russelllab.org.
Collapse
Affiliation(s)
- Juan Carlos González-Sánchez
- BioQuant, Heidelberg University, Heidelberg 69120, Germany.,Biochemistry Center (BZH), Heidelberg University, Heidelberg 69120, Germany
| | - Mustafa F R Ibrahim
- BioQuant, Heidelberg University, Heidelberg 69120, Germany.,Biochemistry Center (BZH), Heidelberg University, Heidelberg 69120, Germany
| | - Ivo C Leist
- BioQuant, Heidelberg University, Heidelberg 69120, Germany.,Biochemistry Center (BZH), Heidelberg University, Heidelberg 69120, Germany
| | - Kyle R Weise
- BioQuant, Heidelberg University, Heidelberg 69120, Germany.,Biochemistry Center (BZH), Heidelberg University, Heidelberg 69120, Germany
| | - Robert B Russell
- BioQuant, Heidelberg University, Heidelberg 69120, Germany.,Biochemistry Center (BZH), Heidelberg University, Heidelberg 69120, Germany
| |
Collapse
|
7
|
Conde-Cid M, Arias-Estévez M, Núñez-Delgado A. How to study SARS-CoV-2 in soils? ENVIRONMENTAL RESEARCH 2021; 198:110464. [PMID: 33188764 PMCID: PMC7658612 DOI: 10.1016/j.envres.2020.110464] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2020] [Revised: 10/18/2020] [Accepted: 11/09/2020] [Indexed: 05/08/2023]
Abstract
Wastewater based epidemiology is increasingly being considered as a potentially useful tool for early warning about eventual new COVID-19 outbreaks. In addition, some authors are investigating on the detection and quantification of SARS-CoV-2 in sewage sludge. However, no paper has been published up to date indicating how this virus could be quantified in soil samples. In view of that, we review available data searching for methodological approaches that could guide on the quantification of SARS-CoV-2 (and even other pathogenic microorganisms) in soils.
Collapse
Affiliation(s)
- Manuel Conde-Cid
- Soil Science and Agricultural Chemistry, Faculty of Sciences, University of Vigo, 32004, Ourense, Spain
| | - Manuel Arias-Estévez
- Soil Science and Agricultural Chemistry, Faculty of Sciences, University of Vigo, 32004, Ourense, Spain
| | - Avelino Núñez-Delgado
- Dept. Soil Science and Agricultural Chemistry, Engineering Polytechnic School, University of Santiago de Compostela, Campus Univ, Lugo, Spain.
| |
Collapse
|
8
|
Lemaire M, Noone D, Lapeyraque AL, Licht C, Frémeaux-Bacchi V. Inherited Kidney Complement Diseases. Clin J Am Soc Nephrol 2021; 16:942-956. [PMID: 33536243 PMCID: PMC8216622 DOI: 10.2215/cjn.11830720] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
In the past 20 years, we have witnessed tremendous advances in our ability to diagnose and treat genetic diseases of the kidney caused by complement dysregulation. Staggering progress was realized toward a better understanding of the genetic underpinnings and pathophysiology of many forms of atypical hemolytic uremic syndrome (aHUS) and C3-dominant glomerulopathies that are driven by complement system abnormalities. Many of these seminal discoveries paved the way for the design and characterization of several innovative therapies, some of which have already radically improved patients' outcomes. This review offers a broad overview of the exciting developments that have occurred in the recent past, with a particular focus on single-gene (or Mendelian), complement-driven aHUS and C3-dominant glomerulopathies that should be of interest to both nephrologists and kidney researchers. The discussion is restricted to genes with robust associations with both aHUS and C3-dominant glomerulopathies (complement factor H, complement component 3, complement factor H-related proteins) or only aHUS (complement factor B, complement factor I, and membrane cofactor protein). Key questions and challenges are highlighted, along with potential avenues for future directions.
Collapse
Affiliation(s)
- Mathieu Lemaire
- Division of Nephrology, The Hospital for Sick Children, Toronto, Ontario, Canada,Cell Biology Program, SickKids Research Institute, Toronto, Ontario, Canada,Department of Paediatrics, Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
| | - Damien Noone
- Division of Nephrology, The Hospital for Sick Children, Toronto, Ontario, Canada,Department of Paediatrics, Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
| | - Anne-Laure Lapeyraque
- Division of Nephrology, Sainte-Justine University Hospital Center, Montreal, Quebec, Canada,Department of Pediatrics, Faculty of Medicine, University of Montréal, Québec, Canada
| | - Christoph Licht
- Division of Nephrology, The Hospital for Sick Children, Toronto, Ontario, Canada,Cell Biology Program, SickKids Research Institute, Toronto, Ontario, Canada,Department of Paediatrics, Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
| | - Véronique Frémeaux-Bacchi
- Assistance Publique-Hôpitaux de Paris, Hôpital Européen Georges Pompidou, Laboratory of Immunology, Paris, France
| |
Collapse
|
9
|
Lemaire M, Noone D, Lapeyraque AL, Licht C, Frémeaux-Bacchi V. Inherited Kidney Complement Diseases. CLINICAL JOURNAL OF THE AMERICAN SOCIETY OF NEPHROLOGY : CJASN 2021. [PMID: 33536243 DOI: 10.2215/cjn.11830720)] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 12/04/2022]
Abstract
In the past 20 years, we have witnessed tremendous advances in our ability to diagnose and treat genetic diseases of the kidney caused by complement dysregulation. Staggering progress was realized toward a better understanding of the genetic underpinnings and pathophysiology of many forms of atypical hemolytic uremic syndrome (aHUS) and C3-dominant glomerulopathies that are driven by complement system abnormalities. Many of these seminal discoveries paved the way for the design and characterization of several innovative therapies, some of which have already radically improved patients' outcomes. This review offers a broad overview of the exciting developments that have occurred in the recent past, with a particular focus on single-gene (or Mendelian), complement-driven aHUS and C3-dominant glomerulopathies that should be of interest to both nephrologists and kidney researchers. The discussion is restricted to genes with robust associations with both aHUS and C3-dominant glomerulopathies (complement factor H, complement component 3, complement factor H-related proteins) or only aHUS (complement factor B, complement factor I, and membrane cofactor protein). Key questions and challenges are highlighted, along with potential avenues for future directions.
Collapse
Affiliation(s)
- Mathieu Lemaire
- Division of Nephrology, The Hospital for Sick Children, Toronto, Ontario, Canada .,Cell Biology Program, SickKids Research Institute, Toronto, Ontario, Canada.,Department of Paediatrics, Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
| | - Damien Noone
- Division of Nephrology, The Hospital for Sick Children, Toronto, Ontario, Canada.,Department of Paediatrics, Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
| | - Anne-Laure Lapeyraque
- Division of Nephrology, Sainte-Justine University Hospital Center, Montreal, Quebec, Canada.,Department of Pediatrics, Faculty of Medicine, University of Montréal, Québec, Canada
| | - Christoph Licht
- Division of Nephrology, The Hospital for Sick Children, Toronto, Ontario, Canada.,Cell Biology Program, SickKids Research Institute, Toronto, Ontario, Canada.,Department of Paediatrics, Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
| | - Véronique Frémeaux-Bacchi
- Assistance Publique-Hôpitaux de Paris, Hôpital Européen Georges Pompidou, Laboratory of Immunology, Paris, France
| |
Collapse
|
10
|
Remec ZI, Trebusak Podkrajsek K, Repic Lampret B, Kovac J, Groselj U, Tesovnik T, Battelino T, Debeljak M. Next-Generation Sequencing in Newborn Screening: A Review of Current State. Front Genet 2021; 12:662254. [PMID: 34122514 PMCID: PMC8188483 DOI: 10.3389/fgene.2021.662254] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2021] [Accepted: 04/13/2021] [Indexed: 12/27/2022] Open
Abstract
Newborn screening was first introduced at the beginning of the 1960s with the successful implementation of the first phenylketonuria screening programs. Early expansion of the included disorders was slow because each additional disorder screened required a separate test. Subsequently, the technological advancements of biochemical methodology enabled the scaling-up of newborn screening, most notably with the implementation of tandem mass spectrometry. In recent years, we have witnessed a remarkable progression of high-throughput sequencing technologies, which has resulted in a continuous decrease of both cost and time required for genetic analysis. This has enabled more widespread use of the massive multiparallel sequencing. Genomic sequencing is now frequently used in clinical applications, and its implementation in newborn screening has been intensively advocated. The expansion of newborn screening has raised many clinical, ethical, legal, psychological, sociological, and technological concerns over time. This review provides an overview of the current state of next-generation sequencing regarding newborn screening including current recommendations and potential challenges for the use of such technologies in newborn screening.
Collapse
Affiliation(s)
- Ziga I. Remec
- Clinical Institute for Special Laboratory Diagnostics, University Children’s Hospital, University Medical Centre Ljubljana, Ljubljana, Slovenia
| | - Katarina Trebusak Podkrajsek
- Clinical Institute for Special Laboratory Diagnostics, University Children’s Hospital, University Medical Centre Ljubljana, Ljubljana, Slovenia
- Faculty of Medicine, Institute of Biochemistry and Molecular Genetics, University of Ljubljana, Ljubljana, Slovenia
| | - Barbka Repic Lampret
- Clinical Institute for Special Laboratory Diagnostics, University Children’s Hospital, University Medical Centre Ljubljana, Ljubljana, Slovenia
| | - Jernej Kovac
- Clinical Institute for Special Laboratory Diagnostics, University Children’s Hospital, University Medical Centre Ljubljana, Ljubljana, Slovenia
| | - Urh Groselj
- Department of Endocrinology, Diabetes and Metabolic Diseases, University Children’s Hospital, University Medical Centre Ljubljana, Ljubljana, Slovenia
- Chair of Pediatrics, Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia
| | - Tine Tesovnik
- Clinical Institute for Special Laboratory Diagnostics, University Children’s Hospital, University Medical Centre Ljubljana, Ljubljana, Slovenia
| | - Tadej Battelino
- Department of Endocrinology, Diabetes and Metabolic Diseases, University Children’s Hospital, University Medical Centre Ljubljana, Ljubljana, Slovenia
- Chair of Pediatrics, Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia
| | - Marusa Debeljak
- Clinical Institute for Special Laboratory Diagnostics, University Children’s Hospital, University Medical Centre Ljubljana, Ljubljana, Slovenia
- Faculty of Medicine, Institute of Biochemistry and Molecular Genetics, University of Ljubljana, Ljubljana, Slovenia
| |
Collapse
|
11
|
Conde-Cid M, Arias-Estévez M, Núñez-Delgado A. SARS-CoV-2 and other pathogens could be determined in liquid samples from soils. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2021; 273:116445. [PMID: 33454620 PMCID: PMC7789913 DOI: 10.1016/j.envpol.2021.116445] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/06/2020] [Revised: 12/14/2020] [Accepted: 12/27/2020] [Indexed: 05/14/2023]
Abstract
In the current COVID-19 pandemic, SARS-CoV-2 has been quantified in wastewater in various countries, and wastewater based epidemiology has been proposed as a potential early warning tool for new outbreaks. However, even taking into account that poorly treated wastewater and sewage sludge may be spread on soils, there is no published paper dealing with the quantification of the virus in soil-related liquid samples, as could be runoff, leachates, or soil solution. To fill this gap, the authors of this piece propose reflections on the development of a methodological approach for the quantification of SARS-CoV-2 (and eventually other pathogens) in soil-related liquid samples.
Collapse
Affiliation(s)
- Manuel Conde-Cid
- Soil Science and Agricultural Chemistry, Faculty of Sciences, University of Vigo, 32004, Ourense, Spain
| | - Manuel Arias-Estévez
- Soil Science and Agricultural Chemistry, Faculty of Sciences, University of Vigo, 32004, Ourense, Spain
| | - Avelino Núñez-Delgado
- Dept. Soil Science and Agricultural Chemistry, Engineering Polytechnic School, University of Santiago de Compostela, Campus Univ. Lugo, Spain.
| |
Collapse
|
12
|
Albright MBN, Thompson J, Kroeger ME, Johansen R, Ulrich DEM, Gallegos-Graves LV, Munsky B, Dunbar J. Differences in substrate use linked to divergent carbon flow during litter decomposition. FEMS Microbiol Ecol 2020; 96:5867763. [DOI: 10.1093/femsec/fiaa135] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Accepted: 07/02/2020] [Indexed: 12/20/2022] Open
Abstract
ABSTRACT
Discovering widespread microbial processes that create variation in soil carbon (C) cycling within ecosystems may improve soil C modeling. Toward this end, we screened 206 soil communities decomposing plant litter in a common garden microcosm environment and examined features linked to divergent patterns of C flow. C flow was measured as carbon dioxide (CO2) and dissolved organic carbon (DOC) from 44-days of litter decomposition. Two large groups of microbial communities representing ‘high’ and ‘low’ DOC phenotypes from original soil and 44-day microcosm samples were down-selected for fungal and bacterial profiling. Metatranscriptomes were also sequenced from a smaller subset of communities in each group. The two groups exhibited differences in average rate of CO2 production, demonstrating that the divergent patterns of C flow arose from innate functional constraints on C metabolism, not a time-dependent artefact. To infer functional constraints, we identified features – traits at the organism, pathway or gene level – linked to the high and low DOC phenotypes using RNA-Seq approaches and machine learning approaches. Substrate use differed across the high and low DOC phenotypes. Additional features suggested that divergent patterns of C flow may be driven in part by differences in organism interactions that affect DOC abundance directly or indirectly by controlling community structure.
Collapse
Affiliation(s)
- Michaeline B N Albright
- Biosciences Division, Los Alamos National Laboratory, Mailstop M888, Los Alamos, NM 87545, USA
| | - Jaron Thompson
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, CO 80523, USA
| | - Marie E Kroeger
- Biosciences Division, Los Alamos National Laboratory, Mailstop M888, Los Alamos, NM 87545, USA
| | - Renee Johansen
- Biosciences Division, Los Alamos National Laboratory, Mailstop M888, Los Alamos, NM 87545, USA
| | - Danielle E M Ulrich
- Biosciences Division, Los Alamos National Laboratory, Mailstop M888, Los Alamos, NM 87545, USA
| | | | - Brian Munsky
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, CO 80523, USA
- School of Biomedical Engineering, Colorado State University, Fort Collins, CO 80523, USA
| | - John Dunbar
- Biosciences Division, Los Alamos National Laboratory, Mailstop M888, Los Alamos, NM 87545, USA
| |
Collapse
|
13
|
Clement K, Hsu JY, Canver MC, Joung JK, Pinello L. Technologies and Computational Analysis Strategies for CRISPR Applications. Mol Cell 2020; 79:11-29. [PMID: 32619467 PMCID: PMC7497852 DOI: 10.1016/j.molcel.2020.06.012] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2019] [Revised: 03/12/2020] [Accepted: 06/05/2020] [Indexed: 12/21/2022]
Abstract
The CRISPR-Cas system offers a programmable platform for eukaryotic genome and epigenome editing. The ability to perform targeted genetic and epigenetic perturbations enables researchers to perform a variety of tasks, ranging from investigating questions in basic biology to potentially developing novel therapeutics for the treatment of disease. While CRISPR systems have been engineered to target DNA and RNA with increased precision, efficiency, and flexibility, assays to identify off-target editing are becoming more comprehensive and sensitive. Furthermore, techniques to perform high-throughput genome and epigenome editing can be paired with a variety of readouts and are uncovering important cellular functions and mechanisms. These technological advances drive and are driven by accompanying computational approaches. Here, we briefly present available CRISPR technologies and review key computational advances and considerations for various CRISPR applications. In particular, we focus on the analysis of on- and off-target editing and CRISPR pooled screen data.
Collapse
Affiliation(s)
- Kendell Clement
- Molecular Pathology Unit, Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA, USA; Department of Pathology, Harvard Medical School, Boston, MA, USA; The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Jonathan Y Hsu
- Molecular Pathology Unit, Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA, USA; Department of Pathology, Harvard Medical School, Boston, MA, USA; The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Matthew C Canver
- Molecular Pathology Unit, Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA, USA; Department of Pathology, Harvard Medical School, Boston, MA, USA; The Broad Institute of MIT and Harvard, Cambridge, MA, USA; Department of Pathology and Laboratory Medicine, New York-Presbyterian Hospital, Weill Cornell Medicine, New York, NY, USA
| | - J Keith Joung
- Molecular Pathology Unit, Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA, USA; Department of Pathology, Harvard Medical School, Boston, MA, USA
| | - Luca Pinello
- Molecular Pathology Unit, Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA, USA; Department of Pathology, Harvard Medical School, Boston, MA, USA; The Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| |
Collapse
|
14
|
Peng X, Li HD, Wu FX, Wang J. Identifying the tissues-of-origin of circulating cell-free DNAs is a promising way in noninvasive diagnostics. Brief Bioinform 2020; 22:5840077. [PMID: 32427285 DOI: 10.1093/bib/bbaa060] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2020] [Revised: 03/16/2020] [Accepted: 03/25/2020] [Indexed: 12/18/2022] Open
Abstract
Advances in sequencing technologies facilitate personalized disease-risk profiling and clinical diagnosis. In recent years, some great progress has been made in noninvasive diagnoses based on cell-free DNAs (cfDNAs). It exploits the fact that dead cells release DNA fragments into the circulation, and some DNA fragments carry information that indicates their tissues-of-origin (TOOs). Based on the signals used for identifying the TOOs of cfDNAs, the existing methods can be classified into three categories: cfDNA mutation-based methods, methylation pattern-based methods and cfDNA fragmentation pattern-based methods. In cfDNA mutation-based methods, the SNP information or the detected mutations in driven genes of certain diseases are employed to identify the TOOs of cfDNAs. Methylation pattern-based methods are developed to identify the TOOs of cfDNAs based on the tissue-specific methylation patterns. In cfDNA fragmentation pattern-based methods, cfDNA fragmentation patterns, such as nucleosome positioning or preferred end coordinates of cfDNAs, are used to predict the TOOs of cfDNAs. In this paper, the strategies and challenges in each category are reviewed. Furthermore, the representative applications based on the TOOs of cfDNAs, including noninvasive prenatal testing, noninvasive cancer screening, transplantation rejection monitoring and parasitic infection detection, are also reviewed. Moreover, the challenges and future work in identifying the TOOs of cfDNAs are discussed. Our research provides a comprehensive picture of the development and challenges in identifying the TOOs of cfDNAs, which may benefit bioinformatics researchers to develop new methods to improve the identification of the TOOs of cfDNAs.
Collapse
|
15
|
DNA Microsystems for Biodiagnosis. MICROMACHINES 2020; 11:mi11040445. [PMID: 32340280 PMCID: PMC7231314 DOI: 10.3390/mi11040445] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/08/2020] [Revised: 04/21/2020] [Accepted: 04/22/2020] [Indexed: 12/16/2022]
Abstract
Researchers are continuously making progress towards diagnosis and treatment of numerous diseases. However, there are still major issues that are presenting many challenges for current medical diagnosis. On the other hand, DNA nanotechnology has evolved significantly over the last three decades and is highly interdisciplinary. With many potential technologies derived from the field, it is natural to begin exploring and incorporating its knowledge to develop DNA microsystems for biodiagnosis in order to help address current obstacles, such as disease detection and drug resistance. Here, current challenges in disease detection are presented along with standard methods for diagnosis. Then, a brief overview of DNA nanotechnology is introduced along with its main attractive features for constructing biodiagnostic microsystems. Lastly, suggested DNA-based microsystems are discussed through proof-of-concept demonstrations with improvement strategies for standard diagnostic approaches.
Collapse
|
16
|
Iancu D, Ashton E. Inherited Renal Tubulopathies-Challenges and Controversies. Genes (Basel) 2020; 11:genes11030277. [PMID: 32150856 PMCID: PMC7140864 DOI: 10.3390/genes11030277] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2020] [Revised: 02/29/2020] [Accepted: 02/29/2020] [Indexed: 12/23/2022] Open
Abstract
Electrolyte homeostasis is maintained by the kidney through a complex transport function mostly performed by specialized proteins distributed along the renal tubules. Pathogenic variants in the genes encoding these proteins impair this function and have consequences on the whole organism. Establishing a genetic diagnosis in patients with renal tubular dysfunction is a challenging task given the genetic and phenotypic heterogeneity, functional characteristics of the genes involved and the number of yet unknown causes. Part of these difficulties can be overcome by gathering large patient cohorts and applying high-throughput sequencing techniques combined with experimental work to prove functional impact. This approach has led to the identification of a number of genes but also generated controversies about proper interpretation of variants. In this article, we will highlight these challenges and controversies.
Collapse
Affiliation(s)
- Daniela Iancu
- UCL-Centre for Nephrology, Royal Free Campus, University College London, Rowland Hill Street, London NW3 2PF, UK
- Correspondence: ; Tel.: +44-2381204172; Fax: +44-020-74726476
| | - Emma Ashton
- Rare & Inherited Disease Laboratory, London North Genomic Laboratory Hub, Great Ormond Street Hospital for Children National Health Service Foundation Trust, Levels 4-6 Barclay House 37, Queen Square, London WC1N 3BH, UK;
| |
Collapse
|
17
|
Brosh RM, Matson SW. History of DNA Helicases. Genes (Basel) 2020; 11:genes11030255. [PMID: 32120966 PMCID: PMC7140857 DOI: 10.3390/genes11030255] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2020] [Revised: 02/18/2020] [Accepted: 02/20/2020] [Indexed: 12/13/2022] Open
Abstract
Since the discovery of the DNA double helix, there has been a fascination in understanding the molecular mechanisms and cellular processes that account for: (i) the transmission of genetic information from one generation to the next and (ii) the remarkable stability of the genome. Nucleic acid biologists have endeavored to unravel the mysteries of DNA not only to understand the processes of DNA replication, repair, recombination, and transcription but to also characterize the underlying basis of genetic diseases characterized by chromosomal instability. Perhaps unexpectedly at first, DNA helicases have arisen as a key class of enzymes to study in this latter capacity. From the first discovery of ATP-dependent DNA unwinding enzymes in the mid 1970's to the burgeoning of helicase-dependent pathways found to be prevalent in all kingdoms of life, the story of scientific discovery in helicase research is rich and informative. Over four decades after their discovery, we take this opportunity to provide a history of DNA helicases. No doubt, many chapters are left to be written. Nonetheless, at this juncture we are privileged to share our perspective on the DNA helicase field - where it has been, its current state, and where it is headed.
Collapse
Affiliation(s)
- Robert M. Brosh
- Section on DNA Helicases, Laboratory of Molecular Gerontology, National Institute on Aging, National Institutes of Health, Baltimore, MD 21224, USA
- Correspondence: (R.M.B.J.); (S.W.M.); Tel.: +1-410-558-8578 (R.M.B.J.); +1-919-962-0005 (S.W.M.)
| | - Steven W. Matson
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Correspondence: (R.M.B.J.); (S.W.M.); Tel.: +1-410-558-8578 (R.M.B.J.); +1-919-962-0005 (S.W.M.)
| |
Collapse
|
18
|
Cho Y, Lee S, Hong JH, Kim BJ, Hong WY, Jung J, Lee HB, Sung J, Kim HN, Kim HL, Jung J. Development of the variant calling algorithm, ADIScan, and its use to estimate discordant sequences between monozygotic twins. Nucleic Acids Res 2019; 46:e92. [PMID: 29873758 PMCID: PMC6125643 DOI: 10.1093/nar/gky445] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2017] [Accepted: 05/15/2018] [Indexed: 12/30/2022] Open
Abstract
Calling variants from next-generation sequencing (NGS) data or discovering discordant sequences between two NGS data sets is challenging. We developed a computer algorithm, ADIScan1, to call variants by comparing the fractions of allelic reads in a tester to the universal reference genome. We then created ADIScan2 by modifying the algorithm to directly compare two sets of NGS data and predict discordant sequences between two testers. ADIScan1 detected >99.7% of variants called by GATK with an additional 724 393 SNVs. ADIScan2 identified ∼500 candidates of discordant sequences in each of two pairs of the monozygotic twins. About 200 of these candidates were included in the ∼2800 predicted by VarScan2. We verified 66 true discordant sequences among the candidates that ADIScan2 and VarScan2 exclusively predicted. ADIScan2 detected many discordant sequences overlooked by VarScan2 and Mutect, which specialize in detecting low frequency mutations in genetically heterogeneous cancerous tissues. Numbers of verified sequences alone were >5 times more than expected based on recently estimated mutation rates from whole genome sequences. Estimated post-zygotic mutation rates were 1.68 × 10−7 in this study. ADIScan1 and 2 would complement existing tools in screening causative mutations of diverse genetic diseases and comparing two sets of genome sequences, respectively.
Collapse
Affiliation(s)
- Yangrae Cho
- Syntekabio Incorporated, Techno-2ro B-512, Yuseong-gu, Daejeon 34025, Republic of Korea.,DFTBA, CALS, Chonnam National University, Gwangju 61186, Republic of Korea
| | - Sunho Lee
- Syntekabio Incorporated, Techno-2ro B-512, Yuseong-gu, Daejeon 34025, Republic of Korea.,School of Computer Science and Engineering, Seoul National University, Seoul, 151-742, Republic of Korea
| | - Jong Hui Hong
- Syntekabio Incorporated, Techno-2ro B-512, Yuseong-gu, Daejeon 34025, Republic of Korea.,Research Institute of Pharmaceutical Sciences, College of Pharmacy, Seoul National University, Seoul 08826, Republic of Korea
| | - Byong Joon Kim
- Syntekabio Incorporated, Techno-2ro B-512, Yuseong-gu, Daejeon 34025, Republic of Korea
| | - Woon-Young Hong
- Syntekabio Incorporated, Techno-2ro B-512, Yuseong-gu, Daejeon 34025, Republic of Korea
| | - Jongcheol Jung
- Syntekabio Incorporated, Techno-2ro B-512, Yuseong-gu, Daejeon 34025, Republic of Korea
| | - Hyang Burm Lee
- DFTBA, CALS, Chonnam National University, Gwangju 61186, Republic of Korea
| | - Joohon Sung
- Complex Disease and Genome Epidemiology Branch, Department of Epidemiology, School of Public Health, Seoul National University, Seoul 08826, Republic of Korea
| | - Han-Na Kim
- Department of Biochemistry, School of Medicine, Ewha Woman's University, Seoul 07985, Republic of Korea
| | - Hyung-Lae Kim
- Department of Biochemistry, School of Medicine, Ewha Woman's University, Seoul 07985, Republic of Korea
| | - Jongsun Jung
- Syntekabio Incorporated, Techno-2ro B-512, Yuseong-gu, Daejeon 34025, Republic of Korea
| |
Collapse
|
19
|
Ergun MA, Citirik M, Bilgili G, Ergun SG, Polat G. A novel RP1 mutation demonstrated in a Turkish family with autosomal recessive retinitis pigmentosa. GENE REPORTS 2018. [DOI: 10.1016/j.genrep.2018.01.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
20
|
Piñeiro-Yáñez E, Reboiro-Jato M, Gómez-López G, Perales-Patón J, Troulé K, Rodríguez JM, Tejero H, Shimamura T, López-Casas PP, Carretero J, Valencia A, Hidalgo M, Glez-Peña D, Al-Shahrour F. PanDrugs: a novel method to prioritize anticancer drug treatments according to individual genomic data. Genome Med 2018; 10:41. [PMID: 29848362 PMCID: PMC5977747 DOI: 10.1186/s13073-018-0546-1] [Citation(s) in RCA: 49] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2017] [Accepted: 05/04/2018] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Large-sequencing cancer genome projects have shown that tumors have thousands of molecular alterations and their frequency is highly heterogeneous. In such scenarios, physicians and oncologists routinely face lists of cancer genomic alterations where only a minority of them are relevant biomarkers to drive clinical decision-making. For this reason, the medical community agrees on the urgent need of methodologies to establish the relevance of tumor alterations, assisting in genomic profile interpretation, and, more importantly, to prioritize those that could be clinically actionable for cancer therapy. RESULTS We present PanDrugs, a new computational methodology to guide the selection of personalized treatments in cancer patients using the variant lists provided by genome-wide sequencing analyses. PanDrugs offers the largest database of drug-target associations available from well-known targeted therapies to preclinical drugs. Scoring data-driven gene cancer relevance and drug feasibility PanDrugs interprets genomic alterations and provides a prioritized evidence-based list of anticancer therapies. Our tool represents the first drug prescription strategy applying a rational based on pathway context, multi-gene markers impact and information provided by functional experiments. Our approach has been systematically applied to TCGA patients and successfully validated in a cancer case study with a xenograft mouse model demonstrating its utility. CONCLUSIONS PanDrugs is a feasible method to identify potentially druggable molecular alterations and prioritize drugs to facilitate the interpretation of genomic landscape and clinical decision-making in cancer patients. Our approach expands the search of druggable genomic alterations from the concept of cancer driver genes to the druggable pathway context extending anticancer therapeutic options beyond already known cancer genes. The methodology is public and easily integratable with custom pipelines through its programmatic API or its docker image. The PanDrugs webtool is freely accessible at http://www.pandrugs.org .
Collapse
Affiliation(s)
- Elena Piñeiro-Yáñez
- Spanish National Cancer Research Centre (CNIO), 3rd Melchor Fernandez Almagro st., E-28029, Madrid, Spain
| | - Miguel Reboiro-Jato
- Computer Science Department - University of Vigo, Vigo, Spain
- Biomedical Research Centre (CINBIO), Vigo, Spain
| | - Gonzalo Gómez-López
- Spanish National Cancer Research Centre (CNIO), 3rd Melchor Fernandez Almagro st., E-28029, Madrid, Spain
| | - Javier Perales-Patón
- Spanish National Cancer Research Centre (CNIO), 3rd Melchor Fernandez Almagro st., E-28029, Madrid, Spain
| | - Kevin Troulé
- Spanish National Cancer Research Centre (CNIO), 3rd Melchor Fernandez Almagro st., E-28029, Madrid, Spain
| | | | - Héctor Tejero
- Spanish National Cancer Research Centre (CNIO), 3rd Melchor Fernandez Almagro st., E-28029, Madrid, Spain
| | - Takeshi Shimamura
- Loyola University Chicago Stritch School of Medicine, Maywood, IL, USA
| | - Pedro Pablo López-Casas
- Spanish National Cancer Research Centre (CNIO), 3rd Melchor Fernandez Almagro st., E-28029, Madrid, Spain
| | - Julián Carretero
- Department of Physiology - University of Valencia, Valencia, Spain
| | - Alfonso Valencia
- Spanish National Cancer Research Centre (CNIO), 3rd Melchor Fernandez Almagro st., E-28029, Madrid, Spain
| | - Manuel Hidalgo
- Spanish National Cancer Research Centre (CNIO), 3rd Melchor Fernandez Almagro st., E-28029, Madrid, Spain
- Beth Israel Deaconess Medical Center, Boston, USA
| | - Daniel Glez-Peña
- Computer Science Department - University of Vigo, Vigo, Spain
- Biomedical Research Centre (CINBIO), Vigo, Spain
| | - Fátima Al-Shahrour
- Spanish National Cancer Research Centre (CNIO), 3rd Melchor Fernandez Almagro st., E-28029, Madrid, Spain.
| |
Collapse
|
21
|
Li J, Shi L, Zhang K, Zhang Y, Hu S, Zhao T, Teng H, Li X, Jiang Y, Ji L, Sun Z. VarCards: an integrated genetic and clinical database for coding variants in the human genome. Nucleic Acids Res 2018; 46:D1039-D1048. [PMID: 29112736 PMCID: PMC5753295 DOI: 10.1093/nar/gkx1039] [Citation(s) in RCA: 141] [Impact Index Per Article: 23.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2017] [Revised: 10/16/2017] [Accepted: 10/18/2017] [Indexed: 12/24/2022] Open
Abstract
A growing number of genomic tools and databases were developed to facilitate the interpretation of genomic variants, particularly in coding regions. However, these tools are separately available in different online websites or databases, making it challenging for general clinicians, geneticists and biologists to obtain the first-hand information regarding some particular variants and genes of interest. Starting with coding regions and splice sties, we artificially generated all possible single nucleotide variants (n = 110 154 363) and cataloged all reported insertion and deletions (n = 1 223 370). We then annotated these variants with respect to functional consequences from more than 60 genomic data sources to develop a database, named VarCards (http://varcards.biols.ac.cn/), by which users can conveniently search, browse and annotate the variant- and gene-level implications of given variants, including the following information: (i) functional effects; (ii) functional consequences through different in silico algorithms; (iii) allele frequencies in different populations; (iv) disease- and phenotype-related knowledge; (v) general meaningful gene-level information; and (vi) drug-gene interactions. As a case study, we successfully employed VarCards in interpretation of de novo mutations in autism spectrum disorders. In conclusion, VarCards provides an intuitive interface of necessary information for researchers to prioritize candidate variations and genes.
Collapse
Affiliation(s)
- Jinchen Li
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, Zhejiang 325025, China
- National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, Hunan 410078, China
- Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha, Hunan 410078, China
| | - Leisheng Shi
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, Zhejiang 325025, China
| | - Kun Zhang
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, Zhejiang 325025, China
| | - Yi Zhang
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, Zhejiang 325025, China
| | - Shanshan Hu
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, Zhejiang 325025, China
| | - Tingting Zhao
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, Zhejiang 325025, China
| | - Huajing Teng
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing 100101, China
| | - Xianfeng Li
- Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha, Hunan 410078, China
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing 100101, China
| | - Yi Jiang
- Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha, Hunan 410078, China
| | - Liying Ji
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, Zhejiang 325025, China
| | - Zhongsheng Sun
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, Zhejiang 325025, China
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing 100101, China
| |
Collapse
|
22
|
Kleyner R, Malcolmson J, Tegay D, Ward K, Maughan A, Maughan G, Nelson L, Wang K, Robison R, Lyon GJ. KBG syndrome involving a single-nucleotide duplication in ANKRD11. Cold Spring Harb Mol Case Stud 2017; 2:a001131. [PMID: 27900361 PMCID: PMC5111005 DOI: 10.1101/mcs.a001131] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
KBG syndrome is a rare autosomal dominant genetic condition characterized by neurological involvement and distinct facial, hand, and skeletal features. More than 70 cases have been reported; however, it is likely that KBG syndrome is underdiagnosed because of lack of comprehensive characterization of the heterogeneous phenotypic features. We describe the clinical manifestations in a male currently 13 years of age, who exhibited symptoms including epilepsy, severe developmental delay, distinct facial features, and hand anomalies, without a positive genetic diagnosis. Subsequent exome sequencing identified a novel de novo heterozygous single base pair duplication (c.6015dupA) in ANKRD11, which was validated by Sanger sequencing. This single-nucleotide duplication is predicted to lead to a premature stop codon and loss of function in ANKRD11, thereby implicating it as contributing to the proband's symptoms and yielding a molecular diagnosis of KBG syndrome. Before molecular diagnosis, this syndrome was not recognized in the proband, as several key features of the disorder were mild and were not recognized by clinicians, further supporting the concept of variable expressivity in many disorders. Although a diagnosis of cerebral folate deficiency has also been given, its significance for the proband's condition remains uncertain.
Collapse
Affiliation(s)
- Robert Kleyner
- Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Janet Malcolmson
- Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA;; Genetic Counseling Graduate Program, Long Island University (LIU), Brookville, New York 11548, USA
| | - David Tegay
- Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Kenneth Ward
- Affiliated Genetics, Inc., Salt Lake City, Utah 84109, USA
| | | | - Glenn Maughan
- KBG Syndrome Foundation, West Jordan, Utah 84088, USA
| | - Lesa Nelson
- Affiliated Genetics, Inc., Salt Lake City, Utah 84109, USA
| | - Kai Wang
- Zilkha Neurogenetic Institute, University of Southern California, Los Angeles, California 90089, USA;; Department of Psychiatry & Behavioral Sciences, Keck School of Medicine, University of Southern California, Los Angeles, California 90033, USA;; Utah Foundation for Biomedical Research, Salt Lake City, Utah 84107, USA
| | - Reid Robison
- Utah Foundation for Biomedical Research, Salt Lake City, Utah 84107, USA
| | - Gholson J Lyon
- Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA;; Utah Foundation for Biomedical Research, Salt Lake City, Utah 84107, USA
| |
Collapse
|
23
|
Gosalia N, Economides AN, Dewey FE, Balasubramanian S. MAPPIN: a method for annotating, predicting pathogenicity and mode of inheritance for nonsynonymous variants. Nucleic Acids Res 2017; 45:10393-10402. [PMID: 28977528 PMCID: PMC5737764 DOI: 10.1093/nar/gkx730] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2017] [Accepted: 08/21/2017] [Indexed: 01/24/2023] Open
Abstract
Nonsynonymous single nucleotide variants (nsSNVs) constitute about 50% of known disease-causing mutations and understanding their functional impact is an area of active research. Existing algorithms predict pathogenicity of nsSNVs; however, they are unable to differentiate heterozygous, dominant disease-causing variants from heterozygous carrier variants that lead to disease only in the homozygous state. Here, we present MAPPIN (Method for Annotating, Predicting Pathogenicity, and mode of Inheritance for Nonsynonymous variants), a prediction method which utilizes a random forest algorithm to distinguish between nsSNVs with dominant, recessive, and benign effects. We apply MAPPIN to a set of Mendelian disease-causing mutations and accurately predict pathogenicity for all mutations. Furthermore, MAPPIN predicts mode of inheritance correctly for 70.3% of nsSNVs. MAPPIN also correctly predicts pathogenicity for 87.3% of mutations from the Deciphering Developmental Disorders Study with a 78.5% accuracy for mode of inheritance. When tested on a larger collection of mutations from the Human Gene Mutation Database, MAPPIN is able to significantly discriminate between mutations in known dominant and recessive genes. Finally, we demonstrate that MAPPIN outperforms CADD and Eigen in predicting disease inheritance modes for all validation datasets. To our knowledge, MAPPIN is the first nsSNV pathogenicity prediction algorithm that provides mode of inheritance predictions, adding another layer of information for variant prioritization.
Collapse
Affiliation(s)
- Nehal Gosalia
- Regeneron Genetics Center, Tarrytown, NY 10591, USA.,Regeneron Pharmaceuticals, Tarrytown, NY 10591, USA
| | - Aris N Economides
- Regeneron Genetics Center, Tarrytown, NY 10591, USA.,Regeneron Pharmaceuticals, Tarrytown, NY 10591, USA
| | | | | |
Collapse
|
24
|
Cho Y, Lee CH, Jeong EG, Kim MH, Hong JH, Ko Y, Lee B, Yun G, Kim BJ, Jung J, Jung J, Lee JS. Prevalence of Rare Genetic Variations and Their Implications in NGS-data Interpretation. Sci Rep 2017; 7:9810. [PMID: 28851938 PMCID: PMC5574920 DOI: 10.1038/s41598-017-09247-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2017] [Accepted: 07/21/2017] [Indexed: 12/30/2022] Open
Abstract
Next-generation sequencing (NGS) technology has improved enough to discover mutations associated with genetic diseases. Our study evaluated the feasibility of targeted NGS as a primary screening tool to detect causal variants and subsequently predict genetic diseases. We performed parallel computations on 3.7-megabase-targeted regions to detect disease-causing mutations in 103 participants consisting of 81 patients and 22 controls. Data analysis of the participants took about 6 hours using local databases and 200 nodes of a supercomputer. All variants in the selected genes led on average to 3.6 putative diseases for each patient while variants restricted to disease-causing genes identified the correct disease. Notably, only 12% of predicted causal variants were recorded as causal mutations in public databases: 88% had no or insufficient records. In this study, most genetic diseases were caused by rare mutations and public records were inadequate. Most rare variants, however, were not associated with genetic diseases. These data implied that novel, rare variants should not be ignored but interpreted in conjunction with additional clinical data. This step is needed so appropriate advice can be given to primary doctors and parents, thus fulfilling the purpose of this method as a primary screen for rare genetic diseases.
Collapse
Affiliation(s)
- Yangrae Cho
- Syntekabio Incorporated, Techno-2ro B-512, Yuseong-gu, Daejeon, 34025, Republic of Korea
- DFTBA, CALS, Chonnam National University, Gwangju, 61186, Republic of Korea
| | - Chul-Ho Lee
- Department of Clinical Genetics, Department of Pediatrics, Yonsei University College of Medicine, Seoul, 03722, Republic of Korea
| | - Eun-Goo Jeong
- Syntekabio Incorporated, Techno-2ro B-512, Yuseong-gu, Daejeon, 34025, Republic of Korea
| | - Min-Ho Kim
- Syntekabio Incorporated, Techno-2ro B-512, Yuseong-gu, Daejeon, 34025, Republic of Korea
| | - Jong Hui Hong
- Syntekabio Incorporated, Techno-2ro B-512, Yuseong-gu, Daejeon, 34025, Republic of Korea
| | - Younhee Ko
- Department of Clinical Genetics, Department of Pediatrics, Yonsei University College of Medicine, Seoul, 03722, Republic of Korea
- Department of Biomedical Engineering, Hankuk University of Foreign Studies, Yongin-si, Gyeonggi-do, 17035, Republic of Korea
| | - Bomnun Lee
- Syntekabio Incorporated, Techno-2ro B-512, Yuseong-gu, Daejeon, 34025, Republic of Korea
| | - Gilly Yun
- Syntekabio Incorporated, Techno-2ro B-512, Yuseong-gu, Daejeon, 34025, Republic of Korea
| | - Byong Joon Kim
- Syntekabio Incorporated, Techno-2ro B-512, Yuseong-gu, Daejeon, 34025, Republic of Korea
| | - Jongcheol Jung
- Syntekabio Incorporated, Techno-2ro B-512, Yuseong-gu, Daejeon, 34025, Republic of Korea
| | - Jongsun Jung
- Syntekabio Incorporated, Techno-2ro B-512, Yuseong-gu, Daejeon, 34025, Republic of Korea.
| | - Jin-Sung Lee
- Department of Clinical Genetics, Department of Pediatrics, Yonsei University College of Medicine, Seoul, 03722, Republic of Korea.
| |
Collapse
|
25
|
de la Campa EÁ, Padilla N, de la Cruz X. Development of pathogenicity predictors specific for variants that do not comply with clinical guidelines for the use of computational evidence. BMC Genomics 2017; 18:569. [PMID: 28812538 PMCID: PMC5558188 DOI: 10.1186/s12864-017-3914-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
Background Strict guidelines delimit the use of computational information in the clinical setting, due to the still moderate accuracy of in silico tools. These guidelines indicate that several tools should always be used and that full coincidence between them is required if we want to consider their results as supporting evidence in medical decision processes. Application of this simple rule certainly decreases the error rate of in silico pathogenicity assignments. However, when predictors disagree this rule results in the rejection of potentially valuable information for a number of variants. In this work, we focus on these variants of the protein sequence and develop specific predictors to help improve the success rate of their annotation. Results We have used a set of 59,442 protein sequence variants (15,723 pathological and 43,719 neutral) from 228 proteins to identify those cases for which pathogenicity predictors disagree. We have repeated this process for all the possible combinations of five known methods (SIFT, PolyPhen-2, PON-P2, CADD and MutationTaster2). For each resulting subset we have trained a specific pathogenicity predictor. We find that these specific predictors are able to discriminate between neutral and pathogenic variants, with a success rate different from random. They tend to outperform the constitutive methods but this trend decreases as the performance of the constitutive predictor improves (e.g. with PON-P2 and PolyPhen-2). We also find that specific methods outperform standard consensus methods (Condel and CAROL). Conclusion Focusing development efforts on the case of variants for which known methods disagree we may obtain pathogenicity predictors with improved performances. Although we have not yet reached the success rate that allows the use of this computational evidence in a clinical setting, the simplicity of the approach indicates that more advanced methods may reach this goal in a close future. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3914-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Elena Álvarez de la Campa
- Research Unit in Translational Bioinformatics, Vall d'Hebron Institute of Research (VHIR), Universitat Autònoma de Barcelona, Barcelona, Spain.,Department of Molecular Genomics, Instituto de Biología Molecular de Barcelona (IBMB), Consejo Superior de Investigaciones Científicas (CSIC), Barcelona, Spain
| | - Natàlia Padilla
- Research Unit in Translational Bioinformatics, Vall d'Hebron Institute of Research (VHIR), Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Xavier de la Cruz
- Research Unit in Translational Bioinformatics, Vall d'Hebron Institute of Research (VHIR), Universitat Autònoma de Barcelona, Barcelona, Spain. .,ICREA, Barcelona, Spain.
| |
Collapse
|
26
|
Li Q, Wang K. InterVar: Clinical Interpretation of Genetic Variants by the 2015 ACMG-AMP Guidelines. Am J Hum Genet 2017; 100:267-280. [PMID: 28132688 DOI: 10.1016/j.ajhg.2017.01.004] [Citation(s) in RCA: 623] [Impact Index Per Article: 89.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2016] [Accepted: 12/30/2016] [Indexed: 12/14/2022] Open
Abstract
In 2015, the American College of Medical Genetics and Genomics (ACMG) and the Association for Molecular Pathology (AMP) published updated standards and guidelines for the clinical interpretation of sequence variants with respect to human diseases on the basis of 28 criteria. However, variability between individual interpreters can be extensive because of reasons such as the different understandings of these guidelines and the lack of standard algorithms for implementing them, yet computational tools for semi-automated variant interpretation are not available. To address these problems, we propose a suite of methods for implementing these criteria and have developed a tool called InterVar to help human reviewers interpret the clinical significance of variants. InterVar can take a pre-annotated or VCF file as input and generate automated interpretation on 18 criteria. Furthermore, we have developed a companion web server, wInterVar, to enable user-friendly variant interpretation with an automated interpretation step and a manual adjustment step. These tools are especially useful for addressing severe congenital or very early-onset developmental disorders with high penetrance. Using results from a few published sequencing studies, we demonstrate the utility of InterVar in significantly reducing the time to interpret the clinical significance of sequence variants.
Collapse
|
27
|
Ergun MA, Bilgili G, Hamurcu U, Ertan A. Whole exome sequencing reveals a mutation in an osteogenesis imperfecta patient. Meta Gene 2017. [DOI: 10.1016/j.mgene.2016.08.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
|
28
|
Mitsui SN, Yasue A, Masuda K, Naruto T, Minegishi Y, Oyadomari S, Noji S, Imoto I, Tanaka E. Novel human mutation and CRISPR/Cas genome-edited mice reveal the importance of C-terminal domain of MSX1 in tooth and palate development. Sci Rep 2016; 6:38398. [PMID: 27917906 PMCID: PMC5137164 DOI: 10.1038/srep38398] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2016] [Accepted: 11/08/2016] [Indexed: 01/08/2023] Open
Abstract
Several mutations, located mainly in the MSX1 homeodomain, have been identified in non-syndromic tooth agenesis predominantly affecting premolars and third molars. We identified a novel frameshift mutation of the highly conserved C-terminal domain of MSX1, known as Msx homology domain 6 (MH6), in a Japanese family with non-syndromic tooth agenesis. To investigate the importance of MH6 in tooth development, Msx1 was targeted in mice with CRISPR/Cas system. Although heterozygous MH6 disruption did not alter craniofacial development, homozygous mice exhibited agenesis of lower incisors with or without cleft palate at E16.5. In addition, agenesis of the upper third molars and the lower second and third molars were observed in 4-week-old mutant mice. Although the upper second molars were present, they were abnormally small. These results suggest that the C-terminal domain of MSX1 is important for tooth and palate development, and demonstrate that that CRISPR/Cas system can be used as a tool to assess causality of human disorders in vivo and to study the importance of conserved domains in genes.
Collapse
Affiliation(s)
- Silvia Naomi Mitsui
- Department of Orthodontics and Dentofacial Orthopedics, Institute of Biomedical Sciences, Tokushima University Graduate School, 3-18-15 Kuramoto-cho, Tokushima 770-8504, Japan.,Division of Molecular Biology, Institute of Advanced Enzyme Research, Tokushima University, 3-18-15 Kuramoto-cho, Tokushima 770-8503, Japan
| | - Akihiro Yasue
- Department of Orthodontics and Dentofacial Orthopedics, Institute of Biomedical Sciences, Tokushima University Graduate School, 3-18-15 Kuramoto-cho, Tokushima 770-8504, Japan
| | - Kiyoshi Masuda
- Department of Human Genetics, Institute of Biomedical Sciences, Tokushima University Graduate School, 3-18-15 Kuramoto-cho, Tokushima 770-8503, Japan
| | - Takuya Naruto
- Department of Human Genetics, Institute of Biomedical Sciences, Tokushima University Graduate School, 3-18-15 Kuramoto-cho, Tokushima 770-8503, Japan
| | - Yoshiyuki Minegishi
- Division of Molecular Medicine, Institute of Advanced Enzyme Research, Tokushima University, 3-18-15 Kuramoto-cho, Tokushima 770-8503, Japan
| | - Seiichi Oyadomari
- Division of Molecular Biology, Institute of Advanced Enzyme Research, Tokushima University, 3-18-15 Kuramoto-cho, Tokushima 770-8503, Japan
| | - Sumihare Noji
- Tokushima University, 2-24 Shinkura-cho, Tokushima 770-8501, Japan
| | - Issei Imoto
- Department of Human Genetics, Institute of Biomedical Sciences, Tokushima University Graduate School, 3-18-15 Kuramoto-cho, Tokushima 770-8503, Japan
| | - Eiji Tanaka
- Department of Orthodontics and Dentofacial Orthopedics, Institute of Biomedical Sciences, Tokushima University Graduate School, 3-18-15 Kuramoto-cho, Tokushima 770-8504, Japan
| |
Collapse
|
29
|
do Valle ÍF, Giampieri E, Simonetti G, Padella A, Manfrini M, Ferrari A, Papayannidis C, Zironi I, Garonzi M, Bernardi S, Delledonne M, Martinelli G, Remondini D, Castellani G. Optimized pipeline of MuTect and GATK tools to improve the detection of somatic single nucleotide polymorphisms in whole-exome sequencing data. BMC Bioinformatics 2016; 17:341. [PMID: 28185561 PMCID: PMC5123378 DOI: 10.1186/s12859-016-1190-7] [Citation(s) in RCA: 75] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Detecting somatic mutations in whole exome sequencing data of cancer samples has become a popular approach for profiling cancer development, progression and chemotherapy resistance. Several studies have proposed software packages, filters and parametrizations. However, many research groups reported low concordance among different methods. We aimed to develop a pipeline which detects a wide range of single nucleotide mutations with high validation rates. We combined two standard tools - Genome Analysis Toolkit (GATK) and MuTect - to create the GATK-LODN method. As proof of principle, we applied our pipeline to exome sequencing data of hematological (Acute Myeloid and Acute Lymphoblastic Leukemias) and solid (Gastrointestinal Stromal Tumor and Lung Adenocarcinoma) tumors. We performed experiments on simulated data to test the sensitivity and specificity of our pipeline. RESULTS The software MuTect presented the highest validation rate (90 %) for mutation detection, but limited number of somatic mutations detected. The GATK detected a high number of mutations but with low specificity. The GATK-LODN increased the performance of the GATK variant detection (from 5 of 14 to 3 of 4 confirmed variants), while preserving mutations not detected by MuTect. However, GATK-LODN filtered more variants in the hematological samples than in the solid tumors. Experiments in simulated data demonstrated that GATK-LODN increased both specificity and sensitivity of GATK results. CONCLUSION We presented a pipeline that detects a wide range of somatic single nucleotide variants, with good validation rates, from exome sequencing data of cancer samples. We also showed the advantage of combining standard algorithms to create the GATK-LODN method, that increased specificity and sensitivity of GATK results. This pipeline can be helpful in discovery studies aimed to profile the somatic mutational landscape of cancer genomes.
Collapse
Affiliation(s)
- Ítalo Faria do Valle
- Department of Physics and Astronomy, University of Bologna, Bologna, Italy
- CAPES Foundation, Ministry of Education of Brazil, Brasília, DF, Brazil
| | - Enrico Giampieri
- Department of Physics and Astronomy, University of Bologna, Bologna, Italy
| | - Giorgia Simonetti
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy
| | - Antonella Padella
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy
| | - Marco Manfrini
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy
| | - Anna Ferrari
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy
| | - Cristina Papayannidis
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy
| | - Isabella Zironi
- Department of Physics and Astronomy, University of Bologna, Bologna, Italy
| | - Marianna Garonzi
- Department of Biotechnology, University of Verona, Verona, Italy
| | - Simona Bernardi
- Unit of Blood Diseases and Stem Cell Transplantation, Department of Clinical and Experimental Sciences, University of Brescia, Brescia, Italy
| | - Massimo Delledonne
- Department of Biotechnology, University of Verona, Verona, Italy
- Personal Genomics, Verona, Italy
| | - Giovanni Martinelli
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy
| | - Daniel Remondini
- Department of Physics and Astronomy, University of Bologna, Bologna, Italy.
| | - Gastone Castellani
- Department of Physics and Astronomy, University of Bologna, Bologna, Italy
| |
Collapse
|
30
|
Bertier G, Hétu M, Joly Y. Unsolved challenges of clinical whole-exome sequencing: a systematic literature review of end-users' views. BMC Med Genomics 2016; 9:52. [PMID: 27514372 PMCID: PMC4982236 DOI: 10.1186/s12920-016-0213-6] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2016] [Accepted: 07/28/2016] [Indexed: 01/15/2023] Open
Abstract
BACKGROUND Whole-exome sequencing (WES) consists in the capture, sequencing and analysis of all exons in the human genome. Originally developed in the research context, this technology is now increasingly used clinically to inform patient care. The implementation of WES into healthcare poses significant organizational, regulatory, and ethical hurdles, which are widely discussed in the literature. METHODS In order to inform future policy decisions on the integration of WES into standard clinical practice, we performed a systematic literature review to identify the most important challenges directly reported by technology users. RESULTS Out of 2094 articles, we selected and analyzed 147 which reported a total of 23 different challenges linked to the production, analysis, reporting and sharing of patients' WES data. Interpretation of variants of unknown significance, incidental findings, and the cost and reimbursement of WES-based tests were the most reported challenges across all articles. CONCLUSIONS WES is already used in the clinical setting, and may soon be considered the standard of care for specific medical conditions. Yet, technology users are calling for certain standards and guidelines to be published before this technology replaces more focused approaches such as gene panels sequencing. In addition, a number of infrastructural adjustments will have to be made for clinics to store, process and analyze the amounts of data produced by WES.
Collapse
Affiliation(s)
- Gabrielle Bertier
- Center of Genomics and Policy, McGill University, 740 Dr. Penfield Avenue, Montreal, Quebec H3A 0G1 Canada
- UMR 1027, Inserm, University of Toulouse III - Paul Sabatier, 37 allées Jules Guesde, F-31000 Toulouse, France
| | - Martin Hétu
- Center of Genomics and Policy, McGill University, 740 Dr. Penfield Avenue, Montreal, Quebec H3A 0G1 Canada
| | - Yann Joly
- Center of Genomics and Policy, McGill University, 740 Dr. Penfield Avenue, Montreal, Quebec H3A 0G1 Canada
| |
Collapse
|
31
|
Gazzo AM, Daneels D, Cilia E, Bonduelle M, Abramowicz M, Van Dooren S, Smits G, Lenaerts T. DIDA: A curated and annotated digenic diseases database. Nucleic Acids Res 2015; 44:D900-7. [PMID: 26481352 PMCID: PMC4702791 DOI: 10.1093/nar/gkv1068] [Citation(s) in RCA: 70] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2015] [Accepted: 10/05/2015] [Indexed: 02/07/2023] Open
Abstract
DIDA (DIgenic diseases DAtabase) is a novel database that provides for the first time detailed information on genes and associated genetic variants involved in digenic diseases, the simplest form of oligogenic inheritance. The database is accessible via http://dida.ibsquare.be and currently includes 213 digenic combinations involved in 44 different digenic diseases. These combinations are composed of 364 distinct variants, which are distributed over 136 distinct genes. The web interface provides browsing and search functionalities, as well as documentation and help pages, general database statistics and references to the original publications from which the data have been collected. The possibility to submit novel digenic data to DIDA is also provided. Creating this new repository was essential as current databases do not allow one to retrieve detailed records regarding digenic combinations. Genes, variants, diseases and digenic combinations in DIDA are annotated with manually curated information and information mined from other online resources. Next to providing a unique resource for the development of new analysis methods, DIDA gives clinical and molecular geneticists a tool to find the most comprehensive information on the digenic nature of their diseases of interest.
Collapse
Affiliation(s)
- Andrea M Gazzo
- Interuniversity Institute of Bioinformatics in Brussels, Boulevard du Triomphe CP 263, 1050 Brussels, Belgium MLG, Département d'Informatique, Université Libre de Bruxelles, Boulevard du Triomphe, CP 212, 1050 Brussels, Belgium Center for Medical Genetics, Reproduction and Genetics, Reproduction Genetics and Regenerative Medicine, Vrije Universiteit Brussel, UZ Brussel, Laarbeeklaan 101, 1090 Brussels, Belgium
| | - Dorien Daneels
- Interuniversity Institute of Bioinformatics in Brussels, Boulevard du Triomphe CP 263, 1050 Brussels, Belgium Center for Medical Genetics, Reproduction and Genetics, Reproduction Genetics and Regenerative Medicine, Vrije Universiteit Brussel, UZ Brussel, Laarbeeklaan 101, 1090 Brussels, Belgium
| | - Elisa Cilia
- Interuniversity Institute of Bioinformatics in Brussels, Boulevard du Triomphe CP 263, 1050 Brussels, Belgium MLG, Département d'Informatique, Université Libre de Bruxelles, Boulevard du Triomphe, CP 212, 1050 Brussels, Belgium
| | - Maryse Bonduelle
- Center for Medical Genetics, Reproduction and Genetics, Reproduction Genetics and Regenerative Medicine, Vrije Universiteit Brussel, UZ Brussel, Laarbeeklaan 101, 1090 Brussels, Belgium
| | - Marc Abramowicz
- Interuniversity Institute of Bioinformatics in Brussels, Boulevard du Triomphe CP 263, 1050 Brussels, Belgium Center for Medical Genetics, Hôpital Erasme, Université Libre de Bruxelles, Route de Lennik 808, 1070 Brussels, Belgium
| | - Sonia Van Dooren
- Interuniversity Institute of Bioinformatics in Brussels, Boulevard du Triomphe CP 263, 1050 Brussels, Belgium Center for Medical Genetics, Reproduction and Genetics, Reproduction Genetics and Regenerative Medicine, Vrije Universiteit Brussel, UZ Brussel, Laarbeeklaan 101, 1090 Brussels, Belgium
| | - Guillaume Smits
- Interuniversity Institute of Bioinformatics in Brussels, Boulevard du Triomphe CP 263, 1050 Brussels, Belgium Center for Medical Genetics, Hôpital Erasme, Université Libre de Bruxelles, Route de Lennik 808, 1070 Brussels, Belgium Genetics, Hôpital Universitaire des Enfants Reine Fabiola, Université Libre de Bruxelles, Avenue JJ Crocq 15, 1020 Brussels, Belgium
| | - Tom Lenaerts
- Interuniversity Institute of Bioinformatics in Brussels, Boulevard du Triomphe CP 263, 1050 Brussels, Belgium MLG, Département d'Informatique, Université Libre de Bruxelles, Boulevard du Triomphe, CP 212, 1050 Brussels, Belgium AI lab, Vakgroep Computerwetenschappen, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium
| |
Collapse
|
32
|
Yang H, Wang K. Genomic variant annotation and prioritization with ANNOVAR and wANNOVAR. Nat Protoc 2015; 10:1556-66. [PMID: 26379229 DOI: 10.1038/nprot.2015.105] [Citation(s) in RCA: 601] [Impact Index Per Article: 66.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Recent developments in sequencing techniques have enabled rapid and high-throughput generation of sequence data, democratizing the ability to compile information on large amounts of genetic variations in individual laboratories. However, there is a growing gap between the generation of raw sequencing data and the extraction of meaningful biological information. Here, we describe a protocol to use the ANNOVAR (ANNOtate VARiation) software to facilitate fast and easy variant annotations, including gene-based, region-based and filter-based annotations on a variant call format (VCF) file generated from human genomes. We further describe a protocol for gene-based annotation of a newly sequenced nonhuman species. Finally, we describe how to use a user-friendly and easily accessible web server called wANNOVAR to prioritize candidate genes for a Mendelian disease. The variant annotation protocols take 5-30 min of computer time, depending on the size of the variant file, and 5-10 min of hands-on time. In summary, through the command-line tool and the web server, these protocols provide a convenient means to analyze genetic variants generated in humans and other species.
Collapse
Affiliation(s)
- Hui Yang
- Zilkha Neurogenetic Institute, University of Southern California, Los Angeles, California, USA.,Neuroscience Graduate Program, University of Southern California, Los Angeles, California, USA
| | - Kai Wang
- Zilkha Neurogenetic Institute, University of Southern California, Los Angeles, California, USA.,Department of Psychiatry, University of Southern California, Los Angeles, California, USA.,Department of Preventive Medicine, Division of Bioinformatics, University of Southern California, Los Angeles, California, USA
| |
Collapse
|
33
|
Yang H, Robinson PN, Wang K. Phenolyzer: phenotype-based prioritization of candidate genes for human diseases. Nat Methods 2015; 12:841-3. [PMID: 26192085 PMCID: PMC4718403 DOI: 10.1038/nmeth.3484] [Citation(s) in RCA: 263] [Impact Index Per Article: 29.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2015] [Accepted: 05/18/2015] [Indexed: 12/21/2022]
Abstract
Prior biological knowledge and phenotype information may help to identify disease genes from human whole-genome and whole-exome sequencing studies. We developed Phenolyzer (http://phenolyzer.usc.edu), a tool that uses prior information to implicate genes involved in diseases. Phenolyzer exhibits superior performance over competing methods for prioritizing Mendelian and complex disease genes, based on disease or phenotype terms entered as free text.
Collapse
Affiliation(s)
- Hui Yang
- Zilkha Neurogenetic Institute, University of Southern California, Los Angeles, California, USA
- Neuroscience Graduate Program, University of Southern California, Los Angeles, California, USA
| | - Peter N Robinson
- Institute for Medical and Human Genetics, Charité-Universitätsmedizin Berlin, Berlin, Germany
- Max Planck Institute for Molecular Genetics, Berlin, Germany
- Berlin Brandenburg Center for Regenerative Therapies (BCRT), Charité-Universitätsmedizin Berlin, Berlin, Germany
- Institute for Bioinformatics, Department of Mathematics and Computer Science, Freie Universität Berlin, Berlin, Germany
| | - Kai Wang
- Zilkha Neurogenetic Institute, University of Southern California, Los Angeles, California, USA
- Department of Psychiatry, University of Southern California, Los Angeles, California, USA
- Division of Bioinformatics, Department of Preventive Medicine, University of Southern California, Los Angeles, California, USA
| |
Collapse
|
34
|
Milner LC, Garrison NA, Cho MK, Altman RB, Hudgins L, Galli SJ, Lowe HJ, Schrijver I, Magnus DC. Genomics in the clinic: ethical and policy challenges in clinical next-generation sequencing programs at early adopter USA institutions. Per Med 2015; 12:269-282. [PMID: 29771644 DOI: 10.2217/pme.14.88] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Next-generation sequencing (NGS) technologies are poised to revolutionize clinical diagnosis and treatment, but raise significant ethical and policy challenges. This review examines NGS program challenges through a synthesis of published literature, website and conference presentation content, and interviews at early-adopting institutions in the USA. Institutions are proactively addressing policy challenges related to the management and technical aspects of program development. However, ethical challenges related to patient-related aspects have not been fully addressed. These complex challenges present opportunities to develop comprehensive and standardized regulations across programs. Understanding the strengths, weaknesses and current practices of evolving NGS program approaches are important considerations for institutions developing NGS services, policymakers regulating or funding NGS programs and physicians and patients considering NGS services.
Collapse
Affiliation(s)
- Lauren C Milner
- Stanford Center for Biomedical Ethics, Stanford University School of Medicine, Stanford, CA, USA
| | - Nanibaa' A Garrison
- Stanford Center for Biomedical Ethics, Stanford University School of Medicine, Stanford, CA, USA.,Center for Biomedical Ethics & Society, Departments of Pediatrics & Anthropology, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Mildred K Cho
- Stanford Center for Biomedical Ethics, Stanford University School of Medicine, Stanford, CA, USA.,Department of Pediatrics, Stanford University School of Medicine, Stanford, CA, USA
| | - Russ B Altman
- Department of Bioengineering, Stanford University School of Medicine, Stanford, CA, USA.,Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA.,Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA
| | - Louanne Hudgins
- Division of Medical Genetics, Department of Pediatrics, Stanford University School of Medicine, Stanford, CA, USA
| | - Stephen J Galli
- Stanford Center for Genomics & Personalized Medicine, Stanford University School of Medicine, Stanford, CA, USA.,Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA.,Department of Microbiology & Immunology, Stanford University School of Medicine, Stanford, CA, USA
| | - Henry J Lowe
- Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA
| | - Iris Schrijver
- Department of Pediatrics, Stanford University School of Medicine, Stanford, CA, USA.,Stanford Center for Genomics & Personalized Medicine, Stanford University School of Medicine, Stanford, CA, USA
| | - David C Magnus
- Stanford Center for Biomedical Ethics, Stanford University School of Medicine, Stanford, CA, USA.,Department of Pediatrics, Stanford University School of Medicine, Stanford, CA, USA
| |
Collapse
|
35
|
Kotze MJ, Lückhoff HK, Peeters AV, Baatjes K, Schoeman M, van der Merwe L, Grant KA, Fisher LR, van der Merwe N, Pretorius J, van Velden DP, Myburgh EJ, Pienaar FM, van Rensburg SJ, Yako YY, September AV, Moremi KE, Cronje FJ, Tiffin N, Bouwens CSH, Bezuidenhout J, Apffelstaedt JP, Hough FS, Erasmus RT, Schneider JW. Genomic medicine and risk prediction across the disease spectrum. Crit Rev Clin Lab Sci 2015; 52:120-37. [PMID: 25597499 DOI: 10.3109/10408363.2014.997930] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Genomic medicine is based on the knowledge that virtually every medical condition, disease susceptibility or response to treatment is caused, regulated or influenced by genes. Genetic testing may therefore add value across the disease spectrum, ranging from single-gene disorders with a Mendelian inheritance pattern to complex multi-factorial diseases. The critical factors for genomic risk prediction are to determine: (1) where the genomic footprint of a particular susceptibility or dysfunction resides within this continuum, and (2) to what extent the genetic determinants are modified by environmental exposures. Regarding the small subset of highly penetrant monogenic disorders, a positive family history and early disease onset are mostly sufficient to determine the appropriateness of genetic testing in the index case and to inform pre-symptomatic diagnosis in at-risk family members. In more prevalent polygenic non-communicable diseases (NCDs), the use of appropriate eligibility criteria is required to ensure a balance between benefit and risk. An additional screening step may therefore be necessary to identify individuals most likely to benefit from genetic testing. This need provided the stimulus for the development of a pathology-supported genetic testing (PSGT) service as a new model for the translational implementation of genomic medicine in clinical practice. PSGT is linked to the establishment of a research database proven to be an invaluable resource for the validation of novel and previously described gene-disease associations replicated in the South African population for a broad range of NCDs associated with increased cardio-metabolic risk. The clinical importance of inquiry concerning family history in determining eligibility for personalized genotyping was supported beyond its current limited role in diagnosing or screening for monogenic subtypes of NCDs. With the recent introduction of advanced microarray-based breast cancer subtyping, genetic testing has extended beyond the genome of the host to also include tumor gene expression profiling for chemotherapy selection. The decreasing cost of next generation sequencing over recent years, together with improvement of both laboratory and computational protocols, enables the mapping of rare genetic disorders and discovery of shared genetic risk factors as novel therapeutic targets across diagnostic boundaries. This article reviews the challenges, successes, increasing inter-disciplinary integration and evolving strategies for extending PSGT towards exome and whole genome sequencing (WGS) within a dynamic framework. Specific points of overlap are highlighted between the application of PSGT and exome or WGS, as the next logical step in genetically uncharacterized patients for whom a particular disease pattern and/or therapeutic failure are not adequately accounted for during the PSGT pre-screen. Discrepancies between different next generation sequencing platforms and low concordance among variant-calling pipelines caution against offering exome or WGS as a stand-alone diagnostic approach. The public reference human genome sequence (hg19) contains minor alleles at more than 1 million loci and variant calling using an advanced major allele reference genome sequence is crucial to ensure data integrity. Understanding that genomic risk prediction is not deterministic but rather probabilistic provides the opportunity for disease prevention and targeted treatment in a way that is unique to each individual patient.
Collapse
Affiliation(s)
- Maritha J Kotze
- Division of Anatomical Pathology, Department of Pathology, Faculty of Medicine and Health Sciences, Stellenbosch University , Cape Town , South Africa
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
36
|
Molparia B, Pham PH, Torkamani A. Symptom-driven idiopathic disease gene identification. Genet Med 2015; 17:859-65. [PMID: 25590976 PMCID: PMC4861313 DOI: 10.1038/gim.2014.202] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2014] [Accepted: 12/12/2014] [Indexed: 12/30/2022] Open
Abstract
Purpose Rare genetic variants are the major cause of Mendelian disorders, yet only half of described genetic diseases have been causally linked to a gene. In addition, the total number of rare genetic diseases is projected to be far greater than that of those already described. Whole-genome sequencing of patients with subsequent genetic and functional analysis is a powerful way to describe these gene anomalies. However, this approach results in tens to hundreds of candidate disease-causative genes, and the identification of additional individuals suffering from the same disorder can be difficult because of rarity and phenotypic heterogeneity. Methods We describe a genetic network–based method to rank candidate genes identified in family-based sequencing studies, termed phenotype informed network (PIN) ranking. Furthermore, we present a case study as an extension of the PIN ranking method in which disease symptoms drive the network ranking and identification of the disease-causative gene. Results We demonstrate, through simulation, that our method is capable of identifying the correct disease-causative gene in a majority of cases. PIN-rank is available at https://genomics.scripps.edu/pin-rank/. Conclusion We have developed a method to prioritize candidate disease-causative genes based on symptoms that would be useful for both the prioritization of candidates and the identification of additional subjects.
Collapse
Affiliation(s)
- Bhuvan Molparia
- Scripps Translational Science Institute, La Jolla, California, USA.,Department of Integrative Structural and Computational Biology, Scripps Research Institute, La Jolla, California, USA
| | | | - Ali Torkamani
- Scripps Translational Science Institute, La Jolla, California, USA.,Department of Integrative Structural and Computational Biology, Scripps Research Institute, La Jolla, California, USA.,Cypher Genomics, La Jolla, California, USA.,Scripps Health, La Jolla, California, USA.,Department of Molecular and Experimental Medicine, Scripps Research Institute, La Jolla, California, USA
| |
Collapse
|
37
|
O'Rawe JA, Ferson S, Lyon GJ. Accounting for uncertainty in DNA sequencing data. Trends Genet 2015; 31:61-6. [PMID: 25579994 DOI: 10.1016/j.tig.2014.12.002] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2014] [Revised: 12/12/2014] [Accepted: 12/15/2014] [Indexed: 12/30/2022]
Abstract
Science is defined in part by an honest exposition of the uncertainties that arise in measurements and propagate through calculations and inferences, so that the reliabilities of its conclusions are made apparent. The recent rapid development of high-throughput DNA sequencing technologies has dramatically increased the number of measurements made at the biochemical and molecular level. These data come from many different DNA-sequencing technologies, each with their own platform-specific errors and biases, which vary widely. Several statistical studies have tried to measure error rates for basic determinations, but there are no general schemes to project these uncertainties so as to assess the surety of the conclusions drawn about genetic, epigenetic, and more general biological questions. We review here the state of uncertainty quantification in DNA sequencing applications, describe sources of error, and propose methods that can be used for accounting and propagating these errors and their uncertainties through subsequent calculations.
Collapse
Affiliation(s)
- Jason A O'Rawe
- Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, NY, USA; Stony Brook University, Stony Brook, NY, USA; Applied Biomathematics, Setauket, NY, USA.
| | - Scott Ferson
- Stony Brook University, Stony Brook, NY, USA; Applied Biomathematics, Setauket, NY, USA.
| | - Gholson J Lyon
- Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, NY, USA; Stony Brook University, Stony Brook, NY, USA; Utah Foundation for Biomedical Research, Salt Lake City, UT, USA.
| |
Collapse
|
38
|
Datt M, Sharma A. Evolutionary and structural annotation of disease-associated mutations in human aminoacyl-tRNA synthetases. BMC Genomics 2014; 15:1063. [PMID: 25476837 PMCID: PMC4298046 DOI: 10.1186/1471-2164-15-1063] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2014] [Accepted: 11/20/2014] [Indexed: 11/10/2022] Open
Abstract
Background Mutation(s) in proteins are a natural byproduct of evolution but can also cause serious diseases. Aminoacyl-tRNA synthetases (aaRSs) are indispensable components of all cellular protein translational machineries, and in humans they drive translation in both cytoplasm and mitochondria. Mutations in aaRSs have been implicated in a plethora of diseases including neurological conditions, metabolic disorders and cancer. Results We have developed an algorithmic approach for genome-wide analyses of sequence substitutions that combines evolutionary, structural and functional information. This pipeline enabled us to super-annotate human aaRS mutations and analyze their linkage to health disorders. Our data suggest that in some but not all cases, aaRS mutations occur in functional and structural sectors where they can manifest their pathological effects by altering enzyme activity or causing structural instability. Further, mutations appear in both solvent exposed and buried regions of aaRSs indicating that these alterations could lead to dysfunctional enzymes resulting in abnormal protein translation routines by affecting inter-molecular interactions or by disruption of non-bonded interactions. Overall, the prevalence of mutations is much higher in mitochondrial aaRSs, and the two most often mutated aaRSs are mitochondrial glutamyl-tRNA synthetase and dual localized glycyl-tRNA synthetase. Out of 63 mutations annotated in this work, only 12 (~20%) were observed in regions that could directly affect aminoacylation activity via either binding to ATP/amino-acid, tRNA or by involvement in dimerization. Mutations in structural cores or at potential biomolecular interfaces account for ~55% mutations while remaining mutations (~25%) remain structurally un-annotated. Conclusion This work provides a comprehensive structural framework within which most defective human aaRSs have been structurally analyzed. The methodology described here could be employed to annotate mutations in other protein families in a high-throughput manner. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-1063) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | - Amit Sharma
- Structural and Computational Biology group, International Center for Genetic Engineering and Biotechnology, New Delhi 110067, India.
| |
Collapse
|
39
|
Shi L, Li B, Huang Y, Ling X, Liu T, Lyon GJ, Xu A, Wang K. "Genotype-first" approaches on a curious case of idiopathic progressive cognitive decline. BMC Med Genomics 2014; 7:66. [PMID: 25466957 PMCID: PMC4267425 DOI: 10.1186/s12920-014-0066-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2014] [Accepted: 11/20/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In developing countries, many cases with rare neurological diseases remain undiagnosed due to limited diagnostic experience. We encountered a case in China where two siblings both began to develop idiopathic progressive cognitive decline starting from age six, and were suspected to have an undiagnosed neurological disease. METHODS Initial clinical assessments included review of medical history, comprehensive physical examination, genetic testing for metabolic diseases, blood tests and brain imaging. We performed exome sequencing with Agilent SureSelect exon capture and Illumina HiSeq2000 platform, followed by variant annotation and selection of rare, shared mutations that fit a recessive model of inheritance. To assess functional impacts of candidate variants, we performed extensive biochemical tests in blood and urine, and examined their possible roles by protein structure modeling. RESULTS Exome sequencing identified NAGLU as the most likely candidate gene with compound heterozygous mutations (chr17:40695717C > T and chr17:40693129A > G in hg19 coordinate), which were documented to be pathogenic. Sanger sequencing confirmed the recessive patterns of inheritance, leading to a genetic diagnosis of Sanfilippo syndrome (mucopolysaccharidosis IIIB). Biochemical tests confirmed the complete loss of activity of alpha-N-acetylglucosaminidase (encoded by NAGLU) in blood, as well as significantly elevated dermatan sulfate and heparan sulfate in urine. Structure modeling revealed the mechanism on how the two variants affect protein structural stability. CONCLUSIONS Successful diagnosis of a rare genetic disorder with an atypical phenotypic presentation confirmed that such "genotype-first" approaches can particularly succeed in areas of the world with insufficient medical genetics expertise and with cost-prohibitive in-depth phenotyping.
Collapse
Affiliation(s)
- Lingling Shi
- Guangdong-Hongkong-Macau Institute of CNS Regeneration, Jinan University, Guangzhou, Guangdong, 510623, China. .,Guangdong Medical Key Laboratory of Brain Function and Diseases, Jinan University, Guangzhou, Guangdong, 510623, China. .,GHM Collaboration and Innovation Center for Tissue Regeneration and Repair, Jinan University, Guangzhou, Guangdong, 510623, China.
| | - Bingxiao Li
- Neonatal Intensive Care Unit, The 1st Affiliated Hospital, Jinan University, Guangzhou, Guangdong, 510623, China.
| | - Yonglan Huang
- Department of Endocrinology and Metabolism, Guangzhou Women and Children's Medical Center, Guangzhou, Guangdong, 510623, China.
| | - Xueying Ling
- Medical Imaging Center, The 1st Affiliated Hospital, Jinan University, Guangzhou, Guangdong, 510623, China.
| | - Tianyun Liu
- Department of Genetics, Stanford University, Stanford, CA, 94305, USA.
| | - Gholson J Lyon
- Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11797, USA.
| | - Anding Xu
- Department of Neurology, The 1st Affiliated Hospital, Jinan University, Guangzhou, Guangdong, 510632, China.
| | - Kai Wang
- Zilkha Neurogenetic Institute, University of Southern California, Los Angeles, CA, 90089, USA. .,Department of Psychiatry & Behavioral Sciences, University of Southern California, Los Angeles, CA, 90089, USA.
| |
Collapse
|
40
|
Fang H, Wu Y, Narzisi G, O'Rawe JA, Barrón LTJ, Rosenbaum J, Ronemus M, Iossifov I, Schatz MC, Lyon GJ. Reducing INDEL calling errors in whole genome and exome sequencing data. Genome Med 2014; 6:89. [PMID: 25426171 PMCID: PMC4240813 DOI: 10.1186/s13073-014-0089-z] [Citation(s) in RCA: 120] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2014] [Accepted: 10/16/2014] [Indexed: 12/30/2022] Open
Abstract
Background INDELs, especially those disrupting protein-coding regions of the genome, have been strongly associated with human diseases. However, there are still many errors with INDEL variant calling, driven by library preparation, sequencing biases, and algorithm artifacts. Methods We characterized whole genome sequencing (WGS), whole exome sequencing (WES), and PCR-free sequencing data from the same samples to investigate the sources of INDEL errors. We also developed a classification scheme based on the coverage and composition to rank high and low quality INDEL calls. We performed a large-scale validation experiment on 600 loci, and find high-quality INDELs to have a substantially lower error rate than low-quality INDELs (7% vs. 51%). Results Simulation and experimental data show that assembly based callers are significantly more sensitive and robust for detecting large INDELs (>5 bp) than alignment based callers, consistent with published data. The concordance of INDEL detection between WGS and WES is low (53%), and WGS data uniquely identifies 10.8-fold more high-quality INDELs. The validation rate for WGS-specific INDELs is also much higher than that for WES-specific INDELs (84% vs. 57%), and WES misses many large INDELs. In addition, the concordance for INDEL detection between standard WGS and PCR-free sequencing is 71%, and standard WGS data uniquely identifies 6.3-fold more low-quality INDELs. Furthermore, accurate detection with Scalpel of heterozygous INDELs requires 1.2-fold higher coverage than that for homozygous INDELs. Lastly, homopolymer A/T INDELs are a major source of low-quality INDEL calls, and they are highly enriched in the WES data. Conclusions Overall, we show that accuracy of INDEL detection with WGS is much greater than WES even in the targeted region. We calculated that 60X WGS depth of coverage from the HiSeq platform is needed to recover 95% of INDELs detected by Scalpel. While this is higher than current sequencing practice, the deeper coverage may save total project costs because of the greater accuracy and sensitivity. Finally, we investigate sources of INDEL errors (for example, capture deficiency, PCR amplification, homopolymers) with various data that will serve as a guideline to effectively reduce INDEL errors in genome sequencing. Electronic supplementary material The online version of this article (doi:10.1186/s13073-014-0089-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Han Fang
- Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, NY USA ; Stony Brook University, 100 Nicolls Rd, Stony Brook, NY USA ; Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, NY USA
| | - Yiyang Wu
- Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, NY USA ; Stony Brook University, 100 Nicolls Rd, Stony Brook, NY USA
| | - Giuseppe Narzisi
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, NY USA ; New York Genome Center, New York, NY USA
| | - Jason A O'Rawe
- Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, NY USA ; Stony Brook University, 100 Nicolls Rd, Stony Brook, NY USA
| | - Laura T Jimenez Barrón
- Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, NY USA ; Centro de Ciencias Genomicas, Universidad Nacional Autonoma de Mexico, Cuernavaca, Morelos Mexico
| | - Julie Rosenbaum
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, NY USA
| | - Michael Ronemus
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, NY USA
| | - Ivan Iossifov
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, NY USA
| | - Michael C Schatz
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, NY USA
| | - Gholson J Lyon
- Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, NY USA ; Stony Brook University, 100 Nicolls Rd, Stony Brook, NY USA
| |
Collapse
|
41
|
Belizário JE. The humankind genome: from genetic diversity to the origin of human diseases. Genome 2014; 56:705-16. [PMID: 24433206 DOI: 10.1139/gen-2013-0125] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Genome-wide association studies have failed to establish common variant risk for the majority of common human diseases. The underlying reasons for this failure are explained by recent studies of resequencing and comparison of over 1200 human genomes and 10 000 exomes, together with the delineation of DNA methylation patterns (epigenome) and full characterization of coding and noncoding RNAs (transcriptome) being transcribed. These studies have provided the most comprehensive catalogues of functional elements and genetic variants that are now available for global integrative analysis and experimental validation in prospective cohort studies. With these datasets, researchers will have unparalleled opportunities for the alignment, mining, and testing of hypotheses for the roles of specific genetic variants, including copy number variations, single nucleotide polymorphisms, and indels as the cause of specific phenotypes and diseases. Through the use of next-generation sequencing technologies for genotyping and standardized ontological annotation to systematically analyze the effects of genomic variation on humans and model organism phenotypes, we will be able to find candidate genes and new clues for disease's etiology and treatment. This article describes essential concepts in genetics and genomic technologies as well as the emerging computational framework to comprehensively search websites and platforms available for the analysis and interpretation of genomic data.
Collapse
Affiliation(s)
- Jose E Belizário
- Departamento de Farmacologia, Instituto de Ciências Biomédicas da Universidade de São Paulo, Avenida Lineu Prestes, 1524 CEP 05508-900, São Paulo, SP, Brazil
| |
Collapse
|
42
|
Freudenberg-Hua Y, Freudenberg J, Vacic V, Abhyankar A, Emde AK, Ben-Avraham D, Barzilai N, Oschwald D, Christen E, Koppel J, Greenwald B, Darnell RB, Germer S, Atzmon G, Davies P. Disease variants in genomes of 44 centenarians. Mol Genet Genomic Med 2014; 2:438-50. [PMID: 25333069 PMCID: PMC4190879 DOI: 10.1002/mgg3.86] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2014] [Revised: 04/29/2014] [Accepted: 04/29/2014] [Indexed: 12/14/2022] Open
Abstract
To identify previously reported disease mutations that are compatible with extraordinary longevity, we screened the coding regions of the genomes of 44 Ashkenazi Jewish centenarians. Individual genome sequences were generated with 30× coverage on the Illumina HiSeq 2000 and single-nucleotide variants were called with the genome analysis toolkit (GATK). We identified 130 coding variants that were annotated as “pathogenic” or “likely pathogenic” based on the ClinVar database and that are infrequent in the general population. These variants were previously reported to cause a wide range of degenerative, neoplastic, and cardiac diseases with autosomal dominant, autosomal recessive, and X-linked inheritance. Several of these variants are located in genes that harbor actionable incidental findings, according to the recommendations of the American College of Medical Genetics. In addition, we found risk variants for late-onset neurodegenerative diseases, such as the APOE ε4 allele that was even present in a homozygous state in one centenarian who did not develop Alzheimer's disease. Our data demonstrate that the incidental finding of certain reported disease variants in an individual genome may not preclude an extraordinarily long life. When the observed variants are encountered in the context of clinical sequencing, it is thus important to exercise caution in justifying clinical decisions. In genome sequences of 44 Ashkenazi centenarians, we identified many coding variants that were annotated as “pathogenic” or “likely pathogenic” based on the ClinVar database. Our data demonstrate that the incidental finding of certain reported disease variants in an individual genome may not preclude an extraordinarily long life. When the observed variants are encountered in the context of clinical sequencing, it is thus important to exercise caution in justifying clinical decisions.
Collapse
Affiliation(s)
- Yun Freudenberg-Hua
- The Litwin-Zucker Research Center for the Study of Alzheimer's Disease and Memory Disorders, The Feinstein Institute for Medical Research, North Shore-LIJ Manhasset, New York, 11030 ; Division of Geriatric Psychiatry, Zucker Hillside Hospital, North Shore-LIJ Glen Oaks, New York, 11040
| | - Jan Freudenberg
- Robert S. Boas Center for Genomics and Human Genetics, The Feinstein Institute for Medical Research, North Shore-LIJ Manhasset, New York, 11030
| | - Vladimir Vacic
- New York Genome Center 101 Avenue of the Americas, New York, New York, 10013
| | - Avinash Abhyankar
- New York Genome Center 101 Avenue of the Americas, New York, New York, 10013
| | - Anne-Katrin Emde
- New York Genome Center 101 Avenue of the Americas, New York, New York, 10013
| | - Danny Ben-Avraham
- Institute for Aging Research Departments of Medicine and Genetics, Albert Einstein College of Medicine 1300 Morris Park Avenue, Bronx, New York, 10461
| | - Nir Barzilai
- Institute for Aging Research Departments of Medicine and Genetics, Albert Einstein College of Medicine 1300 Morris Park Avenue, Bronx, New York, 10461
| | - Dayna Oschwald
- New York Genome Center 101 Avenue of the Americas, New York, New York, 10013
| | - Erika Christen
- The Litwin-Zucker Research Center for the Study of Alzheimer's Disease and Memory Disorders, The Feinstein Institute for Medical Research, North Shore-LIJ Manhasset, New York, 11030
| | - Jeremy Koppel
- The Litwin-Zucker Research Center for the Study of Alzheimer's Disease and Memory Disorders, The Feinstein Institute for Medical Research, North Shore-LIJ Manhasset, New York, 11030 ; Division of Geriatric Psychiatry, Zucker Hillside Hospital, North Shore-LIJ Glen Oaks, New York, 11040
| | - Blaine Greenwald
- Division of Geriatric Psychiatry, Zucker Hillside Hospital, North Shore-LIJ Glen Oaks, New York, 11040
| | - Robert B Darnell
- New York Genome Center 101 Avenue of the Americas, New York, New York, 10013 ; Department of Molecular Neuro-Oncology, Howard Hughes Medical Institute, The Rockefeller University 1230 York Avenue, New York, New York, 10065
| | - Soren Germer
- New York Genome Center 101 Avenue of the Americas, New York, New York, 10013
| | - Gil Atzmon
- Institute for Aging Research Departments of Medicine and Genetics, Albert Einstein College of Medicine 1300 Morris Park Avenue, Bronx, New York, 10461
| | - Peter Davies
- The Litwin-Zucker Research Center for the Study of Alzheimer's Disease and Memory Disorders, The Feinstein Institute for Medical Research, North Shore-LIJ Manhasset, New York, 11030
| |
Collapse
|
43
|
Reis LM, Tyler RC, Semina EV. Identification of a novel C-terminal extension mutation in EPHA2 in a family affected with congenital cataract. Mol Vis 2014; 20:836-42. [PMID: 24940039 PMCID: PMC4057250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2013] [Accepted: 06/11/2013] [Indexed: 11/10/2022] Open
Abstract
PURPOSE Congenital cataracts occur in 3-4 per 10,000 live births and account for 5% to 20% of pediatric blindness worldwide. With more than 37 genes known to be associated with isolated congenital cataract, whole exome sequencing (WES) was recently introduced as an efficient method for screening all known factors. METHODS Whole exome analysis in two members of a four-generation pedigree affected with dominant congenital cataract and glaucoma was performed by WES; co-segregation analysis of identified variants in all pedigree members was completed by Sanger sequencing. RESULTS Analysis of the WES data identified a novel pathogenic variant in EPHA2, c.2925dupC, p.(Ile976Hisfs*37), that demonstrated complete cosegregation with the phenotype in the pedigree. The mutation occurs in the final amino acid before the stop codon of the normal EPHA2 protein and is predicted to produce a mutant protein with an erroneous C-terminal extension of 35 amino acids. Nine other families have been previously reported with dominant congenital/juvenile cataracts and mutations in EPHA2. Two additional likely loss-of-function variants in genes known to cause dominant congenital cataract were considered and excluded based on control data and cosegregation analysis: a nonsense variant in CYRBB3, c.547G>T, p.(Glu183*), and a splicing variant in CRYBA2, c.446+1G>A. CONCLUSIONS Identification of a novel pathogenic EPHA2 allele further implicates this gene in congenital cataract. This is only the second EPHA2 mutation that specifically affects the most C-terminal PSD95/Dlg/ZO1 (PDZ)-binding motif and the third pathogenic allele associated with an erroneous C-terminal extension beyond the normal stop codon.
Collapse
Affiliation(s)
- Linda M. Reis
- Department of Pediatrics and Children’s Research Institute, Medical College of Wisconsin, Milwaukee, WI
| | - Rebecca C. Tyler
- Department of Pediatrics and Children’s Research Institute, Medical College of Wisconsin, Milwaukee, WI
| | - Elena V. Semina
- Department of Pediatrics and Children’s Research Institute, Medical College of Wisconsin, Milwaukee, WI,Department of Cell Biology, Neurobiology and Anatomy, Medical College of Wisconsin, Milwaukee, WI
| |
Collapse
|
44
|
Danielsson K, Mun LJ, Lordemann A, Mao J, Lin CHJ. Next-generation sequencing applied to rare diseases genomics. Expert Rev Mol Diagn 2014; 14:469-87. [PMID: 24702023 DOI: 10.1586/14737159.2014.904749] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Genomics has revolutionized the study of rare diseases. In this review, we overview the latest technological development, rare disease discoveries, implementation obstacles and bioethical challenges. First, we discuss the technology of genome and exome sequencing, including the different next-generation platforms and exome enrichment technologies. Second, we survey the pioneering centers and discoveries for rare diseases, including few of the research institutions that have contributed to the field, as well as an overview survey of different types of rare diseases that have had new discoveries due to next-generation sequencing. Third, we discuss the obstacles and challenges that allow for clinical implementation, including returning of results, informed consent and privacy. Last, we discuss possible outlook as clinical genomics receives wider adoption, as third-generation sequencing is coming onto the horizon, and some needs in informatics and software to further advance the field.
Collapse
Affiliation(s)
- Krissi Danielsson
- Rare Genomics Institute, 4100 Forest Park Ave, Suite 204, St. Louis, MO 63108, USA
| | | | | | | | | |
Collapse
|
45
|
Yao J, Zhang KX, Kramer M, Pellegrini M, McCombie WR. FamAnn: an automated variant annotation pipeline to facilitate target discovery for family-based sequencing studies. ACTA ACUST UNITED AC 2014; 30:1175-1176. [PMID: 24395755 DOI: 10.1093/bioinformatics/btt749] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2012] [Accepted: 12/20/2013] [Indexed: 12/17/2022]
Abstract
FamAnn is an automated variant annotation pipeline designed for facilitating target discovery for family-based sequencing studies. It can apply a different inheritance pattern or a de novo mutations discovery model to each family and select single nucleotide variants and small insertions and deletions segregating in each family or shared by multiple families. It also provides a variety of variant annotations and retains and annotates all transcripts hit by a single variant. Excel-compatible outputs including all annotated variants segregating in each family or shared by multiple families will be provided for users to prioritize variants based on their customized thresholds. A list of genes that harbor the segregating variants will be provided as well for possible pathway/network analyses. FamAnn uses the de facto community standard Variant Call Format as the input format and can be applied to whole exome, genome or targeted resequencing data. AVAILABILITY https://sites.google.com/site/famannotation/home CONTACT: jianchaoyao@gmail.com, kelvinzhang@mednet.ucla.edu, mccombie@cshl.edu Supplementary information: Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jianchao Yao
- Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA, Department of Biological Chemistry, Howard Hughes Medical Institute, University of California, Los Angeles, CA 90095, USA and Department of Molecular, Cellular, and Developmental Biology, University of California, Los Angeles, CA 90095, USA
| | - Kelvin Xi Zhang
- Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA, Department of Biological Chemistry, Howard Hughes Medical Institute, University of California, Los Angeles, CA 90095, USA and Department of Molecular, Cellular, and Developmental Biology, University of California, Los Angeles, CA 90095, USA
| | - Melissa Kramer
- Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA, Department of Biological Chemistry, Howard Hughes Medical Institute, University of California, Los Angeles, CA 90095, USA and Department of Molecular, Cellular, and Developmental Biology, University of California, Los Angeles, CA 90095, USA
| | - Matteo Pellegrini
- Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA, Department of Biological Chemistry, Howard Hughes Medical Institute, University of California, Los Angeles, CA 90095, USA and Department of Molecular, Cellular, and Developmental Biology, University of California, Los Angeles, CA 90095, USA
| | - W Richard McCombie
- Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA, Department of Biological Chemistry, Howard Hughes Medical Institute, University of California, Los Angeles, CA 90095, USA and Department of Molecular, Cellular, and Developmental Biology, University of California, Los Angeles, CA 90095, USA
| |
Collapse
|
46
|
Pierri CL, Palmieri F, De Grassi A. Single-nucleotide evolution quantifies the importance of each site along the structure of mitochondrial carriers. Cell Mol Life Sci 2014; 71:349-64. [PMID: 23800987 PMCID: PMC11113836 DOI: 10.1007/s00018-013-1389-y] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2013] [Revised: 05/10/2013] [Accepted: 05/27/2013] [Indexed: 11/28/2022]
Abstract
Mitochondrial carriers are membrane-embedded proteins consisting of a tripartite structure, a three-fold pseudo-symmetry, related sequences, and similar folding whose main function is to catalyze the transport of various metabolites, nucleotides, and coenzymes across the inner mitochondrial membrane. In this study, the evolutionary rate in vertebrates was screened at each of the approximately 50,000 nucleotides corresponding to the amino acids of the 53 human mitochondrial carriers. Using this information as a starting point, a scoring system was developed to quantify the evolutionary pressure acting on each site of the common mitochondrial carrier structure and estimate its functional or structural relevance. The degree of evolutionary selection varied greatly among all sites, but it was highly similar among the three symmetric positions in the tripartite structure, known as symmetry-related sites or triplets, suggesting that each triplet constitutes an evolutionary unit. Based on evolutionary selection, 111 structural sites (37 triplets) were found to be important. These sites play a key role in structure/function of mitochondrial carriers and are involved in either conformational changes (sites of the gates, proline-glycine levels, and aromatic belts) or in binding and specificity of the transported substrates (sites of the substrate-binding area in between the two gates). Furthermore, the evolutionary pressure analysis revealed that the matrix short helix sites underwent different degrees of selection with high inter-paralog variability. Evidence is presented that these sites form a new sequence motif in a subset of mitochondrial carriers, including the ADP/ATP translocator, and play a regulatory function by interacting with ligands and/or proteins of the mitochondrial matrix.
Collapse
Affiliation(s)
- Ciro Leonardo Pierri
- Department of Biosciences, Biotechnologies and Biopharmaceutics, University of Bari, Via Orabona 4, 70125 Bari, Italy
| | - Ferdinando Palmieri
- Department of Biosciences, Biotechnologies and Biopharmaceutics, University of Bari, Via Orabona 4, 70125 Bari, Italy
- Center of Excellence in Comparative Genomics, University of Bari, Bari, Italy
| | - Anna De Grassi
- Department of Biosciences, Biotechnologies and Biopharmaceutics, University of Bari, Via Orabona 4, 70125 Bari, Italy
- Département Systématique et Evolution, Ecole Pratique des Hautes Etudes, Muséum National d’Histoire Naturelle, Paris, France
| |
Collapse
|
47
|
Wang S, Xing J. A primer for disease gene prioritization using next-generation sequencing data. Genomics Inform 2013; 11:191-9. [PMID: 24465230 PMCID: PMC3897846 DOI: 10.5808/gi.2013.11.4.191] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2013] [Revised: 11/18/2013] [Accepted: 11/21/2013] [Indexed: 01/21/2023] Open
Abstract
High-throughput next-generation sequencing (NGS) technology produces a tremendous amount of raw sequence data. The challenges for researchers are to process the raw data, to map the sequences to genome, to discover variants that are different from the reference genome, and to prioritize/rank the variants for the question of interest. The recent development of many computational algorithms and programs has vastly improved the ability to translate sequence data into valuable information for disease gene identification. However, the NGS data analysis is complex and could be overwhelming for researchers who are not familiar with the process. Here, we outline the analysis pipeline and describe some of the most commonly used principles and tools for analyzing NGS data for disease gene identification.
Collapse
Affiliation(s)
- Shuoguo Wang
- Department of Genetics, The State University of New Jersey, Piscataway, NJ 08854, USA. ; Human Genetics Institute of New Jersey, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | - Jinchuan Xing
- Department of Genetics, The State University of New Jersey, Piscataway, NJ 08854, USA. ; Human Genetics Institute of New Jersey, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| |
Collapse
|
48
|
O’Rawe JA, Fang H, Rynearson S, Robison R, Kiruluta ES, Higgins G, Eilbeck K, Reese MG, Lyon GJ. Integrating precision medicine in the study and clinical treatment of a severely mentally ill person. PeerJ 2013; 1:e177. [PMID: 24109560 PMCID: PMC3792182 DOI: 10.7717/peerj.177] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2013] [Accepted: 09/16/2013] [Indexed: 01/02/2023] Open
Abstract
Background. In recent years, there has been an explosion in the number of technical and medical diagnostic platforms being developed. This has greatly improved our ability to more accurately, and more comprehensively, explore and characterize human biological systems on the individual level. Large quantities of biomedical data are now being generated and archived in many separate research and clinical activities, but there exists a paucity of studies that integrate the areas of clinical neuropsychiatry, personal genomics and brain-machine interfaces. Methods. A single person with severe mental illness was implanted with the Medtronic Reclaim(®) Deep Brain Stimulation (DBS) Therapy device for Obsessive Compulsive Disorder (OCD), targeting his nucleus accumbens/anterior limb of the internal capsule. Programming of the device and psychiatric assessments occurred in an outpatient setting for over two years. His genome was sequenced and variants were detected in the Illumina Whole Genome Sequencing Clinical Laboratory Improvement Amendments (CLIA)-certified laboratory. Results. We report here the detailed phenotypic characterization, clinical-grade whole genome sequencing (WGS), and two-year outcome of a man with severe OCD treated with DBS. Since implantation, this man has reported steady improvement, highlighted by a steady decline in his Yale-Brown Obsessive Compulsive Scale (YBOCS) score from ∼38 to a score of ∼25. A rechargeable Activa RC neurostimulator battery has been of major benefit in terms of facilitating a degree of stability and control over the stimulation. His psychiatric symptoms reliably worsen within hours of the battery becoming depleted, thus providing confirmatory evidence for the efficacy of DBS for OCD in this person. WGS revealed that he is a heterozygote for the p.Val66Met variant in BDNF, encoding a member of the nerve growth factor family, and which has been found to predispose carriers to various psychiatric illnesses. He carries the p.Glu429Ala allele in methylenetetrahydrofolate reductase (MTHFR) and the p.Asp7Asn allele in ChAT, encoding choline O-acetyltransferase, with both alleles having been shown to confer an elevated susceptibility to psychoses. We have found thousands of other variants in his genome, including pharmacogenetic and copy number variants. This information has been archived and offered to this person alongside the clinical sequencing data, so that he and others can re-analyze his genome for years to come. Conclusions. To our knowledge, this is the first study in the clinical neurosciences that integrates detailed neuropsychiatric phenotyping, deep brain stimulation for OCD and clinical-grade WGS with management of genetic results in the medical treatment of one person with severe mental illness. We offer this as an example of precision medicine in neuropsychiatry including brain-implantable devices and genomics-guided preventive health care.
Collapse
Affiliation(s)
- Jason A. O’Rawe
- Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, NY, USA
- Stony Brook University, Stony Brook, NY, USA
| | - Han Fang
- Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, NY, USA
- Stony Brook University, Stony Brook, NY, USA
| | - Shawn Rynearson
- Department of Biomedical Informatics, University of Utah, Salt Lake City, UT, USA
| | - Reid Robison
- Utah Foundation for Biomedical Research, Salt Lake City, UT, USA
| | | | | | - Karen Eilbeck
- Department of Biomedical Informatics, University of Utah, Salt Lake City, UT, USA
| | | | - Gholson J. Lyon
- Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, NY, USA
- Stony Brook University, Stony Brook, NY, USA
- Utah Foundation for Biomedical Research, Salt Lake City, UT, USA
| |
Collapse
|
49
|
Wang K, Kim C, Bradfield J, Guo Y, Toskala E, Otieno FG, Hou C, Thomas K, Cardinale C, Lyon GJ, Golhar R, Hakonarson H. Whole-genome DNA/RNA sequencing identifies truncating mutations in RBCK1 in a novel Mendelian disease with neuromuscular and cardiac involvement. Genome Med 2013; 5:67. [PMID: 23889995 PMCID: PMC3971341 DOI: 10.1186/gm471] [Citation(s) in RCA: 70] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2013] [Revised: 07/15/2013] [Accepted: 07/26/2013] [Indexed: 12/19/2022] Open
Abstract
Background Whole-exome sequencing has identified the causes of several Mendelian diseases by analyzing multiple unrelated cases, but it is more challenging to resolve the cause of extremely rare and suspected Mendelian diseases from individual families. We identified a family quartet with two children, both affected with a previously unreported disease, characterized by progressive muscular weakness and cardiomyopathy, with normal intelligence. During the course of the study, we identified one additional unrelated patient with a comparable phenotype. Methods We performed whole-genome sequencing (Complete Genomics platform), whole-exome sequencing (Agilent SureSelect exon capture and Illumina Genome Analyzer II platform), SNP genotyping (Illumina HumanHap550 SNP array) and Sanger sequencing on blood samples, as well as RNA-Seq (Illumina HiSeq platform) on transformed lymphoblastoid cell lines. Results From whole-genome sequence data, we identified RBCK1, a gene encoding an E3 ubiquitin-protein ligase, as the most likely candidate gene, with two protein-truncating mutations in probands in the first family. However, exome data failed to nominate RBCK1 as a candidate gene, due to poor regional coverage. Sanger sequencing identified a private homozygous splice variant in RBCK1 in the proband in the second family, yet SNP genotyping revealed a 1.2Mb copy-neutral region of homozygosity covering RBCK1. RNA-Seq confirmed aberrant splicing of RBCK1 transcripts, resulting in truncated protein products. Conclusions While the exact mechanism by which these mutations cause disease is unknown, our study represents an example of how the combined use of whole-genome DNA and RNA sequencing can identify a disease-predisposing gene for a novel and extremely rare Mendelian disease.
Collapse
Affiliation(s)
- Kai Wang
- Zilkha Neurogenetic Institute, Keck School of Medicine, University of Southern California, 1501 San Pablo St, Los Angeles, CA 90089, USA ; Center for Applied Genomics, Children's Hospital of Philadelphia, 3615 Civic Center Blvd, Philadelphia, PA 19104, USA
| | - Cecilia Kim
- Center for Applied Genomics, Children's Hospital of Philadelphia, 3615 Civic Center Blvd, Philadelphia, PA 19104, USA
| | - Jonathan Bradfield
- Center for Applied Genomics, Children's Hospital of Philadelphia, 3615 Civic Center Blvd, Philadelphia, PA 19104, USA
| | - Yunfei Guo
- Zilkha Neurogenetic Institute, Keck School of Medicine, University of Southern California, 1501 San Pablo St, Los Angeles, CA 90089, USA
| | - Elina Toskala
- Center for Applied Genomics, Children's Hospital of Philadelphia, 3615 Civic Center Blvd, Philadelphia, PA 19104, USA
| | - Frederick G Otieno
- Center for Applied Genomics, Children's Hospital of Philadelphia, 3615 Civic Center Blvd, Philadelphia, PA 19104, USA
| | - Cuiping Hou
- Center for Applied Genomics, Children's Hospital of Philadelphia, 3615 Civic Center Blvd, Philadelphia, PA 19104, USA
| | - Kelly Thomas
- Center for Applied Genomics, Children's Hospital of Philadelphia, 3615 Civic Center Blvd, Philadelphia, PA 19104, USA
| | - Christopher Cardinale
- Center for Applied Genomics, Children's Hospital of Philadelphia, 3615 Civic Center Blvd, Philadelphia, PA 19104, USA
| | - Gholson J Lyon
- Center for Applied Genomics, Children's Hospital of Philadelphia, 3615 Civic Center Blvd, Philadelphia, PA 19104, USA ; Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, One Bungtown Rd, NY 11724, USA
| | - Ryan Golhar
- Center for Applied Genomics, Children's Hospital of Philadelphia, 3615 Civic Center Blvd, Philadelphia, PA 19104, USA
| | - Hakon Hakonarson
- Center for Applied Genomics, Children's Hospital of Philadelphia, 3615 Civic Center Blvd, Philadelphia, PA 19104, USA ; Department of Pediatrics, University of Pennsylvania School of Medicine, 3451 Walnut St, Philadelphia, PA 19104, USA
| |
Collapse
|
50
|
Veeramah KR, Johnstone L, Karafet TM, Wolf D, Sprissler R, Salogiannis J, Barth-Maron A, Greenberg ME, Stuhlmann T, Weinert S, Jentsch T, Pazzi M, Restifo LL, Talwar D, Erickson RP, Hammer MF. Exome sequencing reveals new causal mutations in children with epileptic encephalopathies. Epilepsia 2013; 54:1270-81. [PMID: 23647072 PMCID: PMC3700577 DOI: 10.1111/epi.12201] [Citation(s) in RCA: 223] [Impact Index Per Article: 20.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/19/2013] [Indexed: 02/06/2023]
Abstract
PURPOSE The management of epilepsy in children is particularly challenging when seizures are resistant to antiepileptic medications, or undergo many changes in seizure type over time, or have comorbid cognitive, behavioral, or motor deficits. Despite efforts to classify such epilepsies based on clinical and electroencephalographic criteria, many children never receive a definitive etiologic diagnosis. Whole exome sequencing (WES) is proving to be a highly effective method for identifying de novo variants that cause neurologic disorders, especially those associated with abnormal brain development. Herein we explore the utility of WES for identifying candidate causal de novo variants in a cohort of children with heterogeneous sporadic epilepsies without etiologic diagnoses. METHODS We performed WES (mean coverage approximately 40×) on 10 trios comprised of unaffected parents and a child with sporadic epilepsy characterized by difficult-to-control seizures and some combination of developmental delay, epileptic encephalopathy, autistic features, cognitive impairment, or motor deficits. Sequence processing and variant calling were performed using standard bioinformatics tools. A custom filtering system was used to prioritize de novo variants of possible functional significance for validation by Sanger sequencing. KEY FINDINGS In 9 of 10 probands, we identified one or more de novo variants predicted to alter protein function, for a total of 15. Four probands had de novo mutations in genes previously shown to harbor heterozygous mutations in patients with severe, early onset epilepsies (two in SCN1A, and one each in CDKL5 and EEF1A2). In three children, the de novo variants were in genes with functional roles that are plausibly relevant to epilepsy (KCNH5, CLCN4, and ARHGEF15). The variant in KCNH5 alters one of the highly conserved arginine residues of the voltage sensor of the encoded voltage-gated potassium channel. In vitro analyses using cell-based assays revealed that the CLCN4 mutation greatly impaired ion transport by the ClC-4 2Cl(-) /H(+) -exchanger and that the mutation in ARHGEF15 reduced GEF exchange activity of the gene product, Ephexin5, by about 50%. Of interest, these seven probands all presented with seizures within the first 6 months of life, and six of these have intractable seizures. SIGNIFICANCE The finding that 7 of 10 children carried de novo mutations in genes of known or plausible clinical significance to neuronal excitability suggests that WES will be of use for the molecular genetic diagnosis of sporadic epilepsies in children, especially when seizures are of early onset and difficult to control.
Collapse
Affiliation(s)
- Krishna R Veeramah
- ARL Division of Biotechnology, University of Arizona, Tucson, AZ, 85721, USA
| | - Laurel Johnstone
- ARL Division of Biotechnology, University of Arizona, Tucson, AZ, 85721, USA
| | - Tatiana M Karafet
- ARL Division of Biotechnology, University of Arizona, Tucson, AZ, 85721, USA
| | - Daniel Wolf
- ARL Division of Biotechnology, University of Arizona, Tucson, AZ, 85721, USA
| | - Ryan Sprissler
- ARL Division of Biotechnology, University of Arizona, Tucson, AZ, 85721, USA
| | - John Salogiannis
- Department of Neurobiology, Harvard Medical School, Boston, MA 02115, USA
| | - Asa Barth-Maron
- Department of Neurobiology, Harvard Medical School, Boston, MA 02115, USA
| | | | - Till Stuhlmann
- Leibniz-Institut für Molekulare Pharmakologie (FMP) and Max-Delbrück-Centrum für Molekulare Medizin (MDC), 13125 Berlin, Germany
| | - Stefanie Weinert
- Leibniz-Institut für Molekulare Pharmakologie (FMP) and Max-Delbrück-Centrum für Molekulare Medizin (MDC), 13125 Berlin, Germany
| | - Thomas Jentsch
- Leibniz-Institut für Molekulare Pharmakologie (FMP) and Max-Delbrück-Centrum für Molekulare Medizin (MDC), 13125 Berlin, Germany
| | | | - Linda L Restifo
- Department of Neurology, Arizona Health Science Center, Tucson AZ 85724, USA
- Department of Neuroscience, University of Arizona, Tucson, AZ 85821, USA
- Department of Cellular & Molecular Medicine, Arizona Health Science Center, Tucson, AZ 85724, USA
| | - Dinesh Talwar
- Center for Neurosciences, Tucson, AZ 85718, USA
- Department of Neurology, Arizona Health Science Center, Tucson AZ 85724, USA
- Department of Pediatrics, Arizona Health Science Center, Tucson AZ 85724, USA
| | - Robert P Erickson
- Department of Pediatrics, Arizona Health Science Center, Tucson AZ 85724, USA
- Department of Molecular and Cellular Biology, University of Arizona, Tucson, AZ 85721, USA
| | - Michael F Hammer
- ARL Division of Biotechnology, University of Arizona, Tucson, AZ, 85721, USA
| |
Collapse
|