1
|
Tran Mau-Them F, Overs A, Bruel AL, Duquet R, Thareau M, Denommé-Pichon AS, Vitobello A, Sorlin A, Safraou H, Nambot S, Delanne J, Moutton S, Racine C, Engel C, De Giraud d’Agay M, Lehalle D, Goldenberg A, Willems M, Coubes C, Genevieve D, Verloes A, Capri Y, Perrin L, Jacquemont ML, Lambert L, Lacaze E, Thevenon J, Hana N, Van-Gils J, Dubucs C, Bizaoui V, Gerard-Blanluet M, Lespinasse J, Mercier S, Guerrot AM, Maystadt I, Tisserant E, Faivre L, Philippe C, Duffourd Y, Thauvin-Robinet C. Combining globally search for a regular expression and print matching lines with bibliographic monitoring of genomic database improves diagnosis. Front Genet 2023; 14:1122985. [PMID: 37152996 PMCID: PMC10157399 DOI: 10.3389/fgene.2023.1122985] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Accepted: 02/13/2023] [Indexed: 05/09/2023] Open
Abstract
Introduction: Exome sequencing has a diagnostic yield ranging from 25% to 70% in rare diseases and regularly implicates genes in novel disorders. Retrospective data reanalysis has demonstrated strong efficacy in improving diagnosis, but poses organizational difficulties for clinical laboratories. Patients and methods: We applied a reanalysis strategy based on intensive prospective bibliographic monitoring along with direct application of the GREP command-line tool (to "globally search for a regular expression and print matching lines") in a large ES database. For 18 months, we submitted the same five keywords of interest [(intellectual disability, (neuro)developmental delay, and (neuro)developmental disorder)] to PubMed on a daily basis to identify recently published novel disease-gene associations or new phenotypes in genes already implicated in human pathology. We used the Linux GREP tool and an in-house script to collect all variants of these genes from our 5,459 exome database. Results: After GREP queries and variant filtration, we identified 128 genes of interest and collected 56 candidate variants from 53 individuals. We confirmed causal diagnosis for 19/128 genes (15%) in 21 individuals and identified variants of unknown significance for 19/128 genes (15%) in 23 individuals. Altogether, GREP queries for only 128 genes over a period of 18 months permitted a causal diagnosis to be established in 21/2875 undiagnosed affected probands (0.7%). Conclusion: The GREP query strategy is efficient and less tedious than complete periodic reanalysis. It is an interesting reanalysis strategy to improve diagnosis.
Collapse
Affiliation(s)
- Frédéric Tran Mau-Them
- Unité Fonctionnelle Innovation en Diagnostic Génomique des maladies rares, CHU Dijon, Dijon, France
- INSERM UMR1231 GAD, Dijon, France
- *Correspondence: Frédéric Tran Mau-Them,
| | - Alexis Overs
- Unité Fonctionnelle Innovation en Diagnostic Génomique des maladies rares, CHU Dijon, Dijon, France
| | - Ange-Line Bruel
- Unité Fonctionnelle Innovation en Diagnostic Génomique des maladies rares, CHU Dijon, Dijon, France
- INSERM UMR1231 GAD, Dijon, France
| | - Romain Duquet
- Unité Fonctionnelle Innovation en Diagnostic Génomique des maladies rares, CHU Dijon, Dijon, France
| | - Mylene Thareau
- Unité Fonctionnelle Innovation en Diagnostic Génomique des maladies rares, CHU Dijon, Dijon, France
| | - Anne-Sophie Denommé-Pichon
- Unité Fonctionnelle Innovation en Diagnostic Génomique des maladies rares, CHU Dijon, Dijon, France
- INSERM UMR1231 GAD, Dijon, France
| | - Antonio Vitobello
- Unité Fonctionnelle Innovation en Diagnostic Génomique des maladies rares, CHU Dijon, Dijon, France
- INSERM UMR1231 GAD, Dijon, France
| | - Arthur Sorlin
- Unité Fonctionnelle Innovation en Diagnostic Génomique des maladies rares, CHU Dijon, Dijon, France
- INSERM UMR1231 GAD, Dijon, France
| | - Hana Safraou
- Unité Fonctionnelle Innovation en Diagnostic Génomique des maladies rares, CHU Dijon, Dijon, France
- INSERM UMR1231 GAD, Dijon, France
| | - Sophie Nambot
- Centre de Référence Maladies Rares “Anomalies du développement et syndromes malformatifs”, Centre de Génétique, FHUTRANSLAD et Institut GIMI, CHU Dijon Bourgogne, Dijon, France
| | - Julian Delanne
- Centre de Référence Maladies Rares “Anomalies du développement et syndromes malformatifs”, Centre de Génétique, FHUTRANSLAD et Institut GIMI, CHU Dijon Bourgogne, Dijon, France
| | - Sebastien Moutton
- Centre de Référence Maladies Rares “Anomalies du développement et syndromes malformatifs”, Centre de Génétique, FHUTRANSLAD et Institut GIMI, CHU Dijon Bourgogne, Dijon, France
| | - Caroline Racine
- Centre de Référence Maladies Rares “Anomalies du développement et syndromes malformatifs”, Centre de Génétique, FHUTRANSLAD et Institut GIMI, CHU Dijon Bourgogne, Dijon, France
| | - Camille Engel
- Unité Fonctionnelle Innovation en Diagnostic Génomique des maladies rares, CHU Dijon, Dijon, France
| | | | - Daphne Lehalle
- Centre de Référence Maladies Rares “Anomalies du développement et syndromes malformatifs”, Centre de Génétique, FHUTRANSLAD et Institut GIMI, CHU Dijon Bourgogne, Dijon, France
| | - Alice Goldenberg
- Normandie Univ, UNIROUEN, Inserm U1245 and Rouen University Hospital, Rouen, France
- Department of Genetics and Reference Center for Developmental Disorders, Normandy Center for Genomic and Personalized Medicine, Rouen, France
| | - Marjolaine Willems
- Département de Génétique Médicale Maladies Rares et Médecine Personnalisée, Centre de Référence Maladies Rares Anomalies du Développement, Hôpital Arnaud de Villeneuve, Université Montpellier, Montpellier, France
| | - Christine Coubes
- Département de Génétique Médicale Maladies Rares et Médecine Personnalisée, Centre de Référence Maladies Rares Anomalies du Développement, Hôpital Arnaud de Villeneuve, Université Montpellier, Montpellier, France
| | - David Genevieve
- Département de Génétique Médicale Maladies Rares et Médecine Personnalisée, Centre de Référence Maladies Rares Anomalies du Développement, Hôpital Arnaud de Villeneuve, Université Montpellier, Montpellier, France
| | - Alain Verloes
- Centre de Référence Anomalies du Développement et Syndromes Malformatifs, Department of Medical Genetics, AP-HPNord- Université de Paris, Hôpital Robert Debré, Paris, France
- INSERM UMR 1141, Paris, France
| | - Yline Capri
- Service de Génétique Clinique, CHU Robert Debré, Paris, France
| | - Laurence Perrin
- Service de Génétique Clinique, CHU Robert Debré, Paris, France
| | - Marie-Line Jacquemont
- Unité de Génétique Médicale, Pole Femme-Mère-Enfant, Groupe Hospitalier Sud Réunion, CHU de La Réunion, La Réunion, France
| | | | - Elodie Lacaze
- Unité de Génétique Médicale, Groupe Hospitalier du Havre, Le Havre, France
| | - Julien Thevenon
- Centre de Référence Maladies Rares “Anomalies du développement et syndromes malformatifs”, Centre de Génétique, FHUTRANSLAD et Institut GIMI, CHU Dijon Bourgogne, Dijon, France
| | - Nadine Hana
- Département de Génétique, Assistance Publique-Hôpitaux de Paris, Hôpital Bichat, Paris, France
- INSERM U1148, Laboratory for Vascular Translational Science, Université Paris de Paris, Hôpital Bichat, Paris, France
| | - Julien Van-Gils
- Service de Génétique Médicale, CHU de Bordeaux, Bordeaux, France
| | - Charlotte Dubucs
- Department of Medical Genetics, Toulouse University Hospital, Toulouse, France
| | - Varoona Bizaoui
- Service de Génétique, Centre Hospitalier Universitaire Caen Normandie, Caen, France
| | | | | | - Sandra Mercier
- Service de Génétique Médicale, CHU Nantes, Nantes, France
| | - Anne-Marie Guerrot
- Department of Genetics and Reference Center for Developmental Disorders, Normandie Univ, UNIROUEN, CHU Rouen, Rouen, France
- Inserm U1245, FHU G4 Génomique, Rouen, France
| | - Isabelle Maystadt
- Centre de Génétique Humaine, Institut de Pathologie et de Génétique, Gosselies, Belgium
| | - Emilie Tisserant
- Unité Fonctionnelle Innovation en Diagnostic Génomique des maladies rares, CHU Dijon, Dijon, France
| | - Laurence Faivre
- INSERM UMR1231 GAD, Dijon, France
- Centre de Référence Maladies Rares “Anomalies du développement et syndromes malformatifs”, Centre de Génétique, FHUTRANSLAD et Institut GIMI, CHU Dijon Bourgogne, Dijon, France
| | - Christophe Philippe
- Unité Fonctionnelle Innovation en Diagnostic Génomique des maladies rares, CHU Dijon, Dijon, France
- INSERM UMR1231 GAD, Dijon, France
| | - Yannis Duffourd
- Unité Fonctionnelle Innovation en Diagnostic Génomique des maladies rares, CHU Dijon, Dijon, France
- INSERM UMR1231 GAD, Dijon, France
| | - Christel Thauvin-Robinet
- Unité Fonctionnelle Innovation en Diagnostic Génomique des maladies rares, CHU Dijon, Dijon, France
- INSERM UMR1231 GAD, Dijon, France
- Centre de Référence Maladies Rares “Anomalies du développement et syndromes malformatifs”, Centre de Génétique, FHUTRANSLAD et Institut GIMI, CHU Dijon Bourgogne, Dijon, France
| |
Collapse
|
2
|
Wang P, Mo Y, Wang Y, Fei Y, Huang J, Ni J, Xu ZF. Macadamia germplasm and genomic database (MacadamiaGGD): A comprehensive platform for germplasm innovation and functional genomics in Macadamia. Front Plant Sci 2022; 13:1007266. [PMID: 36388568 PMCID: PMC9646992 DOI: 10.3389/fpls.2022.1007266] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/30/2022] [Accepted: 10/11/2022] [Indexed: 06/16/2023]
Abstract
As an important nut crop species, macadamia continues to gain increased amounts of attention worldwide. Nevertheless, with the vast increase in macadamia omic data, it is becoming difficult for researchers to effectively process and utilize the information. In this work, we developed the first integrated germplasm and genomic database for macadamia (MacadamiaGGD), which includes five genomes of four species; three chloroplast and mitochondrial genomes; genome annotations; transcriptomic data for three macadamia varieties, germplasm data for four species and 262 main varieties; nine genetic linkage maps; and 35 single-nucleotide polymorphisms (SNPs). The database serves as a valuable collection of simple sequence repeat (SSR) markers, including both markers that are based on macadamia genomic sequences and developed in this study and markers developed previously. MacadamiaGGD is also integrated with multiple bioinformatic tools, such as search, JBrowse, BLAST, primer designer, sequence fetch, enrichment analysis, multiple sequence alignment, genome alignment, and gene homology annotation, which allows users to conveniently analyze their data of interest. MacadamiaGGD is freely available online (http://MacadamiaGGD.net). We believe that the database and additional information of the SSR markers can help scientists better understand the genomic sequence information of macadamia and further facilitate molecular breeding efforts of this species.
Collapse
Affiliation(s)
- Pan Wang
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Forestry, Guangxi University, Nanning, China
- Key Laboratory of National Forestry and Grassland Administration for Fast-Growing Tree Breeding and Cultivation in Central and Southern China, College of Forestry, Guangxi University, Nanning, China
| | - Yi Mo
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Forestry, Guangxi University, Nanning, China
- Key Laboratory of National Forestry and Grassland Administration for Fast-Growing Tree Breeding and Cultivation in Central and Southern China, College of Forestry, Guangxi University, Nanning, China
| | - Yi Wang
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Forestry, Guangxi University, Nanning, China
- Key Laboratory of National Forestry and Grassland Administration for Fast-Growing Tree Breeding and Cultivation in Central and Southern China, College of Forestry, Guangxi University, Nanning, China
| | - Yuchong Fei
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Forestry, Guangxi University, Nanning, China
- Key Laboratory of National Forestry and Grassland Administration for Fast-Growing Tree Breeding and Cultivation in Central and Southern China, College of Forestry, Guangxi University, Nanning, China
| | - Jianting Huang
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Forestry, Guangxi University, Nanning, China
- Key Laboratory of National Forestry and Grassland Administration for Fast-Growing Tree Breeding and Cultivation in Central and Southern China, College of Forestry, Guangxi University, Nanning, China
| | - Jun Ni
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Forestry, Guangxi University, Nanning, China
- Key Laboratory of National Forestry and Grassland Administration for Fast-Growing Tree Breeding and Cultivation in Central and Southern China, College of Forestry, Guangxi University, Nanning, China
| | - Zeng-Fu Xu
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Forestry, Guangxi University, Nanning, China
- Key Laboratory of National Forestry and Grassland Administration for Fast-Growing Tree Breeding and Cultivation in Central and Southern China, College of Forestry, Guangxi University, Nanning, China
| |
Collapse
|
3
|
Sanders M, Lawlor JMJ, Li X, Schuen JN, Millard SL, Zhang X, Buck L, Grysko B, Uhl KL, Hinds D, Stenger CL, Morris M, Lamb N, Levy H, Bupp C, Prokop JW. Genomic, transcriptomic, and protein landscape profile of CFTR and cystic fibrosis. Hum Genet 2021; 140:423-439. [PMID: 32734384 PMCID: PMC7855842 DOI: 10.1007/s00439-020-02211-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2020] [Accepted: 07/25/2020] [Indexed: 01/18/2023]
Abstract
Cystic Fibrosis (CF) is caused most often by removal of amino acid 508 (Phe508del, deltaF508) within CFTR, yet dozens of additional CFTR variants are known to give rise to CF and many variants in the genome are known to contribute to CF pathology. To address CFTR coding variants, we developed a sequence-to-structure-to-dynamic matrix for all amino acids of CFTR using 233 vertebrate species, CFTR structure within a lipid membrane, and 20 ns of molecular dynamic simulation to assess known variants from the CFTR1, CFTR2, ClinVar, TOPmed, gnomAD, and COSMIC databases. Surprisingly, we identify 18 variants of uncertain significance within CFTR from diverse populations that are heritable and a likely cause of CF that have been understudied due to nonexistence in Caucasian populations. In addition, 15 sites within the genome are known to modulate CF pathology, where we have identified one genome region (chr11:34754985-34836401) that contributes to CF through modulation of expression of a noncoding RNA in epithelial cells. These 15 sites are just the beginning of understanding comodifiers of CF, where utilization of eQTLs suggests many additional genomics of CFTR expressing cells that can be influenced by genomic background of CFTR variants. This work highlights that many additional insights of CF genetics are needed, particularly as pharmaceutical interventions increase in the coming years.
Collapse
Affiliation(s)
- Morgan Sanders
- Department of Pediatrics and Human Development, College of Human Medicine, Michigan State University, 400 Monroe Ave NW, Grand Rapids, MI, 49503, USA
| | - James M J Lawlor
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | - Xiaopeng Li
- Department of Pediatrics and Human Development, College of Human Medicine, Michigan State University, 400 Monroe Ave NW, Grand Rapids, MI, 49503, USA
| | - John N Schuen
- Pediatric Pulmonology, Helen DeVos Children's Hospital, Grand Rapids, MI, 49503, USA
| | - Susan L Millard
- Pediatric Pulmonology, Helen DeVos Children's Hospital, Grand Rapids, MI, 49503, USA
| | - Xi Zhang
- Department of Pediatrics, Division of Pulmonary Medicine, National Jewish Health, Denver, CO, 80206, USA
| | - Leah Buck
- Department of Pediatrics and Human Development, College of Human Medicine, Michigan State University, 400 Monroe Ave NW, Grand Rapids, MI, 49503, USA
- Department of Mathematics, University of North Alabama, Florence, AL, 35632, USA
| | - Bethany Grysko
- Spectrum Health Medical Genetics, Grand Rapids, MI, 49503, USA
| | - Katie L Uhl
- Department of Pediatrics and Human Development, College of Human Medicine, Michigan State University, 400 Monroe Ave NW, Grand Rapids, MI, 49503, USA
| | - David Hinds
- Department of Pediatrics and Human Development, College of Human Medicine, Michigan State University, 400 Monroe Ave NW, Grand Rapids, MI, 49503, USA
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | - Cynthia L Stenger
- Department of Mathematics, University of North Alabama, Florence, AL, 35632, USA
| | - Michele Morris
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | - Neil Lamb
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | - Hara Levy
- Department of Pediatrics, Division of Pulmonary Medicine, National Jewish Health, Denver, CO, 80206, USA
| | - Caleb Bupp
- Spectrum Health Medical Genetics, Grand Rapids, MI, 49503, USA
| | - Jeremy W Prokop
- Department of Pediatrics and Human Development, College of Human Medicine, Michigan State University, 400 Monroe Ave NW, Grand Rapids, MI, 49503, USA.
- Department of Pharmacology and Toxicology, Michigan State University, East Lansing, MI, 48824, USA.
| |
Collapse
|
4
|
Gong Y, Kang NK, Kim YU, Wang Z, Wei L, Xin Y, Shen C, Wang Q, You W, Lim JM, Jeong SW, Park YI, Oh HM, Pan K, Poliner E, Yang G, Li-Beisson Y, Li Y, Hu Q, Poetsch A, Farre EM, Chang YK, Jeong WJ, Jeong BR, Xu J. The NanDeSyn database for Nannochloropsis systems and synthetic biology. Plant J 2020; 104:1736-1745. [PMID: 33103271 DOI: 10.1111/tpj.15025] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/25/2020] [Revised: 09/10/2020] [Accepted: 09/23/2020] [Indexed: 06/11/2023]
Abstract
Nannochloropsis species, unicellular industrial oleaginous microalgae, are model organisms for microalgal systems and synthetic biology. To facilitate community-based annotation and mining of the rapidly accumulating functional genomics resources, we have initiated an international consortium and present a comprehensive multi-omics resource database named Nannochloropsis Design and Synthesis (NanDeSyn; http://nandesyn.single-cell.cn). Via the Tripal toolkit, it features user-friendly interfaces hosting genomic resources with gene annotations and transcriptomic and proteomic data for six Nannochloropsis species, including two updated genomes of Nannochloropsis oceanica IMET1 and Nannochloropsis salina CCMP1776. Toolboxes for search, Blast, synteny view, enrichment analysis, metabolic pathway analysis, a genome browser, etc. are also included. In addition, functional validation of genes is indicated based on phenotypes of mutants and relevant bibliography. Furthermore, epigenomic resources are also incorporated, especially for sequencing of small RNAs including microRNAs and circular RNAs. Such comprehensive and integrated landscapes of Nannochloropsis genomics and epigenomics will promote and accelerate community efforts in systems and synthetic biology of these industrially important microalgae.
Collapse
Affiliation(s)
- Yanhai Gong
- Single-Cell Center, CAS Key Laboratory of Biofuels and Shandong Key Laboratory of Energy Genetics, Shandong Institute of Energy Research, Qingdao Institute of BioEnergy and Bioprocess Technology (QIBEBT), Chinese Academy of Sciences, Qingdao, Shandong, 266101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, 266237, China
| | - Nam K Kang
- Department of Chemical and Biomolecular Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141, Korea
- Carl R. Woese Institute for Genomic Biology, University of Illinois, Urbana, IL, 61801, USA
| | - Young U Kim
- Department of Chemical and Biomolecular Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141, Korea
| | - Zengbin Wang
- Single-Cell Center, CAS Key Laboratory of Biofuels and Shandong Key Laboratory of Energy Genetics, Shandong Institute of Energy Research, Qingdao Institute of BioEnergy and Bioprocess Technology (QIBEBT), Chinese Academy of Sciences, Qingdao, Shandong, 266101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, 266237, China
| | - Li Wei
- Single-Cell Center, CAS Key Laboratory of Biofuels and Shandong Key Laboratory of Energy Genetics, Shandong Institute of Energy Research, Qingdao Institute of BioEnergy and Bioprocess Technology (QIBEBT), Chinese Academy of Sciences, Qingdao, Shandong, 266101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, 266237, China
| | - Yi Xin
- Single-Cell Center, CAS Key Laboratory of Biofuels and Shandong Key Laboratory of Energy Genetics, Shandong Institute of Energy Research, Qingdao Institute of BioEnergy and Bioprocess Technology (QIBEBT), Chinese Academy of Sciences, Qingdao, Shandong, 266101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, 266237, China
| | - Chen Shen
- Single-Cell Center, CAS Key Laboratory of Biofuels and Shandong Key Laboratory of Energy Genetics, Shandong Institute of Energy Research, Qingdao Institute of BioEnergy and Bioprocess Technology (QIBEBT), Chinese Academy of Sciences, Qingdao, Shandong, 266101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, 266237, China
| | - Qintao Wang
- Single-Cell Center, CAS Key Laboratory of Biofuels and Shandong Key Laboratory of Energy Genetics, Shandong Institute of Energy Research, Qingdao Institute of BioEnergy and Bioprocess Technology (QIBEBT), Chinese Academy of Sciences, Qingdao, Shandong, 266101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, 266237, China
| | - Wuxin You
- Single-Cell Center, CAS Key Laboratory of Biofuels and Shandong Key Laboratory of Energy Genetics, Shandong Institute of Energy Research, Qingdao Institute of BioEnergy and Bioprocess Technology (QIBEBT), Chinese Academy of Sciences, Qingdao, Shandong, 266101, China
- Department of Plant Biochemistry, Ruhr University Bochum, Bochum, Germany
| | - Jong-Min Lim
- Plant Systems Engineering Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Korea
| | - Suk-Won Jeong
- Department of Biological Sciences, Chungnam National University, Daejeon, 34134, Korea
| | - Youn-Il Park
- Department of Biological Sciences, Chungnam National University, Daejeon, 34134, Korea
| | - Hee-Mock Oh
- Cell Factory Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, 34141, Korea
| | - Kehou Pan
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, 266237, China
- Laboratory of Applied Microalgae, College of Fisheries, Ocean University of China, Qingdao, 266003, China
| | - Eric Poliner
- MSU-DOE Plant Research Laboratory, Michigan State University, East Lansing, MI, 48824, USA
| | - Guanpin Yang
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, 266237, China
- College of Marine Life Sciences, Ocean University of China, Qingdao, Shandong, 266003, China
- Institutes of Evolution and Marine Biodiversity, Ocean University of China, Qingdao, Shandong, 266003, China
| | - Yonghua Li-Beisson
- Aix Marseille Univ, CEA, CNRS, Institut de Biosciences et Biotechnologies Aix-Marseille, CEA Cadarache, 13108, Saint Paul-Lez-Durance, France
| | - Yantao Li
- Institute of Marine and Environmental Technology, University of Maryland Center for Environmental Science, University of Maryland, Baltimore County, Baltimore, MD, 21202, USA
| | - Qiang Hu
- Center for Microalgal Biotechnology and Biofuels, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, 430072, China
| | - Ansgar Poetsch
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, 266237, China
- Department of Plant Biochemistry, Ruhr University Bochum, Bochum, Germany
- College of Marine Life Sciences, Ocean University of China, Qingdao, Shandong, 266003, China
| | - Eva M Farre
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA
| | - Yong K Chang
- Department of Chemical and Biomolecular Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141, Korea
| | - Won-Joong Jeong
- Plant Systems Engineering Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Korea
| | - Byeong-Ryool Jeong
- Single-Cell Center, CAS Key Laboratory of Biofuels and Shandong Key Laboratory of Energy Genetics, Shandong Institute of Energy Research, Qingdao Institute of BioEnergy and Bioprocess Technology (QIBEBT), Chinese Academy of Sciences, Qingdao, Shandong, 266101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
- Department of Chemical and Biomolecular Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141, Korea
- School of Energy and Chemical Engineering, Ulsan National Institute of Science and Technology (UNIST), Ulsan, 44919, Korea
| | - Jian Xu
- Single-Cell Center, CAS Key Laboratory of Biofuels and Shandong Key Laboratory of Energy Genetics, Shandong Institute of Energy Research, Qingdao Institute of BioEnergy and Bioprocess Technology (QIBEBT), Chinese Academy of Sciences, Qingdao, Shandong, 266101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, 266237, China
| |
Collapse
|
5
|
Wain KE, Palen E, Savatt JM, Shuman D, Finucane B, Seeley A, Challman TD, Myers SM, Martin CL. The value of genomic variant ClinVar submissions from clinical providers: Beyond the addition of novel variants. Hum Mutat 2018; 39:1660-1667. [PMID: 30311381 PMCID: PMC6190575 DOI: 10.1002/humu.23607] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2018] [Revised: 06/28/2018] [Accepted: 08/02/2018] [Indexed: 12/15/2022]
Abstract
With the increasing use of clinical genomic testing across broad medical disciplines, the need for data sharing and curation efforts to improve variant interpretation is paramount. The National Center for Biotechnology Information (NCBI) ClinVar database facilitates these efforts by serving as a repository for clinical assertions about genomic variants and associations with disease. Most variant submissions are from clinical laboratories, which may lack clinical details. Laboratories may also choose not to submit all variants. Clinical providers can contribute to variant interpretation improvements by submitting variants to ClinVar with their own assertions and supporting evidence. The medical genetics team at Geisinger's Autism & Developmental Medicine Institute routinely reviews the clinical significance of all variants obtained through clinical genomic testing, using published ACMG/AMP guidelines, clinical correlation, and post-test clinical data. We describe the submission of 148 sequence and 155 copy number variants to ClinVar as "provider interpretations." Of these, 192 (63.4%) were novel to ClinVar. Detailed clinical data were provided for 298 (98.3%), and when available, segregation data and follow-up clinical correlation or testing was included. This contribution marks the first large-scale submission from a neurodevelopmental clinical setting and illustrates the importance of clinical providers in collaborative efforts to improve variant interpretation.
Collapse
Affiliation(s)
- Karen E Wain
- Autism & Developmental Medicine Institute, Geisinger, Lewisburg, Pennsylvania
| | - Emily Palen
- Autism & Developmental Medicine Institute, Geisinger, Lewisburg, Pennsylvania
| | - Juliann M Savatt
- Autism & Developmental Medicine Institute, Geisinger, Lewisburg, Pennsylvania
| | - Devin Shuman
- Autism & Developmental Medicine Institute, Geisinger, Lewisburg, Pennsylvania
| | - Brenda Finucane
- Autism & Developmental Medicine Institute, Geisinger, Lewisburg, Pennsylvania
| | - Andrea Seeley
- Autism & Developmental Medicine Institute, Geisinger, Lewisburg, Pennsylvania
| | - Thomas D Challman
- Autism & Developmental Medicine Institute, Geisinger, Lewisburg, Pennsylvania
| | - Scott M Myers
- Autism & Developmental Medicine Institute, Geisinger, Lewisburg, Pennsylvania
| | - Christa Lese Martin
- Autism & Developmental Medicine Institute, Geisinger, Lewisburg, Pennsylvania
| |
Collapse
|
6
|
Abstract
MetReS (Metabolic Reconstruction Server) is a genomic database that is shared between two software applications that address important biological problems. Biblio-MetReS is a data-mining tool that enables the reconstruction of molecular networks based on automated text-mining analysis of published scientific literature. Homol-MetReS allows functional (re)annotation of proteomes, to properly identify both the individual proteins involved in the processes of interest and their function. The main goal of this work was to identify the areas where the performance of the MetReS database performance could be improved and to test whether this improvement would scale to larger datasets and more complex types of analysis. The study was started with a relational database, MySQL, which is the current database server used by the applications. We also tested the performance of an alternative data-handling framework, Apache Hadoop. Hadoop is currently used for large-scale data processing. We found that this data handling framework is likely to greatly improve the efficiency of the MetReS applications as the dataset and the processing needs increase by several orders of magnitude, as expected to happen in the near future.
Collapse
Affiliation(s)
- Jordi Vilaplana
- 1 Department of Computer Science and INSPIRES, University of Lleida , Lleida, Spain
| | - Rui Alves
- 2 Department of Basic Medical Sciences and IRBLleida, University of Lleida , Lleida, Spain
| | - Francesc Solsona
- 1 Department of Computer Science and INSPIRES, University of Lleida , Lleida, Spain
| | - Jordi Mateo
- 1 Department of Computer Science and INSPIRES, University of Lleida , Lleida, Spain
| | - Ivan Teixidó
- 1 Department of Computer Science and INSPIRES, University of Lleida , Lleida, Spain
| | - Marc Pifarré
- 1 Department of Computer Science and INSPIRES, University of Lleida , Lleida, Spain
| |
Collapse
|
7
|
Poczai P, Varga I, Laos M, Cseh A, Bell N, Valkonen JPT, Hyvönen J. Advances in plant gene-targeted and functional markers: a review. Plant Methods 2013; 9:6. [PMID: 23406322 PMCID: PMC3583794 DOI: 10.1186/1746-4811-9-6] [Citation(s) in RCA: 105] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/17/2012] [Accepted: 02/05/2013] [Indexed: 05/03/2023]
Abstract
Public genomic databases have provided new directions for molecular marker development and initiated a shift in the types of PCR-based techniques commonly used in plant science. Alongside commonly used arbitrarily amplified DNA markers, other methods have been developed. Targeted fingerprinting marker techniques are based on the well-established practices of arbitrarily amplified DNA methods, but employ novel methodological innovations such as the incorporation of gene or promoter elements in the primers. These markers provide good reproducibility and increased resolution by the concurrent incidence of dominant and co-dominant bands. Despite their promising features, these semi-random markers suffer from possible problems of collision and non-homology analogous to those found with randomly generated fingerprints. Transposable elements, present in abundance in plant genomes, may also be used to generate fingerprints. These markers provide increased genomic coverage by utilizing specific targeted sites and produce bands that mostly seem to be homologous. The biggest drawback with most of these techniques is that prior genomic information about retrotransposons is needed for primer design, prohibiting universal applications. Another class of recently developed methods exploits length polymorphism present in arrays of multi-copy gene families such as cytochrome P450 and β-tubulin genes to provide cross-species amplification and transferability. A specific class of marker makes use of common features of plant resistance genes to generate bands linked to a given phenotype, or to reveal genetic diversity. Conserved DNA-based strategies have limited genome coverage and may fail to reveal genetic diversity, while resistance genes may be under specific evolutionary selection. Markers may also be generated from functional and/or transcribed regions of the genome using different gene-targeting approaches coupled with the use of RNA information. Such techniques have the potential to generate phenotypically linked functional markers, especially when fingerprints are generated from the transcribed or expressed region of the genome. It is to be expected that these recently developed techniques will generate larger datasets, but their shortcomings should also be acknowledged and carefully investigated.
Collapse
Affiliation(s)
- Péter Poczai
- Plant Biology, Department of Biosciences, University of Helsinki, PO Box 65, 00014, Helsinki, FIN, Finland
| | - Ildikó Varga
- Plant Biology, Department of Biosciences, University of Helsinki, PO Box 65, 00014, Helsinki, FIN, Finland
| | - Maarja Laos
- Institute of Biotechnology, University of Helsinki, PO Box 65, 00014, Helsinki, FIN, Finland
| | - András Cseh
- Agricultural Institute, Centre of Agricultural Research, Hungarian Academy of Sciences, PO Box 19, H-2462, Martonvásár, Hungary
| | - Neil Bell
- Plant Biology, Department of Biosciences, University of Helsinki, PO Box 65, 00014, Helsinki, FIN, Finland
- Botanical Museum, University of Helsinki, PO Box 7, 00014, Helsinki, FIN, Finland
| | - Jari PT Valkonen
- Department of Agricultural Sciences, University of Helsinki, PO Box 27, 00014, Helsinki, FIN, Finland
| | - Jaakko Hyvönen
- Plant Biology, Department of Biosciences, University of Helsinki, PO Box 65, 00014, Helsinki, FIN, Finland
| |
Collapse
|