1
|
Giardine BM, Joly P, Pissard S, Wajcman H, K Chui DH, Hardison RC, Patrinos GP. Clinically relevant updates of the HbVar database of human hemoglobin variants and thalassemia mutations. Nucleic Acids Res 2021; 49:D1192-D1196. [PMID: 33125055 PMCID: PMC7778921 DOI: 10.1093/nar/gkaa959] [Citation(s) in RCA: 57] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Revised: 10/05/2020] [Accepted: 10/07/2020] [Indexed: 11/21/2022] Open
Abstract
HbVar (http://globin.bx.psu.edu/hbvar) is a widely-used locus-specific database (LSDB) launched 20 years ago by a multi-center academic effort to provide timely information on the numerous genomic variants leading to hemoglobin variants and all types of thalassemia and hemoglobinopathies. Here, we report several advances for the database. We made clinically relevant updates of HbVar, implemented as additional querying options in the HbVar query page, allowing the user to explore the clinical phenotype of compound heterozygous patients. We also made significant improvements to the HbVar front page, making comparative data querying, analysis and output more user-friendly. We continued to expand and enrich the regular data content, involving 1820 variants, 230 of which are new entries. We also increased the querying potential and expanded the usefulness of HbVar database in the clinical setting. These several additions, expansions and updates should improve the utility of HbVar both for the globin research community and in a clinical setting.
Collapse
Affiliation(s)
- Belinda M Giardine
- The Pennsylvania State University, Center for Computational Biology and Bioinformatics, University Park, PA, USA
| | - Philippe Joly
- Biochimie des pathologies érythrocytaires, Laboratoire de Biochimie et Biologie Moléculaire Grand-Est, Groupement hospitalier Est, Hospices Civils de Lyon, Bron, France.,Laboratoire Interuniversitaire de Biologie de la Motricité (LIBM) EA7424, Equipe "Biologie vasculaire et du globule rouge'', Université Claude Bernard Lyon 1, COMUE Lyon, France
| | - Serge Pissard
- Assistance Publique Hopitaux de Paris), Department of Genetics GHU (Groupe Hospitalier Universitaire Henri Mondor) H. Mondor and Institut Mondor de Recherche biomedicale - INSERM U955 eq2, Creteil France
| | | | - David H K Chui
- Boston University School of Medicine, Department of Medicine, Pathology and Laboratory Medicine, Boston, MA, USA
| | - Ross C Hardison
- The Pennsylvania State University, Center for Computational Biology and Bioinformatics, University Park, PA, USA.,Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
| | - George P Patrinos
- University of Patras, School of Health Sciences, Department of Pharmacy, Laboratory of Pharmacogenomics and Individualized Therapy, Patras, Greece.,Erasmus University Medical Center Rotterdam, Faculty of Medicine and Health Sciences, Department of Pathology, Bioinformatics Unit, Rotterdam, the Netherlands.,United Arab Emirates University, College of Medicine and Health Sciences, Department of Pathology, Al-Ain, UAE.,United Arab Emirates University, Zayed Center of Health Sciences, Al-Ain, UAE
| |
Collapse
|
2
|
Kounelis F, Kanterakis A, Kanavos A, Pandi MT, Kordou Z, Manusama O, Vonitsanos G, Katsila T, Tsermpini EE, Lauschke VM, Koromina M, van der Spek PJ, Patrinos GP. Documentation of clinically relevant genomic biomarker allele frequencies in the next-generation FINDbase worldwide database. Hum Mutat 2020; 41:1112-1122. [PMID: 32248568 DOI: 10.1002/humu.24018] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Revised: 02/25/2020] [Accepted: 03/22/2020] [Indexed: 01/24/2023]
Abstract
FINDbase (http://www.findbase.org) is a comprehensive data resource recording the prevalence of clinically relevant genomic variants in various populations worldwide, such as pathogenic variants underlying genetic disorders as well as pharmacogenomic biomarkers that can guide drug treatment. Here, we report significant new developments and technological advancements in the database architecture, leading to a completely revamped database structure, querying interface, accompanied with substantial extensions of data content and curation. In particular, the FINDbase upgrade further improves the user experience by introducing responsive features that support a wide variety of mobile and stationary devices, while enhancing computational runtime due to the use of a modern Javascript framework such as ReactJS. Data collection is significantly enriched, with the data records being divided in a Public and Private version, the latter being accessed on the basis of data contribution, according to the microattribution approach, while the front end was redesigned to support the new functionalities and querying tools. The abovementioned updates further enhance the impact of FINDbase, improve the overall user experience, facilitate further data sharing by microattribution, and strengthen the role of FINDbase as a key resource for personalized medicine applications and personalized public health.
Collapse
Affiliation(s)
- Fotios Kounelis
- Department of Computer Engineering and Informatics, Faculty of Engineering, University of Patras, Patras, Greece
| | - Alexandros Kanterakis
- Biomedical Informatics Laboratory, Foundation of Research and Technology Hellas, Heraklion, Greece.,Department of Pharmacy, School of Health Sciences, University of Patras, Patras, Greece
| | - Andreas Kanavos
- Department of Computer Engineering and Informatics, Faculty of Engineering, University of Patras, Patras, Greece.,Department of Pharmacy, School of Health Sciences, University of Patras, Patras, Greece
| | - Maria-Theodora Pandi
- Department of Pharmacy, School of Health Sciences, University of Patras, Patras, Greece.,Bioinformatics Unit, Department of Pathology, Faculty of Medicine and Health Sciences, Medical Center, Erasmus University, Rotterdam, The Netherlands
| | - Zoe Kordou
- Department of Pharmacy, School of Health Sciences, University of Patras, Patras, Greece
| | - Olivia Manusama
- Department of Immunology, Faculty of Medicine and Health Sciences, Erasmus University Medical Center, Rotterdam, The Netherlands
| | - Gerasimos Vonitsanos
- Department of Computer Engineering and Informatics, Faculty of Engineering, University of Patras, Patras, Greece.,Department of Pharmacy, School of Health Sciences, University of Patras, Patras, Greece
| | - Theodora Katsila
- Department of Pharmacy, School of Health Sciences, University of Patras, Patras, Greece
| | | | - Volker M Lauschke
- Department of Physiology and Pharmacology, Karolinska Institutet, Stockholm, Sweden
| | - Maria Koromina
- Department of Pharmacy, School of Health Sciences, University of Patras, Patras, Greece
| | - Peter J van der Spek
- Bioinformatics Unit, Department of Pathology, Faculty of Medicine and Health Sciences, Medical Center, Erasmus University, Rotterdam, The Netherlands
| | - George P Patrinos
- Department of Pharmacy, School of Health Sciences, University of Patras, Patras, Greece.,Bioinformatics Unit, Department of Pathology, Faculty of Medicine and Health Sciences, Medical Center, Erasmus University, Rotterdam, The Netherlands.,Zayed Center of Health Sciences, United Arab Emirates University, Al-Ain, United Arab Emirates.,Department of Pathology, College of Medicine and Health Sciences, United Arab Emirates University, Al-Ain, United Arab Emirates
| |
Collapse
|
3
|
Gradishar W, Johnson K, Brown K, Mundt E, Manley S. Clinical Variant Classification: A Comparison of Public Databases and a Commercial Testing Laboratory. Oncologist 2017; 22:797-803. [PMID: 28408614 PMCID: PMC5507641 DOI: 10.1634/theoncologist.2016-0431] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2016] [Accepted: 01/05/2017] [Indexed: 11/17/2022] Open
Abstract
BACKGROUND There is a growing move to consult public databases following receipt of a genetic test result from a clinical laboratory; however, the well-documented limitations of these databases call into question how often clinicians will encounter discordant variant classifications that may introduce uncertainty into patient management. Here, we evaluate discordance in BRCA1 and BRCA2 variant classifications between a single commercial testing laboratory and a public database commonly consulted in clinical practice. MATERIALS AND METHODS BRCA1 and BRCA2 variant classifications were obtained from ClinVar and compared with the classifications from a reference laboratory. Full concordance and discordance were determined for variants whose ClinVar entries were of the same pathogenicity (pathogenic, benign, or uncertain). Variants with conflicting ClinVar classifications were considered partially concordant if ≥1 of the listed classifications agreed with the reference laboratory classification. RESULTS Four thousand two hundred and fifty unique BRCA1 and BRCA2 variants were available for analysis. Overall, 73.2% of classifications were fully concordant and 12.3% were partially concordant. The remaining 14.5% of variants had discordant classifications, most of which had a definitive classification (pathogenic or benign) from the reference laboratory compared with an uncertain classification in ClinVar (14.0%). CONCLUSION Here, we show that discrepant classifications between a public database and single reference laboratory potentially account for 26.7% of variants in BRCA1 and BRCA2. The time and expertise required of clinicians to research these discordant classifications call into question the practicality of checking all test results against a database and suggest that discordant classifications should be interpreted with these limitations in mind. IMPLICATIONS FOR PRACTICE With the increasing use of clinical genetic testing for hereditary cancer risk, accurate variant classification is vital to ensuring appropriate medical management. There is a growing move to consult public databases following receipt of a genetic test result from a clinical laboratory; however, we show that up to 26.7% of variants in BRCA1 and BRCA2 have discordant classifications between ClinVar and a reference laboratory. The findings presented in this paper serve as a note of caution regarding the utility of database consultation.
Collapse
Affiliation(s)
- William Gradishar
- Feinberg School of Medicine, Northwestern University, Chicago, Illinois, USA
- Myriad Genetic Laboratories, Inc., Salt Lake City, Utah, USA
| | - KariAnne Johnson
- Feinberg School of Medicine, Northwestern University, Chicago, Illinois, USA
- Myriad Genetic Laboratories, Inc., Salt Lake City, Utah, USA
| | - Krystal Brown
- Feinberg School of Medicine, Northwestern University, Chicago, Illinois, USA
- Myriad Genetic Laboratories, Inc., Salt Lake City, Utah, USA
| | - Erin Mundt
- Feinberg School of Medicine, Northwestern University, Chicago, Illinois, USA
- Myriad Genetic Laboratories, Inc., Salt Lake City, Utah, USA
| | - Susan Manley
- Feinberg School of Medicine, Northwestern University, Chicago, Illinois, USA
- Myriad Genetic Laboratories, Inc., Salt Lake City, Utah, USA
| |
Collapse
|
4
|
Viennas E, Komianou A, Mizzi C, Stojiljkovic M, Mitropoulou C, Muilu J, Vihinen M, Grypioti P, Papadaki S, Pavlidis C, Zukic B, Katsila T, van der Spek PJ, Pavlovic S, Tzimas G, Patrinos GP. Expanded national database collection and data coverage in the FINDbase worldwide database for clinically relevant genomic variation allele frequencies. Nucleic Acids Res 2016; 45:D846-D853. [PMID: 27924022 PMCID: PMC5210643 DOI: 10.1093/nar/gkw949] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2016] [Accepted: 10/12/2016] [Indexed: 01/20/2023] Open
Abstract
FINDbase (http://www.findbase.org) is a comprehensive data repository that records the prevalence of clinically relevant genomic variants in various populations worldwide, such as pathogenic variants leading mostly to monogenic disorders and pharmacogenomics biomarkers. The database also records the incidence of rare genetic diseases in various populations, all in well-distinct data modules. Here, we report extensive data content updates in all data modules, with direct implications to clinical pharmacogenomics. Also, we report significant new developments in FINDbase, namely (i) the release of a new version of the ETHNOS software that catalyzes development curation of national/ethnic genetic databases, (ii) the migration of all FINDbase data content into 90 distinct national/ethnic mutation databases, all built around Microsoft's PivotViewer (http://www.getpivot.com) software (iii) new data visualization tools and (iv) the interrelation of FINDbase with DruGeVar database with direct implications in clinical pharmacogenomics. The abovementioned updates further enhance the impact of FINDbase, as a key resource for Genomic Medicine applications.
Collapse
Affiliation(s)
- Emmanouil Viennas
- University of Patras, Faculty of Engineering, Department of Computer Engineering and Informatics, GR-26504, Patras, Greece
| | - Angeliki Komianou
- Department of Pharmacy, School of Health Sciences, University of Patras, GR-26504, Patras, Greece
| | - Clint Mizzi
- Erasmus University Medical Center, Faculty of Medicine and Health Sciences, Department of Bioinformatics, NL-3015 CN, Rotterdam, The Netherlands.,University of Malta, Faculty of Medicine and Surgery, Department of Physiology and Biochemistry, MSD 2090, Malta
| | - Maja Stojiljkovic
- Institute of Molecular Genetics and Genetic Engineering, University of Belgrade, Laboratory of Molecular Biomedicine, 11010, Belgrade, Serbia
| | | | - Juha Muilu
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, FI-00014, Helsinki, Finland
| | - Mauno Vihinen
- Department of Experimental Medical Science, Lund University, SE-22100, Lund, Sweden
| | - Panagiota Grypioti
- Department of Pharmacy, School of Health Sciences, University of Patras, GR-26504, Patras, Greece
| | - Styliani Papadaki
- Department of Pharmacy, School of Health Sciences, University of Patras, GR-26504, Patras, Greece
| | - Cristiana Pavlidis
- Department of Pharmacy, School of Health Sciences, University of Patras, GR-26504, Patras, Greece
| | - Branka Zukic
- University of Malta, Faculty of Medicine and Surgery, Department of Physiology and Biochemistry, MSD 2090, Malta
| | - Theodora Katsila
- Department of Pharmacy, School of Health Sciences, University of Patras, GR-26504, Patras, Greece
| | - Peter J van der Spek
- Erasmus University Medical Center, Faculty of Medicine and Health Sciences, Department of Bioinformatics, NL-3015 CN, Rotterdam, The Netherlands
| | - Sonja Pavlovic
- University of Malta, Faculty of Medicine and Surgery, Department of Physiology and Biochemistry, MSD 2090, Malta
| | - Giannis Tzimas
- Department of Computer and Informatics Engineering, Technological Educational Institute of Western Greece, GR-30020, Patras, Greece
| | - George P Patrinos
- Department of Pharmacy, School of Health Sciences, University of Patras, GR-26504, Patras, Greece .,Erasmus University Medical Center, Faculty of Medicine and Health Sciences, Department of Bioinformatics, NL-3015 CN, Rotterdam, The Netherlands
| |
Collapse
|
5
|
Niroula A, Vihinen M. Variation Interpretation Predictors: Principles, Types, Performance, and Choice. Hum Mutat 2016; 37:579-97. [DOI: 10.1002/humu.22987] [Citation(s) in RCA: 90] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2015] [Accepted: 03/07/2016] [Indexed: 12/18/2022]
Affiliation(s)
- Abhishek Niroula
- Department of Experimental Medical Science; Lund University; BMC B13 Lund SE-22184 Sweden
| | - Mauno Vihinen
- Department of Experimental Medical Science; Lund University; BMC B13 Lund SE-22184 Sweden
| |
Collapse
|
6
|
Dalgleish R. LSDBs and How They Have Evolved. Hum Mutat 2016; 37:532-9. [DOI: 10.1002/humu.22979] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2015] [Accepted: 02/18/2016] [Indexed: 01/10/2023]
Affiliation(s)
- Raymond Dalgleish
- Department of Genetics; University of Leicester; Leicester United Kingdom
| |
Collapse
|
7
|
Savige J, Dalgleish R, Cotton RG, den Dunnen JT, Macrae F, Povey S. The Human Variome Project: ensuring the quality of DNA variant databases in inherited renal disease. Pediatr Nephrol 2015; 30:1893-901. [PMID: 25384529 DOI: 10.1007/s00467-014-2994-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/17/2014] [Revised: 10/09/2014] [Accepted: 10/15/2014] [Indexed: 02/02/2023]
Abstract
A recent review identified 60 common inherited renal diseases caused by DNA variants in 132 different genes. These diseases can be diagnosed with DNA sequencing, but each gene probably also has a thousand normal variants. Many more normal variants have been characterised by individual laboratories than are reported in the literature or found in publicly accessible collections. At present, testing laboratories must assess each novel change they identify for pathogenicity, even when this has been done elsewhere previously, and the distinction between normal and disease-associated variants is particularly an issue with the recent surge in exomic sequencing and gene discovery projects. The Human Variome Project recommends the establishment of gene-specific DNA variant databases to facilitate the sharing of DNA variants and decisions about likely disease causation. Databases improve diagnostic accuracy and testing efficiency, and reduce costs. They also help with genotype-phenotype correlations and predictive algorithms. The Human Variome Project advocates databases that use standardised descriptions, are up-to-date, include clinical information and are freely available. Currently, the genes affected in the most common inherited renal diseases correspond to 350 different variant databases, many of which are incomplete or have insufficient clinical details for genotype-phenotype correlations. Assistance is needed from nephrologists to maximise the usefulness of these databases for the diagnosis and management of inherited renal disease.
Collapse
Affiliation(s)
- Judy Savige
- The University of Melbourne, Melbourne Health, Melbourne, Australia. .,Department of Medicine, Royal Melbourne Hospital, The University of Melbourne, Parkville, VIC, 3050, Australia.
| | | | - Richard Gh Cotton
- Human Variome Project, The University of Melbourne, Melbourne, Australia
| | - Johan T den Dunnen
- Human and Clinical Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | - Finlay Macrae
- The University of Melbourne, Melbourne Health, Melbourne, Australia.,Colorectal Medicine and Genetics, The Royal Melbourne Hospital, Parkville, Australia
| | - Sue Povey
- Research Department of Genetics, Evolution and Environment, University College London, London, UK
| |
Collapse
|
8
|
Comparison of locus-specific databases for BRCA1 and BRCA2 variants reveals disparity in variant classification within and among databases. J Community Genet 2015; 6:351-9. [PMID: 25782689 PMCID: PMC4567983 DOI: 10.1007/s12687-015-0220-x] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2014] [Accepted: 02/18/2015] [Indexed: 10/27/2022] Open
Abstract
Genetic variants of uncertain clinical significance (VUSs) are a common outcome of clinical genetic testing. Locus-specific variant databases (LSDBs) have been established for numerous disease-associated genes as a research tool for the interpretation of genetic sequence variants to facilitate variant interpretation via aggregated data. If LSDBs are to be used for clinical practice, consistent and transparent criteria regarding the deposition and interpretation of variants are vital, as variant classifications are often used to make important and irreversible clinical decisions. In this study, we performed a retrospective analysis of 2017 consecutive BRCA1 and BRCA2 genetic variants identified from 24,650 consecutive patient samples referred to our laboratory to establish an unbiased dataset representative of the types of variants seen in the US patient population, submitted by clinicians and researchers for BRCA1 and BRCA2 testing. We compared the clinical classifications of these variants among five publicly accessible BRCA1 and BRCA2 variant databases: BIC, ClinVar, HGMD (paid version), LOVD, and the UMD databases. Our results show substantial disparity of variant classifications among publicly accessible databases. Furthermore, it appears that discrepant classifications are not the result of a single outlier but widespread disagreement among databases. This study also shows that databases sometimes favor a clinical classification when current best practice guidelines (ACMG/AMP/CAP) would suggest an uncertain classification. Although LSDBs have been well established for research applications, our results suggest several challenges preclude their wider use in clinical practice.
Collapse
|
9
|
Cutting GR. Annotating DNA variants is the next major goal for human genetics. Am J Hum Genet 2014; 94:5-10. [PMID: 24387988 PMCID: PMC3882730 DOI: 10.1016/j.ajhg.2013.12.008] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2013] [Indexed: 12/29/2022] Open
Abstract
Clinical genetic testing has undergone a dramatic transformation in the past two decades. Diagnostic laboratories that previously tested for well-established disease-causing DNA variants in a handful of genes have evolved into sequencing factories identifying thousands of variants of known and unknown medical consequence. Sorting out what does and does not cause disease in our genomes is the next great challenge in making genetics a central feature of healthcare. I propose that closing the gap in our ability to interpret variation responsible for Mendelian disorders provides a grand and unprecedented opportunity for geneticists. Human geneticists are well placed to coordinate a systematic evaluation of variants in collaboration with basic scientists and clinicians. Sharing of knowledge, data, methods, and tools will aid both researchers and healthcare workers in achieving their common goal of defining the pathogenic potential of variants. Generation of variant annotations will inform genetic testing and will deepen our understanding of gene and protein function, thereby aiding the search for molecular targeted therapies.
Collapse
Affiliation(s)
- Garry R Cutting
- McKusick-Nathans Institute of Genetic Medicine and Department of Pediatrics, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA.
| |
Collapse
|
10
|
Abstract
In this chapter we aim to provide an overview of DNA variant databases, commonly known as Locus-Specific Databases (LSDBs), or Gene-Disease Specific Databases (GDSDBs), but the term variant database will be used for simplicity. We restrict this overview to germ-line variants, particularly as related to Mendelian diseases, which are diseases caused by a variant in a single gene. Common difficulties associated with variant databases and some proposed solutions are reviewed. Finally, systems where technical solutions have been implemented are discussed. This work will be useful for anyone wishing to establish their own variant database, or to learn about the global picture of variant databases, and the technical challenges to be overcome.
Collapse
Affiliation(s)
- John-Paul Plazzer
- Department of Colorectal Medicine and Genetics, Royal Melbourne Hospital, RMH, Grattan Street, Parkville, VIC, 3050, Australia,
| | | |
Collapse
|
11
|
Giardine B, Borg J, Viennas E, Pavlidis C, Moradkhani K, Joly P, Bartsakoulia M, Riemer C, Miller W, Tzimas G, Wajcman H, Hardison RC, Patrinos GP. Updates of the HbVar database of human hemoglobin variants and thalassemia mutations. Nucleic Acids Res 2013; 42:D1063-9. [PMID: 24137000 PMCID: PMC3964999 DOI: 10.1093/nar/gkt911] [Citation(s) in RCA: 327] [Impact Index Per Article: 29.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
HbVar (http://globin.bx.psu.edu/hbvar) is one of the oldest and most appreciated locus-specific databases launched in 2001 by a multi-center academic effort to provide timely information on the genomic alterations leading to hemoglobin variants and all types of thalassemia and hemoglobinopathies. Database records include extensive phenotypic descriptions, biochemical and hematological effects, associated pathology and ethnic occurrence, accompanied by mutation frequencies and references. Here, we report updates to >600 HbVar entries, inclusion of population-specific data for 28 populations and 27 ethnic groups for α-, and β-thalassemias and additional querying options in the HbVar query page. HbVar content was also inter-connected with two other established genetic databases, namely FINDbase (http://www.findbase.org) and Leiden Open-Access Variation database (http://www.lovd.nl), which allows comparative data querying and analysis. HbVar data content has contributed to the realization of two collaborative projects to identify genomic variants that lie on different globin paralogs. Most importantly, HbVar data content has contributed to demonstrate the microattribution concept in practice. These updates significantly enriched the database content and querying potential, enhanced the database profile and data quality and broadened the inter-relation of HbVar with other databases, which should increase the already high impact of this resource to the globin and genetic database community.
Collapse
Affiliation(s)
- Belinda Giardine
- The Pennsylvania State University, Center for Comparative Genomics and Bioinformatics, University Park, PA, USA, Department of Applied Biomedical Science, Faculty of Health Sciences, University of Malta, Msida, Malta, MGC-Department of Cell Biology and Genetics, Erasmus MC, Faculty of Medicine and Health Sciences, Rotterdam, The Netherlands, Department of Computer Engineering and Informatics, University of Patras, Faculty of Engineering, Patras, Greece, Department of Pharmacy, University of Patras, School of Health Sciences, Patras, Greece, Department of Medical Genetics, Laboratory of Cytogenetics, Institute of Biology, Nantes, France, Hôpital Edouard Herriot, Unité de Pathologie Moléculaire du Globule Rouge, Lyon, France, Department of Computer and Informatics Engineering, Technological Educational Institute of Western Greece, Patras, Greece, INSERM U955, CHU Henri Mondor, Creteil, France and Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Clarity against the odds: standards for describing DNA sequence variants. Pathology 2012; 45:101-3. [PMID: 23250040 DOI: 10.1097/pat.0b013e32835c882b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
13
|
Sobrido MJ, Cacheiro P, Carracedo A, Bertram L. Databases for neurogenetics: introduction, overview, and challenges. Hum Mutat 2012; 33:1311-4. [PMID: 22890789 DOI: 10.1002/humu.22164] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
The importance for research and clinical utility of mutation databases, as well as the issues and difficulties entailed in their construction, is discussed within the Human Variome Project. While general principles and standards can apply to most human diseases, some specific questions arise when dealing with the nature of genetic neurological disorders. So far, publically accessible mutation databases exist for only about half of the genes causing neurogenetic disorders; and a considerable work is clearly still needed to optimize their content. The current landscape, main challenges, some potential solutions, and future perspectives on genetic databases for disorders of the nervous system are reviewed in this special issue of Human Mutation on neurogenetics.
Collapse
Affiliation(s)
- María-Jesús Sobrido
- Fundación Pública Galega de Medicina Xenómica-SERGAS, Santiago de Compostela, Galicia, Spain.
| | | | | | | |
Collapse
|
14
|
Pandey KR, Maden N, Poudel B, Pradhananga S, Sharma AK. The curation of genetic variants: difficulties and possible solutions. GENOMICS PROTEOMICS & BIOINFORMATICS 2012; 10:317-25. [PMID: 23317699 PMCID: PMC5054708 DOI: 10.1016/j.gpb.2012.06.006] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 02/05/2012] [Revised: 05/27/2012] [Accepted: 06/20/2012] [Indexed: 11/15/2022]
Abstract
The curation of genetic variants from biomedical articles is required for various clinical and research purposes. Nowadays, establishment of variant databases that include overall information about variants is becoming quite popular. These databases have immense utility, serving as a user-friendly information storehouse of variants for information seekers. While manual curation is the gold standard method for curation of variants, it can turn out to be time-consuming on a large scale thus necessitating the need for automation. Curation of variants described in biomedical literature may not be straightforward mainly due to various nomenclature and expression issues. Though current trends in paper writing on variants is inclined to the standard nomenclature such that variants can easily be retrieved, we have a massive store of variants in the literature that are present as non-standard names and the online search engines that are predominantly used may not be capable of finding them. For effective curation of variants, knowledge about the overall process of curation, nature and types of difficulties in curation, and ways to tackle the difficulties during the task are crucial. Only by effective curation, can variants be correctly interpreted. This paper presents the process and difficulties of curation of genetic variants with possible solutions and suggestions from our work experience in the field including literature support. The paper also highlights aspects of interpretation of genetic variants and the importance of writing papers on variants following standard and retrievable methods.
Collapse
Affiliation(s)
- Kapil Raj Pandey
- Deerwalk Services, Kathmandu 44602, Nepal
- Department of Microbiology, Bangalore University, Bangalore 560001, India
| | - Narendra Maden
- Deerwalk Services, Kathmandu 44602, Nepal
- Central Department of Microbiology, Tribhuvan University, Kathmandu 44613, Nepal
- Corresponding author.
| | - Barsha Poudel
- Department of Bioinformatics, Wageningen University, Wageningen 6700 AA-6799 ZZ, Netherlands
| | - Sailendra Pradhananga
- Deerwalk Services, Kathmandu 44602, Nepal
- Department of Biotechnology, Kathmandu University, Kavrepalanchok 45200, Nepal
| | - Amit Kumar Sharma
- Deerwalk Services, Kathmandu 44602, Nepal
- Department of Biotechnology, Kathmandu University, Kavrepalanchok 45200, Nepal
| |
Collapse
|
15
|
Nair PS, Vihinen M. VariBench: A Benchmark Database for Variations. Hum Mutat 2012; 34:42-9. [DOI: 10.1002/humu.22204] [Citation(s) in RCA: 106] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2012] [Accepted: 07/31/2012] [Indexed: 12/21/2022]
|
16
|
Byrne M, Fokkema IF, Lancaster O, Adamusiak T, Ahonen-Bishopp A, Atlan D, Béroud C, Cornell M, Dalgleish R, Devereau A, Patrinos GP, Swertz MA, Taschner PE, Thorisson GA, Vihinen M, Brookes AJ, Muilu J. VarioML framework for comprehensive variation data representation and exchange. BMC Bioinformatics 2012; 13:254. [PMID: 23031277 PMCID: PMC3507772 DOI: 10.1186/1471-2105-13-254] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2012] [Accepted: 09/23/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Sharing of data about variation and the associated phenotypes is a critical need, yet variant information can be arbitrarily complex, making a single standard vocabulary elusive and re-formatting difficult. Complex standards have proven too time-consuming to implement. RESULTS The GEN2PHEN project addressed these difficulties by developing a comprehensive data model for capturing biomedical observations, Observ-OM, and building the VarioML format around it. VarioML pairs a simplified open specification for describing variants, with a toolkit for adapting the specification into one's own research workflow. Straightforward variant data can be captured, federated, and exchanged with no overhead; more complex data can be described, without loss of compatibility. The open specification enables push-button submission to gene variant databases (LSDBs) e.g., the Leiden Open Variation Database, using the Cafe Variome data publishing service, while VarioML bidirectionally transforms data between XML and web-application code formats, opening up new possibilities for open source web applications building on shared data. A Java implementation toolkit makes VarioML easily integrated into biomedical applications. VarioML is designed primarily for LSDB data submission and transfer scenarios, but can also be used as a standard variation data format for JSON and XML document databases and user interface components. CONCLUSIONS VarioML is a set of tools and practices improving the availability, quality, and comprehensibility of human variation information. It enables researchers, diagnostic laboratories, and clinics to share that information with ease, clarity, and without ambiguity.
Collapse
Affiliation(s)
- Myles Byrne
- Institute for Molecular Medicine Finland-FIMM, University of Helsinki, Helsinki, Finland.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Patrinos GP, Cooper DN, van Mulligen E, Gkantouna V, Tzimas G, Tatum Z, Schultes E, Roos M, Mons B. Microattribution and nanopublication as means to incentivize the placement of human genome variation data into the public domain. Hum Mutat 2012; 33:1503-12. [PMID: 22736453 DOI: 10.1002/humu.22144] [Citation(s) in RCA: 58] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2012] [Accepted: 05/23/2012] [Indexed: 11/07/2022]
Abstract
The advances in bioinformatics required to annotate human genomic variants and to place them in public data repositories have not kept pace with their discovery. Moreover, a law of diminishing returns has begun to operate both in terms of data publication and submission. Although the continued deposition of such data in the public domain is essential to maximize both their scientific and clinical utility, rewards for data sharing are few, representing a serious practical impediment to data submission. To date, two main strategies have been adopted as a means to encourage the submission of human genomic variant data: (1) database journal linkups involving the affiliation of a scientific journal with a publicly available database and (2) microattribution, involving the unambiguous linkage of data to their contributors via a unique identifier. The latter could in principle lead to the establishment of a microcitation-tracking system that acknowledges individual endeavor and achievement. Both approaches could incentivize potential data contributors, thereby encouraging them to share their data with the scientific community. Here, we summarize and critically evaluate approaches that have been proposed to address current deficiencies in data attribution and discuss ways in which they could become more widely adopted as novel scientific publication modalities.
Collapse
Affiliation(s)
- George P Patrinos
- Department of Pharmacy, School of Health Sciences, University of Patras, Patras, Greece.
| | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Viennas E, Gkantouna V, Ioannou M, Georgitsi M, Rigou M, Poulas K, Patrinos GP, Tzimas G. Population-ethnic group specific genome variation allele frequency data: a querying and visualization journey. Genomics 2012; 100:93-101. [PMID: 22659238 DOI: 10.1016/j.ygeno.2012.05.009] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2012] [Revised: 05/09/2012] [Accepted: 05/21/2012] [Indexed: 01/28/2023]
Abstract
National/ethnic mutation databases aim to document the genetic heterogeneity in various populations and ethnic groups worldwide. We have previously reported the development and upgrade of FINDbase (www.findbase.org), a database recording causative mutations and pharmacogenomic marker allele frequencies in various populations around the globe. Although this database has recently been upgraded, we continuously try to enhance its functionality by providing more advanced visualization tools that would further assist effective data querying and comparisons. We are currently experimenting in various visualization techniques on the existing FINDbase causative mutation data collection aiming to provide a dynamic research tool for the worldwide scientific community. We have developed an interactive web-based application for population-based mutation data retrieval. It supports sophisticated data exploration allowing users to apply advanced filtering criteria upon a set of multiple views of the underlying data collection and enables browsing the relationships between individual datasets in a novel and meaningful way.
Collapse
Affiliation(s)
- Emmanouil Viennas
- Department of Computer Engineering and Informatics, University of Patras, Patras, Greece
| | | | | | | | | | | | | | | |
Collapse
|
19
|
Ortutay C, Vihinen M. Conserved and quickly evolving immunome genes have different evolutionary paths. Hum Mutat 2012; 33:1456-63. [PMID: 22623381 DOI: 10.1002/humu.22125] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2012] [Accepted: 05/15/2012] [Indexed: 12/11/2022]
Abstract
Genetic, transcript, and protein level variations have important functional and evolutionary consequences. We performed systematic data collection and analysis of copy-number variations, single-nucleotide polymorphisms, disease-causing variations, messenger RNA splicing variants, and protein posttranslational modifications for the genes and proteins essential for human immune system. Information about polymorphic and evolutionarily fixed genetic variations was used to group immunome genes to the most conserved and the most quickly changing ones under directed selection during the recent immunome evolution. Gene Ontology terms related to adaptive immunity are associated with gene groups subject to recent directing selection. In addition, several other characteristics of the immunome genes and proteins in these two categories have statistically significant differences. The presented findings question the usability of directed mouse genes as models for human diseases and conditions and shed light on the fine tuning of human immunity and its diverse functions.
Collapse
Affiliation(s)
- Csaba Ortutay
- Institute of Biomedical Technology, University of Tampere, Tampere, Finland
| | | |
Collapse
|
20
|
Humbertclaude V, Hamroun D, Bezzou K, Bérard C, Boespflug-Tanguy O, Bommelaer C, Campana-Salort E, Cances C, Chabrol B, Commare MC, Cuisset JM, de Lattre C, Desnuelle C, Echenne B, Halbert C, Jonquet O, Labarre-Vila A, N'Guyen-Morel MA, Pages M, Pepin JL, Petitjean T, Pouget J, Ollagnon-Roman E, Richelme C, Rivier F, Sacconi S, Tiffreau V, Vuillerot C, Picot MC, Claustres M, Béroud C, Tuffery-Giraud S. Motor and respiratory heterogeneity in Duchenne patients: implication for clinical trials. Eur J Paediatr Neurol 2012; 16:149-60. [PMID: 21920787 DOI: 10.1016/j.ejpn.2011.07.001] [Citation(s) in RCA: 86] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/09/2011] [Revised: 07/13/2011] [Accepted: 07/17/2011] [Indexed: 01/06/2023]
Abstract
AIMS Our objective was to clarify the clinical heterogeneity in Duchenne muscular dystrophy (DMD). METHODS The French dystrophinopathy database provided clinical, histochemical and molecular data of 278 DMD patients (mean longitudinal follow-up: 14.2 years). Diagnosis was based on mutation identification in the DMD gene. Three groups were defined according to the age at ambulation loss: before 8 years (group A); between 8 and 11 years (group B); between 11 and 16 years (group C). RESULTS Motor and respiratory declines were statistically different between the three groups, as opposed to heart involvement. When acquired, running ability was lost at the mean age of 5.41 (group A), 7.11 (group B), 9.19 (group C) years; climbing stairs ability at 6.24 (group A), 7.99 (group B), 10,42 (group C) years, and ambulation at 7.10 (group A), 9.25 (group B), 12.01 (group C) years. Pulmonary growth stopped at 10.26 (group A), 12.45 (group B), 14.58 (group C) years. Then, forced vital capacity decreased at the rate of 8.83 (group A), 7.52 (group B), 6.03 (group C) percent per year. Phenotypic variability did not rely on specific mutational spectrum. CONCLUSION Beside the most common form of DMD (group B), we provide detailed description on two extreme clinical subgroups: a severe one (group A) characterized by early severe motor and respiratory decline and a milder subgroup (group C). Compared to group B or C, four to six times fewer patients from group A are needed to detect the same decrease in disease progression in a clinical trial.
Collapse
|
21
|
Li Z, Liu X, Wen J, Xu Y, Zhao X, Li X, Liu L, Zhang X. DRUMS: a human disease related unique gene mutation search engine. Hum Mutat 2012; 32:E2259-65. [PMID: 21913285 DOI: 10.1002/humu.21556] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
With the completion of the human genome project and the development of new methods for gene variant detection, the integration of mutation data and its phenotypic consequences has become more important than ever. Among all available resources, locus-specific databases (LSDBs) curate one or more specific genes' mutation data along with high-quality phenotypes. Although some genotype-phenotype data from LSDB have been integrated into central databases little effort has been made to integrate all these data by a search engine approach. In this work, we have developed disease related unique gene mutation search engine (DRUMS), a search engine for human disease related unique gene mutation as a convenient tool for biologists or physicians to retrieve gene variant and related phenotype information. Gene variant and phenotype information were stored in a gene-centred relational database. Moreover, the relationships between mutations and diseases were indexed by the uniform resource identifier from LSDB, or another central database. By querying DRUMS, users can access the most popular mutation databases under one interface. DRUMS could be treated as a domain specific search engine. By using web crawling, indexing, and searching technologies, it provides a competitively efficient interface for searching and retrieving mutation data and their relationships to diseases. The present system is freely accessible at http://www.scbit.org/glif/new/drums/index.html.
Collapse
Affiliation(s)
- Zuofeng Li
- School of Life Sciences and Technology, Tongji University, Shanghai, China
| | | | | | | | | | | | | | | |
Collapse
|
22
|
Capriotti E, Nehrt NL, Kann MG, Bromberg Y. Bioinformatics for personal genome interpretation. Brief Bioinform 2012; 13:495-512. [PMID: 22247263 DOI: 10.1093/bib/bbr070] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
An international consortium released the first draft sequence of the human genome 10 years ago. Although the analysis of this data has suggested the genetic underpinnings of many diseases, we have not yet been able to fully quantify the relationship between genotype and phenotype. Thus, a major current effort of the scientific community focuses on evaluating individual predispositions to specific phenotypic traits given their genetic backgrounds. Many resources aim to identify and annotate the specific genes responsible for the observed phenotypes. Some of these use intra-species genetic variability as a means for better understanding this relationship. In addition, several online resources are now dedicated to collecting single nucleotide variants and other types of variants, and annotating their functional effects and associations with phenotypic traits. This information has enabled researchers to develop bioinformatics tools to analyze the rapidly increasing amount of newly extracted variation data and to predict the effect of uncharacterized variants. In this work, we review the most important developments in the field--the databases and bioinformatics tools that will be of utmost importance in our concerted effort to interpret the human variome.
Collapse
Affiliation(s)
- Emidio Capriotti
- Department of Mathematics and Computer Science, University of Balearic Islands, ctra. de Valldemossa Km 7.5, Palma de Mallorca, 07122 Spain.
| | | | | | | |
Collapse
|
23
|
Vihinen M, den Dunnen JT, Dalgleish R, Cotton RGH. Guidelines for establishing locus specific databases. Hum Mutat 2011; 33:298-305. [PMID: 22052659 DOI: 10.1002/humu.21646] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2011] [Accepted: 10/25/2011] [Indexed: 11/06/2022]
Abstract
Information about genetic variation has been collected for some 20 years into registries, known as locus specific databases (LSDBs), which nowadays often contain information in addition to the actual genetic variation. Several issues have to be taken into account when considering establishing and maintaining LSDBs and these have been discussed previously in a number of articles describing guidelines and recommendations. This information is widely scattered and, for a newcomer, it would be difficult to obtain the latest information and guidance. Here, a sequence of steps essential for establishing an LSDB is discussed together with guidelines for each step. Curators need to collect information from various sources, code it in systematic way, and distribute to the research and clinical communities. In doing this, ethical issues have to be taken into account. To facilitate integration of information to, for example, analyze genotype-phenotype correlations, systematic data representation using established nomenclatures, data models, and ontologies is essential. LSDB curation and maintenance comprises a number of tasks that can be managed by following logical steps. These resources are becoming ever more important and new curators are essential to ensure that we will have expertly curated databases for all disease-related genes in the near future.
Collapse
Affiliation(s)
- Mauno Vihinen
- Institute of Biomedical Technology, University of Tampere, Finland.
| | | | | | | |
Collapse
|
24
|
Celli J, Dalgleish R, Vihinen M, Taschner PEM, den Dunnen JT. Curating gene variant databases (LSDBs): toward a universal standard. Hum Mutat 2011; 33:291-7. [PMID: 21990126 DOI: 10.1002/humu.21626] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2011] [Accepted: 09/21/2011] [Indexed: 01/27/2023]
Abstract
Gene variant databases or Locus-Specific DataBases (LSDBs) are used to collect and display information on sequence variants on a gene-by-gene basis. Their most frequent use is in relation to DNA-based diagnostics, giving clinicians and scientists easy access to an up-to-date overview of all gene variants identified worldwide and whether they influence the function of the gene ("pathogenic or not"). While literature on gene variant databases is extensive, little has been published on the process of database curation itself. Based on our extensive experience as LSDB curators and our contributions to database curation courses, we discuss the subject of database curation. We describe the tasks involved, the steps to take, and the issues that might occur. Our overview is a first step toward establishing overall guidelines for database curation and ultimately covers one aspect of establishing quality-assured gene variant databases.
Collapse
Affiliation(s)
- Jacopo Celli
- Human and Clinical Genetics, Leiden University Medical Center, Leiden, Netherlands
| | | | | | | | | |
Collapse
|
25
|
Lopes P, Dalgleish R, Oliveira JL. WAVe: web analysis of the variome. Hum Mutat 2011; 32:729-34. [DOI: 10.1002/humu.21499] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2010] [Accepted: 03/01/2011] [Indexed: 01/17/2023]
|
26
|
Webb AJ, Thorisson GA, Brookes AJ. An informatics project and online "Knowledge Centre" supporting modern genotype-to-phenotype research. Hum Mutat 2011; 32:543-50. [PMID: 21438073 DOI: 10.1002/humu.21469] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2011] [Accepted: 01/28/2011] [Indexed: 11/06/2022]
Abstract
Explosive growth in the generation of genotype-to-phenotype (G2P) data necessitates a concerted effort to tackle the logistical and informatics challenges this presents. The GEN2PHEN Project represents one such effort, with a broad strategy of uniting disparate G2P resources into a hybrid centralized-federated network. This is achieved through a holistic strategy focussed on three overlapping areas: data input standards and pipelines through which to submit and collect data (data in); federated, independent, extendable, yet interoperable database platforms on which to store and curate widely diverse datasets (data storage); and data formats and mechanisms with which to exchange, combine, and extract data (data exchange and output). To fully leverage this data network, we have constructed the "G2P Knowledge Centre" (http://www.gen2phen.org). This central platform provides holistic searching of the G2P data domain allied with facilities for data annotation and user feedback, access to extensive G2P and informatics resources, and tools for constructing online working communities centered on the G2P domain. Through the efforts of GEN2PHEN, and through combining data with broader community-derived knowledge, the Knowledge Centre opens up exciting possibilities for organizing, integrating, sharing, and interpreting new waves of G2P data in a collaborative fashion.
Collapse
Affiliation(s)
- Adam J Webb
- Department of Genetics, University of Leicester, University Road, Leicester, United Kingdom.
| | | | | | | |
Collapse
|
27
|
Ullman-Cullere MH, Mathew JP. Emerging landscape of genomics in the Electronic Health Record for personalized medicine. Hum Mutat 2011; 32:512-6. [PMID: 21309042 DOI: 10.1002/humu.21456] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2010] [Accepted: 01/19/2011] [Indexed: 11/11/2022]
Abstract
The Information Technology (IT) roadmap for personalized medicine requires Electronic Health Records (EHRs), extension of Healthcare IT (HIT) standards, and understanding of how genetics/genomics should be integrated into the clinical applications. For reduced overall costs and development times, these three initiatives should run in parallel. EHRs must contain structured data and infrastructure that enables quality analysis, Clinical Decision Support (CDS) and messaging within the healthcare information network. Fortunately, as a result of sustained financial commitment to nongenetic-based healthcare, the industry has HIT data standards and understanding of EHR functionality that improves patient safety and outcomes while reducing overall healthcare costs. However, the HIT standards and EHR functional requirements, needed for personalized medicine, are only beginning to support simple genetic tests and need significant extension. In addition, our understanding of the clinical implications of genomic data is evolving and translation of new discovery into clinical care remains a challenge. Therefore, priority areas include CDS, educational resources, and knowledgebases for the EHR, clinical and research data warehouses, messaging frameworks, and continued review of healthcare policies and regulations supporting personalized medicine. Where core infrastructure remains to be developed and implemented, funding is needed for pilot projects, data standards, policy, and stakeholder collaboration.
Collapse
Affiliation(s)
- Mollie H Ullman-Cullere
- Clinical and Translational Informatics, Dana-Farber Cancer Institute, Boston, Massachusetts 02115-6084, USA.
| | | |
Collapse
|
28
|
Georgitsi M, Viennas E, Gkantouna V, Christodoulopoulou E, Zagoriti Z, Tafrali C, Ntellos F, Giannakopoulou O, Boulakou A, Vlahopoulou P, Kyriacou E, Tsaknakis J, Tsakalidis A, Poulas K, Tzimas G, Patrinos GP. Population-specific documentation of pharmacogenomic markers and their allelic frequencies in FINDbase. Pharmacogenomics 2011; 12:49-58. [DOI: 10.2217/pgs.10.169] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
Aims: Population and ethnic group-specific allele frequencies of pharmacogenomic markers are poorly documented and not systematically collected in structured data repositories. We developed the Frequency of Inherited Disorders Pharmacogenomics database (FINDbase-PGx), a separate module of the FINDbase, aiming to systematically document pharmacogenomic allele frequencies in various populations and ethnic groups worldwide. Materials & methods: We critically collected and curated 214 scientific articles reporting pharmacogenomic markers allele frequencies in various populations and ethnic groups worldwide. Subsequently, in order to host the curated data, support data visualization and data mining, we developed a website application, utilizing Microsoft™ PivotViewer software. Results: Curated allelic frequency data pertaining to 144 pharmacogenomic markers across 14 genes, representing approximately 87,000 individuals from 150 populations worldwide, are currently included in FINDbase-PGx. A user-friendly query interface allows for easy data querying, based on numerous content criteria, such as population, ethnic group, geographical region, gene, drug and rare allele frequency. Conclusion: FINDbase-PGx is a comprehensive database, which, unlike other pharmacogenomic knowledgebases, fulfills the much needed requirement to systematically document pharmacogenomic allelic frequencies in various populations and ethnic groups worldwide.
Collapse
Affiliation(s)
- Marianthi Georgitsi
- Department of Pharmacy, School of Health Sciences, University of Patras, Patras, Greece
| | - Emmanouil Viennas
- Department of Computer Engineering & Informatics, Faculty of Engineering, University of Patras, Patras, Greece
| | - Vassiliki Gkantouna
- Department of Computer Engineering & Informatics, Faculty of Engineering, University of Patras, Patras, Greece
| | | | - Zoi Zagoriti
- Department of Pharmacy, School of Health Sciences, University of Patras, Patras, Greece
| | - Christina Tafrali
- Department of Pharmacy, School of Health Sciences, University of Patras, Patras, Greece
| | - Fotios Ntellos
- Department of Pharmacy, School of Health Sciences, University of Patras, Patras, Greece
| | - Olga Giannakopoulou
- Department of Pharmacy, School of Health Sciences, University of Patras, Patras, Greece
| | - Athanassia Boulakou
- Department of Pharmacy, School of Health Sciences, University of Patras, Patras, Greece
| | - Panagiota Vlahopoulou
- Department of Pharmacy, School of Health Sciences, University of Patras, Patras, Greece
| | - Eva Kyriacou
- Department of Pharmacy, School of Health Sciences, University of Patras, Patras, Greece
| | - John Tsaknakis
- Department of Computer Engineering & Informatics, Faculty of Engineering, University of Patras, Patras, Greece
| | - Athanassios Tsakalidis
- Department of Computer Engineering & Informatics, Faculty of Engineering, University of Patras, Patras, Greece
| | - Konstantinos Poulas
- Department of Pharmacy, School of Health Sciences, University of Patras, Patras, Greece
| | - Giannis Tzimas
- Department of Computer Engineering & Informatics, Faculty of Engineering, University of Patras, Patras, Greece
| | | |
Collapse
|
29
|
Georgitsi M, Viennas E, Antoniou DI, Gkantouna V, van Baal S, Petricoin EF, Poulas K, Tzimas G, Patrinos GP. FINDbase: a worldwide database for genetic variation allele frequencies updated. Nucleic Acids Res 2010; 39:D926-32. [PMID: 21113021 PMCID: PMC3013745 DOI: 10.1093/nar/gkq1236] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Frequency of INherited Disorders database (FIND base; http://www.findbase.org) records frequencies of causative genetic variations worldwide. Database records include the population and ethnic group or geographical region, the disorder name and the related gene, accompanied by links to any related external resources and the genetic variation together with its frequency in that population. In addition to the regular data content updates, we report the following significant advances: (i) the systematic collection and thorough documentation of population/ethnic group-specific pharmacogenomic markers allele frequencies for 144 markers in 14 genes of pharmacogenomic interest from different classes of drug-metabolizing enzymes and transporters, representing 150 populations and ethnic groups worldwide; (ii) the development of new data querying and visualization tools in the expanded FINDbase data collection, built around Microsoft's PivotViewer software (http://www.getpivot.com), based on Microsoft Silverlight technology (http://www.silverlight.net) that facilitates querying of large data sets and visualizing the results; and (iii) the establishment of the first database journal, by affiliating FINDbase with Human Genomics and Proteomics, a new open-access scientific journal, which would serve as a prime example of a non-profit model for sustainable database funding.
Collapse
Affiliation(s)
- Marianthi Georgitsi
- Department of Pharmacy, School of Health Sciences, Faculty of Engineering, University of Patras, Patras, Greece
| | | | | | | | | | | | | | | | | |
Collapse
|