1
|
Seven years since the launch of the Matchmaker Exchange: The evolution of genomic matchmaking. Hum Mutat 2022; 43:659-667. [PMID: 35537081 PMCID: PMC9133175 DOI: 10.1002/humu.24373] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Accepted: 03/22/2022] [Indexed: 11/09/2022]
Abstract
The Matchmaker Exchange (MME) was launched in 2015 to provide a robust mechanism to discover novel disease-gene relationships. It operates as a federated network connecting databases holding relevant data using a common application programming interface, where two or more users are looking for a match for the same gene (two-sided matchmaking). Seven years from its launch, it is clear that the MME is making outstanding contributions to understanding the morbid anatomy of the genome. The number of unique genes present across the MME has steadily increased over time; there are currently >13,520 unique genes (~68% of all protein-coding genes) connected across the MME's eight genomic matchmaking nodes, GeneMatcher, DECIPHER, PhenomeCentral, MyGene2, seqr, Initiative on Rare and Undiagnosed Disease, PatientMatcher, and the RD-Connect Genome-Phenome Analysis Platform. The collective data set accessible across the MME currently includes more than 120,000 cases from over 12,000 contributors in 98 countries. The discovery of potential new disease-gene relationships is happening daily and international collaborative teams are moving these advances forward to publication, now numbering well over 500. Expansion of data sharing into routine clinical practice by clinicians, genetic counselors, and clinical laboratories has ensured access to discovery for even more individuals with undiagnosed rare genetic diseases. Tens of thousands of patients and their family members have been directly or indirectly impacted by the discoveries facilitated by two-sided genomic matchmaking. MME supports further connections to the literature (PubCaseFinder) and to human and model organism resources (Monarch Initiative) and scientists (ModelMatcher). Efforts are now underway to explore additional approaches to matchmaking at the gene or variant level where there is only one querier (one-sided matchmaking). Genomic matchmaking has proven its utility over the past 7 years and will continue to facilitate discoveries in the years to come.
Collapse
|
2
|
PhenomeCentral: 7 years of rare disease matchmaking. Hum Mutat 2022; 43:674-681. [PMID: 35165961 DOI: 10.1002/humu.24348] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2021] [Revised: 02/04/2022] [Accepted: 02/08/2022] [Indexed: 11/08/2022]
Abstract
A major challenge in validating genetic causes for patients with rare diseases (RDs) is the difficulty in identifying other RD patients with overlapping phenotypes and variants in the same candidate gene. This process, known as matchmaking, requires robust data sharing solutions in order to be effective. In 2014 we launched PhenomeCentral, a RD data repository capable of collecting computer-readable genotypic and phenotypic data for the purposes of RD matchmaking. Over the past 7 years PhenomeCentral's features have been expanded and its dataset has consistently grown. There are currently 1,615 users registered on PhenomeCentral, which have contributed over 12,000 patient cases. Most of these cases contain detailed phenotypic terms, with a significant portion also providing genomic sequence data or other forms of clinical information. Matchmaking within PhenomeCentral, and with connections to other data repositories in the Matchmaker Exchange, have collectively resulted in over 60,000 matches, which have facilitated multiple gene discoveries. The collection of deep phenotypic and genotypic data has also positioned PhenomeCentral well to support next generation of matchmaking initiatives that utilize genome sequencing data, ensuring that PhenomeCentral will remain a useful tool in solving undiagnosed RD cases in the years to come. This article is protected by copyright. All rights reserved.
Collapse
|
3
|
Diagnostic testing laboratories are valuable partners for disease gene discovery: 5-year experience with GeneMatcher. Hum Mutat 2022; 43:772-781. [PMID: 35143109 PMCID: PMC9313781 DOI: 10.1002/humu.24342] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2021] [Revised: 02/01/2022] [Accepted: 02/07/2022] [Indexed: 12/01/2022]
Abstract
Although the rates of disease gene discovery have steadily increased with the expanding use of genome and exome sequencing by clinical and research laboratories, only ~16% of genes in the genome have confirmed disease associations. Here we describe our clinical laboratory's experience utilizing GeneMatcher, an online portal designed to promote disease gene discovery and data sharing. Since 2016, we submitted 246 candidates from 243 unique genes to GeneMatcher, of which 111 (45%) are now clinically characterized. Submissions meeting our candidate gene‐reporting criteria based on a scoring system using patient and molecular‐weighted evidence were significantly more likely to be characterized as of October 2021 versus genes that did not meet our clinical‐reporting criteria (p = 0.025). We reported relevant findings related to these newly characterized gene–disease associations in 477 probands. In 218 (46%) instances, we issued reclassifications after an initial negative or candidate gene (uncertain) report. We coauthored 104 publications delineating gene–disease relationships, including descriptions of new associations (60%), additional supportive evidence (13%), subsequent descriptive cohorts (23%), and phenotypic expansions (4%). Clinical laboratories are pivotal for disease gene discovery efforts and can screen phenotypes based on genotype matches, contact clinicians of relevant cases, and issue proactive reclassification reports.
Collapse
|
4
|
Advances in the development of PubCaseFinder, including the new application programming interface and matching algorithm. Hum Mutat 2022; 43:734-742. [PMID: 35143083 PMCID: PMC9305291 DOI: 10.1002/humu.24341] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 01/17/2022] [Accepted: 02/07/2022] [Indexed: 11/11/2022]
Abstract
Over 10,000 rare genetic diseases have been identified, and millions of newborns are affected by severe rare genetic diseases each year. A variety of Human Phenotype Ontology (HPO)-based clinical decision support systems (CDSS) and patient repositories have been developed to support clinicians in diagnosing patients with suspected rare genetic diseases. In September 2017, we released PubCaseFinder (https://pubcasefinder.dbcls.jp), a web-based CDSS that provides ranked lists of genetic and rare diseases using HPO-based phenotypic similarities, where top-listed diseases represent the most likely differential diagnosis. We also developed a Matchmaker Exchange (MME) application programming interface (API) to query PubCaseFinder, which has been adopted by several patient repositories. In this paper, we describe notable updates regarding PubCaseFinder, the GeneYenta matching algorithm implemented in PubCaseFinder, and the PubCaseFinder API. The updated GeneYenta matching algorithm improves the performance of the CDSS automated differential diagnosis function. Moreover, the updated PubCaseFinder and new API empower patient repositories participating in MME and medical professionals to actively use HPO-based resources. This article is protected by copyright. All rights reserved.
Collapse
|
5
|
Genomic Data Sharing for Novel Mendelian Disease Gene Discovery: The Matchmaker Exchange. Annu Rev Genomics Hum Genet 2020; 21:305-326. [PMID: 32339034 DOI: 10.1146/annurev-genom-083118-014915] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
In the last decade, exome and/or genome sequencing has become a common test in the diagnosis of individuals with features of a rare Mendelian disorder. Despite its success, this test leaves the majority of tested individuals undiagnosed. This review describes the Matchmaker Exchange (MME), a federated network established to facilitate the solving of undiagnosed rare-disease cases through data sharing. MME supports genomic matchmaking, the act of connecting two or more parties looking for cases with similar phenotypes and variants in the same candidate genes. An application programming interface currently connects six matchmaker nodes-the Database of Chromosomal Imbalance and Phenotype in Humans Using Ensembl Resources (DECIPHER), GeneMatcher, PhenomeCentral, seqr, MyGene2, and the Initiative on Rare and Undiagnosed Diseases (IRUD) Exchange-resulting in a collective data set spanning more than 150,000 cases from more than 11,000 contributors in 88 countries. Here, we describe the successes and challenges of MME, its individual matchmaking nodes, plans for growing the network, and considerations for future directions.
Collapse
|
6
|
matchbox: An open-source tool for patient matching via the Matchmaker Exchange. Hum Mutat 2018; 39:1827-1834. [PMID: 30240502 DOI: 10.1002/humu.23655] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2018] [Revised: 08/29/2018] [Accepted: 09/18/2018] [Indexed: 12/11/2022]
Abstract
Rare disease investigators constantly face challenges in identifying additional cases to build evidence for gene-disease causality. The Matchmaker Exchange (MME) addresses this limitation by providing a mechanism for matching patients across genomic centers via a federated network. The MME has revolutionized searching for additional cases by making it possible to query across institutional boundaries, so that what was once a laborious and manual process of contacting researchers is now automated and computable. However, while the MME network is beginning to scale, the growth of additional nodes is limited by the lack of easy-to-use solutions that can be implemented by any rare disease database owner, even one without significant software engineering resources. Here, we describe matchbox, which is an open-source, platform-independent, portable bridge between any given rare disease genomic center and the MME network, which has already led to novel gene discoveries. We also describe how matchbox greatly reduces the barrier to participation by overcoming challenges for new databases to join the MME.
Collapse
|
7
|
Abstract
Provision of a molecularly confirmed diagnosis in a timely manner for children and adults with rare genetic diseases shortens their "diagnostic odyssey," improves disease management, and fosters genetic counseling with respect to recurrence risks while assuring reproductive choices. In a general clinical genetics setting, the current diagnostic rate is approximately 50%, but for those who do not receive a molecular diagnosis after the initial genetics evaluation, that rate is much lower. Diagnostic success for these more challenging affected individuals depends to a large extent on progress in the discovery of genes associated with, and mechanisms underlying, rare diseases. Thus, continued research is required for moving toward a more complete catalog of disease-related genes and variants. The International Rare Diseases Research Consortium (IRDiRC) was established in 2011 to bring together researchers and organizations invested in rare disease research to develop a means of achieving molecular diagnosis for all rare diseases. Here, we review the current and future bottlenecks to gene discovery and suggest strategies for enabling progress in this regard. Each successful discovery will define potential diagnostic, preventive, and therapeutic opportunities for the corresponding rare disease, enabling precision medicine for this patient population.
Collapse
|
8
|
Mitigating false-positive associations in rare disease gene discovery. Hum Mutat 2016; 36:998-1003. [PMID: 26378430 DOI: 10.1002/humu.22847] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2015] [Accepted: 07/19/2015] [Indexed: 11/09/2022]
Abstract
Clinical sequencing is expanding, but causal variants are still not identified in the majority of cases. These unsolved cases can aid in gene discovery when individuals with similar phenotypes are identified in systems such as the Matchmaker Exchange. We describe risks for gene discovery in this growing set of unsolved cases. In a set of rare disease cases with the same phenotype, it is not difficult to find two individuals with the same phenotype that carry variants in the same gene. We quantify the risk of false-positive association in a cohort of individuals with the same phenotype, using the prior probability of observing a variant in each gene from over 60,000 individuals (Exome Aggregation Consortium). Based on the number of individuals with a genic variant, cohort size, specific gene, and mode of inheritance, we calculate a P value that the match represents a true association. A match in two of 10 patients in MECP2 is statistically significant (P = 0.0014), whereas a match in TTN would not reach significance, as expected (P > 0.999). Finally, we analyze the probability of matching in clinical exome cases to estimate the number of cases needed to identify genes related to different disorders. We offer Rare Disease Match, an online tool to mitigate the uncertainty of false-positive associations.
Collapse
|
9
|
The Matchmaker Exchange: a platform for rare disease gene discovery. Hum Mutat 2016; 36:915-21. [PMID: 26295439 DOI: 10.1002/humu.22858] [Citation(s) in RCA: 340] [Impact Index Per Article: 42.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2015] [Accepted: 07/21/2015] [Indexed: 12/21/2022]
Abstract
There are few better examples of the need for data sharing than in the rare disease community, where patients, physicians, and researchers must search for "the needle in a haystack" to uncover rare, novel causes of disease within the genome. Impeding the pace of discovery has been the existence of many small siloed datasets within individual research or clinical laboratory databases and/or disease-specific organizations, hoping for serendipitous occasions when two distant investigators happen to learn they have a rare phenotype in common and can "match" these cases to build evidence for causality. However, serendipity has never proven to be a reliable or scalable approach in science. As such, the Matchmaker Exchange (MME) was launched to provide a robust and systematic approach to rare disease gene discovery through the creation of a federated network connecting databases of genotypes and rare phenotypes using a common application programming interface (API). The core building blocks of the MME have been defined and assembled. Three MME services have now been connected through the API and are available for community use. Additional databases that support internal matching are anticipated to join the MME network as it continues to grow.
Collapse
|
10
|
The Matchmaker Exchange API: automating patient matching through the exchange of structured phenotypic and genotypic profiles. Hum Mutat 2016; 36:922-7. [PMID: 26255989 DOI: 10.1002/humu.22850] [Citation(s) in RCA: 43] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2015] [Accepted: 07/24/2015] [Indexed: 01/28/2023]
Abstract
Despite the increasing prevalence of clinical sequencing, the difficulty of identifying additional affected families is a key obstacle to solving many rare diseases. There may only be a handful of similar patients worldwide, and their data may be stored in diverse clinical and research databases. Computational methods are necessary to enable finding similar patients across the growing number of patient repositories and registries. We present the Matchmaker Exchange Application Programming Interface (MME API), a protocol and data format for exchanging phenotype and genotype profiles to enable matchmaking among patient databases, facilitate the identification of additional cohorts, and increase the rate with which rare diseases can be researched and diagnosed. We designed the API to be straightforward and flexible in order to simplify its adoption on a large number of data types and workflows. We also provide a public test data set, curated from the literature, to facilitate implementation of the API and development of new matching algorithms. The initial version of the API has been successfully implemented by three members of the Matchmaker Exchange and was immediately able to reproduce previously identified matches and generate several new leads currently being validated. The API is available at https://github.com/ga4gh/mme-apis.
Collapse
|
11
|
Use of model organism and disease databases to support matchmaking for human disease gene discovery. Hum Mutat 2015; 36:979-84. [PMID: 26269093 PMCID: PMC5473253 DOI: 10.1002/humu.22857] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2015] [Accepted: 07/22/2015] [Indexed: 11/10/2022]
Abstract
The Matchmaker Exchange application programming interface (API) allows searching a patient's genotypic or phenotypic profiles across clinical sites, for the purposes of cohort discovery and variant disease causal validation. This API can be used not only to search for matching patients, but also to match against public disease and model organism data. This public disease data enable matching known diseases and variant-phenotype associations using phenotype semantic similarity algorithms developed by the Monarch Initiative. The model data can provide additional evidence to aid diagnosis, suggest relevant models for disease mechanism and treatment exploration, and identify collaborators across the translational divide. The Monarch Initiative provides an implementation of this API for searching multiple integrated sources of data that contextualize the knowledge about any given patient or patient family into the greater biomedical knowledge landscape. While this corpus of data can aid diagnosis, it is also the beginning of research to improve understanding of rare human diseases.
Collapse
|
12
|
PhenomeCentral: a portal for phenotypic and genotypic matchmaking of patients with rare genetic diseases. Hum Mutat 2015; 36:931-40. [PMID: 26251998 DOI: 10.1002/humu.22851] [Citation(s) in RCA: 101] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2015] [Accepted: 07/28/2015] [Indexed: 01/18/2023]
Abstract
The discovery of disease-causing mutations typically requires confirmation of the variant or gene in multiple unrelated individuals, and a large number of rare genetic diseases remain unsolved due to difficulty identifying second families. To enable the secure sharing of case records by clinicians and rare disease scientists, we have developed the PhenomeCentral portal (https://phenomecentral.org). Each record includes a phenotypic description and relevant genetic information (exome or candidate genes). PhenomeCentral identifies similar patients in the database based on semantic similarity between clinical features, automatically prioritized genes from whole-exome data, and candidate genes entered by the users, enabling both hypothesis-free and hypothesis-driven matchmaking. Users can then contact other submitters to follow up on promising matches. PhenomeCentral incorporates data for over 1,000 patients with rare genetic diseases, contributed by the FORGE and Care4Rare Canada projects, the US NIH Undiagnosed Diseases Program, the EU Neuromics and ANDDIrare projects, as well as numerous independent clinicians and scientists. Though the majority of these records have associated exome data, most lack a molecular diagnosis. PhenomeCentral has already been used to identify causative mutations for several patients, and its ability to find matching patients and diagnose these diseases will grow with each additional patient that is entered.
Collapse
|
13
|
Participant-driven matchmaking in the genomic era. Hum Mutat 2015; 36:965-73. [PMID: 26252162 DOI: 10.1002/humu.22852] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2015] [Accepted: 07/15/2015] [Indexed: 01/16/2023]
Abstract
Whole-genome and whole-exome sequencing are increasingly useful diagnostic tools for novel monogenic conditions. In order to confirm diagnoses made using these technologies, genomic matchmaking-the matching of cases with similar phenotypic and/or genotypic profiles, to narrow the number of candidate genes or ascertain a condition's etiology with greater certainty-is essential. Yet, due to current limitations on the size of matchmaking networks and data sets available to support them, identifying a match can be difficult. We argue that matchmaking efforts led by affected individuals and their families-participant-led efforts-offer a twofold solution to this need, in that participants both have the capacity to access larger networks and to provide more detailed sets of phenotypic and genotypic data. These features of participant-led efforts have the potential to increase the value of matchmaking networks, both in terms of number of matches and in terms of the overall energy of the network. We provide two examples of participant-led matchmaking, and propose a model for scaling these efforts.
Collapse
|
14
|
Abstract
The Undiagnosed Diseases Network (UDN) builds on the successes of the Undiagnosed Diseases Program at the National Institutes of Health (NIH UDP). Through support from the NIH Common Fund, a coordinating center, six additional clinical sites, and two sequencing cores comprise the UDN. The objectives of the UDN are to: (1) improve the level of diagnosis and care for patients with undiagnosed diseases through the development of common protocols designed by an enlarged community of investigators across the network; (2) facilitate research into the etiology of undiagnosed diseases, by collecting and sharing standardized, high-quality clinical and laboratory data including genotyping, phenotyping, and environmental exposure data; and (3) create an integrated and collaborative research community across multiple clinical sites, and among laboratory and clinical investigators, to investigate the pathophysiology of these rare diseases and to identify options for patient management. Broad-based data sharing is at the core of achieving these objectives, and the UDN is establishing the policies and governance structure to support broad data sharing.
Collapse
|
15
|
Cafe Variome: general-purpose software for making genotype-phenotype data discoverable in restricted or open access contexts. Hum Mutat 2015. [PMID: 26224250 DOI: 10.1002/humu.22841] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Biomedical data sharing is desirable, but problematic. Data "discovery" approaches-which establish the existence rather than the substance of data-precisely connect data owners with data seekers, and thereby promote data sharing. Cafe Variome (http://www.cafevariome.org) was therefore designed to provide a general-purpose, Web-based, data discovery tool that can be quickly installed by any genotype-phenotype data owner, or network of data owners, to make safe or sensitive content appropriately discoverable. Data fields or content of any type can be accommodated, from simple ID and label fields through to extensive genotype and phenotype details based on ontologies. The system provides a "shop window" in front of data, with main interfaces being a simple search box and a powerful "query-builder" that enable very elaborate queries to be formulated. After a successful search, counts of records are reported grouped by "openAccess" (data may be directly accessed), "linkedAccess" (a source link is provided), and "restrictedAccess" (facilitated data requests and subsequent provision of approved records). An administrator interface provides a wide range of options for system configuration, enabling highly customized single-site or federated networks to be established. Current uses include rare disease data discovery, patient matchmaking, and a Beacon Web service.
Collapse
|
16
|
Matching two independent cohorts validates DPH1 as a gene responsible for autosomal recessive intellectual disability with short stature, craniofacial, and ectodermal anomalies. Hum Mutat 2015. [PMID: 26220823 DOI: 10.1002/humu.22843] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Recently, Alazami et al. (2015) identified 33 putative candidate disease genes for neurogenetic disorders. One such gene was DPH1, in which a homozygous missense mutation was associated with a 3C syndrome-like phenotype in four patients from a single extended family. Here, we report a second homozygous missense variant in DPH1, seen in four members of a founder population, and associated with a phenotype initially reminiscent of Sensenbrenner syndrome. This postpublication "match" validates DPH1 as a gene underlying syndromic intellectual disability with short stature and craniofacial and ectodermal anomalies, reminiscent of, but distinct from, 3C and Sensenbrenner syndromes. This validation took several years after the independent discoveries due to the absence of effective methods for sharing both candidate phenotype and genotype data between investigators. Sharing of data via Web-based anonymous data exchange servers will play an increasingly important role toward more efficient identification of the molecular basis for rare Mendelian disorders.
Collapse
|
17
|
Abstract
Here, we describe an overview and update on GeneMatcher (http://www.genematcher.org), a freely accessible Web-based tool developed as part of the Baylor-Hopkins Center for Mendelian Genomics. We created GeneMatcher with the goal of identifying additional individuals with rare phenotypes who had variants in the same candidate disease gene. We also wanted to facilitate connections to basic scientists working on orthologous genes in model systems with the goal of connecting their work to human Mendelian phenotypes. Meeting these goals will enhance the identification of novel Mendelian genes. Launched in September, 2013, GeneMatcher now has 2,178 candidate genes from 486 submitters spread across 38 countries entered in the database (June 1, 2015). GeneMatcher is also part of the Matchmaker Exchange (http://matchmakerexchange.org/) with an Application Programing Interface enabling submitters to query other databases of genetic variants and phenotypes without having to create accounts and data entries in multiple systems.
Collapse
|