1
|
CAGI, the Critical Assessment of Genome Interpretation, establishes progress and prospects for computational genetic variant interpretation methods. Genome Biol 2024; 25:53. [PMID: 38389099 PMCID: PMC10882881 DOI: 10.1186/s13059-023-03113-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2023] [Accepted: 11/17/2023] [Indexed: 02/24/2024] Open
Abstract
BACKGROUND The Critical Assessment of Genome Interpretation (CAGI) aims to advance the state-of-the-art for computational prediction of genetic variant impact, particularly where relevant to disease. The five complete editions of the CAGI community experiment comprised 50 challenges, in which participants made blind predictions of phenotypes from genetic data, and these were evaluated by independent assessors. RESULTS Performance was particularly strong for clinical pathogenic variants, including some difficult-to-diagnose cases, and extends to interpretation of cancer-related variants. Missense variant interpretation methods were able to estimate biochemical effects with increasing accuracy. Assessment of methods for regulatory variants and complex trait disease risk was less definitive and indicates performance potentially suitable for auxiliary use in the clinic. CONCLUSIONS Results show that while current methods are imperfect, they have major utility for research and clinical applications. Emerging methods and increasingly large, robust datasets for training and assessment promise further progress ahead.
Collapse
|
2
|
Enabling the clinical application of artificial intelligence in genomics: a perspective of the AMIA Genomics and Translational Bioinformatics Workgroup. J Am Med Inform Assoc 2024; 31:536-541. [PMID: 38037121 PMCID: PMC10797281 DOI: 10.1093/jamia/ocad211] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2023] [Revised: 10/09/2023] [Accepted: 10/26/2023] [Indexed: 12/02/2023] Open
Abstract
OBJECTIVE Given the importance AI in genomics and its potential impact on human health, the American Medical Informatics Association-Genomics and Translational Biomedical Informatics (GenTBI) Workgroup developed this assessment of factors that can further enable the clinical application of AI in this space. PROCESS A list of relevant factors was developed through GenTBI workgroup discussions in multiple in-person and online meetings, along with review of pertinent publications. This list was then summarized and reviewed to achieve consensus among the group members. CONCLUSIONS Substantial informatics research and development are needed to fully realize the clinical potential of such technologies. The development of larger datasets is crucial to emulating the success AI is achieving in other domains. It is important that AI methods do not exacerbate existing socio-economic, racial, and ethnic disparities. Genomic data standards are critical to effectively scale such technologies across institutions. With so much uncertainty, complexity and novelty in genomics and medicine, and with an evolving regulatory environment, the current focus should be on using these technologies in an interface with clinicians that emphasizes the value each brings to clinical decision-making.
Collapse
|
3
|
HerediGene Population Study IT infrastructure: A model to support genomic research recruitment and precision public health. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2024; 2023:689-698. [PMID: 38222332 PMCID: PMC10785925] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 01/16/2024]
Abstract
The HerediGene Population Study is a large research study focused on identifying new genetic biomarkers for disease prevention, diagnosis, prognosis, and development of new therapeutics. A substantial IT infrastructure evolved to reach enrollment targets and return results to participants. More than 170,000 participants have been enrolled in the study to date, with 5.87% of those whole genome sequenced and 0.46% of those genotyped harboring pathogenic variants. Among other purposes, this infrastructure supports: (1) identifying candidates from clinical criteria, (2) monitoring for qualifying clinical events (e.g., blood draw), (3) contacting candidates, (4) obtaining consent electronically, (5) initiating lab orders, (6) integrating consent and lab orders into clinical workflow, (7) de-identifying samples and clinical data, (8) shipping/transmitting samples and clinical data, (9) genotyping/sequencing samples, (10) and re-identifying and returning results for participants where applicable. This study may serve as a model for similar genomic research and precision public health initiatives.
Collapse
|
4
|
A resonant sextuplet of sub-Neptunes transiting the bright star HD 110067. Nature 2023; 623:932-937. [PMID: 38030780 DOI: 10.1038/s41586-023-06692-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Accepted: 09/28/2023] [Indexed: 12/01/2023]
Abstract
Planets with radii between that of the Earth and Neptune (hereafter referred to as 'sub-Neptunes') are found in close-in orbits around more than half of all Sun-like stars1,2. However, their composition, formation and evolution remain poorly understood3. The study of multiplanetary systems offers an opportunity to investigate the outcomes of planet formation and evolution while controlling for initial conditions and environment. Those in resonance (with their orbital periods related by a ratio of small integers) are particularly valuable because they imply a system architecture practically unchanged since its birth. Here we present the observations of six transiting planets around the bright nearby star HD 110067. We find that the planets follow a chain of resonant orbits. A dynamical study of the innermost planet triplet allowed the prediction and later confirmation of the orbits of the rest of the planets in the system. The six planets are found to be sub-Neptunes with radii ranging from 1.94R⊕ to 2.85R⊕. Three of the planets have measured masses, yielding low bulk densities that suggest the presence of large hydrogen-dominated atmospheres.
Collapse
|
5
|
Repurposing Normal Chromosomal Microarray Data to Harbor Genetic Insights into Congenital Heart Disease. BIOLOGY 2023; 12:1290. [PMID: 37887000 PMCID: PMC10604103 DOI: 10.3390/biology12101290] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Revised: 09/08/2023] [Accepted: 09/08/2023] [Indexed: 10/28/2023]
Abstract
About 15% of congenital heart disease (CHD) patients have a known pathogenic copy number variant. The majority of their chromosomal microarray (CMA) tests are deemed normal. Diagnostic interpretation typically ignores microdeletions smaller than 100 kb. We hypothesized that unreported microdeletions are enriched for CHD genes. We analyzed "normal" CMAs of 1762 patients who were evaluated at a pediatric referral center, of which 319 (18%) had CHD. Using CMAs from monozygotic twins or replicates from the same individual, we established a size threshold based on probe count for the reproducible detection of small microdeletions. Genes in the microdeletions were sequentially filtered by their nominal association with a CHD diagnosis, the expression level in the fetal heart, and the deleteriousness of a loss-of-function mutation. The subsequent enrichment for CHD genes was assessed using the presence of known or potentially novel genes implicated by a large whole-exome sequencing study of CHD. The unreported microdeletions were modestly enriched for both known CHD genes and those of unknown significance identified using their de novo mutation in CHD patients. Our results show that readily available "normal" CMA data can be a fruitful resource for genetic discovery and that smaller deletions should receive more attention in clinical evaluation.
Collapse
|
6
|
Paving a pathway for large-scale utilization of genomics in precision medicine and population health. FRONTIERS IN SOCIOLOGY 2023; 8:1122488. [PMID: 37274607 PMCID: PMC10235789 DOI: 10.3389/fsoc.2023.1122488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Accepted: 05/02/2023] [Indexed: 06/06/2023]
Abstract
Having worked with two large population sequencing initiatives, the separation between the potential for genomics in precision medicine and the current reality have become clear. To realize this potential requires workflows, policies, and technical architectures that are foreign to most healthcare systems. Many historical processes and regulatory barriers currently impede our progress. The future of precision medicine includes genomic data being widely available at the point of care with systems in place to manage its efficient utilization. To achieve such vision requires substantial changes in billing, reimbursement, and reporting as well as the development of new systemic and technical architectures within the healthcare system. Clinical geneticist roles will evolve into managing precision health frameworks and genetic counselors will serve crucial roles in both leading and supporting precision medicine through the implementation and maintenance of precision medicine architectures. Our current path has many obstacles that hold us back, leaving preventable deaths in the wake. Reengineering our healthcare systems to support genomics can have a major impact on patient outcomes and allow us to realize the long-sought promises of precision medicine.
Collapse
|
7
|
Quantifying the phenome-wide disease burden of obesity using electronic health records and genomics. Obesity (Silver Spring) 2022; 30:2477-2488. [PMID: 36372681 PMCID: PMC9691570 DOI: 10.1002/oby.23561] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Revised: 07/08/2022] [Accepted: 07/11/2022] [Indexed: 11/15/2022]
Abstract
OBJECTIVE High BMI is associated with many comorbidities and mortality. This study aimed to elucidate the overall clinical risk of obesity using a genome- and phenome-wide approach. METHODS This study performed a phenome-wide association study of BMI using a clinical cohort of 736,726 adults. This was followed by genetic association studies using two separate cohorts: one consisting of 65,174 adults in the Electronic Medical Records and Genomics (eMERGE) Network and another with 405,432 participants in the UK Biobank. RESULTS Class 3 obesity was associated with 433 phenotypes, representing 59.3% of all billing codes in individuals with severe obesity. A genome-wide polygenic risk score for BMI, accounting for 7.5% of variance in BMI, was associated with 296 clinical diseases, including strong associations with type 2 diabetes, sleep apnea, hypertension, and chronic liver disease. In all three cohorts, 199 phenotypes were associated with class 3 obesity and polygenic risk for obesity, including novel associations such as increased risk of renal failure, venous insufficiency, and gastroesophageal reflux. CONCLUSIONS This combined genomic and phenomic systematic approach demonstrated that obesity has a strong genetic predisposition and is associated with a considerable burden of disease across all disease classes.
Collapse
|
8
|
The Development of an Infrastructure to Facilitate the Use of Whole Genome Sequencing for Population Health. J Pers Med 2022; 12:jpm12111867. [PMID: 36579594 PMCID: PMC9693138 DOI: 10.3390/jpm12111867] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Revised: 10/29/2022] [Accepted: 11/04/2022] [Indexed: 11/09/2022] Open
Abstract
The clinical use of genomic analysis has expanded rapidly resulting in an increased availability and utility of genomic information in clinical care. We have developed an infrastructure utilizing informatics tools and clinical processes to facilitate the use of whole genome sequencing data for population health management across the healthcare system. Our resulting framework scaled well to multiple clinical domains in both pediatric and adult care, although there were domain specific challenges that arose. Our infrastructure was complementary to existing clinical processes and well-received by care providers and patients. Informatics solutions were critical to the successful deployment and scaling of this program. Implementation of genomics at the scale of population health utilizes complicated technologies and processes that for many health systems are not supported by current information systems or in existing clinical workflows. To scale such a system requires a substantial clinical framework backed by informatics tools to facilitate the flow and management of data. Our work represents an early model that has been successful in scaling to 29 different genes with associated genetic conditions in four clinical domains. Work is ongoing to optimize informatics tools; and to identify best practices for translation to smaller healthcare systems.
Collapse
|
9
|
User testing of a diagnostic decision support system with machine-assisted chart review to facilitate clinical genomic diagnosis. BMJ Health Care Inform 2021; 28:bmjhci-2021-100331. [PMID: 33962988 PMCID: PMC8108675 DOI: 10.1136/bmjhci-2021-100331] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2021] [Revised: 04/01/2021] [Accepted: 04/20/2021] [Indexed: 11/21/2022] Open
Abstract
Objectives There is a need in clinical genomics for systems that assist in clinical diagnosis, analysis of genomic information and periodic reanalysis of results, and can use information from the electronic health record to do so. Such systems should be built using the concepts of human-centred design, fit within clinical workflows and provide solutions to priority problems. Methods We adapted a commercially available diagnostic decision support system (DDSS) to use extracted findings from a patient record and combine them with genomic variant information in the DDSS interface. Three representative patient cases were created in a simulated clinical environment for user testing. A semistructured interview guide was created to illuminate factors relevant to human factors in CDS design and organisational implementation. Results Six individuals completed the user testing process. Tester responses were positive and noted good fit with real-world clinical genetics workflow. Technical issues related to interface, interaction and design were minor and fixable. Testers suggested solving issues related to terminology and usability through training and infobuttons. Time savings was estimated at 30%–50% and additional uses such as in-house clinical variant analysis were suggested for increase fit with workflow and to further address priority problems. Conclusion This study provides preliminary evidence for usability, workflow fit, acceptability and implementation potential of a modified DDSS that includes machine-assisted chart review. Continued development and testing using principles from human-centred design and implementation science are necessary to improve technical functionality and acceptability for multiple stakeholders and organisational implementation potential to improve the genomic diagnosis process.
Collapse
|
10
|
Developing and Optimizing Innovative Tools to Address Familial Hypercholesterolemia Underdiagnosis: Identification Methods, Patient Activation, and Cascade Testing for Familial Hypercholesterolemia. CIRCULATION-GENOMIC AND PRECISION MEDICINE 2021; 14:e003120. [PMID: 33480803 PMCID: PMC7892261 DOI: 10.1161/circgen.120.003120] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Background: Familial hypercholesterolemia (FH) is the most common cardiovascular genetic disorder and, if left untreated, is associated with increased risk of premature atherosclerotic cardiovascular disease, the leading cause of preventable death in the United States. Although FH is common, fatal, and treatable, it is underdiagnosed and undertreated due to a lack of systematic methods to identify individuals with FH and limited uptake of cascade testing. Methods and Results: This mixed-method, multi-stage study will optimize, test, and implement innovative approaches for both FH identification and cascade testing in 3 aims. To improve identification of individuals with FH, in Aim 1, we will compare and refine automated phenotype-based and genomic approaches to identify individuals likely to have FH. To improve cascade testing uptake for at-risk individuals, in Aim 2, we will use a patient-centered design thinking process to optimize and develop novel, active family communication methods. Using a prospective, observational pragmatic trial, we will assess uptake and effectiveness of each family communication method on cascade testing. Guided by an implementation science framework, in Aim 3, we will develop a comprehensive guide to identify individuals with FH. Using the Conceptual Model for Implementation Research, we will evaluate implementation outcomes including feasibility, acceptability, and perceived sustainability as well as health outcomes related to the optimized methods and tools developed in Aims 1 and 2. Conclusions: Data generated from this study will address barriers and gaps in care related to underdiagnosis of FH by developing and optimizing tools to improve FH identification and cascade testing.
Collapse
|
11
|
Infant acid suppression use is associated with the development of eosinophilic esophagitis. Dis Esophagus 2020; 33:5874728. [PMID: 32696950 DOI: 10.1093/dote/doaa073] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/17/2020] [Revised: 06/18/2020] [Accepted: 06/20/2020] [Indexed: 12/11/2022]
Abstract
Eosinophilic esophagitis (EoE) is an esophageal allergic inflammatory disorder often presenting with infant/toddler gastroesophageal reflux symptoms refractory to treatment, including acid suppression trials with histamine H2 antagonists and proton pump inhibitors. We propose to evaluate the impact of infant acid suppressant exposure in EoE. Geisinger's pediatric EoE cases were matched to controls (1:5 EoE case control ratio) using age, race, sex, and ages at other diagnoses of asthma, eczema, and environmental allergies, totaling 526 EoE cases and 2,630 controls. Comparisons between EoE cases and matched controls were tested with regard to rates of acid suppression use with H2 antagonists and PPIs during infancy. Our analyses found the use of acid suppression in infancy was positively associated with EoE: PPI (5.7% EoE cases vs. 1.6% controls; P < 0.0001), H2 antagonists (8.8% EoE cases vs. 4.5% controls; P < 0.0001). Additionally, analysis of EoE cases using acid suppression during infancy indicated a likelihood for the diagnosis with EoE at an earlier age. Early acid suppression use in infants is significantly associated with the diagnosis of EoE in childhood in this well-matched retrospective cohort study. The potential link warrants additional investigation. Our study further reinforces the evidence-based stewardship of acid suppressant use, especially in our most vulnerable populations.
Collapse
|
12
|
Abstract
BACKGROUND The extent to which obesity and genetics determine postoperative complications is incompletely understood. METHODS We performed a retrospective study using two population cohorts with electronic health record (EHR) data. The first included 736,726 adults with body mass index (BMI) recorded between 1990 and 2017 at Vanderbilt University Medical Center. The second cohort consisted of 65,174 individuals from 12 institutions contributing EHR and genome-wide genotyping data to the Electronic Medical Records and Genomics (eMERGE) Network. Pairwise logistic regression analyses were used to measure the association of BMI categories with postoperative complications derived from International Classification of Disease-9 codes, including postoperative infection, incisional hernia, and intestinal obstruction. A genetic risk score was constructed from 97 obesity-risk single-nucleotide polymorphisms for a Mendelian randomization study to determine the association of genetic risk of obesity on postoperative complications. Logistic regression analyses were adjusted for sex, age, site, and race/principal components. RESULTS Individuals with overweight or obese BMI (≥25 kg/m2) had increased risk of incisional hernia (odds ratio [OR] 1.7-5.5, p < 3.1 × 10-20), and people with obesity (BMI ≥ 30 kg/m2) had increased risk of postoperative infection (OR 1.2-2.3, p < 2.5 × 10-5). In the eMERGE cohort, genetically predicted BMI was associated with incisional hernia (OR 2.1 [95% CI 1.8-2.5], p = 1.4 × 10-6) and postoperative infection (OR 1.6 [95% CI 1.4-1.9], p = 3.1 × 10-6). Association findings were similar after limitation of the cohorts to those who underwent abdominal procedures. CONCLUSIONS Clinical and Mendelian randomization studies suggest that obesity, as measured by BMI, is associated with the development of postoperative incisional hernia and infection.
Collapse
|
13
|
Genomic Information for Clinicians in the Electronic Health Record: Lessons Learned From the Clinical Genome Resource Project and the Electronic Medical Records and Genomics Network. Front Genet 2019; 10:1059. [PMID: 31737042 PMCID: PMC6830110 DOI: 10.3389/fgene.2019.01059] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2019] [Accepted: 10/03/2019] [Indexed: 01/05/2023] Open
Abstract
Genomic knowledge is being translated into clinical care. To fully realize the value, it is critical to place credible information in the hands of clinicians in time to support clinical decision making. The electronic health record is an essential component of clinician workflow. Utilizing the electronic health record to present information to support the use of genomic medicine in clinical care to improve outcomes represents a tremendous opportunity. However, there are numerous barriers that prevent the effective use of the electronic health record for this purpose. The electronic health record working groups of the Electronic Medical Records and Genomics (eMERGE) Network and the Clinical Genome Resource (ClinGen) project, along with other groups, have been defining these barriers, to allow the development of solutions that can be tested using implementation pilots. In this paper, we present “lessons learned” from these efforts to inform future efforts leading to the development of effective and sustainable solutions that will support the realization of genomic medicine.
Collapse
|
14
|
Future directions for high-throughput splicing assays in precision medicine. Hum Mutat 2019; 40:1225-1234. [PMID: 31297895 DOI: 10.1002/humu.23866] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2019] [Revised: 07/02/2019] [Accepted: 07/06/2019] [Indexed: 11/12/2022]
Abstract
Classification of variants of unknown significance is a challenging technical problem in clinical genetics. As up to one-third of disease-causing mutations are thought to affect pre-mRNA splicing, it is important to accurately classify splicing mutations in patient sequencing data. Several consortia and healthcare systems have conducted large-scale patient sequencing studies, which discover novel variants faster than they can be classified. Here, we compare the advantages and limitations of several high-throughput splicing assays aimed at mitigating this bottleneck, and describe a data set of ~5,000 variants that we analyzed using our Massively Parallel Splicing Assay (MaPSy). The Critical Assessment of Genome Interpretation group (CAGI) organized a challenge, in which participants submitted machine learning models to predict the splicing effects of variants in this data set. We discuss the winning submission of the challenge (MMSplice) which outperformed existing software. Finally, we highlight methods to overcome the limitations of MaPSy and similar assays, such as tissue-specific splicing, the effect of surrounding sequence context, classifying intronic variants, synthesizing large exons, and amplifying complex libraries of minigene species. Further development of these assays will greatly benefit the field of clinical genetics, which lack high-throughput methods for variant interpretation.
Collapse
|
15
|
Comorbidity Characterization Among eMERGE Institutions: A Pilot Evaluation with the Johns Hopkins Adjusted Clinical Groups® System. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE PROCEEDINGS. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE 2019; 2019:145-152. [PMID: 31258966 PMCID: PMC6568092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Electronic health records (EHR) are valuable to define phenotype selection algorithms used to identify cohorts ofpatients for sequencing or genome wide association studies (GWAS). To date, the electronic medical records and genomics (eMERGE) network institutions have developed and applied such algorithms to identify cohorts with associated DNA samples used to discover new genetic associations. For complex diseases, there are benefits to stratifying cohorts using comorbidities in order to identify their genetic determinants. The objective of this study was to: (a) characterize comorbidities in a range of phenotype-selected cohorts using the Johns Hopkins Adjusted Clinical Groups® (ACG®) System, (b) assess the frequency of important comorbidities in three commonly studied GWAS phenotypes, and (c) compare the comorbidity characterization of cases and controls. Our analysis demonstrates a framework to characterize comorbidities using the ACG system and identified differences in mean chronic condition count among GWAS cases and controls. Thus, we believe there is great potential to use the ACG system to characterize comorbidities among genetic cohorts selected based on EHR phenotypes.
Collapse
|
16
|
Semantic integration of clinical laboratory tests from electronic health records for deep phenotyping and biomarker discovery. NPJ Digit Med 2019; 2:32. [PMID: 31119199 PMCID: PMC6527418 DOI: 10.1038/s41746-019-0110-4] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2018] [Accepted: 04/18/2019] [Indexed: 12/22/2022] Open
Abstract
Electronic Health Record (EHR) systems typically define laboratory test results using the Laboratory Observation Identifier Names and Codes (LOINC) and can transmit them using Fast Healthcare Interoperability Resource (FHIR) standards. LOINC has not yet been semantically integrated with computational resources for phenotype analysis. Here, we provide a method for mapping LOINC-encoded laboratory test results transmitted in FHIR standards to Human Phenotype Ontology (HPO) terms. We annotated the medical implications of 2923 commonly used laboratory tests with HPO terms. Using these annotations, our software assesses laboratory test results and converts each result into an HPO term. We validated our approach with EHR data from 15,681 patients with respiratory complaints and identified known biomarkers for asthma. Finally, we provide a freely available SMART on FHIR application that can be used within EHR systems. Our approach allows readily available laboratory tests in EHR to be reused for deep phenotyping and exploits the hierarchical structure of HPO to integrate distinct tests that have comparable medical interpretations for association studies.
Collapse
|
17
|
A dust-enshrouded tidal disruption event with a resolved radio jet in a galaxy merger. Science 2018; 361:482-485. [PMID: 29903886 DOI: 10.1126/science.aao4669] [Citation(s) in RCA: 78] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2017] [Accepted: 06/01/2018] [Indexed: 11/02/2022]
Abstract
Tidal disruption events (TDEs) are transient flares produced when a star is ripped apart by the gravitational field of a supermassive black hole (SMBH). We have observed a transient source in the western nucleus of the merging galaxy pair Arp 299 that radiated >1.5 × 1052 erg at infrared and radio wavelengths but was not luminous at optical or x-ray wavelengths. We interpret this as a TDE with much of its emission reradiated at infrared wavelengths by dust. Efficient reprocessing by dense gas and dust may explain the difference between theoretical predictions and observed luminosities of TDEs. The radio observations resolve an expanding and decelerating jet, probing the jet formation and evolution around a SMBH.
Collapse
|
18
|
Abstract
Twenty years ago, chromosomal abnormalities were the only identifiable genetic causes of a small fraction of congenital heart defects (CHD). Today, a de novo or inherited genetic abnormality can be identified as pathogenic in one-third of cases. We refer to them here as monogenic causes, insofar as the genetic abnormality has a readily detectable, large effect. What explains the other two-thirds? This review considers a complex genetic basis. That is, a combination of genetic mutations or variants that individually may have little or no detectable effect contribute to the pathogenesis of a heart defect. Genes in the embryo that act directly in cardiac developmental pathways have received the most attention, but genes in the mother that establish the gestational milieu via pathways related to metabolism and aging also have an effect. A growing body of evidence highlights the pathogenic significance of genetic interactions in the embryo and maternal effects that have a genetic basis. The investigation of CHD as guided by a complex genetic model could help estimate risk more precisely and logically lead to a means of prevention.
Collapse
|
19
|
Astronomical algorithms for automated analysis of tissue protein expression in breast cancer. Br J Cancer 2013; 108:602-12. [PMID: 23329232 PMCID: PMC3593538 DOI: 10.1038/bjc.2012.558] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2012] [Revised: 11/15/2012] [Accepted: 11/19/2012] [Indexed: 01/22/2023] Open
Abstract
BACKGROUND High-throughput evaluation of tissue biomarkers in oncology has been greatly accelerated by the widespread use of tissue microarrays (TMAs) and immunohistochemistry. Although TMAs have the potential to facilitate protein expression profiling on a scale to rival experiments of tumour transcriptomes, the bottleneck and imprecision of manually scoring TMAs has impeded progress. METHODS We report image analysis algorithms adapted from astronomy for the precise automated analysis of IHC in all subcellular compartments. The power of this technique is demonstrated using over 2000 breast tumours and comparing quantitative automated scores against manual assessment by pathologists. RESULTS All continuous automated scores showed good correlation with their corresponding ordinal manual scores. For oestrogen receptor (ER), the correlation was 0.82, P<0.0001, for BCL2 0.72, P<0.0001 and for HER2 0.62, P<0.0001. Automated scores showed excellent concordance with manual scores for the unsupervised assignment of cases to 'positive' or 'negative' categories with agreement rates of up to 96%. CONCLUSION The adaptation of astronomical algorithms coupled with their application to large annotated study cohorts, constitutes a powerful tool for the realisation of the enormous potential of digital pathology.
Collapse
|
20
|
Predicting the start week of respiratory syncytial virus outbreaks using real time weather variables. BMC Med Inform Decis Mak 2010; 10:68. [PMID: 21044325 PMCID: PMC2987968 DOI: 10.1186/1472-6947-10-68] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2010] [Accepted: 11/02/2010] [Indexed: 11/11/2022] Open
Abstract
Background Respiratory Syncytial Virus (RSV), a major cause of bronchiolitis, has a large impact on the census of pediatric hospitals during outbreak seasons. Reliable prediction of the week these outbreaks will start, based on readily available data, could help pediatric hospitals better prepare for large outbreaks. Methods Naïve Bayes (NB) classifier models were constructed using weather data from 1985-2008 considering only variables that are available in real time and that could be used to forecast the week in which an RSV outbreak will occur in Salt Lake County, Utah. Outbreak start dates were determined by a panel of experts using 32,509 records with ICD-9 coded RSV and bronchiolitis diagnoses from Intermountain Healthcare hospitals and clinics for the RSV seasons from 1985 to 2008. Results NB models predicted RSV outbreaks up to 3 weeks in advance with an estimated sensitivity of up to 67% and estimated specificities as high as 94% to 100%. Temperature and wind speed were the best overall predictors, but other weather variables also showed relevance depending on how far in advance the predictions were made. The weather conditions predictive of an RSV outbreak in our study were similar to those that lead to temperature inversions in the Salt Lake Valley. Conclusions We demonstrate that Naïve Bayes (NB) classifier models based on weather data available in real time have the potential to be used as effective predictive models. These models may be able to predict the week that an RSV outbreak will occur with clinical relevance. Their clinical usefulness will be field tested during the next five years.
Collapse
|
21
|
PathGrid: a service-orientated architecture for microscopy image analysis. PHILOSOPHICAL TRANSACTIONS. SERIES A, MATHEMATICAL, PHYSICAL, AND ENGINEERING SCIENCES 2010; 368:3937-3952. [PMID: 20643686 DOI: 10.1098/rsta.2010.0158] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
This paper describes 'PathGrid'--an analysis and data integration system, developed initially to meet the demands in the analysis of medical microscopy imaging data. An overview of the current system is given, describing the techniques used in developing the data handling infrastructure and the analysis algorithm development. The use of software created in the context of systems designed for the astronomy domain is noted, specifically infrastructure from the astronomy virtual observatory movement for data discovery, access and workflow management, and astronomical image analysis software adapted for the analysis of high-throughput astronomy imaging surveys. This paper notes the applicability of the techniques from the astronomy domain. The testbed infrastructure deployment is described, emphasizing its speed and ease of use and support. The validity of the analysis techniques is confirmed through the pilot study described here--with the application to a large sample of immunohistochemistry microscopy data obtained in part for assessing the oestrogen receptor status of breast cancers. The analysis showed that the specificity and sensitivity values for the automatic scoring using PathGrid were within the errors of those obtained via a 'gold standard' manual pathologist scoring.
Collapse
|