1
|
Zaks N, Mahjani B, Reichenberg A, Birnbaum R. CLINICAL AND COGNITIVE PHENOTYPING OF COPY NUMBER VARIANTS PATHOGENIC FOR NEURODEVELOPMENTAL DISORDERS FROM A MULTI-ANCESTRY BIOBANK. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.07.16.24310489. [PMID: 39072027 PMCID: PMC11275656 DOI: 10.1101/2024.07.16.24310489] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/30/2024]
Abstract
Background Rare copy number variants (CNVs) are pathogenic for neurodevelopmental disorders (NDDs) and effect neurocognitive impairment. In aggregate, NDD CNVs may present in up to 2% of population cohorts with implications for neuropsychiatric disease risk and cognitive health. However, analyses of NDD CNVs in biobanks or population cohorts have been hindered by limited clinical or cognitive phenotypes, and a lack of ancestral diversity. In the current proof-of-concept study, NDD CNV carriers were recontacted from BioMe, a multi-ancestry biobank derived from the Mount Sinai healthcare system, to enable 'deep phenotyping' beyond electronic health record outcomes. Methods From BioMe biobank, 892 adult participants were recontacted, including 335 harboring NDD CNVs, 217 with schizophrenia and 340 neurotypical controls as comparators. Clinical and cognitive assessments were administered to each recruited participant. Results Seventy-three participants completed study assessments (mean age=48.8 years; 66% female; 36% African, 26% European, 34% Hispanic), or 8% of the recontacted subset, including 30 NDD CNV carriers across 15 loci. Among NDD CNV carriers, assessments indicated 40% with mood and anxiety disorders, 30% with learning disorders, and 13% with a history of special education. NDD CNV carriers were significantly cognitively impaired compared to controls on digit span backwards (Beta=-1.76, FDR=0.04) and digit span sequencing (Beta=-2.01, FDR=0.04). Conclusions Feasibility of "recall-by-genotype" from a multi-ancestry biobank was established for NDD CNV carriers, along with comparator groups. The current study corroborated past reports of NDD CNVs effects of cognitive impairment, while elucidating clinical phenotypes for recalled individuals. Future "recall-by-genotype" studies may further facilitate clinical characterization of disease-relevant genomic variants.
Collapse
Affiliation(s)
- Nina Zaks
- Department of Child and Adolescent Psychiatry, NYU Langone Health
| | - Behrang Mahjani
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai
- Seaver Autism Center for Research and Treatment, Icahn School of Medicine at Mount Sinai
| | - Abraham Reichenberg
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai
- Seaver Autism Center for Research and Treatment, Icahn School of Medicine at Mount Sinai
- Department of Environmental Medicine and Public Health, Icahn School of Medicine at Mount Sinai
| | - Rebecca Birnbaum
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai
- Department of Genetic and Genomics Sciences, Icahn School of Medicine at Mount Sinai
| |
Collapse
|
2
|
Breeyear JH, Mautz BS, Keaton JM, Hellwege JN, Torstenson ES, Liang J, Bray MJ, Giri A, Warren HR, Munroe PB, Velez Edwards DR, Zhu X, Li C, Edwards TL. A new test for trait mean and variance detects unreported loci for blood-pressure variation. Am J Hum Genet 2024; 111:954-965. [PMID: 38614075 PMCID: PMC11080606 DOI: 10.1016/j.ajhg.2024.03.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 03/20/2024] [Accepted: 03/22/2024] [Indexed: 04/15/2024] Open
Abstract
Variability in quantitative traits has clinical, ecological, and evolutionary significance. Most genetic variants identified for complex quantitative traits have only a detectable effect on the mean of trait. We have developed the mean-variance test (MVtest) to simultaneously model the mean and log-variance of a quantitative trait as functions of genotypes and covariates by using estimating equations. The advantages of MVtest include the facts that it can detect effect modification, that multiple testing can follow conventional thresholds, that it is robust to non-normal outcomes, and that association statistics can be meta-analyzed. In simulations, we show control of type I error of MVtest over several alternatives. We identified 51 and 37 previously unreported associations for effects on blood-pressure variance and mean, respectively, in the UK Biobank. Transcriptome-wide association studies revealed 633 significant unique gene associations with blood-pressure mean variance. MVtest is broadly applicable to studies of complex quantitative traits and provides an important opportunity to detect novel loci.
Collapse
Affiliation(s)
- Joseph H Breeyear
- Division of Epidemiology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA; Biostatistics and Computational Biology Branch, Division of Intramural Research, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - Brian S Mautz
- Population Analytics and Insights, Data Sciences, Janssen Research and Development, Spring House, PA, USA
| | - Jacob M Keaton
- Medical Genomics and Metabolic Genetics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Jacklyn N Hellwege
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, USA; Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Eric S Torstenson
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Jingjing Liang
- Department of Pharmacy Practice and Science, University of Arizona, Tucson, AZ, USA
| | - Michael J Bray
- Department of Maternal and Fetal Medicine, Orlando Health, Orlando, FL, USA; Genetic Counseling Program, Bay Path University, Longmeadow, MA, USA
| | - Ayush Giri
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, USA; Division of Quantitative Sciences, Department of Obstetrics and Gynecology, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Helen R Warren
- Center of Clinical Pharmacology and Precision Medicine, Queen Mary University, London, England
| | - Patricia B Munroe
- Center of Clinical Pharmacology and Precision Medicine, Queen Mary University, London, England
| | - Digna R Velez Edwards
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, USA; Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA; Division of Quantitative Sciences, Department of Obstetrics and Gynecology, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Xiaofeng Zhu
- Department of Epidemiology and Biostatistics, Institute for Computational Biology, Case Western Reserve University, Cleveland, OH, USA
| | - Chun Li
- Department of Population and Public Health Sciences, University of Southern California, Los Angeles, CA, USA
| | - Todd L Edwards
- Division of Epidemiology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA.
| |
Collapse
|
3
|
Yan C, Ong HH, Grabowska ME, Krantz MS, Su WC, Dickson AL, Peterson JF, Feng Q, Roden DM, Stein CM, Kerchberger VE, Malin BA, Wei WQ. Large language models facilitate the generation of electronic health record phenotyping algorithms. J Am Med Inform Assoc 2024:ocae072. [PMID: 38613820 DOI: 10.1093/jamia/ocae072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2023] [Revised: 02/21/2024] [Accepted: 03/22/2024] [Indexed: 04/15/2024] Open
Abstract
OBJECTIVES Phenotyping is a core task in observational health research utilizing electronic health records (EHRs). Developing an accurate algorithm demands substantial input from domain experts, involving extensive literature review and evidence synthesis. This burdensome process limits scalability and delays knowledge discovery. We investigate the potential for leveraging large language models (LLMs) to enhance the efficiency of EHR phenotyping by generating high-quality algorithm drafts. MATERIALS AND METHODS We prompted four LLMs-GPT-4 and GPT-3.5 of ChatGPT, Claude 2, and Bard-in October 2023, asking them to generate executable phenotyping algorithms in the form of SQL queries adhering to a common data model (CDM) for three phenotypes (ie, type 2 diabetes mellitus, dementia, and hypothyroidism). Three phenotyping experts evaluated the returned algorithms across several critical metrics. We further implemented the top-rated algorithms and compared them against clinician-validated phenotyping algorithms from the Electronic Medical Records and Genomics (eMERGE) network. RESULTS GPT-4 and GPT-3.5 exhibited significantly higher overall expert evaluation scores in instruction following, algorithmic logic, and SQL executability, when compared to Claude 2 and Bard. Although GPT-4 and GPT-3.5 effectively identified relevant clinical concepts, they exhibited immature capability in organizing phenotyping criteria with the proper logic, leading to phenotyping algorithms that were either excessively restrictive (with low recall) or overly broad (with low positive predictive values). CONCLUSION GPT versions 3.5 and 4 are capable of drafting phenotyping algorithms by identifying relevant clinical criteria aligned with a CDM. However, expertise in informatics and clinical experience is still required to assess and further refine generated algorithms.
Collapse
Affiliation(s)
- Chao Yan
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, United States
| | - Henry H Ong
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, United States
| | - Monika E Grabowska
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, United States
| | - Matthew S Krantz
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, United States
| | - Wu-Chen Su
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, United States
| | - Alyson L Dickson
- Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37203, United States
| | - Josh F Peterson
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, United States
- Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37203, United States
| | - QiPing Feng
- Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37203, United States
| | - Dan M Roden
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, United States
| | - C Michael Stein
- Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37203, United States
| | - V Eric Kerchberger
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, United States
- Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37203, United States
| | - Bradley A Malin
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, United States
- Department of Computer Science, Vanderbilt University, Nashville, TN 37203, United States
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN 37203, United States
| | - Wei-Qi Wei
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37203, United States
- Department of Computer Science, Vanderbilt University, Nashville, TN 37203, United States
| |
Collapse
|
4
|
Mizuno S, Wagata M, Nagaie S, Ishikuro M, Obara T, Tamiya G, Kuriyama S, Tanaka H, Yaegashi N, Yamamoto M, Sugawara J, Ogishima S. Development of phenotyping algorithms for hypertensive disorders of pregnancy (HDP) and their application in more than 22,000 pregnant women. Sci Rep 2024; 14:6292. [PMID: 38491024 PMCID: PMC10943000 DOI: 10.1038/s41598-024-55914-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Accepted: 02/28/2024] [Indexed: 03/18/2024] Open
Abstract
Recently, many phenotyping algorithms for high-throughput cohort identification have been developed. Prospective genome cohort studies are critical resources for precision medicine, but there are many hurdles in the precise cohort identification. Consequently, it is important to develop phenotyping algorithms for cohort data collection. Hypertensive disorders of pregnancy (HDP) is a leading cause of maternal morbidity and mortality. In this study, we developed, applied, and validated rule-based phenotyping algorithms of HDP. Two phenotyping algorithms, algorithms 1 and 2, were developed according to American and Japanese guidelines, and applied into 22,452 pregnant women in the Birth and Three-Generation Cohort Study of the Tohoku Medical Megabank project. To precise cohort identification, we analyzed both structured data (e.g., laboratory and physiological tests) and unstructured clinical notes. The identified subtypes of HDP were validated against reference standards. Algorithms 1 and 2 identified 7.93% and 8.08% of the subjects as having HDP, respectively, along with their HDP subtypes. Our algorithms were high performing with high positive predictive values (0.96 and 0.90 for algorithms 1 and 2, respectively). Overcoming the hurdle of precise cohort identification from large-scale cohort data collection, we achieved both developed and implemented phenotyping algorithms, and precisely identified HDP patients and their subtypes from large-scale cohort data collection.
Collapse
Affiliation(s)
- Satoshi Mizuno
- Department of Informatics for Genomic Medicine, Tohoku Medical Megabank Organization, Tohoku University, 2-1, Seiryo-Machi, Aoba-Ku, Sendai, Miyagi, 980-8575, Japan
| | - Maiko Wagata
- Department of Feto-Maternal Medical Science, Tohoku Medical Megabank Organization, Tohoku University, Miyagi, Japan
| | - Satoshi Nagaie
- Department of Informatics for Genomic Medicine, Tohoku Medical Megabank Organization, Tohoku University, 2-1, Seiryo-Machi, Aoba-Ku, Sendai, Miyagi, 980-8575, Japan
| | - Mami Ishikuro
- Department of Molecular Epidemiology, Tohoku Medical Megabank Organization, Tohoku University, Miyagi, Japan
| | - Taku Obara
- Department of Molecular Epidemiology, Tohoku Medical Megabank Organization, Tohoku University, Miyagi, Japan
| | - Gen Tamiya
- Department of Statistical Genetics and Genomics, Tohoku Medical Megabank Organization, Tohoku University, Miyagi, Japan
| | - Shinichi Kuriyama
- Department of Molecular Epidemiology, Tohoku Medical Megabank Organization, Tohoku University, Miyagi, Japan
| | | | - Nobuo Yaegashi
- Department of Gynecology and Obstetrics, Tohoku University Graduate School of Medicine, Tohoku University, Miyagi, Japan
| | - Masayuki Yamamoto
- Department of Biochemistry and Molecular Biology, Tohoku Medical Megabank Organization, Tohoku University, Miyagi, Japan
| | - Junichi Sugawara
- Department of Gynecology and Obstetrics, Tohoku University Graduate School of Medicine, Tohoku University, Miyagi, Japan
- Suzuki Memorial Hospital, 3-5-5, Satonomori, Iwanumashi, Miyagi, Japan
| | - Soichi Ogishima
- Department of Informatics for Genomic Medicine, Tohoku Medical Megabank Organization, Tohoku University, 2-1, Seiryo-Machi, Aoba-Ku, Sendai, Miyagi, 980-8575, Japan.
- Advanced Research Center for Innovations in Next-Generation Medicine, Tohoku University, Miyagi, Japan.
| |
Collapse
|
5
|
Lo Faro V, Johansson T, Johansson Å. The risk of venous thromboembolism in oral contraceptive users: the role of genetic factors-a prospective cohort study of 240,000 women in the UK Biobank. Am J Obstet Gynecol 2024; 230:360.e1-360.e13. [PMID: 37734636 DOI: 10.1016/j.ajog.2023.09.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 08/31/2023] [Accepted: 09/13/2023] [Indexed: 09/23/2023]
Abstract
BACKGROUND More than 150 million women worldwide use oral contraceptives. Women with inherited thrombophilia and carriers of certain thrombophilia gene variants, such as factor V Leiden and the prothrombin, are at an increased risk for venous thromboembolism, especially when combined with oral contraceptive use. Venous thromboembolism is a complex disorder involving many genetic risk factors, and recently, polygenic risk scores have been proposed to capture a significant proportion of the genetic risk of venous thromboembolism. OBJECTIVE The aim of this study was to estimate the risk for developing venous thromboembolism when initiating oral contraceptive use (first 2 years) and during continued use among women with a high genetic liability. STUDY DESIGN We used a prospective study design in which 244,420 participants from the UK Biobank were followed from birth. The effect of oral contraceptive use during the first 2 years and in the remaining years of oral contraceptive use on the risk of developing venous thromboembolism was estimated using a Cox regression with a time-dependent exposure variable. Women were stratified according to their polygenic risk scores and whether they were carriers of factor V Leiden and/or prothrombin variants. RESULTS When genetic risk was not considered, an increased risk for venous thromboembolism was observed during the first 2 years of oral contraceptive use (hazard ratio, 3.09; 95% confidence interval, 3.00-3.20) but not during continued use (hazard ratio, 0.92; 95% confidence interval, 0.80-1.05). However, when genetic risk was considered, women in the highest polygenic risk score category had a more pronounced risk of developing a venous thromboembolism during the first 2 years of oral contraceptive use (hazard ratio, 6.35; 95% confidence interval, 4.98-8.09), and a high risk was also observed among factor V Leiden (hazard ratio, 5.73; 95% confidence interval, 5.31-6.17) and prothrombin variant carriers (hazard ratio, 5.23; 95% confidence interval, 4.67 - 5.87). A high polygenic risk score in combination with being a factor V Leiden and prothrombin variant carrier conferred the highest risk for developing a venous thromboembolism during the first 2 years of oral contraceptive use (hazard ratio, 14.8; 95% confidence interval, 9.28-23.6). Women with a high genetic liability also had an increased risk during continued use but it was less pronounced, and the highest risk was conferred to carriers of both factor V Leiden and the prothrombin variant (hazard ratio, 4.93; 95% confidence interval, 3.16-7.7). CONCLUSION Evaluating polygenic risk can identify additional venous thromboembolism risk that is not captured in the commonly investigated genes for inherited thrombophilia. Our results indicate that oral contraceptive use is associated with an increased risk for developing a venous thromboembolism, particularly among women with a high genetic predisposition, and that oral contraceptive use dramatically increases the risk thereof short after initiation of use, which decreases with continued use. This suggests that the polygenic risk score could be used to identify women who are at high risk for developing a venous thromboembolism and advise them on alternative methods of contraception.
Collapse
Affiliation(s)
- Valeria Lo Faro
- Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden.
| | - Therese Johansson
- Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden; Centre for Women's Mental Health during the Reproductive Lifespan - Womher, Uppsala University, Sweden
| | - Åsa Johansson
- Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| |
Collapse
|
6
|
Lennon NJ, Kottyan LC, Kachulis C, Abul-Husn NS, Arias J, Belbin G, Below JE, Berndt SI, Chung WK, Cimino JJ, Clayton EW, Connolly JJ, Crosslin DR, Dikilitas O, Velez Edwards DR, Feng Q, Fisher M, Freimuth RR, Ge T, Glessner JT, Gordon AS, Patterson C, Hakonarson H, Harden M, Harr M, Hirschhorn JN, Hoggart C, Hsu L, Irvin MR, Jarvik GP, Karlson EW, Khan A, Khera A, Kiryluk K, Kullo I, Larkin K, Limdi N, Linder JE, Loos RJF, Luo Y, Malolepsza E, Manolio TA, Martin LJ, McCarthy L, McNally EM, Meigs JB, Mersha TB, Mosley JD, Musick A, Namjou B, Pai N, Pesce LL, Peters U, Peterson JF, Prows CA, Puckelwartz MJ, Rehm HL, Roden DM, Rosenthal EA, Rowley R, Sawicki KT, Schaid DJ, Smit RAJ, Smith JL, Smoller JW, Thomas M, Tiwari H, Toledo DM, Vaitinadin NS, Veenstra D, Walunas TL, Wang Z, Wei WQ, Weng C, Wiesner GL, Yin X, Kenny EE. Selection, optimization and validation of ten chronic disease polygenic risk scores for clinical implementation in diverse US populations. Nat Med 2024; 30:480-487. [PMID: 38374346 PMCID: PMC10878968 DOI: 10.1038/s41591-024-02796-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Accepted: 01/02/2024] [Indexed: 02/21/2024]
Abstract
Polygenic risk scores (PRSs) have improved in predictive performance, but several challenges remain to be addressed before PRSs can be implemented in the clinic, including reduced predictive performance of PRSs in diverse populations, and the interpretation and communication of genetic results to both providers and patients. To address these challenges, the National Human Genome Research Institute-funded Electronic Medical Records and Genomics (eMERGE) Network has developed a framework and pipeline for return of a PRS-based genome-informed risk assessment to 25,000 diverse adults and children as part of a clinical study. From an initial list of 23 conditions, ten were selected for implementation based on PRS performance, medical actionability and potential clinical utility, including cardiometabolic diseases and cancer. Standardized metrics were considered in the selection process, with additional consideration given to strength of evidence in African and Hispanic populations. We then developed a pipeline for clinical PRS implementation (score transfer to a clinical laboratory, validation and verification of score performance), and used genetic ancestry to calibrate PRS mean and variance, utilizing genetically diverse data from 13,475 participants of the All of Us Research Program cohort to train and test model parameters. Finally, we created a framework for regulatory compliance and developed a PRS clinical report for return to providers and for inclusion in an additional genome-informed risk assessment. The initial experience from eMERGE can inform the approach needed to implement PRS-based testing in diverse clinical settings.
Collapse
Affiliation(s)
| | - Leah C Kottyan
- Cincinnati Children's Hospital Medical Center, University of Cincinnati, Cincinnati, OH, USA
| | | | | | - Josh Arias
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Gillian Belbin
- Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | | | - Sonja I Berndt
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | | | - James J Cimino
- University of Alabama at Birmingham, Birmingham, AL, USA
| | | | | | - David R Crosslin
- Tulane University, New Orleans, LA, USA
- University of Washington, Seattle, WA, USA
| | | | | | - QiPing Feng
- Vanderbilt University Medical Center, Nashville, TN, USA
| | | | | | - Tian Ge
- Mass General Brigham, Boston, MA, USA
| | | | | | | | | | - Maegan Harden
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Margaret Harr
- Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Joel N Hirschhorn
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Boston Children's Hospital, Boston, MA, USA
| | - Clive Hoggart
- Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Li Hsu
- Fred Hutchinson Cancer Center, Seattle, WA, USA
| | | | | | | | | | - Amit Khera
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | | | | | - Katie Larkin
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Nita Limdi
- University of Alabama at Birmingham, Birmingham, AL, USA
| | | | - Ruth J F Loos
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Yuan Luo
- Northwestern University, Evanston, IL, USA
| | | | - Teri A Manolio
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Lisa J Martin
- Cincinnati Children's Hospital Medical Center, University of Cincinnati, Cincinnati, OH, USA
| | - Li McCarthy
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | | | | | - Tesfaye B Mersha
- Cincinnati Children's Hospital Medical Center, University of Cincinnati, Cincinnati, OH, USA
| | | | | | - Bahram Namjou
- Cincinnati Children's Hospital Medical Center, University of Cincinnati, Cincinnati, OH, USA
| | - Nihal Pai
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | | | | | | | - Cynthia A Prows
- Cincinnati Children's Hospital Medical Center, University of Cincinnati, Cincinnati, OH, USA
| | | | - Heidi L Rehm
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Dan M Roden
- Vanderbilt University Medical Center, Nashville, TN, USA
| | | | - Robb Rowley
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | | | | | | | | | | | | | - Hemant Tiwari
- University of Alabama at Birmingham, Birmingham, AL, USA
| | | | | | | | | | - Zhe Wang
- Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Wei-Qi Wei
- Vanderbilt University Medical Center, Nashville, TN, USA
| | | | | | | | - Eimear E Kenny
- Icahn School of Medicine at Mount Sinai, New York, NY, USA
| |
Collapse
|
7
|
Mollalo A, Hamidi B, Lenert L, Alekseyenko AV. Application of Spatial Analysis for Electronic Health Records: Characterizing Patient Phenotypes and Emerging Trends. RESEARCH SQUARE 2024:rs.3.rs-3443865. [PMID: 37886509 PMCID: PMC10602163 DOI: 10.21203/rs.3.rs-3443865/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/28/2023]
Abstract
Background Electronic health records (EHR) commonly contain patient addresses that provide valuable data for geocoding and spatial analysis, enabling more comprehensive descriptions of individual patients for clinical purposes. Despite the widespread use of EHR in clinical decision support and interventions, no systematic review has examined the extent to which spatial analysis is used to characterize patient phenotypes. Objective This study reviews advanced spatial analyses that employed individual-level health data from EHR within the US to characterize patient phenotypes. Methods We systematically evaluated English-language peer-reviewed articles from PubMed/MEDLINE, Scopus, Web of Science, and Google Scholar databases from inception to August 20, 2023, without imposing constraints on time, study design, or specific health domains. Results Only 49 articles met the eligibility criteria. These articles utilized diverse spatial methods, with a predominant focus on clustering techniques, while spatiotemporal analysis (frequentist and Bayesian) and modeling were relatively underexplored. A noteworthy surge (n = 42, 85.7%) in publications was observed post-2017. The publications investigated a variety of adult and pediatric clinical areas, including infectious disease, endocrinology, and cardiology, using phenotypes defined over a range of data domains, such as demographics, diagnoses, and visits. The primary health outcomes investigated were asthma, hypertension, and diabetes. Notably, patient phenotypes involving genomics, imaging, and notes were rarely utilized. Conclusions This review underscores the growing interest in spatial analysis of EHR-derived data and highlights knowledge gaps in clinical health, phenotype domains, and spatial methodologies. Additionally, this review proposes guidelines for harnessing the potential of spatial analysis to enhance the context of individual patients for future clinical decision support.
Collapse
|
8
|
Li R, Benz L, Duan R, Denny JC, Hakonarson H, Mosley JD, Smoller JW, Wei WQ, Ritchie MD, Moore JH, Chen Y. mixWAS: An efficient distributed algorithm for mixed-outcomes genome-wide association studies. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.01.09.24301073. [PMID: 38260403 PMCID: PMC10802662 DOI: 10.1101/2024.01.09.24301073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2024]
Abstract
Genome-wide association studies (GWAS) have been instrumental in identifying genetic associations for various diseases and traits. However, uncovering genetic underpinnings among traits beyond univariate phenotype associations remains a challenge. Multi-phenotype associations (MPA), or genetic pleiotropy, offer important insights into shared genes and pathways among traits, enhancing our understanding of genetic architectures of complex diseases. GWAS of biobank-linked electronic health record (EHR) data are increasingly being utilized to identify MPA among various traits and diseases. However, methodologies that can efficiently take advantage of distributed EHR to detect MPA are still lacking. Here, we introduce mixWAS, a novel algorithm that efficiently and losslessly integrates multiple EHRs via summary statistics, allowing the detection of MPA among mixed phenotypes while accounting for heterogeneities across EHRs. Simulations demonstrate that mixWAS outperforms the widely used MPA detection method, Phenome-wide association study (PheWAS), across diverse scenarios. Applying mixWAS to data from seven EHRs in the US, we identified 4,534 MPA among blood lipids, BMI, and circulatory diseases. Validation in an independent EHR data from UK confirmed 97.7% of the associations. mixWAS fundamentally improves the detection of MPA and is available as a free, open-source software.
Collapse
Affiliation(s)
- Ruowang Li
- Department of Computational Biomedicine, Cedars-Sinai Medical Center
| | - Luke Benz
- Department of Biostatistics, Harvard T.H. Chan School of Public Health
| | - Rui Duan
- Department of Biostatistics, Harvard T.H. Chan School of Public Health
| | - Joshua C Denny
- National Human Genome Research Institute, National Institutes of Health
| | - Hakon Hakonarson
- Division of Human Genetics, Children's Hospital of Philadelphia
- Center for Applied Genomics, Children's Hospital of Philadelphia
- Department of Pediatrics, University of Pennsylvania, Perelman School of Medicine
| | - Jonathan D Mosley
- Department of Medicine, Vanderbilt University Medical Center
- Department of Biomedical Informatics, Vanderbilt University Medical Center
| | - Jordan W Smoller
- Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital
- Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital
| | - Wei-Qi Wei
- Department of Biomedical Informatics, Vanderbilt University Medical Center
| | - Marylyn D Ritchie
- Department of Genetics and Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania
| | - Jason H Moore
- Department of Computational Biomedicine, Cedars-Sinai Medical Center
| | - Yong Chen
- Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania
| |
Collapse
|
9
|
Lieb W, Strathmann EA, Röder C, Jacobs G, Gaede KI, Richter G, Illig T, Krawczak M. Population-Based Biobanking. Genes (Basel) 2024; 15:66. [PMID: 38254956 PMCID: PMC10815030 DOI: 10.3390/genes15010066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 12/22/2023] [Accepted: 12/25/2023] [Indexed: 01/24/2024] Open
Abstract
Population-based biobanking is an essential element of medical research that has grown substantially over the last two decades, and many countries are currently pursuing large national biobanking initiatives. The rise of individual biobanks is paralleled by various networking activities in the field at both the national and international level, such as BBMRI-ERIC in the EU. A significant contribution to population-based biobanking comes from large cohort studies and national repositories, including the United Kingdom Biobank (UKBB), the CONSTANCES project in France, the German National Cohort (NAKO), LifeLines in the Netherlands, FinnGen in Finland, and the All of Us project in the U.S. At the same time, hospital-based biobanking has also gained importance in medical research. We describe some of the scientific questions that can be addressed particularly well by the use of population-based biobanks, including the discovery and calibration of biomarkers and the identification of molecular correlates of health parameters and disease states. Despite the tremendous progress made so far, some major challenges to population-based biobanking still remain, including the need to develop strategies for the long-term sustainability of biobanks, the handling of incidental findings, and the linkage of sample-related and sample-derived data to other relevant resources.
Collapse
Affiliation(s)
- Wolfgang Lieb
- Institute of Epidemiology and Biobank Popgen, Kiel University, University Hospital Schleswig-Holstein, Campus Kiel, 24105 Kiel, Germany; (E.A.S.); (C.R.)
- German Centre for Lung Research (DZL), Airway Research Centre North (ARCN), 22927 Großhansdorf, Germany; (K.I.G.); (G.R.); (T.I.)
- PopGen 2.0 Biobanking Network (P2N), Kiel University, University Hospital Schleswig-Holstein, Campus Kiel, 24105 Kiel, Germany;
| | - Eike A. Strathmann
- Institute of Epidemiology and Biobank Popgen, Kiel University, University Hospital Schleswig-Holstein, Campus Kiel, 24105 Kiel, Germany; (E.A.S.); (C.R.)
| | - Christian Röder
- Institute of Epidemiology and Biobank Popgen, Kiel University, University Hospital Schleswig-Holstein, Campus Kiel, 24105 Kiel, Germany; (E.A.S.); (C.R.)
- PopGen 2.0 Biobanking Network (P2N), Kiel University, University Hospital Schleswig-Holstein, Campus Kiel, 24105 Kiel, Germany;
- Institute for Experimental Cancer Research (IET), Kiel University, University Hospital Schleswig-Holstein, Campus Kiel, 24105 Kiel, Germany
| | - Gunnar Jacobs
- Institute of Epidemiology and Biobank Popgen, Kiel University, University Hospital Schleswig-Holstein, Campus Kiel, 24105 Kiel, Germany; (E.A.S.); (C.R.)
- PopGen 2.0 Biobanking Network (P2N), Kiel University, University Hospital Schleswig-Holstein, Campus Kiel, 24105 Kiel, Germany;
| | - Karoline I. Gaede
- German Centre for Lung Research (DZL), Airway Research Centre North (ARCN), 22927 Großhansdorf, Germany; (K.I.G.); (G.R.); (T.I.)
- PopGen 2.0 Biobanking Network (P2N), Kiel University, University Hospital Schleswig-Holstein, Campus Kiel, 24105 Kiel, Germany;
- BioMaterialBank (BMB) North, Research Center Borstel, Leibniz Lung Center, 23845 Borstel, Germany
| | - Gesine Richter
- German Centre for Lung Research (DZL), Airway Research Centre North (ARCN), 22927 Großhansdorf, Germany; (K.I.G.); (G.R.); (T.I.)
- PopGen 2.0 Biobanking Network (P2N), Kiel University, University Hospital Schleswig-Holstein, Campus Kiel, 24105 Kiel, Germany;
- Institute of Experimental Medicine (IEM), Division of Biomedical Ethics, Kiel University, University Hospital Schleswig-Holstein, Campus Kiel, 24105 Kiel, Germany
| | - Thomas Illig
- German Centre for Lung Research (DZL), Airway Research Centre North (ARCN), 22927 Großhansdorf, Germany; (K.I.G.); (G.R.); (T.I.)
- Hannover Unified Biobank (HUB), Hannover Medical School, 30625 Hannover, Germany
- German Center for Lung Research (DZL), Biomedical Research in Endstage and Obstructive Lung Disease Hannover (BREATH), 30625 Hannover, Germany
| | - Michael Krawczak
- PopGen 2.0 Biobanking Network (P2N), Kiel University, University Hospital Schleswig-Holstein, Campus Kiel, 24105 Kiel, Germany;
- Institute of Medical Informatics and Statistics (IMIS), Kiel University, University Hospital Schleswig-Holstein, Campus Kiel, 24105 Kiel, Germany
| |
Collapse
|
10
|
Patil R, Ashraf F, Dayeh SA, Prakash SK. Genomic Medicine Guidance: A Point-of-Care App for Heritable Thoracic Aortic Diseases. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.12.22.23299696. [PMID: 38234729 PMCID: PMC10793513 DOI: 10.1101/2023.12.22.23299696] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/19/2024]
Abstract
Genetic testing can determine familial and personal risks for heritable thoracic aortic aneurysms and dissections (TAD). The 2022 ACC/AHA guidelines for TAD recommend management decisions based on the specific gene mutation. However, many clinicians lack sufficient comfort or insight to integrate genetic information into clinical practice. We therefore developed the Genomic Medicine Guidance (GMG) app, an interactive point-of care tool to inform clinicians and patients about TAD diagnosis, treatment, and surveillance. GMG is a REDCap-based app that combines publicly available genetic data and clinical recommendations based on the TAD guidelines into one translational education tool. TAD genetic information in GMG was sourced from the Montalcino Aortic Consortium, a worldwide collaboration of TAD centers of excellence, and the NIH genetic repositories ClinVar and ClinGen. The app streamlines data on the 13 most frequently mutated TAD genes with 2,286 unique pathogenic mutations that cause TAD so that users receive comprehensive recommendations for diagnostic testing, imaging, surveillance, medical therapy, preventative surgical repair, as well as guidance for exercise safety and management during pregnancy. The app output can be displayed in a clinician view or exported as an informative pamphlet in a patient-friendly format. The overall goal of the GMG app is to make genomic medicine more accessible to clinicians and patients, while serving as a unifying platform for research. We anticipate that these features will be catalysts for collaborative projects that aim to understand the spectrum of genetic variants that contribute to TAD.
Collapse
Affiliation(s)
- Rohan Patil
- Department of Internal Medicine, McGovern Medical School, University of Texas Health Science Center at Houston, Houston, Texas, USA
| | - Fatima Ashraf
- McWilliams School of Bioinformatics, University of Texas Health Science Center at Houston, Houston, Texas, USA
| | - Samer Abu Dayeh
- Department of Internal Medicine, McGovern Medical School, University of Texas Health Science Center at Houston, Houston, Texas, USA
| | - Siddharth K. Prakash
- Department of Internal Medicine, McGovern Medical School, University of Texas Health Science Center at Houston, Houston, Texas, USA
| |
Collapse
|
11
|
Bednorz A, Mak JKL, Jylhävä J, Religa D. Use of Electronic Medical Records (EMR) in Gerontology: Benefits, Considerations and a Promising Future. Clin Interv Aging 2023; 18:2171-2183. [PMID: 38152074 PMCID: PMC10752027 DOI: 10.2147/cia.s400887] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Accepted: 11/05/2023] [Indexed: 12/29/2023] Open
Abstract
Electronic medical records (EMRs) have many benefits in clinical research in gerontology, enabling data analysis, development of prognostic tools and disease risk prediction. EMRs also offer a range of advantages in clinical practice, such as comprehensive medical records, streamlined communication with healthcare providers, remote data access, and rapid retrieval of test results, ultimately leading to increased efficiency, enhanced patient safety, and improved quality of care in gerontology, which includes benefits like reduced medication use and better patient history taking and physical examination assessments. The use of artificial intelligence (AI) and machine learning (ML) approaches on EMRs can further improve disease diagnosis, symptom classification, and support clinical decision-making. However, there are also challenges related to data quality, data entry errors, as well as the ethics and safety of using AI in healthcare. This article discusses the future of EMRs in gerontology and the application of AI and ML in clinical research. Ethical and legal issues surrounding data sharing and the need for healthcare professionals to critically evaluate and integrate these technologies are also emphasized. The article concludes by discussing the challenges related to the use of EMRs in research as well as in their primary intended use, the daily clinical practice.
Collapse
Affiliation(s)
- Adam Bednorz
- John Paul II Geriatric Hospital, Katowice, Poland
- Institute of Psychology, Humanitas Academy, Sosnowiec, Poland
| | - Jonathan K L Mak
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Juulia Jylhävä
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
- Faculty of Social Sciences (Health Sciences) and Gerontology Research Center (GEREC), University of Tampere, Tampere, Finland
| | - Dorota Religa
- Division of Clinical Geriatrics, Department of Neurobiology, Care sciences and Society, Karolinska Institutet, Stockholm, Sweden
- Theme Inflammation and Aging, Karolinska University Hospital, Huddinge, Sweden
| |
Collapse
|
12
|
Wang Y, Stroh JN, Hripcsak G, Low Wang CC, Bennett TD, Wrobel J, Der Nigoghossian C, Mueller SW, Claassen J, Albers DJ. A methodology of phenotyping ICU patients from EHR data: High-fidelity, personalized, and interpretable phenotypes estimation. J Biomed Inform 2023; 148:104547. [PMID: 37984547 PMCID: PMC10802138 DOI: 10.1016/j.jbi.2023.104547] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Revised: 11/13/2023] [Accepted: 11/16/2023] [Indexed: 11/22/2023]
Abstract
OBJECTIVE Computing phenotypes that provide high-fidelity, time-dependent characterizations and yield personalized interpretations is challenging, especially given the complexity of physiological and healthcare systems and clinical data quality. This paper develops a methodological pipeline to estimate unmeasured physiological parameters and produce high-fidelity, personalized phenotypes anchored to physiological mechanics from electronic health record (EHR). METHODS A methodological phenotyping pipeline is developed that computes new phenotypes defined with unmeasurable computational biomarkers quantifying specific physiological properties in real time. Working within the inverse problem framework, this pipeline is applied to the glucose-insulin system for ICU patients using data assimilation to estimate an established mathematical physiological model with stochastic optimization. This produces physiological model parameter vectors of clinically unmeasured endocrine properties, here insulin secretion, clearance, and resistance, estimated for individual patient. These physiological parameter vectors are used as inputs to unsupervised machine learning methods to produce phenotypic labels and discrete physiological phenotypes. These phenotypes are inherently interpretable because they are based on parametric physiological descriptors. To establish potential clinical utility, the computed phenotypes are evaluated with external EHR data for consistency and reliability and with clinician face validation. RESULTS The phenotype computation was performed on a cohort of 109 ICU patients who received no or short-acting insulin therapy, rendering continuous and discrete physiological phenotypes as specific computational biomarkers of unmeasured insulin secretion, clearance, and resistance on time windows of three days. Six, six, and five discrete phenotypes were found in the first, middle, and last three-day periods of ICU stays, respectively. Computed phenotypic labels were predictive with an average accuracy of 89%. External validation of discrete phenotypes showed coherence and consistency in clinically observable differences based on laboratory measurements and ICD 9/10 codes and clinical concordance from face validity. A particularly clinically impactful parameter, insulin secretion, had a concordance accuracy of 83%±27%. CONCLUSION The new physiological phenotypes computed with individual patient ICU data and defined by estimates of mechanistic model parameters have high physiological fidelity, are continuous, time-specific, personalized, interpretable, and predictive. This methodology is generalizable to other clinical and physiological settings and opens the door for discovering deeper physiological information to personalize medical care.
Collapse
Affiliation(s)
- Yanran Wang
- Department of Biostatistics and Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, 13001 East 17th Place, 3rd Floor, Mail Stop B119, Aurora, CO 80045, United States of America; Department of Biomedical Informatics, University of Colorado School of Medicine, Anschutz Health Sciences Building, 1890 N. Revere Court, Mailstop F600, Aurora, CO 80045, United States of America.
| | - J N Stroh
- Department of Biomedical Informatics, University of Colorado School of Medicine, Anschutz Health Sciences Building, 1890 N. Revere Court, Mailstop F600, Aurora, CO 80045, United States of America; Department of Biomedical Engineering, University of Colorado, 12705 East Montview Boulevard, Suite 100, Aurora, CO 80045, United States of America
| | - George Hripcsak
- Biomedical Informatics, Columbia University, 622 W. 168th Street, PH20, New York, NY 10032, United States of America
| | - Cecilia C Low Wang
- Division of Endocrinology, Metabolism and Diabetes, Department of Medicine, University of Colorado School of Medicine, 12801 East 17th Avenue, 7103, Aurora, CO 80045, United States of America
| | - Tellen D Bennett
- Department of Biomedical Informatics, University of Colorado School of Medicine, Anschutz Health Sciences Building, 1890 N. Revere Court, Mailstop F600, Aurora, CO 80045, United States of America
| | - Julia Wrobel
- Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, 1518 Clifton Rd, NE Atlanta, GA 30322, United States of America
| | - Caroline Der Nigoghossian
- Columbia University School of Nursing, 560 West 168th Street, New York, NY 10032, United States of America
| | - Scott W Mueller
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of Colorado Anschutz Medical Campus, 12850 East Montview Boulevard, Aurora, CO 80045, United States of America
| | - Jan Claassen
- The Neurological Institute of New York, Columbia University Irving Medical Center, 710 West 168th Street, New York NY 10032, United States of America
| | - D J Albers
- Department of Biostatistics and Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, 13001 East 17th Place, 3rd Floor, Mail Stop B119, Aurora, CO 80045, United States of America; Department of Biomedical Informatics, University of Colorado School of Medicine, Anschutz Health Sciences Building, 1890 N. Revere Court, Mailstop F600, Aurora, CO 80045, United States of America; Department of Biomedical Engineering, University of Colorado, 12705 East Montview Boulevard, Suite 100, Aurora, CO 80045, United States of America; Biomedical Informatics, Columbia University, 622 W. 168th Street, PH20, New York, NY 10032, United States of America
| |
Collapse
|
13
|
Linder JE, Tao R, Chung WK, Kiryluk K, Liu C, Weng C, Connolly JJ, Hakonarson H, Harr M, Leppig KA, Jarvik GP, Veenstra DL, Aufox S, Chisholm RL, Gordon AS, Hoell C, Rasmussen-Torvik LJ, Smith ME, Holm IA, Miller EM, Prows CA, Elskeally O, Kullo IJ, Lee C, Jose S, Manolio TA, Rowley R, Padi-Adjirackor NA, Wilmayani NK, City B, Wei WQ, Wiesner GL, Rahm AK, Williams JL, Williams MS, Peterson JF. Prospective, multi-site study of healthcare utilization after actionable monogenic findings from clinical sequencing. Am J Hum Genet 2023; 110:1950-1958. [PMID: 37883979 PMCID: PMC10645563 DOI: 10.1016/j.ajhg.2023.10.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Revised: 10/05/2023] [Accepted: 10/08/2023] [Indexed: 10/28/2023] Open
Abstract
As large-scale genomic screening becomes increasingly prevalent, understanding the influence of actionable results on healthcare utilization is key to estimating the potential long-term clinical impact. The eMERGE network sequenced individuals for actionable genes in multiple genetic conditions and returned results to individuals, providers, and the electronic health record. Differences in recommended health services (laboratory, imaging, and procedural testing) delivered within 12 months of return were compared among individuals with pathogenic or likely pathogenic (P/LP) findings to matched individuals with negative findings before and after return of results. Of 16,218 adults, 477 unselected individuals were found to have a monogenic risk for arrhythmia (n = 95), breast cancer (n = 96), cardiomyopathy (n = 95), colorectal cancer (n = 105), or familial hypercholesterolemia (n = 86). Individuals with P/LP results more frequently received services after return (43.8%) compared to before return (25.6%) of results and compared to individuals with negative findings (24.9%; p < 0.0001). The annual cost of qualifying healthcare services increased from an average of $162 before return to $343 after return of results among the P/LP group (p < 0.0001); differences in the negative group were non-significant. The mean difference-in-differences was $149 (p < 0.0001), which describes the increased cost within the P/LP group corrected for cost changes in the negative group. When stratified by individual conditions, significant cost differences were observed for arrhythmia, breast cancer, and cardiomyopathy. In conclusion, less than half of individuals received billed health services after monogenic return, which modestly increased healthcare costs for payors in the year following return.
Collapse
Affiliation(s)
- Jodell E Linder
- Vanderbilt University Medical Center, Nashville, TN 37203, USA.
| | - Ran Tao
- Vanderbilt University Medical Center, Nashville, TN 37203, USA
| | | | | | - Cong Liu
- Columbia University, New York, NY 10032, USA
| | | | - John J Connolly
- Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Hakon Hakonarson
- Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA; Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Margaret Harr
- Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Kathleen A Leppig
- Genetic Services, Kaiser Permanente of Washington, Seattle, WA 98195, USA
| | - Gail P Jarvik
- University of Washington Medical Center, Departments of Medicine (Medical Genetics) and Genome Sciences, Seattle, WA 98195, USA
| | - David L Veenstra
- University of Washington, Department of Pharmacy, Seattle, WA 98195, USA
| | - Sharon Aufox
- Northwestern University, Center for Genetic Medicine, Chicago, IL 60611, USA
| | - Rex L Chisholm
- Northwestern University, Center for Genetic Medicine, Chicago, IL 60611, USA
| | - Adam S Gordon
- Northwestern University, Center for Genetic Medicine, Chicago, IL 60611, USA
| | - Christin Hoell
- Northwestern University, Center for Genetic Medicine, Chicago, IL 60611, USA
| | | | - Maureen E Smith
- Northwestern University, Center for Genetic Medicine, Chicago, IL 60611, USA
| | | | - Erin M Miller
- Division of Cardiology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH 45229, USA; Department of Pediatrics, University of Cincinnati, College of Medicine, Cincinnati, OH, USA
| | - Cynthia A Prows
- Division of Human Genetics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH 45229, USA
| | | | | | | | - Sheethal Jose
- National Human Genome Research Institute, Bethesda, MD 20892, USA
| | - Teri A Manolio
- National Human Genome Research Institute, Bethesda, MD 20892, USA
| | - Robb Rowley
- National Human Genome Research Institute, Bethesda, MD 20892, USA
| | | | | | - Brittany City
- Vanderbilt University Medical Center, Nashville, TN 37203, USA
| | - Wei-Qi Wei
- Vanderbilt University Medical Center, Nashville, TN 37203, USA
| | | | | | - Janet L Williams
- Department of Genomic Health, Geisinger, Danville, PA 17822, USA
| | - Marc S Williams
- Department of Genomic Health, Geisinger, Danville, PA 17822, USA
| | - Josh F Peterson
- Vanderbilt University Medical Center, Nashville, TN 37203, USA
| |
Collapse
|
14
|
Ferar K, Hall TO, Crawford DC, Rowley R, Satterfield BA, Li R, Gragert L, Karlson EW, de Andrade M, Kullo IJ, McCarty CA, Kho A, Hayes MG, Ritchie MD, Crane PK, Mirel DB, Carlson C, Connolly JJ, Hakonarson H, Crenshaw AT, Carrell D, Luo Y, Dikilitas O, Denny JC, Jarvik GP, Crosslin DR. Genetic variation in the human leukocyte antigen region confers susceptibility to Clostridioides difficile infection. Sci Rep 2023; 13:18532. [PMID: 37898691 PMCID: PMC10613277 DOI: 10.1038/s41598-023-45649-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Accepted: 10/22/2023] [Indexed: 10/30/2023] Open
Abstract
Clostridioides difficile (C. diff.) infection (CDI) is a leading cause of hospital acquired diarrhea in North America and Europe and a major cause of morbidity and mortality. Known risk factors do not fully explain CDI susceptibility, and genetic susceptibility is suggested by the fact that some patients with colons that are colonized with C. diff. do not develop any infection while others develop severe or recurrent infections. To identify common genetic variants associated with CDI, we performed a genome-wide association analysis in 19,861 participants (1349 cases; 18,512 controls) from the Electronic Medical Records and Genomics (eMERGE) Network. Using logistic regression, we found strong evidence for genetic variation in the DRB locus of the MHC (HLA) II region that predisposes individuals to CDI (P > 1.0 × 10-14; OR 1.56). Altered transcriptional regulation in the HLA region may play a role in conferring susceptibility to this opportunistic enteric pathogen.
Collapse
Affiliation(s)
- Kathleen Ferar
- Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, USA.
| | - Taryn O Hall
- Optum Genomics, UnitedHealth Group, Minnetonka, MN, USA
| | - Dana C Crawford
- Department of Population and Quantitative Health Sciences, Cleveland Institute for Computational Biology, Case Western Reserve University, Cleveland, OH, USA
- Department of Genetics and Genome Sciences, Cleveland Institute for Computational Biology, Case Western Reserve University, Cleveland, OH, USA
| | - Robb Rowley
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | | | - Rongling Li
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Loren Gragert
- Division of Biomedical Informatics and Genomics, John W. Deming Department of Medicine, Tulane University School of Medicine, New Orleans, LA, USA
| | | | - Mariza de Andrade
- Division of Biomedical Statistics and Informatics, Mayo Clinic, Rochester, MN, USA
| | - Iftikhar J Kullo
- Department of Cardiovascular Diseases, Mayo Clinic, Rochester, MN, USA
| | - Catherine A McCarty
- University of Minnesota Medical School, Duluth, MN, USA
- Center for Human Genetics, Marshfield Clinic Research Foundation, Marshfield, WI, USA
| | - Abel Kho
- Divisions of General Internal Medicine and Preventive Medicine, Northwestern University, Chicago, IL, USA
| | - M Geoffrey Hayes
- Division of Endocrinology, Metabolism, and Molecular Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
| | - Marylyn D Ritchie
- Department of Biochemistry and Molecular Biology, Center for Systems Genomics, Pennsylvania State University, University Park, PA, USA
| | - Paul K Crane
- Division of General Internal Medicine, University of Washington, Seattle, WA, USA
| | | | - Christopher Carlson
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - John J Connolly
- Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Hakon Hakonarson
- Department of Pediatrics, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | | | - David Carrell
- Kaiser Permanente Washington Health Research Institute, Seattle, WA, USA
| | - Yuan Luo
- Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Ozan Dikilitas
- Department of Cardiovascular Diseases, Mayo Clinic, Rochester, MN, USA
| | - Joshua C Denny
- Department of Biomedical Informatics, Vanderbilt University, Nashville, TN, USA
| | - Gail P Jarvik
- Department of Medicine (Medical Genetics), University of Washington Medical Center, Seattle, WA, USA
| | - David R Crosslin
- Division of Biomedical Informatics and Genomics, John W. Deming Department of Medicine, Tulane University School of Medicine, New Orleans, LA, USA.
| |
Collapse
|
15
|
Cervera de la Cruz P, Shabani M. Conceptualizing fairness in the secondary use of health data for research: A scoping review. Account Res 2023:1-30. [PMID: 37851101 DOI: 10.1080/08989621.2023.2271394] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Accepted: 10/12/2023] [Indexed: 10/19/2023]
Abstract
With the introduction of the European Health Data Space (EHDS), the secondary use of health data for research purposes is attracting more attention. Secondary health data processing promises to address novel research questions, inform the design of future research and improve healthcare delivery generally. To comply with the existing data protection regulations, the secondary data use must be fair, among other things. However, there is no clear understanding of what fairness means in the context of secondary use of health data for scientific research purposes. In response, we conducted a scoping review of argument-based literature to explore how fairness in the secondary use of health data has been conceptualized. A total of 35 publications were included in the final synthesis after abstract and full-text screening. Using an inductive approach and a thematic analysis, our review has revealed that balancing individual and public interests, reducing power asymmetries, setting conditions for commercial involvement, and implementing benefit sharing are essential to guarantee fair secondary use research. The findings of this review can inform current and future research practices and policy development to adequately address concerns about fairness in the secondary use of health data.
Collapse
Affiliation(s)
| | - Mahsa Shabani
- Metamedica, Faculty of Law and Criminology, University of Ghent, Ghent, Belgium
- Law Centre for Health and Life, Faculty of Law, University of Amsterdam, Amsterdam, The Netherlands
| |
Collapse
|
16
|
Wang Y, Stroh JN, Hripcsak G, Low Wang CC, Bennett TD, Wrobel J, Der Nigoghossian C, Mueller S, Claassen J, Albers DJ. A methodology of phenotyping ICU patients from EHR data: high-fidelity, personalized, and interpretable phenotypes estimation. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.03.15.23287315. [PMID: 37662404 PMCID: PMC10473766 DOI: 10.1101/2023.03.15.23287315] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/05/2023]
Abstract
Objective Computing phenotypes that provide high-fidelity, time-dependent characterizations and yield personalized interpretations is challenging, especially given the complexity of physiological and healthcare systems and clinical data quality. This paper develops a methodological pipeline to estimate unmeasured physiological parameters and produce high-fidelity, personalized phenotypes anchored to physiological mechanics from electronic health record (EHR). Methods A methodological phenotyping pipeline is developed that computes new phenotypes defined with unmeasurable computational biomarkers quantifying specific physiological properties in real time. Working within the inverse problem framework, this pipeline is applied to the glucose-insulin system for ICU patients using data assimilation to estimate an established mathematical physiological model with stochastic optimization. This produces physiological model parameter vectors of clinically unmeasured endocrine properties, here insulin secretion, clearance, and resistance, estimated for individual patient. These physiological parameter vectors are used as inputs to unsupervised machine learning methods to produce phenotypic labels and discrete physiological phenotypes. These phenotypes are inherently interpretable because they are based on parametric physiological descriptors. To establish potential clinical utility, the computed phenotypes are evaluated with external EHR data for consistency and reliability and with clinician face validation. Results The phenotype computation was performed on a cohort of 109 ICU patients who received no or short-acting insulin therapy, rendering continuous and discrete physiological phenotypes as specific computational biomarkers of unmeasured insulin secretion, clearance, and resistance on time windows of three days. Six, six, and five discrete phenotypes were found in the first, middle, and last three-day periods of ICU stays, respectively. Computed phenotypic labels were predictive with an average accuracy of 89%. External validation of discrete phenotypes showed coherence and consistency in clinically observable differences based on laboratory measurements and ICD 9/10 codes and clinical concordance from face validity. A particularly clinically impactful parameter, insulin secretion, had a concordance accuracy of 83% ± 27%. Conclusion The new physiological phenotypes computed with individual patient ICU data and defined by estimates of mechanistic model parameters have high physiological fidelity, are continuous, time-specific, personalized, interpretable, and predictive. This methodology is generalizable to other clinical and physiological settings and opens the door for discovering deeper physiological information to personalize medical care.
Collapse
|
17
|
Cruz LA, Cooke Bailey JN, Crawford DC. Importance of Diversity in Precision Medicine: Generalizability of Genetic Associations Across Ancestry Groups Toward Better Identification of Disease Susceptibility Variants. Annu Rev Biomed Data Sci 2023; 6:339-356. [PMID: 37196357 PMCID: PMC10720270 DOI: 10.1146/annurev-biodatasci-122220-113250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/19/2023]
Abstract
Genome-wide association studies (GWAS) revolutionized our understanding of common genetic variation and its impact on common human disease and traits. Developed and adopted in the mid-2000s, GWAS led to searchable genotype-phenotype catalogs and genome-wide datasets available for further data mining and analysis for the eventual development of translational applications. The GWAS revolution was swift and specific, including almost exclusively populations of European descent, to the neglect of the majority of the world's genetic diversity. In this narrative review, we recount the GWAS landscape of the early years that established a genotype-phenotype catalog that is now universally understood to be inadequate for a complete understanding of complex human genetics. We then describe approaches taken to augment the genotype-phenotype catalog, including the study populations, collaborative consortia, and study design approaches aimed to generalize and then ultimately discover genome-wide associations in non-European descent populations. The collaborations and data resources established in the efforts to diversify genomic findings undoubtedly provide the foundations of the next chapters of genetic association studies with the advent of budget-friendly whole-genome sequencing.
Collapse
Affiliation(s)
- Lauren A Cruz
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, Ohio, USA;
- Cleveland Institute for Computational Biology, Case Western Reserve University, Cleveland, Ohio, USA
| | - Jessica N Cooke Bailey
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, Ohio, USA;
- Department of Genetics and Genome Sciences, Case Western Reserve University, Cleveland, Ohio, USA
- Cleveland Institute for Computational Biology, Case Western Reserve University, Cleveland, Ohio, USA
| | - Dana C Crawford
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, Ohio, USA;
- Department of Genetics and Genome Sciences, Case Western Reserve University, Cleveland, Ohio, USA
- Cleveland Institute for Computational Biology, Case Western Reserve University, Cleveland, Ohio, USA
| |
Collapse
|
18
|
Liao SY, Jacobson S, Hamzeh NY, Culver DA, Barkes BQ, Mroz M, Macphail K, Pacheco K, Patel DC, Wasfi YS, Koth LL, Langefeld CD, Leach SM, White E, Montgomery C, Maier LA, Fingerlin TE. Genome-wide association study identifies multiple HLA loci for sarcoidosis susceptibility. Hum Mol Genet 2023; 32:2669-2678. [PMID: 37399103 PMCID: PMC10407706 DOI: 10.1093/hmg/ddad067] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2022] [Revised: 03/30/2023] [Accepted: 04/19/2023] [Indexed: 07/05/2023] Open
Abstract
Sarcoidosis is a complex systemic disease. Our study aimed to (1) identify novel alleles associated with sarcoidosis susceptibility; (2) provide an in-depth evaluation of HLA alleles and sarcoidosis susceptibility and (3) integrate genetic and transcription data to identify risk loci that may more directly impact disease pathogenesis. We report a genome-wide association study of 1335 sarcoidosis cases and 1264 controls of European descent (EA) and investigate associated alleles in a study of African Americans (AA: 1487 cases and 1504 controls). The EA and AA cohort was recruited from multiple United States sites. HLA alleles were imputed and tested for association with sarcoidosis susceptibility. Expression quantitative locus and colocalization analysis were performed using a subset of subjects with transcriptome data. Forty-nine SNPs in the HLA region in HLA-DRA, -DRB9, -DRB5, -DQA1 and BRD2 genes were significantly associated with sarcoidosis susceptibility in EA, rs3129888 was also a risk variant for sarcoidosis in AA. Classical HLA alleles DRB1*0101, DQA1*0101 and DQB1*0501, which are highly correlated, were also associated with sarcoidosis. rs3135287 near HLA-DRA was associated with HLA-DRA expression in peripheral blood mononuclear cells and bronchoalveolar lavage from subjects and lung tissue and whole blood from GTEx. We identified six novel SNPs (out of the seven SNPs representing the 49 significant SNPs) and nine HLA alleles associated with sarcoidosis susceptibility in the largest EA population. We also replicated our findings in an AA population. Our study reiterates the potential role of antigen recognition and/or presentation HLA class II genes in sarcoidosis pathogenesis.
Collapse
Affiliation(s)
- Shu-Yi Liao
- Department of Medicine, National Jewish Health, Denver, CO 80206, USA
- Department of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
- Colorado School of Public Health, University of Colorado Denver–Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Sean Jacobson
- Department of Medicine, National Jewish Health, Denver, CO 80206, USA
| | - Nabeel Y Hamzeh
- Department of Medicine, University of Iowa, Iowa City, IA 52242, USA
| | - Daniel A Culver
- Department of Medicine, Cleveland Clinic, Cleveland, OH 44195, USA
| | - Briana Q Barkes
- Department of Medicine, National Jewish Health, Denver, CO 80206, USA
| | - Margarita Mroz
- Department of Medicine, National Jewish Health, Denver, CO 80206, USA
| | - Kristyn Macphail
- Department of Medicine, National Jewish Health, Denver, CO 80206, USA
| | - Karin Pacheco
- Department of Medicine, National Jewish Health, Denver, CO 80206, USA
- Department of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
- Colorado School of Public Health, University of Colorado Denver–Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Divya C Patel
- Department of Medicine, Division of Pulmonary, Critical Care and Sleep Medicine, University of Florida, Gainesville, FL 32610, USA
| | | | - Laura L Koth
- Department of Medicine, University of California-San Fransisco, San Fransisco, CA 94143, USA
| | - Carl D Langefeld
- Department of Biostatistics and Data Science, Wake Forest University School of Medicine, Winston-Salem, NC 27101, USA
- Wake Forest University School of Medicine, Center for Precision Medicine, Winston-Salem, NC 27101, USA
| | - Sonia M Leach
- Department of Medicine, National Jewish Health, Denver, CO 80206, USA
| | - Elizabeth White
- Department of Medicine, National Jewish Health, Denver, CO 80206, USA
| | | | - Lisa A Maier
- Department of Medicine, National Jewish Health, Denver, CO 80206, USA
- Department of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
- Colorado School of Public Health, University of Colorado Denver–Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Tasha E Fingerlin
- Department of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
- Colorado School of Public Health, University of Colorado Denver–Anschutz Medical Campus, Aurora, CO 80045, USA
- Department of Immunology and Genomic Medicine, National Jewish Health, Denver, CO 80206, USA
| | | |
Collapse
|
19
|
Clayton EW, Smith ME, Anderson KC, Chung WK, Connolly JJ, Fullerton SM, McGowan ML, Peterson JF, Prows CA, Sabatello M, Holm IA. Studying the impact of translational genomic research: Lessons from eMERGE. Am J Hum Genet 2023; 110:1021-1033. [PMID: 37343562 PMCID: PMC10357472 DOI: 10.1016/j.ajhg.2023.05.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Revised: 05/22/2023] [Accepted: 05/23/2023] [Indexed: 06/23/2023] Open
Abstract
Two major goals of the Electronic Medical Record and Genomics (eMERGE) Network are to learn how best to return research results to patient/participants and the clinicians who care for them and also to assess the impact of placing these results in clinical care. Yet since its inception, the Network has confronted a host of challenges in achieving these goals, many of which had ethical, legal, or social implications (ELSIs) that required consideration. Here, we share impediments we encountered in recruiting participants, returning results, and assessing their impact, all of which affected our ability to achieve the goals of eMERGE, as well as the steps we took to attempt to address these obstacles. We divide the domains in which we experienced challenges into four broad categories: (1) study design, including recruitment of more diverse groups; (2) consent; (3) returning results to participants and their health care providers (HCPs); and (4) assessment of follow-up care of participants and measuring the impact of research on participants and their families. Since most phases of eMERGE have included children as well as adults, we also address the particular ELSI posed by including pediatric populations in this research. We make specific suggestions for improving translational genomic research to ensure that future projects can effectively return results and assess their impact on patient/participants and providers if the goals of genomic-informed medicine are to be achieved.
Collapse
Affiliation(s)
- Ellen Wright Clayton
- Center for Biomedical Ethics and Society, Departments of Pediatrics and Health Policy, Vanderbilt University Medical Center, Nashville, TN 37203, USA.
| | - Maureen E Smith
- Department of Medicine, Center for Genetic Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA
| | - Katherine C Anderson
- Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - Wendy K Chung
- Departments of Pediatrics and Medicine, Columbia University, New York, NY 10032, USA
| | - John J Connolly
- Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Stephanie M Fullerton
- Department of Bioethics & Humanities, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Michelle L McGowan
- Biomedical Ethics Research Program, Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN 55905, USA; Department of Women's, Gender, and Sexuality Studies, University of Cincinnati, Cincinnati, OH 45221, USA
| | - Josh F Peterson
- Center for Precision Medicine, Department of Biomedical Informatics, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37203, USA
| | - Cynthia A Prows
- Divisions of Human Genetics and Patient Services, Cincinnati Children's Hospital, Cincinnati, OH 45229, USA
| | - Maya Sabatello
- Center for Precision Medicine & Genomics, Department of Medicine, and Division of Ethics, Department of Medical Humanities & Ethics Columbia University Vagelos College of Physicians and Surgeons, NY, NY 10032, USA
| | - Ingrid A Holm
- Division of Genetics and Genomics, Boston Children's Hospital; Department of Pediatrics, Harvard Medical School, Boston, MA 02115, USA
| |
Collapse
|
20
|
Spotnitz M, Acharya N, Cimino JJ, Murphy S, Namjou B, Crimmins N, Walunas T, Liu C, Crosslin D, Benoit B, Rosenthal E, Pacheco JA, Ostropolets A, Reyes Nieva H, Patterson JS, Richter LR, Callahan TJ, Elhussein A, Pang C, Kiryluk K, Nestor J, Khan A, Mohan S, Minty E, Chung W, Wei WQ, Natarajan K, Weng C. A metadata framework for computational phenotypes. JAMIA Open 2023; 6:ooad032. [PMID: 37181728 PMCID: PMC10168627 DOI: 10.1093/jamiaopen/ooad032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Revised: 04/10/2023] [Accepted: 04/21/2023] [Indexed: 05/16/2023] Open
Abstract
With the burgeoning development of computational phenotypes, it is increasingly difficult to identify the right phenotype for the right tasks. This study uses a mixed-methods approach to develop and evaluate a novel metadata framework for retrieval of and reusing computational phenotypes. Twenty active phenotyping researchers from 2 large research networks, Electronic Medical Records and Genomics and Observational Health Data Sciences and Informatics, were recruited to suggest metadata elements. Once consensus was reached on 39 metadata elements, 47 new researchers were surveyed to evaluate the utility of the metadata framework. The survey consisted of 5-Likert multiple-choice questions and open-ended questions. Two more researchers were asked to use the metadata framework to annotate 8 type-2 diabetes mellitus phenotypes. More than 90% of the survey respondents rated metadata elements regarding phenotype definition and validation methods and metrics positively with a score of 4 or 5. Both researchers completed annotation of each phenotype within 60 min. Our thematic analysis of the narrative feedback indicates that the metadata framework was effective in capturing rich and explicit descriptions and enabling the search for phenotypes, compliance with data standards, and comprehensive validation metrics. Current limitations were its complexity for data collection and the entailed human costs.
Collapse
Affiliation(s)
- Matthew Spotnitz
- Department of Biomedical Informatics, Vagelos College of Physicians & Surgeons, Columbia University Irving Medical Center, New York, New York, USA
| | - Nripendra Acharya
- Department of Biomedical Informatics, Vagelos College of Physicians & Surgeons, Columbia University Irving Medical Center, New York, New York, USA
| | - James J Cimino
- Informatics Institute, Heersink School of Medicine, University of Alabama at Birmingham, Birmingham, Alabama, USA
| | - Shawn Murphy
- Laboratory of Computer Science, Mass General Brigham, Boston, Massachusetts, USA
- Department of Neurology, Mass General Brigham, Boston, Massachusetts, USA
| | - Bahram Namjou
- Department of Pediatrics, Cincinnati Children’s Hospital Medical Center, Cincinnati, Ohio, USA
| | - Nancy Crimmins
- Department of Pediatrics, Cincinnati Children’s Hospital Medical Center, Cincinnati, Ohio, USA
| | - Theresa Walunas
- Department of Medicine, Feinberg School of Medicine, Northwestern University, Chicago, Illinois, USA
| | - Cong Liu
- Department of Biomedical Informatics, Vagelos College of Physicians & Surgeons, Columbia University Irving Medical Center, New York, New York, USA
| | - David Crosslin
- Division of Biomedical Informatics and Genomics, Tulane University School of Medicine, New Orleans, Louisiana, USA
| | - Barbara Benoit
- Department of Research Information Science & Computing, Mass General Brigham, Boston, Massachusetts, USA
| | | | - Jennifer A Pacheco
- Center for Genetic Medicine, Northwestern University, Chicago, Illinois, USA
| | - Anna Ostropolets
- Department of Biomedical Informatics, Vagelos College of Physicians & Surgeons, Columbia University Irving Medical Center, New York, New York, USA
| | - Harry Reyes Nieva
- Department of Biomedical Informatics, Vagelos College of Physicians & Surgeons, Columbia University Irving Medical Center, New York, New York, USA
| | - Jason S Patterson
- Department of Biomedical Informatics, Vagelos College of Physicians & Surgeons, Columbia University Irving Medical Center, New York, New York, USA
| | - Lauren R Richter
- Department of Biomedical Informatics, Vagelos College of Physicians & Surgeons, Columbia University Irving Medical Center, New York, New York, USA
| | - Tiffany J Callahan
- Department of Biomedical Informatics, Vagelos College of Physicians & Surgeons, Columbia University Irving Medical Center, New York, New York, USA
| | - Ahmed Elhussein
- Department of Biomedical Informatics, Vagelos College of Physicians & Surgeons, Columbia University Irving Medical Center, New York, New York, USA
| | - Chao Pang
- Department of Biomedical Informatics, Vagelos College of Physicians & Surgeons, Columbia University Irving Medical Center, New York, New York, USA
| | - Krzysztof Kiryluk
- Division of Nephrology, Department of Medicine, Vagelos College of Physicians & Surgeons, Columbia University Irving Medical Center, New York, New York, USA
| | - Jordan Nestor
- Division of Nephrology, Department of Medicine, Vagelos College of Physicians & Surgeons, Columbia University Irving Medical Center, New York, New York, USA
| | - Atlas Khan
- Division of Nephrology, Department of Medicine, Vagelos College of Physicians & Surgeons, Columbia University Irving Medical Center, New York, New York, USA
| | - Sumit Mohan
- Division of Nephrology, Department of Medicine, Vagelos College of Physicians & Surgeons, Columbia University Irving Medical Center, New York, New York, USA
- Department of Epidemiology, Columbia University Mailman School of Public Health, New York, New York, USA
| | - Evan Minty
- Department of Medicine, University of Calgary, Calgary, Alberta, Canada
| | - Wendy Chung
- Department of Pediatrics, Vagelos College of Physicians & Surgeons, Columbia University Irving Medical Center, New York, New York, USA
| | - Wei-Qi Wei
- Department of Biomedical Informatics, Vanderbilt University, Nashville, Tennessee, USA
| | - Karthik Natarajan
- Department of Biomedical Informatics, Vagelos College of Physicians & Surgeons, Columbia University Irving Medical Center, New York, New York, USA
| | - Chunhua Weng
- Department of Biomedical Informatics, Vagelos College of Physicians & Surgeons, Columbia University Irving Medical Center, New York, New York, USA
| |
Collapse
|
21
|
Chen F, Madduri RK, Rodriguez AA, Darst BF, Chou A, Sheng X, Wang A, Shen J, Saunders EJ, Rhie SK, Bensen JT, Ingles SA, Kittles RA, Strom SS, Rybicki BA, Nemesure B, Isaacs WB, Stanford JL, Zheng W, Sanderson M, John EM, Park JY, Xu J, Wang Y, Berndt SI, Huff CD, Yeboah ED, Tettey Y, Lachance J, Tang W, Rentsch CT, Cho K, Mcmahon BH, Biritwum RB, Adjei AA, Tay E, Truelove A, Niwa S, Sellers TA, Yamoah K, Murphy AB, Crawford DC, Patel AV, Bush WS, Aldrich MC, Cussenot O, Petrovics G, Cullen J, Neslund-Dudas CM, Stern MC, Kote-Jarai Z, Govindasami K, Cook MB, Chokkalingam AP, Hsing AW, Goodman PJ, Hoffmann TJ, Drake BF, Hu JJ, Keaton JM, Hellwege JN, Clark PE, Jalloh M, Gueye SM, Niang L, Ogunbiyi O, Idowu MO, Popoola O, Adebiyi AO, Aisuodionoe-Shadrach OI, Ajibola HO, Jamda MA, Oluwole OP, Nwegbu M, Adusei B, Mante S, Darkwa-Abrahams A, Mensah JE, Diop H, Van Den Eeden SK, Blanchet P, Fowke JH, Casey G, Hennis AJ, Lubwama A, Thompson IM, Leach R, Easton DF, Preuss MH, Loos RJ, Gundell SM, Wan P, Mohler JL, Fontham ET, Smith GJ, Taylor JA, Srivastava S, Eeles RA, Carpten JD, Kibel AS, Multigner L, Parent MÉ, Menegaux F, Cancel-Tassin G, Klein EA, Andrews C, Rebbeck TR, Brureau L, Ambs S, Edwards TL, Watya S, Chanock SJ, Witte JS, Blot WJ, Michael Gaziano J, Justice AC, Conti DV, Haiman CA. Evidence of Novel Susceptibility Variants for Prostate Cancer and a Multiancestry Polygenic Risk Score Associated with Aggressive Disease in Men of African Ancestry. Eur Urol 2023; 84:13-21. [PMID: 36872133 PMCID: PMC10424812 DOI: 10.1016/j.eururo.2023.01.022] [Citation(s) in RCA: 18] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Revised: 12/14/2022] [Accepted: 01/24/2023] [Indexed: 03/06/2023]
Abstract
BACKGROUND Genetic factors play an important role in prostate cancer (PCa) susceptibility. OBJECTIVE To discover common genetic variants contributing to the risk of PCa in men of African ancestry. DESIGN, SETTING, AND PARTICIPANTS We conducted a meta-analysis of ten genome-wide association studies consisting of 19378 cases and 61620 controls of African ancestry. OUTCOME MEASUREMENTS AND STATISTICAL ANALYSIS Common genotyped and imputed variants were tested for their association with PCa risk. Novel susceptibility loci were identified and incorporated into a multiancestry polygenic risk score (PRS). The PRS was evaluated for associations with PCa risk and disease aggressiveness. RESULTS AND LIMITATIONS Nine novel susceptibility loci for PCa were identified, of which seven were only found or substantially more common in men of African ancestry, including an African-specific stop-gain variant in the prostate-specific gene anoctamin 7 (ANO7). A multiancestry PRS of 278 risk variants conferred strong associations with PCa risk in African ancestry studies (odds ratios [ORs] >3 and >5 for men in the top PRS decile and percentile, respectively). More importantly, compared with men in the 40-60% PRS category, men in the top PRS decile had a significantly higher risk of aggressive PCa (OR = 1.23, 95% confidence interval = 1.10-1.38, p = 4.4 × 10-4). CONCLUSIONS This study demonstrates the importance of large-scale genetic studies in men of African ancestry for a better understanding of PCa susceptibility in this high-risk population and suggests a potential clinical utility of PRS in differentiating between the risks of developing aggressive and nonaggressive disease in men of African ancestry. PATIENT SUMMARY In this large genetic study in men of African ancestry, we discovered nine novel prostate cancer (PCa) risk variants. We also showed that a multiancestry polygenic risk score was effective in stratifying PCa risk, and was able to differentiate risk of aggressive and nonaggressive disease.
Collapse
Affiliation(s)
- Fei Chen
- Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | | | | | - Burcu F Darst
- Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA; Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Alisha Chou
- Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Xin Sheng
- Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Anqi Wang
- Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Jiayi Shen
- Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | | | - Suhn K Rhie
- Department of Biochemistry and Molecular Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Jeannette T Bensen
- Department of Epidemiology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA; Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Sue A Ingles
- Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Rick A Kittles
- Department of Population Sciences, City of Hope Comprehensive Cancer Center, Duarte, CA, USA
| | - Sara S Strom
- Department of Epidemiology, University of Texas M.D. Anderson Cancer Center, Houston, TX, USA
| | - Benjamin A Rybicki
- Department of Public Health Sciences, Henry Ford Hospital, Detroit, MI, USA
| | - Barbara Nemesure
- Department of Family, Population and Preventive Medicine, Stony Brook University, Stony Brook, NY, USA
| | - William B Isaacs
- James Buchanan Brady Urological Institute, Johns Hopkins Hospital and Medical Institution, Baltimore, MD, USA
| | - Janet L Stanford
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Wei Zheng
- Division of Epidemiology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Maureen Sanderson
- Department of Family and Community Medicine, Meharry Medical College, Nashville, TN, USA
| | - Esther M John
- Department of Medicine, Stanford Cancer Institute, Stanford University School of Medicine, Stanford, CA, USA
| | - Jong Y Park
- Department of Cancer Epidemiology, Moffitt Cancer Center, Tampa, FL, USA
| | - Jianfeng Xu
- Program for Personalized Cancer Care and Department of Surgery, NorthShore University HealthSystem, Evanston, IL, USA
| | - Ying Wang
- Department of Population Science, American Cancer Society, Kennesaw, GA, USA
| | - Sonja I Berndt
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, NIH, Bethesda, MD, USA
| | - Chad D Huff
- Department of Epidemiology, University of Texas M.D. Anderson Cancer Center, Houston, TX, USA
| | | | - Yao Tettey
- Department of Pathology, University of Ghana, Accra, Ghana; Korle Bu Teaching Hospital, Accra, Ghana
| | - Joseph Lachance
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA
| | - Wei Tang
- Laboratory of Human Carcinogenesis, Center for Cancer Research, National Cancer Institute, Bethesda, MD, USA
| | - Christopher T Rentsch
- Yale School of Medicine, New Haven, CT, USA; VA Connecticut Healthcare System, West Haven, CT, USA; Faculty of Epidemiology and Population Health, London School of Hygiene & Tropical Medicine, London, UK
| | - Kelly Cho
- Division of Aging, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA; VA Boston Healthcare System, Jamaica Plain, MA, USA
| | - Benjamin H Mcmahon
- Theoretical Biology Division, Los Alamos National Lab, Los Alamos, NM, USA
| | | | - Andrew A Adjei
- Department of Pathology, University of Ghana Medical School, Accra, Ghana
| | - Evelyn Tay
- Korle Bu Teaching Hospital, Accra, Ghana
| | | | | | - Thomas A Sellers
- Department of Cancer Epidemiology, Moffitt Cancer Center, Tampa, FL, USA
| | - Kosj Yamoah
- Department of Cancer Epidemiology, Moffitt Cancer Center, Tampa, FL, USA; Department of Radiation Oncology, Moffitt Cancer Center, Tampa, FL, USA
| | - Adam B Murphy
- Department of Urology, Northwestern University, Chicago, IL, USA
| | - Dana C Crawford
- Department of Population and Quantitative Health Sciences, Cleveland Institute for Computational Biology, Case Western Reserve University, Cleveland, OH, USA
| | - Alpa V Patel
- Department of Population Science, American Cancer Society, Kennesaw, GA, USA
| | - William S Bush
- Department of Population and Quantitative Health Sciences, Cleveland Institute for Computational Biology, Case Western Reserve University, Cleveland, OH, USA
| | - Melinda C Aldrich
- Division of Epidemiology, Department of Thoracic Surgery, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Olivier Cussenot
- Department of Urology and Predictive Onco-Urology Group, Sorbonne Université, GRC 5 Predictive Onco-Urology, APHP-Sorbonne Université, Paris, France; CeRePP, Tenon Hospital, Paris, France
| | - Gyorgy Petrovics
- Department of Surgery, Center for Prostate Disease Research, Uniformed Services University of the Health Sciences, Bethesda, MD, USA
| | - Jennifer Cullen
- Department of Population and Quantitative Health Sciences, Cleveland Institute for Computational Biology, Case Western Reserve University, Cleveland, OH, USA; Department of Surgery, Center for Prostate Disease Research, Uniformed Services University of the Health Sciences, Bethesda, MD, USA
| | | | - Mariana C Stern
- Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | | | | | - Michael B Cook
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, NIH, Bethesda, MD, USA
| | | | - Ann W Hsing
- Department of Medicine, Stanford Cancer Institute, Stanford University School of Medicine, Stanford, CA, USA
| | - Phyllis J Goodman
- SWOG Statistical Center, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Thomas J Hoffmann
- Department of Epidemiology and Biostatistics, University of California, San Francisco, San Francisco, CA, USA
| | - Bettina F Drake
- Division of Public Health Sciences, Washington University School of Medicine, St. Louis, MO, USA
| | - Jennifer J Hu
- The University of Miami School of Medicine, Sylvester Comprehensive Cancer Center, Miami, FL, USA
| | - Jacob M Keaton
- Division of Epidemiology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA; Center for Precision Health Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Jacklyn N Hellwege
- Division of Epidemiology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA; Division of Genetic Medicine, Department of Medicine, Vanderbilt Genetics Institute, Nashville, TN, USA
| | - Peter E Clark
- Atrium Health/Levine Cancer Institute, Charlotte, NC, USA
| | | | | | | | - Olufemi Ogunbiyi
- College of Medicine, University of Ibadan and University College Hospital, Ibadan, Nigeria
| | - Michael O Idowu
- College of Medicine, University of Ibadan and University College Hospital, Ibadan, Nigeria
| | - Olufemi Popoola
- College of Medicine, University of Ibadan and University College Hospital, Ibadan, Nigeria
| | - Akindele O Adebiyi
- College of Medicine, University of Ibadan and University College Hospital, Ibadan, Nigeria
| | - Oseremen I Aisuodionoe-Shadrach
- College of Health Sciences, University of Abuja, University of Abuja Teaching Hospital and Cancer Science Center, Abuja, Nigeria
| | - Hafees O Ajibola
- College of Health Sciences, University of Abuja, University of Abuja Teaching Hospital and Cancer Science Center, Abuja, Nigeria
| | - Mustapha A Jamda
- College of Health Sciences, University of Abuja, University of Abuja Teaching Hospital and Cancer Science Center, Abuja, Nigeria
| | - Olabode P Oluwole
- College of Health Sciences, University of Abuja, University of Abuja Teaching Hospital and Cancer Science Center, Abuja, Nigeria
| | - Maxwell Nwegbu
- College of Health Sciences, University of Abuja, University of Abuja Teaching Hospital and Cancer Science Center, Abuja, Nigeria
| | | | | | | | | | - Halimatou Diop
- Laboratoires Bacteriologie et Virologie, Hôpital Aristide Le Dantec, Dakar, Senegal
| | - Stephen K Van Den Eeden
- Division of Research, Kaiser Permanente, Northern California, Oakland, CA, USA; Department of Urology, University of California, San Francisco, San Francisco, CA, USA
| | - Pascal Blanchet
- CHU de Pointe-à-Pitre, Univ Antilles, Univ Rennes, Inserm, EHESP, Irset (Institut de recherche en santé, environnement et travail), Pointe-à-Pitre, Guadeloupe, France
| | - Jay H Fowke
- Department of Preventive Medicine, Division of Epidemiology, The University of Tennessee Health Science Center, Memphis, TN, USA
| | - Graham Casey
- Department of Public Health Science, Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
| | - Anselm J Hennis
- Department of Family, Population and Preventive Medicine, Stony Brook University, Stony Brook, NY, USA
| | | | - Ian M Thompson
- CHRISTUS Santa Rosa Medical Center Hospital, San Antonio, TX, USA
| | - Robin Leach
- Department of Urology, Cancer Therapy and Research Center, University of Texas Health Science Center at San Antonio, San Antonio, TX, USA
| | - Douglas F Easton
- Department of Public Health and Primary Care, Centre for Cancer Genetic Epidemiology, University of Cambridge, Strangeways Research Laboratory, Cambridge, UK
| | - Michael H Preuss
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Ruth J Loos
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Susan M Gundell
- Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Peggy Wan
- Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - James L Mohler
- Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA; Department of Urology, Roswell Park Cancer Institute, Buffalo, NY, USA
| | - Elizabeth T Fontham
- School of Public Health, Louisiana State University Health Sciences Center, New Orleans, LA, USA
| | - Gary J Smith
- Department of Urology, Roswell Park Cancer Institute, Buffalo, NY, USA
| | - Jack A Taylor
- Epidemiology Branch, National Institute of Environmental Health Sciences, Research Triangle Park, NC, USA; Laboratory of Molecular Carcinogenesis, National Institute of Environmental Health Sciences, Research Triangle Park, NC, USA
| | - Shiv Srivastava
- Department of Biochemistry and Molecular & Cellular Biology, Georgetown University Medical Center, Washington, DC, USA
| | - Rosaline A Eeles
- The Institute of Cancer Research, London, UK; Royal Marsden NHS Foundation Trust, London, UK
| | - John D Carpten
- Department of Translational Genomics, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Adam S Kibel
- Department of Urology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Luc Multigner
- Univ Rennes, Inserm, EHESP, Irset (Institut de recherche en santé, environnement et travail), Rennes, France
| | - Marie-Élise Parent
- Epidemiology and Biostatistics Unit, Centre Armand-Frappier Santé Biotechnologie, Institut national de la recherche scientifique, Laval, QC, Canada
| | - Florence Menegaux
- Cancer & Environment Group, Center for Research in Epidemiology and Population Health (CESP), INSERM, University Paris-Sud, University Paris-Saclay, Villejuif Cédex, France; Paris-Sud University, Villejuif Cédex, France
| | - Geraldine Cancel-Tassin
- Department of Urology and Predictive Onco-Urology Group, Sorbonne Université, GRC 5 Predictive Onco-Urology, APHP-Sorbonne Université, Paris, France; CeRePP, Tenon Hospital, Paris, France
| | - Eric A Klein
- Cleveland Clinic Lerner Research Institute, Cleveland, OH, USA
| | - Caroline Andrews
- Harvard TH Chan School of Public Health and Division of Population Sciences, Dana Farber Cancer Institute, Boston, MA, USA; Glickman Urological & Kidney Institute, Cleveland, OH, USA
| | - Timothy R Rebbeck
- Harvard TH Chan School of Public Health and Division of Population Sciences, Dana Farber Cancer Institute, Boston, MA, USA
| | - Laurent Brureau
- CHU de Pointe-à-Pitre, Univ Antilles, Univ Rennes, Inserm, EHESP, Irset (Institut de recherche en santé, environnement et travail), Pointe-à-Pitre, Guadeloupe, France
| | - Stefan Ambs
- Laboratory of Human Carcinogenesis, Center for Cancer Research, National Cancer Institute, Bethesda, MD, USA
| | - Todd L Edwards
- Division of Epidemiology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
| | | | - Stephen J Chanock
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, NIH, Bethesda, MD, USA
| | - John S Witte
- Department of Epidemiology and Population Health, Stanford University, Stanford, CA, USA; Stanford Cancer Institute, Stanford University, Stanford, CA, USA
| | - William J Blot
- Division of Epidemiology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA; International Epidemiology Institute, Rockville, MD, USA
| | - J Michael Gaziano
- Division of Aging, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA; VA Boston Healthcare System, Boston, MA, USA
| | - Amy C Justice
- Yale School of Medicine, New Haven, CT, USA; VA Connecticut Healthcare System, West Haven, CT, USA
| | - David V Conti
- Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Christopher A Haiman
- Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA.
| |
Collapse
|
22
|
Lennon NJ, Kottyan LC, Kachulis C, Abul-Husn N, Arias J, Belbin G, Below JE, Berndt S, Chung W, Cimino JJ, Clayton EW, Connolly JJ, Crosslin D, Dikilitas O, Velez Edwards DR, Feng Q, Fisher M, Freimuth R, Ge T, Glessner JT, Gordon A, Guiducci C, Hakonarson H, Harden M, Harr M, Hirschhorn J, Hoggart C, Hsu L, Irvin R, Jarvik GP, Karlson EW, Khan A, Khera A, Kiryluk K, Kullo I, Larkin K, Limdi N, Linder JE, Loos R, Luo Y, Malolepsza E, Manolio T, Martin LJ, McCarthy L, Meigs JB, Mersha TB, Mosley J, Namjou B, Pai N, Pesce LL, Peters U, Peterson J, Prows CA, Puckelwartz MJ, Rehm H, Roden D, Rosenthal EA, Rowley R, Sawicki KT, Schaid D, Schmidlen T, Smit R, Smith J, Smoller JW, Thomas M, Tiwari H, Toledo D, Vaitinadin NS, Veenstra D, Walunas T, Wang Z, Wei WQ, Weng C, Wiesner G, Xianyong Y, Kenny E. Selection, optimization, and validation of ten chronic disease polygenic risk scores for clinical implementation in diverse populations. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.05.25.23290535. [PMID: 37333246 PMCID: PMC10275001 DOI: 10.1101/2023.05.25.23290535] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/20/2023]
Abstract
Polygenic risk scores (PRS) have improved in predictive performance supporting their use in clinical practice. Reduced predictive performance of PRS in diverse populations can exacerbate existing health disparities. The NHGRI-funded eMERGE Network is returning a PRS-based genome-informed risk assessment to 25,000 diverse adults and children. We assessed PRS performance, medical actionability, and potential clinical utility for 23 conditions. Standardized metrics were considered in the selection process with additional consideration given to strength of evidence in African and Hispanic populations. Ten conditions were selected with a range of high-risk thresholds: atrial fibrillation, breast cancer, chronic kidney disease, coronary heart disease, hypercholesterolemia, prostate cancer, asthma, type 1 diabetes, obesity, and type 2 diabetes. We developed a pipeline for clinical PRS implementation, used genetic ancestry to calibrate PRS mean and variance, created a framework for regulatory compliance, and developed a PRS clinical report. eMERGE's experience informs the infrastructure needed to implement PRS-based implementation in diverse clinical settings.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - Li Hsu
- Fred Hutchinson Cancer Center and University of Washington
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - Ulrike Peters
- Fred Hutchinson Cancer Center and University of Washington
| | | | | | | | | | - Dan Roden
- Vanderbilt University Medical Center
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
23
|
La Cava WG, Lee PC, Ajmal I, Ding X, Solanki P, Cohen JB, Moore JH, Herman DS. A flexible symbolic regression method for constructing interpretable clinical prediction models. NPJ Digit Med 2023; 6:107. [PMID: 37277550 DOI: 10.1038/s41746-023-00833-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Accepted: 05/05/2023] [Indexed: 06/07/2023] Open
Abstract
Machine learning (ML) models trained for triggering clinical decision support (CDS) are typically either accurate or interpretable but not both. Scaling CDS to the panoply of clinical use cases while mitigating risks to patients will require many ML models be intuitively interpretable for clinicians. To this end, we adapted a symbolic regression method, coined the feature engineering automation tool (FEAT), to train concise and accurate models from high-dimensional electronic health record (EHR) data. We first present an in-depth application of FEAT to classify hypertension, hypertension with unexplained hypokalemia, and apparent treatment-resistant hypertension (aTRH) using EHR data for 1200 subjects receiving longitudinal care in a large healthcare system. FEAT models trained to predict phenotypes adjudicated by chart review had equivalent or higher discriminative performance (p < 0.001) and were at least three times smaller (p < 1 × 10-6) than other potentially interpretable models. For aTRH, FEAT generated a six-feature, highly discriminative (positive predictive value = 0.70, sensitivity = 0.62), and clinically intuitive model. To assess the generalizability of the approach, we tested FEAT on 25 benchmark clinical phenotyping tasks using the MIMIC-III critical care database. Under comparable dimensionality constraints, FEAT's models exhibited higher area under the receiver-operating curve scores than penalized linear models across tasks (p < 6 × 10-6). In summary, FEAT can train EHR prediction models that are both intuitively interpretable and accurate, which should facilitate safe and effective scaling of ML-triggered CDS to the panoply of potential clinical use cases and healthcare practices.
Collapse
Affiliation(s)
- William G La Cava
- Computational Health Informatics Program, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
| | - Paul C Lee
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Imran Ajmal
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Xiruo Ding
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Priyanka Solanki
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Jordana B Cohen
- Division of Renal-Electrolyte and Hypertension, Department of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, PA, USA
| | - Jason H Moore
- Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, PA, USA
| | - Daniel S Herman
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA, USA.
| |
Collapse
|
24
|
Keloth VK, Banda JM, Gurley M, Heider PM, Kennedy G, Liu H, Liu F, Miller T, Natarajan K, V Patterson O, Peng Y, Raja K, Reeves RM, Rouhizadeh M, Shi J, Wang X, Wang Y, Wei WQ, Williams AE, Zhang R, Belenkaya R, Reich C, Blacketer C, Ryan P, Hripcsak G, Elhadad N, Xu H. Representing and utilizing clinical textual data for real world studies: An OHDSI approach. J Biomed Inform 2023; 142:104343. [PMID: 36935011 PMCID: PMC10428170 DOI: 10.1016/j.jbi.2023.104343] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Revised: 01/21/2023] [Accepted: 03/13/2023] [Indexed: 03/19/2023]
Abstract
Clinical documentation in electronic health records contains crucial narratives and details about patients and their care. Natural language processing (NLP) can unlock the information conveyed in clinical notes and reports, and thus plays a critical role in real-world studies. The NLP Working Group at the Observational Health Data Sciences and Informatics (OHDSI) consortium was established to develop methods and tools to promote the use of textual data and NLP in real-world observational studies. In this paper, we describe a framework for representing and utilizing textual data in real-world evidence generation, including representations of information from clinical text in the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM), the workflow and tools that were developed to extract, transform and load (ETL) data from clinical notes into tables in OMOP CDM, as well as current applications and specific use cases of the proposed OHDSI NLP solution at large consortia and individual institutions with English textual data. Challenges faced and lessons learned during the process are also discussed to provide valuable insights for researchers who are planning to implement NLP solutions in real-world studies.
Collapse
Affiliation(s)
- Vipina K Keloth
- Section of Biomedical Informatics and Data Science, Yale School of Medicine, Yale University, New Haven, CT, USA
| | - Juan M Banda
- Department of Computer Science, Georgia State University, Atlanta, GA, USA
| | - Michael Gurley
- Lurie Cancer Center, Northwestern University, Chicago, Illinois, USA
| | - Paul M Heider
- Biomedical Informatics Center, Medical University of South Carolina, Charleston, SC, USA
| | - Georgina Kennedy
- Ingham Institute for Applied Medical Research, Sydney, Australia
| | - Hongfang Liu
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN, USA
| | - Feifan Liu
- Department of Population and Quantitative Health Sciences, University of Massachusetts Chan Medical School, Worcester, MA, USA
| | - Timothy Miller
- Computational Health Informatics Program, Boston Children's Hospital, and Department of Pediatrics, Harvard Medical School, Boston, MA, USA
| | - Karthik Natarajan
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA
| | - Olga V Patterson
- VA Informatics and Computing Infrastructure, Department of Veterans Affairs Salt Lake City Health Care System, Salt Lake City, Utah, USA; Division of Epidemiology, Department of Internal Medicine, School of Medicine, University of Utah, Salt Lake City, Utah, USA; Verily Life Sciences, Mountain View, CA, USA
| | - Yifan Peng
- Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, USA
| | - Kalpana Raja
- Section of Biomedical Informatics and Data Science, Yale School of Medicine, Yale University, New Haven, CT, USA
| | - Ruth M Reeves
- TN Valley Healthcare System, U.S. Department of Veterans Affairs, Nashville, TN, USA; Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Masoud Rouhizadeh
- Department of Pharmaceutical Outcomes & Policy, University of Florida, Gainesville, FL, USA; Biomedical Informatics and Data Science, Johns Hopkins University, Baltimore, MD, USA
| | - Jianlin Shi
- VA Informatics and Computing Infrastructure, Department of Veterans Affairs Salt Lake City Health Care System, Salt Lake City, Utah, USA; Division of Epidemiology, Department of Internal Medicine, School of Medicine, University of Utah, Salt Lake City, Utah, USA; Department of Biomedical Informatics, University of Utah, Salt Lake City, USA
| | - Xiaoyan Wang
- Sema4 Mount Sinai Genomics Incorporation, Stamford, CT, USA
| | - Yanshan Wang
- Department of Health Information Management, Department of Biomedical Informatics, and Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA, USA
| | - Wei-Qi Wei
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA
| | | | - Rui Zhang
- Institute for Health Informatics, and Department of Pharmaceutical Care & Health Systems, University of Minnesota, Minneapolis, MN, USA
| | | | | | - Clair Blacketer
- Janssen Pharmaceutical Research and Development LLC, Titusville, NJ, USA; Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, the Netherlands
| | - Patrick Ryan
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA; Janssen Pharmaceutical Research and Development LLC, Titusville, NJ, USA
| | - George Hripcsak
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA
| | - Noémie Elhadad
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA.
| | - Hua Xu
- Section of Biomedical Informatics and Data Science, Yale School of Medicine, Yale University, New Haven, CT, USA.
| |
Collapse
|
25
|
Song J, Forrest N, Gordon A, Kottyan L, Mittendorf KF, Wei WQ, Ramsey-Goldman R, Walunas T, Kho A. Utilization of electronic health record data to evaluate the association of urban environment with systemic lupus erythematosus symptoms. Rheumatology (Oxford) 2023; 62:e180-e181. [PMID: 36383166 PMCID: PMC10234196 DOI: 10.1093/rheumatology/keac647] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Revised: 11/03/2022] [Accepted: 11/09/2022] [Indexed: 11/17/2022] Open
Affiliation(s)
- Janet Song
- Center for Health Information Partnerships, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
- Division of General Internal Medicine and Geriatrics, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Noah Forrest
- Center for Health Information Partnerships, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
- Division of General Internal Medicine and Geriatrics, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Adam Gordon
- Department of Pharmacology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Leah Kottyan
- Center for Autoimmune Genomics and Etiology, Division of Human Genetics, Department of Pediatrics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA
| | - Kathleen F Mittendorf
- Vanderbilt-Ingram Cancer Center, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Wei-Qi Wei
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Rosalind Ramsey-Goldman
- Division of Rheumatology, Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Theresa Walunas
- Center for Health Information Partnerships, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
- Division of General Internal Medicine and Geriatrics, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Abel Kho
- Center for Health Information Partnerships, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
- Division of General Internal Medicine and Geriatrics, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| |
Collapse
|
26
|
Zhang HG, Honerlaw JP, Maripuri M, Samayamuthu MJ, Beaulieu-Jones BR, Baig HS, L'Yi S, Ho YL, Morris M, Panickan VA, Wang X, Weber GM, Liao KP, Visweswaran S, Tan BWQ, Yuan W, Gehlenborg N, Muralidhar S, Ramoni RB, Kohane IS, Xia Z, Cho K, Cai T, Brat GA. Potential pitfalls in the use of real-world data for studying long COVID. Nat Med 2023; 29:1040-1043. [PMID: 37055567 PMCID: PMC10205658 DOI: 10.1038/s41591-023-02274-y] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/15/2023]
Affiliation(s)
- Harrison G Zhang
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Massachusetts Veterans Epidemiology Research and Information Center (MAVERIC), VA Boston Healthcare System, Boston, MA, USA
- Division of Rheumatology, Inflammation, and Immunity, Brigham and Women's Hospital, Boston, MA, USA
| | - Jacqueline P Honerlaw
- Massachusetts Veterans Epidemiology Research and Information Center (MAVERIC), VA Boston Healthcare System, Boston, MA, USA
| | - Monika Maripuri
- Massachusetts Veterans Epidemiology Research and Information Center (MAVERIC), VA Boston Healthcare System, Boston, MA, USA
| | | | | | - Huma S Baig
- Department of Surgery, Beth Israel Deaconess Medical Center, Boston, MA, USA
| | - Sehi L'Yi
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Yuk-Lam Ho
- Massachusetts Veterans Epidemiology Research and Information Center (MAVERIC), VA Boston Healthcare System, Boston, MA, USA
| | - Michele Morris
- Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA, USA
| | | | - Xuan Wang
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Griffin M Weber
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Katherine P Liao
- Massachusetts Veterans Epidemiology Research and Information Center (MAVERIC), VA Boston Healthcare System, Boston, MA, USA
- Division of Rheumatology, Inflammation, and Immunity, Brigham and Women's Hospital, Boston, MA, USA
| | - Shyam Visweswaran
- Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA, USA
| | - Bryce W Q Tan
- Department of Medicine, National University Hospital, Singapore, Singapore, Singapore
| | - William Yuan
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Nils Gehlenborg
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Sumitra Muralidhar
- Office of Research and Development, US Department of Veterans Affairs, Washington DC, USA
| | - Rachel B Ramoni
- Office of Research and Development, US Department of Veterans Affairs, Washington DC, USA
| | - Isaac S Kohane
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Zongqi Xia
- Department of Neurology, University of Pittsburgh, Pittsburgh, PA, USA
| | - Kelly Cho
- Massachusetts Veterans Epidemiology Research and Information Center (MAVERIC), VA Boston Healthcare System, Boston, MA, USA
| | - Tianxi Cai
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Massachusetts Veterans Epidemiology Research and Information Center (MAVERIC), VA Boston Healthcare System, Boston, MA, USA
| | - Gabriel A Brat
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
27
|
Sun TY, Bhave SA, Altosaar J, Elhadad N. Assessing Phenotype Definitions for Algorithmic Fairness. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2023; 2022:1032-1041. [PMID: 37128361 PMCID: PMC10148336] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Phenotyping is a core, routine activity in observational health research. Cohorts impact downstream analyses, such as how a condition is characterized, how patient risk is defined, and what treatments are studied. It is thus critical to ensure that cohorts are representative of all patients, independently of their demographics or social determinants of health. In this paper, we propose a set of best practices to assess the fairness of phenotype definitions. We leverage established fairness metrics commonly used in predictive models and relate them to commonly used epidemiological metrics. We describe an empirical study for Crohn's disease and diabetes type 2, each with multiple phenotype definitions taken from the literature across gender and race. We show that the different phenotype definitions exhibit widely varying and disparate performance according to the different fairness metrics and subgroups. We hope that the proposed best practices can help in constructing fair and inclusive phenotype definitions.
Collapse
|
28
|
Linder JE, Allworth A, Bland HT, Caraballo PJ, Chisholm RL, Clayton EW, Crosslin DR, Dikilitas O, DiVietro A, Esplin ED, Forman S, Freimuth RR, Gordon AS, Green R, Harden MV, Holm IA, Jarvik GP, Karlson EW, Labrecque S, Lennon NJ, Limdi NA, Mittendorf KF, Murphy SN, Orlando L, Prows CA, Rasmussen LV, Rasmussen-Torvik L, Rowley R, Sawicki KT, Schmidlen T, Terek S, Veenstra D, Velez Edwards DR, Absher D, Abul-Husn NS, Alsip J, Bangash H, Beasley M, Below JE, Berner ES, Booth J, Chung WK, Cimino JJ, Connolly J, Davis P, Devine B, Fullerton SM, Guiducci C, Habrat ML, Hain H, Hakonarson H, Harr M, Haverfield E, Hernandez V, Hoell C, Horike-Pyne M, Hripcsak G, Irvin MR, Kachulis C, Karavite D, Kenny EE, Khan A, Kiryluk K, Korf B, Kottyan L, Kullo IJ, Larkin K, Liu C, Malolepsza E, Manolio TA, May T, McNally EM, Mentch F, Miller A, Mooney SD, Murali P, Mutai B, Muthu N, Namjou B, Perez EF, Puckelwartz MJ, Rakhra-Burris T, Roden DM, Rosenthal EA, Saadatagah S, Sabatello M, Schaid DJ, Schultz B, Seabolt L, Shaibi GQ, Sharp RR, Shirts B, Smith ME, Smoller JW, Sterling R, Suckiel SA, Thayer J, Tiwari HK, Trinidad SB, Walunas T, Wei WQ, Wells QS, Weng C, Wiesner GL, Wiley K, Peterson JF. Returning integrated genomic risk and clinical recommendations: The eMERGE study. Genet Med 2023; 25:100006. [PMID: 36621880 PMCID: PMC10085845 DOI: 10.1016/j.gim.2023.100006] [Citation(s) in RCA: 32] [Impact Index Per Article: 32.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Revised: 12/16/2022] [Accepted: 12/21/2022] [Indexed: 01/09/2023] Open
Abstract
PURPOSE Assessing the risk of common, complex diseases requires consideration of clinical risk factors as well as monogenic and polygenic risks, which in turn may be reflected in family history. Returning risks to individuals and providers may influence preventive care or use of prophylactic therapies for those individuals at high genetic risk. METHODS To enable integrated genetic risk assessment, the eMERGE (electronic MEdical Records and GEnomics) network is enrolling 25,000 diverse individuals in a prospective cohort study across 10 sites. The network developed methods to return cross-ancestry polygenic risk scores, monogenic risks, family history, and clinical risk assessments via a genome-informed risk assessment (GIRA) report and will assess uptake of care recommendations after return of results. RESULTS GIRAs include summary care recommendations for 11 conditions, education pages, and clinical laboratory reports. The return of high-risk GIRA to individuals and providers includes guidelines for care and lifestyle recommendations. Assembling the GIRA required infrastructure and workflows for ingesting and presenting content from multiple sources. Recruitment began in February 2022. CONCLUSION Return of a novel report for communicating monogenic, polygenic, and family history-based risk factors will inform the benefits of integrated genetic risk assessment for routine health care.
Collapse
Affiliation(s)
- Jodell E Linder
- Vanderbilt Institute for Clinical and Translational Research, Vanderbilt University Medical Center, Nashville, TN
| | - Aimee Allworth
- Division of Medical Genetics, Department of Medicine, University of Washington Medical Center, Seattle, WA
| | - Harris T Bland
- Department of Biomedical Informatics and Department of Medicine, Vanderbilt University Medical Center, Nashville, TN
| | - Pedro J Caraballo
- Department of Internal Medicine and Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN
| | - Rex L Chisholm
- Center for Genetic Medicine, Northwestern University, Chicago, IL
| | - Ellen Wright Clayton
- Center for Biomedical Ethics and Society, Vanderbilt University Medical Center, Nashville, TN
| | - David R Crosslin
- Division of Biomedical Informatics and Genomics, John W. Deming Department of Medicine, Tulane University School of Medicine, New Orleans, LA
| | - Ozan Dikilitas
- Mayo Clinician Investigator Training Program, Department of Internal Medicine and Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN
| | - Alanna DiVietro
- Vanderbilt Institute for Clinical and Translational Research, Vanderbilt University Medical Center, Nashville, TN
| | | | - Sophie Forman
- Vanderbilt Institute for Clinical and Translational Research, Vanderbilt University Medical Center, Nashville, TN
| | - Robert R Freimuth
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN
| | - Adam S Gordon
- Department of Pharmacology, Feinberg School of Medicine, and Center for Genetic Medicine, Northwestern University, Chicago, IL
| | - Richard Green
- Department of Biomedical Informatics and Medical Education, University of Washington Medical Center, Seattle, WA
| | | | - Ingrid A Holm
- Division of Genetics and Genomics and Manton Center for Orphan Diseases Research, Boston Children's Hospital, Department of Pediatrics, Harvard Medical School, Boston, MA
| | - Gail P Jarvik
- Division of Medical Genetics, Department of Medicine and Department of Genome Science, University of Washington Medical Center, Seattle, WA
| | - Elizabeth W Karlson
- Division of Rheumatology, Inflammation and Immunity, Department of Medicine, Brigham and Women's Hospital, Boston, MA
| | - Sofia Labrecque
- Vanderbilt Institute for Clinical and Translational Research, Vanderbilt University Medical Center, Nashville, TN
| | | | - Nita A Limdi
- Department of Neurology, Heersink School of Medicine, University of Alabama at Birmingham, Birmingham, AL
| | - Kathleen F Mittendorf
- Vanderbilt-Ingram Cancer Center, Vanderbilt University Medical Center, Nashville, TN
| | - Shawn N Murphy
- Department of Neurology, Massachusetts General Hospital, Boston, MA
| | - Lori Orlando
- Center for Applied Genomics and Precision Medicine, Duke University, Durham, NC
| | - Cynthia A Prows
- Divisions of Human Genetics and Patient Services, Cincinnati Children's Hospital Medical Center, Cincinnati, OH
| | - Luke V Rasmussen
- Department of Preventive Medicine, Northwestern University, Chicago, IL
| | | | - Robb Rowley
- Division of Genomic Medicine, National Human Genome Research Institute, Bethesda, MD
| | - Konrad Teodor Sawicki
- Department of Cardiology and Center for Genetic Medicine, Northwestern University, Chicago, IL
| | | | - Shannon Terek
- Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, PA
| | - David Veenstra
- School of Pharmacy, University of Washington, Seattle, WA
| | - Digna R Velez Edwards
- Division of Quantitative Science, Department of Obstetrics and Gynecology, Department of Biomedical Sciences, Vanderbilt University Medical Center, Nashville, TN
| | | | - Noura S Abul-Husn
- Institute for Genomic Health, Department of Medicine, Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY
| | | | - Hana Bangash
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN
| | - Mark Beasley
- Department of Biostatistics, School of Public Health, University of Alabama at Birmingham, Birmingham, AL
| | - Jennifer E Below
- Division of Genetic Medicine, Department of Medicine, Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN
| | - Eta S Berner
- Department of Health Services Administration, University of Alabama at Birmingham, Birmingham, AL
| | - James Booth
- Department of Emergency Medicine, Heersink School of Medicine, University of Alabama at Birmingham, Birmingham, AL
| | - Wendy K Chung
- Departments of Pediatrics and Medicine, Columbia University Irving Medical Center, Columbia University, New York, NY
| | - James J Cimino
- Division of General Internal Medicine and the Informatics Institute, Department of Medicine, Heersink School of Medicine, University of Alabama at Birmingham, Birmingham, AL
| | - John Connolly
- Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, PA
| | - Patrick Davis
- Department of Biomedical Informatics and Medical Education, University of Washington Medical Center, Seattle, WA
| | - Beth Devine
- School of Pharmacy, University of Washington, Seattle, WA
| | - Stephanie M Fullerton
- Department of Bioethics and Humanities, University of Washington School of Medicine, Seattle, WA
| | | | - Melissa L Habrat
- Department of Biomedical Informatics and Medical Education, University of Washington Medical Center, Seattle, WA
| | - Heather Hain
- Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, PA
| | - Hakon Hakonarson
- Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, PA
| | - Margaret Harr
- Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, PA
| | | | | | - Christin Hoell
- Department of Obstetrics & Gynecology and Center for Genetic Medicine, Northwestern University, Chicago, IL
| | - Martha Horike-Pyne
- Division of Medical Genetics, Department of Medicine, University of Washington Medical Center, Seattle, WA
| | - George Hripcsak
- Department of Biomedical Informatics, Columbia University Irving Medical Center, Columbia University, New York, NY
| | - Marguerite R Irvin
- Department of Epidemiology, School of Public Health, University of Alabama at Birmingham, Birmingham, AL
| | | | - Dean Karavite
- Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA
| | - Eimear E Kenny
- Institute for Genomic Health, Department of Medicine, Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY
| | - Atlas Khan
- Division of Nephrology, Department of Medicine, Columbia University Irving Medical Center, New York, NY
| | - Krzysztof Kiryluk
- Division of Nephrology, Department of Medicine, Columbia University Irving Medical Center, New York, NY
| | - Bruce Korf
- Department of Genetics, Heersink School of Medicine, University of Alabama at Birmingham, Birmingham, AL
| | - Leah Kottyan
- The Center for Autoimmune Genomics and Etiology, Division of Human Genetics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH
| | - Iftikhar J Kullo
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN
| | - Katie Larkin
- Broad Institute of MIT and Harvard, Cambridge, MA
| | - Cong Liu
- Department of Biomedical Informatics, Columbia University Irving Medical Center, Columbia University, New York, NY
| | | | - Teri A Manolio
- Division of Genomic Medicine, National Human Genome Research Institute, Bethesda, MD
| | - Thomas May
- Elson S. Floyd College of Medicine, Washington State University, Vancouver, WA
| | | | - Frank Mentch
- Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, PA
| | - Alexandra Miller
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN
| | - Sean D Mooney
- Department of Biomedical Informatics and Medical Education, University of Washington Medical Center, Seattle, WA
| | - Priyanka Murali
- Division of Medical Genetics, Department of Medicine, University of Washington Medical Center, Seattle, WA
| | - Brenda Mutai
- Division of Medical Genetics, Department of Medicine, University of Washington Medical Center, Seattle, WA
| | - Naveen Muthu
- Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA
| | - Bahram Namjou
- The Center for Autoimmune Genomics and Etiology, Division of Human Genetics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH
| | - Emma F Perez
- Department of Medicine, Brigham and Women's Hospital, Mass General Brigham Personalized Medicine, Boston, MA
| | - Megan J Puckelwartz
- Department of Pharmacology, Feinberg School of Medicine, and Center for Genetic Medicine, Northwestern University, Chicago, IL
| | | | - Dan M Roden
- Departments of Medicine, Pharmacology, and Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN
| | - Elisabeth A Rosenthal
- Division of Medical Genetics, Department of Medicine, University of Washington Medical Center, Seattle, WA
| | | | - Maya Sabatello
- Division of Nephrology, Department of Medicine & Division of Ethics, Department of Medical Humanities and Ethics, Columbia University Irving Medical Center, New York, NY
| | - Dan J Schaid
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN
| | - Baergen Schultz
- Division of Genomic Medicine, National Human Genome Research Institute, Bethesda, MD
| | - Lynn Seabolt
- Vanderbilt Institute for Clinical and Translational Research, Vanderbilt University Medical Center, Nashville, TN
| | - Gabriel Q Shaibi
- Center for Health Promotion and Disease Prevention, Arizona State University, Phoenix, AZ
| | - Richard R Sharp
- Biomedical Ethics Program, Department of Quantitative Health Science, Mayo Clinic, Rochester, MN
| | - Brian Shirts
- Department of Laboratory Medicine & Pathology, University of Washington Medical Center, Seattle, WA
| | - Maureen E Smith
- Department of Cardiology and Center for Genetic Medicine, Northwestern University, Chicago, IL
| | - Jordan W Smoller
- Department of Psychiatry and Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA
| | - Rene Sterling
- Division of Genomics and Society, National Human Genome Research Institute, Bethesda, MD
| | - Sabrina A Suckiel
- The Institute for Genomic Health, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY
| | - Jeritt Thayer
- Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA
| | - Hemant K Tiwari
- Department of Biostatistics, School of Public Health, University of Alabama at Birmingham, Birmingham, AL
| | - Susan B Trinidad
- Department of Bioethics and Humanities, University of Washington School of Medicine, Seattle, WA
| | - Theresa Walunas
- Department of Medicine and Center for Health Information Partnerships, Northwestern University, Chicago, IL
| | - Wei-Qi Wei
- Department of Biomedical Informatics and Department of Medicine, Vanderbilt University Medical Center, Nashville, TN
| | - Quinn S Wells
- Division of Cardiovascular Medicine, Department of Medicine, Vanderbilt University School of Medicine, Nashville, TN
| | - Chunhua Weng
- Department of Biomedical Informatics, Columbia University Irving Medical Center, Columbia University, New York, NY
| | - Georgia L Wiesner
- Division of Genetic Medicine, Department of Medicine, Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN
| | - Ken Wiley
- Division of Genomic Medicine, National Human Genome Research Institute, Bethesda, MD
| | - Josh F Peterson
- Center for Precision Medicine, Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN.
| |
Collapse
|
29
|
Verbitsky M, Krishnamurthy S, Krithivasan P, Hughes D, Khan A, Marasà M, Vena N, Khosla P, Zhang J, Lim TY, Glessner JT, Weng C, Shang N, Shen Y, Hripcsak G, Hakonarson H, Ionita-Laza I, Levy B, Kenny EE, Loos RJ, Kiryluk K, Sanna-Cherchi S, Crosslin DR, Furth S, Warady BA, Igo RP, Iyengar SK, Wong CS, Parsa A, Feldman HI, Gharavi AG. Genomic Disorders in CKD across the Lifespan. J Am Soc Nephrol 2023; 34:607-618. [PMID: 36302597 PMCID: PMC10103259 DOI: 10.1681/asn.2022060725] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Accepted: 09/15/2022] [Indexed: 01/24/2023] Open
Abstract
SIGNIFICANCE STATEMENT Pathogenic structural genetic variants, also known as genomic disorders, have been associated with pediatric CKD. This study extends those results across the lifespan, with genomic disorders enriched in both pediatric and adult patients compared with controls. In the Chronic Renal Insufficiency Cohort study, genomic disorders were also associated with lower serum Mg, lower educational performance, and a higher risk of death. A phenome-wide association study confirmed the link between kidney disease and genomic disorders in an unbiased way. Systematic detection of genomic disorders can provide a molecular diagnosis and refine prediction of risk and prognosis. BACKGROUND Genomic disorders (GDs) are associated with many comorbid outcomes, including CKD. Identification of GDs has diagnostic utility. METHODS We examined the prevalence of GDs among participants in the Chronic Kidney Disease in Children (CKiD) cohort II ( n =248), Chronic Renal Insufficiency Cohort (CRIC) study ( n =3375), Columbia University CKD Biobank (CU-CKD; n =1986), and the Family Investigation of Nephropathy and Diabetes (FIND; n =1318) compared with 30,746 controls. We also performed a phenome-wide association analysis (PheWAS) of GDs in the electronic MEdical Records and GEnomics (eMERGE; n =11,146) cohort. RESULTS We found nine out of 248 (3.6%) CKiD II participants carried a GD, replicating prior findings in pediatric CKD. We also identified GDs in 72 out of 6679 (1.1%) adult patients with CKD in the CRIC, CU-CKD, and FIND cohorts, compared with 199 out of 30,746 (0.65%) GDs in controls (OR, 1.7; 95% CI, 1.3 to 2.2). Among adults with CKD, we found recurrent GDs at the 1q21.1, 16p11.2, 17q12, and 22q11.2 loci. The 17q12 GD (diagnostic of renal cyst and diabetes syndrome) was most frequent, present in 1:252 patients with CKD and diabetes. In the PheWAS, dialysis and neuropsychiatric phenotypes were the top associations with GDs. In CRIC participants, GDs were associated with lower serum magnesium, lower educational achievement, and higher mortality risk. CONCLUSION Undiagnosed GDs are detected both in children and adults with CKD. Identification of GDs in these patients can enable a precise genetic diagnosis, inform prognosis, and help stratify risk in clinical studies. GDs could also provide a molecular explanation for nephropathy and comorbidities, such as poorer neurocognition for a subset of patients.
Collapse
Affiliation(s)
- Miguel Verbitsky
- Division of Nephrology, Department of Medicine, Columbia University, New York, New York
| | | | - Priya Krithivasan
- Division of Nephrology, Department of Medicine, Columbia University, New York, New York
| | - Daniel Hughes
- Institute for Genomic Medicine, Columbia University Medical Center, New York, New York
| | - Atlas Khan
- Division of Nephrology, Department of Medicine, Columbia University, New York, New York
| | - Maddalena Marasà
- Division of Nephrology, Department of Medicine, Columbia University, New York, New York
| | - Natalie Vena
- Division of Nephrology, Department of Medicine, Columbia University, New York, New York
| | - Pavan Khosla
- Division of Nephrology, Department of Medicine, Columbia University, New York, New York
| | - Junying Zhang
- Division of Nephrology, Department of Medicine, Columbia University, New York, New York
| | - Tze Y. Lim
- Division of Nephrology, Department of Medicine, Columbia University, New York, New York
| | - Joseph T. Glessner
- Center for Applied Genomics and Department of Pediatrics, Perelman School of Medicine, Philadelphia, Pennsylvania
| | - Chunhua Weng
- Department of Biomedical Informatics, Columbia University, New York, New York
| | - Ning Shang
- Division of Nephrology, Department of Medicine, Columbia University, New York, New York
- Department of Biomedical Informatics, Columbia University, New York, New York
| | - Yufeng Shen
- Department of Systems Biology and Columbia Genome Center, Columbia University, New York, New York
| | - George Hripcsak
- Department of Biomedical Informatics, Columbia University, New York, New York
| | - Hakon Hakonarson
- Center for Applied Genomics and Department of Pediatrics, Perelman School of Medicine, Philadelphia, Pennsylvania
| | | | - Brynn Levy
- Department of Pathology and Cell Biology, Columbia University, New York, New York
| | - Eimear E. Kenny
- Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, New York
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York
| | - Ruth J.F. Loos
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York
- Novo Nordisk Foundation Center for Basic Metabolic Research, University of Copenhagen, Copenhagen, Denmark
| | - Krzysztof Kiryluk
- Division of Nephrology, Department of Medicine, Columbia University, New York, New York
| | - Simone Sanna-Cherchi
- Division of Nephrology, Department of Medicine, Columbia University, New York, New York
| | - David R. Crosslin
- Division of Biomedical Informatics and Genomics, Tulane University School of Medicine, New Orleans, Louisiana
| | - Susan Furth
- Departments of Pediatrics and Epidemiology, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, Pennsylvania
| | - Bradley A. Warady
- Department of Pediatrics, University of Missouri-Kansas City School of Medicine, Kansas City, Missouri
| | - Robert P. Igo
- Department of Population and Quantitative Health Sciences, Case Western Reserve University and Louis Stoke, Cleveland, Ohio
| | - Sudha K. Iyengar
- Department of Population and Quantitative Health Sciences, Case Western Reserve University and Louis Stoke, Cleveland, Ohio
| | - Craig S. Wong
- Division of Pediatric Nephrology, University of New Mexico Children’s Hospital, Albuquerque, New Mexico
| | - Afshin Parsa
- Division of Kidney, Urologic, and Hematologic Diseases, National Institute of Diabetes and Digestive and Kidney Diseases, Bethesda, Maryland
| | - Harold I. Feldman
- Department of Biostatistics, Epidemiology, and Informatics, Perelman School of Medicine, Philadelphia, Pennsylvania
- Department of Medicine, Perelman School of Medicine, Philadelphia, Pennsylvania
- Center for Clinical Epidemiology and Biostatistics, Perelman School of Medicine, Philadelphia, Pennsylvania
| | - Ali G. Gharavi
- Division of Nephrology, Department of Medicine, Columbia University, New York, New York
| |
Collapse
|
30
|
Brandt PS, Kho A, Luo Y, Pacheco JA, Walunas TL, Hakonarson H, Hripcsak G, Liu C, Shang N, Weng C, Walton N, Carrell DS, Crane PK, Larson EB, Chute CG, Kullo IJ, Carroll R, Denny J, Ramirez A, Wei WQ, Pathak J, Wiley LK, Richesson R, Starren JB, Rasmussen LV. Characterizing variability of electronic health record-driven phenotype definitions. J Am Med Inform Assoc 2023; 30:427-437. [PMID: 36474423 PMCID: PMC9933077 DOI: 10.1093/jamia/ocac235] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2022] [Revised: 10/19/2022] [Accepted: 11/23/2022] [Indexed: 12/12/2022] Open
Abstract
OBJECTIVE The aim of this study was to analyze a publicly available sample of rule-based phenotype definitions to characterize and evaluate the variability of logical constructs used. MATERIALS AND METHODS A sample of 33 preexisting phenotype definitions used in research that are represented using Fast Healthcare Interoperability Resources and Clinical Quality Language (CQL) was analyzed using automated analysis of the computable representation of the CQL libraries. RESULTS Most of the phenotype definitions include narrative descriptions and flowcharts, while few provide pseudocode or executable artifacts. Most use 4 or fewer medical terminologies. The number of codes used ranges from 5 to 6865, and value sets from 1 to 19. We found that the most common expressions used were literal, data, and logical expressions. Aggregate and arithmetic expressions are the least common. Expression depth ranges from 4 to 27. DISCUSSION Despite the range of conditions, we found that all of the phenotype definitions consisted of logical criteria, representing both clinical and operational logic, and tabular data, consisting of codes from standard terminologies and keywords for natural language processing. The total number and variety of expressions are low, which may be to simplify implementation, or authors may limit complexity due to data availability constraints. CONCLUSIONS The phenotype definitions analyzed show significant variation in specific logical, arithmetic, and other operators but are all composed of the same high-level components, namely tabular data and logical expressions. A standard representation for phenotype definitions should support these formats and be modular to support localization and shared logic.
Collapse
Affiliation(s)
- Pascal S Brandt
- Department of Biomedical and Medical Education, University of Washington, Seattle, Washington, USA
| | - Abel Kho
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, Illinois, USA
| | - Yuan Luo
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, Illinois, USA
| | - Jennifer A Pacheco
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, Illinois, USA
| | - Theresa L Walunas
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, Illinois, USA
| | - Hakon Hakonarson
- Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| | - George Hripcsak
- Department of Biomedical Informatics, Columbia University, New York, New York, USA
| | - Cong Liu
- Department of Biomedical Informatics, Columbia University, New York, New York, USA
| | - Ning Shang
- Department of Biomedical Informatics, Columbia University, New York, New York, USA
| | - Chunhua Weng
- Department of Biomedical Informatics, Columbia University, New York, New York, USA
| | - Nephi Walton
- Intermountain Precision Genomics, Intermountain Healthcare, St George, Utah, USA
| | - David S Carrell
- Kaiser Permanente Washington Health Research Institute, Seattle, Washington, USA
| | - Paul K Crane
- Department of Medicine, University of Washington, Seattle, Washington, USA
| | - Eric B Larson
- Department of Medicine, University of Washington, Seattle, Washington, USA
- Department of Health Services, University of Washington, Seattle, Washington, USA
| | - Christopher G Chute
- Schools of Medicine, Public Health, and Nursing, Johns Hopkins University, Baltimore, Maryland, USA
| | - Iftikhar J Kullo
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, Minnesota, USA
| | - Robert Carroll
- Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee, USA
| | - Josh Denny
- All of Us Research Program, National Institutes of Health, Bethesda, Maryland, USA
| | - Andrea Ramirez
- Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee, USA
| | - Wei-Qi Wei
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, USA
| | - Jyoti Pathak
- Department of Population Health Sciences, Weill Cornell Medicine, New York, New York, USA
| | - Laura K Wiley
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, Colorado, USA
| | - Rachel Richesson
- Department of Learning Health Sciences, University of Michigan Medical School, Ann Arbor, Michigan, USA
| | - Justin B Starren
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, Illinois, USA
| | - Luke V Rasmussen
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, Illinois, USA
| |
Collapse
|
31
|
Broadaway KA, Yin X, Williamson A, Parsons VA, Wilson EP, Moxley AH, Vadlamudi S, Varshney A, Jackson AU, Ahuja V, Bornstein SR, Corbin LJ, Delgado GE, Dwivedi OP, Fernandes Silva L, Frayling TM, Grallert H, Gustafsson S, Hakaste L, Hammar U, Herder C, Herrmann S, Højlund K, Hughes DA, Kleber ME, Lindgren CM, Liu CT, Luan J, Malmberg A, Moissl AP, Morris AP, Perakakis N, Peters A, Petrie JR, Roden M, Schwarz PEH, Sharma S, Silveira A, Strawbridge RJ, Tuomi T, Wood AR, Wu P, Zethelius B, Baldassarre D, Eriksson JG, Fall T, Florez JC, Fritsche A, Gigante B, Hamsten A, Kajantie E, Laakso M, Lahti J, Lawlor DA, Lind L, März W, Meigs JB, Sundström J, Timpson NJ, Wagner R, Walker M, Wareham NJ, Watkins H, Barroso I, O'Rahilly S, Grarup N, Parker SC, Boehnke M, Langenberg C, Wheeler E, Mohlke KL. Loci for insulin processing and secretion provide insight into type 2 diabetes risk. Am J Hum Genet 2023; 110:284-299. [PMID: 36693378 PMCID: PMC9943750 DOI: 10.1016/j.ajhg.2023.01.002] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Accepted: 01/03/2023] [Indexed: 01/25/2023] Open
Abstract
Insulin secretion is critical for glucose homeostasis, and increased levels of the precursor proinsulin relative to insulin indicate pancreatic islet beta-cell stress and insufficient insulin secretory capacity in the setting of insulin resistance. We conducted meta-analyses of genome-wide association results for fasting proinsulin from 16 European-ancestry studies in 45,861 individuals. We found 36 independent signals at 30 loci (p value < 5 × 10-8), which validated 12 previously reported loci for proinsulin and ten additional loci previously identified for another glycemic trait. Half of the alleles associated with higher proinsulin showed higher rather than lower effects on glucose levels, corresponding to different mechanisms. Proinsulin loci included genes that affect prohormone convertases, beta-cell dysfunction, vesicle trafficking, beta-cell transcriptional regulation, and lysosomes/autophagy processes. We colocalized 11 proinsulin signals with islet expression quantitative trait locus (eQTL) data, suggesting candidate genes, including ARSG, WIPI1, SLC7A14, and SIX3. The NKX6-3/ANK1 proinsulin signal colocalized with a T2D signal and an adipose ANK1 eQTL signal but not the islet NKX6-3 eQTL. Signals were enriched for islet enhancers, and we showed a plausible islet regulatory mechanism for the lead signal in the MADD locus. These results show how detailed genetic studies of an intermediate phenotype can elucidate mechanisms that may predispose one to disease.
Collapse
Affiliation(s)
- K Alaine Broadaway
- Department of Genetics, University of North Carolina, Chapel Hill, NC, USA
| | - Xianyong Yin
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA; Center for Statistical Genetics, University of Michigan, Ann Arbor, MI, USA
| | - Alice Williamson
- MRC Epidemiology Unit, Institute of Metabolic Science, University of Cambridge School of Clinical Medicine, Cambridge, UK; University of Cambridge Metabolic Research Laboratories, Wellcome Trust-MRC Institute of Metabolic Science, Department of Clinical Biochemistry, University of Cambridge, Cambridge, UK
| | - Victoria A Parsons
- Department of Genetics, University of North Carolina, Chapel Hill, NC, USA
| | - Emma P Wilson
- Department of Genetics, University of North Carolina, Chapel Hill, NC, USA
| | - Anne H Moxley
- Department of Genetics, University of North Carolina, Chapel Hill, NC, USA
| | | | - Arushi Varshney
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Anne U Jackson
- Center for Statistical Genetics, University of Michigan, Ann Arbor, MI, USA
| | - Vasudha Ahuja
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
| | - Stefan R Bornstein
- Department of Internal Medicine, Metabolic and Vascular Medicine, MedicCal Faculty Carl Gustav Carus, Dresden, Germany; Helmholtz Zentrum München, Paul Langerhans Institute Dresden, University Hospital and Faculty of Medicine, TU Dresden, Dresden, Germany; German Center for Diabetes Research, Neuherberg, Germany
| | - Laura J Corbin
- Medical Research Council Integrative Epidemiology Unit at the University of Bristol, Bristol, UK; Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | | | - Om P Dwivedi
- University of Helsinki, Helsinki, Finland; Folkhälsan Research Center, Helsinki, Finland
| | | | | | - Harald Grallert
- Research Unit of Molecular Epidemiology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany; Institute of Epidemiology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany; German Center for Diabetes Research, Neuherberg, Germany
| | - Stefan Gustafsson
- Department of Medical Sciences, Clinical Epidemiology, Uppsala University, Uppsala, Sweden
| | - Liisa Hakaste
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
| | - Ulf Hammar
- Department of Medical Sciences, Molecular Epidemiology and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Christian Herder
- German Center for Diabetes Research, Neuherberg, Germany; Institute for Clinical Diabetology, German Diabetes Center, Leibniz Center for Diabetes Research at Heinrich Heine University Düsseldorf, Düsseldorf, Germany; Department of Endocrinology and Diabetology, Medical Faculty and University Hospital Düsseldorf, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Sandra Herrmann
- Department of Internal Medicine, Prevention and Care of Diabetes, Medical Faculty Carl Gustav Carus, Dresden, Germany; Helmholtz Zentrum München, Paul Langerhans Institute Dresden, University Hospital and Faculty of Medicine, TU Dresden, Dresden, Germany
| | | | - David A Hughes
- Medical Research Council Integrative Epidemiology Unit at the University of Bristol, Bristol, UK; Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - Marcus E Kleber
- Medical Faculty Mannheim, Heidelberg University, Mannheim, BW, Germany; SYNLAB MVZ Humangenetik Mannheim, Mannheim, BW, Germany
| | - Cecilia M Lindgren
- Oxford Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK; Nuffield Department of Population Health, University of Oxford, Oxford, UK; Wellcome Trust Centre Human Genetics, University of Oxford, Oxford, UK; Broad Institute, Cambridge, MA, USA
| | - Ching-Ti Liu
- Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
| | - Jian'an Luan
- MRC Epidemiology Unit, Institute of Metabolic Science, University of Cambridge School of Clinical Medicine, Cambridge, UK
| | - Anni Malmberg
- Department of Psychology and Logopedics, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Angela P Moissl
- Institute of Nutritional Sciences, Friedrich-Schiller-University, Jena, Germany; Competence Cluster for Nutrition and Cardiovascular Health, Halle-Jena-Leipzig, Germany; Medical Faculty Mannheim, Heidelberg University, Mannheim, BW, Germany
| | - Andrew P Morris
- Centre for Genetics and Genomics Versus Arthritis, Centre for Musculoskeletal Research, The University of Manchester, Manchester, UK
| | - Nikolaos Perakakis
- Department of Internal Medicine, Metabolic and Vascular Medicine, MedicCal Faculty Carl Gustav Carus, Dresden, Germany; Helmholtz Zentrum München, Paul Langerhans Institute Dresden, University Hospital and Faculty of Medicine, TU Dresden, Dresden, Germany; German Center for Diabetes Research, Neuherberg, Germany
| | - Annette Peters
- Institute of Epidemiology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany; German Center for Diabetes Research, Neuherberg, Germany
| | - John R Petrie
- School of Health and Wellbeing, University of Glasgow, Glasgow, UK
| | - Michael Roden
- Institute for Clinical Diabetology, German Diabetes Center, Leibniz Center for Diabetes Research at Heinrich Heine University Düsseldorf, Düsseldorf, Germany; Department of Endocrinology and Diabetology, Medical Faculty and University Hospital Düsseldorf, Heinrich Heine University Düsseldorf, Düsseldorf, Germany; German Center for Diabetes Research, Neuherberg, Germany
| | - Peter E H Schwarz
- Department of Internal Medicine, Prevention and Care of Diabetes, Medical Faculty Carl Gustav Carus, Dresden, Germany; Helmholtz Zentrum München, Paul Langerhans Institute Dresden, University Hospital and Faculty of Medicine, TU Dresden, Dresden, Germany; German Center for Diabetes Research, Neuherberg, Germany
| | - Sapna Sharma
- German Center for Diabetes Research, Neuherberg, Germany; Research Unit of Molecular Epidemiology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany; Institute of Epidemiology, Helmholtz Zentrum München-German Research Center for Environmental Health, Neuherberg, Germany; Chair of Food Chemistry and Molecular Sensory Science, Technische Universität München, Freising, Germany
| | - Angela Silveira
- Department of Medicine Solna, Division of Cardiovascular Medicine, Karolinska Institutet, Stockholm, Sweden; Oxford Biomedical Research Centre, Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Rona J Strawbridge
- Institute of Health and Wellbeing, Mental Health and Wellbeing, University of Glasgow, Glasgow, UK; Department of Medicine Solna, Division of Cardiovascular Medicine, Karolinska Institutet, Stockholm, Sweden
| | - Tiinamaija Tuomi
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland; Folkhälsan Research Center, Helsinki, Finland; Abdominal Center, Endocrinology, Helsinki University Hospital, Helsinki, Finland
| | - Andrew R Wood
- Genetics of Complex Traits, College of Medicine and Health, University of Exeter, Exeter, UK
| | - Peitao Wu
- Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
| | - Björn Zethelius
- Department of Geriatrics, Uppsala University, Uppsala, Sweden
| | - Damiano Baldassarre
- Department of Medical Biotechnology and Translational Medicine, Università degli Studi di Milano, Milan, Italy; Cardiovascular Prevention Area, Centro Cardiologico Monzino I.R.C.C.S., Milan, Italy
| | - Johan G Eriksson
- Department of General Practice and Primary Health Care, Faculty of Medicine, University of Helsinki, Helsinki, Finland; Folkhälsan Research Centre, Helsinki, Finland; Department of Obstetrics and Gynecology, Yong Loo Lin School of Medicine, National University Singapore, Singapore, Singapore
| | - Tove Fall
- Department of Medical Sciences, Molecular Epidemiology and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Jose C Florez
- Diabetes Unit and Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA; Programs in Metabolism and Medical & Population Genetics, Broad Institute, Cambridge, MA, USA; Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Andreas Fritsche
- Department of Internal Medicine, Diabetology, Tübingen, Germany; Institute for Diabetes Research and Metabolic Diseases, Helmholtz Center Munich, University of Tübingen, Tübingen, Germany; German Center for Diabetes Research, Neuherberg, Germany
| | - Bruna Gigante
- Department of Medicine Solna, Division of Cardiovascular Medicine, Karolinska Institutet, Stockholm, Sweden
| | - Anders Hamsten
- Department of Medicine Solna, Division of Cardiovascular Medicine, Karolinska Institutet, Stockholm, Sweden
| | - Eero Kajantie
- Population Health Unit, Finnish Institute for Health and Welfare, Helsinki, Finland; PEDEGO Research Unit, MRC Oulu, Oulu University Hospital and University of Oulu, Oulu, Finland; Department of Clinical and Molecular Medicine, Norwegian University of Science and Technology, Trondheim, Norway; Children's Hospital, Helsinki University Hospital and University of Helsinki, Helsinki, Finland
| | - Markku Laakso
- Institute of Clinical Medicine, University of Eastern Finland, Kuopio, Finland
| | - Jari Lahti
- Department of Psychology and Logopedics, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Deborah A Lawlor
- Medical Research Council Integrative Epidemiology Unit at the University of Bristol, Bristol, UK; Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - Lars Lind
- Department of Medical Sciences, Clinical Epidemiology, Uppsala University, Uppsala, Sweden
| | - Winfried März
- Synlab Academy, SYNLAB Holding Deutschland GmbH, Mannheim, BW, Germany; Medical Faculty Mannheim, Heidelberg University, Mannheim, BW, Germany
| | - James B Meigs
- Department of Medicine, Division of General Internal Medicine, Massachusetts General Hospital, Boston, MA, USA; Department of Medicine, Harvard Medical School, Boston, MA, USA; Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, USA
| | - Johan Sundström
- Department of Medical Sciences, Clinical Epidemiology, Uppsala University, Uppsala, Sweden
| | - Nicholas J Timpson
- Medical Research Council Integrative Epidemiology Unit at the University of Bristol, Bristol, UK; Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - Robert Wagner
- Department of Internal Medicine, Diabetology, Tübingen, Germany; Institute for Diabetes Research and Metabolic Diseases, Helmholtz Center Munich, University of Tübingen, Tübingen, Germany; German Center for Diabetes Research, Neuherberg, Germany
| | - Mark Walker
- Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
| | - Nicholas J Wareham
- MRC Epidemiology Unit, Institute of Metabolic Science, University of Cambridge School of Clinical Medicine, Cambridge, UK; Health Data Research UK, Gibbs Building, London, UK
| | - Hugh Watkins
- Division of Cardiovascular Medicine, Radcliffe Department of Medicine, University of Oxford, Oxford, UK
| | - Inês Barroso
- Exeter Centre of Excellence for Diabetes Research, Genetics of Complex Traits, University of Exeter Medical School, University of Exeter, Exeter, UK
| | - Stephen O'Rahilly
- MRC Metabolic Diseases Unit, Wellcome Trust-Medical Research Council Institute of Metabolic Science, University of Cambridge, Cambridge, UK
| | - Niels Grarup
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Stephen Cj Parker
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA; Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA; Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA
| | - Michael Boehnke
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA; Center for Statistical Genetics, University of Michigan, Ann Arbor, MI, USA
| | - Claudia Langenberg
- MRC Epidemiology Unit, Institute of Metabolic Science, University of Cambridge School of Clinical Medicine, Cambridge, UK; Computational Medicine, Berlin Institute of Health at Charité-Universitätsmedizin Berlin, Berlin, Germany; Precision Healthcare University Research Institute, Queen Mary University of London, London, UK
| | - Eleanor Wheeler
- MRC Epidemiology Unit, Institute of Metabolic Science, University of Cambridge School of Clinical Medicine, Cambridge, UK.
| | - Karen L Mohlke
- Department of Genetics, University of North Carolina, Chapel Hill, NC, USA.
| |
Collapse
|
32
|
Obeng AO, Scott SA, Kaszemacher T, Ellis SB, Mejia A, Gomez A, Nadukuru R, Abul-Husn NS, Vega A, Waite E, Gottesman O, Cho J, Bottinger EP. Prescriber Adoption of SLCO1B1 Genotype-Guided Simvastatin Clinical Decision Support in a Clinical Pharmacogenetics Program. Clin Pharmacol Ther 2023; 113:321-327. [PMID: 36372942 DOI: 10.1002/cpt.2773] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Accepted: 10/08/2022] [Indexed: 11/15/2022]
Abstract
Pharmacogenetic implementation programs are increasingly feasible due to the availability of clinical guidelines for implementation research. The utilization of these resources has been reported with selected drug-gene pairs; however, little is known about how prescribers respond to pharmacogenetic recommendations for statin therapy. We prospectively assessed prescriber interaction with point-of-care clinical decision support (CDS) to guide simvastatin therapy for a diverse cohort of primary care patients enrolled in a clinical pharmacogenetics program. Of the 1,639 preemptively genotyped patients, 298 (18.2%) had an intermediate function (IF) OATP1B1 phenotype and 25 (1.53%) had a poor function (PF) phenotype, predicted by a common single nucleotide variant in the SLCO1B1 gene (c.521T>C; rs4149056). Clinicians were presented with CDS when simvastatin was prescribed for patients with IF or PF through the electronic health record. Importantly, 64.2% of the CDS deployed at the point-of-care was accepted by the prescribers and resulted in prescription changes. Statin intensity was found to significantly influence prescriber adoption of the pharmacogenetic-guided CDS, whereas patient gender or race, prescriber type, or pharmacogenetic training status did not significantly influence adoption. This study demonstrates that primary care providers readily adopt pharmacogenetic information to guide statin therapy for the majority of patients with preemptive genotype data.
Collapse
Affiliation(s)
- Aniwaa Owusu Obeng
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA.,Pharmacy Department, The Mount Sinai Hospital, New York, New York, USA.,Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA
| | - Stuart A Scott
- Department of Pathology, Stanford University, Stanford, California, USA.,Clinical Genomics Laboratory, Stanford Health Care, Palo Alto, California, USA
| | - Tom Kaszemacher
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA
| | - Stephen B Ellis
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA
| | - Ana Mejia
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA
| | - Alanna Gomez
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA
| | - Rajiv Nadukuru
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA
| | - Noura S Abul-Husn
- Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, New York, USA.,Division of Genomic Medicine, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA.,23andMe Inc., Sunnyvale, California, USA
| | - Aida Vega
- Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA.,Mount Sinai Faculty Practice Associates, Primary Care Program, The Mount Sinai Health system, New York, New York, USA
| | - Eva Waite
- Mount Sinai Faculty Practice Associates, Primary Care Program, The Mount Sinai Health system, New York, New York, USA
| | - Omri Gottesman
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA.,Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA.,Empirico Inc., San Diego, California, USA
| | - Judy Cho
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA.,Department of Pathology, Molecular and Cell-Based Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA
| | - Erwin P Bottinger
- Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, New York, USA.,Digital Health Center, Hasso Plattner Institute, University of Potsdam, Potsdam, Germany
| |
Collapse
|
33
|
Abstract
Polygenic scores quantify inherited risk by integrating information from many common sites of DNA variation into a single number. Rapid increases in the scale of genetic association studies and new statistical algorithms have enabled development of polygenic scores that meaningfully measure-as early as birth-risk of coronary artery disease. These newer-generation polygenic scores identify up to 8% of the population with triple the normal risk based on genetic variation alone, and these individuals cannot be identified on the basis of family history or clinical risk factors alone. For those identified with increased genetic risk, evidence supports risk reduction with at least two interventions, adherence to a healthy lifestyle and cholesterol-lowering therapies, that can substantially reduce risk. Alongside considerable enthusiasm for the potential of polygenic risk estimation to enable a new era of preventive clinical medicine is recognition of a need for ongoing research into how best to ensure equitable performance across diverse ancestries, how and in whom to assess the scores in clinical practice, as well as randomized trials to confirm clinical utility.
Collapse
Affiliation(s)
- Aniruddh P Patel
- Division of Cardiology and Center for Genomic Medicine, Department of Medicine, Massachusetts General Hospital, Boston, Massachusetts, USA; , .,Cardiovascular Disease Initiative, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA.,Department of Medicine, Harvard Medical School, Boston, Massachusetts, USA
| | - Amit V Khera
- Division of Cardiology and Center for Genomic Medicine, Department of Medicine, Massachusetts General Hospital, Boston, Massachusetts, USA; , .,Cardiovascular Disease Initiative, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA.,Department of Medicine, Harvard Medical School, Boston, Massachusetts, USA.,Verve Therapeutics, Cambridge, Massachusetts, USA
| |
Collapse
|
34
|
Yang S, Varghese P, Stephenson E, Tu K, Gronsbell J. Machine learning approaches for electronic health records phenotyping: a methodical review. J Am Med Inform Assoc 2023; 30:367-381. [PMID: 36413056 PMCID: PMC9846699 DOI: 10.1093/jamia/ocac216] [Citation(s) in RCA: 23] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Revised: 09/27/2022] [Accepted: 10/27/2022] [Indexed: 11/23/2022] Open
Abstract
OBJECTIVE Accurate and rapid phenotyping is a prerequisite to leveraging electronic health records for biomedical research. While early phenotyping relied on rule-based algorithms curated by experts, machine learning (ML) approaches have emerged as an alternative to improve scalability across phenotypes and healthcare settings. This study evaluates ML-based phenotyping with respect to (1) the data sources used, (2) the phenotypes considered, (3) the methods applied, and (4) the reporting and evaluation methods used. MATERIALS AND METHODS We searched PubMed and Web of Science for articles published between 2018 and 2022. After screening 850 articles, we recorded 37 variables on 100 studies. RESULTS Most studies utilized data from a single institution and included information in clinical notes. Although chronic conditions were most commonly considered, ML also enabled the characterization of nuanced phenotypes such as social determinants of health. Supervised deep learning was the most popular ML paradigm, while semi-supervised and weakly supervised learning were applied to expedite algorithm development and unsupervised learning to facilitate phenotype discovery. ML approaches did not uniformly outperform rule-based algorithms, but deep learning offered a marginal improvement over traditional ML for many conditions. DISCUSSION Despite the progress in ML-based phenotyping, most articles focused on binary phenotypes and few articles evaluated external validity or used multi-institution data. Study settings were infrequently reported and analytic code was rarely released. CONCLUSION Continued research in ML-based phenotyping is warranted, with emphasis on characterizing nuanced phenotypes, establishing reporting and evaluation standards, and developing methods to accommodate misclassified phenotypes due to algorithm errors in downstream applications.
Collapse
Affiliation(s)
- Siyue Yang
- Department of Statistical Sciences, University of Toronto, Toronto, Ontario, Canada
| | | | - Ellen Stephenson
- Department of Family & Community Medicine, University of Toronto, Toronto, Ontario, Canada
| | - Karen Tu
- Department of Family & Community Medicine, University of Toronto, Toronto, Ontario, Canada
| | - Jessica Gronsbell
- Department of Statistical Sciences, University of Toronto, Toronto, Ontario, Canada
- Department of Family & Community Medicine, University of Toronto, Toronto, Ontario, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|
35
|
Schuster ALR, Crossnohere NL, Paskett J, Thomas N, Hampel H, Ma Q, Tiner JC, Paskett ED, Bridges JFP. Promoting patient engagement in cancer genomics research programs: An environmental scan. Front Genet 2023; 14:1053613. [PMID: 36741312 PMCID: PMC9889863 DOI: 10.3389/fgene.2023.1053613] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 01/06/2023] [Indexed: 01/19/2023] Open
Abstract
Background: A national priority in the United States is to promote patient engagement in cancer genomics research, especially among diverse and understudied populations. Several cancer genomics research programs have emerged to accomplish this priority, yet questions remain about the meaning and methods of patient engagement. This study explored how cancer genomics research programs define engagement and what strategies they use to engage patients across stages in the conduct of research. Methods: An environmental scan was conducted of cancer genomics research programs focused on patient engagement. Research programs were identified and characterized using materials identified from publicly available sources (e.g., websites), a targeted literature review, and interviews with key informants. Descriptive information about the programs and their definitions of engagement, were synthesized using thematic analysis. The engagement strategies were synthesized and mapped to different stages in the conduct of research, including recruitment, consent, data collection, sharing results, and retention. Results: Ten research programs were identified, examples of which include the Cancer Moonshot Biobank, the MyPART Network, NCI-CONNECT, and the Participant Engagement and Cancer Genome Sequencing (PE-CGS) Network. All programs aimed to include understudied or underrepresented populations. Based on publicly available information, four programs explicitly defined engagement. These definitions similarly characterized engagement as being interpersonal, reciprocal, and continuous. Five general strategies of engagement were identified across the programs: 1) digital (such as websites) and 2) non-digital communications (such as radio broadcasts, or printed brochures); 3) partnering with community organizations; 4) providing incentives; and 5) affiliating with non-academic medical centers. Digital communications were the only strategy used across all stages of the conduct of research. Programs tailored these strategies to their study goals, including overcoming barriers to research participation among diverse populations. Conclusion: Programs studying cancer genomics are deeply committed to increasing research participation among diverse populations through patient engagement. Yet, the field needs to reach a consensus on the meaning of patient engagement, develop a taxonomy of patient engagement measures in cancer genomics research, and identify optimal strategies to engage patients in cancer genomics. Addressing these needs could enable patient engagement to fulfill its potential and accelerate the pace of cancer genomic discoveries.
Collapse
Affiliation(s)
- Anne L. R. Schuster
- Department of Biomedical Informatics, The Ohio State University College of Medicine, Columbus, OH, United States,*Correspondence: Anne L. R. Schuster,
| | - Norah L. Crossnohere
- Department of Biomedical Informatics, The Ohio State University College of Medicine, Columbus, OH, United States
| | - Jonathan Paskett
- Department of Biomedical Informatics, The Ohio State University College of Medicine, Columbus, OH, United States
| | - Neena Thomas
- Department of Biomedical Informatics, The Ohio State University College of Medicine, Columbus, OH, United States
| | - Heather Hampel
- Division of Clinical Cancer Genomics, City of Hope National Medical Center, Duarte, CA, United States,Division of Human Genetics, The Ohio State University Wexner Medical Center, Columbus, OH, United States
| | - Qin Ma
- Department of Biomedical Informatics, The Ohio State University College of Medicine, Columbus, OH, United States
| | - Jessica C. Tiner
- Epidemiology and Genomics Research Program, Division of Cancer Control and Population Sciences, National Cancer Institute, Bethesda, MD, United States
| | - Electra D. Paskett
- Division of Cancer Prevention and Control, Department of Internal Medicine, College of Medicine, The Ohio State University, Columbus, OH, United States,Division of Epidemiology, College of Public Health, The Ohio State University, Columbus, OH, United States
| | - John F. P. Bridges
- Department of Biomedical Informatics, The Ohio State University College of Medicine, Columbus, OH, United States
| |
Collapse
|
36
|
Hellwege JN, Dorn C, Irvin MR, Limdi NA, Cimino J, Beasley TM, Tsao PS, Damrauer SM, Roden DM, Velez Edwards DR, Wei WQ, Edwards TL. Predictive models for abdominal aortic aneurysms using polygenic scores and PheWAS-derived risk factors. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2023; 28:425-436. [PMID: 36540997 PMCID: PMC9782709] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Abdominal aortic aneurysms (AAA) are common enlargements of the abdominal aorta which can grow larger until rupture, often leading to death. Detection of AAA is often by ultrasonography and screening recommendations are mostly directed at men over 65 with a smoking history. Recent large-scale genome-wide association studies have identified genetic loci associated with AAA risk. We combined known risk factors, polygenic risk scores (PRS) and precedent clinical diagnoses from electronic health records (EHR) to develop predictive models for AAA, and compared performance against screening recommendations. The PRS included genome-wide summary statistics from the Million Veteran Program and FinnGen (10,467 cases, 378,713 controls of European ancestry), with optimization in Vanderbilt's BioVU and validated in the eMERGE Network, separately across both White and Black participants. Candidate diagnoses were identified through a temporally-oriented Phenome-wide association study in independent EHR data from Vanderbilt, and features were selected via elastic net. We calculated C-statistics in eMERGE for models including PRS, phecodes, and covariates using regression weights from BioVU. The AUC for the full model in the test set was 0.883 (95% CI 0.873-0.892), 0.844 (0.836-0.851) for covariates only, 0.613 (95% CI 0.604-0.622) when using primary USPSTF screening criteria, and 0.632 (95% CI 0.623-0.642) using primary and secondary criteria. Brier scores were between 0.003 and 0.023 for our models indicating good calibration, and net reclassification improvement over combined primary and secondary USPSTF criteria was 0.36-0.60. We provide PRS for AAA which are strongly associated with AAA risk and add to predictive model performance. These models substantially improve identification of people at risk of a AAA diagnosis compared with existing guidelines, with evidence of potential applicability in minority populations.
Collapse
Affiliation(s)
- Jacklyn N Hellwege
- Division of Genetic Medicine, Department of Medicine, Vanderbilt Genetics Institute Vanderbilt University Medical Center 2525 West End Ave. Ste 700, Nashville, TN, 37203, USA,
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
37
|
Seviiri M, Law MH, Ong JS, Gharahkhani P, Fontanillas P, Olsen CM, Whiteman DC, MacGregor S. A multi-phenotype analysis reveals 19 susceptibility loci for basal cell carcinoma and 15 for squamous cell carcinoma. Nat Commun 2022; 13:7650. [PMID: 36496446 PMCID: PMC9741635 DOI: 10.1038/s41467-022-35345-8] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Accepted: 11/29/2022] [Indexed: 12/13/2022] Open
Abstract
Basal cell carcinoma and squamous cell carcinoma are the most common skin cancers, and have genetic overlap with melanoma, pigmentation traits, autoimmune diseases, and blood biochemistry biomarkers. In this multi-trait genetic analysis of over 300,000 participants from Europe, Australia and the United States, we reveal 78 risk loci for basal cell carcinoma (19 previously unknown and replicated) and 69 for squamous cell carcinoma (15 previously unknown and replicated). The previously unknown risk loci are implicated in cancer development and progression (e.g. CDKL1), pigmentation (e.g. TPCN2), cardiometabolic (e.g. FADS2), and immune-regulatory pathways for innate immunity (e.g. IFIH1), and HIV-1 viral load modulation (e.g. CCR5). We also report an optimised polygenic risk score for effective risk stratification for keratinocyte cancer in the Canadian Longitudinal Study of Aging (794 cases and 18139 controls), which could facilitate skin cancer surveillance e.g. in high risk subpopulations such as transplantees.
Collapse
Affiliation(s)
- Mathias Seviiri
- Statistical Genetics Lab, QIMR Berghofer Medical Research Institute, Brisbane, QLD, Australia.
- School of Biomedical Sciences, Faculty of Health, Queensland University of Technology, Brisbane, QLD, Australia.
- Center for Genomics and Personalised Health, Queensland University of Technology, Brisbane, QLD, Australia.
| | - Matthew H Law
- Statistical Genetics Lab, QIMR Berghofer Medical Research Institute, Brisbane, QLD, Australia
- School of Biomedical Sciences, Faculty of Health, Queensland University of Technology, Brisbane, QLD, Australia
| | - Jue-Sheng Ong
- Statistical Genetics Lab, QIMR Berghofer Medical Research Institute, Brisbane, QLD, Australia
| | - Puya Gharahkhani
- Statistical Genetics Lab, QIMR Berghofer Medical Research Institute, Brisbane, QLD, Australia
| | | | - Catherine M Olsen
- Cancer Control Group, QIMR Berghofer Medical Research Institute, Brisbane, QLD, Australia
- Faculty of Medicine, University of Queensland, Brisbane, QLD, Australia
| | - David C Whiteman
- Faculty of Medicine, University of Queensland, Brisbane, QLD, Australia
| | - Stuart MacGregor
- Statistical Genetics Lab, QIMR Berghofer Medical Research Institute, Brisbane, QLD, Australia
- School of Biomedical Sciences, Faculty of Health, Queensland University of Technology, Brisbane, QLD, Australia
| |
Collapse
|
38
|
Cusick M, Velupillai S, Downs J, Campion TR, Sholle ET, Dutta R, Pathak J. Portability of natural language processing methods to detect suicidality from clinical text in US and UK electronic health records. JOURNAL OF AFFECTIVE DISORDERS REPORTS 2022; 10:100430. [PMID: 36644339 PMCID: PMC9835770 DOI: 10.1016/j.jadr.2022.100430] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Background In the global effort to prevent death by suicide, many academic medical institutions are implementing natural language processing (NLP) approaches to detect suicidality from unstructured clinical text in electronic health records (EHRs), with the hope of targeting timely, preventative interventions to individuals most at risk of suicide. Despite the international need, the development of these NLP approaches in EHRs has been largely local and not shared across healthcare systems. Methods In this study, we developed a process to share NLP approaches that were individually developed at King's College London (KCL), UK and Weill Cornell Medicine (WCM), US - two academic medical centers based in different countries with vastly different healthcare systems. We tested and compared the algorithms' performance on manually annotated clinical notes (KCL: n = 4,911 and WCM = 837). Results After a successful technical porting of the NLP approaches, our quantitative evaluation determined that independently developed NLP approaches can detect suicidality at another healthcare organization with a different EHR system, clinical documentation processes, and culture, yet do not achieve the same level of success as at the institution where the NLP algorithm was developed (KCL approach: F1-score 0.85 vs. 0.68, WCM approach: F1-score 0.87 vs. 0.72). Limitations Independent NLP algorithm development and patient cohort selection at the two institutions comprised direct comparability. Conclusions Shared use of these NLP approaches is a critical step forward towards improving data-driven algorithms for early suicide risk identification and timely prevention.
Collapse
Affiliation(s)
- Marika Cusick
- WeiCornell Medicine, 402 E. 67th St., New York, NY 10065, USA, South London and Maudsley NHS Foundation Trust, London, UK, Corresponding author. (M. Cusick)
| | - Sumithra Velupillai
- IoPPN, King’s College London, London, UK, South London and Maudsley NHS Foundation Trust, London, UK
| | - Johnny Downs
- IoPPN, King’s College London, London, UK, South London and Maudsley NHS Foundation Trust, London, UK
| | - Thomas R. Campion
- WeiCornell Medicine, 402 E. 67th St., New York, NY 10065, USA, South London and Maudsley NHS Foundation Trust, London, UK
| | - Evan T. Sholle
- WeiCornell Medicine, 402 E. 67th St., New York, NY 10065, USA, South London and Maudsley NHS Foundation Trust, London, UK
| | - Rina Dutta
- IoPPN, King’s College London, London, UK, South London and Maudsley NHS Foundation Trust, London, UK
| | - Jyotishman Pathak
- WeiCornell Medicine, 402 E. 67th St., New York, NY 10065, USA, South London and Maudsley NHS Foundation Trust, London, UK
| |
Collapse
|
39
|
Zheng HW, Ranganath VK, Perry LC, Chetrit DA, Criner KM, Pham AQ, Seto R, Vangala S, Elashoff DA, Bui AA. Evaluation of an automated phenotyping algorithm for rheumatoid arthritis. J Biomed Inform 2022; 135:104214. [DOI: 10.1016/j.jbi.2022.104214] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Revised: 09/24/2022] [Accepted: 09/26/2022] [Indexed: 11/16/2022]
|
40
|
Piekos JA, Hellwege JN, Zhang Y, Torstenson ES, Jarvik GP, Dikilitas O, Kullo IJ, Schaid DJ, Crosslin DR, Pendergrass SA, Lee MTM, Roden D, Denny JC, Edwards TL, Velez Edwards DR. Uterine fibroid polygenic risk score (PRS) associates and predicts risk for uterine fibroid. Hum Genet 2022; 141:1739-1748. [PMID: 35226188 PMCID: PMC9420161 DOI: 10.1007/s00439-022-02442-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2021] [Accepted: 02/11/2022] [Indexed: 11/29/2022]
Abstract
Uterine fibroids (UF) are common pelvic tumors in women, heritable, and genome-wide association studies (GWAS) have identified ~ 30 loci associated with increased risk in UF. Using summary statistics from a previously published UF GWAS performed in a non-Hispanic European Ancestry (NHW) female subset from the Electronic Medical Records and Genomics (eMERGE) Network, we constructed a polygenic risk score (PRS) for UF. UF-PRS was developed using PRSice and optimized in the separate clinical population of BioVU. PRS was validated using parallel methods of 10-fold cross-validation logistic regression and phenome-wide association study (PheWAS) in a seperate subset of eMERGE NHW females (validation set), excluding samples used in GWAS. PRSice determined pt < 0.001 and after linkage disequilibrium pruning (r2 < 0.2), 4458 variants were in the PRS which was significant (pseudo-R2 = 0.0018, p = 0.041). 10-fold cross-validation logistic regression modeling of validation set revealed the model had an area under the curve (AUC) value of 0.60 (95% confidence interval [CI] 0.58-0.62) when plotted in a receiver operator curve (ROC). PheWAS identified six phecodes associated with the PRS with the most significant phenotypes being 218 'benign neoplasm of uterus' and 218.1 'uterine leiomyoma' (p = 1.94 × 10-23, OR 1.31 [95% CI 1.26-1.37] and p = 3.50 × 10-23, OR 1.32 [95% CI 1.26-1.37]). We have developed and validated the first PRS for UF. We find our PRS has predictive ability for UF and captures genetic architecture of increased risk for UF that can be used in further studies.
Collapse
Affiliation(s)
- Jacqueline A Piekos
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, 37203, USA
| | - Jacklyn N Hellwege
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, 37203, USA
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, 37203, USA
| | - Yanfei Zhang
- Genomic Medicine Institute, Geisinger Health Systems, Danville, PA, 17822, USA
| | - Eric S Torstenson
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, 37203, USA
- Division of Epidemiology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, 37203, USA
| | - Gail P Jarvik
- Departments of Medicine (Medical Genetics) and Genome Sciences, University of Washington Medical Center, Seattle, WA, 98195, USA
| | - Ozan Dikilitas
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN, 55905, USA
| | - Iftikhar J Kullo
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN, 55905, USA
| | - Daniel J Schaid
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, 55905, USA
| | - David R Crosslin
- Departments of Medicine (Medical Genetics) and Genome Sciences, University of Washington Medical Center, Seattle, WA, 98195, USA
| | | | - Ming Ta Michael Lee
- Genomic Medicine Institute, Geisinger Health Systems, Danville, PA, 17822, USA
| | - Dan Roden
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, 37203, USA
- Division of Clinical Pharmacology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, 37203, USA
| | - Josh C Denny
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, 37203, USA
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, 37203, USA
| | - Todd L Edwards
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, 37203, USA
- Division of Epidemiology, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, 37203, USA
| | - Digna R Velez Edwards
- Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, 37203, USA.
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, 37203, USA.
- Division of Quantitative Science, Department of Obstetrics and Gynecology, Vanderbilt University Medical Center, Nashville, TN, 37203, USA.
| |
Collapse
|
41
|
Thibord F, Klarin D, Brody JA, Chen MH, Levin MG, Chasman DI, Goode EL, Hveem K, Teder-Laving M, Martinez-Perez A, Aïssi D, Daian-Bacq D, Ito K, Natarajan P, Lutsey PL, Nadkarni GN, de Vries PS, Cuellar-Partida G, Wolford BN, Pattee JW, Kooperberg C, Braekkan SK, Li-Gao R, Saut N, Sept C, Germain M, Judy RL, Wiggins KL, Ko D, O’Donnell CJ, Taylor KD, Giulianini F, De Andrade M, Nøst TH, Boland A, Empana JP, Koyama S, Gilliland T, Do R, Huffman JE, Wang X, Zhou W, Soria JM, Souto JC, Pankratz N, Haessler J, Hindberg K, Rosendaal FR, Turman C, Olaso R, Kember RL, Bartz TM, Lynch JA, Heckbert SR, Armasu SM, Brumpton B, Smadja DM, Jouven X, Komuro I, Clapham KR, Loos RJ, Willer CJ, Sabater-Lleal M, Pankow JS, Reiner AP, Morelli VM, Ridker PM, van Hylckama Vlieg A, Deleuze JF, Kraft P, Rader DJ, Lee KM, Psaty BM, Skogholt AH, Emmerich J, Suchon P, Rich SS, Vy HMT, Tang W, Jackson RD, Hansen JB, Morange PE, Kabrhel C, Trégouët DA, Damrauer SM, Johnson AD, Smith NL. Cross-Ancestry Investigation of Venous Thromboembolism Genomic Predictors. Circulation 2022; 146:1225-1242. [PMID: 36154123 PMCID: PMC10152894 DOI: 10.1161/circulationaha.122.059675] [Citation(s) in RCA: 25] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Accepted: 08/09/2022] [Indexed: 01/24/2023]
Abstract
BACKGROUND Venous thromboembolism (VTE) is a life-threatening vascular event with environmental and genetic determinants. Recent VTE genome-wide association studies (GWAS) meta-analyses involved nearly 30 000 VTE cases and identified up to 40 genetic loci associated with VTE risk, including loci not previously suspected to play a role in hemostasis. The aim of our research was to expand discovery of new genetic loci associated with VTE by using cross-ancestry genomic resources. METHODS We present new cross-ancestry meta-analyzed GWAS results involving up to 81 669 VTE cases from 30 studies, with replication of novel loci in independent populations and loci characterization through in silico genomic interrogations. RESULTS In our genetic discovery effort that included 55 330 participants with VTE (47 822 European, 6320 African, and 1188 Hispanic ancestry), we identified 48 novel associations, of which 34 were replicated after correction for multiple testing. In our combined discovery-replication analysis (81 669 VTE participants) and ancestry-stratified meta-analyses (European, African, and Hispanic), we identified another 44 novel associations, which are new candidate VTE-associated loci requiring replication. In total, across all GWAS meta-analyses, we identified 135 independent genomic loci significantly associated with VTE risk. A genetic risk score of the significantly associated loci in Europeans identified a 6-fold increase in risk for those in the top 1% of scores compared with those with average scores. We also identified 31 novel transcript associations in transcriptome-wide association studies and 8 novel candidate genes with protein quantitative-trait locus Mendelian randomization analyses. In silico interrogations of hemostasis and hematology traits and a large phenome-wide association analysis of the 135 GWAS loci provided insights to biological pathways contributing to VTE, with some loci contributing to VTE through well-characterized coagulation pathways and others providing new data on the role of hematology traits, particularly platelet function. Many of the replicated loci are outside of known or currently hypothesized pathways to thrombosis. CONCLUSIONS Our cross-ancestry GWAS meta-analyses identified new loci associated with VTE. These findings highlight new pathways to thrombosis and provide novel molecules that may be useful in the development of improved antithrombosis treatments.
Collapse
Affiliation(s)
- Florian Thibord
- Population Sciences Branch, Division of Intramural Research, National Heart, Lung and Blood Institute, 73 Mt. Wayte Ave, Suite #2, Framingham, MA, 01702, USA
- The Framingham Heart Study, Boston University and NHLBI, 73 Mt. Wayte Ave, Suite #2, Framingham, MA, 01702, USA
| | - Derek Klarin
- Division of Vascular Surgery, Stanford University School of Medicine, Stanford, CA, 94305, USA
- VA Palo Alto Healthcare System, Palo Alto, CA, 94550, USA
| | - Jennifer A. Brody
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, 1730 Minor Ave, Suite 1360, Seattle, WA, 98101, USA
| | - Ming-Huei Chen
- Population Sciences Branch, Division of Intramural Research, National Heart, Lung and Blood Institute, 73 Mt. Wayte Ave, Suite #2, Framingham, MA, 01702, USA
- The Framingham Heart Study, Boston University and NHLBI, 73 Mt. Wayte Ave, Suite #2, Framingham, MA, 01702, USA
| | - Michael G. Levin
- Division of Cardiovascular Medicine, Department of Medicine, University of Pennsylvania, 3400 Spruce Street, Philadelphia, PA, 19104, USA
| | - Daniel I. Chasman
- Division of Preventive Medicine, Brigham and Women’s Hospital, 900 Commonwealth Ave, Boston, MA, 02215, USA
- Harvard Medical School, Boston, MA, 02115, USA
| | - Ellen L. Goode
- Department of Quantitative Health Sciences, Mayo Clinic, 200 First Street SW, Rochester, MN, 55905, USA
| | - Kristian Hveem
- HUNT Research Center, Department of Public Health and Nursing, Norwegian University of Science and Technology, Forskningsvegen 2, Levanger, 7600, Norway
- K.G. Jebsen Centre for Genetic Epidemiology, Department of Public Health and Nursing, Norwegian University of Science and Technology, Håkon Jarls gate 11, Trondheim, 7030, Norway
| | - Maris Teder-Laving
- Institute of Genomics, University of Tartu, Riia 23b, Tartu, Tartu, 51010, Estonia
| | - Angel Martinez-Perez
- Genomics of Complex Disease Unit, Institut d’Investigació Biomèdica Sant Pau (IIB SANT PAU), St Quinti 77-79, Barcelona, 8041, Spain
| | - Dylan Aïssi
- Bordeaux Population Health Research Center, University of Bordeaux, 146 rue Léo Saignat, Bordeaux, 33076, France
- UMR1219, INSERM, 146 rue Léo Saignat, Bordeaux, 33076, France
| | - Delphine Daian-Bacq
- Centre National de Recherche en Génomique Humaine, CEA, Université Paris-Saclay, 2 Rue Gaston Crémieux, Evry, 91057, France
- Laboratory of Excellence on Medical Genomics, GenMed, France
| | - Kaoru Ito
- Laboratory for Cardiovascular Genomics and Informatics, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehirocho, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan
| | - Pradeep Natarajan
- Cardiovascular Research Center, Massachusetts General Hospital, 185 Cambridge Street, Boston, MA, 02446, USA
- Program in Medical and Population Genetics and the Cardiovascular Disease Initiative, Broad Institute of Harvard & MIT, 75 Ames St, Cambridge, MA, 02142, USA
- Department of Medicine, Harvard Medical School, Shattuck St, Boston, MA, 02115, USA
| | - Pamela L. Lutsey
- Division of Epidemiology and Community Health, School of Public Health, University of Minnesota, 1300 South Second Street, Minneapolis, MN, 55454, USA
| | - Girish N. Nadkarni
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, 1 Gustave L. Levy Pl, New York, NY, 10029, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1 Gu stave L. Levy Pl, New York, NY, 10029, USA
- Division of Nephrology, Department of Medicine, Icahn School of Medicine at Mount Sinai, 1 Gustave L. Levy Pl, New York, NY, 10029, USA
| | - Paul S. de Vries
- Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, University of Texas Health Science Center at Houston, 1200 Pressler St, Houston, TX, 77030, USA
| | | | - Brooke N. Wolford
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Jack W. Pattee
- Division of Biostatistics, University of Minnesota, 420 Delaware Street SE, Minneapolis, MN, 55455, USA
- Center for Innovative Design & Analysis and Department of Biostatistics & Informatics, Colorado School of Public Health, 13001 East 17th Place, Aurora, CO, 80045, USA
| | - Charles Kooperberg
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave N, Seattle, WA, 98109, USA
| | - Sigrid K. Braekkan
- Thrombosis Research Center (TREC), UiT - The Arctic University of Norway, Universitetsvegen 57, Tromsø, 9037, Norway
- Division of internal medicine, University Hospital of North Norway, Tromsø, 9038, Norway
| | - Ruifang Li-Gao
- Clinical Epidemiology, Leiden University Medical Center, PO Box 9600, Leiden, 2300 RC, The Netherlands
| | - Noemie Saut
- Hematology Laboratory, La Timone University Hospital of Marseille, 264 Rue Saint-Pierre, Marseille, 13385, France
| | - Corriene Sept
- Department of Epidemiology, Harvard TH Chan Harvard School of Public Health, 655 Huntington Ave., Building II, Boston, MA, 02115, USA
| | - Marine Germain
- Bordeaux Population Health Research Center, University of Bordeaux, 146 rue Léo Saignat, Bordeaux, 33076, France
- UMR1219, INSERM, 146 rue Léo Saignat, Bordeaux, 33076, France
- Laboratory of Excellence on Medical Genomics, GenMed, France
| | - Renae L. Judy
- Surgery, University of Pennsylvania, 3401 Walnut Street, Philadelphia, PA, 19104, USA
| | - Kerri L. Wiggins
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, 1730 Minor Ave, Suite 1360, Seattle, WA, 98101, USA
| | - Darae Ko
- The Framingham Heart Study, Boston University and NHLBI, 73 Mt. Wayte Ave, Suite #2, Framingham, MA, 01702, USA
- Section of Cardiovascular Medicine, Boston University School of Medicine, 85 East Newton Street, Boston, MA, 02118, USA
| | - Christopher J. O’Donnell
- Cardiology Section, Department of Medicine, VA Boston Healthcare System, Boston, MA, 02132, USA
- Department of Medicine, Harvard Medical School, Boston, MA, 02115, USA
| | - Kent D. Taylor
- Institute for Translational Genomics and Population Sciences, The Lundquist Institute for Biomedical Innovation, 1124 W Carson St., Torrance, CA, 90502, USA
| | - Franco Giulianini
- Division of Preventive Medicine, Brigham and Women’s Hospital, 900 Commonwealth Ave, Boston, MA, 02215, USA
| | - Mariza De Andrade
- Department of Quantitative Health Sciences, Mayo Clinic, 200 First Street SW, Rochester, MN, 55905, USA
| | - Therese H. Nøst
- K.G. Jebsen Centre for Genetic Epidemiology, Department of Public Health and Nursing, Norwegian University of Science and Technology, Håkon Jarls gate 11, Trondheim, 7030, Norway
| | - Anne Boland
- Centre National de Recherche en Génomique Humaine, CEA, Université Paris-Saclay, 2 Rue Gaston Crémieux, Evry, 91057, France
- Laboratory of Excellence on Medical Genomics, GenMed, France
| | - Jean-Philippe Empana
- Integrative Epidemiology of cardiovascular diseases, Université Paris Cité, Paris Cardiovascular Research Center (PARCC), 56 rue Leblanc, Paris, 75015, France
- Department of Cardiology, APHP, Hopital Européen Georges Pompidou, 20 rue Leblanc, Paris, 75015, France
| | - Satoshi Koyama
- Laboratory for Cardiovascular Genomics and Informatics, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehirocho, Tsurumi-ku, Yokohama, Kanagawa, 230-0045, Japan
- Cardiovascular Research Center, Massachusetts General Hospital, 185 Cambridge Street, Boston, MA, 02446, USA
- Program in Medical and Population Genetics and the Cardiovascular Disease Initiative, Broad Institute of Harvard & MIT, 75 Ames St, Cambridge, MA, 02142, USA
| | - Thomas Gilliland
- Cardiovascular Research Center, Massachusetts General Hospital, 185 Cambridge Street, Boston, MA, 02446, USA
- Program in Medical and Population Genetics and the Cardiovascular Disease Initiative, Broad Institute of Harvard & MIT, 75 Ames St, Cambridge, MA, 02142, USA
- Department of Medicine, Harvard Medical School, Shattuck St, Boston, MA, 02115, USA
| | - Ron Do
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, 1 Gustave L. Levy Pl, New York, NY, 10029, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1 Gu stave L. Levy Pl, New York, NY, 10029, USA
- BioMe Phenomics Center, Icahn School of Medicine at Mount Sinai, 1 Gustave L. Levy Pl, New York, NY, 10029, USA
| | - Jennifer E. Huffman
- MAVERIC, VA Boston Heathcare System, 2 Avenue de Lafayette, Boston, MA, 02111, USA
| | - Xin Wang
- 23andMe, Inc., 223 N Mathilda Ave, Sunnyvale, CA, 94086, USA
| | - Wei Zhou
- Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, 55 Fruit Street, Boston, MA, 02114, USA
| | - Jose Manuel Soria
- Genomics of Complex Disease Unit, Institut d’Investigació Biomèdica Sant Pau (IIB SANT PAU), St Quinti 77-79, Barcelona, 8041, Spain
| | - Juan Carlos Souto
- Genomics of Complex Disease Unit, Institut d’Investigació Biomèdica Sant Pau (IIB SANT PAU), St Quinti 77-79, Barcelona, 8041, Spain
- Unit of Thrombosis and Hemostasis, Hospital de la Santa Creu i Sant Pau, St Quinti 89, Barcelona, 8041, Spain
| | - Nathan Pankratz
- Department of Laboratory Medicine and Pathology, University of Minnesota, 420 Delaware Street SE, Minneapolis, MN, 55455, USA
| | - Jeffery Haessler
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave N, Seattle, WA, 98109, USA
| | - Kristian Hindberg
- Thrombosis Research Center (TREC), UiT - The Arctic University of Norway, Universitetsvegen 57, Tromsø, 9037, Norway
| | - Frits R. Rosendaal
- Clinical Epidemiology, Leiden University Medical Center, PO Box 9600, Leiden, 2300 RC, The Netherlands
| | - Constance Turman
- Department of Epidemiology, Harvard TH Chan Harvard School of Public Health, 655 Huntington Ave., Building II, Boston, MA, 02115, USA
| | - Robert Olaso
- Centre National de Recherche en Génomique Humaine, CEA, Université Paris-Saclay, 2 Rue Gaston Crémieux, Evry, 91057, France
- Laboratory of Excellence on Medical Genomics, GenMed, France
| | - Rachel L. Kember
- Psychiatry, University of Pennsylvania, 3401 Walnut Street, Philadelphia, PA, 19104, USA
| | - Traci M. Bartz
- Cardiovascular Health Research Unit, Departments of Biostatistics and Medicine, University of Washington, 1730 Minor Ave, Suite 1360, Seattle, WA, 98101, USA
| | - Julie A. Lynch
- VA Informatics & Computing Infrastructure, VA Salt Lake City Healthcare System, 500 Foothills Drive, Salt Lake City, UT, 84148, USA
- Epidemiology, University of Utah, 500 Foothills Drive, Salt Lake City, UT, 84148, USA
| | - Susan R. Heckbert
- Department of Epidemiology, University of Washington, 1730 Minor Ave, Suite 1360, Seattle, WA, 98101, USA
| | - Sebastian M. Armasu
- Department of Quantitative Health Sciences, Mayo Clinic, 200 First Street SW, Rochester, MN, 55905, USA
| | - Ben Brumpton
- K.G. Jebsen Centre for Genetic Epidemiology, Department of Public Health and Nursing, Norwegian University of Science and Technology, Håkon Jarls gate 11, Trondheim, 7030, Norway
| | - David M. Smadja
- Hematology Department and Biosurgical Research Lab (Carpentier Foundation), European Georges Pompidou Hospital, Assistance Publique Hôpitaux de Paris, 20 rue Leblanc, Paris, 75015, France
- Innovative Therapies in Haemostasis, INSERM, Université de Paris, 4 avenue de l’Observatoire, Paris, 75270, France
| | - Xavier Jouven
- Integrative Epidemiology of cardiovascular diseases, Université Paris Descartes, Sorbonne Paris Cité, 56 rue Leblanc, Paris, 75015, France
- Paris Cardiovascular Research Center, Inserm U970, Université Paris Descartes, Sorbonne Paris Cité, 20 rue Leblanc, Paris, 75015, France
| | - Issei Komuro
- Department of Cardiovascular Medicine, The University of Tokyo, 7-3-1, Hongo, Bunkyo-ku, Tokyo, Tokyo, 113-8655, Japan
| | - Katharine R. Clapham
- Program in Medical and Population Genetics and the Cardiovascular Disease Initiative, Broad Institute of Harvard & MIT, 75 Ames St, Cambridge, MA, 02142, USA
- Department of Medicine, Harvard Medical School, Shattuck St, Boston, MA, 02115, USA
- Division of Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital, 900 Commonwealth Ave, Boston, MA, 02215, USA
| | - Ruth J.F. Loos
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, 1 Gustave L. Levy Pl, New York, NY, 10029, USA
| | - Cristen J. Willer
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Maria Sabater-Lleal
- Genomics of Complex Disease Unit, Institut d’Investigació Biomèdica Sant Pau (IIB SANT PAU), St Quinti 77-79, Barcelona, 8041, Spain
- Cardiovascular Medicine Unit, Department of Medicine, Karolinska Institutet, Center for Molecular Medicine, Stockholm, 17176, Sweden
| | - James S. Pankow
- Division of Epidemiology and Community Health, School of Public Health, University of Minnesota, 1300 South Second Street, Minneapolis, MN, 55454, USA
| | - Alexander P. Reiner
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave N, Seattle, WA, 98109, USA
- Department of Epidemiology, University of Washington, 1730 Minor Ave, Suite 1360, Seattle, WA, 98101, USA
| | - Vania M. Morelli
- Thrombosis Research Center (TREC), UiT - The Arctic University of Norway, Universitetsvegen 57, Tromsø, 9037, Norway
- Division of internal medicine, University Hospital of North Norway, Tromsø, 9038, Norway
| | - Paul M. Ridker
- Division of Preventive Medicine, Brigham and Women’s Hospital, 900 Commonwealth Ave, Boston, MA, 02215, USA
- Harvard Medical School, Boston, MA, 02115, USA
| | - Astrid van Hylckama Vlieg
- Clinical Epidemiology, Leiden University Medical Center, PO Box 9600, Leiden, 2300 RC, The Netherlands
| | - Jean-François Deleuze
- Centre National de Recherche en Génomique Humaine, CEA, Université Paris-Saclay, 2 Rue Gaston Crémieux, Evry, 91057, France
- Laboratory of Excellence on Medical Genomics, GenMed, France
- Centre D’Etude du Polymorphisme Humain, Fondation Jean Dausset, 27 rue Juliette Dodu, Paris, 75010, France
| | - Peter Kraft
- Department of Epidemiology, Harvard TH Chan Harvard School of Public Health, 655 Huntington Ave., Building II, Boston, MA, 02115, USA
| | - Daniel J. Rader
- Departments of Medicine and Genetics and Cardiovascular Institute, Perelman School of Medicine, University of Pennsylvania, 3400 Civic Center Blvd, Philadelphia, PA, 19104, USA
| | | | | | | | | | | | - Kyung Min Lee
- VA Informatics & Computing Infrastructure, VA Salt Lake City Healthcare System, 500 Foothills Drive, Salt Lake City, UT, 84148, USA
| | - Bruce M. Psaty
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, 1730 Minor Ave, Suite 1360, Seattle, WA, 98101, USA
- Department of Epidemiology, University of Washington, 1730 Minor Ave, Suite 1360, Seattle, WA, 98101, USA
- Department of Health Systems and Population Heath, University of Washington, 1730 Minor Ave, Suite 1360, Seattle, WA, 98101, USA
| | - Anne Heidi Skogholt
- K.G. Jebsen Centre for Genetic Epidemiology, Department of Public Health and Nursing, Norwegian University of Science and Technology, Håkon Jarls gate 11, Trondheim, 7030, Norway
| | - Joseph Emmerich
- Department of vascular medicine, Paris Saint-Joseph Hospital Group, University of Paris, 185 rue Raymond Losserand, Paris, 75674, France
- UMR1153, INSERM CRESS, 185 rue Raymond Losserand, Paris, 75674, France
| | - Pierre Suchon
- Hematology Laboratory, La Timone University Hospital of Marseille, 264 Rue Saint-Pierre, Marseille, 13385, France
- C2VN, INSERM, INRAE, Aix-Marseille University, 27, bd Jean Moulin, Marseille, 13385, France
| | - Stephen S. Rich
- Center for Public Health Genomics, University of Virginia, 3242 West Complex, Charlottesville, VA, 22908-0717, USA
| | - Ha My T. Vy
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, 1 Gustave L. Levy Pl, New York, NY, 10029, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1 Gu stave L. Levy Pl, New York, NY, 10029, USA
| | - Weihong Tang
- Division of Epidemiology and Community Health, School of Public Health, University of Minnesota, 1300 South Second Street, Minneapolis, MN, 55454, USA
| | - Rebecca D. Jackson
- College of Medicine, Ohio State University, 376 W. 10th Ave, Columbus, OH, 43210, USA
| | - John-Bjarne Hansen
- Thrombosis Research Center (TREC), UiT - The Arctic University of Norway, Universitetsvegen 57, Tromsø, 9037, Norway
- Division of internal medicine, University Hospital of North Norway, Tromsø, 9038, Norway
| | - Pierre-Emmanuel Morange
- Hematology Laboratory, La Timone University Hospital of Marseille, 264 Rue Saint-Pierre, Marseille, 13385, France
- C2VN, INSERM, INRAE, Aix-Marseille University, 27, bd Jean Moulin, Marseille, 13385, France
| | - Christopher Kabrhel
- Emergency Medicine, Massachusetts General Hospital, Zero Emerson Place, Suite 3B, Boston, MA, 02114, USA
- Emergency Medicine, Harvard Medical School, Zero Emerson Place, Suite 3B, Boston, MA, 02114, USA
| | - David-Alexandre Trégouët
- Bordeaux Population Health Research Center, University of Bordeaux, 146 rue Léo Saignat, Bordeaux, 33076, France
- UMR1219, INSERM, 146 rue Léo Saignat, Bordeaux, 33076, France
- Laboratory of Excellence on Medical Genomics, GenMed, France
| | - Scott M. Damrauer
- Corporal Michael J. Crescenz Philadelphia VA Medical Center, 3900 Woodland Ave, Philadelphia, PA, 19104, USA
- Department of Surgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Andrew D. Johnson
- Population Sciences Branch, Division of Intramural Research, National Heart, Lung and Blood Institute, 73 Mt. Wayte Ave, Suite #2, Framingham, MA, 01702, USA
- The Framingham Heart Study, Boston University and NHLBI, 73 Mt. Wayte Ave, Suite #2, Framingham, MA, 01702, USA
| | - Nicholas L. Smith
- Department of Epidemiology, University of Washington, 1730 Minor Ave, Suite 1360, Seattle, WA, 98101, USA
- Kaiser Permanente Washington Health Research Institute, Kaiser Permanente Washington, Seattle, WA, 98101, USA
- Seattle Epidemiologic Research and Information Center, Department of Veterans Affairs Office of Research and Development, Seattle, WA, 98108, USA
| |
Collapse
|
42
|
Sherafati A, Elsekaily O, Saadatagah S, Kochan DC, Lee C, Wiesner GL, Liu C, Dellefave-Castillo L, Namjou B, Perez EF, Salvati ZM, Connolly JJ, Hakonarson H, Williams MS, Jarvik GP, Chung WK, McNally EM, Manolio TA, Kullo IJ. Pathogenic variants in arteriopathy genes detected in a targeted sequencing study: Penetrance and 1-year outcomes after return of results. Genet Med 2022; 24:2123-2133. [PMID: 35943490 PMCID: PMC9837827 DOI: 10.1016/j.gim.2022.07.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2022] [Revised: 07/04/2022] [Accepted: 07/04/2022] [Indexed: 01/18/2023] Open
Abstract
PURPOSE We estimated the penetrance of pathogenic/likely pathogenic (P/LP) variants in arteriopathy-related genes and assessed near-term outcomes following return of results. METHODS Participants (N = 24,520) in phase III of the Electronic Medical Records and Genomics network underwent targeted sequencing of 68 actionable genes, including 9 genes associated with arterial aneurysmal diseases. Penetrance was estimated on the basis of the presence of relevant clinical traits. Outcomes occurring within 1 year of return of results included new diagnoses, referral to a specialist, new tests ordered, surveillance initiated, and new medications started. RESULTS P/LP variants were present in 34 participants. The average penetrance across genes was 59%, ranging from 86% for FBN1 variants to 25% for SMAD3. Of 16 participants in whom results were returned, 1-year outcomes occurred in 63%. A new diagnosis was made in 44% of the participants, 56% were referred to a specialist, a new test was ordered in 44%, surveillance was initiated in 31%, and a new medication was started in 31%. CONCLUSION Penetrance of P/LP variants in arteriopathy-related genes, identified in a large, targeted sequencing study, was variable and overall lower than that reported in clinical cohorts. Meaningful outcomes within the first year were noted in 63% of participants who received results.
Collapse
Affiliation(s)
- Alborz Sherafati
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN
| | - Omar Elsekaily
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN
| | | | - David C Kochan
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN
| | - Christopher Lee
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN
| | - Georgia L Wiesner
- Department of Medicine, Vanderbilt University Medical Center, Nashville, TN
| | - Cong Liu
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY
| | - Lisa Dellefave-Castillo
- Center for Genetic Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL
| | - Bahram Namjou
- Center for Autoimmune Genomics and Etiology (CAGE), Cincinnati Children's Hospital Medical Center, Cincinnati, OH
| | - Emma F Perez
- Department of Medicine, Brigham and Women's Hospital, Boston MA
| | | | - John J Connolly
- The Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, PA
| | - Hakon Hakonarson
- The Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, PA; Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA; Division of Human Genetics, Children's Hospital of Philadelphia, Philadelphia, PA; Division of Pulmonary Medicine, Children's Hospital of Philadelphia, Philadelphia, PA
| | | | - Gail P Jarvik
- Division of Medical Genetics, Department of Medicine, University of Washington Medical Center, Seattle, WA; Department of Genome Sciences, University of Washington Medical Center, Seattle, WA
| | - Wendy K Chung
- Department of Pediatrics, Columbia University Irving Medical Center, New York, NY; Department of Medicine, Columbia University Irving Medical Center, New York, NY
| | - Elizabeth M McNally
- Center for Genetic Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL
| | - Teri A Manolio
- Division of Genomic Medicine, National Human Genome Research Institute, Bethesda, MD
| | - Iftikhar J Kullo
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN; Gonda Vascular Center, Mayo Clinic, Rochester, MN.
| |
Collapse
|
43
|
The Genetic Architecture of the Etiology of Lower Extremity Peripheral Artery Disease: Current Knowledge and Future Challenges in the Era of Genomic Medicine. Int J Mol Sci 2022; 23:ijms231810481. [PMID: 36142394 PMCID: PMC9499674 DOI: 10.3390/ijms231810481] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Revised: 09/05/2022] [Accepted: 09/06/2022] [Indexed: 12/24/2022] Open
Abstract
Lower extremity artery disease (LEAD), caused by atherosclerotic obstruction of the arteries of the lower limb extremities, has exhibited an increase in mortality and morbidity worldwide. The phenotypic variability of LEAD is correlated with its complex, multifactorial etiology. In addition to traditional risk factors, it has been shown that the interaction between genetic factors (epistasis) or between genes and the environment potentially have an independent role in the development and progression of LEAD. In recent years, progress has been made in identifying genetic variants associated with LEAD, by Genome-Wide Association Studies (GWAS), Whole Exome Sequencing (WES) studies, and epigenetic profiling. The aim of this review is to present the current knowledge about the genetic factors involved in the etiopathogenic mechanisms of LEAD, as well as possible directions for future research. We analyzed data from the literature, starting with candidate gene-based association studies, and then continuing with extensive association studies, such as GWAS and WES. The results of these studies showed that the genetic architecture of LEAD is extremely heterogeneous. In the future, the identification of new genetic factors will allow for the development of targeted molecular therapies, and the use of polygenic risk scores (PRS) to identify individuals at an increased risk of LEAD will allow for early prophylactic measures and personalized therapy to improve their prognosis.
Collapse
|
44
|
Jennings MV, Lee H, Rocha DB, Bianchi SB, Coombes BJ, Crist RC, Faucon AB, Hu Y, Kember RL, Mallard TT, Niarchou M, Poulsen MN, Straub P, Urman RD, Walsh CG, Davis LK, Smoller JW, Troiani V, Sanchez-Roige S. Identifying High-Risk Comorbidities Associated with Opioid Use Patterns Using Electronic Health Record Prescription Data. Complex Psychiatry 2022; 8:47-55. [PMID: 36545045 PMCID: PMC9669950 DOI: 10.1159/000525313] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2022] [Accepted: 03/23/2022] [Indexed: 12/24/2022] Open
Abstract
Introduction Opioid use disorders (OUDs) constitute a major public health issue, and we urgently need alternative methods for characterizing risk for OUD. Electronic health records (EHRs) are useful tools for understanding complex medical phenotypes but have been underutilized for OUD because of challenges related to underdiagnosis, binary diagnostic frameworks, and minimally characterized reference groups. As a first step in addressing these challenges, a new paradigm is warranted that characterizes risk for opioid prescription misuse on a continuous scale of severity, i.e., as a continuum. Methods Across sites within the PsycheMERGE network, we extracted prescription opioid data and diagnoses that co-occur with OUD (including psychiatric and substance use disorders, pain-related diagnoses, HIV, and hepatitis C) for over 2.6 million patients across three health registries (Vanderbilt University Medical Center, Mass General Brigham, Geisinger) between 2005 and 2018. We defined three groups based on levels of opioid exposure: no prescriptions, minimal exposure, and chronic exposure and then compared the comorbidity profiles of these groups to the full registries and to those with OUD diagnostic codes. Results Our results confirm that EHR data reflects known higher prevalence of substance use disorders, psychiatric disorders, medical, and pain diagnoses in patients with OUD diagnoses and chronic opioid use. Comorbidity profiles that distinguish opioid exposure are strikingly consistent across large health systems, indicating the phenotypes described in this new quantitative framework are robust to health systems differences. Conclusion This work indicates that EHR prescription opioid data can serve as a platform to characterize complex risk markers for OUD using existing data.
Collapse
Affiliation(s)
- Mariela V Jennings
- Department of Psychiatry, University of California San Diego, La Jolla, California, USA
| | - Hyunjoon Lee
- Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts, USA
| | - Daniel B Rocha
- Phenomic Analytics and Clinical Data Core, Geisinger, Danville, Pennsylvania, USA
| | - Sevim B Bianchi
- Department of Psychiatry, University of California San Diego, La Jolla, California, USA
| | - Brandon J Coombes
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, Minnesota, USA
| | - Richard C Crist
- Department of Psychiatry, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania, USA
| | - Annika B Faucon
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, Tennessee, USA
| | - Yirui Hu
- Department of Population Health Sciences, Geisinger, Danville, Pennsylvania, USA
| | - Rachel L Kember
- Department of Psychiatry, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania, USA
| | - Travis T Mallard
- Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts, USA
| | - Maria Niarchou
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, Tennessee, USA
| | - Melissa N Poulsen
- Department of Population Health Sciences, Geisinger, Danville, Pennsylvania, USA
| | - Peter Straub
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, Tennessee, USA
| | - Richard D Urman
- Department of Anesthesiology, Perioperative and Pain Medicine, Brigham and Women's Hospital, Boston, Massachusetts, USA
| | - Colin G Walsh
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, USA.,Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee, USA.,Department of Psychiatry and Behavioral Sciences, Vanderbilt University Medical Center, Nashville, Tennessee, USA
| | - Lea K Davis
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, USA.,Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee, USA.,Department of Psychiatry and Behavioral Sciences, Vanderbilt University Medical Center, Nashville, Tennessee, USA
| | - Jordan W Smoller
- Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts, USA.,Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston, Massachusetts, USA
| | - Vanessa Troiani
- Geisinger Clinic, Geisinger, Danville, Pennsylvania, USA.,Department of Imaging Science and Innovation, Geisinger, Danville, Pennsylvania, USA.,Neuroscience Institute, Geisinger, Danville, Pennsylvania, USA.,Department of Basic Sciences, Geisinger Commonwealth School of Medicine, Scranton, Pennsylvania, USA
| | - Sandra Sanchez-Roige
- Department of Psychiatry, University of California San Diego, La Jolla, California, USA.,Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee, USA
| | | |
Collapse
|
45
|
Brandt PS, Pacheco JA, Adekkanattu P, Sholle ET, Abedian S, Stone DJ, Knaack DM, Xu J, Xu Z, Peng Y, Benda NC, Wang F, Luo Y, Jiang G, Pathak J, Rasmussen LV. Design and validation of a FHIR-based EHR-driven phenotyping toolbox. J Am Med Inform Assoc 2022; 29:1449-1460. [PMID: 35799370 PMCID: PMC9382394 DOI: 10.1093/jamia/ocac063] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2021] [Revised: 04/04/2022] [Accepted: 06/17/2022] [Indexed: 12/14/2022] Open
Abstract
OBJECTIVES To develop and validate a standards-based phenotyping tool to author electronic health record (EHR)-based phenotype definitions and demonstrate execution of the definitions against heterogeneous clinical research data platforms. MATERIALS AND METHODS We developed an open-source, standards-compliant phenotyping tool known as the PhEMA Workbench that enables a phenotype representation using the Fast Healthcare Interoperability Resources (FHIR) and Clinical Quality Language (CQL) standards. We then demonstrated how this tool can be used to conduct EHR-based phenotyping, including phenotype authoring, execution, and validation. We validated the performance of the tool by executing a thrombotic event phenotype definition at 3 sites, Mayo Clinic (MC), Northwestern Medicine (NM), and Weill Cornell Medicine (WCM), and used manual review to determine precision and recall. RESULTS An initial version of the PhEMA Workbench has been released, which supports phenotype authoring, execution, and publishing to a shared phenotype definition repository. The resulting thrombotic event phenotype definition consisted of 11 CQL statements, and 24 value sets containing a total of 834 codes. Technical validation showed satisfactory performance (both NM and MC had 100% precision and recall and WCM had a precision of 95% and a recall of 84%). CONCLUSIONS We demonstrate that the PhEMA Workbench can facilitate EHR-driven phenotype definition, execution, and phenotype sharing in heterogeneous clinical research data environments. A phenotype definition that integrates with existing standards-compliant systems, and the use of a formal representation facilitates automation and can decrease potential for human error.
Collapse
Affiliation(s)
- Pascal S Brandt
- Corresponding Author: Pascal S. Brandt, Department of Biomedical Informatics & Medical Education, University of Washington, Box 358047, Seattle, WA 98195, USA;
| | - Jennifer A Pacheco
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, Illinois, USA
| | - Prakash Adekkanattu
- Department of Healthcare Policy and Research, Weill Cornell Medicine, New York, New York, USA
| | - Evan T Sholle
- Department of Healthcare Policy and Research, Weill Cornell Medicine, New York, New York, USA
| | - Sajjad Abedian
- Department of Healthcare Policy and Research, Weill Cornell Medicine, New York, New York, USA
| | - Daniel J Stone
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, Minnesota, USA
| | - David M Knaack
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, Minnesota, USA
| | - Jie Xu
- Department of Healthcare Policy and Research, Weill Cornell Medicine, New York, New York, USA
| | - Zhenxing Xu
- Department of Healthcare Policy and Research, Weill Cornell Medicine, New York, New York, USA
| | - Yifan Peng
- Department of Healthcare Policy and Research, Weill Cornell Medicine, New York, New York, USA
| | - Natalie C Benda
- Department of Healthcare Policy and Research, Weill Cornell Medicine, New York, New York, USA
| | - Fei Wang
- Department of Healthcare Policy and Research, Weill Cornell Medicine, New York, New York, USA
| | - Yuan Luo
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, Illinois, USA
| | - Guoqian Jiang
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, Minnesota, USA
| | - Jyotishman Pathak
- Department of Healthcare Policy and Research, Weill Cornell Medicine, New York, New York, USA
| | - Luke V Rasmussen
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, Illinois, USA
| |
Collapse
|
46
|
Blair DR, Hoffmann TJ, Shieh JT. Common genetic variation associated with Mendelian disease severity revealed through cryptic phenotype analysis. Nat Commun 2022; 13:3675. [PMID: 35760791 PMCID: PMC9237040 DOI: 10.1038/s41467-022-31030-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2021] [Accepted: 05/30/2022] [Indexed: 11/09/2022] Open
Abstract
Clinical heterogeneity is common in Mendelian disease, but small sample sizes make it difficult to identify specific contributing factors. However, if a disease represents the severely affected extreme of a spectrum of phenotypic variation, then modifier effects may be apparent within a larger subset of the population. Analyses that take advantage of this full spectrum could have substantially increased power. To test this, we developed cryptic phenotype analysis, a model-based approach that infers quantitative traits that capture disease-related phenotypic variability using qualitative symptom data. By applying this approach to 50 Mendelian diseases in two cohorts, we identify traits that reliably quantify disease severity. We then conduct genome-wide association analyses for five of the inferred cryptic phenotypes, uncovering common variation that is predictive of Mendelian disease-related diagnoses and outcomes. Overall, this study highlights the utility of computationally-derived phenotypes and biobank-scale cohorts for investigating the complex genetic architecture of Mendelian diseases.
Collapse
Affiliation(s)
- David R Blair
- Division of Medical Genetics, Department of Pediatrics, Benioff Children's Hospital, San Francisco, CA, USA.
| | - Thomas J Hoffmann
- Institute for Human Genetics, San Francisco, CA, USA.,Department of Epidemiology and Biostatistics, University of California, San Francisco, CA, USA
| | - Joseph T Shieh
- Division of Medical Genetics, Department of Pediatrics, Benioff Children's Hospital, San Francisco, CA, USA. .,Institute for Human Genetics, San Francisco, CA, USA.
| |
Collapse
|
47
|
Feng YCA, Stanaway IB, Connolly JJ, Denny JC, Luo Y, Weng C, Wei WQ, Weiss ST, Karlson EW, Smoller JW. Psychiatric manifestations of rare variation in medically actionable genes: a PheWAS approach. BMC Genomics 2022; 23:385. [PMID: 35590255 PMCID: PMC9121574 DOI: 10.1186/s12864-022-08600-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Accepted: 04/22/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND As genomic sequencing moves closer to clinical implementation, there has been an increasing acceptance of returning incidental findings to research participants and patients for mutations in highly penetrant, medically actionable genes. A curated list of genes has been recommended by the American College of Medical Genetics and Genomics (ACMG) for return of incidental findings. However, the pleiotropic effects of these genes are not fully known. Such effects could complicate genetic counseling when returning incidental findings. In particular, there has been no systematic evaluation of psychiatric manifestations associated with rare variation in these genes. RESULTS Here, we leveraged a targeted sequence panel and real-world electronic health records from the eMERGE network to assess the burden of rare variation in the ACMG-56 genes and two psychiatric-associated genes (CACNA1C and TCF4) across common mental health conditions in 15,181 individuals of European descent. As a positive control, we showed that this approach replicated the established association between rare mutations in LDLR and hypercholesterolemia with no visible inflation from population stratification. However, we did not identify any genes significantly enriched with rare deleterious variants that confer risk for common psychiatric disorders after correction for multiple testing. Suggestive associations were observed between depression and rare coding variation in PTEN (P = 1.5 × 10-4), LDLR (P = 3.6 × 10-4), and CACNA1S (P = 5.8 × 10-4). We also observed nominal associations between rare variants in KCNQ1 and substance use disorders (P = 2.4 × 10-4), and APOB and tobacco use disorder (P = 1.1 × 10-3). CONCLUSIONS Our results do not support an association between psychiatric disorders and incidental findings in medically actionable gene mutations, but power was limited with the available sample sizes. Given the phenotypic and genetic complexity of psychiatric phenotypes, future work will require a much larger sequencing dataset to determine whether incidental findings in these genes have implications for risk of psychopathology.
Collapse
Affiliation(s)
- Yen-Chen A Feng
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA. .,Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA, USA. .,Institute of Epidemiology and Preventive Medicine, National Taiwan University, Taipei, Taiwan. .,Master of Public Health Program, National Taiwan University, Taipei, Taiwan.
| | - Ian B Stanaway
- Division of Nephrology, School of Medicine, Kidney Research Institute, University of Washington, Seattle, WA, USA
| | - John J Connolly
- Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Joshua C Denny
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA.,Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA.,All of Us Research Program, National Institutes of Health, Besthesda, MD, USA
| | - Yuan Luo
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Chunhua Weng
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA
| | - Wei-Qi Wei
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Scott T Weiss
- Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA.,Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - Elizabeth W Karlson
- Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA.,Division of Rheumatology, Inflammation and Immunity, Brigham and Women's Hospital, Boston, MA, USA
| | - Jordan W Smoller
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA. .,Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA, USA. .,Center for Precision Psychiatry, Massachusetts General Hospital, Boston, MA, USA.
| |
Collapse
|
48
|
Mo H, Denny JC. The U.S. National Library of Medicine’s impact on precision and genomic medicine. INFORMATION SERVICES & USE 2022; 42:71-80. [PMID: 35600119 PMCID: PMC9108560 DOI: 10.3233/isu-210144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Precision medicine offers the potential to improve health through deeper understandings of the lifestyle, biological, and environmental influences on health. Under Dr. Donald A. B. Lindberg’s leadership, the U.S. National Library of Medicine (NLM) has developed the central reference resources for biomedical research and molecular laboratory medicine that enable precision medicine. The hosting and curation of biomedical knowledge repositories and data by NLM enable quality information reachable for providers and researchers throughout the world. NLM has been supporting the innovation of electronic health record systems to implement computability and secondary use for biomedical research, producing the scale of linked health and molecular datasets necessary for precision medicine discovery.
Collapse
Affiliation(s)
- Huan Mo
- National Human Genome Research Institute, National Institutes of Health, , , USA
| | - Joshua C. Denny
- National Human Genome Research Institute, National Institutes of Health, , , USA
| |
Collapse
|
49
|
Cruz Navarro J, Ponce Mejia LL, Robertson C. A Precision Medicine Agenda in Traumatic Brain Injury. Front Pharmacol 2022; 13:713100. [PMID: 35370671 PMCID: PMC8966615 DOI: 10.3389/fphar.2022.713100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2021] [Accepted: 02/25/2022] [Indexed: 11/13/2022] Open
Abstract
Traumatic brain injury remains a leading cause of death and disability across the globe. Substantial uncertainty in outcome prediction continues to be the rule notwithstanding the existing prediction models. Additionally, despite very promising preclinical data, randomized clinical trials (RCTs) of neuroprotective strategies in moderate and severe TBI have failed to demonstrate significant treatment effects. Better predictive models are needed, as the existing validated ones are more useful in prognosticating poor outcome and do not include biomarkers, genomics, proteonomics, metabolomics, etc. Invasive neuromonitoring long believed to be a "game changer" in the care of TBI patients have shown mixed results, and the level of evidence to support its widespread use remains insufficient. This is due in part to the extremely heterogenous nature of the disease regarding its etiology, pathology and severity. Currently, the diagnosis of traumatic brain injury (TBI) in the acute setting is centered on neurological examination and neuroimaging tools such as CT scanning and MRI, and its treatment has been largely confronted using a "one-size-fits-all" approach, that has left us with many unanswered questions. Precision medicine is an innovative approach for TBI treatment that considers individual variability in genes, environment, and lifestyle and has expanded across the medical fields. In this article, we briefly explore the field of precision medicine in TBI including biomarkers for therapeutic decision-making, multimodal neuromonitoring, and genomics.
Collapse
Affiliation(s)
- Jovany Cruz Navarro
- Departments of Anesthesiology and Neurosurgery, Baylor College of Medicine, Houston, TX, United States
| | - Lucido L Ponce Mejia
- Departments of Neurosurgery and Neurology, LSU Health Science Center, New Orleans, LA, United States
| | - Claudia Robertson
- Department of Neurosurgery, Baylor College of Medicine, Houston, TX, United States
| |
Collapse
|
50
|
Kothari C, Srivastava S, Kousa Y, Izem R, Gierdalski M, Kim D, Good A, Dies KA, Geisel G, Morizono H, Gallo V, Pomeroy SL, Garden GA, Guay-Woodford L, Sahin M, Avillach P. Validation of a computational phenotype for finding patients eligible for genetic testing for pathogenic PTEN variants across three centers. J Neurodev Disord 2022; 14:24. [PMID: 35321655 PMCID: PMC8943944 DOI: 10.1186/s11689-022-09434-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/10/2021] [Accepted: 03/04/2022] [Indexed: 11/30/2022] Open
Abstract
BACKGROUND Computational phenotypes are most often combinations of patient billing codes that are highly predictive of disease using electronic health records (EHR). In the case of rare diseases that can only be diagnosed by genetic testing, computational phenotypes identify patient cohorts for genetic testing and possible diagnosis. This article details the validation of a computational phenotype for PTEN hamartoma tumor syndrome (PHTS) against the EHR of patients at three collaborating clinical research centers: Boston Children's Hospital, Children's National Hospital, and the University of Washington. METHODS A combination of billing codes from the International Classification of Diseases versions 9 and 10 (ICD-9 and ICD-10) for diagnostic criteria postulated by a research team at Cleveland Clinic was used to identify patient cohorts for genetic testing from the clinical data warehouses at the three research centers. Subsequently, the EHR-including billing codes, clinical notes, and genetic reports-of these patients were reviewed by clinical experts to identify patients with PHTS. RESULTS The PTEN genetic testing yield of the computational phenotype, the number of patients who needed to be genetically tested for incidence of pathogenic PTEN gene variants, ranged from 82 to 94% at the three centers. CONCLUSIONS Computational phenotypes have the potential to enable the timely and accurate diagnosis of rare genetic diseases such as PHTS by identifying patient cohorts for genetic sequencing and testing.
Collapse
Affiliation(s)
- Cartik Kothari
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, 02115, USA
| | - Siddharth Srivastava
- Department of Neurology, Rosamund Stone Zander Translational Neuroscience Center, Boston Children's Hospital, Harvard Medical School, Boston, MA, 02115, USA
| | - Youssef Kousa
- Division of Neurology, Children's National Hospital, Washington, DC, 20010, USA.,Department of Genomics and Precision Medicine, The George Washington University School of Medicine and Health Sciences, Washington, DC, 20052, USA
| | - Rima Izem
- Division of Biostatistics and Study Methodology, Children's National Research Institute, Silver Spring, MD, 20910, USA
| | - Marcin Gierdalski
- Division of Biostatistics and Study Methodology, Children's National Hospital, Washington, DC, 20010, USA
| | - Dongkyu Kim
- Division of Biostatistics and Study Methodology, Children's National Hospital, Washington, DC, 20010, USA
| | - Amy Good
- Institute for Translational Health Sciences, University of Washington, Seattle, WA, 98195, USA
| | - Kira A Dies
- Department of Neurology, Rosamund Stone Zander Translational Neuroscience Center, Boston Children's Hospital, Harvard Medical School, Boston, MA, 02115, USA
| | - Gregory Geisel
- Department of Neurology, Rosamund Stone Zander Translational Neuroscience Center, Boston Children's Hospital, Harvard Medical School, Boston, MA, 02115, USA
| | - Hiroki Morizono
- Center for Genetic Medicine Research, Children's National Hospital, Washington, DC, 20010, USA.,Department of Genomics and Precision Medicine, The George Washington University School of Medicine and Health Sciences, Washington, DC, 20052, USA
| | - Vittorio Gallo
- Center for Neuroscience Research, Children's National Research Institute, Children's National Hospital, Washington, DC, 20010, USA
| | - Scott L Pomeroy
- Department of Neurology, Boston Children's Hospital, Harvard Medical School, Boston, MA, 02115, USA
| | - Gwenn A Garden
- Department of Neurology and Center on Human Development and Disability, University of Washington, Seattle, WA, 98195, USA.,Department of Neurology, University of North Carolina, Chapel Hill, NC, 27599, USA
| | - Lisa Guay-Woodford
- Center for Translational Research, Children's National Hospital, Washington, DC, 20010, USA
| | - Mustafa Sahin
- Department of Neurology, Rosamund Stone Zander Translational Neuroscience Center, Boston Children's Hospital, Harvard Medical School, Boston, MA, 02115, USA
| | - Paul Avillach
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, 02115, USA.
| |
Collapse
|