1
|
Nam Y, Kim J, Jung SH, Woerner J, Suh EH, Lee DG, Shivakumar M, Lee ME, Kim D. Harnessing Artificial Intelligence in Multimodal Omics Data Integration: Paving the Path for the Next Frontier in Precision Medicine. Annu Rev Biomed Data Sci 2024; 7:225-250. [PMID: 38768397 DOI: 10.1146/annurev-biodatasci-102523-103801] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/22/2024]
Abstract
The integration of multiomics data with detailed phenotypic insights from electronic health records marks a paradigm shift in biomedical research, offering unparalleled holistic views into health and disease pathways. This review delineates the current landscape of multimodal omics data integration, emphasizing its transformative potential in generating a comprehensive understanding of complex biological systems. We explore robust methodologies for data integration, ranging from concatenation-based to transformation-based and network-based strategies, designed to harness the intricate nuances of diverse data types. Our discussion extends from incorporating large-scale population biobanks to dissecting high-dimensional omics layers at the single-cell level. The review underscores the emerging role of large language models in artificial intelligence, anticipating their influence as a near-future pivot in data integration approaches. Highlighting both achievements and hurdles, we advocate for a concerted effort toward sophisticated integration models, fortifying the foundation for groundbreaking discoveries in precision medicine.
Collapse
Affiliation(s)
- Yonghyun Nam
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA;
| | - Jaesik Kim
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
- Department of Bioengineering, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Sang-Hyuk Jung
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA;
| | - Jakob Woerner
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA;
| | - Erica H Suh
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA;
| | - Dong-Gi Lee
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA;
| | - Manu Shivakumar
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA;
| | - Matthew E Lee
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA;
| | - Dokyoon Kim
- Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA;
| |
Collapse
|
2
|
Hrytsenko Y, Shea B, Elgart M, Kurniansyah N, Lyons G, Morrison AC, Carson AP, Haring B, Mitchell BD, Psaty BM, Jaeger BC, Gu CC, Kooperberg C, Levy D, Lloyd-Jones D, Choi E, Brody JA, Smith JA, Rotter JI, Moll M, Fornage M, Simon N, Castaldi P, Casanova R, Chung RH, Kaplan R, Loos RJF, Kardia SLR, Rich SS, Redline S, Kelly T, O'Connor T, Zhao W, Kim W, Guo X, Ida Chen YD, Sofer T. Machine learning models for predicting blood pressure phenotypes by combining multiple polygenic risk scores. Sci Rep 2024; 14:12436. [PMID: 38816422 PMCID: PMC11139858 DOI: 10.1038/s41598-024-62945-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Accepted: 05/22/2024] [Indexed: 06/01/2024] Open
Abstract
We construct non-linear machine learning (ML) prediction models for systolic and diastolic blood pressure (SBP, DBP) using demographic and clinical variables and polygenic risk scores (PRSs). We developed a two-model ensemble, consisting of a baseline model, where prediction is based on demographic and clinical variables only, and a genetic model, where we also include PRSs. We evaluate the use of a linear versus a non-linear model at both the baseline and the genetic model levels and assess the improvement in performance when incorporating multiple PRSs. We report the ensemble model's performance as percentage variance explained (PVE) on a held-out test dataset. A non-linear baseline model improved the PVEs from 28.1 to 30.1% (SBP) and 14.3% to 17.4% (DBP) compared with a linear baseline model. Including seven PRSs in the genetic model computed based on the largest available GWAS of SBP/DBP improved the genetic model PVE from 4.8 to 5.1% (SBP) and 4.7 to 5% (DBP) compared to using a single PRS. Adding additional 14 PRSs computed based on two independent GWASs further increased the genetic model PVE to 6.3% (SBP) and 5.7% (DBP). PVE differed across self-reported race/ethnicity groups, with primarily all non-White groups benefitting from the inclusion of additional PRSs. In summary, non-linear ML models improves BP prediction in models incorporating diverse populations.
Collapse
Affiliation(s)
- Yana Hrytsenko
- Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
- CardioVascular Institute (CVI), Beth Israel Deaconess Medical Center, Boston, MA, USA
| | - Benjamin Shea
- CardioVascular Institute (CVI), Beth Israel Deaconess Medical Center, Boston, MA, USA
| | - Michael Elgart
- Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
| | | | - Genevieve Lyons
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Alanna C Morrison
- Department of Epidemiology, School of Public Health, Human Genetics Center, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - April P Carson
- Department of Medicine, University of Mississippi Medical Center, Jackson, MS, USA
| | - Bernhard Haring
- Department of Epidemiology & Population Health, Albert Einstein College of Medicine, Bronx, NY, USA
- Department of Medicine III, Saarland University, Homburg, Saarland, Germany
| | - Braxton D Mitchell
- Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Bruce M Psaty
- Department of Medicine, University of Washington, Seattle, WA, USA
- Department of Epidemiology, University of Washington, Seattle, WA, USA
- Cardiovascular Health Research Unit, University of Washington, Seattle, WA, USA
- Health Systems and Population Health, University of Washington, Seattle, WA, USA
| | - Byron C Jaeger
- Department of Biostatistics and Data Science, Wake Forest University School of Medicine, Winston-Salem, NC, USA
| | - C Charles Gu
- The Center for Biostatistics and Data Science, Washington University, St. Louis, USA
| | - Charles Kooperberg
- Division of Public Health Sciences, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Daniel Levy
- The Population Sciences Branch of the National Heart, Lung and Blood Institute, Bethesda, MD, USA
- The Framingham Heart Study, Framingham, MA, USA
| | - Donald Lloyd-Jones
- Department of Preventive Medicine, Northwestern University, Chicago, IL, USA
| | - Eunhee Choi
- Columbia Hypertension Laboratory, Department of Medicine, Columbia University Irving Medical Center, New York, NY, USA
| | - Jennifer A Brody
- Department of Medicine, University of Washington, Seattle, WA, USA
- Cardiovascular Health Research Unit, University of Washington, Seattle, WA, USA
| | - Jennifer A Smith
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
- Survey Research Center, Institute for Social Research, University of Michigan, Ann Arbor, MI, USA
| | - Jerome I Rotter
- Department of Pediatrics, The Institute for Translational Genomics and Population Sciences, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
| | - Matthew Moll
- Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
- VA Boston Healthcare System, West Roxbury, MA, USA
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital, Boston, USA
| | - Myriam Fornage
- Department of Epidemiology, School of Public Health, Human Genetics Center, The University of Texas Health Science Center at Houston, Houston, TX, USA
- Brown Foundation Institute of Molecular Medicine, McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Noah Simon
- Department of Biostatistics, School of Public Health, University of Washington, Seattle, WA, USA
| | - Peter Castaldi
- Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Ramon Casanova
- Department of Biostatistics and Data Science, Wake Forest University School of Medicine, Winston-Salem, NC, USA
| | - Ren-Hua Chung
- Division of Biostatistics and Bioinformatics, Institute of Population Health Sciences, National Health Research Institutes, Taipei City, Taiwan
| | - Robert Kaplan
- Department of Epidemiology & Population Health, Albert Einstein College of Medicine, Bronx, NY, USA
- Division of Public Health Sciences, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Ruth J F Loos
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty for Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Sharon L R Kardia
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
| | - Stephen S Rich
- Center for Public Health Genomics, University of Virginia School of Medicine, Charlottesville, VA, USA
| | - Susan Redline
- Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA
| | - Tanika Kelly
- Department of Epidemiology, Tulane University School of Public Health and Tropical Medicine, New Orleans, LA, USA
| | - Timothy O'Connor
- Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA
- Program in Health Equity and Population Health, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Wei Zhao
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
- Survey Research Center, Institute for Social Research, University of Michigan, Ann Arbor, MI, USA
| | - Wonji Kim
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital, Boston, USA
| | - Xiuqing Guo
- Department of Pediatrics, The Institute for Translational Genomics and Population Sciences, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
| | - Yii-Der Ida Chen
- Department of Pediatrics, The Institute for Translational Genomics and Population Sciences, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
| | - Tamar Sofer
- Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA.
- Department of Medicine, Harvard Medical School, Boston, MA, USA.
- CardioVascular Institute (CVI), Beth Israel Deaconess Medical Center, Boston, MA, USA.
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
- Center for Life Sciences CLS-934, 3 Blackfan St., Boston, MA, 02115, USA.
| |
Collapse
|
3
|
Moll M, Hecker J, Platig J, Zhang J, Ghosh AJ, Pratte KA, Wang RS, Hill D, Konigsberg IR, Chiles JW, Hersh CP, Castaldi PJ, Glass K, Dy JG, Sin DD, Tal-Singer R, Mouded M, Rennard SI, Anderson GP, Kinney GL, Bowler RP, Curtis JL, McDonald ML, Silverman EK, Hobbs BD, Cho MH. Polygenic and transcriptional risk scores identify chronic obstructive pulmonary disease subtypes. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.05.20.24307621. [PMID: 38826461 PMCID: PMC11142287 DOI: 10.1101/2024.05.20.24307621] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2024]
Abstract
Rationale Genetic variants and gene expression predict risk of chronic obstructive pulmonary disease (COPD), but their effect on COPD heterogeneity is unclear. Objectives Define high-risk COPD subtypes using both genetics (polygenic risk score, PRS) and blood gene expression (transcriptional risk score, TRS) and assess differences in clinical and molecular characteristics. Methods We defined high-risk groups based on PRS and TRS quantiles by maximizing differences in protein biomarkers in a COPDGene training set and identified these groups in COPDGene and ECLIPSE test sets. We tested multivariable associations of subgroups with clinical outcomes and compared protein-protein interaction networks and drug repurposing analyses between high-risk groups. Measurements and Main Results We examined two high-risk omics-defined groups in non-overlapping test sets (n=1,133 NHW COPDGene, n=299 African American (AA) COPDGene, n=468 ECLIPSE). We defined "High activity" (low PRS/high TRS) and "severe risk" (high PRS/high TRS) subgroups. Participants in both subgroups had lower body-mass index (BMI), lower lung function, and alterations in metabolic, growth, and immune signaling processes compared to a low-risk (low PRS, low TRS) reference subgroup. "High activity" but not "severe risk" participants had greater prospective FEV 1 decline (COPDGene: -51 mL/year; ECLIPSE: - 40 mL/year) and their proteomic profiles were enriched in gene sets perturbed by treatment with 5-lipoxygenase inhibitors and angiotensin-converting enzyme (ACE) inhibitors. Conclusions Concomitant use of polygenic and transcriptional risk scores identified clinical and molecular heterogeneity amongst high-risk individuals. Proteomic and drug repurposing analysis identified subtype-specific enrichment for therapies and suggest prior drug repurposing failures may be explained by patient selection.
Collapse
|
4
|
Song W, Shi Y, Lin GN. Haplotype function score improves biological interpretation and cross-ancestry polygenic prediction of human complex traits. eLife 2024; 12:RP92574. [PMID: 38639992 PMCID: PMC11031082 DOI: 10.7554/elife.92574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/20/2024] Open
Abstract
We propose a new framework for human genetic association studies: at each locus, a deep learning model (in this study, Sei) is used to calculate the functional genomic activity score for two haplotypes per individual. This score, defined as the Haplotype Function Score (HFS), replaces the original genotype in association studies. Applying the HFS framework to 14 complex traits in the UK Biobank, we identified 3619 independent HFS-trait associations with a significance of p < 5 × 10-8. Fine-mapping revealed 2699 causal associations, corresponding to a median increase of 63 causal findings per trait compared with single-nucleotide polymorphism (SNP)-based analysis. HFS-based enrichment analysis uncovered 727 pathway-trait associations and 153 tissue-trait associations with strong biological interpretability, including 'circadian pathway-chronotype' and 'arachidonic acid-intelligence'. Lastly, we applied least absolute shrinkage and selection operator (LASSO) regression to integrate HFS prediction score with SNP-based polygenic risk scores, which showed an improvement of 16.1-39.8% in cross-ancestry polygenic prediction. We concluded that HFS is a promising strategy for understanding the genetic basis of human complex traits.
Collapse
Affiliation(s)
- Weichen Song
- Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, School of Bioengineering, Shanghai Jiao Tong UniversityShanghaiChina
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Collaborative Innovation Center for Brain Science, Shanghai Jiao Tong UniversityShanghaiChina
| | - Yongyong Shi
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Collaborative Innovation Center for Brain Science, Shanghai Jiao Tong UniversityShanghaiChina
- Biomedical Sciences Institute of Qingdao University (Qingdao Branch of SJTU Bio-X12 Institutes), Qingdao UniversityQingdaoChina
| | - Guan Ning Lin
- Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, School of Bioengineering, Shanghai Jiao Tong UniversityShanghaiChina
| |
Collapse
|
5
|
Hou Y, Dai H, Chen N, Zhao Z, Wang Q, Hou T, Zheng J, Wang T, Li M, Lin H, Wang S, Zheng R, Lu J, Xu Y, Chen Y, Liu R, Ning G, Wang W, Bi Y, Wang J, Xu M. Whole Blood-based Transcriptional Risk Score for Nonobese Type 2 Diabetes Predicts Dynamic Changes in Glucose Metabolism. J Clin Endocrinol Metab 2023; 109:114-124. [PMID: 37555255 PMCID: PMC10735316 DOI: 10.1210/clinem/dgad466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 08/02/2023] [Accepted: 08/04/2023] [Indexed: 08/10/2023]
Abstract
CONTEXT The performance of peripheral blood transcriptional markers in evaluating risk of type 2 diabetes (T2D) with normal body mass index (BMI) is unknown. OBJECTIVE We developed a whole blood-based transcriptional risk score (wb-TRS) for nonobese T2D and assessed its contributions on disease risk and dynamic changes in glucose metabolism. METHODS Using a community-based cohort with blood transcriptome data, we developed the wb-TRS in 1105 participants aged ≥40 years who maintained a normal BMI for up to 10 years, and we validated the wb-TRS in an external dataset. Potential biological significance was explored. RESULTS The wb-TRS included 144 gene transcripts. Compared to the lowest tertile, wb-TRS in tertile 3 was associated with 8.91-fold (95% CI, 3.53-22.5) higher risk and each 1-unit increment was associated with 2.63-fold (95% CI, 1.87-3.68) higher risk of nonobese T2D. Furthermore, baseline wb-TRS significantly associated with dynamic changes in average, daytime, nighttime, and 24-hour glucose, HbA1c values, and area under the curve of glucose measured by continuous glucose monitoring over 6 months of intervention. The wb-TRS improved the prediction performance for nonobese T2D, combined with fasting glucose, triglycerides, and demographic and anthropometric parameters. Multi-contrast gene set enrichment (Mitch) analysis implicated oxidative phosphorylation, mTORC1 signaling, and cholesterol metabolism involved in nonobese T2D pathogenesis. CONCLUSION A whole blood-based nonobese T2D-associated transcriptional risk score was validated to predict dynamic changes in glucose metabolism. These findings suggested several biological pathways involved in the pathogenesis of nonobese T2D.
Collapse
Affiliation(s)
- Yanan Hou
- Department of Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
- Shanghai National Clinical Research Center for Metabolic Diseases, Key Laboratory for Endocrine and Metabolic Diseases of the National Health Commission of the PR China, Shanghai Key Laboratory for Endocrine Tumor, State Key Laboratory of Medical Genomics, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Huajie Dai
- Department of Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
- Shanghai National Clinical Research Center for Metabolic Diseases, Key Laboratory for Endocrine and Metabolic Diseases of the National Health Commission of the PR China, Shanghai Key Laboratory for Endocrine Tumor, State Key Laboratory of Medical Genomics, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Na Chen
- Department of Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
- Shanghai National Clinical Research Center for Metabolic Diseases, Key Laboratory for Endocrine and Metabolic Diseases of the National Health Commission of the PR China, Shanghai Key Laboratory for Endocrine Tumor, State Key Laboratory of Medical Genomics, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Zhiyun Zhao
- Department of Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
- Shanghai National Clinical Research Center for Metabolic Diseases, Key Laboratory for Endocrine and Metabolic Diseases of the National Health Commission of the PR China, Shanghai Key Laboratory for Endocrine Tumor, State Key Laboratory of Medical Genomics, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Qi Wang
- Department of Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
- Shanghai National Clinical Research Center for Metabolic Diseases, Key Laboratory for Endocrine and Metabolic Diseases of the National Health Commission of the PR China, Shanghai Key Laboratory for Endocrine Tumor, State Key Laboratory of Medical Genomics, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Tianzhichao Hou
- Department of Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
- Shanghai National Clinical Research Center for Metabolic Diseases, Key Laboratory for Endocrine and Metabolic Diseases of the National Health Commission of the PR China, Shanghai Key Laboratory for Endocrine Tumor, State Key Laboratory of Medical Genomics, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Jie Zheng
- Department of Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
- Shanghai National Clinical Research Center for Metabolic Diseases, Key Laboratory for Endocrine and Metabolic Diseases of the National Health Commission of the PR China, Shanghai Key Laboratory for Endocrine Tumor, State Key Laboratory of Medical Genomics, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Tiange Wang
- Department of Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
- Shanghai National Clinical Research Center for Metabolic Diseases, Key Laboratory for Endocrine and Metabolic Diseases of the National Health Commission of the PR China, Shanghai Key Laboratory for Endocrine Tumor, State Key Laboratory of Medical Genomics, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Mian Li
- Department of Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
- Shanghai National Clinical Research Center for Metabolic Diseases, Key Laboratory for Endocrine and Metabolic Diseases of the National Health Commission of the PR China, Shanghai Key Laboratory for Endocrine Tumor, State Key Laboratory of Medical Genomics, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Hong Lin
- Department of Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
- Shanghai National Clinical Research Center for Metabolic Diseases, Key Laboratory for Endocrine and Metabolic Diseases of the National Health Commission of the PR China, Shanghai Key Laboratory for Endocrine Tumor, State Key Laboratory of Medical Genomics, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Shuangyuan Wang
- Department of Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
- Shanghai National Clinical Research Center for Metabolic Diseases, Key Laboratory for Endocrine and Metabolic Diseases of the National Health Commission of the PR China, Shanghai Key Laboratory for Endocrine Tumor, State Key Laboratory of Medical Genomics, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Ruizhi Zheng
- Department of Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
- Shanghai National Clinical Research Center for Metabolic Diseases, Key Laboratory for Endocrine and Metabolic Diseases of the National Health Commission of the PR China, Shanghai Key Laboratory for Endocrine Tumor, State Key Laboratory of Medical Genomics, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Jieli Lu
- Department of Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
- Shanghai National Clinical Research Center for Metabolic Diseases, Key Laboratory for Endocrine and Metabolic Diseases of the National Health Commission of the PR China, Shanghai Key Laboratory for Endocrine Tumor, State Key Laboratory of Medical Genomics, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Yu Xu
- Department of Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
- Shanghai National Clinical Research Center for Metabolic Diseases, Key Laboratory for Endocrine and Metabolic Diseases of the National Health Commission of the PR China, Shanghai Key Laboratory for Endocrine Tumor, State Key Laboratory of Medical Genomics, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Yuhong Chen
- Department of Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
- Shanghai National Clinical Research Center for Metabolic Diseases, Key Laboratory for Endocrine and Metabolic Diseases of the National Health Commission of the PR China, Shanghai Key Laboratory for Endocrine Tumor, State Key Laboratory of Medical Genomics, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Ruixin Liu
- Department of Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
- Shanghai National Clinical Research Center for Metabolic Diseases, Key Laboratory for Endocrine and Metabolic Diseases of the National Health Commission of the PR China, Shanghai Key Laboratory for Endocrine Tumor, State Key Laboratory of Medical Genomics, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Guang Ning
- Department of Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
- Shanghai National Clinical Research Center for Metabolic Diseases, Key Laboratory for Endocrine and Metabolic Diseases of the National Health Commission of the PR China, Shanghai Key Laboratory for Endocrine Tumor, State Key Laboratory of Medical Genomics, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Weiqing Wang
- Department of Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
- Shanghai National Clinical Research Center for Metabolic Diseases, Key Laboratory for Endocrine and Metabolic Diseases of the National Health Commission of the PR China, Shanghai Key Laboratory for Endocrine Tumor, State Key Laboratory of Medical Genomics, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Yufang Bi
- Department of Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
- Shanghai National Clinical Research Center for Metabolic Diseases, Key Laboratory for Endocrine and Metabolic Diseases of the National Health Commission of the PR China, Shanghai Key Laboratory for Endocrine Tumor, State Key Laboratory of Medical Genomics, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Jiqiu Wang
- Department of Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
- Shanghai National Clinical Research Center for Metabolic Diseases, Key Laboratory for Endocrine and Metabolic Diseases of the National Health Commission of the PR China, Shanghai Key Laboratory for Endocrine Tumor, State Key Laboratory of Medical Genomics, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Min Xu
- Department of Endocrine and Metabolic Diseases, Shanghai Institute of Endocrine and Metabolic Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
- Shanghai National Clinical Research Center for Metabolic Diseases, Key Laboratory for Endocrine and Metabolic Diseases of the National Health Commission of the PR China, Shanghai Key Laboratory for Endocrine Tumor, State Key Laboratory of Medical Genomics, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| |
Collapse
|
6
|
Kendall TJ, Jimenez-Ramos M, Turner F, Ramachandran P, Minnier J, McColgan MD, Alam M, Ellis H, Dunbar DR, Kohnen G, Konanahalli P, Oien KA, Bandiera L, Menolascina F, Juncker-Jensen A, Alexander D, Mayor C, Guha IN, Fallowfield JA. An integrated gene-to-outcome multimodal database for metabolic dysfunction-associated steatotic liver disease. Nat Med 2023; 29:2939-2953. [PMID: 37903863 PMCID: PMC10667096 DOI: 10.1038/s41591-023-02602-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Accepted: 09/20/2023] [Indexed: 11/01/2023]
Abstract
Metabolic dysfunction-associated steatotic liver disease (MASLD) is the commonest cause of chronic liver disease worldwide and represents an unmet precision medicine challenge. We established a retrospective national cohort of 940 histologically defined patients (55.4% men, 44.6% women; median body mass index 31.3; 32% with type 2 diabetes) covering the complete MASLD severity spectrum, and created a secure, searchable, open resource (SteatoSITE). In 668 cases and 39 controls, we generated hepatic bulk RNA sequencing data and performed differential gene expression and pathway analysis, including exploration of gender-specific differences. A web-based gene browser was also developed. We integrated histopathological assessments, transcriptomic data and 5.67 million days of time-stamped longitudinal electronic health record data to define disease-stage-specific gene expression signatures, pathogenic hepatic cell subpopulations and master regulator networks associated with adverse outcomes in MASLD. We constructed a 15-gene transcriptional risk score to predict future hepatic decompensation events (area under the receiver operating characteristic curve 0.86, 0.81 and 0.83 for 1-, 3- and 5-year risk, respectively). Additionally, thyroid hormone receptor beta regulon activity was identified as a critical suppressor of disease progression. SteatoSITE supports rational biomarker and drug development and facilitates precision medicine approaches for patients with MASLD.
Collapse
Affiliation(s)
- Timothy J Kendall
- Institute for Regeneration and Repair, University of Edinburgh, Edinburgh, UK
- Edinburgh Pathology, University of Edinburgh, Edinburgh, UK
| | - Maria Jimenez-Ramos
- Institute for Regeneration and Repair, University of Edinburgh, Edinburgh, UK
| | - Frances Turner
- Edinburgh Genomics (Bioinformatics), University of Edinburgh, Edinburgh, UK
| | | | - Jessica Minnier
- OHSU-PSU School of Public Health, Oregon Health & Sciences University, Portland, OR, USA
- Knight Cancer Institute Biostatistics Shared Resource, Oregon Health & Sciences University, Portland, OR, USA
| | - Michael D McColgan
- Precision Medicine Scotland-Innovation Centre (PMS-IC), University of Glasgow, Glasgow, UK
| | - Masood Alam
- Precision Medicine Scotland-Innovation Centre (PMS-IC), University of Glasgow, Glasgow, UK
| | - Harriet Ellis
- Precision Medicine Scotland-Innovation Centre (PMS-IC), University of Glasgow, Glasgow, UK
| | - Donald R Dunbar
- Edinburgh Genomics (Bioinformatics), University of Edinburgh, Edinburgh, UK
| | - Gabriele Kohnen
- Pathology Department, Queen Elizabeth University Hospital, Glasgow, UK
| | | | - Karin A Oien
- Pathology Department, Queen Elizabeth University Hospital, Glasgow, UK
| | - Lucia Bandiera
- School of Engineering, Institute of Bioengineering, University of Edinburgh, Edinburgh, UK
- Centre for Engineering Biology, University of Edinburgh, Edinburgh, UK
| | - Filippo Menolascina
- School of Engineering, Institute of Bioengineering, University of Edinburgh, Edinburgh, UK
- Centre for Engineering Biology, University of Edinburgh, Edinburgh, UK
| | | | | | - Charlie Mayor
- NHS Greater Glasgow and Clyde Safe Haven, Glasgow, UK
| | - Indra Neil Guha
- National Institute of Health Research (NIHR) Nottingham Biomedical Research Centre, Nottingham University Hospitals NHS Trust and University of Nottingham, Nottingham, UK
| | | |
Collapse
|
7
|
Vassy JL, Posner DC, Ho YL, Gagnon DR, Galloway A, Tanukonda V, Houghton SC, Madduri RK, McMahon BH, Tsao PS, Damrauer SM, O’Donnell CJ, Assimes TL, Casas JP, Gaziano JM, Pencina MJ, Sun YV, Cho K, Wilson PW. Cardiovascular Disease Risk Assessment Using Traditional Risk Factors and Polygenic Risk Scores in the Million Veteran Program. JAMA Cardiol 2023; 8:564-574. [PMID: 37133828 PMCID: PMC10157509 DOI: 10.1001/jamacardio.2023.0857] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Accepted: 03/09/2023] [Indexed: 05/04/2023]
Abstract
Importance Primary prevention of atherosclerotic cardiovascular disease (ASCVD) relies on risk stratification. Genome-wide polygenic risk scores (PRSs) are proposed to improve ASCVD risk estimation. Objective To determine whether genome-wide PRSs for coronary artery disease (CAD) and acute ischemic stroke improve ASCVD risk estimation with traditional clinical risk factors in an ancestrally diverse midlife population. Design, Setting, and Participants This was a prognostic analysis of incident events in a retrospectively defined longitudinal cohort conducted from January 1, 2011, to December 31, 2018. Included in the study were adults free of ASCVD and statin naive at baseline from the Million Veteran Program (MVP), a mega biobank with genetic, survey, and electronic health record data from a large US health care system. Data were analyzed from March 15, 2021, to January 5, 2023. Exposures PRSs for CAD and ischemic stroke derived from cohorts of largely European descent and risk factors, including age, sex, systolic blood pressure, total cholesterol, high-density lipoprotein (HDL) cholesterol, smoking, and diabetes status. Main Outcomes and Measures Incident nonfatal myocardial infarction (MI), ischemic stroke, ASCVD death, and composite ASCVD events. Results A total of 79 151 participants (mean [SD] age, 57.8 [13.7] years; 68 503 male [86.5%]) were included in the study. The cohort included participants from the following harmonized genetic ancestry and race and ethnicity categories: 18 505 non-Hispanic Black (23.4%), 6785 Hispanic (8.6%), and 53 861 non-Hispanic White (68.0%) with a median (5th-95th percentile) follow-up of 4.3 (0.7-6.9) years. From 2011 to 2018, 3186 MIs (4.0%), 1933 ischemic strokes (2.4%), 867 ASCVD deaths (1.1%), and 5485 composite ASCVD events (6.9%) were observed. CAD PRS was associated with incident MI in non-Hispanic Black (hazard ratio [HR], 1.10; 95% CI, 1.02-1.19), Hispanic (HR, 1.26; 95% CI, 1.09-1.46), and non-Hispanic White (HR, 1.23; 95% CI, 1.18-1.29) participants. Stroke PRS was associated with incident stroke in non-Hispanic White participants (HR, 1.15; 95% CI, 1.08-1.21). A combined CAD plus stroke PRS was associated with ASCVD deaths among non-Hispanic Black (HR, 1.19; 95% CI, 1.03-1.17) and non-Hispanic (HR, 1.11; 95% CI, 1.03-1.21) participants. The combined PRS was also associated with composite ASCVD across all ancestry groups but greater among non-Hispanic White (HR, 1.20; 95% CI, 1.16-1.24) than non-Hispanic Black (HR, 1.11; 95% CI, 1.05-1.17) and Hispanic (HR, 1.12; 95% CI, 1.00-1.25) participants. Net reclassification improvement from adding PRS to a traditional risk model was modest for the intermediate risk group for composite CVD among men (5-year risk >3.75%, 0.38%; 95% CI, 0.07%-0.68%), among women, (6.79%; 95% CI, 3.01%-10.58%), for age older than 55 years (0.25%; 95% CI, 0.03%-0.47%), and for ages 40 to 55 years (1.61%; 95% CI, -0.07% to 3.30%). Conclusions and Relevance Study results suggest that PRSs derived predominantly in European samples were statistically significantly associated with ASCVD in the multiancestry midlife and older-age MVP cohort. Overall, modest improvement in discrimination metrics were observed with addition of PRSs to traditional risk factors with greater magnitude in women and younger age groups.
Collapse
Affiliation(s)
- Jason L. Vassy
- Veterans Affairs Boston Healthcare System, Boston, Massachusetts
- Department of Medicine, Brigham & Women’s Hospital and Harvard Medical School, Boston, Massachusetts
| | - Daniel C. Posner
- Veterans Affairs Boston Healthcare System, Boston, Massachusetts
| | - Yuk-Lam Ho
- Veterans Affairs Boston Healthcare System, Boston, Massachusetts
| | - David R. Gagnon
- Veterans Affairs Boston Healthcare System, Boston, Massachusetts
- Department of Biostatistics, Boston University School of Public Health, Boston, Massachusetts
| | - Ashley Galloway
- Veterans Affairs Boston Healthcare System, Boston, Massachusetts
| | | | | | - Ravi K. Madduri
- Data Science and Learning Division, Argonne National Laboratory, Lemont, Illinois
- University of Chicago Consortium for Advanced Science and Engineering, The University of Chicago, Chicago, Illinois
| | - Benjamin H. McMahon
- Theoretical Biology and Biophysics, Los Alamos National Laboratory, Los Alamos, New Mexico
| | - Philip S. Tsao
- Palo Alto VA Healthcare System, Palo Alto, California
- Stanford Cardiovascular Institute, Stanford University, Stanford, California
| | - Scott M. Damrauer
- Corporal Michael J. Crescenz VA Medical Center, Philadelphia, Pennsylvania
- Department of Surgery, University of Pennsylvania Perelman School of Medicine, Philadelphia
| | | | - Themistocles L. Assimes
- Palo Alto VA Healthcare System, Palo Alto, California
- Division of Cardiovascular Medicine, Department of Medicine, Stanford University School of Medicine, Stanford, California
- Stanford Cardiovascular Institute, Stanford University, Stanford, California
| | - Juan P. Casas
- Veterans Affairs Boston Healthcare System, Boston, Massachusetts
- Department of Medicine, Brigham & Women’s Hospital and Harvard Medical School, Boston, Massachusetts
| | - J. Michael Gaziano
- Veterans Affairs Boston Healthcare System, Boston, Massachusetts
- Division of Aging, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts
| | - Michael J. Pencina
- Department of Biostatistics, Duke University Medical Center, Durham, North Carolina
| | - Yan V. Sun
- Veterans Affairs Atlanta Healthcare System, Decatur, Georgia
- Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta, Georgia
| | - Kelly Cho
- Veterans Affairs Boston Healthcare System, Boston, Massachusetts
- Department of Medicine, Brigham & Women’s Hospital and Harvard Medical School, Boston, Massachusetts
| | - Peter W.F. Wilson
- Veterans Affairs Atlanta Healthcare System, Decatur, Georgia
- Division of Cardiology, Emory University School of Medicine, Atlanta, Georgia
- Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta, Georgia
| |
Collapse
|
8
|
Herrera-Luis E, Forno E, Celedón JC, Pino-Yanes M. Asthma Exacerbations: The Genes Behind the Scenes. J Investig Allergol Clin Immunol 2023; 33:76-94. [PMID: 36420738 PMCID: PMC10638677 DOI: 10.18176/jiaci.0878] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
The clinical and socioeconomic burden of asthma exacerbations (AEs) constitutes a major public health problem. In the last 4 years, there has been an increase in ethnic diversity in candidate-gene and genome-wide association studies of AEs, which in the latter case led to the identification of novel genes and underlying pathobiological processes. Pharmacogenomics, admixture mapping analyses, and the combination of multiple "omics" layers have helped to prioritize genomic regions of interest and/or facilitated our understanding of the functional consequences of genetic variation. Nevertheless, the field still lags behind the genomics of asthma, where a vast compendium of genetic approaches has been used (eg, gene-environment nteractions, next-generation sequencing, and polygenic risk scores). Furthermore, the roles of the DNA methylome and histone modifications in AEs have received little attention, and microRNA findings remain to be validated in independent studies. Likewise, the most recent transcriptomic studies highlight the importance of the host-airway microbiome interaction in the modulation of risk of AEs. Leveraging -omics and deep-phenotyping data from subtypes or homogenous subgroups of patients will be crucial if we are to overcome the inherent heterogeneity of AEs, boost the identification of potential therapeutic targets, and implement precision medicine approaches to AEs in clinical practice.
Collapse
Affiliation(s)
- E Herrera-Luis
- Genomics and Health Group, Department of Biochemistry, Microbiology, Cell Biology and Genetics, Universidad de La Laguna (ULL), La Laguna, Tenerife, Spain
| | - E Forno
- Division of Pediatric Pulmonary Medicine, UPMC Children´s Hospital of Pittsburgh, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | - J C Celedón
- Division of Pediatric Pulmonary Medicine, UPMC Children´s Hospital of Pittsburgh, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | - M Pino-Yanes
- Genomics and Health Group, Department of Biochemistry, Microbiology, Cell Biology and Genetics, Universidad de La Laguna (ULL), La Laguna, Tenerife, Spain
- CIBER de Enfermedades Respiratorias, Instituto de Salud Carlos III, Madrid, Spain 4 Instituto de Tecnologías Biomédicas (ITB), Universidad de La Laguna (ULL), La Laguna, Tenerife, Spain
- Instituto de Tecnologías Biomédicas (ITB), Universidad de La Laguna (ULL), La Laguna, Tenerife, Spain
| |
Collapse
|
9
|
Bhattacharya A, Hirbo JB, Zhou D, Zhou W, Zheng J, Kanai M, Pasaniuc B, Gamazon ER, Cox NJ. Best practices for multi-ancestry, meta-analytic transcriptome-wide association studies: Lessons from the Global Biobank Meta-analysis Initiative. CELL GENOMICS 2022; 2:100180. [PMID: 36341024 PMCID: PMC9631681 DOI: 10.1016/j.xgen.2022.100180] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
The Global Biobank Meta-analysis Initiative (GBMI), through its diversity, provides a valuable opportunity to study population-wide and ancestry-specific genetic associations. However, with multiple ascertainment strategies and multi-ancestry study populations across biobanks, GBMI presents unique challenges in implementing statistical genetics methods. Transcriptome-wide association studies (TWASs) boost detection power for and provide biological context to genetic associations by integrating genetic variant-to-trait associations from genome-wide association studies (GWASs) with predictive models of gene expression. TWASs present unique challenges beyond GWASs, especially in a multi-biobank, meta-analytic setting. Here, we present the GBMI TWAS pipeline, outlining practical considerations for ancestry and tissue specificity, meta-analytic strategies, and open challenges at every step of the framework. We advise conducting ancestry-stratified TWASs using ancestry-specific expression models and meta-analyzing results using inverse-variance weighting, showing the least test statistic inflation. Our work provides a foundation for adding transcriptomic context to biobank-linked GWASs, allowing for ancestry-aware discovery to accelerate genomic medicine.
Collapse
Affiliation(s)
- Arjun Bhattacharya
- Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA,Institute of Quantitative and Computational Biosciences, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA,Corresponding author
| | - Jibril B. Hirbo
- Department of Medicine, Division of Genetic Medicine, Vanderbilt University School of Medicine, Nashville, TN, USA,Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Dan Zhou
- Department of Medicine, Division of Genetic Medicine, Vanderbilt University School of Medicine, Nashville, TN, USA,Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Wei Zhou
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA,Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA,Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Jie Zheng
- MRC Integrative Epidemiology Unit (IEU), Bristol Medical School, University of Bristol, Oakfield House, Oakfield Grove, Bristol BS8 2BN, UK
| | - Masahiro Kanai
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA,Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA,Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA,Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA,Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita 565-0871, Japan
| | | | - Bogdan Pasaniuc
- Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA,Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA,Department of Computational Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
| | - Eric R. Gamazon
- Department of Medicine, Division of Genetic Medicine, Vanderbilt University School of Medicine, Nashville, TN, USA,Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA,MRC Epidemiology Unit, University of Cambridge, Cambridge, UK
| | - Nancy J. Cox
- Department of Medicine, Division of Genetic Medicine, Vanderbilt University School of Medicine, Nashville, TN, USA,Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA
| |
Collapse
|
10
|
Lu Z, Gopalan S, Yuan D, Conti DV, Pasaniuc B, Gusev A, Mancuso N. Multi-ancestry fine-mapping improves precision to identify causal genes in transcriptome-wide association studies. Am J Hum Genet 2022; 109:1388-1404. [PMID: 35931050 PMCID: PMC9388396 DOI: 10.1016/j.ajhg.2022.07.002] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Accepted: 06/30/2022] [Indexed: 02/06/2023] Open
Abstract
Transcriptome-wide association studies (TWASs) are a powerful approach to identify genes whose expression is associated with complex disease risk. However, non-causal genes can exhibit association signals due to confounding by linkage disequilibrium (LD) patterns and eQTL pleiotropy at genomic risk regions, which necessitates fine-mapping of TWAS signals. Here, we present MA-FOCUS, a multi-ancestry framework for the improved identification of genes underlying traits of interest. We demonstrate that by leveraging differences in ancestry-specific patterns of LD and eQTL signals, MA-FOCUS consistently outperforms single-ancestry fine-mapping approaches with equivalent total sample sizes across multiple metrics. We perform TWASs for 15 blood traits using genome-wide summary statistics (average nEA = 511 k, nAA = 13 k) and lymphoblastoid cell line eQTL data from cohorts of primarily European and African continental ancestries. We recapitulate evidence demonstrating shared genetic architectures for eQTL and blood traits between the two ancestry groups and observe that gene-level effects correlate 20% more strongly across ancestries than SNP-level effects. Lastly, we perform fine-mapping using MA-FOCUS and find evidence that genes at TWAS risk regions are more likely to be shared across ancestries than they are to be ancestry specific. Using multiple lines of evidence to validate our findings, we find that gene sets produced by MA-FOCUS are more enriched in hematopoietic categories than alternative approaches (p = 2.36 × 10-15). Our work demonstrates that including and appropriately accounting for genetic diversity can drive more profound insights into the genetic architecture of complex traits.
Collapse
Affiliation(s)
- Zeyun Lu
- Biostatistics Division, Department of Population and Public Health Sciences, University of Southern California, Los Angeles, CA, USA.
| | - Shyamalika Gopalan
- Center for Genetic Epidemiology, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA; Department of Evolutionary Anthropology, Duke University, Durham, NC, USA
| | - Dong Yuan
- Biostatistics Division, Department of Population and Public Health Sciences, University of Southern California, Los Angeles, CA, USA
| | - David V Conti
- Biostatistics Division, Department of Population and Public Health Sciences, University of Southern California, Los Angeles, CA, USA; Center for Genetic Epidemiology, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Bogdan Pasaniuc
- Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA, USA; Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA; Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA; Department of Computational Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
| | - Alexander Gusev
- Division of Population Sciences, Dana-Farber Cancer Institute & Harvard Medical School, Boston, MA, USA; Division of Genetics, Brigham & Women's Hospital, Boston, MA, USA; Program in Medical and Population Genetics, The Broad Institute, Cambridge, MA, USA
| | - Nicholas Mancuso
- Biostatistics Division, Department of Population and Public Health Sciences, University of Southern California, Los Angeles, CA, USA; Center for Genetic Epidemiology, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA; Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA.
| |
Collapse
|