1
|
Wang DC, Xu WD, Wang SN, Wang X, Leng W, Fu L, Liu XY, Qin Z, Huang AF. Lupus nephritis or not? A simple and clinically friendly machine learning pipeline to help diagnosis of lupus nephritis. Inflamm Res 2023:10.1007/s00011-023-01755-7. [PMID: 37300586 DOI: 10.1007/s00011-023-01755-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Revised: 05/17/2023] [Accepted: 05/30/2023] [Indexed: 06/12/2023] Open
Abstract
OBJECTIVE Diagnosis of lupus nephritis (LN) is a complex process, which usually requires renal biopsy. We aim to establish a machine learning pipeline to help diagnosis of LN. METHODS A cohort of 681 systemic lupus erythematosus (SLE) patients without LN and 786 SLE patients with LN was established, and a total of 95 clinical, laboratory data and 17 meteorological indicators were collected. After tenfold cross-validation, the patients were divided into training set and test set. The features selected by collective feature selection method of mutual information (MI) and multisurf were used to construct the models of logistic regression, decision tree, random forest, naive Bayes, support vector machine (SVM), light gradient boosting (LGB), extreme gradient boosting (XGB), and artificial neural network (ANN), the models were compared and verified in post-analysis. RESULTS Collective feature selection method screens out antistreptolysin (ASO), retinol binding protein (RBP), lupus anticoagulant 1 (LA1), LA2, proteinuria and other features, and the hyperparameter optimized XGB (ROC: AUC = 0.995; PRC: AUC = 1.000, APS = 1.000; balance accuracy: 0.990) has the best performance, followed by LGB (ROC: AUC = 0.992; PRC: AUC = 0.997, APS = 0.977; balance accuracy: 0.957). The worst performance is naive Bayes model (ROC: AUC = 0.799; PRC: AUC = 0.822, APS = 0.823; balance accuracy: 0.693). In the composite feature importance bar plots, ASO, RF, Up/Ucr, and other features play important roles in LN. CONCLUSION We developed and validated a new and simple machine learning pathway for diagnosis of LN, especially the XGB model based on ASO, LA1, LA2, proteinuria, and other features screened out by collective feature selection.
Collapse
Affiliation(s)
- Da-Cheng Wang
- Department of Evidence-Based Medicine, Southwest Medical University, 1 Xianglin Road, Luzhou, Sichuan, China
| | - Wang-Dong Xu
- Department of Evidence-Based Medicine, Southwest Medical University, 1 Xianglin Road, Luzhou, Sichuan, China
| | - Shen-Nan Wang
- Luzhou Meteorological Bureau, 3 Songshan Road, Luzhou, Sichuan, China
| | - Xiang Wang
- Luzhou Meteorological Bureau, 3 Songshan Road, Luzhou, Sichuan, China
| | - Wei Leng
- Luzhou Meteorological Bureau, 3 Songshan Road, Luzhou, Sichuan, China
| | - Lu Fu
- Laboratory Animal Center, Southwest Medical University, 1 Xianglin Road, Luzhou, Sichuan, China
| | - Xiao-Yan Liu
- Department of Evidence-Based Medicine, Southwest Medical University, 1 Xianglin Road, Luzhou, Sichuan, China
| | - Zhen Qin
- Department of Rheumatology and Immunology, Affiliated Hospital of Southwest Medical University, 25 Taiping Road, Luzhou, Sichuan, China
| | - An-Fang Huang
- Department of Rheumatology and Immunology, Affiliated Hospital of Southwest Medical University, 25 Taiping Road, Luzhou, Sichuan, China.
| |
Collapse
|
2
|
Zarzar TG, Lee B, Coughlin R, Kim D, Shen L, Hall MA. Sex Differences in the Metabolome of Alzheimer's Disease Progression. FRONTIERS IN RADIOLOGY 2022; 2:782864. [PMID: 35445209 PMCID: PMC9014653 DOI: 10.3389/fradi.2022.782864] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Alzheimer's disease (AD) is the leading cause of dementia; however, men and women face differential AD prevalence, presentation, and progression risks. Characterizing metabolomic profiles during AD progression is fundamental to understand the metabolic disruptions and the biological pathways involved. However, outstanding questions remain of whether peripheral metabolic changes occur equally in men and women with AD. Here, we evaluated differential effects of metabolomic and brain volume associations between sexes. We used three cohorts from the Alzheimer's Disease Neuroimaging Initiative (ADNI), evaluated 1,368 participants, two metabolomic platforms with 380 metabolites in total, and six brain segment volumes. Using dimension reduction techniques, we took advantage of the correlation structure of the brain volume phenotypes and the metabolite concentration values to reduce the number of tests while aggregating relevant biological structures. Using WGCNA, we aggregated modules of highly co-expressed metabolites. On the other hand, we used partial least squares regression-discriminant analysis (PLS-DA) to extract components of brain volumes that maximally co-vary with AD diagnosis as phenotypes. We tested for differences in effect sizes between sexes in the association between single metabolite and metabolite modules with the brain volume components. We found five metabolite modules and 125 single metabolites with significant differences between sexes. These results highlight a differential lipid disruption in AD progression between sexes. Men showed a greater negative association of phosphatidylcholines and sphingomyelins and a positive association of VLDL and large LDL with AD progression. In contrast, women showed a positive association of triglycerides in VLDL and small and medium LDL with AD progression. Explicitly identifying sex differences in metabolomics during AD progression can highlight particular metabolic disruptions in each sex. Our research study and strategy can lead to better-tailored studies and better-suited treatments that take sex differences into account.
Collapse
Affiliation(s)
- Tomás González Zarzar
- Department of Veterinary and Biomedical Sciences, College of Agricultural Sciences, The Pennsylvania State University, University Park, PA, United States.,Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, United States
| | - Brian Lee
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| | - Rory Coughlin
- Department of Veterinary and Biomedical Sciences, College of Agricultural Sciences, The Pennsylvania State University, University Park, PA, United States
| | - Dokyoon Kim
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| | - Li Shen
- Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| | - Molly A Hall
- Department of Veterinary and Biomedical Sciences, College of Agricultural Sciences, The Pennsylvania State University, University Park, PA, United States.,Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA, United States.,Penn State Cancer Institute, The Pennsylvania State University, University Park, PA, United States
| |
Collapse
|
3
|
Escriba-Montagut X, Basagaña X, Vrijheid M, Gonzalez JR. Software Application Profile: exposomeShiny—a toolbox for exposome data analysis. Int J Epidemiol 2021. [PMCID: PMC8855999 DOI: 10.1093/ije/dyab220] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Motivation Studying the role of the exposome in human health and its impact on different omic layers requires advanced statistical methods. Many of these methods are implemented in different R and Bioconductor packages, but their use may require strong expertise in R, in writing pipelines and in using new R classes which may not be familiar to non-advanced users. ExposomeShiny provides a bridge between researchers and most of the state-of-the-art exposome analysis methodologies, without the need of advanced programming skills. Implementation ExposomeShiny is a standalone web application implemented in R. It is available as source files and can be installed in any server or computer avoiding problems with data confidentiality. It is executed in RStudio which opens a browser window with the web application. General features The presented implementation allows the conduct of: (i) data pre-processing: normalization and missing imputation (including limit of detection); (ii) descriptive analysis; (iii) exposome principal component analysis (PCA) and hierarchical clustering; (iv) exposome-wide association studies (ExWAS) and variable selection ExWAS; (v) omic data integration by single association and multi-omic analyses; and (vi) post-exposome data analyses to gain biological insight for the exposures, genes or using the Comparative Toxicogenomics Database (CTD) and pathway analysis. Availability The exposomeShiny source code is freely available on Github at [https://github.com/isglobal-brge/exposomeShiny], Git tag v1.4. The software is also available as a Docker image [https://hub.docker.com/r/brgelab/exposome-shiny], tag v1.4. A user guide with information about the analysis methodologies as well as information on how to use exposomeShiny is freely hosted at [https://isglobal-brge.github.io/exposome_bookdown/].
Collapse
Affiliation(s)
| | - Xavier Basagaña
- Barcelona Institute for Global Health (ISGlobal), Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Martine Vrijheid
- Barcelona Institute for Global Health (ISGlobal), Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- Centro de Investigación Biomédica en Red en Epidemiología y Salud Pública (CIBERESP), Barcelona, Spain
- Department of Mathematics, Universitat Autònoma de Barcelona (UAB), Bellaterra (Barcelona), Spain
| | - Juan R Gonzalez
- Barcelona Institute for Global Health (ISGlobal), Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- Centro de Investigación Biomédica en Red en Epidemiología y Salud Pública (CIBERESP), Barcelona, Spain
- Department of Mathematics, Universitat Autònoma de Barcelona (UAB), Bellaterra (Barcelona), Spain
- Corresponding author. Barcelona Biomedical Research Park (PRBB), Doctor Aiguader, 88. 08003 Barcelona, Spain. E-mail:
| |
Collapse
|
4
|
Wang L, Zhang X, Meng X, Koskeridis F, Georgiou A, Yu L, Campbell H, Theodoratou E, Li X. Methodology in phenome-wide association studies: a systematic review. J Med Genet 2021; 58:720-728. [PMID: 34272311 DOI: 10.1136/jmedgenet-2021-107696] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2021] [Accepted: 05/27/2021] [Indexed: 11/04/2022]
Abstract
Phenome-wide association study (PheWAS) has been increasingly used to identify novel genetic associations across a wide spectrum of phenotypes. This systematic review aims to summarise the PheWAS methodology, discuss the advantages and challenges of PheWAS, and provide potential implications for future PheWAS studies. Medical Literature Analysis and Retrieval System Online (MEDLINE) and Excerpta Medica Database (EMBASE) databases were searched to identify all published PheWAS studies up until 24 April 2021. The PheWAS methodology incorporating how to perform PheWAS analysis and which software/tool could be used, were summarised based on the extracted information. A total of 1035 studies were identified and 195 eligible articles were finally included. Among them, 137 (77.0%) contained 10 000 or more study participants, 164 (92.1%) defined the phenome based on electronic medical records data, 140 (78.7%) used genetic variants as predictors, and 73 (41.0%) conducted replication analysis to validate PheWAS findings and almost all of them (94.5%) received consistent results. The methodology applied in these PheWAS studies was dissected into several critical steps, including quality control of the phenome, selecting predictors, phenotyping, statistical analysis, interpretation and visualisation of PheWAS results, and the workflow for performing a PheWAS was established with detailed instructions on each step. This study provides a comprehensive overview of PheWAS methodology to help practitioners achieve a better understanding of the PheWAS design, to detect understudied or overstudied outcomes, and to direct their research by applying the most appropriate software and online tools for their study data structure.
Collapse
Affiliation(s)
- Lijuan Wang
- School of Public Health and the Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
| | - Xiaomeng Zhang
- Centre for Global Health, The University of Edinburgh Usher Institute of Population Health Sciences and Informatics, Edinburgh, UK
| | - Xiangrui Meng
- Vanke School of Public Health, Tsinghua University, Beijing, China
| | - Fotios Koskeridis
- Department of Hygiene and Epidemiology, University of Ioannina, Ioannina, Epirus, Greece
| | - Andrea Georgiou
- Department of Hygiene and Epidemiology, University of Ioannina, Ioannina, Epirus, Greece
| | - Lili Yu
- School of Public Health and the Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
| | - Harry Campbell
- Centre for Global Health, The University of Edinburgh Usher Institute of Population Health Sciences and Informatics, Edinburgh, UK
| | - Evropi Theodoratou
- Centre for Global Health, The University of Edinburgh Usher Institute of Population Health Sciences and Informatics, Edinburgh, UK.,Cancer Research UK Edinburgh Centre, The University of Edinburgh MRC Institute of Genetics and Molecular Medicine, Edinburgh, UK
| | - Xue Li
- School of Public Health and the Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
| |
Collapse
|
5
|
Li B, Veturi Y, Verma A, Bradford Y, Daar ES, Gulick RM, Riddler SA, Robbins GK, Lennox JL, Haas DW, Ritchie MD. Tissue specificity-aware TWAS (TSA-TWAS) framework identifies novel associations with metabolic, immunologic, and virologic traits in HIV-positive adults. PLoS Genet 2021; 17:e1009464. [PMID: 33901188 PMCID: PMC8102009 DOI: 10.1371/journal.pgen.1009464] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2020] [Revised: 05/06/2021] [Accepted: 03/03/2021] [Indexed: 01/01/2023] Open
Abstract
As a type of relatively new methodology, the transcriptome-wide association study (TWAS) has gained interest due to capacity for gene-level association testing. However, the development of TWAS has outpaced statistical evaluation of TWAS gene prioritization performance. Current TWAS methods vary in underlying biological assumptions about tissue specificity of transcriptional regulatory mechanisms. In a previous study from our group, this may have affected whether TWAS methods better identified associations in single tissues versus multiple tissues. We therefore designed simulation analyses to examine how the interplay between particular TWAS methods and tissue specificity of gene expression affects power and type I error rates for gene prioritization. We found that cross-tissue identification of expression quantitative trait loci (eQTLs) improved TWAS power. Single-tissue TWAS (i.e., PrediXcan) had robust power to identify genes expressed in single tissues, but, often found significant associations in the wrong tissues as well (therefore had high false positive rates). Cross-tissue TWAS (i.e., UTMOST) had overall equal or greater power and controlled type I error rates for genes expressed in multiple tissues. Based on these simulation results, we applied a tissue specificity-aware TWAS (TSA-TWAS) analytic framework to look for gene-based associations with pre-treatment laboratory values from AIDS Clinical Trial Group (ACTG) studies. We replicated several proof-of-concept transcriptionally regulated gene-trait associations, including UGT1A1 (encoding bilirubin uridine diphosphate glucuronosyltransferase enzyme) and total bilirubin levels (p = 3.59×10−12), and CETP (cholesteryl ester transfer protein) with high-density lipoprotein cholesterol (p = 4.49×10−12). We also identified several novel genes associated with metabolic and virologic traits, as well as pleiotropic genes that linked plasma viral load, absolute basophil count, and/or triglyceride levels. By highlighting the advantages of different TWAS methods, our simulation study promotes a tissue specificity-aware TWAS analytic framework that revealed novel aspects of HIV-related traits. Transcriptome-wide association studies (TWAS) are a type of bioinformatics methodology for identifying complex trait-associated genes. There have been various TWAS methods, each developed under distinct biological assumptions of how genes contribute to complex traits. It is unclear, however, how powerful different TWAS methods are under a variety of biological scenarios. Here, we design an unbiased simulation strategy to evaluate the performance of multiple representative TWAS methods. We find that no one method fits all. Different TWAS methods are advantageous at dealing with different biological scenarios and answering different research questions. Thus, we propose a novel TWAS analytic framework that integrates and maximizes the performance of multiple TWAS methods, and validate its capability using a well-studied real-world dataset. In summary, our study provides quantitative evaluation of method performance to aid future TWAS experimental design and understanding of genes underlying complex human traits. The TWAS evaluation tool is made publicly available.
Collapse
Affiliation(s)
- Binglan Li
- Department of Biomedical Data Science, Stanford University, Stanford, California, United States of America
| | - Yogasudha Veturi
- Department of Genetics, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Anurag Verma
- Department of Genetics, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Yuki Bradford
- Department of Genetics, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Eric S. Daar
- Lundquist Institute at Harbor-UCLA Medical Center, Torrance, California, United States of America
| | - Roy M. Gulick
- Weill Cornell Medicine, New York City, New York, United States of America
| | - Sharon A. Riddler
- Department of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Gregory K. Robbins
- Division of Infectious Diseases, Massachusetts General Hospital, Boston, Massachusetts, United States of America
| | - Jeffrey L. Lennox
- Emory University School of Medicine, Atlanta, Georgia, United States of America
| | - David W. Haas
- Departments of Medicine, Pharmacology, Pathology, Microbiology & Immunology, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
- Department of Internal Medicine, Meharry Medical College, Nashville, Tennessee, United States of America
| | - Marylyn D. Ritchie
- Department of Genetics, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
- Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
- * E-mail:
| |
Collapse
|
6
|
Dennis JK, Sealock JM, Straub P, Lee YH, Hucks D, Actkins K, Faucon A, Feng YCA, Ge T, Goleva SB, Niarchou M, Singh K, Morley T, Smoller JW, Ruderfer DM, Mosley JD, Chen G, Davis LK. Clinical laboratory test-wide association scan of polygenic scores identifies biomarkers of complex disease. Genome Med 2021; 13:6. [PMID: 33441150 PMCID: PMC7807864 DOI: 10.1186/s13073-020-00820-8] [Citation(s) in RCA: 42] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2020] [Accepted: 12/08/2020] [Indexed: 12/27/2022] Open
Abstract
BACKGROUND Clinical laboratory (lab) tests are used in clinical practice to diagnose, treat, and monitor disease conditions. Test results are stored in electronic health records (EHRs), and a growing number of EHRs are linked to patient DNA, offering unprecedented opportunities to query relationships between genetic risk for complex disease and quantitative physiological measurements collected on large populations. METHODS A total of 3075 quantitative lab tests were extracted from Vanderbilt University Medical Center's (VUMC) EHR system and cleaned for population-level analysis according to our QualityLab protocol. Lab values extracted from BioVU were compared with previous population studies using heritability and genetic correlation analyses. We then tested the hypothesis that polygenic risk scores for biomarkers and complex disease are associated with biomarkers of disease extracted from the EHR. In a proof of concept analyses, we focused on lipids and coronary artery disease (CAD). We cleaned lab traits extracted from the EHR performed lab-wide association scans (LabWAS) of the lipids and CAD polygenic risk scores across 315 heritable lab tests then replicated the pipeline and analyses in the Massachusetts General Brigham Biobank. RESULTS Heritability estimates of lipid values (after cleaning with QualityLab) were comparable to previous reports and polygenic scores for lipids were strongly associated with their referent lipid in a LabWAS. LabWAS of the polygenic score for CAD recapitulated canonical heart disease biomarker profiles including decreased HDL, increased pre-medication LDL, triglycerides, blood glucose, and glycated hemoglobin (HgbA1C) in European and African descent populations. Notably, many of these associations remained even after adjusting for the presence of cardiovascular disease and were replicated in the MGBB. CONCLUSIONS Polygenic risk scores can be used to identify biomarkers of complex disease in large-scale EHR-based genomic analyses, providing new avenues for discovery of novel biomarkers and deeper understanding of disease trajectories in pre-symptomatic individuals. We present two methods and associated software, QualityLab and LabWAS, to clean and analyze EHR labs at scale and perform a Lab-Wide Association Scan.
Collapse
Affiliation(s)
- Jessica K Dennis
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, V5Z 4H4, Canada
| | - Julia M Sealock
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
| | - Peter Straub
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
| | - Younga H Lee
- Psychiatric & Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, 02114, USA
- Department of Psychiatry, Harvard Medical School, Boston, MA, 02115, USA
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, 02142, USA
| | - Donald Hucks
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
| | - Ky'Era Actkins
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
- Department of Microbiology, Immunology, and Physiology, Meharry Medical College, Nashville, TN, 37232, USA
| | - Annika Faucon
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
| | - Yen-Chen Anne Feng
- Psychiatric & Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, 02114, USA
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, 02142, USA
- Analytic and Translational Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, 02114, USA
| | - Tian Ge
- Psychiatric & Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, 02114, USA
- Department of Psychiatry, Harvard Medical School, Boston, MA, 02115, USA
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, 02142, USA
| | - Slavina B Goleva
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
- Department of Molecular Physiology and Biophysics, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
| | - Maria Niarchou
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
| | - Kritika Singh
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
| | - Theodore Morley
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
| | - Jordan W Smoller
- Psychiatric & Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, 02114, USA
- Department of Psychiatry, Harvard Medical School, Boston, MA, 02115, USA
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, 02142, USA
| | - Douglas M Ruderfer
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
- Department of Psychiatry and Behavioral Sciences, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
- Departments of Medicine and Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
| | - Jonathan D Mosley
- Departments of Medicine and Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
| | - Guanhua Chen
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Lea K Davis
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, 37232, USA.
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, 37232, USA.
- Department of Molecular Physiology and Biophysics, Vanderbilt University Medical Center, Nashville, TN, 37232, USA.
- Department of Psychiatry and Behavioral Sciences, Vanderbilt University Medical Center, Nashville, TN, 37232, USA.
- Departments of Medicine and Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, 37232, USA.
- Division of Genetic Medicine, Department of Medicine, Vanderbilt Genetics Institute, Vanderbilt University, 511-A Light Hall, 2215 Garland Ave, Nashville, TN, 37232, USA.
| |
Collapse
|
7
|
Passero K, Setia-Verma S, McAllister K, Manrai A, Patel C, Hall M. What about the environment? Leveraging multi-omic datasets to characterize the environment's role in human health. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2021; 26:309-315. [PMID: 34409132 PMCID: PMC8323787] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
The environment plays an important role in mediating human health. In this session we consider research addressing ways to overcome the challenges associated with studying the multifaceted and ever-changing environment. Environmental health research has a need for technological and methodological advances which will further our knowledge of how exposures precipitate complex phenotypes and exacerbate disease.
Collapse
Affiliation(s)
- Kristin Passero
- Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA 16802
| | - Shefali Setia-Verma
- Department of Genetics and Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, PA 19104
| | - Kimberly McAllister
- Genes, Environment, and Health Branch, Division of Extramural Research and Training, National Institute of Environmental Health Sciences, P.O. Box 12233 (MD EC-21), Research Triangle Park, NC 27709
| | - Arjun Manrai
- Computational Health Informatics Program, Boston Children’s Hospital, Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115
| | - Chirag Patel
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115
| | - Molly Hall
- Department of Veterinary and Biomedical Sciences, The Pennsylvania State University, University Park, PA 16802
| |
Collapse
|
8
|
Investigation of gene-gene interactions in cardiac traits and serum fatty acid levels in the LURIC Health Study. PLoS One 2020; 15:e0238304. [PMID: 32915819 PMCID: PMC7485803 DOI: 10.1371/journal.pone.0238304] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2019] [Accepted: 08/13/2020] [Indexed: 01/25/2023] Open
Abstract
Epistasis analysis elucidates the effects of gene-gene interactions (G×G) between multiple loci for complex traits. However, the large computational demands and the high multiple testing burden impede their discoveries. Here, we illustrate the utilization of two methods, main effect filtering based on individual GWAS results and biological knowledge-based modeling through Biofilter software, to reduce the number of interactions tested among single nucleotide polymorphisms (SNPs) for 15 cardiac-related traits and 14 fatty acids. We performed interaction analyses using the two filtering methods, adjusting for age, sex, body mass index (BMI), waist-hip ratio, and the first three principal components from genetic data, among 2,824 samples from the Ludwigshafen Risk and Cardiovascular (LURIC) Health Study. Using Biofilter, one interaction nearly met Bonferroni significance: an interaction between rs7735781 in XRCC4 and rs10804247 in XRCC5 was identified for venous thrombosis with a Bonferroni-adjusted likelihood ratio test (LRT) p: 0.0627. A total of 57 interactions were identified from main effect filtering for the cardiac traits G×G (10) and fatty acids G×G (47) at Bonferroni-adjusted LRT p < 0.05. For cardiac traits, the top interaction involved SNPs rs1383819 in SNTG1 and rs1493939 (138kb from 5’ of SAMD12) with Bonferroni-adjusted LRT p: 0.0228 which was significantly associated with history of arterial hypertension. For fatty acids, the top interaction between rs4839193 in KCND3 and rs10829717 in LOC107984002 with Bonferroni-adjusted LRT p: 2.28×10−5 was associated with 9-trans 12-trans octadecanoic acid, an omega-6 trans fatty acid. The model inflation factor for the interactions under different filtering methods was evaluated from the standard median and the linear regression approach. Here, we applied filtering approaches to identify numerous genetic interactions related to cardiac-related outcomes as potential targets for therapy. The approaches described offer ways to detect epistasis in the complex traits and to improve precision medicine capability.
Collapse
|
9
|
O'Shea K, Misra BB. Software tools, databases and resources in metabolomics: updates from 2018 to 2019. Metabolomics 2020; 16:36. [PMID: 32146531 DOI: 10.1007/s11306-020-01657-3] [Citation(s) in RCA: 53] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/03/2019] [Accepted: 03/01/2020] [Indexed: 12/24/2022]
Abstract
Metabolomics has evolved as a discipline from a discovery and functional genomics tool, and is now a cornerstone in the era of big data-driven precision medicine. Sample preparation strategies and analytical technologies have seen enormous growth, and keeping pace with data analytics is challenging, to say the least. This review introduces and briefly presents around 100 metabolomics software resources, tools, databases, and other utilities that have surfaced or have improved in 2019. Table 1 provides the computational dependencies of the tools, categorizes the resources based on utility and ease of use, and provides hyperlinks to webpages where the tools can be downloaded or used. This review intends to keep the community of metabolomics researchers up to date with all the software tools, resources, and databases developed in 2019, in one place.
Collapse
Affiliation(s)
- Keiron O'Shea
- Institute of Biological, Environmental, and Rural Studies, Aberystwyth University, Ceredigion, Wales, SY23 3DA, UK
| | - Biswapriya B Misra
- Center for Precision Medicine, Department of Internal Medicine, Section of Molecular Medicine, Wake Forest School of Medicine, Medical Center Boulevard, Winston-Salem, NC, 27157, USA.
| |
Collapse
|