1
|
Motsinger-Reif AA, Reif DM, Akhtari FS, House JS, Campbell CR, Messier KP, Fargo DC, Bowen TA, Nadadur SS, Schmitt CP, Pettibone KG, Balshaw DM, Lawler CP, Newton SA, Collman GW, Miller AK, Merrick BA, Cui Y, Anchang B, Harmon QE, McAllister KA, Woychik R. Gene-environment interactions within a precision environmental health framework. CELL GENOMICS 2024; 4:100591. [PMID: 38925123 PMCID: PMC11293590 DOI: 10.1016/j.xgen.2024.100591] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Revised: 03/26/2024] [Accepted: 06/02/2024] [Indexed: 06/28/2024]
Abstract
Understanding the complex interplay of genetic and environmental factors in disease etiology and the role of gene-environment interactions (GEIs) across human development stages is important. We review the state of GEI research, including challenges in measuring environmental factors and advantages of GEI analysis in understanding disease mechanisms. We discuss the evolution of GEI studies from candidate gene-environment studies to genome-wide interaction studies (GWISs) and the role of multi-omics in mediating GEI effects. We review advancements in GEI analysis methods and the importance of large-scale datasets. We also address the translation of GEI findings into precision environmental health (PEH), showcasing real-world applications in healthcare and disease prevention. Additionally, we highlight societal considerations in GEI research, including environmental justice, the return of results to participants, and data privacy. Overall, we underscore the significance of GEI for disease prediction and prevention and advocate for integrating the exposome into PEH omics studies.
Collapse
Affiliation(s)
- Alison A Motsinger-Reif
- Biostatistics and Computational Biology Branch, Division of Intramural Research, National Institute of Environmental Health Sciences, Durham, NC, USA.
| | - David M Reif
- Predictive Toxicology Branch, Division of Translational Toxicology, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - Farida S Akhtari
- Biostatistics and Computational Biology Branch, Division of Intramural Research, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - John S House
- Biostatistics and Computational Biology Branch, Division of Intramural Research, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - C Ryan Campbell
- Biostatistics and Computational Biology Branch, Division of Intramural Research, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - Kyle P Messier
- Biostatistics and Computational Biology Branch, Division of Intramural Research, National Institute of Environmental Health Sciences, Durham, NC, USA; Predictive Toxicology Branch, Division of Translational Toxicology, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - David C Fargo
- Office of the Director, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - Tiffany A Bowen
- Office of the Director, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - Srikanth S Nadadur
- Exposure, Response, and Technology Branch, Division of Extramural Research and Training, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - Charles P Schmitt
- Office of the Scientific Director, Office of Data Science, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - Kristianna G Pettibone
- Program Analysis Branch, Division of Extramural Research and Training, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - David M Balshaw
- Office of the Director, National Institute of Environmental Health Sciences, Durham, NC, USA; Division of Extramural Research and Training, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - Cindy P Lawler
- Genes, Environment, and Health Branch, Division of Extramural Research and Training, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - Shelia A Newton
- Office of Scientific Coordination, Planning and Evaluation, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - Gwen W Collman
- Office of the Director, National Institute of Environmental Health Sciences, Durham, NC, USA; Office of Scientific Coordination, Planning and Evaluation, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - Aubrey K Miller
- Office of Scientific Coordination, Planning and Evaluation, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - B Alex Merrick
- Mechanistic Toxicology Branch, Division of Translational Toxicology, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - Yuxia Cui
- Exposure, Response, and Technology Branch, Division of Extramural Research and Training, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - Benedict Anchang
- Biostatistics and Computational Biology Branch, Division of Intramural Research, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - Quaker E Harmon
- Epidemiology Branch, Division of Intramural Research, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - Kimberly A McAllister
- Genes, Environment, and Health Branch, Division of Extramural Research and Training, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - Rick Woychik
- Office of the Director, National Institute of Environmental Health Sciences, Durham, NC, USA
| |
Collapse
|
2
|
Singhal P, Verma SS, Ritchie MD. Gene Interactions in Human Disease Studies-Evidence Is Mounting. Annu Rev Biomed Data Sci 2023; 6:377-395. [PMID: 37196359 DOI: 10.1146/annurev-biodatasci-102022-120818] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/19/2023]
Abstract
Despite monumental advances in molecular technology to generate genome sequence data at scale, there is still a considerable proportion of heritability in most complex diseases that remains unexplained. Because many of the discoveries have been single-nucleotide variants with small to moderate effects on disease, the functional implication of many of the variants is still unknown and, thus, we have limited new drug targets and therapeutics. We, and many others, posit that one primary factor that has limited our ability to identify novel drug targets from genome-wide association studies may be due to gene interactions (epistasis), gene-environment interactions, network/pathway effects, or multiomic relationships. We propose that many of these complex models explain much of the underlying genetic architecture of complex disease. In this review, we discuss the evidence from multiple research avenues, ranging from pairs of alleles to multiomic integration studies and pharmacogenomics, that supports the need for further investigation of gene interactions (or epistasis) in genetic and genomic studies of human disease. Our goal is to catalog the mounting evidence for epistasis in genetic studies and the connections between genetic interactions and human health and disease that could enable precision medicine of the future.
Collapse
Affiliation(s)
- Pankhuri Singhal
- Genetics and Epigenetics Graduate Group, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania, USA
| | - Shefali Setia Verma
- Department of Pathology and Laboratory Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania, USA
| | - Marylyn D Ritchie
- Department of Genetics, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania, USA;
- Penn Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| |
Collapse
|
3
|
Oliveira-Paula GH, Pereira SC, Tanus-Santos JE, Lacchini R. Pharmacogenomics And Hypertension: Current Insights. PHARMACOGENOMICS & PERSONALIZED MEDICINE 2019; 12:341-359. [PMID: 31819590 PMCID: PMC6878918 DOI: 10.2147/pgpm.s230201] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 09/07/2019] [Accepted: 11/05/2019] [Indexed: 11/23/2022]
Abstract
Hypertension is a multifactorial disease that affects approximately one billion subjects worldwide and is a major risk factor associated with cardiovascular events, including coronary heart disease and cerebrovascular accidents. Therefore, adequate blood pressure control is important to prevent these events, reducing premature mortality and disability. However, only one third of patients have the effective control of blood pressure, despite several classes of antihypertensive drugs available. These disappointing outcomes may be at least in part explained by interpatient variability in drug response due to genetic polymorphisms. To address the effects of genetic polymorphisms on blood pressure responses to the antihypertensive drug classes, studies have applied candidate genes and genome wide approaches. More recently, a third approach that considers gene-gene interactions has also been applied in hypertension pharmacogenomics. In this article, we carried out a comprehensive review of recent findings on the pharmacogenomics of antihypertensive drugs, including diuretics, β-blockers, angiotensin-converting enzyme inhibitors and angiotensin II receptor blockers, and calcium channel blockers. We also discuss the limitations and inconsistences that have been found in hypertension pharmacogenomics and the challenges to implement this valuable approach in clinical practice.
Collapse
Affiliation(s)
- Gustavo H Oliveira-Paula
- Department of Medicine, Division of Cardiology, Wilf Family Cardiovascular Research Institute, Albert Einstein College of Medicine, New York, NY, USA.,Department of Pharmacology, Ribeirao Preto Medical School, University of Sao Paulo, Ribeirao Preto, SP, Brazil
| | - Sherliane C Pereira
- Department of Pharmacology, Ribeirao Preto Medical School, University of Sao Paulo, Ribeirao Preto, SP, Brazil
| | - Jose E Tanus-Santos
- Department of Pharmacology, Ribeirao Preto Medical School, University of Sao Paulo, Ribeirao Preto, SP, Brazil
| | - Riccardo Lacchini
- Department of Psychiatric Nursing and Human Sciences, Ribeirao Preto College of Nursing, University of Sao Paulo, Ribeirao Preto, SP, Brazil
| |
Collapse
|
4
|
Lauschke VM, Zhou Y, Ingelman-Sundberg M. Novel genetic and epigenetic factors of importance for inter-individual differences in drug disposition, response and toxicity. Pharmacol Ther 2019; 197:122-152. [PMID: 30677473 PMCID: PMC6527860 DOI: 10.1016/j.pharmthera.2019.01.002] [Citation(s) in RCA: 65] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Individuals differ substantially in their response to pharmacological treatment. Personalized medicine aspires to embrace these inter-individual differences and customize therapy by taking a wealth of patient-specific data into account. Pharmacogenomic constitutes a cornerstone of personalized medicine that provides therapeutic guidance based on the genomic profile of a given patient. Pharmacogenomics already has applications in the clinics, particularly in oncology, whereas future development in this area is needed in order to establish pharmacogenomic biomarkers as useful clinical tools. In this review we present an updated overview of current and emerging pharmacogenomic biomarkers in different therapeutic areas and critically discuss their potential to transform clinical care. Furthermore, we discuss opportunities of technological, methodological and institutional advances to improve biomarker discovery. We also summarize recent progress in our understanding of epigenetic effects on drug disposition and response, including a discussion of the only few pharmacogenomic biomarkers implemented into routine care. We anticipate, in part due to exciting rapid developments in Next Generation Sequencing technologies, machine learning methods and national biobanks, that the field will make great advances in the upcoming years towards unlocking the full potential of genomic data.
Collapse
Affiliation(s)
- Volker M Lauschke
- Department of Physiology and Pharmacology, Section of Pharmacogenetics, Biomedicum 5B, Karolinska Institutet, SE-171 77 Stockholm, Sweden
| | - Yitian Zhou
- Department of Physiology and Pharmacology, Section of Pharmacogenetics, Biomedicum 5B, Karolinska Institutet, SE-171 77 Stockholm, Sweden
| | - Magnus Ingelman-Sundberg
- Department of Physiology and Pharmacology, Section of Pharmacogenetics, Biomedicum 5B, Karolinska Institutet, SE-171 77 Stockholm, Sweden.
| |
Collapse
|
5
|
Luizon MR, Pereira DA, Tanus-Santos JE. Pharmacogenetic relevance of endothelial nitric oxide synthase polymorphisms and gene interactions. Pharmacogenomics 2018; 19:1423-1435. [PMID: 30398085 DOI: 10.2217/pgs-2018-0098] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Endothelial nitric oxide synthase (NOS3) is a key enzyme responsible for nitric oxide (NO) generation in the vascular endothelium. Endothelial dysfunction is characterized by reduced NO production, and is a hallmark of cardiovascular diseases. Drugs with cardiovascular action may activate NOS3 and result in NO release and vasodilation. Moreover, genetic variations affect NOS3 expression and activity, and may partially explain the variability in the responses to cardiovascular drugs. We reviewed NO signaling and genetic effects on NO formation, and the effects of NOS3 polymorphisms, haplotypes and gene-gene interactions within NO signaling pathways on the responses to cardiovascular drugs. We discuss the role of rare NOS3 variants and further gene-gene interactions analysis for the development of novel therapies for cardiovascular diseases.
Collapse
Affiliation(s)
- Marcelo R Luizon
- Department of General Biology, Institute of Biological Sciences, Federal University of Minas Gerais (UFMG), Belo Horizonte, Minas Gerais 31270-901, Brazil.,UFMG Graduate Program in Genetics, Institute of Biological Sciences, Federal University of Minas Gerais (UFMG), Belo Horizonte, Minas Gerais 31270-901, Brazil
| | - Daniela A Pereira
- UFMG Graduate Program in Genetics, Institute of Biological Sciences, Federal University of Minas Gerais (UFMG), Belo Horizonte, Minas Gerais 31270-901, Brazil
| | - Jose E Tanus-Santos
- Department of Pharmacology, Ribeirao Preto Medical School, University of Sao Paulo, Ribeirao Preto, Sao Paulo 14049-900, Brazil
| |
Collapse
|
6
|
Balik-Meisner M, Truong L, Scholl EH, La Du JK, Tanguay RL, Reif DM. Elucidating Gene-by-Environment Interactions Associated with Differential Susceptibility to Chemical Exposure. ENVIRONMENTAL HEALTH PERSPECTIVES 2018; 126:067010. [PMID: 29968567 PMCID: PMC6084885 DOI: 10.1289/ehp2662] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/08/2017] [Revised: 05/14/2018] [Accepted: 05/17/2018] [Indexed: 05/04/2023]
Abstract
BACKGROUND Modern societies are exposed to vast numbers of potentially hazardous chemicals. Despite demonstrated linkages between chemical exposure and severe health effects, there are limited, often conflicting, data on how adverse health effects of exposure differ across individuals. OBJECTIVES We tested the hypothesis that population variability in response to certain chemicals could elucidate a role for gene-environment interactions (GxE) in differential susceptibility. METHODS High-throughput screening (HTS) data on thousands of chemicals in genetically heterogeneous zebrafish were leveraged to identify a candidate chemical (Abamectin) with response patterns indicative of population susceptibility differences. We tested the prediction by generating genome-wide sequence data for 276 individual zebrafish displaying susceptible (Affected) vs. resistant (Unaffected) phenotypes following identical chemical exposure. RESULTS We found GxE associated with differential susceptibility in the sox7 promoter region and then confirmed gene expression differences between phenotypic response classes. CONCLUSIONS The results for Abamectin in zebrafish demonstrate that GxE associated with naturally occurring, population genetic variation play a significant role in mediating individual response to chemical exposure. https://doi.org/10.1289/EHP2662.
Collapse
Affiliation(s)
- Michele Balik-Meisner
- Bioinformatics Research Center, Center for Human Health and the Environment, Dept. of Biological Sciences, North Carolina State University, Raleigh, North Carolina, USA
| | - Lisa Truong
- Sinnhuber Aquatic Research Laboratory, Dept. of Environmental and Molecular Toxicology, Oregon State University, Corvallis, Oregon, USA
| | - Elizabeth H Scholl
- Bioinformatics Research Center, Center for Human Health and the Environment, Dept. of Biological Sciences, North Carolina State University, Raleigh, North Carolina, USA
| | - Jane K La Du
- Sinnhuber Aquatic Research Laboratory, Dept. of Environmental and Molecular Toxicology, Oregon State University, Corvallis, Oregon, USA
| | - Robert L Tanguay
- Sinnhuber Aquatic Research Laboratory, Dept. of Environmental and Molecular Toxicology, Oregon State University, Corvallis, Oregon, USA
| | - David M Reif
- Bioinformatics Research Center, Center for Human Health and the Environment, Dept. of Biological Sciences, North Carolina State University, Raleigh, North Carolina, USA
| |
Collapse
|
7
|
Ritchie MD, Van Steen K. The search for gene-gene interactions in genome-wide association studies: challenges in abundance of methods, practical considerations, and biological interpretation. ANNALS OF TRANSLATIONAL MEDICINE 2018; 6:157. [PMID: 29862246 DOI: 10.21037/atm.2018.04.05] [Citation(s) in RCA: 58] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
One of the primary goals in this era of precision medicine is to understand the biology of human diseases and their treatment, such that each individual patient receives the best possible treatment for their disease based on their genetic and environmental exposures. One way to work towards achieving this goal is to identify the environmental exposures and genetic variants that are relevant to each disease in question, as well as the complex interplay between genes and environment. Genome-wide association studies (GWAS) have allowed for a greater understanding of the genetic component of many complex traits. However, these genetic effects are largely small and thus, our ability to use these GWAS finding for precision medicine is limited. As more and more GWAS have been performed, rather than focusing only on common single nucleotide polymorphisms (SNPs) and additive genetic models, many researchers have begun to explore alternative heritable components of complex traits including rare variants, structural variants, epigenetics, and genetic interactions. While genetic interactions are a plausible reality that could explain some of the heritabliy that has not yet been identified, especially when one considers the identification of genetic interactions in model organisms as well as our understanding of biological complexity, still there are significant challenges and considerations in identifying these genetic interactions. Broadly, these can be summarized in three categories: abundance of methods, practical considerations, and biological interpretation. In this review, we will discuss these important elements in the search for genetic interactions along with some potential solutions. While genetic interactions are theoretically understood to be important for complex human disease, the body of evidence is still building to support this component of the underlying genetic architecture of complex human traits. Our hope is that more sophisticated modeling approaches and more robust computational techniques will enable the community to identify these important genetic interactions and improve our ability to implement precision medicine in the future.
Collapse
Affiliation(s)
- Marylyn D Ritchie
- Department of Genetics, University of Pennsylvania, Philadelphia, PA, USA
| | - Kristel Van Steen
- WELBIO, GIGA-R Medical Genomics Unit - BIO3, University of Liège, Liège, Belgium.,Department of Human Genetics, University of Leuven, Leuven, Belgium
| |
Collapse
|
8
|
Luizon MR, Pereira DA, Sandrim VC. Pharmacogenomics of Hypertension and Preeclampsia: Focus on Gene-Gene Interactions. Front Pharmacol 2018. [PMID: 29541029 PMCID: PMC5835759 DOI: 10.3389/fphar.2018.00168] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Hypertension is a leading cause of cardiovascular mortality, but only about half of patients on antihypertensive therapy achieve blood pressure control. Preeclampsia is defined as pregnancy-induced hypertension and proteinuria, and is associated with increased maternal and perinatal mortality and morbidity. Similarly, a large number of patients with preeclampsia are non-responsive to antihypertensive therapy. Pharmacogenomics may help to guide the personalized treatment for non-responsive hypertensive patients. There is evidence for the association of genetic variants with variable response to the most commonly used antihypertensive drugs. However, further replication is needed to confirm these associations in different populations. The failure to replicate findings from single-locus association studies has prompted the search for novel statistical methods for data analysis, which are required to detect the complex effects from multiple genes to drug response phenotypes. Notably, gene–gene interaction analyses have been applied to pharmacogenetic studies, including antihypertensive drug response. In this perspective article, we present advances of considering the interactions among genetic polymorphisms of different candidate genes within pathways relevant to antihypertensive drug response, and we highlight recent findings related to gene–gene interactions on pharmacogenetics of hypertension and preeclampsia. Finally, we discuss the future directions that are needed to unravel additional genes and variants involved in the responsiveness to antihypertensive drugs.
Collapse
Affiliation(s)
- Marcelo R Luizon
- Department of General Biology, Institute of Biological Sciences, Federal University of Minas Gerais (UFMG), Belo Horizonte, Brazil.,UFMG Graduate Program in Genetics, Institute of Biological Sciences, Federal University of Minas Gerais, Belo Horizonte, Brazil
| | - Daniela A Pereira
- UFMG Graduate Program in Genetics, Institute of Biological Sciences, Federal University of Minas Gerais, Belo Horizonte, Brazil
| | - Valeria C Sandrim
- Department of Pharmacology, Institute of Biosciences of Botucatu, Universidade Estadual Paulista, Botucatu, Brazil
| |
Collapse
|
9
|
Chen J, Akhtari FS, Wagner MJ, Suzuki O, Wiltshire T, Motsinger-Reif AA, Dumond JB. Pharmacogenetic Analysis of the Model-Based Pharmacokinetics of Five Anti-HIV Drugs: How Does This Influence the Effect of Aging? Clin Transl Sci 2017; 11:226-236. [PMID: 29205871 PMCID: PMC5866997 DOI: 10.1111/cts.12525] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2017] [Accepted: 10/31/2017] [Indexed: 12/27/2022] Open
Abstract
Analysis of aging and pharmacogenetics (PGx) on antiretroviral pharmacokinetics (PKs) could inform precision dosing for older human HIV‐infected patients. Seventy‐four participants receiving either atazanavir/ritonavir (ATV/RTV) or efavirenz (EFV) with tenofovir/emtricitabine (TFV/FTC) provided PK and PGx information. Aging‐PGx‐PK association and interaction analyses were conducted using one‐way analysis of variance (ANOVA), multiple linear regression, and Random Forest ensemble methods. Our analyses associated unbound ATV disposition with multidrug resistance protein (MRP)4, RTV with P‐glycoprotein (P‐gp), and EFV with cytochrome P450 (CYP)2B6 and MRP4 genetic variants. The clearance and cellular distribution of TFV were associated with P‐gp, MRP2, and concentrative nucleoside transporters (CNTs), and FTC parameters were associated with organic cation transporters (OCTs) and MRP2 genetic variants. Notably, p16INK4a expression, a cellular aging marker, predicted EFV and FTC PK when genetic factors were adjusted. Both age and p16INK4a expression interacted with PGx on ATV and TFV disposition, implying potential dose adjustment based on aging may depend on genetic background.
Collapse
Affiliation(s)
- Jingxian Chen
- University of North Carolina Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Farida S Akhtari
- Department of Biological Sciences, North Carolina State University, Raleigh, NC, USA.,Bioinformatics Research Center, North Carolina State University, Raleigh, NC, USA
| | - Michael J Wagner
- University of North Carolina Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Oscar Suzuki
- University of North Carolina Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Tim Wiltshire
- University of North Carolina Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Alison A Motsinger-Reif
- Bioinformatics Research Center, North Carolina State University, Raleigh, NC, USA.,Department of Statistics, North Carolina State University, Raleigh, NC, USA
| | - Julie B Dumond
- University of North Carolina Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| |
Collapse
|
10
|
Moore JH, Andrews PC, Olson RS, Carlson SE, Larock CR, Bulhoes MJ, O'Connor JP, Greytak EM, Armentrout SL. Grid-based stochastic search for hierarchical gene-gene interactions in population-based genetic studies of common human diseases. BioData Min 2017; 10:19. [PMID: 28572842 PMCID: PMC5450417 DOI: 10.1186/s13040-017-0139-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2016] [Accepted: 05/18/2017] [Indexed: 11/18/2022] Open
Abstract
Background Large-scale genetic studies of common human diseases have focused almost exclusively on the independent main effects of single-nucleotide polymorphisms (SNPs) on disease susceptibility. These studies have had some success, but much of the genetic architecture of common disease remains unexplained. Attention is now turning to detecting SNPs that impact disease susceptibility in the context of other genetic factors and environmental exposures. These context-dependent genetic effects can manifest themselves as non-additive interactions, which are more challenging to model using parametric statistical approaches. The dimensionality that results from a multitude of genotype combinations, which results from considering many SNPs simultaneously, renders these approaches underpowered. We previously developed the multifactor dimensionality reduction (MDR) approach as a nonparametric and genetic model-free machine learning alternative. Approaches such as MDR can improve the power to detect gene-gene interactions but are limited in their ability to exhaustively consider SNP combinations in genome-wide association studies (GWAS), due to the combinatorial explosion of the search space. We introduce here a stochastic search algorithm called Crush for the application of MDR to modeling high-order gene-gene interactions in genome-wide data. The Crush-MDR approach uses expert knowledge to guide probabilistic searches within a framework that capitalizes on the use of biological knowledge to filter gene sets prior to analysis. Here we evaluated the ability of Crush-MDR to detect hierarchical sets of interacting SNPs using a biology-based simulation strategy that assumes non-additive interactions within genes and additivity in genetic effects between sets of genes within a biochemical pathway. Results We show that Crush-MDR is able to identify genetic effects at the gene or pathway level significantly better than a baseline random search with the same number of model evaluations. We then applied the same methodology to a GWAS for Alzheimer’s disease and showed base level validation that Crush-MDR was able to identify a set of interacting genes with biological ties to Alzheimer’s disease. Conclusions We discuss the role of stochastic search and cloud computing for detecting complex genetic effects in genome-wide data.
Collapse
Affiliation(s)
- Jason H Moore
- Department of Biostatistics and Epidemiology, Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, 19104 PA USA
| | - Peter C Andrews
- Department of Biostatistics and Epidemiology, Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, 19104 PA USA
| | - Randal S Olson
- Department of Biostatistics and Epidemiology, Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, 19104 PA USA
| | | | | | | | | | | | | |
Collapse
|
11
|
Oliveira-Paula GH, Luizon MR, Lacchini R, Fontana V, Silva PS, Biagi C, Tanus-Santos JE. Gene-Gene Interactions Among PRKCA, NOS3 and BDKRB2 Polymorphisms Affect the Antihypertensive Effects of Enalapril. Basic Clin Pharmacol Toxicol 2016; 120:284-291. [PMID: 27696692 DOI: 10.1111/bcpt.12682] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2016] [Accepted: 09/22/2016] [Indexed: 01/08/2023]
Abstract
Protein kinase C (PKC) signalling is critically involved in the control of blood pressure. Angiotensin-converting enzyme inhibitors (ACEi) affect PKC expression and activity, which are partially associated with the responses to ACEi. We examined whether PRKCA (protein kinase C, alpha) polymorphisms (rs887797 C>T, rs1010544 T>C and rs16960228 G>A), or haplotypes, and gene-gene interactions within the ACEi pathway affect the antihypertensive responses in 104 hypertensive patients treated with enalapril as monotherapy. Patients were classified as poor responders (PR) or good responders (GR) to enalapril if their changes in mean arterial pressure were lower or higher than the median value, respectively. Multi-factor dimensionality reduction was used to characterize interactions among PRKCA, NOS3 (nitric oxide synthase 3) and BDKRB2 (bradykinin receptor B2) polymorphisms. The TC+CC genotypes for the rs1010544 polymorphism were more frequent in GR than in PR (p = 0.037). Conversely, the GA+AA genotypes for the rs16960228 polymorphism, and the CTA haplotype, were more frequent in PR than in GR (p = 0.040 and p = 0.008, respectively). Moreover, the GG genotype for the PRKCA rs16960228 polymorphism was associated with PR or GR depending on the genotypes for the rs2070744 (NOS3) and rs1799722 (BDKRB2) polymorphisms (p = 0.012). Our results suggest that PRKCA polymorphisms and gene-gene interactions within the ACEi pathway affect the antihypertensive responses to enalapril.
Collapse
Affiliation(s)
- Gustavo H Oliveira-Paula
- Department of Pharmacology, Ribeirao Preto Medical School, University of Sao Paulo, Ribeirao Preto, SP, Brazil
| | - Marcelo R Luizon
- Department of Pharmacology, Ribeirao Preto Medical School, University of Sao Paulo, Ribeirao Preto, SP, Brazil
| | - Riccardo Lacchini
- Department of Pharmacology, Ribeirao Preto Medical School, University of Sao Paulo, Ribeirao Preto, SP, Brazil
| | - Vanessa Fontana
- Department of Pharmacology, State University of Campinas, Campinas, SP, Brazil
| | - Pamela S Silva
- Department of Pharmacology, State University of Campinas, Campinas, SP, Brazil
| | - Celso Biagi
- Santa Casa of Araçatuba, Araçatuba, SP, Brazil
| | - Jose E Tanus-Santos
- Department of Pharmacology, Ribeirao Preto Medical School, University of Sao Paulo, Ribeirao Preto, SP, Brazil
| |
Collapse
|
12
|
Gene–gene interactions in the NAMPT pathway, plasma visfatin/NAMPT levels, and antihypertensive therapy responsiveness in hypertensive disorders of pregnancy. THE PHARMACOGENOMICS JOURNAL 2016; 17:427-434. [DOI: 10.1038/tpj.2016.35] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/05/2015] [Revised: 02/08/2016] [Accepted: 03/28/2016] [Indexed: 12/16/2022]
|
13
|
Hu XS, Hu Y. Genomic Scans of Zygotic Disequilibrium and Epistatic SNPs in HapMap Phase III Populations. PLoS One 2015; 10:e0131039. [PMID: 26126177 PMCID: PMC4488364 DOI: 10.1371/journal.pone.0131039] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2015] [Accepted: 05/27/2015] [Indexed: 11/19/2022] Open
Abstract
Previous theory indicates that zygotic linkage disequilibrium (LD) is more informative than gametic or composite digenic LD in revealing natural population history. Further, the difference between the composite digenic and maximum zygotic LDs can be used to detect epistatic selection for fitness. Here we corroborate the theory by investigating genome-wide zygotic LDs in HapMap phase III human populations. Results show that non-Africa populations have much more significant zygotic LDs than do Africa populations. Africa populations (ASW, LWK, MKK, and YRI) possess more significant zygotic LDs for the double-homozygotes (DAABB) than any other significant zygotic LDs (DAABb, DAaBB, and DAaBb), while non-Africa populations generally have more significant DAaBb’s than any other significant zygotic LDs (DAABB, DAABb, and DAaBB). Average r-squares for any significant zygotic LDs increase generally in an order of populations YRI, MKK, CEU, CHB, LWK, JPT, CHD, TSI, GIH, ASW, and MEX. Average r-squares are greater for DAABB and DAaBb than for DAaBB and DAABb in each population. YRI and MKK can be separated from LWK and ASW in terms of the pattern of average r-squares. All population divergences in zygotic LDs can be interpreted with the model of Out of Africa for modern human origins. We have also detected 19735-95921 SNP pairs exhibiting strong signals of epistatic selection in different populations. Gene-gene interactions for some epistatic SNP pairs are evident from empirical findings, but many more epistatic SNP pairs await evidence. Common epistatic SNP pairs rarely exist among all populations, but exist in distinct regions (Africa, Europe, and East Asia), which helps to understand geographical genomic medicine.
Collapse
Affiliation(s)
- Xin-Sheng Hu
- Department of Plant Sciences, University of Oxford, South Parks Road, Oxford, OX13RB, United Kingdom
- * E-mail:
| | - Yang Hu
- Department of Computing Science, University of Alberta, Edmonton, AB T6G 2S4, Canada
| |
Collapse
|
14
|
Lee SH, Lee EB, Shin ES, Lee JE, Cho SH, Min KU, Park HW. The Interaction Between Allelic Variants of CD86 and CD40LG: A Common Risk Factor of Allergic Asthma and Rheumatoid Arthritis. ALLERGY, ASTHMA & IMMUNOLOGY RESEARCH 2014; 6:137-41. [PMID: 24587950 PMCID: PMC3936042 DOI: 10.4168/aair.2014.6.2.137] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/28/2013] [Revised: 04/15/2013] [Accepted: 05/03/2013] [Indexed: 11/20/2022]
Abstract
PURPOSE Allergic asthma (AA) and rheumatoid arthritis (RA) are immune tolerance-related diseases, and immune tolerance is known to be influenced by costimulatory molecules. In this study, we sought to identify common genetic susceptibility in AA and RA. METHODS Two hundred cases of AA, 184 cases of RA, and 182 healthy controls were recruited at the Seoul National University Hospital, Seoul, Korea. Eight single nucleotide polymorphisms (SNPs) in five genes coding costimulatory molecules, namely, -318C>T, +49A>G, and 6230G>A in CTLA4, IVS3+17T>C in CD28, -3479T>G and I179V in CD86, -1C>T in CD40, and -3458A>G in CD40LG were scored, and genetic interactions were evaluated by multifactor dimensionality reduction (MDR) analysis. RESULTS MDR analysis revealed a significant gene-gene interaction between -3479T>G CD86 and -3458A>G CD40LG for AA. Subjects with the T/T genotype of -3479T>G CD86 and the A/A genotype of -3458A>G CD40LG were found to be significantly more likely to develop AA than those with the T/T genotype of -3479T>G CD86 and A/- genotype of -3458A>G CD40LG (adjusted OR, 6.09; 95% CI, 2.89-12.98; logistic regression analysis controlled by age). Similarly those subjects showed a significant risk of developing RA (adjusted OR, 39.35; 95% CI, 15.01-107.00, logistic regression analysis controlled by age). CONCLUSIONS Our findings suggest that a genetic interaction between CD86 and CD40LG favors the development of both AA and RA.
Collapse
Affiliation(s)
- So-Hee Lee
- Institute of Allergy and Clinical Immunology, Seoul National University College of Medicine, Seoul, Korea
- Department of Internal Medicine, Seoul National University College of Medicine, Seoul, Korea
| | - Eun-Bong Lee
- Department of Internal Medicine, Seoul National University College of Medicine, Seoul, Korea
| | | | | | - Sang-Heon Cho
- Institute of Allergy and Clinical Immunology, Seoul National University College of Medicine, Seoul, Korea
- Department of Internal Medicine, Seoul National University College of Medicine, Seoul, Korea
| | - Kyung-Up Min
- Institute of Allergy and Clinical Immunology, Seoul National University College of Medicine, Seoul, Korea
- Department of Internal Medicine, Seoul National University College of Medicine, Seoul, Korea
| | - Heung-Woo Park
- Institute of Allergy and Clinical Immunology, Seoul National University College of Medicine, Seoul, Korea
- Department of Internal Medicine, Seoul National University College of Medicine, Seoul, Korea
| |
Collapse
|
15
|
Meurice G. Apport de la bioinformatique dans le cadre de la médecine moléculaire. ONCOLOGIE 2014. [DOI: 10.1007/s10269-014-2375-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
16
|
Lou XY. Gene-Gene and Gene-Environment Interactions Underlying Complex Traits and their Detection. BIOMETRICS & BIOSTATISTICS INTERNATIONAL JOURNAL 2014; 1:00007. [PMID: 25584363 PMCID: PMC4288817 DOI: 10.15406/bbij.2014.01.00007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Affiliation(s)
- Xiang-Yang Lou
- Corresponding author: Xiang-Yang Lou, Department of Biostatistics, University of Alabama at Birmingham 1665 University Boulevard, RPHB 327, Birmingham, Alabama 35294-0022, USA, Tel: 205-975-9145; Fax: 205-975-2541;
| |
Collapse
|
17
|
Greene CS, Himmelstein DS, Nelson HH, Kelsey KT, Williams SM, Andrew AS, Karagas MR, Moore JH. Enabling personal genomics with an explicit test of epistasis. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2013:327-36. [PMID: 19908385 DOI: 10.1142/9789814295291_0035] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
One goal of personal genomics is to use information about genomic variation to predict who is at risk for various common diseases. Technological advances in genotyping have spawned several personal genetic testing services that market genotyping services directly to the consumer. An important goal of consumer genetic testing is to provide health information along with the genotyping results. This has the potential to integrate detailed personal genetic and genomic information into healthcare decision making. Despite the potential importance of these advances, there are some important limitations. One concern is that much of the literature that is used to formulate personal genetics reports is based on genetic association studies that consider each genetic variant independently of the others. It is our working hypothesis that the true value of personal genomics will only be realized when the complexity of the genotype-to-phenotype mapping relationship is embraced, rather than ignored. We focus here on complexity in genetic architecture due to epistasis or nonlinear gene-gene interaction. We have previously developed a multifactor dimensionality reduction (MDR) algorithm and software package for detecting nonlinear interactions in genetic association studies. In most prior MDR analyses, the permutation testing strategy used to assess statistical significance was unable to differentiate MDR models that captured only interaction effects from those that also detected independent main effects. Statistical interpretation of MDR models required post-hoc analysis using entropy-based measures of interaction information. We introduce here a novel permutation test that allows the effects of nonlinear interactions between multiple genetic variants to be specifically tested in a manner that is not confounded by linear additive effects. We show using simulated nonlinear interactions that the power using the explicit test of epistasis is no different than a standard permutation test. We also show that the test has the appropriate size or type I error rate of approximately 0.05. We then apply MDR with the new explicit test of epistasis to a large genetic study of bladder cancer and show that a previously reported nonlinear interaction between is indeed significant, even after considering the strong additive effect of smoking in the model. Finally, we evaluated the power of the explicit test of epistasis to detect the nonlinear interaction between two XPD gene polymorphisms by simulating data from the MDR model of bladder cancer susceptibility. The results of this study provide for the first time a simple method for explicitly testing epistasis or gene-gene interaction effects in genetic association studies. Although we demonstrated the method with MDR, an important advantage is that it can be combined with any modeling approach. The explicit test of epistasis brings us a step closer to the type of routine gene-gene interaction analysis that is needed if we are to enable personal genomics.
Collapse
Affiliation(s)
- Casey S Greene
- Department of Genetics, Dartmouth Medical School, Lebanon, NH 03756, USA
| | | | | | | | | | | | | | | |
Collapse
|
18
|
Setsirichok D, Tienboon P, Jaroonruang N, Kittichaijaroen S, Wongseree W, Piroonratana T, Usavanarong T, Limwongse C, Aporntewan C, Phadoongsidhi M, Chaiyaratana N. An omnibus permutation test on ensembles of two-locus analyses can detect pure epistasis and genetic heterogeneity in genome-wide association studies. SPRINGERPLUS 2013; 2:230. [PMID: 24804170 PMCID: PMC4006521 DOI: 10.1186/2193-1801-2-230] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/20/2012] [Accepted: 04/24/2013] [Indexed: 01/20/2023]
Abstract
This article presents the ability of an omnibus permutation test on ensembles of two-locus analyses (2LOmb) to detect pure epistasis in the presence of genetic heterogeneity. The performance of 2LOmb is evaluated in various simulation scenarios covering two independent causes of complex disease where each cause is governed by a purely epistatic interaction. Different scenarios are set up by varying the number of available single nucleotide polymorphisms (SNPs) in data, number of causative SNPs and ratio of case samples from two affected groups. The simulation results indicate that 2LOmb outperforms multifactor dimensionality reduction (MDR) and random forest (RF) techniques in terms of a low number of output SNPs and a high number of correctly-identified causative SNPs. Moreover, 2LOmb is capable of identifying the number of independent interactions in tractable computational time and can be used in genome-wide association studies. 2LOmb is subsequently applied to a type 1 diabetes mellitus (T1D) data set, which is collected from a UK population by the Wellcome Trust Case Control Consortium (WTCCC). After screening for SNPs that locate within or near genes and exhibit no marginal single-locus effects, the T1D data set is reduced to 95,991 SNPs from 12,146 genes. The 2LOmb search in the reduced T1D data set reveals that 12 SNPs, which can be divided into two independent sets, are associated with the disease. The first SNP set consists of three SNPs from MUC21 (mucin 21, cell surface associated), three SNPs from MUC22 (mucin 22), two SNPs from PSORS1C1 (psoriasis susceptibility 1 candidate 1) and one SNP from TCF19 (transcription factor 19). A four-locus interaction between these four genes is also detected. The second SNP set consists of three SNPs from ATAD1 (ATPase family, AAA domain containing 1). Overall, the findings indicate the detection of pure epistasis in the presence of genetic heterogeneity and provide an alternative explanation for the aetiology of T1D in the UK population.
Collapse
Affiliation(s)
- Damrongrit Setsirichok
- Department of Electrical and Computer Engineering, Faculty of Engineering, King Mongkut's University of Technology North Bangkok, 1518 Pracharat Sai 1 Road, Bangsue, Bangkok 10800, Thailand
| | - Phuwadej Tienboon
- Department of Electrical and Computer Engineering, Faculty of Engineering, King Mongkut's University of Technology North Bangkok, 1518 Pracharat Sai 1 Road, Bangsue, Bangkok 10800, Thailand
| | - Nattapong Jaroonruang
- Department of Computer Engineering, Faculty of Engineering, King Mongkut's University of Technology Thonburi, 126 Pracha-utid Road, Bangmod, Toongkru, Bangkok 10140, Thailand
| | - Somkit Kittichaijaroen
- Department of Electrical and Computer Engineering, Faculty of Engineering, King Mongkut's University of Technology North Bangkok, 1518 Pracharat Sai 1 Road, Bangsue, Bangkok 10800, Thailand
| | - Waranyu Wongseree
- Division of Technology of Information System Management, Faculty of Engineering, Mahidol University, 25/25 Phuttamonthon 4 Road, Nakhon Pathom 73170, Salaya, Thailand
| | - Theera Piroonratana
- Department of Electrical and Computer Engineering, Faculty of Engineering, King Mongkut's University of Technology North Bangkok, 1518 Pracharat Sai 1 Road, Bangsue, Bangkok 10800, Thailand
| | - Touchpong Usavanarong
- Department of Electrical and Computer Engineering, Faculty of Engineering, King Mongkut's University of Technology North Bangkok, 1518 Pracharat Sai 1 Road, Bangsue, Bangkok 10800, Thailand
| | - Chanin Limwongse
- Division of Molecular Genetics, Department of Research and Development, Faculty of Medicine Siriraj Hospital, Mahidol University, 2 Prannok Road, Bangkok 10700, Bangkoknoi, Thailand
| | - Chatchawit Aporntewan
- Department of Mathematics and Computer Science, Faculty of Science, Chulalongkorn University, 254 Phayathai Road, Pathumwan, Bangkok 10330, Thailand
| | - Marong Phadoongsidhi
- Department of Computer Engineering, Faculty of Engineering, King Mongkut's University of Technology Thonburi, 126 Pracha-utid Road, Bangmod, Toongkru, Bangkok 10140, Thailand
| | - Nachol Chaiyaratana
- Department of Electrical and Computer Engineering, Faculty of Engineering, King Mongkut's University of Technology North Bangkok, 1518 Pracharat Sai 1 Road, Bangsue, Bangkok 10800, Thailand ; Division of Molecular Genetics, Department of Research and Development, Faculty of Medicine Siriraj Hospital, Mahidol University, 2 Prannok Road, Bangkok 10700, Bangkoknoi, Thailand
| |
Collapse
|
19
|
Moore JH, Hill DP, Sulovari A, Kidd LC. Genetic Analysis of Prostate Cancer Using Computational Evolution, Pareto-Optimization and Post-processing. GENETIC AND EVOLUTIONARY COMPUTATION 2013. [DOI: 10.1007/978-1-4614-6846-2_7] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
|
20
|
Howland RH. Future Prospects for Pharmacogenetics in the Quest for Personalized Medicine. J Psychosoc Nurs Ment Health Serv 2012; 50:13-6. [DOI: 10.3928/02793695-20121114-01] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
21
|
Abstract
The use of pharmacogenetics information is central to the concept of personalized medicine. Understanding pharmacogenetic differences in drug response and tolerability has been investigated mainly through the study of pharmacokinetic and pharmacodynamic processes. The hope and promise of pharmacogenetic testing has led to the commercial availability of several testing products. With the exception of the relationship between certain types of adverse drug reactions and immune response genes such as the human leukocyte antigen, a growing body of research has not yet established the clinical utility of pharmacogenetics testing. Variance in findings from pharmacogenetics studies conducted to date may be due to epistasis (gene-gene and gene-environment interactions), epigenetics (non-DNA sequence-related heredity), or other genetic factors, which have been largely unexplored in pharmacogenetics research.
Collapse
Affiliation(s)
- Robert H Howland
- University of Pittsburgh School of Medicine, Western Psychiatric School of Medicine, Western Psychiatric Institute and Clinic, Pittsburgh, Pennsylvania 15213, USA.
| |
Collapse
|
22
|
Ritchie MD. The success of pharmacogenomics in moving genetic association studies from bench to bedside: study design and implementation of precision medicine in the post-GWAS era. Hum Genet 2012; 131:1615-26. [PMID: 22923055 PMCID: PMC3432217 DOI: 10.1007/s00439-012-1221-z] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2012] [Accepted: 08/07/2012] [Indexed: 12/13/2022]
Abstract
Pharmacogenomics is emerging as a popular type of study for human genetics in recent years. This is primarily due to the many success stories and high potential for translation to clinical practice. In this review, the strengths and limitations of pharmacogenomics are discussed as well as the primary epidemiologic, clinical trial, and in vitro study designs implemented. A brief discussion of molecular and analytic approaches will be reviewed. Finally, several examples of bench-to-bedside clinical implementations of pharmacogenetic traits will be described. Pharmacogenomics continues to grow in popularity because of the important genetic associations identified that drive the possibility of precision medicine.
Collapse
Affiliation(s)
- Marylyn D Ritchie
- Department of Biochemistry and Molecular Biology, The Huck Institutes of the Life Sciences, Center for Systems Genomics, Eberly College of Science, The Pennsylvania State University, University Park, PA 16802, USA.
| |
Collapse
|
23
|
Lane HY, Tsai GE, Lin E. Assessing Gene-Gene Interactions in Pharmacogenomics. Mol Diagn Ther 2012; 16:15-27. [DOI: 10.1007/bf03256426] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
|
24
|
Holzinger ER, Ritchie MD. Integrating heterogeneous high-throughput data for meta-dimensional pharmacogenomics and disease-related studies. Pharmacogenomics 2012; 13:213-22. [PMID: 22256870 DOI: 10.2217/pgs.11.145] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
The current paradigm of human genetics research is to analyze variation of a single data type (i.e., DNA sequence or RNA levels) to detect genes and pathways that underlie complex traits such as disease state or drug response. While these studies have detected thousands of variations that associate with hundreds of complex phenotypes, much of the estimated heritability, or trait variability due to genetic factors, remain unexplained. We may be able to account for a portion of the missing heritability if we incorporate a systems biology approach into these analyses. Rapid technological advances will make it possible for scientists to explore this hypothesis via the generation of high-throughput omics data - transcriptomic, proteomic and methylomic to name a few. Analyzing this 'meta-dimensional' data will require clever statistical techniques that allow for the integration of qualitative and quantitative predictor variables. For this article, we examine two major categories of approaches for integrated data analysis, give examples of their use in experimental and in silico datasets, and assess the limitations of each method.
Collapse
Affiliation(s)
- Emily R Holzinger
- Center for Human Genetics Research, Vanderbilt University, Department of Molecular Physiology & Biophysics, Nashville, TN, USA
| | | |
Collapse
|
25
|
Dai H, Bhandary M, Becker M, Leeder JS, Gaedigk R, Motsinger-Reif AA. Global tests of P-values for multifactor dimensionality reduction models in selection of optimal number of target genes. BioData Min 2012; 5:3. [PMID: 22616673 PMCID: PMC3508622 DOI: 10.1186/1756-0381-5-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2012] [Accepted: 04/19/2012] [Indexed: 11/12/2022] Open
Abstract
Background Multifactor Dimensionality Reduction (MDR) is a popular and successful data mining method developed to characterize and detect nonlinear complex gene-gene interactions (epistasis) that are associated with disease susceptibility. Because MDR uses a combinatorial search strategy to detect interaction, several filtration techniques have been developed to remove genes (SNPs) that have no interactive effects prior to analysis. However, the cutoff values implemented for these filtration methods are arbitrary, therefore different choices of cutoff values will lead to different selections of genes (SNPs). Methods We suggest incorporating a global test of p-values to filtration procedures to identify the optimal number of genes/SNPs for further MDR analysis and demonstrate this approach using a ReliefF filter technique. We compare the performance of different global testing procedures in this context, including the Kolmogorov-Smirnov test, the inverse chi-square test, the inverse normal test, the logit test, the Wilcoxon test and Tippett’s test. Additionally we demonstrate the approach on a real data application with a candidate gene study of drug response in Juvenile Idiopathic Arthritis. Results Extensive simulation of correlated p-values show that the inverse chi-square test is the most appropriate approach to be incorporated with the screening approach to determine the optimal number of SNPs for the final MDR analysis. The Kolmogorov-Smirnov test has high inflation of Type I errors when p-values are highly correlated or when p-values peak near the center of histogram. Tippett’s test has very low power when the effect size of GxG interactions is small. Conclusions The proposed global tests can serve as a screening approach prior to individual tests to prevent false discovery. Strong power in small sample sizes and well controlled Type I error in absence of GxG interactions make global tests highly recommended in epistasis studies.
Collapse
Affiliation(s)
- Hongying Dai
- Department of Medical Research, Children's Mercy Hospital, 2401 Gillham Road, Kansas City, MO, 64108, USA.
| | | | | | | | | | | |
Collapse
|
26
|
Shang J, Zhang J, Sun Y, Liu D, Ye D, Yin Y. Performance analysis of novel methods for detecting epistasis. BMC Bioinformatics 2011; 12:475. [PMID: 22172045 PMCID: PMC3259123 DOI: 10.1186/1471-2105-12-475] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2011] [Accepted: 12/15/2011] [Indexed: 02/03/2023] Open
Abstract
Background Epistasis is recognized fundamentally important for understanding the mechanism of disease-causing genetic variation. Though many novel methods for detecting epistasis have been proposed, few studies focus on their comparison. Undertaking a comprehensive comparison study is an urgent task and a pathway of the methods to real applications. Results This paper aims at a comparison study of epistasis detection methods through applying related software packages on datasets. For this purpose, we categorize methods according to their search strategies, and select five representative methods (TEAM, BOOST, SNPRuler, AntEpiSeeker and epiMODE) originating from different underlying techniques for comparison. The methods are tested on simulated datasets with different size, various epistasis models, and with/without noise. The types of noise include missing data, genotyping error and phenocopy. Performance is evaluated by detection power (three forms are introduced), robustness, sensitivity and computational complexity. Conclusions None of selected methods is perfect in all scenarios and each has its own merits and limitations. In terms of detection power, AntEpiSeeker performs best on detecting epistasis displaying marginal effects (eME) and BOOST performs best on identifying epistasis displaying no marginal effects (eNME). In terms of robustness, AntEpiSeeker is robust to all types of noise on eME models, BOOST is robust to genotyping error and phenocopy on eNME models, and SNPRuler is robust to phenocopy on eME models and missing data on eNME models. In terms of sensitivity, AntEpiSeeker is the winner on eME models and both SNPRuler and BOOST perform well on eNME models. In terms of computational complexity, BOOST is the fastest among the methods. In terms of overall performance, AntEpiSeeker and BOOST are recommended as the efficient and effective methods. This comparison study may provide guidelines for applying the methods and further clues for epistasis detection.
Collapse
Affiliation(s)
- Junliang Shang
- School of Computer Science & Technology, Xidian University, Xi'an 710071, China.
| | | | | | | | | | | |
Collapse
|
27
|
Fernald GH, Capriotti E, Daneshjou R, Karczewski KJ, Altman RB. Bioinformatics challenges for personalized medicine. ACTA ACUST UNITED AC 2011; 27:1741-8. [PMID: 21596790 PMCID: PMC3117361 DOI: 10.1093/bioinformatics/btr295] [Citation(s) in RCA: 177] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
MOTIVATION Widespread availability of low-cost, full genome sequencing will introduce new challenges for bioinformatics. RESULTS This review outlines recent developments in sequencing technologies and genome analysis methods for application in personalized medicine. New methods are needed in four areas to realize the potential of personalized medicine: (i) processing large-scale robust genomic data; (ii) interpreting the functional effect and the impact of genomic variation; (iii) integrating systems data to relate complex genetic interactions with phenotypes; and (iv) translating these discoveries into medical practice. CONTACT russ.altman@stanford.edu
Collapse
Affiliation(s)
- Guy Haskin Fernald
- Biomedical Informatics Training Program, Stanford University School of Medicine, Department of Bioengineering, Stanford University, Stanford, CA, USA
| | | | | | | | | |
Collapse
|
28
|
Grady BJ, Ritchie MD. Statistical Optimization of Pharmacogenomics Association Studies: Key Considerations from Study Design to Analysis. CURRENT PHARMACOGENOMICS AND PERSONALIZED MEDICINE 2011; 9:41-66. [PMID: 21887206 PMCID: PMC3163263 DOI: 10.2174/187569211794728805] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
Research in human genetics and genetic epidemiology has grown significantly over the previous decade, particularly in the field of pharmacogenomics. Pharmacogenomics presents an opportunity for rapid translation of associated genetic polymorphisms into diagnostic measures or tests to guide therapy as part of a move towards personalized medicine. Expansion in genotyping technology has cleared the way for widespread use of whole-genome genotyping in the effort to identify novel biology and new genetic markers associated with pharmacokinetic and pharmacodynamic endpoints. With new technology and methodology regularly becoming available for use in genetic studies, a discussion on the application of such tools becomes necessary. In particular, quality control criteria have evolved with the use of GWAS as we have come to understand potential systematic errors which can be introduced into the data during genotyping. There have been several replicated pharmacogenomic associations, some of which have moved to the clinic to enact change in treatment decisions. These examples of translation illustrate the strength of evidence necessary to successfully and effectively translate a genetic discovery. In this review, the design of pharmacogenomic association studies is examined with the goal of optimizing the impact and utility of this research. Issues of ascertainment, genotyping, quality control, analysis and interpretation are considered.
Collapse
Affiliation(s)
- Benjamin J. Grady
- Department of Molecular Physiology & Biophysics, Center for Human Genetics Research, Vanderbilt University, Nashville, TN, USA
| | - Marylyn D. Ritchie
- Department of Molecular Physiology & Biophysics, Center for Human Genetics Research, Vanderbilt University, Nashville, TN, USA
| |
Collapse
|
29
|
Gui J, Andrew AS, Andrews P, Nelson HM, Kelsey KT, Karagas MR, Moore JH. A robust multifactor dimensionality reduction method for detecting gene-gene interactions with application to the genetic analysis of bladder cancer susceptibility. Ann Hum Genet 2010; 75:20-8. [PMID: 21091664 DOI: 10.1111/j.1469-1809.2010.00624.x] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
A central goal of human genetics is to identify susceptibility genes for common human diseases. An important challenge is modelling gene-gene interaction or epistasis that can result in nonadditivity of genetic effects. The multifactor dimensionality reduction (MDR) method was developed as a machine learning alternative to parametric logistic regression for detecting interactions in the absence of significant marginal effects. The goal of MDR is to reduce the dimensionality inherent in modelling combinations of polymorphisms using a computational approach called constructive induction. Here, we propose a Robust Multifactor Dimensionality Reduction (RMDR) method that performs constructive induction using a Fisher's Exact Test rather than a predetermined threshold. The advantage of this approach is that only statistically significant genotype combinations are considered in the MDR analysis. We use simulation studies to demonstrate that this approach will increase the success rate of MDR when there are only a few genotype combinations that are significantly associated with case-control status. We show that there is no loss of success rate when this is not the case. We then apply the RMDR method to the detection of gene-gene interactions in genotype data from a population-based study of bladder cancer in New Hampshire.
Collapse
Affiliation(s)
- Jiang Gui
- Dartmouth Medical School, Lebanon, NH 03756, USA
| | | | | | | | | | | | | |
Collapse
|
30
|
Motsinger-Reif AA, Deodhar S, Winham SJ, Hardison NE. Grammatical evolution decision trees for detecting gene-gene interactions. BioData Min 2010; 3:8. [PMID: 21087514 PMCID: PMC3000379 DOI: 10.1186/1756-0381-3-8] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2010] [Accepted: 11/18/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND A fundamental goal of human genetics is the discovery of polymorphisms that predict common, complex diseases. It is hypothesized that complex diseases are due to a myriad of factors including environmental exposures and complex genetic risk models, including gene-gene interactions. Such epistatic models present an important analytical challenge, requiring that methods perform not only statistical modeling, but also variable selection to generate testable genetic model hypotheses. This challenge is amplified by recent advances in genotyping technology, as the number of potential predictor variables is rapidly increasing. METHODS Decision trees are a highly successful, easily interpretable data-mining method that are typically optimized with a hierarchical model building approach, which limits their potential to identify interacting effects. To overcome this limitation, we utilize evolutionary computation, specifically grammatical evolution, to build decision trees to detect and model gene-gene interactions. In the current study, we introduce the Grammatical Evolution Decision Trees (GEDT) method and software and evaluate this approach on simulated data representing gene-gene interaction models of a range of effect sizes. We compare the performance of the method to a traditional decision tree algorithm and a random search approach and demonstrate the improved performance of the method to detect purely epistatic interactions. RESULTS The results of our simulations demonstrate that GEDT has high power to detect even very moderate genetic risk models. GEDT has high power to detect interactions with and without main effects. CONCLUSIONS GEDT, while still in its initial stages of development, is a promising new approach for identifying gene-gene interactions in genetic association studies.
Collapse
|
31
|
Seripa D, Pilotto A, Panza F, Matera MG, Pilotto A. Pharmacogenetics of cytochrome P450 (CYP) in the elderly. Ageing Res Rev 2010; 9:457-74. [PMID: 20601196 DOI: 10.1016/j.arr.2010.06.001] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2010] [Revised: 05/28/2010] [Accepted: 06/01/2010] [Indexed: 12/22/2022]
Abstract
The genetics of cytochrome P450 (CYP) is a very active area of multidisciplinary research, overlapping the interest of medicine, biology and pharmacology, being the CYP enzyme system responsible for the metabolism of more than 80% of the commercially available drugs. Variations in CYP encoding genes are responsible for inter-individual differences in CYP production or function, with severe clinical consequences as therapeutic failures (TFs) and adverse drug reactions (ADRs), being ADRs worldwide primary causes of morbidity and mortality in elderly people. In fact, the prevalence of both TFs and ADRs strongly increased in the presence of multiple pharmacological treatments, a common status in subjects aging 65 years and over. The present article explored some basic concepts of human genetics that have important implications in the genetics of CYP. An attempted to transfer these basic concepts to the genetic data reported by the Home Page of The Human Cytochrome P450 (CYP) Allele Nomenclature Committee was also made, focusing on the current knowledge of CYP genetics. The status of what we know and what we need to know is the base for the clinical applications of pharmacogenetics, in which personalized drug treatments constituted the main aim, in particular in patients attending a geriatric ward.
Collapse
Affiliation(s)
- Davide Seripa
- Geriatric Unit & Gerontology-Geriatrics Research Laboratory, Department of Medical Sciences, IRCCS Casa Sollievo della Sofferenza, Viale Cappuccini 1, 71013 San Giovanni Rotondo (FG), Italy.
| | | | | | | | | |
Collapse
|
32
|
Dexamethasone-induced FKBP51 expression in peripheral blood mononuclear cells could play a role in predicting the response of asthmatics to treatment with corticosteroids. J Clin Immunol 2010; 31:122-7. [PMID: 20853021 DOI: 10.1007/s10875-010-9463-9] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2010] [Accepted: 09/07/2010] [Indexed: 10/19/2022]
Abstract
BACKGROUND Corticosteroids (CSs) are the preferred anti-inflammatory therapy for the treatment of asthma, but the responses of asthmatics to CSs are known to vary. It has thus become important to discover reliable markers in predicting responses to CSs. METHODS We performed time-series microarrays using a murine model of asthma after a single dose of dexamethasone, based on the assumption that the gene showing a greater change in response to CSs can also be a potential marker for that finding. We then evaluated the clinical meaning of the gene discovered in the microarray experiments. RESULTS We found that the expression of FK506 binding protein 51 gene (FKBP51) in lung tissue markedly increased after dexamethasone treatment in a murine model of asthma. We then measured dexamethasone-induced FKBP51 expression in peripheral blood mononuclear cells (PBMCs) in asthmatics. Dexamethasone-induced FKBP51 expression in PBMCs was significantly higher in severe asthmatics compared with mild-to-moderate asthmatics treated with inhaled CSs. In addition, we found that dexamethasone-induced FKBP51 expression in PBMCs was inversely correlated with improvement in lung function after treatment with orally administered prednisolone in six steroid-naive asthmatics. CONCLUSION Dexamethasone-induced FKBP51 expression in PBMCs may be a reliable and practical biomarker in predicting the response to CSs in asthmatics.
Collapse
|
33
|
Cattaert T, Calle ML, Dudek SM, Mahachie John JM, Van Lishout F, Urrea V, Ritchie MD, Van Steen K. Model-based multifactor dimensionality reduction for detecting epistasis in case-control data in the presence of noise. Ann Hum Genet 2010; 75:78-89. [PMID: 21158747 DOI: 10.1111/j.1469-1809.2010.00604.x] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
Analyzing the combined effects of genes and/or environmental factors on the development of complex diseases is a great challenge from both the statistical and computational perspective, even using a relatively small number of genetic and nongenetic exposures. Several data-mining methods have been proposed for interaction analysis, among them, the Multifactor Dimensionality Reduction Method (MDR) has proven its utility in a variety of theoretical and practical settings. Model-Based Multifactor Dimensionality Reduction (MB-MDR), a relatively new MDR-based technique that is able to unify the best of both nonparametric and parametric worlds, was developed to address some of the remaining concerns that go along with an MDR analysis. These include the restriction to univariate, dichotomous traits, the absence of flexible ways to adjust for lower order effects and important confounders, and the difficulty in highlighting epistatic effects when too many multilocus genotype cells are pooled into two new genotype groups. We investigate the empirical power of MB-MDR to detect gene-gene interactions in the absence of any noise and in the presence of genotyping error, missing data, phenocopy, and genetic heterogeneity. Power is generally higher for MB-MDR than for MDR, in particular in the presence of genetic heterogeneity, phenocopy, or low minor allele frequencies.
Collapse
Affiliation(s)
- Tom Cattaert
- Montefiore Institute, University of Liege, Belgium
| | | | | | | | | | | | | | | |
Collapse
|
34
|
Wakefield J, De Vocht F, Hung RJ. Bayesian mixture modeling of gene-environment and gene-gene interactions. Genet Epidemiol 2010; 34:16-25. [PMID: 19492346 DOI: 10.1002/gepi.20429] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
With the advent of rapid and relatively cheap genotyping technologies there is now the opportunity to attempt to identify gene-environment and gene-gene interactions when the number of genes and environmental factors is potentially large. Unfortunately the dimensionality of the parameter space leads to a computational explosion in the number of possible interactions that may be investigated. The full model that includes all interactions and main effects can be unstable, with wide confidence intervals arising from the large number of estimated parameters. We describe a hierarchical mixture model that allows all interactions to be investigated simultaneously, but assumes the effects come from a mixture prior with two components, one that reflects small null effects and the second for epidemiologically significant effects. Effects from the former are effectively set to zero, hence increasing the power for the detection of real signals. The prior framework is very flexible, which allows substantive information to be incorporated into the analysis. We illustrate the methods first using simulation, and then on data from a case-control study of lung cancer in Central and Eastern Europe.
Collapse
Affiliation(s)
- Jon Wakefield
- International Agency for Research on Cancer, Lyon, France.
| | | | | |
Collapse
|
35
|
Abstract
Motivation: The sequencing of the human genome has made it possible to identify an informative set of >1 million single nucleotide polymorphisms (SNPs) across the genome that can be used to carry out genome-wide association studies (GWASs). The availability of massive amounts of GWAS data has necessitated the development of new biostatistical methods for quality control, imputation and analysis issues including multiple testing. This work has been successful and has enabled the discovery of new associations that have been replicated in multiple studies. However, it is now recognized that most SNPs discovered via GWAS have small effects on disease susceptibility and thus may not be suitable for improving health care through genetic testing. One likely explanation for the mixed results of GWAS is that the current biostatistical analysis paradigm is by design agnostic or unbiased in that it ignores all prior knowledge about disease pathobiology. Further, the linear modeling framework that is employed in GWAS often considers only one SNP at a time thus ignoring their genomic and environmental context. There is now a shift away from the biostatistical approach toward a more holistic approach that recognizes the complexity of the genotype–phenotype relationship that is characterized by significant heterogeneity and gene–gene and gene–environment interaction. We argue here that bioinformatics has an important role to play in addressing the complexity of the underlying genetic basis of common human diseases. The goal of this review is to identify and discuss those GWAS challenges that will require computational methods. Contact:jason.h.moore@dartmouth.edu
Collapse
Affiliation(s)
- Jason H Moore
- Department of Genetics, Department of Community and Family Medicine, Dartmouth Medical School, Lebanon, NH 03756, USA.
| | | | | |
Collapse
|
36
|
Ritchie MD, Bush WS. Genome simulation approaches for synthesizing in silico datasets for human genomics. ADVANCES IN GENETICS 2010; 72:1-24. [PMID: 21029846 DOI: 10.1016/b978-0-12-380862-2.00001-1] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Simulated data is a necessary first step in the evaluation of new analytic methods because in simulated data the true effects are known. To successfully develop novel statistical and computational methods for genetic analysis, it is vital to simulate datasets consisting of single nucleotide polymorphisms (SNPs) spread throughout the genome at a density similar to that observed by new high-throughput molecular genomics studies. In addition, the simulation of environmental data and effects will be essential to properly formulate risk models for complex disorders. Data simulations are often criticized because they are much less noisy than natural biological data, as it is nearly impossible to simulate the multitude of possible sources of natural and experimental variability. However, simulating data in silico is the most straightforward way to test the true potential of new methods during development. Thus, advances that increase the complexity of data simulations will permit investigators to better assess new analytical methods. In this work, we will briefly describe some of the current approaches for the simulation of human genomics data describing the advantages and disadvantages of the various approaches. We will also include details on software packages available for data simulation. Finally, we will expand upon one particular approach for the creation of complex, human genomic datasets that uses a forward-time population simulation algorithm: genomeSIMLA. Many of the hallmark features of biological datasets can be synthesized in silico; still much research is needed to enhance our capabilities to create datasets that capture the natural complexity of biological datasets.
Collapse
Affiliation(s)
- Marylyn D Ritchie
- Center for Human Genetics Research, Vanderbilt University, Nashville, Tennessee, USA
| | | |
Collapse
|
37
|
Kim SH, Lee JE, Kim SH, Jee YK, Kim YK, Park HS, Min KU, Park HW. Allelic variants of CD40 and CD40L genes interact to promote antibiotic-induced cutaneous allergic reactions. Clin Exp Allergy 2009; 39:1852-6. [DOI: 10.1111/j.1365-2222.2009.03336.x] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
38
|
Wongseree W, Assawamakin A, Piroonratana T, Sinsomros S, Limwongse C, Chaiyaratana N. Detecting purely epistatic multi-locus interactions by an omnibus permutation test on ensembles of two-locus analyses. BMC Bioinformatics 2009; 10:294. [PMID: 19761607 PMCID: PMC2759961 DOI: 10.1186/1471-2105-10-294] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2009] [Accepted: 09/17/2009] [Indexed: 01/17/2023] Open
Abstract
BACKGROUND Purely epistatic multi-locus interactions cannot generally be detected via single-locus analysis in case-control studies of complex diseases. Recently, many two-locus and multi-locus analysis techniques have been shown to be promising for the epistasis detection. However, exhaustive multi-locus analysis requires prohibitively large computational efforts when problems involve large-scale or genome-wide data. Furthermore, there is no explicit proof that a combination of multiple two-locus analyses can lead to the correct identification of multi-locus interactions. RESULTS The proposed 2LOmb algorithm performs an omnibus permutation test on ensembles of two-locus analyses. The algorithm consists of four main steps: two-locus analysis, a permutation test, global p-value determination and a progressive search for the best ensemble. 2LOmb is benchmarked against an exhaustive two-locus analysis technique, a set association approach, a correlation-based feature selection (CFS) technique and a tuned ReliefF (TuRF) technique. The simulation results indicate that 2LOmb produces a low false-positive error. Moreover, 2LOmb has the best performance in terms of an ability to identify all causative single nucleotide polymorphisms (SNPs) and a low number of output SNPs in purely epistatic two-, three- and four-locus interaction problems. The interaction models constructed from the 2LOmb outputs via a multifactor dimensionality reduction (MDR) method are also included for the confirmation of epistasis detection. 2LOmb is subsequently applied to a type 2 diabetes mellitus (T2D) data set, which is obtained as a part of the UK genome-wide genetic epidemiology study by the Wellcome Trust Case Control Consortium (WTCCC). After primarily screening for SNPs that locate within or near 372 candidate genes and exhibit no marginal single-locus effects, the T2D data set is reduced to 7,065 SNPs from 370 genes. The 2LOmb search in the reduced T2D data reveals that four intronic SNPs in PGM1 (phosphoglucomutase 1), two intronic SNPs in LMX1A (LIM homeobox transcription factor 1, alpha), two intronic SNPs in PARK2 (Parkinson disease (autosomal recessive, juvenile) 2, parkin) and three intronic SNPs in GYS2 (glycogen synthase 2 (liver)) are associated with the disease. The 2LOmb result suggests that there is no interaction between each pair of the identified genes that can be described by purely epistatic two-locus interaction models. Moreover, there are no interactions between these four genes that can be described by purely epistatic multi-locus interaction models with marginal two-locus effects. The findings provide an alternative explanation for the aetiology of T2D in a UK population. CONCLUSION An omnibus permutation test on ensembles of two-locus analyses can detect purely epistatic multi-locus interactions with marginal two-locus effects. The study also reveals that SNPs from large-scale or genome-wide case-control data which are discarded after single-locus analysis detects no association can still be useful for genetic epidemiology studies.
Collapse
Affiliation(s)
- Waranyu Wongseree
- Department of Electrical Engineering, Faculty of Engineering, King Mongkut's University of Technology North Bangkok, 1518 Piboolsongkram Road, Bangsue, Bangkok 10800, Thailand
| | - Anunchai Assawamakin
- Division of Molecular Genetics, Department of Research and Development, Faculty of Medicine Siriraj Hospital, Mahidol University, 2 Prannok Road, Bangkoknoi, Bangkok 10700, Thailand
| | - Theera Piroonratana
- Department of Electrical Engineering, Faculty of Engineering, King Mongkut's University of Technology North Bangkok, 1518 Piboolsongkram Road, Bangsue, Bangkok 10800, Thailand
| | - Saravudh Sinsomros
- Department of Electrical Engineering, Faculty of Engineering, King Mongkut's University of Technology North Bangkok, 1518 Piboolsongkram Road, Bangsue, Bangkok 10800, Thailand
| | - Chanin Limwongse
- Division of Molecular Genetics, Department of Research and Development, Faculty of Medicine Siriraj Hospital, Mahidol University, 2 Prannok Road, Bangkoknoi, Bangkok 10700, Thailand
| | - Nachol Chaiyaratana
- Department of Electrical Engineering, Faculty of Engineering, King Mongkut's University of Technology North Bangkok, 1518 Piboolsongkram Road, Bangsue, Bangkok 10800, Thailand
- Division of Molecular Genetics, Department of Research and Development, Faculty of Medicine Siriraj Hospital, Mahidol University, 2 Prannok Road, Bangkoknoi, Bangkok 10700, Thailand
| |
Collapse
|
39
|
Epistasis and its implications for personal genetics. Am J Hum Genet 2009; 85:309-20. [PMID: 19733727 DOI: 10.1016/j.ajhg.2009.08.006] [Citation(s) in RCA: 240] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2009] [Revised: 07/31/2009] [Accepted: 08/10/2009] [Indexed: 12/22/2022] Open
Abstract
The widespread availability of high-throughput genotyping technology has opened the door to the era of personal genetics, which brings to consumers the promise of using genetic variations to predict individual susceptibility to common diseases. Despite easy access to commercial personal genetics services, our knowledge of the genetic architecture of common diseases is still very limited and has not yet fulfilled the promise of accurately predicting most people at risk. This is partly because of the complexity of the mapping relationship between genotype and phenotype that is a consequence of epistasis (gene-gene interaction) and other phenomena such as gene-environment interaction and locus heterogeneity. Unfortunately, these aspects of genetic architecture have not been addressed in most of the genetic association studies that provide the knowledge base for interpreting large-scale genetic association results. We provide here an introductory review of how epistasis can affect human health and disease and how it can be detected in population-based studies. We provide some thoughts on the implications of epistasis for personal genetics and some recommendations for improving personal genetics in light of this complexity.
Collapse
|
40
|
Baranzini SE, Galwey NW, Wang J, Khankhanian P, Lindberg R, Pelletier D, Wu W, Uitdehaag BMJ, Kappos L, Polman CH, Matthews PM, Hauser SL, Gibson RA, Oksenberg JR, Barnes MR. Pathway and network-based analysis of genome-wide association studies in multiple sclerosis. Hum Mol Genet 2009; 18:2078-90. [PMID: 19286671 PMCID: PMC2678928 DOI: 10.1093/hmg/ddp120] [Citation(s) in RCA: 284] [Impact Index Per Article: 18.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Genome-wide association studies (GWAS) testing several hundred thousand SNPs have been performed in multiple sclerosis (MS) and other complex diseases. Typically, the number of markers in which the evidence for association exceeds the genome-wide significance threshold is very small, and markers that do not exceed this threshold are generally neglected. Classical statistical analysis of these datasets in MS revealed genes with known immunological functions. However, many of the markers showing modest association may represent false negatives. We hypothesize that certain combinations of genes flagged by these markers can be identified if they belong to a common biological pathway. Here we conduct a pathway-oriented analysis of two GWAS in MS that takes into account all SNPs with nominal evidence of association (P < 0.05). Gene-wise P-values were superimposed on a human protein interaction network and searches were conducted to identify sub-networks containing a higher proportion of genes associated with MS than expected by chance. These sub-networks, and others generated at random as a control, were categorized for membership of biological pathways. GWAS from eight other diseases were analyzed to assess the specificity of the pathways identified. In the MS datasets, we identified sub-networks of genes from several immunological pathways including cell adhesion, communication and signaling. Remarkably, neural pathways, namely axon-guidance and synaptic potentiation, were also over-represented in MS. In addition to the immunological pathways previously identified, we report here for the first time the potential involvement of neural pathways in MS susceptibility.
Collapse
Affiliation(s)
- Sergio E Baranzini
- Department of Neurology, School of Medicine, University of California San Francisco, 513 Parnassus Ave. Room S-256, San Francisco, CA 94143-0435, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
41
|
|
42
|
Motsinger-Reif AA, Fanelli TJ, Davis AC, Ritchie MD. Power of grammatical evolution neural networks to detect gene-gene interactions in the presence of error. BMC Res Notes 2008; 1:65. [PMID: 18710518 PMCID: PMC2531119 DOI: 10.1186/1756-0500-1-65] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2008] [Accepted: 08/13/2008] [Indexed: 12/02/2022] Open
Abstract
BACKGROUND With the advent of increasingly efficient means to obtain genetic information, a great insurgence of data has resulted, leading to the need for methods for analyzing this data beyond that of traditional parametric statistical approaches. Recently we introduced Grammatical Evolution Neural Network (GENN), a machine-learning approach to detect gene-gene or gene-environment interactions, also known as epistasis, in high dimensional genetic epidemiological data. GENN has been shown to be highly successful in a range of simulated data, but the impact of error common to real data is unknown. In the current study, we examine the power of GENN to detect interesting interactions in the presence of noise due to genotyping error, missing data, phenocopy, and genetic heterogeneity. Additionally, we compare the performance of GENN to that of another computational method - Multifactor Dimensionality Reduction (MDR). FINDINGS GENN is extremely robust to missing data and genotyping error. Phenocopy in a dataset reduces the power of both GENN and MDR. GENN is reasonably robust to genetic heterogeneity and find that in some cases GENN has substantially higher power than MDR to detect functional loci in the presence of genetic heterogeneity. CONCLUSION GENN is a promising method to detect gene-gene interaction, even in the presence of common types of error found in real data.
Collapse
Affiliation(s)
- Alison A Motsinger-Reif
- Bioinformatics Research Center, Department of Statistics, North Carolina State University, Raleigh, NC 27695, USA
| | - Theresa J Fanelli
- Center for Human Genetics Research, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - Anna C Davis
- Center for Human Genetics Research, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - Marylyn D Ritchie
- Center for Human Genetics Research, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| |
Collapse
|