Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Fan R, Zhong M, Wang S, Zhang Y, Andrew A, Karagas M, Chen H, Amos CI, Xiong M, Moore JH. Entropy-based information gain approaches to detect and to characterize gene-gene and gene-environment interactions/correlations of complex diseases. Genet Epidemiol 2011;35:706-21. [PMID: 22009792 PMCID: PMC3384547 DOI: 10.1002/gepi.20621] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]

For:	Fan R, Zhong M, Wang S, Zhang Y, Andrew A, Karagas M, Chen H, Amos CI, Xiong M, Moore JH. Entropy-based information gain approaches to detect and to characterize gene-gene and gene-environment interactions/correlations of complex diseases. Genet Epidemiol 2011;35:706-21. [PMID: 22009792 PMCID: PMC3384547 DOI: 10.1002/gepi.20621] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]

Number

Cited by Other Article(s)

Ozminkowski S, Solís‐Lemus C. Identifying microbial drivers in biological phenotypes with a Bayesian network regression model. Ecol Evol 2024;14:e11039. [PMID: 38774136 PMCID: PMC11106058 DOI: 10.1002/ece3.11039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Revised: 01/29/2024] [Accepted: 02/03/2024] [Indexed: 05/24/2024] Open

Yaldız B, Erdoğan O, Rafatov S, Iyigün C, Aydın Son Y. Revealing third-order interactions through the integration of machine learning and entropy methods in genomic studies. BioData Min 2024;17:3. [PMID: 38291454 PMCID: PMC10826120 DOI: 10.1186/s13040-024-00355-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 01/16/2024] [Indexed: 02/01/2024] Open

Abstract

BACKGROUND

Non-linear relationships at the genotype level are essential in understanding the genetic interactions of complex disease traits. Genome-wide association Studies (GWAS) have revealed statistical association of the SNPs in many complex diseases. As GWAS results could not thoroughly reveal the genetic background of these disorders, Genome-Wide Interaction Studies have started to gain importance. In recent years, various statistical approaches, such as entropy-based methods, have been suggested for revealing these non-additive interactions between variants. This study presents a novel prioritization workflow integrating two-step Random Forest (RF) modeling and entropy analysis after PLINK filtering. PLINK-RF-RF workflow is followed by an entropy-based 3-way interaction information (3WII) method to capture the hidden patterns resulting from non-linear relationships between genotypes in Late-Onset Alzheimer Disease to discover early and differential diagnosis markers.

RESULTS

Three models from different datasets are developed by integrating PLINK-RF-RF analysis and entropy-based three-way interaction information (3WII) calculation method, which enables the detection of the third-order interactions, which are not primarily considered in epistatic interaction studies. A reduced SNP set is selected for all three datasets by 3WII analysis by PLINK filtering and prioritization of SNP with RF-RF modeling, promising as a model minimization approach. Among SNPs revealed by 3WII, 4 SNPs out of 19 from GenADA, 1 SNP out of 27 from ADNI, and 4 SNPs out of 106 from NCRAD are mapped to genes directly associated with Alzheimer Disease. Additionally, several SNPs are associated with other neurological disorders. Also, the genes the variants mapped to in all datasets are significantly enriched in calcium ion binding, extracellular matrix, external encapsulating structure, and RUNX1 regulates estrogen receptor-mediated transcription pathways. Therefore, these functional pathways are proposed for further examination for a possible LOAD association. Besides, all 3WII variants are proposed as candidate biomarkers for the genotyping-based LOAD diagnosis.

CONCLUSION

The entropy approach performed in this study reveals the complex genetic interactions that significantly contribute to LOAD risk. We benefited from the entropy-based 3WII as a model minimization step and determined the significant 3-way interactions between the prioritized SNPs by PLINK-RF-RF. This framework is a promising approach for disease association studies, which can also be modified by integrating other machine learning and entropy-based interaction methods.

Collapse

Ventresca C, Mohamed W, Russel WA, Ay A, Ingram KK. Machine learning analyses reveal circadian clock features predictive of anxiety among UK biobank participants. Sci Rep 2023;13:22304. [PMID: 38102312 PMCID: PMC10724169 DOI: 10.1038/s41598-023-49644-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2023] [Accepted: 12/11/2023] [Indexed: 12/17/2023] Open

Abstract

Mood disorders, including depression and anxiety, affect almost one-fifth of the world's adult population and are becoming increasingly prevalent. Mutations in circadian clock genes have previously been associated with mood disorders both directly and indirectly through alterations in circadian phase, suggesting that the circadian clock influences multiple molecular pathways involved in mood. By targeting previously identified single nucleotide polymorphisms (SNPs) that have been implicated in anxiety and depressive disorders, we use a combination of statistical and machine learning techniques to investigate associations with the generalized anxiety disorder assessment (GAD-7) scores in a UK Biobank sample of 90,882 individuals. As in previous studies, we observed that females exhibited higher GAD-7 scores than males regardless of genotype. Interestingly, we found no significant effects on anxiety from individual circadian gene variants; only circadian genotypes with multiple SNP variants showed significant associations with anxiety. For both sexes, severe anxiety is associated with a 120-fold increase in odds for individuals with CRY2_AG(rs1083852)/ZBTB20_TT(rs1394593) genotypes and is associated with a near 40-fold reduction in odds for individuals with PER3-A_CG(rs228697)/ZBTB20_TT(rs1394593) genotypes. We also report several sex-specific associations with anxiety. In females, the CRY2/ZBTB20 genotype combination showed a > 200-fold increase in odds of anxiety and PER3/ZBTB20 and CRY1 /PER3-A genotype combinations also appeared as female risk factors. In males, CRY1/PER3-A and PER3-B/ZBTB20 genotype combinations were associated with anxiety risk. Mediation analysis revealed direct associations of CRY2/ZBTB20 variant genotypes with moderate anxiety in females and CRY1/PER3-A variant genotypes with severe anxiety in males. The association of CRY1/PER3-A variant genotypes with severe anxiety in females was partially mediated by extreme evening chronotype. Our results reinforce existing findings that females exhibit stronger anxiety outcomes than males, and provide evidence for circadian gene associations with anxiety, particularly in females. Our analyses only identified significant associations using two-gene combinations, underscoring the importance of combined gene effects on anxiety risk. We describe novel, robust associations between gene combinations involving the ZBTB20 SNP (rs1394593) and risk of anxiety symptoms in a large population sample. Our findings also support previous findings that the ZBTB20 SNP is an important factor in mood disorders, including seasonal affective disorder. Our results suggest that reduced expression of this gene significantly modulates the risk of anxiety symptoms through direct influences on mood-related pathways. Together, these observations provide novel links between the circadian clockwork and anxiety symptoms and identify potential molecular pathways through which clock genes may influence anxiety risk.

Collapse

Wang Z, Zhu Y, Liu Z, Li H, Tang X, Jiang Y. Comparative analysis of tissue-specific genes in maize based on machine learning models: CNN performs technically best, LightGBM performs biologically soundest. Front Genet 2023;14:1190887. [PMID: 37229198 PMCID: PMC10203421 DOI: 10.3389/fgene.2023.1190887] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 04/17/2023] [Indexed: 05/27/2023] Open

Abstract

Introduction: With the advancement of RNA-seq technology and machine learning, training large-scale RNA-seq data from databases with machine learning models can generally identify genes with important regulatory roles that were previously missed by standard linear analytic methodologies. Finding tissue-specific genes could improve our comprehension of the relationship between tissues and genes. However, few machine learning models for transcriptome data have been deployed and compared to identify tissue-specific genes, particularly for plants. Methods: In this study, an expression matrix was processed with linear models (Limma), machine learning models (LightGBM), and deep learning models (CNN) with information gain and the SHAP strategy based on 1,548 maize multi-tissue RNA-seq data obtained from a public database to identify tissue-specific genes. In terms of validation, V-measure values were computed based on k-means clustering of the gene sets to evaluate their technical complementarity. Furthermore, GO analysis and literature retrieval were used to validate the functions and research status of these genes. Results: Based on clustering validation, the convolutional neural network outperformed others with higher V-measure values as 0.647, indicating that its gene set could cover as many specific properties of various tissues as possible, whereas LightGBM discovered key transcription factors. The combination of three gene sets produced 78 core tissue-specific genes that had previously been shown in the literature to be biologically significant. Discussion: Different tissue-specific gene sets were identified due to the distinct interpretation strategy for machine learning models and researchers may use multiple methodologies and strategies for tissue-specific gene sets based on their goals, types of data, and computational resources. This study provided comparative insight for large-scale data mining of transcriptome datasets, shedding light on resolving high dimensions and bias difficulties in bioinformatics data processing.

Collapse

Xiong W, Chen Y, Ma S. Unified model-free interaction screening via CV-entropy filter. Comput Stat Data Anal 2023;180:107684. [PMID: 36910335 PMCID: PMC9997997 DOI: 10.1016/j.csda.2022.107684] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]

Walakira A, Ocira J, Duroux D, Fouladi R, Moškon M, Rozman D, Van Steen K. Detecting gene-gene interactions from GWAS using diffusion kernel principal components. BMC Bioinformatics 2022;23:57. [PMID: 35105309 PMCID: PMC8805268 DOI: 10.1186/s12859-022-04580-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2021] [Accepted: 01/18/2022] [Indexed: 11/10/2022] Open

Kunert-Graf JM, Sakhanenko NA, Galas DJ. Optimized permutation testing for information theoretic measures of multi-gene interactions. BMC Bioinformatics 2021;22:180. [PMID: 33827420 PMCID: PMC8028212 DOI: 10.1186/s12859-021-04107-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Accepted: 03/29/2021] [Indexed: 11/17/2022] Open

Abstract

Background

Permutation testing is often considered the “gold standard” for multi-test significance analysis, as it is an exact test requiring few assumptions about the distribution being computed. However, it can be computationally very expensive, particularly in its naive form in which the full analysis pipeline is re-run after permuting the phenotype labels. This can become intractable in multi-locus genome-wide association studies (GWAS), in which the number of potential interactions to be tested is combinatorially large.

Results

In this paper, we develop an approach for permutation testing in multi-locus GWAS, specifically focusing on SNP–SNP-phenotype interactions using multivariable measures that can be computed from frequency count tables, such as those based in Information Theory. We find that the computational bottleneck in this process is the construction of the count tables themselves, and that this step can be eliminated at each iteration of the permutation testing by transforming the count tables directly. This leads to a speed-up by a factor of over 10³ for a typical permutation test compared to the naive approach. Additionally, this approach is insensitive to the number of samples making it suitable for datasets with large number of samples.

Conclusions

The proliferation of large-scale datasets with genotype data for hundreds of thousands of individuals enables new and more powerful approaches for the detection of multi-locus genotype-phenotype interactions. Our approach significantly improves the computational tractability of permutation testing for these studies. Moreover, our approach is insensitive to the large number of samples in these modern datasets. The code for performing these computations and replicating the figures in this paper is freely available at https://github.com/kunert/permute-counts.

Collapse

Manavalan R, Priya S. Genetic interactions effects for cancer disease identification using computational models: a review. Med Biol Eng Comput 2021;59:733-758. [PMID: 33839998 DOI: 10.1007/s11517-021-02343-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Accepted: 03/10/2021] [Indexed: 11/29/2022]

Chanda P, Costa E, Hu J, Sukumar S, Van Hemert J, Walia R. Information Theory in Computational Biology: Where We Stand Today. ENTROPY (BASEL, SWITZERLAND) 2020;22:E627. [PMID: 33286399 PMCID: PMC7517167 DOI: 10.3390/e22060627] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 05/31/2020] [Accepted: 06/03/2020] [Indexed: 12/30/2022]

Malten J, König IR. Modified entropy-based procedure detects gene-gene-interactions in unconventional genetic models. BMC Med Genomics 2020;13:65. [PMID: 32326960 PMCID: PMC7181579 DOI: 10.1186/s12920-020-0703-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2019] [Accepted: 03/13/2020] [Indexed: 11/10/2022] Open

Kim S. A miRNA- and mRNA-seq-Based Feature Selection Approach for Kidney Cancer Biomakers. Cancer Inform 2020;19:1176935120908301. [PMID: 32165847 PMCID: PMC7050029 DOI: 10.1177/1176935120908301] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2020] [Accepted: 02/01/2020] [Indexed: 11/15/2022] Open

Furxhi I, Murphy F, Poland CA, Sheehan B, Mullins M, Mantecca P. Application of Bayesian networks in determining nanoparticle-induced cellular outcomes using transcriptomics. Nanotoxicology 2019;13:827-848. [PMID: 31140895 DOI: 10.1080/17435390.2019.1595206] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Abstract

Inroads have been made in our understanding of the risks posed to human health and the environment by nanoparticles (NPs) but this area requires continuous research and monitoring. Machine learning techniques have been applied to nanotoxicology with very encouraging results. This study deals with bridging physicochemical properties of NPs, experimental exposure conditions and in vitro characteristics with biological effects of NPs on a molecular cellular level from transcriptomics studies. The bridging is done by developing and implementing Bayesian Networks (BNs) with or without data preprocessing. The BN structures are derived either automatically or methodologically and compared. Early stage nanotoxicity measurements represent a challenge, not least when attempting to predict adverse outcomes and modeling is critical to understanding the biological effects of exposure to NPs. The preprocessed data-driven BN showed improved performance over automatically structured BN and the BN with unprocessed datasets. The prestructured BN captures inter relationships between NP properties, exposure condition and in vitro characteristics and links those with cellular effects based on statistic correlation findings. Information gain analysis showed that exposure dose, NP and cell line variables were the most influential attributes in predicting the biological effects. The BN methodology proposed in this study successfully predicts a number of toxicologically relevant cellular disrupted biological processes such as cell cycle and proliferation pathways, cell adhesion and extracellular matrix responses, DNA damage and repair mechanisms etc., with a success rate >80%. The model validation from independent data shows a robust and promising methodology for incorporating transcriptomics outcomes in a hazard and, by extension, risk assessment modeling framework by predicting affected cellular functions from experimental conditions.

Collapse

Kafaie S, Chen Y, Hu T. A network approach to prioritizing susceptibility genes for genome-wide association studies. Genet Epidemiol 2019;43:477-491. [PMID: 30859622 DOI: 10.1002/gepi.22198] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2018] [Revised: 01/31/2019] [Accepted: 02/25/2019] [Indexed: 12/22/2022]

Ferrario PG, König IR. Transferring entropy to the realm of GxG interactions. Brief Bioinform 2018;19:136-147. [PMID: 27769993 PMCID: PMC5862307 DOI: 10.1093/bib/bbw086] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2016] [Indexed: 01/18/2023] Open

ClusterMI: Detecting High-Order SNP Interactions Based on Clustering and Mutual Information. Int J Mol Sci 2018;19:ijms19082267. [PMID: 30072632 PMCID: PMC6121365 DOI: 10.3390/ijms19082267] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2018] [Revised: 07/23/2018] [Accepted: 07/30/2018] [Indexed: 01/14/2023] Open

Mielniczuk J, Teisseyre P. A deeper look at two concepts of measuring gene-gene interactions: logistic regression and interaction information revisited. Genet Epidemiol 2017;42:187-200. [PMID: 29265411 DOI: 10.1002/gepi.22108] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2017] [Revised: 10/23/2017] [Accepted: 11/15/2017] [Indexed: 11/09/2022]

Bernardo M, Bioque M, Cabrera B, Lobo A, González-Pinto A, Pina L, Corripio I, Sanjuán J, Mané A, Castro-Fornieles J, Vieta E, Arango C, Mezquida G, Gassó P, Parellada M, Saiz-Ruiz J, Cuesta MJ, Mas S. Modelling gene-environment interaction in first episodes of psychosis. Schizophr Res 2017;189:181-189. [PMID: 28179063 DOI: 10.1016/j.schres.2017.01.058] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/23/2016] [Revised: 01/24/2017] [Accepted: 01/30/2017] [Indexed: 12/20/2022]

Affiliation(s)

Miguel Bernardo Barcelona Clínic SchizophreniaUnit, Hospital Clínic de Barcelona, CIBERSAM, Spain; Universitat de Barcelona, IDIBAPS, Barcelona, Spain.
Miquel Bioque Barcelona Clínic SchizophreniaUnit, Hospital Clínic de Barcelona, CIBERSAM, Spain
Bibiana Cabrera Barcelona Clínic SchizophreniaUnit, Hospital Clínic de Barcelona, CIBERSAM, Spain
Antonio Lobo Instituto de Investigación Sanitaria Aragón (IIS Aragón), University of Zaragoza, Spain
Ana González-Pinto Department of Psychiatry, Hospital Universitario de Alava, CIBERSAM, University of the Basque Country, Spain
Laura Pina Child and Adolescent Psychiatry Department, Hospital General Universitario Gregorio Marañón, IiSGM, CIBERSAM, School of Medicine, Universidad Complutense, Madrid, Spain
Iluminada Corripio Department of Psychiatry, Hospital de Sant Pau, CIBERSAM, Barcelona, Spain
Julio Sanjuán Clinic Hospital Valencia, INCLIVA, CIBERSAM, Valencia University, Spain
Anna Mané Department of Psychiatry, Hospital del Mar, Barcelona, IMIM, Barcelona, Spain
Josefina Castro-Fornieles Department of Child and Adolescent Psychiatry and Psychology, SGR-489, Neurosciences Institute, Hospital Clínic of Barcelona, IDIBAPS, CIBERSAM, University of Barcelona, Spain
Eduard Vieta Hospital Clínic de Barcelona, Universitat de Barcelona, IDIBAPS, CIBERSAM, Spain
Celso Arango Child and Adolescent Psychiatry Department, Hospital General Universitario Gregorio Marañón, IiSGM, CIBERSAM, School of Medicine, Universidad Complutense, Madrid, Spain
Gisela Mezquida Barcelona Clínic SchizophreniaUnit, Hospital Clínic de Barcelona, CIBERSAM, Spain
Patricia Gassó Department of Pathological Anatomy, Pharmacology and Microbiology, University of Barcelona, Institutd'InvestigacionsBiomèdiques August Pi i Sunyer (IDIBAPS), CIBERSAM, Barcelona, Spain
Mara Parellada Child and Adolescent Psychiatry Department, Hospital General Universitario Gregorio Marañón, IiSGM, CIBERSAM, School of Medicine, Universidad Complutense, Madrid, Spain
Jerónimo Saiz-Ruiz Hospital Ramón y Cajal, Universidad de Alcalá, IRYCIS, CIBERSAM, Madrid, Spain
Manuel J Cuesta Psychiatric Department, Complejo Hospitalario de Navarra, Pamplona (Spain), Instituto de Investigación Sanitaria de Navarra (IdiSNA), Spain
Sergi Mas Department of Pathological Anatomy, Pharmacology and Microbiology, University of Barcelona, Institutd'InvestigacionsBiomèdiques August Pi i Sunyer (IDIBAPS), CIBERSAM, Barcelona, Spain

Collapse

Sauce B, Matzel LD. The paradox of intelligence: Heritability and malleability coexist in hidden gene-environment interplay. Psychol Bull 2017;144:26-47. [PMID: 29083200 DOI: 10.1037/bul0000131] [Citation(s) in RCA: 52] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]

Yan W, Li J, Liu M, Bai X, Shao H. Data-based multiple criteria decision-making model and visualized monitoring of urban drinking water quality. Soft comput 2017. [DOI: 10.1007/s00500-017-2809-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]

Balestre M, de Souza CL. Bayesian reversible-jump for epistasis analysis in genomic studies. BMC Genomics 2016;17:1012. [PMID: 27938339 PMCID: PMC5148921 DOI: 10.1186/s12864-016-3342-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2016] [Accepted: 11/25/2016] [Indexed: 12/03/2022] Open

Woo HJ, Yu C, Kumar K, Gold B, Reifman J. Genotype distribution-based inference of collective effects in genome-wide association studies: insights to age-related macular degeneration disease mechanism. BMC Genomics 2016;17:695. [PMID: 27576376 PMCID: PMC5006276 DOI: 10.1186/s12864-016-2871-3] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2015] [Accepted: 07/01/2016] [Indexed: 12/18/2022] Open

Abstract

BACKGROUND

Genome-wide association studies provide important insights to the genetic component of disease risks. However, an existing challenge is how to incorporate collective effects of interactions beyond the level of independent single nucleotide polymorphism (SNP) tests. While methods considering each SNP pair separately have provided insights, a large portion of expected heritability may reside in higher-order interaction effects.

RESULTS

We describe an inference approach (discrete discriminant analysis; DDA) designed to probe collective interactions while treating both genotypes and phenotypes as random variables. The genotype distributions in case and control groups are modeled separately based on empirical allele frequency and covariance data, whose differences yield disease risk parameters. We compared pairwise tests and collective inference methods, the latter based both on DDA and logistic regression. Analyses using simulated data demonstrated that significantly higher sensitivity and specificity can be achieved with collective inference in comparison to pairwise tests, and with DDA in comparison to logistic regression. Using age-related macular degeneration (AMD) data, we demonstrated two possible applications of DDA. In the first application, a genome-wide SNP set is reduced into a small number (∼100) of variants via filtering and SNP pairs with significant interactions are identified. We found that interactions between SNPs with highest AMD association were epigenetically active in the liver, adipocytes, and mesenchymal stem cells. In the other application, multiple groups of SNPs were formed from the genome-wide data and their relative strengths of association were compared using cross-validation. This analysis allowed us to discover novel collections of loci for which interactions between SNPs play significant roles in their disease association. In particular, we considered pathway-based groups of SNPs containing up to ∼10, 000 variants in each group. In addition to pathways related to complement activation, our collective inference pointed to pathway groups involved in phospholipid synthesis, oxidative stress, and apoptosis, consistent with the AMD pathogenesis mechanism where the dysfunction of retinal pigment epithelium cells plays central roles.

CONCLUSIONS

The simultaneous inference of collective interaction effects within a set of SNPs has the potential to reveal novel aspects of disease association.

Collapse

Sun L, Wang C, Hu YQ. Utilizing mutual information for detecting rare and common variants associated with a categorical trait. PeerJ 2016;4:e2139. [PMID: 27350900 PMCID: PMC4918222 DOI: 10.7717/peerj.2139] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2016] [Accepted: 05/25/2016] [Indexed: 11/20/2022] Open

Abstract

Background. Genome-wide association studies have succeeded in detecting novel common variants which associate with complex diseases. As a result of the fast changes in next generation sequencing technology, a large number of sequencing data are generated, which offers great opportunities to identify rare variants that could explain a larger proportion of missing heritability. Many effective and powerful methods are proposed, although they are usually limited to continuous, dichotomous or ordinal traits. Notice that traits having nominal categorical features are commonly observed in complex diseases, especially in mental disorders, which motivates the incorporation of the characteristics of the categorical trait into association studies with rare and common variants.

Methods. We construct two simple and intuitive nonparametric tests, MIT and aMIT, based on mutual information for detecting association between genetic variants in a gene or region and a categorical trait. MIT and aMIT can gauge the difference among the distributions of rare and common variants across a region given every categorical trait value. If there is little association between variants and a categorical trait, MIT or aMIT approximately equals zero. The larger the difference in distributions, the greater values MIT and aMIT have. Therefore, MIT and aMIT have the potential for detecting functional variants.

Results.We checked the validity of proposed statistics and compared them to the existing ones through extensive simulation studies with varied combinations of the numbers of variants of rare causal, rare non-causal, common causal, and common non-causal, deleterious and protective, various minor allele frequencies and different levels of linkage disequilibrium. The results show our methods have higher statistical power than conventional ones, including the likelihood based score test, in most cases: (1) there are multiple genetic variants in a gene or region; (2) both protective and deleterious variants are present; (3) there exist rare and common variants; and (4) more than half of the variants are neutral. The proposed tests are applied to the data from Collaborative Studies on Genetics of Alcoholism, and a competent performance is exhibited therein.

Discussion. As a complementary to the existing methods mainly focusing on quantitative traits, this study provides the nonparametric tests MIT and aMIT for detecting variants associated with categorical trait. Furthermore, we plan to investigate the association between rare variants and multiple categorical traits.

Collapse

Lee W, Sjölander A, Pawitan Y. A Critical Look at Entropy-Based Gene-Gene Interaction Measures. Genet Epidemiol 2016;40:416-24. [PMID: 27229752 DOI: 10.1002/gepi.21974] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2015] [Revised: 02/28/2015] [Accepted: 03/17/2016] [Indexed: 11/12/2022]

Mas S, Gassó P, Morer A, Calvo A, Bargalló N, Lafuente A, Lázaro L. Integrating Genetic, Neuropsychological and Neuroimaging Data to Model Early-Onset Obsessive Compulsive Disorder Severity. PLoS One 2016;11:e0153846. [PMID: 27093171 PMCID: PMC4836736 DOI: 10.1371/journal.pone.0153846] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2015] [Accepted: 04/05/2016] [Indexed: 01/03/2023] Open

Affiliation(s)

Sergi Mas Dept. Anatomic Pathology, Pharmacology and Microbiology, University of Barcelona, Barcelona, Spain Centro de Investigación Biomédica en Red de Salud Mental (CIBERSAM), Barcelona, Spain Institut d’Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain * E-mail:
Patricia Gassó Dept. Anatomic Pathology, Pharmacology and Microbiology, University of Barcelona, Barcelona, Spain Institut d’Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain
Astrid Morer Department of Child and Adolescent Psychiatry and Psychology, Institute of Neurosciences, Hospital Clinic de Barcelona, Barcelona, Spain Centro de Investigación Biomédica en Red de Salud Mental (CIBERSAM), Barcelona, Spain Institut d’Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain
Anna Calvo Magnetic Resonance Image Core Facility, Institut d’Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain Institut d’Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain
Nuria Bargalló Department of Radiology, Centre de Diagnostic per la Imatge, Hospital Clínic, Barcelona, Spain Institut d’Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain
Amalia Lafuente Dept. Anatomic Pathology, Pharmacology and Microbiology, University of Barcelona, Barcelona, Spain Centro de Investigación Biomédica en Red de Salud Mental (CIBERSAM), Barcelona, Spain Institut d’Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain
Luisa Lázaro Department of Child and Adolescent Psychiatry and Psychology, Institute of Neurosciences, Hospital Clinic de Barcelona, Barcelona, Spain Dept. Psychiatry and Clinical Psychobiology, University of Barcelona, Barcelona, Spain Centro de Investigación Biomédica en Red de Salud Mental (CIBERSAM), Barcelona, Spain Institut d’Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain

Collapse

The application of information theory for the research of aging and aging-related diseases. Prog Neurobiol 2016;157:158-173. [PMID: 27004830 DOI: 10.1016/j.pneurobio.2016.03.005] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2015] [Revised: 03/13/2016] [Accepted: 03/19/2016] [Indexed: 11/23/2022]

Abstract

This article reviews the application of information-theoretical analysis, employing measures of entropy and mutual information, for the study of aging and aging-related diseases. The research of aging and aging-related diseases is particularly suitable for the application of information theory methods, as aging processes and related diseases are multi-parametric, with continuous parameters coexisting alongside discrete parameters, and with the relations between the parameters being as a rule non-linear. Information theory provides unique analytical capabilities for the solution of such problems, with unique advantages over common linear biostatistics. Among the age-related diseases, information theory has been used in the study of neurodegenerative diseases (particularly using EEG time series for diagnosis and prediction), cancer (particularly for establishing individual and combined cancer biomarkers), diabetes (mainly utilizing mutual information to characterize the diseased and aging states), and heart disease (mainly for the analysis of heart rate variability). Few works have employed information theory for the analysis of general aging processes and frailty, as underlying determinants and possible early preclinical diagnostic measures for aging-related diseases. Generally, the use of information-theoretical analysis permits not only establishing the (non-linear) correlations between diagnostic or therapeutic parameters of interest, but may also provide a theoretical insight into the nature of aging and related diseases by establishing the measures of variability, adaptation, regulation or homeostasis, within a system of interest. It may be hoped that the increased use of such measures in research may considerably increase diagnostic and therapeutic capabilities and the fundamental theoretical mathematical understanding of aging and disease.

Collapse

Yu Z, Demetriou M, Gillen DL. Genome-Wide Analysis of Gene-Gene and Gene-Environment Interactions Using Closed-Form Wald Tests. Genet Epidemiol 2015;39:446-55. [PMID: 26095143 DOI: 10.1002/gepi.21907] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2014] [Revised: 02/25/2015] [Accepted: 05/06/2015] [Indexed: 01/31/2023]

Su L, Liu G, Wang H, Tian Y, Zhou Z, Han L, Yan L. Research on single nucleotide polymorphisms interaction detection from network perspective. PLoS One 2015;10:e0119146. [PMID: 25763929 PMCID: PMC4357495 DOI: 10.1371/journal.pone.0119146] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2014] [Accepted: 01/09/2015] [Indexed: 12/02/2022] Open

Moore JH, Hill DP. Epistasis analysis using artificial intelligence. Methods Mol Biol 2015;1253:327-46. [PMID: 25403541 DOI: 10.1007/978-1-4939-2155-3_18] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]

White MJ, Tacconelli A, Chen JS, Wejse C, Hill PC, Gomes VF, Velez-Edwards DR, Østergaard LJ, Hu T, Moore JH, Novelli G, Scott WK, Williams SM, Sirugo G. Epiregulin (EREG) and human V-ATPase (TCIRG1): genetic variation, ethnicity and pulmonary tuberculosis susceptibility in Guinea-Bissau and The Gambia. Genes Immun 2014;15:370-7. [PMID: 24898387 DOI: 10.1038/gene.2014.28] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2014] [Revised: 04/23/2014] [Accepted: 04/24/2014] [Indexed: 02/07/2023]

Affiliation(s)

M J White 1] Center for Human Genetics Research, Vanderbilt University, Nashville, TN, USA [2] Department of Genetics and Institute of Quantitative Biomedical Sciences, Dartmouth College, Hanover, NH, USA
A Tacconelli Centro di Ricerca, Ospedale San Pietro Fatebenefratelli, Rome, Italy
J S Chen Department of Genetics and Institute of Quantitative Biomedical Sciences, Dartmouth College, Hanover, NH, USA
C Wejse 1] Bandim Health Project, Danish Epidemiology Science Centre and Statens Serum Institute, Bissau, Guinea-Bissau [2] Department of Infectious Diseases, Aarhus University Hospital, Skejby, Denmark [3] Center for Global Health, School of Public Health, Aarhus University, Skejby, Denmark
P C Hill 1] Centre for International Health, University of Otago School of Medicine, Dunedin, New Zealand [2] MRC Laboratories, Fajara, The Gambia
V F Gomes Bandim Health Project, Danish Epidemiology Science Centre and Statens Serum Institute, Bissau, Guinea-Bissau
D R Velez-Edwards 1] Vanderbilt Epidemiology Center, Vanderbilt University, Nashville, TN, USA [2] Institute for Medicine and Public Health, Vanderbilt University, Nashville, TN, USA [3] Center for Human Genetics Research, Vanderbilt University, Nashville, TN, USA [4] Department of Obstetrics and Gynecology, Vanderbilt University, Nashville, TN, USA
L J Østergaard Department of Infectious Diseases, Aarhus University Hospital, Skejby, Denmark
T Hu Department of Genetics and Institute of Quantitative Biomedical Sciences, Dartmouth College, Hanover, NH, USA
J H Moore Department of Genetics and Institute of Quantitative Biomedical Sciences, Dartmouth College, Hanover, NH, USA
G Novelli 1] Centro di Ricerca, Ospedale San Pietro Fatebenefratelli, Rome, Italy [2] Dipartimento di Biomedicina e Prevenzione, Sezione di Genetica, Università di Roma 'Tor Vergata', Rome, Italy
W K Scott Dr John T. Macdonald Foundation Department of Human Genetics and John P. Hussman Institute for Human Genomics, University of Miami Miller School of Medicine, Miami, FL, USA
S M Williams Department of Genetics and Institute of Quantitative Biomedical Sciences, Dartmouth College, Hanover, NH, USA
G Sirugo Centro di Ricerca, Ospedale San Pietro Fatebenefratelli, Rome, Italy

Collapse

El-Serag HB, Kanwal F, Davila JA, Kramer J, Richardson P. A new laboratory-based algorithm to predict development of hepatocellular carcinoma in patients with hepatitis C and cirrhosis. Gastroenterology 2014;146:1249-55.e1. [PMID: 24462733 PMCID: PMC3992177 DOI: 10.1053/j.gastro.2014.01.045] [Citation(s) in RCA: 127] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/02/2013] [Revised: 01/13/2014] [Accepted: 01/20/2014] [Indexed: 02/08/2023]

Abstract

BACKGROUND & AIMS

Serum levels of α-fetoprotein (AFP) are influenced not only by the presence of hepatocellular carcinoma (HCC), but also by the underlying severity and activity of liver disease, which is reflected by liver function tests. We constructed an AFP-based algorithm that included these factors to identify patients at risk for HCC, and tested its predictive ability in a large set of patients with cirrhosis.

METHODS

We used the national Department of Veterans Affairs Hepatitis C Virus Clinical Case Registry to identify patients with cirrhosis, results from at least 1 AFP test, and 6 months of follow-up. Our algorithm included data on age; levels of aspartate aminotransferase, alanine aminotransferase (ALT), alkaline phosphatase, total bilirubin, albumin, creatinine, and hemoglobin; prothrombin time; and numbers of platelets and white cells. We examined the operating characteristics (calibration, discrimination, predictive values) of several different algorithms for identification of patients who would develop HCC within 6 months of the AFP test. We assessed our final model in the development and validation subsets.

RESULTS

We identified 11,721 patients with hepatitis C virus-related cirrhosis in whom 35,494 AFP tests were performed, and 987 patients developed HCC. A predictive model that included data on levels of AFP, ALT, and platelets, along with age at time of AFP test (and interaction terms between AFP and ALT, and AFP and platelets), best discriminated between patients who did and did not develop HCC. Using this AFP-adjusted model, the predictive accuracy increased at different AFP cutoffs compared with AFP alone. At any given AFP value, low numbers of platelets and ALT and older age were associated with increased risk of HCC, and high levels of ALT and normal/high numbers of platelets were associated with low risk for HCC. For example, the probabilities of HCC, based only on 20 ng/mL and 120 ng/mL AFP, were 3.5% and 11.4%, respectively. However, patients with the same AFP values (20 ng/mL and 120 ng/mL) who were 70 years old, with ALT levels of 40 IU/mL and platelet counts of 100,000, had probabilities of developing HCC of 8.1% and 29.0%, respectively.

CONCLUSIONS

We developed and validated an algorithm based on levels of AFP, platelets, and ALT, along with age, which increased the predictive value for identifying patients with hepatitis C virus-associated cirrhosis likely to develop HCC within 6 months. If validated in other patient groups, this model would have immediate clinical applicability.

Collapse

Vinga S. Information theory applications for biological sequence analysis. Brief Bioinform 2014;15:376-89. [PMID: 24058049 PMCID: PMC7109941 DOI: 10.1093/bib/bbt068] [Citation(s) in RCA: 67] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2013] [Accepted: 08/17/2013] [Indexed: 01/13/2023] Open

Hu T, Pan Q, Andrew AS, Langer JM, Cole MD, Tomlinson CR, Karagas MR, Moore JH. Functional genomics annotation of a statistical epistasis network associated with bladder cancer susceptibility. BioData Min 2014;7:5. [PMID: 24725556 PMCID: PMC3989783 DOI: 10.1186/1756-0381-7-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2013] [Accepted: 04/05/2014] [Indexed: 11/13/2022] Open

Abstract

BACKGROUND

Several different genetic and environmental factors have been identified as independent risk factors for bladder cancer in population-based studies. Recent studies have turned to understanding the role of gene-gene and gene-environment interactions in determining risk. We previously developed the bioinformatics framework of statistical epistasis networks (SEN) to characterize the global structure of interacting genetic factors associated with a particular disease or clinical outcome. By applying SEN to a population-based study of bladder cancer among Caucasians in New Hampshire, we were able to identify a set of connected genetic factors with strong and significant interaction effects on bladder cancer susceptibility.

FINDINGS

To support our statistical findings using networks, in the present study, we performed pathway enrichment analyses on the set of genes identified using SEN, and found that they are associated with the carcinogen benzo[a]pyrene, a component of tobacco smoke. We further carried out an mRNA expression microarray experiment to validate statistical genetic interactions, and to determine if the set of genes identified in the SEN were differentially expressed in a normal bladder cell line and a bladder cancer cell line in the presence or absence of benzo[a]pyrene. Significant nonrandom sets of genes from the SEN were found to be differentially expressed in response to benzo[a]pyrene in both the normal bladder cells and the bladder cancer cells. In addition, the patterns of gene expression were significantly different between these two cell types.

CONCLUSIONS

The enrichment analyses and the gene expression microarray results support the idea that SEN analysis of bladder in population-based studies is able to identify biologically meaningful statistical patterns. These results bring us a step closer to a systems genetic approach to understanding cancer susceptibility that integrates population and laboratory-based studies.

Collapse

Pan Q, Hu T, Malley JD, Andrew AS, Karagas MR, Moore JH. A system-level pathway-phenotype association analysis using synthetic feature random forest. Genet Epidemiol 2014;38:209-19. [PMID: 24535726 DOI: 10.1002/gepi.21794] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2013] [Revised: 11/21/2013] [Accepted: 01/02/2014] [Indexed: 11/07/2022]

Abstract

As the cost of genome-wide genotyping decreases, the number of genome-wide association studies (GWAS) has increased considerably. However, the transition from GWAS findings to the underlying biology of various phenotypes remains challenging. As a result, due to its system-level interpretability, pathway analysis has become a popular tool for gaining insights on the underlying biology from high-throughput genetic association data. In pathway analyses, gene sets representing particular biological processes are tested for significant associations with a given phenotype. Most existing pathway analysis approaches rely on single-marker statistics and assume that pathways are independent of each other. As biological systems are driven by complex biomolecular interactions, embracing the complex relationships between single-nucleotide polymorphisms (SNPs) and pathways needs to be addressed. To incorporate the complexity of gene-gene interactions and pathway-pathway relationships, we propose a system-level pathway analysis approach, synthetic feature random forest (SF-RF), which is designed to detect pathway-phenotype associations without making assumptions about the relationships among SNPs or pathways. In our approach, the genotypes of SNPs in a particular pathway are aggregated into a synthetic feature representing that pathway via Random Forest (RF). Multiple synthetic features are analyzed using RF simultaneously and the significance of a synthetic feature indicates the significance of the corresponding pathway. We further complement SF-RF with pathway-based Statistical Epistasis Network (SEN) analysis that evaluates interactions among pathways. By investigating the pathway SEN, we hope to gain additional insights into the genetic mechanisms contributing to the pathway-phenotype association. We apply SF-RF to a population-based genetic study of bladder cancer and further investigate the mechanisms that help explain the pathway-phenotype associations using SEN. The bladder cancer associated pathways we found are both consistent with existing biological knowledge and reveal novel and plausible hypotheses for future biological validations.

Collapse

Hutter CM, Mechanic LE, Chatterjee N, Kraft P, Gillanders EM. Gene-environment interactions in cancer epidemiology: a National Cancer Institute Think Tank report. Genet Epidemiol 2013;37:643-57. [PMID: 24123198 PMCID: PMC4143122 DOI: 10.1002/gepi.21756] [Citation(s) in RCA: 78] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2013] [Revised: 08/06/2013] [Accepted: 08/14/2013] [Indexed: 01/04/2023]

Gong M, Yi Q, Wang W. Association between NQO1 C609T polymorphism and bladder cancer susceptibility: a systemic review and meta-analysis. Tumour Biol 2013;34:2551-6. [PMID: 23749485 DOI: 10.1007/s13277-013-0799-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2013] [Accepted: 04/08/2013] [Indexed: 12/24/2022] Open

Abstract

There is growing evidence for the important roles of genetic factors in the host's susceptibility to bladder cancer. NAD(P)H:quinone oxidoreductase 1 (NQO1) is a cytosolic enzyme that catalyzes the two-electron reduction of quinoid compounds into hydroquinones. Since the NQO1 C609T polymorphism is linked to enzymatic activity of NQO1, it has also been hypothesized that NQO1 C609T polymorphism may affect the host's susceptibility to bladder cancer by modifying the exposure to carcinogens. There were many studies carried out to assess the association between NQO1 C609T polymorphism and bladder cancer risk, but they reported contradictory results. We conducted a meta-analysis to examine the hypotheses that the NQO1 C609T polymorphism modifies the risk of bladder cancer. Eleven case-control studies with 2,937 bladder cancer cases and 3,008 controls were included in the meta-analysis. Overall, there was no obvious association between NQO1 C609T polymorphism and bladder cancer susceptibility (for T versus C: odds ratio (OR) = 1.12, 95 % confidence interval (95 %CI) 0.99-1.26, P OR = 0.069; for TT versus CC: OR = 1.31, 95 %CI 0.95-1.81, P OR = 0.100; for TT/CT versus CC: OR = 1.06, 95 %CI 0.95-1.18, P OR = 0.304; for TT versus CT/CC: OR = 1.29, 95 %CI 0.94-1.77, P OR = 0.112). After adjusting for heterogeneity, meta-analysis of those left 10 studies showed that there was an obvious association between NQO1 C609T polymorphism and bladder cancer susceptibility (for T versus C: OR = 1.18, 95 %CI 1.06-1.31, P OR = 0.003; for TT versus CC: OR = 1.47, 95 %CI 1.14-1.90, P OR = 0.003; for TT/CT versus CC: OR = 1.16, 95 %CI 1.01-1.34, P OR = 0.036; for TT versus CT/CC: OR = 1.39, 95 %CI 1.10-1.75, P OR = 0.006). There was low risk of publication bias. Therefore, our meta-analysis suggests that NQO1 C609T polymorphism is associated with bladder cancer susceptibility.

Collapse

Hu T, Chen Y, Kiralis JW, Moore JH. ViSEN: methodology and software for visualization of statistical epistasis networks. Genet Epidemiol 2013;37:283-5. [PMID: 23468157 DOI: 10.1002/gepi.21718] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2012] [Revised: 12/20/2012] [Accepted: 02/05/2013] [Indexed: 11/06/2022]

Hu T, Chen Y, Kiralis JW, Collins RL, Wejse C, Sirugo G, Williams SM, Moore JH. An information-gain approach to detecting three-way epistatic interactions in genetic association studies. J Am Med Inform Assoc 2013;20:630-6. [PMID: 23396514 PMCID: PMC3721169 DOI: 10.1136/amiajnl-2012-001525] [Citation(s) in RCA: 51] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open

Wu C, Li S, Cui Y. Genetic association studies: an information content perspective. Curr Genomics 2012;13:566-73. [PMID: 23633916 PMCID: PMC3468889 DOI: 10.2174/138920212803251382] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2012] [Revised: 06/04/2012] [Accepted: 06/18/2012] [Indexed: 01/02/2023] Open

Fan R, Albert PS, Schisterman EF. A discussion of gene-gene and gene-environment interactions and longitudinal genetic analysis of complex traits. Stat Med 2012;31:2565-8. [PMID: 22969024 PMCID: PMC3458189 DOI: 10.1002/sim.5495] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]

Aschard H, Lutz S, Maus B, Duell EJ, Fingerlin TE, Chatterjee N, Kraft P, Van Steen K. Challenges and opportunities in genome-wide environmental interaction (GWEI) studies. Hum Genet 2012;131:1591-613. [PMID: 22760307 DOI: 10.1007/s00439-012-1192-0] [Citation(s) in RCA: 110] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2012] [Accepted: 06/11/2012] [Indexed: 02/03/2023]

McKinney BA, Pajewski NM. Six Degrees of Epistasis: Statistical Network Models for GWAS. Front Genet 2012;2:109. [PMID: 22303403 PMCID: PMC3261632 DOI: 10.3389/fgene.2011.00109] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2011] [Accepted: 12/22/2011] [Indexed: 11/18/2022] Open