Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Nelson MR, Kardia SL, Ferrell RE, Sing CF. A combinatorial partitioning method to identify multilocus genotypic partitions that predict quantitative trait variation. Genome Res 2001;11:458-70. [PMID: 11230170 PMCID: PMC311041 DOI: 10.1101/gr.172901] [Citation(s) in RCA: 257] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2000] [Accepted: 01/02/2001] [Indexed: 11/24/2022]

For:	Nelson MR, Kardia SL, Ferrell RE, Sing CF. A combinatorial partitioning method to identify multilocus genotypic partitions that predict quantitative trait variation. Genome Res 2001;11:458-70. [PMID: 11230170 PMCID: PMC311041 DOI: 10.1101/gr.172901] [Citation(s) in RCA: 257] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2000] [Accepted: 01/02/2001] [Indexed: 11/24/2022]

Number

Cited by Other Article(s)

Sha Z, Freda PJ, Bhandary P, Ghosh A, Matsumoto N, Moore JH, Hu T. Distinct Network Patterns Emerge from Cartesian and XOR Epistasis Models: A Comparative Network Science Analysis. RESEARCH SQUARE 2024:rs.3.rs-4392123. [PMID: 38826481 PMCID: PMC11142370 DOI: 10.21203/rs.3.rs-4392123/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2024]

Abstract

Background

Epistasis, the phenomenon where the effect of one gene (or variant) is masked or modified by one or more other genes, can significantly contribute to the observed phenotypic variance of complex traits. To date, it has been generally assumed that genetic interactions can be detected using a Cartesian, or multiplicative, interaction model commonly utilized in standard regression approaches. However, a recent study investigating epistasis in obesity-related traits in rats and mice has identified potential limitations of the Cartesian model, revealing that it only detects some of the genetic interactions occurring in these systems. By applying an alternative approach, the exclusive-or (XOR) model, the researchers detected a greater number of epistatic interactions and identified more biologically relevant ontological terms associated with the interacting loci. This suggests that the XOR model may provide a more comprehensive understanding of epistasis in these species and phenotypes. To further explore these findings and determine if different interaction models also make up distinct epistatic networks, we leverage network science to provide a more comprehensive view into the genetic interactions underlying BMI in this system.

Results

Our comparative analysis of networks derived from Cartesian and XOR interaction models in rats (Rattus norvegicus) uncovers distinct topological characteristics for each model-derived network. Notably, we discover that networks based on the XOR model exhibit an enhanced sensitivity to epistatic interactions. This sensitivity enables the identification of network communities, revealing novel trait-related biological functions through enrichment analysis. Furthermore, we identify triangle network motifs in the XOR epistatic network, suggestive of higher-order epistasis, based on the topology of lower-order epistasis.

Conclusions

These findings highlight the XOR model's ability to uncover meaningful biological associations as well as higher-order epistasis from lower-order epistatic networks. Additionally, our results demonstrate that network approaches not only enhance epistasis detection capabilities but also provide more nuanced understandings of genetic architectures underlying complex traits. The identification of community structures and motifs within these distinct networks, especially in XOR, points to the potential for network science to aid in the discovery of novel genetic pathways and regulatory networks. Such insights are important for advancing our understanding of phenotype-genotype relationships.

Collapse

Ma J, Li J, Chen Y, Yang Z, He Y. Poor statistical power in population-based association study of gene interaction. BMC Med Genomics 2024;17:111. [PMID: 38678264 PMCID: PMC11055307 DOI: 10.1186/s12920-024-01884-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Accepted: 04/19/2024] [Indexed: 04/29/2024] Open

Batista S, Madar VS, Freda PJ, Bhandary P, Ghosh A, Matsumoto N, Chitre AS, Palmer AA, Moore JH. Interaction models matter: an efficient, flexible computational framework for model-specific investigation of epistasis. BioData Min 2024;17:7. [PMID: 38419006 PMCID: PMC10900690 DOI: 10.1186/s13040-024-00358-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Accepted: 02/20/2024] [Indexed: 03/02/2024] Open

Abstract

PURPOSE

Epistasis, the interaction between two or more genes, is integral to the study of genetics and is present throughout nature. Yet, it is seldom fully explored as most approaches primarily focus on single-locus effects, partly because analyzing all pairwise and higher-order interactions requires significant computational resources. Furthermore, existing methods for epistasis detection only consider a Cartesian (multiplicative) model for interaction terms. This is likely limiting as epistatic interactions can evolve to produce varied relationships between genetic loci, some complex and not linearly separable.

METHODS

We present new algorithms for the interaction coefficients for standard regression models for epistasis that permit many varied models for the interaction terms for loci and efficient memory usage. The algorithms are given for two-way and three-way epistasis and may be generalized to higher order epistasis. Statistical tests for the interaction coefficients are also provided. We also present an efficient matrix based algorithm for permutation testing for two-way epistasis. We offer a proof and experimental evidence that methods that look for epistasis only at loci that have main effects may not be justified. Given the computational efficiency of the algorithm, we applied the method to a rat data set and mouse data set, with at least 10,000 loci and 1,000 samples each, using the standard Cartesian model and the XOR model to explore body mass index.

RESULTS

This study reveals that although many of the loci found to exhibit significant statistical epistasis overlap between models in rats, the pairs are mostly distinct. Further, the XOR model found greater evidence for statistical epistasis in many more pairs of loci in both data sets with almost all significant epistasis in mice identified using XOR. In the rat data set, loci involved in epistasis under the XOR model are enriched for biologically relevant pathways.

CONCLUSION

Our results in both species show that many biologically relevant epistatic relationships would have been undetected if only one interaction model was applied, providing evidence that varied interaction models should be implemented to explore epistatic interactions that occur in living systems.

Collapse

Yang CH, Hou MF, Chuang LY, Yang CS, Lin YD. Dimensionality reduction approach for many-objective epistasis analysis. Brief Bioinform 2023;24:6858949. [PMID: 36458451 DOI: 10.1093/bib/bbac512] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Revised: 10/07/2022] [Accepted: 10/26/2022] [Indexed: 12/04/2022] Open

Abstract

In epistasis analysis, single-nucleotide polymorphism-single-nucleotide polymorphism interactions (SSIs) among genes may, alongside other environmental factors, influence the risk of multifactorial diseases. To identify SSI between cases and controls (i.e. binary traits), the score for model quality is affected by different objective functions (i.e. measurements) because of potential disease model preferences and disease complexities. Our previous study proposed a multiobjective approach-based multifactor dimensionality reduction (MOMDR), with the results indicating that two objective functions could enhance SSI identification with weak marginal effects. However, SSI identification using MOMDR remains a challenge because the optimal measure combination of objective functions has yet to be investigated. This study extended MOMDR to the many-objective version (i.e. many-objective MDR, MaODR) by integrating various disease probability measures based on a two-way contingency table to improve the identification of SSI between cases and controls. We introduced an objective function selection approach to determine the optimal measure combination in MaODR among 10 well-known measures. In total, 6 disease models with and 40 disease models without marginal effects were used to evaluate the general algorithms, namely those based on multifactor dimensionality reduction, MOMDR and MaODR. Our results revealed that the MaODR-based three objective function model, correct classification rate, likelihood ratio and normalized mutual information (MaODR-CLN) exhibited the higher 6.47% detection success rates (Accuracy) than MOMDR and higher 17.23% detection success rates than MDR through the application of an objective function selection approach. In a Wellcome Trust Case Control Consortium, MaODR-CLN successfully identified the significant SSIs (P < 0.001) associated with coronary artery disease. We performed a systematic analysis to identify the optimal measure combination in MaODR among 10 objective functions. Our combination detected SSIs-based binary traits with weak marginal effects and thus reduced spurious variables in the score model. MOAI is freely available at https://sites.google.com/view/maodr/home.

Collapse

Pudjihartono N, Fadason T, Kempa-Liehr AW, O'Sullivan JM. A Review of Feature Selection Methods for Machine Learning-Based Disease Risk Prediction. FRONTIERS IN BIOINFORMATICS 2022;2:927312. [PMID: 36304293 PMCID: PMC9580915 DOI: 10.3389/fbinf.2022.927312] [Citation(s) in RCA: 75] [Impact Index Per Article: 37.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2022] [Accepted: 06/03/2022] [Indexed: 01/14/2023] Open

Martins J, Yusupov N, Binder EB, Brückl TM, Czamara D. Early adversity as the prototype gene × environment interaction in mental disorders? Pharmacol Biochem Behav 2022;215:173371. [PMID: 35271857 DOI: 10.1016/j.pbb.2022.173371] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/07/2021] [Revised: 02/03/2022] [Accepted: 02/28/2022] [Indexed: 10/18/2022]

Yilmaz S, Fakhouri M, Koyutürk M, Çiçek AE, Tastan O. Uncovering complementary sets of variants for predicting quantitative phenotypes. Bioinformatics 2022;38:908-917. [PMID: 34864867 DOI: 10.1093/bioinformatics/btab803] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2021] [Revised: 09/21/2021] [Accepted: 11/24/2021] [Indexed: 02/03/2023] Open

Lin YD, Lee YC, Chiang CP, Moi SH, Kan JY. MOAI: a multi-outcome interaction identification approach reveals an interaction between vaspin and carcinoembryonic antigen on colorectal cancer prognosis. Brief Bioinform 2021;23:6398687. [PMID: 34661627 DOI: 10.1093/bib/bbab427] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2021] [Revised: 09/14/2021] [Accepted: 09/18/2021] [Indexed: 11/12/2022] Open

Dyson G. An Application of the Patient Rule-Induction Method to Detect Clinically Meaningful Subgroups from Failed Phase III Clinical Trials. INTERNATIONAL JOURNAL OF CLINICAL BIOSTATISTICS AND BIOMETRICS 2021;7. [PMID: 34632463 PMCID: PMC8496893 DOI: 10.23937/2469-5831/1510038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

MIDESP: Mutual Information-Based Detection of Epistatic SNP Pairs for Qualitative and Quantitative Phenotypes. BIOLOGY 2021;10:biology10090921. [PMID: 34571798 PMCID: PMC8469369 DOI: 10.3390/biology10090921] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 09/09/2021] [Accepted: 09/13/2021] [Indexed: 11/17/2022]

Abstract

Simple Summary

The interactions between SNPs, which are known as epistasis, can strongly influence the phenotype. Their detection is still a challenge, which is made even more difficult through the existence of background associations that can hide correct epistatic interactions. To address the limitations of existing methods, we present in this study our novel method MIDESP for the detection of epistatic SNP pairs. It is the first mutual information-based method that can be applied to both qualitative and quantitative phenotypes and which explicitly accounts for background associations in the dataset.

Abstract

The interactions between SNPs result in a complex interplay with the phenotype, known as epistasis. The knowledge of epistasis is a crucial part of understanding genetic causes of complex traits. However, due to the enormous number of SNP pairs and their complex relationship to the phenotype, identification still remains a challenging problem. Many approaches for the detection of epistasis have been developed using mutual information (MI) as an association measure. However, these methods have mainly been restricted to case–control phenotypes and are therefore of limited applicability for quantitative traits. To overcome this limitation of MI-based methods, here, we present an MI-based novel algorithm, MIDESP, to detect epistasis between SNPs for qualitative as well as quantitative phenotypes. Moreover, by incorporating a dataset-dependent correction technique, we deal with the effect of background associations in a genotypic dataset to separate correct epistatic interaction signals from those of false positive interactions resulting from the effect of single SNP×phenotype associations. To demonstrate the effectiveness of MIDESP, we apply it on two real datasets with qualitative and quantitative phenotypes, respectively. Our results suggest that by eliminating the background associations, MIDESP can identify important genes, which play essential roles for bovine tuberculosis or the egg weight of chickens.

Collapse

Okazaki A, Horpaopan S, Zhang Q, Randesi M, Ott J. Genotype Pattern Mining for Pairs of Interacting Variants Underlying Digenic Traits. Genes (Basel) 2021;12:1160. [PMID: 34440333 PMCID: PMC8391494 DOI: 10.3390/genes12081160] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Revised: 07/23/2021] [Accepted: 07/27/2021] [Indexed: 12/15/2022] Open

Yilmaz S, Tastan O, Cicek AE. SPADIS: An Algorithm for Selecting Predictive and Diverse SNPs in GWAS. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021;18:1208-1216. [PMID: 31443041 DOI: 10.1109/tcbb.2019.2935437] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

Wu Q, Nasoz F, Jung J, Bhattarai B, Han MV, Greenes RA, Saag KG. Machine learning approaches for the prediction of bone mineral density by using genomic and phenotypic data of 5130 older men. Sci Rep 2021;11:4482. [PMID: 33627720 PMCID: PMC7904941 DOI: 10.1038/s41598-021-83828-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2020] [Accepted: 02/09/2021] [Indexed: 02/07/2023] Open

Guo X. JS-MA: A Jensen-Shannon Divergence Based Method for Mapping Genome-Wide Associations on Multiple Diseases. Front Genet 2020;11:507038. [PMID: 33193597 PMCID: PMC7662082 DOI: 10.3389/fgene.2020.507038] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2019] [Accepted: 09/21/2020] [Indexed: 12/14/2022] Open

Zhou X, Chan KCC, Huang Z, Wang J. Determining dependency and redundancy for identifying gene-gene interaction associated with complex disease. J Bioinform Comput Biol 2020;18:2050035. [PMID: 33064052 DOI: 10.1142/s0219720020500353] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Wen J, Ford CT, Janies D, Shi X. A parallelized strategy for epistasis analysis based on Empirical Bayesian Elastic Net models. Bioinformatics 2020;36:3803-3810. [PMID: 32227194 DOI: 10.1093/bioinformatics/btaa216] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2019] [Revised: 03/05/2020] [Accepted: 03/26/2020] [Indexed: 11/14/2022] Open

Testing the Significance of Interactions in Genetic Studies Using Interaction Information and Resampling Technique. LECTURE NOTES IN COMPUTER SCIENCE 2020. [PMCID: PMC7304020 DOI: 10.1007/978-3-030-50420-5_38] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/01/2022]

Application of simulation-based CYP26 SNP-environment barcodes for evaluating the occurrence of oral malignant disorders by odds ratio-based binary particle swarm optimization: A case-control study in the Taiwanese population. PLoS One 2019;14:e0220719. [PMID: 31465460 PMCID: PMC6715230 DOI: 10.1371/journal.pone.0220719] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2019] [Accepted: 07/22/2019] [Indexed: 12/15/2022] Open

Lawania S, Singh A, Sharma S, Singh N, Behera D. The multi-faceted high order polymorphic synergistic interactions among nucleotide excision repair genes increase the risk of lung cancer in North Indians. Mutat Res 2019;816-818:111673. [PMID: 31195348 DOI: 10.1016/j.mrfmmm.2019.111673] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2019] [Revised: 05/08/2019] [Accepted: 06/04/2019] [Indexed: 11/25/2022]

Guan B, Zhao Y, Sun W. Ant colony optimization with an automatic adjustment mechanism for detecting epistatic interactions. Comput Biol Chem 2018;77:354-362. [DOI: 10.1016/j.compbiolchem.2018.11.001] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2018] [Revised: 10/01/2018] [Accepted: 11/05/2018] [Indexed: 12/13/2022]

Hou TT, Lin F, Bai S, Cleves MA, Xu HM, Lou XY. Generalized multifactor dimensionality reduction approaches to identification of genetic interactions underlying ordinal traits. Genet Epidemiol 2018;43:24-36. [PMID: 30387901 DOI: 10.1002/gepi.22169] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2018] [Revised: 08/31/2018] [Accepted: 09/21/2018] [Indexed: 12/11/2022]

Zhou X, Chan KCC. Detecting gene-gene interactions for complex quantitative traits using generalized fuzzy classification. BMC Bioinformatics 2018;19:329. [PMID: 30227829 PMCID: PMC6145205 DOI: 10.1186/s12859-018-2361-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2017] [Accepted: 09/09/2018] [Indexed: 11/10/2022] Open

Cole BS, Hall MA, Urbanowicz RJ, Gilbert‐Diamond D, Moore JH. Analysis of Gene‐Gene Interactions. ACTA ACUST UNITED AC 2018;95:1.14.1-1.14.10. [DOI: 10.1002/cphg.45] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Xu Y, Wu Y, Wu J. Capturing pair-wise epistatic effects associated with three agronomic traits in barley. Genetica 2018;146:161-170. [PMID: 29349538 DOI: 10.1007/s10709-018-0008-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2017] [Accepted: 01/11/2018] [Indexed: 11/25/2022]

Mielniczuk J, Teisseyre P. A deeper look at two concepts of measuring gene-gene interactions: logistic regression and interaction information revisited. Genet Epidemiol 2017;42:187-200. [PMID: 29265411 DOI: 10.1002/gepi.22108] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2017] [Revised: 10/23/2017] [Accepted: 11/15/2017] [Indexed: 11/09/2022]

Hall MA, Moore JH, Ritchie MD. Embracing Complex Associations in Common Traits: Critical Considerations for Precision Medicine. Trends Genet 2017;32:470-484. [PMID: 27392675 DOI: 10.1016/j.tig.2016.06.001] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2016] [Revised: 06/01/2016] [Accepted: 06/02/2016] [Indexed: 10/21/2022]

Wen J, Quitadamo A, Hall B, Shi X. Epistasis analysis of microRNAs on pathological stages in colon cancer based on an Empirical Bayesian Elastic Net method. BMC Genomics 2017. [PMID: 29513198 PMCID: PMC5657052 DOI: 10.1186/s12864-017-4130-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open

Abstract

Background

Colon cancer is a leading cause of worldwide cancer death. It has become clear that microRNAs (miRNAs) play a role in the progress of colon cancer and understanding the effect of miRNAs on tumorigenesis could lead to better prognosis and improved treatment. However, most studies have focused on studying differentially expressed miRNAs between tumor and non-tumor samples or between stages in tumor tissue. Limited work has conducted to study the interactions or epistasis between miRNAs and how the epistasis brings about effect on tumor progression. In this study, we investigate the main and pair-wise epistatic effects of miRNAs on the pathological stages of colon cancer using datasets from The Cancer Genome Atlas.

Results

We develop a workflow composed of multiple steps for feature selection based on the Empirical Bayesian Elastic Net (EBEN) method. First, we identify the main effects using a model with only main effect on the phenotype. Second, a corrected phenotype is calculated by removing the significant main effect from the original phenotype. Third, we select features with epistatic effect on the corrected phenotype. Finally, we run the full model with main and epistatic effects on the previously selected main and epistatic features. Using the multi-step workflow, we identify a set of miRNAs with main and epistatic effect on the pathological stages of colon cancer. Many of miRNAs with main effect on colon cancer have been previously reported to be associated with colon cancer, and the majority of the epistatic miRNAs share common target genes that could explain their epistasis effect on the pathological stages of colon cancer. We also find many of the target genes of detected miRNAs are associated with colon cancer. Go Ontology Enrichment Analysis of the experimentally validates targets of main and epistatic miRNAs, shows that these target genes are enriched for biological processes associated with cancer progression.

Conclusion

Our results provide a set of candidate miRNAs associated with colon cancer progression that could have potential translational and therapeutic utility. Our analysis workflow offers a new opportunity to efficiently explore epistatic interactions among genetic and epigenetic factors that could be associated with human diseases. Furthermore, our workflow is flexible and can be applied to analyze the main and epistatic effect of various genetic and epigenetic factors on a wide range of phenotypes.

Electronic supplementary material

The online version of this article (10.1186/s12864-017-4130-7) contains supplementary material, which is available to authorized users.

Collapse

Kim G, Lai CQ, Arnett DK, Parnell LD, Ordovas JM, Kim Y, Kim J. Detection of gene-environment interactions in a family-based population using SCAD. Stat Med 2017;36:3547-3559. [PMID: 28707299 DOI: 10.1002/sim.7382] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2016] [Revised: 05/19/2017] [Accepted: 06/02/2017] [Indexed: 11/07/2022]

Moore JH, Andrews PC, Olson RS, Carlson SE, Larock CR, Bulhoes MJ, O'Connor JP, Greytak EM, Armentrout SL. Grid-based stochastic search for hierarchical gene-gene interactions in population-based genetic studies of common human diseases. BioData Min 2017;10:19. [PMID: 28572842 PMCID: PMC5450417 DOI: 10.1186/s13040-017-0139-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2016] [Accepted: 05/18/2017] [Indexed: 11/18/2022] Open

Abstract

Background

Large-scale genetic studies of common human diseases have focused almost exclusively on the independent main effects of single-nucleotide polymorphisms (SNPs) on disease susceptibility. These studies have had some success, but much of the genetic architecture of common disease remains unexplained. Attention is now turning to detecting SNPs that impact disease susceptibility in the context of other genetic factors and environmental exposures. These context-dependent genetic effects can manifest themselves as non-additive interactions, which are more challenging to model using parametric statistical approaches. The dimensionality that results from a multitude of genotype combinations, which results from considering many SNPs simultaneously, renders these approaches underpowered. We previously developed the multifactor dimensionality reduction (MDR) approach as a nonparametric and genetic model-free machine learning alternative. Approaches such as MDR can improve the power to detect gene-gene interactions but are limited in their ability to exhaustively consider SNP combinations in genome-wide association studies (GWAS), due to the combinatorial explosion of the search space. We introduce here a stochastic search algorithm called Crush for the application of MDR to modeling high-order gene-gene interactions in genome-wide data. The Crush-MDR approach uses expert knowledge to guide probabilistic searches within a framework that capitalizes on the use of biological knowledge to filter gene sets prior to analysis. Here we evaluated the ability of Crush-MDR to detect hierarchical sets of interacting SNPs using a biology-based simulation strategy that assumes non-additive interactions within genes and additivity in genetic effects between sets of genes within a biochemical pathway.

Results

We show that Crush-MDR is able to identify genetic effects at the gene or pathway level significantly better than a baseline random search with the same number of model evaluations. We then applied the same methodology to a GWAS for Alzheimer’s disease and showed base level validation that Crush-MDR was able to identify a set of interacting genes with biological ties to Alzheimer’s disease.

Conclusions

We discuss the role of stochastic search and cloud computing for detecting complex genetic effects in genome-wide data.

Collapse

Ultra-Fast Detection of Higher-Order Epistatic Interactions on GPUs. ACTA ACUST UNITED AC 2017. [DOI: 10.1007/978-3-319-58943-5_34] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]

Guo X, Zhang J, Cai Z, Du DZ, Pan Y. Searching Genome-Wide Multi-Locus Associations for Multiple Diseases Based on Bayesian Inference. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2017;14:600-610. [PMID: 26887006 DOI: 10.1109/tcbb.2016.2527648] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Leem S, Park T. An empirical fuzzy multifactor dimensionality reduction method for detecting gene-gene interactions. BMC Genomics 2017;18:115. [PMID: 28361694 PMCID: PMC5374597 DOI: 10.1186/s12864-017-3496-x] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open

Abstract

BACKGROUND

Detection of gene-gene interaction (GGI) is a key challenge towards solving the problem of missing heritability in genetics. The multifactor dimensionality reduction (MDR) method has been widely studied for detecting GGIs. MDR reduces the dimensionality of multi-factor by means of binary classification into high-risk (H) or low-risk (L) groups. Unfortunately, this simple binary classification does not reflect the uncertainty of H/L classification. Thus, we proposed Fuzzy MDR to overcome limitations of binary classification by introducing the degree of membership of two fuzzy sets H/L. While Fuzzy MDR demonstrated higher power than that of MDR, its performance is highly dependent on the several tuning parameters. In real applications, it is not easy to choose appropriate tuning parameter values.

RESULT

In this work, we propose an empirical fuzzy MDR (EF-MDR) which does not require specifying tuning parameters values. Here, we propose an empirical approach to estimating the membership degree that can be directly estimated from the data. In EF-MDR, the membership degree is estimated by the maximum likelihood estimator of the proportion of cases(controls) in each genotype combination. We also show that the balanced accuracy measure derived from this new membership function is a linear function of the standard chi-square statistics. This relationship allows us to perform the standard significance test using p-values in the MDR framework without permutation. Through two simulation studies, the power of the proposed EF-MDR is shown to be higher than those of MDR and Fuzzy MDR. We illustrate the proposed EF-MDR by analyzing Crohn's disease (CD) and bipolar disorder (BD) in the Wellcome Trust Case Control Consortium (WTCCC) dataset.

CONCLUSION

We propose an empirical Fuzzy MDR for detecting GGI using the maximum likelihood of the proportion of cases(controls) as the membership degree of the genotype combination. The program written in R for EF-MDR is available at http://statgen.snu.ac.kr/software/EF-MDR .

Collapse

Lin H, Mueller-Nurasyid M, Smith AV, Arking DE, Barnard J, Bartz TM, Lunetta KL, Lohman K, Kleber ME, Lubitz SA, Geelhoed B, Trompet S, Niemeijer MN, Kacprowski T, Chasman DI, Klarin D, Sinner MF, Waldenberger M, Meitinger T, Harris TB, Launer LJ, Soliman EZ, Chen LY, Smith JD, Van Wagoner DR, Rotter JI, Psaty BM, Xie Z, Hendricks AE, Ding J, Delgado GE, Verweij N, van der Harst P, Macfarlane PW, Ford I, Hofman A, Uitterlinden A, Heeringa J, Franco OH, Kors JA, Weiss S, Völzke H, Rose LM, Natarajan P, Kathiresan S, Kääb S, Gudnason V, Alonso A, Chung MK, Heckbert SR, Benjamin EJ, Liu Y, März W, Rienstra M, Jukema JW, Stricker BH, Dörr M, Albert CM, Ellinor PT. Gene-gene Interaction Analyses for Atrial Fibrillation. Sci Rep 2016;6:35371. [PMID: 27824142 PMCID: PMC5099695 DOI: 10.1038/srep35371] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2016] [Accepted: 09/28/2016] [Indexed: 11/29/2022] Open

Affiliation(s)

Honghuang Lin National Heart Lung and Blood Institute's and Boston University's Framingham Heart Study, Framingham, MA, USA.,Section of Computational Biomedicine, Department of Medicine, Boston University School of Medicine, Boston, MA, USA
Martina Mueller-Nurasyid Institute of Genetic Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany.,Department of Medicine I, Ludwig-Maximilians-University Munich, Munich, Germany.,DZHK (German Centre for Cardiovascular Research), partner site Munich Heart Alliance, Munich, Germany
Albert V Smith Icelandic Heart Association, Kopavogur, Iceland.,Faculty of Medicine, University of Iceland, Reykjavik, Iceland
Dan E Arking McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
John Barnard Cleveland Clinic, Cleveland, OH, USA
Traci M Bartz Department of Biostatistics, University of Washington, Seattle, WA, USA
Kathryn L Lunetta National Heart Lung and Blood Institute's and Boston University's Framingham Heart Study, Framingham, MA, USA.,Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
Kurt Lohman Department of Biostatistical Sciences, Public Health Sciences, Wake Forest School of Medicine, Winston-Salem, NC, USA
Marcus E Kleber Vth Department of Medicine, Medical Faculty Mannheim, Heidelberg University, Theodor-Kutzer-Ufer 1-3, 68167 Mannheim, Germany
Steven A Lubitz Cardiac Arrhythmia Service, Massachusetts General Hospital, Boston, MA, USA.,Harvard Medical School, Boston, MA, USA
Bastiaan Geelhoed Department of Cardiology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
Stella Trompet Department of Cardiology, Leiden University Medical Center, the Netherlands.,Department of Gerontology and Geriatrics, Leiden University Medical Center, Leiden, the Netherlands
Maartje N Niemeijer Department of Epidemiology, Erasmus MC - University Medical Center Rotterdam, Rotterdam, the Netherlands
Tim Kacprowski Department of Functional Genomics, Interfaculty Institute for Genetics and Functional Genomics, University Medicine and Ernst-Moritz-Arndt University Greifswald, Greifswald, Germany.,DZHK (German Centre for Cardiovascular Research), partner site Greifswald, Greifswald, Germany
Daniel I Chasman Division of Preventive Medicine, Brigham and Women's Hospital, Boston MA, USA
Derek Klarin Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA.,Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA, USA.,Department of Surgery, Massachusetts General Hospital, Boston, MA, USA.,Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, USA
Moritz F Sinner Department of Medicine I, Ludwig-Maximilians-University Munich, Munich, Germany
Melanie Waldenberger DZHK (German Centre for Cardiovascular Research), partner site Munich Heart Alliance, Munich, Germany.,Vth Department of Medicine, Medical Faculty Mannheim, Heidelberg University, Theodor-Kutzer-Ufer 1-3, 68167 Mannheim, Germany.,Research Unit of Molecular Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
Thomas Meitinger DZHK (German Centre for Cardiovascular Research), partner site Munich Heart Alliance, Munich, Germany.,Institute of Human Genetics, Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg, Germany.,Institute of Human Genetics, Technische Universität München, Munich, Germany
Tamara B Harris National Institute on Aging, National Institutes of Health, Bethesda, MD, USA
Lenore J Launer National Institute on Aging, National Institutes of Health, Bethesda, MD, USA
Elsayed Z Soliman Epidemiological Cardiology Research Center, Wake Forest School of Medicine, Winston Salem, NC, USA
Lin Y Chen Cardiovascular Division, Department of Medicine, University of Minnesota Medical School, Minneapolis, MN, USA
Jonathan D Smith Cleveland Clinic, Cleveland, OH, USA
David R Van Wagoner Cleveland Clinic, Cleveland, OH, USA
Jerome I Rotter Institute for Translational Genomics and Population Sciences (J.I.R.), Departments of Pediatrics and Medicine, LABioMed at Harbor-UCLA Medical Center, Torrance, CA, USA
Bruce M Psaty Cardiovascular Health Research Unit, Departments of Medicine, Epidemiology and Health Services, University of Washington, Seattle, WA, USA.,Group Health Research Institute, Group Health Cooperative, Seattle, WA, USA
Zhijun Xie Section of Computational Biomedicine, Department of Medicine, Boston University School of Medicine, Boston, MA, USA
Audrey E Hendricks National Heart Lung and Blood Institute's and Boston University's Framingham Heart Study, Framingham, MA, USA.,Mathematical and Statistical Sciences, University of Colorado, Denver, Denver, CO, USA
Jingzhong Ding Department of Gerontology and Geriatric Medicine, Wake Forest School of Medicine, Winston-Salem, NC, USA
Graciela E Delgado Vth Department of Medicine, Medical Faculty Mannheim, Heidelberg University, Theodor-Kutzer-Ufer 1-3, 68167 Mannheim, Germany
Niek Verweij Department of Cardiology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
Pim van der Harst Department of Cardiology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
Peter W Macfarlane Institute of Health and Wellbeing, College of Veterinary, Medical and Life Sciences, University of Glasgow, United Kingdom
Ian Ford Robertson Center for Biostatistics, University of Glasgow, United Kingdom
Albert Hofman Department of Epidemiology, Erasmus MC - University Medical Center Rotterdam, Rotterdam, the Netherlands
André Uitterlinden Department of Epidemiology &Internal Medicine, Erasmus MC - University Medical Center Rotterdam, Rotterdam, the Netherlands
Jan Heeringa Department of Epidemiology, Erasmus MC - University Medical Center Rotterdam, Rotterdam, the Netherlands
Oscar H Franco Department of Epidemiology, Erasmus MC - University Medical Center Rotterdam, Rotterdam, the Netherlands
Jan A Kors Department of Medical Informatics, Erasmus MC - University Medical Center Rotterdam, the Netherlands
Stefan Weiss Department of Functional Genomics, Interfaculty Institute for Genetics and Functional Genomics, University Medicine and Ernst-Moritz-Arndt University Greifswald, Greifswald, Germany.,DZHK (German Centre for Cardiovascular Research), partner site Greifswald, Greifswald, Germany
Henry Völzke DZHK (German Centre for Cardiovascular Research), partner site Greifswald, Greifswald, Germany.,Institute for Community Medicine, University Medicine Greifswald, Greifswald, Germany
Lynda M Rose Division of Preventive Medicine, Brigham and Women's Hospital, Boston MA, USA
Pradeep Natarajan Harvard Medical School, Boston, MA, USA.,Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA.,Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA, USA.,Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, USA
Sekar Kathiresan Harvard Medical School, Boston, MA, USA.,Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA.,Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA, USA.,Program in Medical and Population Genetics, Broad Institute, Cambridge, MA, USA
Stefan Kääb Department of Medicine I, Ludwig-Maximilians-University Munich, Munich, Germany.,DZHK (German Centre for Cardiovascular Research), partner site Munich Heart Alliance, Munich, Germany
Vilmundur Gudnason Icelandic Heart Association, Kopavogur, Iceland.,Faculty of Medicine, University of Iceland, Reykjavik, Iceland
Alvaro Alonso Department of Epidemiology, Rollins School of Public Health, Emory University, Atlanta, GA, USA
Mina K Chung Cleveland Clinic, Cleveland, OH, USA
Susan R Heckbert Group Health Research Institute, Group Health Cooperative, Seattle, WA, USA.,Department of Epidemiology, Cardiovascular Health Research Unit, University of Washington, Seattle, WA, USA
Emelia J Benjamin National Heart Lung and Blood Institute's and Boston University's Framingham Heart Study, Framingham, MA, USA.,Section of Cardiovascular Medicine and Preventive Medicine, Department of Medicine, Boston University School of Medicine, Boston, MA, USA.,Department of Epidemiology, Boston University School of Public Health, Boston, MA, USA
Yongmei Liu Department of Epidemiology &Prevention, Public Health Sciences, Wake Forest School of Medicine, Winston-Salem, NC, USA
Winfried März Synlab Academy, Synlab Services, GmbH P5,7, 68161 Mannheim, Germany.,Clinical Institute of Medical and Chemical Laboratory Diagnostics, Medical University of Graz, Graz, Austria.,Medical Clinic V (Nephrology, Hypertensiology, Rheumatology, Endocrinology, Diabetology), Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany
Michiel Rienstra Department of Cardiology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
J Wouter Jukema Department of Cardiology, Leiden University Medical Center, the Netherlands
Bruno H Stricker Department of Epidemiology &Internal Medicine, Erasmus MC - University Medical Center Rotterdam, Rotterdam, the Netherlands.,Inspectorate of Health Care, Utrecht, the Netherlands
Marcus Dörr DZHK (German Centre for Cardiovascular Research), partner site Greifswald, Greifswald, Germany.,Department of Internal Medicine B, University Medicine Greifswald, Greifswald, Germany
Christine M Albert Division of Preventive Medicine, Brigham and Women's Hospital, Boston MA, USA
Patrick T Ellinor Harvard Medical School, Boston, MA, USA

Collapse

Kodama K, Saigo H. KDSNP: A kernel-based approach to detecting high-order SNP interactions. J Bioinform Comput Biol 2016;14:1644003. [DOI: 10.1142/s0219720016440030] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Li M, Wei C, Wen Y, Wang T, Lu Q. Detecting Gene-Gene Interactions Associated with Multiple Complex Traits with U-Statistics. Curr Genomics 2016;17:403-415. [PMID: 28479869 PMCID: PMC5320542 DOI: 10.2174/1389202917666160513100946] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2015] [Revised: 05/26/2015] [Accepted: 06/06/2015] [Indexed: 12/02/2022] Open

Chen Q, Mao X, Zhang Z, Zhu R, Yin Z, Leng Y, Yu H, Jia H, Jiang S, Ni Z, Jiang H, Han X, Liu C, Hu Z, Wu X, Hu G, Xin D, Qi Z. SNP-SNP Interaction Analysis on Soybean Oil Content under Multi-Environments. PLoS One 2016;11:e0163692. [PMID: 27668866 PMCID: PMC5036806 DOI: 10.1371/journal.pone.0163692] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2016] [Accepted: 09/13/2016] [Indexed: 11/22/2022] Open

Affiliation(s)

Qingshan Chen College of Agriculture, Soybean biology Key Laboratory of the Ministry of Education, Northeast Agricultural University, Harbin, 150030, Heilongjiang, People’s Republic of China
Xinrui Mao College of Agriculture, Soybean biology Key Laboratory of the Ministry of Education, Northeast Agricultural University, Harbin, 150030, Heilongjiang, People’s Republic of China
Zhanguo Zhang College of Agriculture, Soybean biology Key Laboratory of the Ministry of Education, Northeast Agricultural University, Harbin, 150030, Heilongjiang, People’s Republic of China
Rongsheng Zhu College of Agriculture, Soybean biology Key Laboratory of the Ministry of Education, Northeast Agricultural University, Harbin, 150030, Heilongjiang, People’s Republic of China
Zhengong Yin College of Agriculture, Soybean biology Key Laboratory of the Ministry of Education, Northeast Agricultural University, Harbin, 150030, Heilongjiang, People’s Republic of China Crop Breeding Institute, Heilongjiang Academy of Agricultural Sciences, Harbin, 150086, Heilongjiang, People’s Republic of China
Yue Leng College of Agriculture, Soybean biology Key Laboratory of the Ministry of Education, Northeast Agricultural University, Harbin, 150030, Heilongjiang, People’s Republic of China
Hongxiao Yu College of Agriculture, Soybean biology Key Laboratory of the Ministry of Education, Northeast Agricultural University, Harbin, 150030, Heilongjiang, People’s Republic of China
Huiying Jia College of Agriculture, Soybean biology Key Laboratory of the Ministry of Education, Northeast Agricultural University, Harbin, 150030, Heilongjiang, People’s Republic of China
Shanshan Jiang College of Agriculture, Soybean biology Key Laboratory of the Ministry of Education, Northeast Agricultural University, Harbin, 150030, Heilongjiang, People’s Republic of China
Zhongqiu Ni College of Agriculture, Soybean biology Key Laboratory of the Ministry of Education, Northeast Agricultural University, Harbin, 150030, Heilongjiang, People’s Republic of China
Hongwei Jiang The Crop Research and Breeding Center of Land-Reclamation of Heilongjiang Province, Harbin, 150090, Heilongjiang, People’s Republic of China
Xue Han The Crop Research and Breeding Center of Land-Reclamation of Heilongjiang Province, Harbin, 150090, Heilongjiang, People’s Republic of China
Chunyan Liu The Crop Research and Breeding Center of Land-Reclamation of Heilongjiang Province, Harbin, 150090, Heilongjiang, People’s Republic of China
Zhenbang Hu College of Agriculture, Soybean biology Key Laboratory of the Ministry of Education, Northeast Agricultural University, Harbin, 150030, Heilongjiang, People’s Republic of China
Xiaoxia Wu College of Agriculture, Soybean biology Key Laboratory of the Ministry of Education, Northeast Agricultural University, Harbin, 150030, Heilongjiang, People’s Republic of China
Guohua Hu The Crop Research and Breeding Center of Land-Reclamation of Heilongjiang Province, Harbin, 150090, Heilongjiang, People’s Republic of China
Dawei Xin College of Agriculture, Soybean biology Key Laboratory of the Ministry of Education, Northeast Agricultural University, Harbin, 150030, Heilongjiang, People’s Republic of China * E-mail: (DX); (ZQ)
Zhaoming Qi College of Agriculture, Soybean biology Key Laboratory of the Ministry of Education, Northeast Agricultural University, Harbin, 150030, Heilongjiang, People’s Republic of China * E-mail: (DX); (ZQ)

Collapse

Simon PHG, Sylvestre MP, Tremblay J, Hamet P. Key Considerations and Methods in the Study of Gene-Environment Interactions. Am J Hypertens 2016;29:891-9. [PMID: 27037711 DOI: 10.1093/ajh/hpw021] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2015] [Accepted: 02/08/2016] [Indexed: 12/16/2022] Open

Evaluation of associative classification-based multifactor dimensionality reduction in the presence of noise. ACTA ACUST UNITED AC 2016. [DOI: 10.1007/s13721-016-0114-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

A forest-based feature screening approach for large-scale genome data with complex structures. BMC Genet 2015;16:148. [PMID: 26698561 PMCID: PMC4690313 DOI: 10.1186/s12863-015-0294-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2015] [Accepted: 11/13/2015] [Indexed: 01/06/2023] Open

Abstract

Background

Genome-wide association studies (GWAS) interrogate large-scale whole genome to characterize the complex genetic architecture for biomedical traits. When the number of SNPs dramatically increases to half million but the sample size is still limited to thousands, the traditional p-value based statistical approaches suffer from unprecedented limitations. Feature screening has proved to be an effective and powerful approach to handle ultrahigh dimensional data statistically, yet it has not received much attention in GWAS. Feature screening reduces the feature space from millions to hundreds by removing non-informative noise. However, the univariate measures used to rank features are mainly based on individual effect without considering the mutual interactions with other features. In this article, we explore the performance of a random forest (RF) based feature screening procedure to emphasize the SNPs that have complex effects for a continuous phenotype.

Results

Both simulation and real data analysis are conducted to examine the power of the forest-based feature screening. We compare it with five other popular feature screening approaches via simulation and conclude that RF can serve as a decent feature screening tool to accommodate complex genetic effects such as nonlinear, interactive, correlative, and joint effects. Unlike the traditional p-value based Manhattan plot, we use the Permutation Variable Importance Measure (PVIM) to display the relative significance and believe that it will provide as much useful information as the traditional plot.

Conclusion

Most complex traits are found to be regulated by epistatic and polygenic variants. The forest-based feature screening is proven to be an efficient, easily implemented, and accurate approach to cope whole genome data with complex structures. Our explorations should add to a growing body of enlargement of feature screening better serving the demands of contemporary genome data.

Collapse

Sapin E, Keedwell E, Frayling T. Ant colony optimisation of decision tree and contingency table models for the discovery of gene-gene interactions. IET Syst Biol 2015;9:218-25. [PMID: 26577156 PMCID: PMC8687348 DOI: 10.1049/iet-syb.2015.0017] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2015] [Revised: 05/15/2015] [Accepted: 05/31/2015] [Indexed: 11/20/2022] Open

Kullo IJ, Leeper NJ. The genetic basis of peripheral arterial disease: current knowledge, challenges, and future directions. Circ Res 2015;116:1551-60. [PMID: 25908728 DOI: 10.1161/circresaha.116.303518] [Citation(s) in RCA: 56] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]

Rule-based analysis for detecting epistasis using associative classification mining. ACTA ACUST UNITED AC 2015. [DOI: 10.1007/s13721-015-0084-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Gao H, Wu Y, Li J, Li H, Li J, Yang R. Forward LASSO analysis for high-order interactions in genome-wide association study. Brief Bioinform 2015;15:552-61. [PMID: 23775311 DOI: 10.1093/bib/bbt037] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Ding X, Wang J, Zelikovsky A, Guo X, Xie M, Pan Y. Searching High-Order SNP Combinations for Complex Diseases Based on Energy Distribution Difference. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2015;12:695-704. [PMID: 26357280 DOI: 10.1109/tcbb.2014.2363459] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Su L, Liu G, Wang H, Tian Y, Zhou Z, Han L, Yan L. Research on single nucleotide polymorphisms interaction detection from network perspective. PLoS One 2015;10:e0119146. [PMID: 25763929 PMCID: PMC4357495 DOI: 10.1371/journal.pone.0119146] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2014] [Accepted: 01/09/2015] [Indexed: 12/02/2022] Open

Broer L, Buchman AS, Deelen J, Evans DS, Faul JD, Lunetta KL, Sebastiani P, Smith JA, Smith AV, Tanaka T, Yu L, Arnold AM, Aspelund T, Benjamin EJ, De Jager PL, Eirkisdottir G, Evans DA, Garcia ME, Hofman A, Kaplan RC, Kardia SLR, Kiel DP, Oostra BA, Orwoll ES, Parimi N, Psaty BM, Rivadeneira F, Rotter JI, Seshadri S, Singleton A, Tiemeier H, Uitterlinden AG, Zhao W, Bandinelli S, Bennett DA, Ferrucci L, Gudnason V, Harris TB, Karasik D, Launer LJ, Perls TT, Slagboom PE, Tranah GJ, Weir DR, Newman AB, van Duijn CM, Murabito JM. GWAS of longevity in CHARGE consortium confirms APOE and FOXO3 candidacy. J Gerontol A Biol Sci Med Sci 2015;70:110-8. [PMID: 25199915 PMCID: PMC4296168 DOI: 10.1093/gerona/glu166] [Citation(s) in RCA: 204] [Impact Index Per Article: 22.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2014] [Accepted: 08/07/2014] [Indexed: 01/08/2023] Open

Affiliation(s)

Linda Broer Department of Epidemiology, Erasmus MC, Rotterdam, The Netherlands. Netherlands Consortium for Healthy Ageing, Leiden University Medical Center, The Netherlands. Department of Internal Medicine, Erasmus MC, Rotterdam, The Netherlands
Aron S Buchman Rush Alzheimer's Disease Center, Rush University Medical Center, Chicago, Illinois
Joris Deelen Netherlands Consortium for Healthy Ageing, Leiden University Medical Center, The Netherlands. Department of Molecular Epidemiology, Leiden University Medical Center, The Netherlands
Daniel S Evans California Pacific Medical Center Research Institute, San Francisco
Jessica D Faul Survey Research Center, Institute for Social Research, University of Michigan, Ann Arbor
Kathryn L Lunetta Department of Biostatistics, Boston University School of Public Health, Massachusetts. NHLBI's and Boston Univesity's Framingham Heart Study, Massachusetts
Paola Sebastiani Department of Biostatistics, Boston University School of Public Health, Massachusetts
Jennifer A Smith Department of Epidemiology, University of Michigan, Ann Arbor
Albert V Smith Icelandic Heart Association, Kopavogur, Iceland. Department of Medicine, University of Iceland, Reykjavik
Toshiko Tanaka Translational Gerontology Branch, National Institute on Aging, Baltimore, Maryland
Lei Yu Rush Alzheimer's Disease Center, Rush University Medical Center, Chicago, Illinois
Alice M Arnold Department of Biostatistics, University of Washington, Seattle
Thor Aspelund Icelandic Heart Association, Kopavogur, Iceland. Department of Medicine, University of Iceland, Reykjavik
Emelia J Benjamin NHLBI's and Boston Univesity's Framingham Heart Study, Massachusetts. Department of Medicine, Sections of Preventive Medicine and Cardiology, Boston University School of Medicine, Massachusetts. Department of Epidemiology, Boston University School of Public Health, Massachusetts
Philip L De Jager Program in Translational NeuroPsychiatric Genomics, Institute for the Neurosciences, Departments of Neurology and Psychiatry, Brigham and Women's Hospital, Boston, Massachusetts. Harvard Medical School, Boston, Massachusetts. Program in Medical and Population Genetics, Broad Institute, Cambridge, Massachusetts
Gudny Eirkisdottir Icelandic Heart Association, Kopavogur, Iceland
Denis A Evans Rush Institute for Healthy Aging and Department of Internal Medicine, Rush University Medical Center, Chicago, Illinois
Melissa E Garcia Laboratory of Epidemiology and Population Sciences, National Institute on Aging, Bethesda, Maryland
Albert Hofman Department of Epidemiology, Erasmus MC, Rotterdam, The Netherlands. Netherlands Consortium for Healthy Ageing, Leiden University Medical Center, The Netherlands
Robert C Kaplan Department of Epidemiology and Population Health, Albert Einstein College, Bronx, New York
Sharon L R Kardia Department of Epidemiology, University of Michigan, Ann Arbor
Douglas P Kiel Harvard Medical School, Boston, Massachusetts. Institute for Aging Research, Hebrew SeniorLife, Harvard Medical School Department of Medicine, Boston, Massachusetts. Division of Gerontology, Department of Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts
Ben A Oostra Department of Epidemiology, Erasmus MC, Rotterdam, The Netherlands. Netherlands Consortium for Healthy Ageing, Leiden University Medical Center, The Netherlands
Eric S Orwoll School of Medicine, Oregon Health and Science University, Portland
Neeta Parimi California Pacific Medical Center Research Institute, San Francisco
Bruce M Psaty Department of Medicine, University of Washington, Seattle. Deparment of Epidemiology, University of Washington, Seattle. Department of Health Services, University of Washington, Seattle. Group Health Research Institute, Group Health Cooperative, Seattle, Washington
Fernando Rivadeneira Department of Epidemiology, Erasmus MC, Rotterdam, The Netherlands. Institute for Aging Research, Hebrew SeniorLife, Harvard Medical School Department of Medicine, Boston, Massachusetts
Jerome I Rotter Institute for Translational Genomics and Population Sciences, Los Angeles Biomedical Research Institute and Department of Pediatrics, Harbor-UCLA Medical Center, Torrance, California
Sudha Seshadri Department of Biostatistics, Boston University School of Public Health, Massachusetts. Department of Neurology, Boston University School of Medicine, Massachusetts
Andrew Singleton Laboratory of Neurogenetics, National Institute on Aging, Bethesda, Maryland
Henning Tiemeier Department of Epidemiology, Erasmus MC, Rotterdam, The Netherlands. Netherlands Consortium for Healthy Ageing, Leiden University Medical Center, The Netherlands. Department of Child and Adolescent Psychiatry, Erasmus MC and Sophia Children's Hospital, Rotterdam, The Netherlands
André G Uitterlinden Department of Epidemiology, Erasmus MC, Rotterdam, The Netherlands. Netherlands Consortium for Healthy Ageing, Leiden University Medical Center, The Netherlands. Department of Internal Medicine, Erasmus MC, Rotterdam, The Netherlands
Wei Zhao Department of Epidemiology, University of Michigan, Ann Arbor
Stefania Bandinelli Geriatric Unit, Azienda Sanitaria Firenze, Florence, Italy
David A Bennett Rush Alzheimer's Disease Center, Rush University Medical Center, Chicago, Illinois
Luigi Ferrucci Translational Gerontology Branch, National Institute on Aging, Baltimore, Maryland
Vilmundur Gudnason Icelandic Heart Association, Kopavogur, Iceland. Department of Medicine, University of Iceland, Reykjavik
Tamara B Harris Laboratory of Epidemiology and Population Sciences, National Institute on Aging, Bethesda, Maryland
David Karasik Institute for Aging Research, Hebrew SeniorLife, Harvard Medical School Department of Medicine, Boston, Massachusetts. Faculty of Medicine in The Galilee, Bar-Ilan University, Safed, Israel
Lenore J Launer Laboratory of Epidemiology and Population Sciences, National Institute on Aging, Bethesda, Maryland
Thomas T Perls Section of Geriatrics, Boston University School of Medicine and Boston Medical Center, Massachusetts
P Eline Slagboom Netherlands Consortium for Healthy Ageing, Leiden University Medical Center, The Netherlands. Department of Molecular Epidemiology, Leiden University Medical Center, The Netherlands
Gregory J Tranah California Pacific Medical Center Research Institute, San Francisco. Department of Epidemiology and Biostatistics, University of California, San Francisco
David R Weir Survey Research Center, Institute for Social Research, University of Michigan, Ann Arbor
Anne B Newman Department of Epidemiology, University of Pittsburgh, Pennsylvania. *These authors contributed equally to this work
Cornelia M van Duijn Department of Epidemiology, Erasmus MC, Rotterdam, The Netherlands. Netherlands Consortium for Healthy Ageing, Leiden University Medical Center, The Netherlands. *These authors contributed equally to this work
Joanne M Murabito NHLBI's and Boston Univesity's Framingham Heart Study, Massachusetts. Department of Medicine, Section of General Internal Medicine, Boston University School of Medicine, Massachusetts. *These authors contributed equally to this work.

Collapse

Xu HM, Sun XW, Qi T, Lin WY, Liu N, Lou XY. Multivariate dimensionality reduction approaches to identify gene-gene and gene-environment interactions underlying multiple complex traits. PLoS One 2014;9:e108103. [PMID: 25259584 PMCID: PMC4178067 DOI: 10.1371/journal.pone.0108103] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2014] [Accepted: 08/18/2014] [Indexed: 11/30/2022] Open

Gusareva ES, Van Steen K. Practical aspects of genome-wide association interaction analysis. Hum Genet 2014;133:1343-58. [DOI: 10.1007/s00439-014-1480-y] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2014] [Accepted: 08/18/2014] [Indexed: 12/31/2022]

Detecting epistatic interactions in metagenome-wide association studies by metaBOOST. BIOMED RESEARCH INTERNATIONAL 2014;2014:398147. [PMID: 25165702 PMCID: PMC4131565 DOI: 10.1155/2014/398147] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/14/2014] [Accepted: 07/14/2014] [Indexed: 01/27/2023]

Zhang Q, Long Q, Ott J. AprioriGWAS, a new pattern mining strategy for detecting genetic variants associated with disease through interaction effects. PLoS Comput Biol 2014;10:e1003627. [PMID: 24901472 PMCID: PMC4046917 DOI: 10.1371/journal.pcbi.1003627] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2013] [Accepted: 04/01/2014] [Indexed: 12/11/2022] Open

Abstract

Identifying gene-gene interaction is a hot topic in genome wide association studies. Two fundamental challenges are: (1) how to smartly identify combinations of variants that may be associated with the trait from astronomical number of all possible combinations; and (2) how to test epistatic interaction when all potential combinations are available. We developed AprioriGWAS, which brings two innovations. (1) Based on Apriori, a successful method in field of Frequent Itemset Mining (FIM) in which a pattern growth strategy is leveraged to effectively and accurately reduce search space, AprioriGWAS can efficiently identify genetically associated genotype patterns. (2) To test the hypotheses of epistasis, we adopt a new conditional permutation procedure to obtain reliable statistical inference of Pearson's chi-square test for the contingency table generated by associated variants. By applying AprioriGWAS to age-related macular degeneration (AMD) data, we found that: (1) angiopoietin 1 (ANGPT1) and four retinal genes interact with Complement Factor H (CFH). (2) GO term “glycosaminoglycan biosynthetic process” was enriched in AMD interacting genes. The epistatic interactions newly found by AprioriGWAS on AMD data are likely true interactions, since genes interacting with CFH are retinal genes, and GO term enrichment also verified that interaction between glycosaminoglycans (GAGs) and CFH plays an important role in disease pathology of AMD. By applying AprioriGWAS on Bipolar disorder in WTCCC data, we found variants without marginal effect show significant interactions. For example, multiple-SNP genotype patterns inside gene GABRB2 and GRIA1 (AMPA subunit 1 receptor gene). AMPARs are found in many parts of the brain and are the most commonly found receptor in the nervous system. The GABRB2 mediates the fastest inhibitory synaptic transmission in the central nervous system. GRIA1 and GABRB2 are relevant to mental disorders supported by multiple evidences.

Genes do not operate in vacuum. They interact with each other in many ways. Therefore, to figure out genetic causes of disease by case-control association studies, it is important to take interactions into account. There are two fundamental challenges in interaction-focused analysis. The first is the number of possible combinations of genetic variants easily goes to astronomic which is beyond current computational facility, which is referred as “the curse of dimensionality” in field of computer science. The other is, even if all potential combinations could be exhaustively checked, genuine signals are likely to be buried by false positives that are composed of single variant with large main effect and some other irrelevant variant. In this work, we propose AprioriGWAS that employees Apriori, an algorithm that pioneers the branch of “Frequent Itemset Mining” in computer science to cope with daunting numbers of combinations, and conditional permutation, to enable real signals standing out. By applying AprioriGWAS to age-related macular degeneration (AMD) data and bipolar disorder (BD) in WTCCC data, we found interesting interactions between sensible genes in terms of disease. Consequently, AprioriGWAS could be a good tool to find epistasis interaction from GWA data.

Collapse