Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Cattaert T, Calle ML, Dudek SM, Mahachie John JM, Van Lishout F, Urrea V, Ritchie MD, Van Steen K. Model-based multifactor dimensionality reduction for detecting epistasis in case-control data in the presence of noise. Ann Hum Genet 2010;75:78-89. [PMID: 21158747 DOI: 10.1111/j.1469-1809.2010.00604.x] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]

For:	Cattaert T, Calle ML, Dudek SM, Mahachie John JM, Van Lishout F, Urrea V, Ritchie MD, Van Steen K. Model-based multifactor dimensionality reduction for detecting epistasis in case-control data in the presence of noise. Ann Hum Genet 2010;75:78-89. [PMID: 21158747 DOI: 10.1111/j.1469-1809.2010.00604.x] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]

Number

Cited by Other Article(s)

Veyssiere M, Rodriguez Ordonez MDP, Chalabi S, Michou L, Cornelis F, Boland A, Olaso R, Deleuze JF, Petit-Teixeira E, Chaudru V. MYLK*FLNB and DOCK1*LAMA2 gene-gene interactions associated with rheumatoid arthritis in the focal adhesion pathway. Front Genet 2024;15:1375036. [PMID: 38803542 PMCID: PMC11128622 DOI: 10.3389/fgene.2024.1375036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Accepted: 04/18/2024] [Indexed: 05/29/2024] Open

Balunathan N, Rani G U, Perumal V, Kumarasamy P. Single nucleotide polymorphisms of Interleukin - 4, Interleukin-18, FCRL3 and sPLA2IIa genes and their association in pathogenesis of endometriosis. Mol Biol Rep 2023;50:4239-4252. [PMID: 36905404 DOI: 10.1007/s11033-023-08316-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2022] [Accepted: 01/31/2023] [Indexed: 03/12/2023]

Abstract

BACKGROUND

Endometriosis is a complex gynaecological disorder that contributes to infertility, dysmenorrhea, dyspareunia, and other chronic issues. It is a multifactorial disease involving genetic, hormonal, immunological and environmental components. Endometriosis's pathogenesis remains unclear.

AIM OF THE STUDY

was to analyse the polymorphisms in Interleukin 4, Interleukin 18, FCRL3 and sPLA2IIa genes to identify any significant association with the risk of endometriosis.

MATERIAL AND METHODS

This study evaluated the polymorphism of -590 C/T in interleukin- 4(IL-4) gene, C607A in Interleukin - 18(IL-18) gene, -169T > C in FCRL3 gene and 763 C > G in sPLA2IIa gene in women with endometriosis. The case-control study included 150 women with endometriosis and 150 apparently healthy women as control subjects. DNA was extracted from peripheral blood leukocytes and endometriotic tissue of cases and blood samples for controls and further analysed by PCR amplification and then sequencing was carried out to find the allele and genotypes of the subjects and then to analyse the relationship between the gene polymorphisms and endometriosis. To evaluate the association of the different genotypes, 95% confidence intervals (CI) were calculated.

RESULTS

Interleukin - 18 and FCRL3 gene polymorphisms of endometriotic tissue and blood samples of endometriosis (cases) showed significantly associated (OR = 4.88 [95% CI = 2.31-10.30], P > 0.0001) and (OR = 4.00 [95% CI = 2.2-7.33], P > 0.0001) when compared with normal blood samples. However, there was no significant difference in Interleukin - 4 and sPLA2IIa gene polymorphisms between control women and patients with endometriosis.

CONCLUSIONS

The present study suggests that the IL-18 and FCRL3 gene polymorphisms are associated with a higher risk for endometriosis, which delivers valuable knowledge of endometriosis's pathogenesis. However, a larger sample size of patients from various ethnic backgrounds is necessary to evaluate whether these alleles have a direct effect on disease susceptibility.

Collapse

Sha Z, Chen Y, Hu T. NSPA: characterizing the disease association of multiple genetic interactions at single-subject resolution. BIOINFORMATICS ADVANCES 2023;3:vbad010. [PMID: 36818729 PMCID: PMC9927570 DOI: 10.1093/bioadv/vbad010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 01/02/2023] [Accepted: 02/02/2023] [Indexed: 02/10/2023]

Wang H, Wu X. IPP: An Intelligent Privacy-Preserving Scheme for Detecting Interactions in Genome Association Studies. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023;20:455-464. [PMID: 35239492 DOI: 10.1109/tcbb.2022.3155774] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]

Saha S, Perrin L, Röder L, Brun C, Spinelli L. Epi-MEIF: detecting higher order epistatic interactions for complex traits using mixed effect conditional inference forests. Nucleic Acids Res 2022;50:e114. [PMID: 36107776 PMCID: PMC9639209 DOI: 10.1093/nar/gkac715] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Revised: 07/29/2022] [Accepted: 09/12/2022] [Indexed: 12/04/2022] Open

Missing Causality and Heritability of Autoimmune Hepatitis. Dig Dis Sci 2022;68:1585-1604. [PMID: 36261672 DOI: 10.1007/s10620-022-07728-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/27/2022] [Accepted: 10/10/2022] [Indexed: 12/09/2022]

Abstract

BACKGROUND

Autoimmune hepatitis has an unknown cause and genetic associations that are not disease-specific or always present. Clarification of its missing causality and heritability could improve prevention and management strategies.

AIMS

Describe the key epigenetic and genetic mechanisms that could account for missing causality and heritability in autoimmune hepatitis; indicate the prospects of these mechanisms as pivotal factors; and encourage investigations of their pathogenic role and therapeutic potential.

METHODS

English abstracts were identified in PubMed using multiple key search phases. Several hundred abstracts and 210 full-length articles were reviewed.

RESULTS

Environmental induction of epigenetic changes is the prime candidate for explaining the missing causality of autoimmune hepatitis. Environmental factors (diet, toxic exposures) can alter chromatin structure and the production of micro-ribonucleic acids that affect gene expression. Epistatic interaction between unsuspected genes is the prime candidate for explaining the missing heritability. The non-additive, interactive effects of multiple genes could enhance their impact on the propensity and phenotype of autoimmune hepatitis. Transgenerational inheritance of acquired epigenetic marks constitutes another mechanism of transmitting parental adaptations that could affect susceptibility. Management strategies could range from lifestyle adjustments and nutritional supplements to precision editing of the epigenetic landscape.

CONCLUSIONS

Autoimmune hepatitis has a missing causality that might be explained by epigenetic changes induced by environmental factors and a missing heritability that might reflect epistatic gene interactions or transgenerational transmission of acquired epigenetic marks. These unassessed or under-evaluated areas warrant investigation.

Collapse

Walakira A, Ocira J, Duroux D, Fouladi R, Moškon M, Rozman D, Van Steen K. Detecting gene-gene interactions from GWAS using diffusion kernel principal components. BMC Bioinformatics 2022;23:57. [PMID: 35105309 PMCID: PMC8805268 DOI: 10.1186/s12859-022-04580-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2021] [Accepted: 01/18/2022] [Indexed: 11/10/2022] Open

Association between gene expression levels of GDF9 and BMP15 and clinicopathological factors in the prognosis of female infertility in northeast Indian populations. Meta Gene 2021. [DOI: 10.1016/j.mgene.2021.100964] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open

Abegaz F, Van Lishout F, Mahachie John JM, Chiachoompu K, Bhardwaj A, Duroux D, Gusareva ES, Wei Z, Hakonarson H, Van Steen K. Performance of model-based multifactor dimensionality reduction methods for epistasis detection by controlling population structure. BioData Min 2021;14:16. [PMID: 33608043 PMCID: PMC7893746 DOI: 10.1186/s13040-021-00247-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2020] [Accepted: 02/07/2021] [Indexed: 12/15/2022] Open

Abegaz F, Chaichoompu K, Génin E, Fardo DW, König IR, Mahachie John JM, Van Steen K. Principals about principal components in statistical genetics. Brief Bioinform 2020;20:2200-2216. [PMID: 30219892 DOI: 10.1093/bib/bby081] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2018] [Revised: 07/21/2018] [Accepted: 08/12/2018] [Indexed: 12/13/2022] Open

Riahi P, Kazemnejad A, Mostafaei S, Meguro A, Mizuki N, Ashraf-Ganjouei A, Javinani A, Faezi ST, Shahram F, Mahmoudi M. ERAP1 polymorphisms interactions and their association with Behçet's disease susceptibly: Application of Model-Based Multifactor Dimension Reduction Algorithm (MB-MDR). PLoS One 2020;15:e0227997. [PMID: 32023277 PMCID: PMC7001967 DOI: 10.1371/journal.pone.0227997] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2019] [Accepted: 01/03/2020] [Indexed: 12/15/2022] Open

Chattopadhyay A, Lu TP. Gene-gene interaction: the curse of dimensionality. ANNALS OF TRANSLATIONAL MEDICINE 2019;7:813. [PMID: 32042829 DOI: 10.21037/atm.2019.12.87] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]

Abstract

Identified genetic variants from genome wide association studies frequently show only modest effects on the disease risk, leading to the "missing heritability" problem. An avenue, to account for a part of this "missingness" is to evaluate gene-gene interactions (epistasis) thereby elucidating their effect on complex diseases. This can potentially help with identifying gene functions, pathways, and drug targets. However, the exhaustive evaluation of all possible genetic interactions among millions of single nucleotide polymorphisms (SNPs) raises several issues, otherwise known as the "curse of dimensionality". The dimensionality involved in the epistatic analysis of such exponentially growing SNPs diminishes the usefulness of traditional, parametric statistical methods. With the immense popularity of multifactor dimensionality reduction (MDR), a non-parametric method, proposed in 2001, that classifies multi-dimensional genotypes into one- dimensional binary approaches, led to the emergence of a fast-growing collection of methods that were based on the MDR approach. Moreover, machine-learning (ML) methods such as random forests and neural networks (NNs), deep-learning (DL) approaches, and hybrid approaches have also been applied profusely, in the recent years, to tackle this dimensionality issue associated with whole genome gene-gene interaction studies. However, exhaustive searching in MDR based approaches or variable selection in ML methods, still pose the risk of missing out on relevant SNPs. Furthermore, interpretability issues are a major hindrance for DL methods. To minimize this loss of information, Python based tools such as PySpark can potentially take advantage of distributed computing resources in the cloud, to bring back smaller subsets of data for further local analysis. Parallel computing can be a powerful resource that stands to fight this "curse". PySpark supports all standard Python libraries and C extensions thus making it convenient to write codes to deliver dramatic improvements in processing speed for extraordinarily large sets of data.

Collapse

Abegaz F, Van Lishout F, Mahachie John JM, Chiachoompu K, Bhardwaj A, Gusareva ES, Wei Z, Hakonarson H, Van Steen K. Epistasis Detection in Genome-Wide Screening for Complex Human Diseases in Structured Populations. SYSTEMS MEDICINE 2019. [DOI: 10.1089/sysm.2019.0003] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open

Joiret M, Mahachie John JM, Gusareva ES, Van Steen K. Confounding of linkage disequilibrium patterns in large scale DNA based gene-gene interaction studies. BioData Min 2019;12:11. [PMID: 31198442 PMCID: PMC6558841 DOI: 10.1186/s13040-019-0199-7] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2019] [Accepted: 05/09/2019] [Indexed: 01/07/2023] Open

Abstract

Background

In Genome-Wide Association Studies (GWAS), the concept of linkage disequilibrium is important as it allows identifying genetic markers that tag the actual causal variants. In Genome-Wide Association Interaction Studies (GWAIS), similar principles hold for pairs of causal variants. However, Linkage Disequilibrium (LD) may also interfere with the detection of genuine epistasis signals in that there may be complete confounding between Gametic Phase Disequilibrium (GPD) and interaction. GPD may involve unlinked genetic markers, even residing on different chromosomes. Often GPD is eliminated in GWAIS, via feature selection schemes or so-called pruning algorithms, to obtain unconfounded epistasis results. However, little is known about the optimal degree of GPD/LD-pruning that gives a balance between false positive control and sufficient power of epistasis detection statistics. Here, we focus on Model-Based Multifactor Dimensionality Reduction as one large-scale epistasis detection tool. Its performance has been thoroughly investigated in terms of false positive control and power, under a variety of scenarios involving different trait types and study designs, as well as error-free and noisy data, but never with respect to multicollinear SNPs.

Results

Using real-life human LD patterns from a homogeneous subpopulation of British ancestry, we investigated the impact of LD-pruning on the statistical sensitivity of MB-MDR. We considered three different non-fully penetrant epistasis models with varying effect sizes. There is a clear advantage in pre-analysis pruning using sliding windows at r² of 0.75 or lower, but using a threshold of 0.20 has a detrimental effect on the power to detect a functional interactive SNP pair (power < 25%). Signal sensitivity, directly using LD-block information to determine whether an epistasis signal is present or not, benefits from LD-pruning as well (average power across scenarios: 87%), but is largely hampered by functional loci residing at the boundaries of an LD-block.

Conclusions

Our results confirm that LD patterns and the position of causal variants in LD blocks do have an impact on epistasis detection, and that pruning strategies and LD-blocks definitions combined need careful attention, if we wish to maximize the power of large-scale epistasis screenings.

Collapse

Van Steen K, Moore JH. How to increase our belief in discovered statistical interactions via large-scale association studies? Hum Genet 2019;138:293-305. [PMID: 30840129 PMCID: PMC6483943 DOI: 10.1007/s00439-019-01987-w] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2018] [Accepted: 02/20/2019] [Indexed: 12/31/2022]

Statistical methods for genome-wide association studies. Semin Cancer Biol 2019;55:53-60. [DOI: 10.1016/j.semcancer.2018.04.008] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2017] [Revised: 04/27/2018] [Accepted: 04/28/2018] [Indexed: 12/12/2022]

Statistical Modeling of Trivariate Static Systems: Isotonic Models. DATA 2019. [DOI: 10.3390/data4010017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Lee S, Son D, Kim Y, Yu W, Park T. Unified Cox model based multifactor dimensionality reduction method for gene-gene interaction analysis of the survival phenotype. BioData Min 2018;11:27. [PMID: 30564286 PMCID: PMC6295107 DOI: 10.1186/s13040-018-0189-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2018] [Accepted: 11/26/2018] [Indexed: 12/04/2022] Open

Abstract

Background

One strategy for addressing missing heritability in genome-wide association study is gene-gene interaction analysis, which, unlike a single gene approach, involves high-dimensionality. The multifactor dimensionality reduction method (MDR) has been widely applied to reduce multi-levels of genotypes into high or low risk groups. The Cox-MDR method has been proposed to detect gene-gene interactions associated with the survival phenotype by using the martingale residuals from a Cox model. However, this method requires a cross-validation procedure to find the best SNP pair among all possible pairs and the permutation procedure should be followed for the significance of gene-gene interactions. Recently, the unified model based multifactor dimensionality reduction method (UM-MDR) has been proposed to unify the significance testing with the MDR algorithm within the regression model framework, in which neither cross-validation nor permutation testing are needed. In this paper, we proposed a simple approach, called Cox UM-MDR, which combines Cox-MDR with the key procedure of UM-MDR to identify gene-gene interactions associated with the survival phenotype.

Results

The simulation study was performed to compare Cox UM-MDR with Cox-MDR with and without the marginal effects of SNPs. We found that Cox UM-MDR has similar power to Cox-MDR without marginal effects, whereas it outperforms Cox-MDR with marginal effects and more robust to heavy censoring. We also applied Cox UM-MDR to a dataset of leukemia patients and detected gene-gene interactions with regard to the survival time.

Conclusion

Cox UM-MDR is easily implemented by combining Cox-MDR with UM-MDR to detect the significant gene-gene interactions associated with the survival time without cross-validation and permutation testing. The simulation results are shown to demonstrate the utility of the proposed method, which achieves at least the same power as Cox-MDR in most scenarios, and outperforms Cox-MDR when some SNPs having only marginal effects might mask the detection of the causal epistasis.

Collapse

Male-specific epistasis between WWC1 and TLN2 genes is associated with Alzheimer's disease. Neurobiol Aging 2018;72:188.e3-188.e12. [PMID: 30201328 PMCID: PMC6769421 DOI: 10.1016/j.neurobiolaging.2018.08.001] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2018] [Revised: 07/05/2018] [Accepted: 08/01/2018] [Indexed: 12/19/2022]

Yadav RP, Ghatak S, Chakraborty P, Lalrohlui F, Kannan R, Kumar R, Pautu JL, Zomingthanga J, Chenkual S, Muthukumaran R, Senthil Kumar N. Lifestyle chemical carcinogens associated with mutations in cell cycle regulatory genes increases the susceptibility to gastric cancer risk. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2018;25:31691-31704. [PMID: 30209766 DOI: 10.1007/s11356-018-3080-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/16/2018] [Accepted: 08/27/2018] [Indexed: 06/08/2023]

Kundu S, Ramshankar V, Verma AK, Thangaraj SV, Krishnamurthy A, Kumar R, Kannan R, Ghosh SK. Association of DFNA5, SYK, and NELL1 variants along with HPV infection in oral cancer among the prolonged tobacco-chewers. Tumour Biol 2018;40:1010428318793023. [PMID: 30091681 DOI: 10.1177/1010428318793023] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open

Abstract

Southeast Asia, especially India, is well known for the highest use of smokeless tobacco. These products are known to induce oral squamous cell carcinoma. However, not all long-term tobacco-chewers develop oral squamous cell carcinoma. In addition, germline variants play a crucial role in susceptibility, prognosis, development, and progression of the disease. These prompted us to study the genetic susceptibility to oral squamous cell carcinoma among the long-term tobacco-chewers. Here, we presented a retrospective study on prolonged tobacco-chewers of Northeast India to identify the potential protective or risk-associated germline variants in tobacco-related oral squamous cell carcinoma along with HPV infection. Targeted re-sequencing (n = 60) of 170 genetic regions from 75 genes was carried out in Ion-PGM™ and validation (n = 116) of the observed variants was done using Sequenom iPLEX MassARRAY™ platform followed by polymerase chain reaction-based HPV genotyping and p16-immunohistochemistry study. Subsequently, estimation of population structure, different statistical and in silico approaches were undertaken. We identified one nonsense-mediated mRNA decay transcript variant in the DFNA5 region (rs2237306), associated with Benzo(a)pyrene, as a protective factor (odds ratio = 0.33; p = 0.009) and four harmful (odds ratio > 2.5; p < 0.05) intronic variants, rs182361, rs290974, and rs169724 in SYK and rs1670661 in NELL1 region, involved in genetic susceptibility to tobacco- and HPV-mediated oral oncogenesis. Among the oral squamous cell carcinoma patients, 12.6% (11/87) were HPV positive, out of which 45.5% (5/11) were HPV16-infected, 27.3% (3/11) were HPV18-infected, and 27.3% (3/11) had an infection of both subtypes. Multifactor dimensionality reduction analysis showed that the interactions among HPV and NELL1 variant rs1670661 with age and gender augmented the risk of both non-tobacco- and tobacco-related oral squamous cell carcinoma, respectively. These suggest that HPV infection may be one of the important risk factors for oral squamous cell carcinoma in this population. Finally, we newly report a DFNA5 variant probably conferring protection via nonsense-mediated mRNA decay pathway against tobacco-related oral squamous cell carcinoma. Thus, the analytical approach used here can be useful in predicting the population-specific significant variants associated with oral squamous cell carcinoma in any heterogeneous population.

Collapse

García-González I, López-Díaz RI, Canché-Pech JR, Solís-Cárdenas ADJ, Flores-Ocampo JA, Mendoza-Alcocer R, Herrera-Sánchez LF, Jiménez-Rico MA, Ceballos-López AA, López-Novelo ME. Epistasis analysis of metabolic genes polymorphisms associated with ischemic heart disease in Yucatan. CLINICA E INVESTIGACION EN ARTERIOSCLEROSIS : PUBLICACION OFICIAL DE LA SOCIEDAD ESPANOLA DE ARTERIOSCLEROSIS 2018;30:102-111. [PMID: 29395491 DOI: 10.1016/j.arteri.2017.11.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2017] [Revised: 11/27/2017] [Accepted: 11/29/2017] [Indexed: 06/07/2023]

Jung HY, Leem S, Park T. Fuzzy set-based generalized multifactor dimensionality reduction analysis of gene-gene interactions. BMC Med Genomics 2018;11:32. [PMID: 29697366 PMCID: PMC5918459 DOI: 10.1186/s12920-018-0343-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open

Ritchie MD, Van Steen K. The search for gene-gene interactions in genome-wide association studies: challenges in abundance of methods, practical considerations, and biological interpretation. ANNALS OF TRANSLATIONAL MEDICINE 2018;6:157. [PMID: 29862246 DOI: 10.21037/atm.2018.04.05] [Citation(s) in RCA: 58] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

Abstract

One of the primary goals in this era of precision medicine is to understand the biology of human diseases and their treatment, such that each individual patient receives the best possible treatment for their disease based on their genetic and environmental exposures. One way to work towards achieving this goal is to identify the environmental exposures and genetic variants that are relevant to each disease in question, as well as the complex interplay between genes and environment. Genome-wide association studies (GWAS) have allowed for a greater understanding of the genetic component of many complex traits. However, these genetic effects are largely small and thus, our ability to use these GWAS finding for precision medicine is limited. As more and more GWAS have been performed, rather than focusing only on common single nucleotide polymorphisms (SNPs) and additive genetic models, many researchers have begun to explore alternative heritable components of complex traits including rare variants, structural variants, epigenetics, and genetic interactions. While genetic interactions are a plausible reality that could explain some of the heritabliy that has not yet been identified, especially when one considers the identification of genetic interactions in model organisms as well as our understanding of biological complexity, still there are significant challenges and considerations in identifying these genetic interactions. Broadly, these can be summarized in three categories: abundance of methods, practical considerations, and biological interpretation. In this review, we will discuss these important elements in the search for genetic interactions along with some potential solutions. While genetic interactions are theoretically understood to be important for complex human disease, the body of evidence is still building to support this component of the underlying genetic architecture of complex human traits. Our hope is that more sophisticated modeling approaches and more robust computational techniques will enable the community to identify these important genetic interactions and improve our ability to implement precision medicine in the future.

Collapse

Uppu S, Krishna A, Gopalan RP. A Review on Methods for Detecting SNP Interactions in High-Dimensional Genomic Data. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018;15:599-612. [PMID: 28060710 DOI: 10.1109/tcbb.2016.2635125] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]

Yu W, Lee S, Park T. A unified model based multifactor dimensionality reduction framework for detecting gene-gene interactions. Bioinformatics 2017;32:i605-i610. [PMID: 27587680 DOI: 10.1093/bioinformatics/btw424] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Abstract

MOTIVATION

Gene-gene interaction (GGI) is one of the most popular approaches for finding and explaining the missing heritability of common complex traits in genome-wide association studies. The multifactor dimensionality reduction (MDR) method has been widely studied for detecting GGI effects. However, there are several disadvantages of the existing MDR-based approaches, such as the lack of an efficient way of evaluating the significance of multi-locus models and the high computational burden due to intensive permutation. Furthermore, the MDR method does not distinguish marginal effects from pure interaction effects.

METHODS

We propose a two-step unified model based MDR approach (UM-MDR), in which, the significance of a multi-locus model, even a high-order model, can be easily obtained through a regression framework with a semi-parametric correction procedure for controlling Type I error rates. In comparison to the conventional permutation approach, the proposed semi-parametric correction procedure avoids heavy computation in order to achieve the significance of a multi-locus model. The proposed UM-MDR approach is flexible in the sense that it is able to incorporate different types of traits and evaluate significances of the existing MDR extensions.

RESULTS

The simulation studies and the analysis of a real example are provided to demonstrate the utility of the proposed method. UM-MDR can achieve at least the same power as MDR for most scenarios, and it outperforms MDR especially when there are some single nucleotide polymorphisms that only have marginal effects, which masks the detection of causal epistasis for the existing MDR approaches.

CONCLUSIONS

UM-MDR provides a very good supplement of existing MDR method due to its efficiency in achieving significance for every multi-locus model, its power and its flexibility of handling different types of traits.

AVAILABILITY AND IMPLEMENTATION

A R package "umMDR" and other source codes are freely available at http://statgen.snu.ac.kr/software/umMDR/ CONTACT: tspark@stats.snu.ac.kr

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Abo Alchamlat S, Farnir F. KNN-MDR: a learning approach for improving interactions mapping performances in genome wide association studies. BMC Bioinformatics 2017;18:184. [PMID: 28327091 PMCID: PMC5361736 DOI: 10.1186/s12859-017-1599-7] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2016] [Accepted: 03/11/2017] [Indexed: 12/30/2022] Open

Gupta U, Mir SS, Garg N, Agarwal SK, Pande S, Mittal B. Association study of inflammatory genes with rheumatic heart disease in North Indian population: A multi-analytical approach. Immunol Lett 2016;174:53-62. [PMID: 27118427 DOI: 10.1016/j.imlet.2016.04.012] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2016] [Revised: 04/13/2016] [Accepted: 04/13/2016] [Indexed: 10/21/2022]

Abstract

Rheumatic heart disease (RHD) is an inflammatory, autoimmune disease; occurring as a consequence of group A streptococcal infection complicated by rheumatic fever (RF). An inappropriate immune response is the central signature tune to the complex pathogenesis of RHD. However, some of those infected develop RHD, and genetic host susceptibility factors are thought to play a key role in diseasedevelopment. Therefore, the present study was designed to explore the role of genetic variants in inflammatory genes in conferring risk of RHD. The study recruited total of 700 subjects, including 400 RHD patients and 300 healthy controls. We examined the associations of 8 selected polymorphisms in seven inflammatory genes: IL-6 [rs1800795G/C], IL-10 [rs1800896G/A], TNF-A [rs1800629G/A], IL-1β [rs2853550C/T], IL-1VNTR [rs2234663], TGF-β1 [rs1800469C/T]; [rs1982073T/C], and CTLA-4 [rs5742909C/T] with RHD risk. Genotyping for all the polymorphisms was done using PCR-ARMS/PCR/RFLP methods. Multifactor dimensionality reduction and classification and regression tree approaches were combined with logistic regression to discover high-order gene-gene interactions in studiedgenes involved in RHD susceptibility.In univariate logistic regression analysis, we found significant association of variant-containing genotypes (CT&TT) of TGF-β1 869T/C [rs1982073]; [p=0.0.004 & 0.001, OR (95% CI)=1.65 (1.2-2.3) & 2.25 (1.4-3.6) respectively], variant genotype (CC) of IL-1β -511C/T [rs2853550]; [p=0.001, OR (95% CI)=2.33 (1.4-3.8)] and IL-1 VNTR [rs2234663]; [p=0.03, OR (95% CI)=5.25 (1.2-23.4)] SNPs with RHD risk. CART analysis revealed that individuals with the combined genotypes of TGF-β1T/C_ rs1982073 (CT/TT) and IL-1 β_ rs2853550 (CC) had significantly higher susceptibility for RHD [p=0.0005, OR (95% CI)=5.91 (2.9-12.5)]. In MDR analysis, TGF-β1 869T>C yielded the highest testing accuracy of 0.562. In conclusion, using multi-analytical approaches, our study revealed important role of TGF-β1 869T/C [rs1982073] in RHD susceptibility.

Collapse

Lishout FV, Gadaleta F, Moore JH, Wehenkel L, Steen KV. gammaMAXT: a fast multiple-testing correction algorithm. BioData Min 2015;8:36. [PMID: 26594243 PMCID: PMC4654922 DOI: 10.1186/s13040-015-0069-x] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2015] [Accepted: 11/08/2015] [Indexed: 02/07/2023] Open

Abstract

BACKGROUND

The purpose of the MaxT algorithm is to provide a significance test algorithm that controls the family-wise error rate (FWER) during simultaneous hypothesis testing. However, the requirements in terms of computing time and memory of this procedure are proportional to the number of investigated hypotheses. The memory issue has been solved in 2013 by Van Lishout's implementation of MaxT, which makes the memory usage independent from the size of the dataset. This algorithm is implemented in MBMDR-3.0.3, a software that is able to identify genetic interactions, for a variety of SNP-SNP based epistasis models effectively. On the other hand, that implementation turned out to be less suitable for genome-wide interaction analysis studies, due to the prohibitive computational burden.

RESULTS

In this work we introduce gammaMAXT, a novel implementation of the maxT algorithm for multiple testing correction. The algorithm was implemented in software MBMDR-4.2.2, as part of the MB-MDR framework to screen for SNP-SNP, SNP-environment or SNP-SNP-environment interactions at a genome-wide level. We show that, in the absence of interaction effects, test-statistics produced by the MB-MDR methodology follow a mixture distribution with a point mass at zero and a shifted gamma distribution for the top 10 % of the strictly positive values. We show that the gammaMAXT algorithm has a power comparable to MaxT and maintains FWER, but requires less computational resources and time. We analyze a dataset composed of 10(6) SNPs and 1000 individuals within one day on a 256-core computer cluster. The same analysis would take about 10(4) times longer with MBMDR-3.0.3.

CONCLUSIONS

These results are promising for future GWAIs. However, the proposed gammaMAXT algorithm offers a general significance assessment and multiple testing approach, applicable to any context that requires performing hundreds of thousands of tests. It offers new perspectives for fast and efficient permutation-based significance assessment in large-scale (integrated) omics studies.

Collapse

Kim Y, Park T. Robust Gene-Gene Interaction Analysis in Genome Wide Association Studies. PLoS One 2015;10:e0135016. [PMID: 26267341 PMCID: PMC4534386 DOI: 10.1371/journal.pone.0135016] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2014] [Accepted: 07/17/2015] [Indexed: 11/19/2022] Open

Choudhury JH, Singh SA, Kundu S, Choudhury B, Talukdar FR, Srivasta S, Laskar RS, Dhar B, Das R, Laskar S, Kumar M, Kapfo W, Mondal R, Ghosh SK. Tobacco carcinogen-metabolizing genes CYP1A1, GSTM1, and GSTT1 polymorphisms and their interaction with tobacco exposure influence the risk of head and neck cancer in Northeast Indian population. Tumour Biol 2015;36:5773-83. [PMID: 25724184 DOI: 10.1007/s13277-015-3246-0] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2014] [Accepted: 02/10/2015] [Indexed: 11/29/2022] Open

Yu W, Kwon MS, Park T. Multivariate Quantitative Multifactor Dimensionality Reduction for Detecting Gene-Gene Interactions. Hum Hered 2015. [PMID: 26201702 DOI: 10.1159/000377723] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open

Fouladi R, Bessonov K, Van Lishout F, Van Steen K. Model-Based Multifactor Dimensionality Reduction for Rare Variant Association Analysis. Hum Hered 2015. [PMID: 26201701 DOI: 10.1159/000381286] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open

Gola D, Mahachie John JM, van Steen K, König IR. A roadmap to multifactor dimensionality reduction methods. Brief Bioinform 2015;17:293-308. [PMID: 26108231 PMCID: PMC4793893 DOI: 10.1093/bib/bbv038] [Citation(s) in RCA: 56] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2015] [Indexed: 02/02/2023] Open

Rule-based analysis for detecting epistasis using associative classification mining. ACTA ACUST UNITED AC 2015. [DOI: 10.1007/s13721-015-0084-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Bridging the gap between statistical and biological epistasis in Alzheimer's disease. BIOMED RESEARCH INTERNATIONAL 2015;2015:870123. [PMID: 26075270 PMCID: PMC4449899 DOI: 10.1155/2015/870123] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 03/06/2015] [Accepted: 05/05/2015] [Indexed: 12/17/2022]

Bessonov K, Gusareva ES, Van Steen K. A cautionary note on the impact of protocol changes for genome-wide association SNP × SNP interaction studies: an example on ankylosing spondylitis. Hum Genet 2015;134:761-73. [DOI: 10.1007/s00439-015-1560-7] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2015] [Accepted: 04/26/2015] [Indexed: 12/11/2022]

Grange L, Bureau JF, Nikolayeva I, Paul R, Van Steen K, Schwikowski B, Sakuntabhai A. Filter-free exhaustive odds ratio-based genome-wide interaction approach pinpoints evidence for interaction in the HLA region in psoriasis. BMC Genet 2015;16:11. [PMID: 25655172 PMCID: PMC4341885 DOI: 10.1186/s12863-015-0174-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2014] [Accepted: 01/23/2015] [Indexed: 12/02/2022] Open

Abstract

Background

Deciphering the genetic architecture of complex traits is still a major challenge for human genetics. In most cases, genome-wide association studies have only partially explained the heritability of traits and diseases. Epistasis, one potentially important cause of this missing heritability, is difficult to explore at the genome-wide level. Here, we develop and assess a tool based on interactive odds ratios (I_OR), Fast Odds Ratio-based sCan for Epistasis (FORCE), as a novel approach for exhaustive genome-wide epistasis search. I_OR is the ratio between the multiplicative term of the odds ratio (OR) of having each variant over the OR of having both of them. By definition, an I_OR that significantly deviates from 1 suggests the occurrence of an interaction (epistasis). As the I_OR is fast to calculate, we used the I_OR to rank and select pairs of interacting polymorphisms for P value estimation, which is more time consuming.

Results

FORCE displayed power and accuracy similar to existing parametric and non-parametric methods, and is fast enough to complete a filter-free genome-wide epistasis search in a few days on a standard computer. Analysis of psoriasis data uncovered novel epistatic interactions in the HLA region, corroborating the known major and complex role of the HLA region in psoriasis susceptibility.

Conclusions

Our systematic study revealed the ability of FORCE to uncover novel interactions, highlighted the importance of exhaustiveness, as well as its specificity for certain types of interactions that were not detected by existing approaches. We therefore believe that FORCE is a valuable new tool for decoding the genetic basis of complex diseases.

Electronic supplementary material

The online version of this article (doi:10.1186/s12863-015-0174-3) contains supplementary material, which is available to authorized users.

Collapse

Gusareva ES, Carrasquillo MM, Bellenguez C, Cuyvers E, Colon S, Graff-Radford NR, Petersen RC, Dickson DW, Mahachie John JM, Bessonov K, Van Broeckhoven C, Harold D, Williams J, Amouyel P, Sleegers K, Ertekin-Taner N, Lambert JC, Van Steen K. Genome-wide association interaction analysis for Alzheimer's disease. Neurobiol Aging 2014;35:2436-2443. [PMID: 24958192 PMCID: PMC4370231 DOI: 10.1016/j.neurobiolaging.2014.05.014] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2013] [Revised: 05/19/2014] [Accepted: 05/21/2014] [Indexed: 12/23/2022]

Affiliation(s)

Elena S Gusareva Systems and Modeling Unit, Montefiore Institute, University of Liege, Belgium; Bioinformatics and Modeling, GIGA-R, University of Liege, Belgium.
Minerva M Carrasquillo Department of Neuroscience, Mayo Clinic Florida, Jacksonville, FL, USA
Céline Bellenguez INSERM U744, Lille, France; Department of Public Health and Molecular Epidemiology of Aging Related Diseases, Institut Pasteur de Lille, Lille, France; Universite de Lille Nord de France, Lille, France
Elise Cuyvers Department of Molecular Genetics, VIB, Antwerp, Belgium; Department of Neurology, Institute Born-Bunge, University of Antwerp, Antwerp, Belgium
Samuel Colon Department of Neuroscience, Mayo Clinic Florida, Jacksonville, FL, USA
Neill R Graff-Radford Department of Neurology, Mayo Clinic Florida, Jacksonville, FL, USA
Ronald C Petersen Department of Neurology, Mayo Clinic Florida, Rochester, MN, USA
Dennis W Dickson Department of Neuroscience, Mayo Clinic Florida, Jacksonville, FL, USA
Jestinah M Mahachie John Systems and Modeling Unit, Montefiore Institute, University of Liege, Belgium; Bioinformatics and Modeling, GIGA-R, University of Liege, Belgium
Kyrylo Bessonov Systems and Modeling Unit, Montefiore Institute, University of Liege, Belgium; Bioinformatics and Modeling, GIGA-R, University of Liege, Belgium
Christine Van Broeckhoven Department of Molecular Genetics, VIB, Antwerp, Belgium; Department of Neurology, Institute Born-Bunge, University of Antwerp, Antwerp, Belgium
Denise Harold Medical Research Council Centre for Neuropsychiatric Genetics and Genomics, Institute of Psychological Medicine and Clinical Neurosciences, Cardiff University School of Medicine, Cardiff, UK
Julie Williams Medical Research Council Centre for Neuropsychiatric Genetics and Genomics, Institute of Psychological Medicine and Clinical Neurosciences, Cardiff University School of Medicine, Cardiff, UK
Philippe Amouyel INSERM U744, Lille, France; Department of Public Health and Molecular Epidemiology of Aging Related Diseases, Institut Pasteur de Lille, Lille, France; Universite de Lille Nord de France, Lille, France
Kristel Sleegers Department of Molecular Genetics, VIB, Antwerp, Belgium; Department of Neurology, Institute Born-Bunge, University of Antwerp, Antwerp, Belgium
Nilüfer Ertekin-Taner Department of Neuroscience, Mayo Clinic Florida, Jacksonville, FL, USA; Department of Neurology, Mayo Clinic Florida, Jacksonville, FL, USA
Jean-Charles Lambert INSERM U744, Lille, France; Department of Public Health and Molecular Epidemiology of Aging Related Diseases, Institut Pasteur de Lille, Lille, France; Universite de Lille Nord de France, Lille, France
Kristel Van Steen Systems and Modeling Unit, Montefiore Institute, University of Liege, Belgium; Bioinformatics and Modeling, GIGA-R, University of Liege, Belgium

Collapse

Gusareva ES, Van Steen K. Practical aspects of genome-wide association interaction analysis. Hum Genet 2014;133:1343-58. [DOI: 10.1007/s00439-014-1480-y] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2014] [Accepted: 08/18/2014] [Indexed: 12/31/2022]

Guo X, Meng Y, Yu N, Pan Y. Cloud computing for detecting high-order genome-wide epistatic interaction via dynamic clustering. BMC Bioinformatics 2014;15:102. [PMID: 24717145 PMCID: PMC4021249 DOI: 10.1186/1471-2105-15-102] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2013] [Accepted: 03/17/2014] [Indexed: 11/25/2022] Open

Abstract

Backgroud

Taking the advan tage of high-throughput single nucleotide polymorphism (SNP) genotyping technology, large genome-wide association studies (GWASs) have been considered to hold promise for unravelling complex relationships between genotype and phenotype. At present, traditional single-locus-based methods are insufficient to detect interactions consisting of multiple-locus, which are broadly existing in complex traits. In addition, statistic tests for high order epistatic interactions with more than 2 SNPs propose computational and analytical challenges because the computation increases exponentially as the cardinality of SNPs combinations gets larger.

Results

In this paper, we provide a simple, fast and powerful method using dynamic clustering and cloud computing to detect genome-wide multi-locus epistatic interactions. We have constructed systematic experiments to compare powers performance against some recently proposed algorithms, including TEAM, SNPRuler, EDCF and BOOST. Furthermore, we have applied our method on two real GWAS datasets, Age-related macular degeneration (AMD) and Rheumatoid arthritis (RA) datasets, where we find some novel potential disease-related genetic factors which are not shown up in detections of 2-loci epistatic interactions.

Conclusions

Experimental results on simulated data demonstrate that our method is more powerful than some recently proposed methods on both two- and three-locus disease models. Our method has discovered many novel high-order associations that are significantly enriched in cases from two real GWAS datasets. Moreover, the running time of the cloud implementation for our method on AMD dataset and RA dataset are roughly 2 hours and 50 hours on a cluster with forty small virtual machines for detecting two-locus interactions, respectively. Therefore, we believe that our method is suitable and effective for the full-scale analysis of multiple-locus epistatic interactions in GWAS.

Collapse

Liu J, Calhoun VD. A review of multivariate analyses in imaging genetics. Front Neuroinform 2014;8:29. [PMID: 24723883 PMCID: PMC3972473 DOI: 10.3389/fninf.2014.00029] [Citation(s) in RCA: 69] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2013] [Accepted: 03/04/2014] [Indexed: 12/13/2022] Open

Abstract

Recent advances in neuroimaging technology and molecular genetics provide the unique opportunity to investigate genetic influence on the variation of brain attributes. Since the year 2000, when the initial publication on brain imaging and genetics was released, imaging genetics has been a rapidly growing research approach with increasing publications every year. Several reviews have been offered to the research community focusing on various study designs. In addition to study design, analytic tools and their proper implementation are also critical to the success of a study. In this review, we survey recent publications using data from neuroimaging and genetics, focusing on methods capturing multivariate effects accommodating the large number of variables from both imaging data and genetic data. We group the analyses of genetic or genomic data into either a priori driven or data driven approach, including gene-set enrichment analysis, multifactor dimensionality reduction, principal component analysis, independent component analysis (ICA), and clustering. For the analyses of imaging data, ICA and extensions of ICA are the most widely used multivariate methods. Given detailed reviews of multivariate analyses of imaging data available elsewhere, we provide a brief summary here that includes a recently proposed method known as independent vector analysis. Finally, we review methods focused on bridging the imaging and genetic data by establishing multivariate and multiple genotype-phenotype-associations, including sparse partial least squares, sparse canonical correlation analysis, sparse reduced rank regression and parallel ICA. These methods are designed to extract latent variables from both genetic and imaging data, which become new genotypes and phenotypes, and the links between the new genotype-phenotype pairs are maximized using different cost functions. The relationship between these methods along with their assumptions, advantages, and limitations are discussed.

Collapse

Hoefkens E, Nys K, John JM, Van Steen K, Arijs I, Van der Goten J, Van Assche G, Agostinis P, Rutgeerts P, Vermeire S, Cleynen I. Genetic association and functional role of Crohn disease risk alleles involved in microbial sensing, autophagy, and endoplasmic reticulum (ER) stress. Autophagy 2013;9:2046-55. [PMID: 24247223 DOI: 10.4161/auto.26337] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open

Mahachie John JM, Van Lishout F, Gusareva ES, Van Steen K. A robustness study of parametric and non-parametric tests in model-based multifactor dimensionality reduction for epistasis detection. BioData Min 2013;6:9. [PMID: 23618370 PMCID: PMC3668290 DOI: 10.1186/1756-0381-6-9] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2012] [Accepted: 04/20/2013] [Indexed: 11/10/2022] Open

Abstract

Background

Applying a statistical method implies identifying underlying (model) assumptions and checking their validity in the particular context. One of these contexts is association modeling for epistasis detection. Here, depending on the technique used, violation of model assumptions may result in increased type I error, power loss, or biased parameter estimates. Remedial measures for violated underlying conditions or assumptions include data transformation or selecting a more relaxed modeling or testing strategy. Model-Based Multifactor Dimensionality Reduction (MB-MDR) for epistasis detection relies on association testing between a trait and a factor consisting of multilocus genotype information. For quantitative traits, the framework is essentially Analysis of Variance (ANOVA) that decomposes the variability in the trait amongst the different factors. In this study, we assess through simulations, the cumulative effect of deviations from normality and homoscedasticity on the overall performance of quantitative Model-Based Multifactor Dimensionality Reduction (MB-MDR) to detect 2-locus epistasis signals in the absence of main effects.

Methodology

Our simulation study focuses on pure epistasis models with varying degrees of genetic influence on a quantitative trait. Conditional on a multilocus genotype, we consider quantitative trait distributions that are normal, chi-square or Student’s t with constant or non-constant phenotypic variances. All data are analyzed with MB-MDR using the built-in Student’s t-test for association, as well as a novel MB-MDR implementation based on Welch’s t-test. Traits are either left untransformed or are transformed into new traits via logarithmic, standardization or rank-based transformations, prior to MB-MDR modeling.

Results

Our simulation results show that MB-MDR controls type I error and false positive rates irrespective of the association test considered. Empirically-based MB-MDR power estimates for MB-MDR with Welch’s t-tests are generally lower than those for MB-MDR with Student’s t-tests. Trait transformations involving ranks tend to lead to increased power compared to the other considered data transformations.

Conclusions

When performing MB-MDR screening for gene-gene interactions with quantitative traits, we recommend to first rank-transform traits to normality and then to apply MB-MDR modeling with Student’s t-tests as internal tests for association.

Collapse

Van Lishout F, Mahachie John JM, Gusareva ES, Urrea V, Cleynen I, Théâtre E, Charloteaux B, Calle ML, Wehenkel L, Van Steen K. An efficient algorithm to perform multiple testing in epistasis screening. BMC Bioinformatics 2013;14:138. [PMID: 23617239 PMCID: PMC3648350 DOI: 10.1186/1471-2105-14-138] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2012] [Accepted: 04/12/2013] [Indexed: 12/22/2022] Open

Abstract

BACKGROUND

Research in epistasis or gene-gene interaction detection for human complex traits has grown over the last few years. It has been marked by promising methodological developments, improved translation efforts of statistical epistasis to biological epistasis and attempts to integrate different omics information sources into the epistasis screening to enhance power. The quest for gene-gene interactions poses severe multiple-testing problems. In this context, the maxT algorithm is one technique to control the false-positive rate. However, the memory needed by this algorithm rises linearly with the amount of hypothesis tests. Gene-gene interaction studies will require a memory proportional to the squared number of SNPs. A genome-wide epistasis search would therefore require terabytes of memory. Hence, cache problems are likely to occur, increasing the computation time. In this work we present a new version of maxT, requiring an amount of memory independent from the number of genetic effects to be investigated. This algorithm was implemented in C++ in our epistasis screening software MBMDR-3.0.3. We evaluate the new implementation in terms of memory efficiency and speed using simulated data. The software is illustrated on real-life data for Crohn's disease.

RESULTS

In the case of a binary (affected/unaffected) trait, the parallel workflow of MBMDR-3.0.3 analyzes all gene-gene interactions with a dataset of 100,000 SNPs typed on 1000 individuals within 4 days and 9 hours, using 999 permutations of the trait to assess statistical significance, on a cluster composed of 10 blades, containing each four Quad-Core AMD Opteron(tm) Processor 2352 2.1 GHz. In the case of a continuous trait, a similar run takes 9 days. Our program found 14 SNP-SNP interactions with a multiple-testing corrected p-value of less than 0.05 on real-life Crohn's disease (CD) data.

CONCLUSIONS

Our software is the first implementation of the MB-MDR methodology able to solve large-scale SNP-SNP interactions problems within a few days, without using much memory, while adequately controlling the type I error rates. A new implementation to reach genome-wide epistasis screening is under construction. In the context of Crohn's disease, MBMDR-3.0.3 could identify epistasis involving regions that are well known in the field and could be explained from a biological point of view. This demonstrates the power of our software to find relevant phenotype-genotype higher-order associations.

Collapse

Pan Q, Hu T, Moore JH. Epistasis, complexity, and multifactor dimensionality reduction. Methods Mol Biol 2013;1019:465-477. [PMID: 23756906 DOI: 10.1007/978-1-62703-447-0_22] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]

Applications of multifactor dimensionality reduction to genome-wide data using the R package 'MDR'. Methods Mol Biol 2013;1019:479-98. [PMID: 23756907 DOI: 10.1007/978-1-62703-447-0_23] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

Upstill-Goddard R, Eccles D, Fliege J, Collins A. Machine learning approaches for the discovery of gene-gene interactions in disease data. Brief Bioinform 2012;14:251-60. [DOI: 10.1093/bib/bbs024] [Citation(s) in RCA: 69] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open

Kazma R, Bailey JN. Population-based and family-based designs to analyze rare variants in complex diseases. Genet Epidemiol 2012;35 Suppl 1:S41-7. [PMID: 22128057 DOI: 10.1002/gepi.20648] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Rodin AS, Gogoshin G, Boerwinkle E. Systems biology data analysis methodology in pharmacogenomics. Pharmacogenomics 2012;12:1349-60. [PMID: 21919609 DOI: 10.2217/pgs.11.76] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open