1
|
SEEI: spherical evolution with feedback mechanism for identifying epistatic interactions. BMC Genomics 2024; 25:462. [PMID: 38735952 DOI: 10.1186/s12864-024-10373-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Accepted: 05/03/2024] [Indexed: 05/14/2024] Open
Abstract
BACKGROUND Detecting epistatic interactions (EIs) involves the exploration of associations among single nucleotide polymorphisms (SNPs) and complex diseases, which is an important task in genome-wide association studies. The EI detection problem is dependent on epistasis models and corresponding optimization methods. Although various models and methods have been proposed to detect EIs, identifying EIs efficiently and accurately is still a challenge. RESULTS Here, we propose a linear mixed statistical epistasis model (LMSE) and a spherical evolution approach with a feedback mechanism (named SEEI). The LMSE model expands the existing single epistasis models such as LR-Score, K2-Score, Mutual information, and Gini index. The SEEI includes an adaptive spherical search strategy and population updating strategy, which ensures that the algorithm is not easily trapped in local optima. We analyzed the performances of 8 random disease models, 12 disease models with marginal effects, 30 disease models without marginal effects, and 10 high-order disease models. The 60 simulated disease models and a real breast cancer dataset were used to evaluate eight algorithms (SEEI, EACO, EpiACO, FDHEIW, MP-HS-DHSI, NHSA-DHSC, SNPHarvester, CSE). Three evaluation criteria (pow1, pow2, pow3), a T-test, and a Friedman test were used to compare the performances of these algorithms. The results show that the SEEI algorithm (order 1, averages ranks = 13.125) outperformed the other algorithms in detecting EIs. CONCLUSIONS Here, we propose an LMSE model and an evolutionary computing method (SEEI) to solve the optimization problem of the LMSE model. The proposed method performed better than the other seven algorithms tested in its ability to identify EIs in genome-wide association datasets. We identified new SNP-SNP combinations in the real breast cancer dataset and verified the results. Our findings provide new insights for the diagnosis and treatment of breast cancer. AVAILABILITY AND IMPLEMENTATION https://github.com/scutdy/SSO/blob/master/SEEI.zip .
Collapse
|
2
|
Genome-wide association studies for a comprehensive understanding of the genetic architecture of culm strength and yield traits in rice. FRONTIERS IN PLANT SCIENCE 2024; 14:1298083. [PMID: 38317832 PMCID: PMC10839031 DOI: 10.3389/fpls.2023.1298083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 12/14/2023] [Indexed: 02/07/2024]
Abstract
Lodging resistance in rice is a complex trait determined by culm morphological and culm physical strength traits, and these traits are a major determinant of yield. We made a detailed analysis of various component traits with the aim of deriving optimized parameters for measuring culm strength. Genotyping by sequencing (GBS)-based genome-wide association study (GWAS) was employed among 181 genotypes for dissecting the genetic control of culm strength traits. The VanRaden kinship algorithm using 6,822 filtered single-nucleotide polymorphisms (SNPs) revealed the presence of two sub-groups within the association panel with kinship values concentrated at<0.5 level, indicating greater diversity among the genotypes. A wide range of phenotypic variation and high heritability for culm strength and yield traits were observed over two seasons, as reflected in best linear unbiased prediction (BLUP) estimates. The multi-locus model for GWAS resulted in the identification of 15 highly significant associations (p< 0.0001) for culm strength traits. Two novel major effect marker-trait associations (MTAs) for section modulus and bending stress were identified on chromosomes 2 and 12 with a phenotypic variance of 21.87% and 10.14%, respectively. Other MTAs were also noted in the vicinity of previously reported putative candidate genes for lodging resistance, providing an opportunity for further research on the biochemical basis of culm strength. The quantitative trait locus (QTL) hotspot identified on chromosome 12 with the synergistic association for culm strength trait (section modulus, bending stress, and internode breaking weight) and grain number can be considered a novel genomic region that can serve a dual purpose of enhancing culm strength and grain yield. Elite donors in the indica background with beneficial alleles of the identified major QTLs could be a valuable resource with greater significance in practical plant breeding programs focusing on improving lodging resistance in rice.
Collapse
|
3
|
Mapping of QTLs and meta-QTLs for Heterodera avenae Woll. resistance in common wheat (Triticum aestivum L.). BMC PLANT BIOLOGY 2023; 23:529. [PMID: 37904124 PMCID: PMC10617160 DOI: 10.1186/s12870-023-04526-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Accepted: 10/14/2023] [Indexed: 11/01/2023]
Abstract
BACKGROUND In hexaploid wheat, quantitative trait loci (QTL) and meta-QTL (MQTL) analyses were conducted to identify genomic regions controlling resistance to cereal cyst nematode (CCN), Heterodera avenae. A mapping population comprising 149 RILs derived from the cross HUW 468 × C 306 was used for composite interval mapping (CIM) and inclusive composite interval mapping (ICIM). RESULTS Eight main effect QTLs on three chromosomes (1B, 2A and 3A) were identified using two repeat experiments. One of these QTLs was co-localized with a previously reported wheat gene Cre5 for resistance to CCN. Seven important digenic epistatic interactions (PVE = 5% or more) were also identified, each involving one main effect QTL and another novel E-QTL. Using QTLs earlier reported in literature, two meta-QTLs were also identified, which were also used for identification of 57 candidate genes (CGs). Out of these, 29 CGs have high expression in roots and encoded the following proteins having a role in resistance to plant parasitic nematodes (PPNs): (i) NB-ARC,P-loop containing NTP hydrolase, (ii) Protein Kinase, (iii) serine-threonine/tyrosine-PK, (iv) protein with leucine-rich repeat, (v) virus X resistance protein-like, (vi) zinc finger protein, (vii) RING/FYVE/PHD-type, (viii) glycosyl transferase, family 8 (GT8), (ix) rubisco protein with small subunit domain, (x) protein with SANT/Myb domain and (xi) a protein with a homeobox. CONCLUSION Identification and selection of resistance loci with additive and epistatic effect along with two MQTL and associated CGs, identified in the present study may prove useful for understanding the molecular basis of resistance against H. avenae in wheat and for marker-assisted selection (MAS) for breeding CCN resistant wheat cultivars.
Collapse
|
4
|
Development and marker-trait relationships of functional markers for glutamine synthetase GS1 and GS2 homoeogenes in bread wheat. MOLECULAR BREEDING : NEW STRATEGIES IN PLANT IMPROVEMENT 2023; 43:8. [PMID: 37309364 PMCID: PMC10248667 DOI: 10.1007/s11032-022-01354-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Accepted: 12/28/2022] [Indexed: 06/14/2023]
Abstract
GS1 and GS2 genes encode, respectively, the main cytosolic and the plastidic isoforms of glutamine synthetase (GS). In the present study, the wheat GS1 and GS2 homoeogenes located in the A, B and D genome chromosomes have been sequenced in a group of 15 bread wheat varieties including landraces, old commercial varieties and modern cultivars. Phenotypic characterization by multi-environment field trials detected significant effects of specific GS homoeogenes on three of the seven agronomic and grain quality traits analyzed. Based on the gene sequence polymorphisms found, biallelic molecular markers that could facilitate marker-assisted breeding were developed for genes GS1A, GS2A and GS2D. The remaining genes encoding main wheat GS were excluded because of being monomorphic (GS1D) or too polymorphic (GS1B and GS2B) in the sequencing panel varieties. A collection of 187 Spanish bread wheat landraces was genotyped for these gene-based molecular markers. Data analyses conducted with phenotypic records reported for this germplasm collection in López-Fernández et al. (Plants-Basel 10: 620, 2021) have revealed the beneficial influence of some individual alleles on thousand-kernel weight (TKW), kernels per spike (KS) and grain protein content. Furthermore, genetic interactions between GS1A, a cytosolic GS isoform coding gene, and GS2A or GS2D, plastidic GS enzyme coding genes, were found to affect TKW and KS. The finding that some alleles at one locus may mask the effect of positive alleles at hypostatic GS loci should be kept in mind if gene pyramiding strategies are attempted for the improvement of N-use efficiency-related traits. Supplementary Information The online version contains supplementary material available at 10.1007/s11032-022-01354-0.
Collapse
|
5
|
Abstract
With the advancements in next-generation sequencing technologies, leading to millions of single nucleotide polymorphisms in all crop species including wheat, genome-wide association study (GWAS) has become a leading approach for trait dissection. In wheat, GWAS has been conducted for a plethora of traits and more and more studies are being conducted and reported in journals. While application of GWAS has become a routine in wheat using the standardized approaches, there has been a great leap forward using newer models and combination of GWAS with other sets of data. This chapter has reviewed all these latest advancements in GWAS in wheat by citing the most important studies and their outputs. Specially, we have focused on studies that conducted meta-GWAS, multilocus GWAS, haplotype-based GWAS, Environmental- and Eigen-GWAS, and/or GWAS combined with gene regulatory network and pathway analyses or epistatic interactions analyses; all these have taken the association mapping approach to new heights in wheat.
Collapse
|
6
|
Genome-wide association study of flowering time reveals complex genetic heterogeneity and epistatic interactions in rice. Gene 2020; 770:145353. [PMID: 33333227 DOI: 10.1016/j.gene.2020.145353] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2020] [Revised: 10/24/2020] [Accepted: 12/01/2020] [Indexed: 11/22/2022]
Abstract
Since domestication, rice has cultivated in a wide range of latitudes with different day lengths. Selection of diverse natural variations in heading date and photoperiod sensitivity is critical for adaptation of rice to different geographical environments. To unravel the genetic architecture underlying natural variation of rice flowering time, we conducted a genome wide association study (GWAS) using several association analysis strategies with a diverse worldwide collection of 529 O. sativa accessions. Heading date was investigated in three environments under long-day or short-day conditions, and photosensitivity was evaluated. By dividing the whole association panel into subpopulations and performing GWAS with both linear mixed models and multi-locus mixed-models, we revealed hundreds of significant loci harboring novel candidate genes as well as most of the known flowering time genes. In total, 127 hotspots were detected in at least two GWAS. Universal genetic heterogeneity was found across subpopulations. We further detected abundant interactions between GWAS loci, especially in indica. Functional gene families were revealed from enrichment analysis of the 127 hotspots. The results demonstrated a rich of genetic interactions in rice flowering time genes and such epistatic interactions contributed to the large portions of missing heritability in GWAS. It suggests the increased complexity of genetic heterogeneity might discount the power of increasing the sample sizes in GWAS.
Collapse
|
7
|
Genetic factors affecting Fusarium head blight resistance improvement from introgression of exotic Sumai 3 alleles (including Fhb1, Fhb2, and Fhb5) in hard red spring wheat. BMC PLANT BIOLOGY 2019; 19:179. [PMID: 31053089 PMCID: PMC6499950 DOI: 10.1186/s12870-019-1782-2] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2018] [Accepted: 04/16/2019] [Indexed: 05/09/2023]
Abstract
BACKGROUND Fusarium head blight resistance genes, Fhb1 (for Type-II resistance), Fhb2 (Type-II), and Fhb5 (Type-I plus some Type-II), which originate from Sumai 3, are among the most important that confer resistance in hexaploid wheat. Near-isogenic lines (NILs), in the CDC Alsask (susceptible; n = 32) and CDC Go (moderately susceptible; n = 38) backgrounds, carrying these genes in all possible combinations were developed using flanking microsatellite markers and evaluated for their response to FHB and deoxynivalenol (DON) accumulation in eight environments. NILs were haplotyped with wheat 90 K iSelect assay to elucidate the genomic composition and confirm alleles' presence. Other than evaluating the effects of three major genes in common genetic background, the study elucidated the epistatic gene interactions as they influence FHB measurements; identified loci other than Fhb1, Fhb2, and Fhb5, in both recurrent and donor parents and examined annotated proteins in gene intervals. RESULTS Genotyping using 81,857 single nucleotide polymorphism (SNP) markers revealed polymorphism on all chromosomes and that the NILs carried < 3% of alleles from the resistant donor. Significant improvement in field resistance (Type-I + Type-II) resulted only among the CDC Alsask NILs, not the CDC Go NILs. The phenotypic response of NILs carrying combinations of Sumai 3 derived genes suggested non-additive responses and Fhb5 was as good as Fhb1 in conferring field resistance in both populations. In addition to Fhb1, Fhb2, and Fhb5, four to five resistance improving alleles in both populations were identified and three of five in CDC Go were contributed by the susceptible parent. The introgressed chromosome regions carried genes encoding disease resistance proteins, protein kinases, nucleotide-binding and leucine rich repeats' domains. Complex epistatic gene-gene interactions among marker loci (including Fhb1, Fhb2, Fhb5) explained > 20% of the phenotypic variation in FHB measurements. CONCLUSIONS Immediate Sumai 3 derivatives carry a number of resistance improving minor effect alleles, other than Fhb1, Fhb2, Fhb5. Results verified that marker-assisted selection is possible for the introgression of exotic FHB resistance genes, however, the genetic background of the recipient line and epistatic interactions can have a strong influence on expression and penetrance of any given gene.
Collapse
|
8
|
mtProtEvol: the resource presenting molecular evolution analysis of proteins involved in the function of Vertebrate mitochondria. BMC Evol Biol 2019; 19:47. [PMID: 30813887 PMCID: PMC6391778 DOI: 10.1186/s12862-019-1371-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND Heterotachy is the variation in the evolutionary rate of aligned sites in different parts of the phylogenetic tree. It occurs mainly due to epistatic interactions among the substitutions, which are highly complex and make it difficult to study protein evolution. The vast majority of computational evolutionary approaches for studying these epistatic interactions or their evolutionary consequences in proteins require high computational time. However, recently, it has been shown that the evolution of residue solvent accessibility (RSA) is tightly linked with changes in protein fitness and intra-protein epistatic interactions. This provides a computationally fast alternative, based on comparison of evolutionary rates of amino acid replacements with the rates of RSA evolutionary changes in order to recognize any shifts in epistatic interaction. RESULTS Based on RSA information, data randomization and phylogenetic approaches, we constructed a software pipeline, which can be used to analyze the evolutionary consequences of intra-protein epistatic interactions with relatively low computational time. We analyzed the evolution of 512 protein families tightly linked to mitochondrial function in Vertebrates and created "mtProtEvol", the web resource with data on protein evolution. In strict agreement with lifespan and metabolic rate data, we demonstrated that different functional categories of mitochondria-related proteins subjected to selection on accelerated and decelerated RSA rates in rodents and primates. For example, accelerated RSA evolution in rodents has been shown for Krebs cycle enzymes, respiratory chain and reactive oxygen species metabolism, while in primates these functions are stress-response, translation and mtDNA integrity. Decelerated RSA evolution in rodents has been demonstrated for translational machinery and oxidative stress response components. CONCLUSIONS mtProtEvol is an interactive resource focused on evolutionary analysis of epistatic interactions in protein families involved in Vertebrata mitochondria function and available at http://bioinfodbs.kantiana.ru/mtProtEvol /. This resource and the devised software pipeline may be useful tool for researchers in area of protein evolution.
Collapse
|
9
|
Novel Complex Interactions between Mitochondrial and Nuclear DNA in Schizophrenia and Bipolar Disorder. MOLECULAR NEUROPSYCHIATRY 2019; 5:13-27. [PMID: 31019915 DOI: 10.1159/000495658] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/05/2018] [Accepted: 11/20/2018] [Indexed: 12/21/2022]
Abstract
Mitochondrial dysfunction has been associated with schizophrenia (SZ) and bipolar disorder (BD). This review examines recent publications and novel associations between mitochondrial genes and SZ and BD. Associations of nuclear-encoded mitochondrial variants with SZ were found using gene- and pathway-based approaches. Two control region mitochondrial DNA (mtDNA) SNPs, T16519C and T195C, both showed an association with SZ and BD. A review of 4 studies of A15218G located in the cytochrome B oxidase gene (CYTB, SZ = 11,311, control = 35,735) shows a moderate association with SZ (p = 2.15E-03). Another mtDNA allele A12308G was nominally associated with psychosis in BD type I subjects and SZ. The first published study testing the epistatic interaction between nuclear-encoded and mitochondria-encoded genes demonstrated evidence for potential interactions between mtDNA and the nuclear genome for BD. A similar analysis for the risk of SZ revealed significant joint effects (34 nuclear-mitochondria SNP pairs with joint effect p ≤ 5E-07) and significant enrichment of projection neurons. The mitochondria-encoded gene CYTB was found in both the epistatic interactions for SZ and BD and the single SNP association of SZ. Future efforts considering population stratification and polygenic risk scores will test the role of mitochondrial variants in psychiatric disorders.
Collapse
|
10
|
HC-HDSD: A method of hypergraph construction and high-density subgraph detection for inferring high-order epistatic interactions. Comput Biol Chem 2018; 78:440-447. [PMID: 30595466 DOI: 10.1016/j.compbiolchem.2018.11.031] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2018] [Accepted: 11/26/2018] [Indexed: 01/08/2023]
Abstract
Detecting epistatic interactions, or nonlinear interactive effects of Single Nucleotide Polymorphisms (SNPs), has gained increasing attention in explaining the "missing heritability" of complex diseases. Though much work has been done in mapping SNPs underlying diseases, most of them constrain to 2-order epistatic interactions. In this paper, a method of hypergraph construction and high-density subgraph detection, named HC-HDSD, is proposed for detecting high-order epistatic interactions. The hypergraph is constructed by low-order epistatic interactions that identified using the normalized co-information measure and the exhaustive search. The hypergraph consists of two types of vertices: real ones representing main effects of SNPs and virtual ones denoting interactive effects of epistatic interactions. Then, both maximal clique centrality algorithm and near-clique mining algorithm are employed to detect high-density subgraphs from the constructed hypergraph. These high-density subgraphs are inferred as high-order epistatic interactions in the HC-HDSD. Experiments are performed on several simulation data sets, results of which show that HC-HDSD is promising in inferring high-order epistatic interactions while substantially reducing the computation cost. In addition, the application of HC-HDSD on a real Age-related Macular Degeneration (AMD) data set provides several new clues for the exploration of causative factors of AMD.
Collapse
|
11
|
ENHO, RXRA, and LXRA polymorphisms and dyslipidaemia, related comorbidities and survival in haemodialysis patients. BMC MEDICAL GENETICS 2018; 19:194. [PMID: 30413149 PMCID: PMC6234788 DOI: 10.1186/s12881-018-0708-4] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/19/2018] [Accepted: 10/23/2018] [Indexed: 12/26/2022]
Abstract
BACKGROUND The energy homeostasis-associated gene (ENHO), retinoid X receptor alpha gene (RXRA), and liver X receptor alpha gene (LXRA) are involved in adipogenic/lipogenic regulation. We investigated whether single-nucleotide polymorphisms in these genes (ENHO rs2281997, rs72735260; RXRA rs749759, rs10776909, rs10881578; LXRA rs2279238, rs7120118, rs11039155) are associated with dyslipidaemia, related comorbidities and survival of haemodialysis (HD) patients also tested for T-helper (Th) cell interleukin genes (IL). METHODS The study was carried out in 873 HD patients. Dyslipidaemia was diagnosed by the recommendations of the Kidney Disease Outcomes Quality Initiative (K/DOQI) guidelines (2003); atherogenic dyslipidaemia was referred to if the TG/HDL cholesterol ratio was equal to or higher than 3.8. Genotyping of ENHO SNPs, LXRA SNPs, and IL12A rs568408 was carried out using HRM analysis. RXRA SNPs, IL12B rs3212227, and IL18 rs360719 were genotyped using PCR-RFLP analysis. The circulating adropin concentration was determined in 126 patients by enzyme-linked immunosorbent assay. Survival probability was analysed using the Kaplan-Meier method in 440 patients followed through 7.5 years. RESULTS Dyslipidaemia by K/DOQI was diagnosed in 459 patients (91% revealed hyper-LDL- cholesterolaemia), atherogenic dyslipidaemia was diagnosed in 454 patients, and 231 patients were free of dyslipidaemia by both criteria. The variant allele (T) of ENHO rs2281997 was associated with the hyper-LDL cholesterolaemic pattern of dyslipidaemia by K/DOQI. The frequency of atherogenic dyslipidaemia was lower in T-allele bearers than in CC-genotype patients. The rs2281997 T allele was associated with lower cardiovascular mortality in HD patients showing atherogenic dyslipidaemia. ENHO, RXRA, and LXRA showed epistatic interactions in dyslipidaemia. Circulating adropin was lower in atherogenic dyslipidaemia than in non-atherogenic conditions. RXRA rs10776909 was associated with myocardial infarction. Bearers of LXRA rs2279238, rs7120118 or rs11039155 minor alleles showed higher mortality. ENHO SNP positions fell within the same DNase 1 hypersensitivity site expressed in the Th1 cell line. Epistatic interactions occurred between rs2281997 and Th1 IL SNPs (rs360719, rs568408). CONCLUSIONS Atherogenic dyslipidaemia occurs in HD patients in whom ENHO encodes less adropin. ENHO, RXRA, and LXRA SNPs, separately or jointly, are associated with dyslipidaemia, myocardial infarction, and survival in HD patients. Differences in the availability of transcription binding sites may contribute to these associations.
Collapse
|
12
|
Genetic Variations of Oxidative Stress Related Genes ALOX5, ALOX5AP and MPO Modulate Ischemic Stroke Susceptibility Through Main Effects and Epistatic Interactions in a Chinese Population. Cell Physiol Biochem 2017; 43:1588-1602. [PMID: 29041000 DOI: 10.1159/000482023] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2017] [Accepted: 09/15/2017] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND/AIMS To investigate the roles of the oxidative stress related-genes ALOX5, ALOX5AP and MPO in ischemic stroke susceptibility in the Han Chinese population. METHODS A total of 351 ischemic stroke patients and 417 controls were recruited. The ALOX5 rs10900213, ALOX5AP rs4293222 and MPO rs2107545 gene polymorphisms were genotyped. RESULTS We identified that rs2107545 of MPO gene was significantly associated with ischemic stroke susceptibility after adjusting for covariates. Furthermore, we also considered the likely complexity of oxidative stress and inflammatory process in stroke by assessing the combined effects of multiple genes. Generalized multifactor dimensionality reduction (GMDR) analysis revealed that the combination of ALOX5 rs10900213, ALOX5AP rs4293222 and MPO rs2107545 was significantly associated with increased risk of ischemic stroke (P=0.0040, OR (95% CI) =1.991 (1.241 to 3.195)). Additionally, the MPO rs2107545 genotype was significantly associated with clinical outcomes at 6 months after discharge from the hospital. CONCLUSION Our study revealed that epistatic interaction among the ALOX5, ALOX5AP and MPO genes played a significant role in vulnerability to ischemic stroke. Furthermore, these results also suggest that the rs2107545 of MPO gene can be used as a biomarker for the susceptibility and prognosis of ischemic stroke patients.
Collapse
|
13
|
epiACO - a method for identifying epistasis based on ant Colony optimization algorithm. BioData Min 2017; 10:23. [PMID: 28694848 PMCID: PMC5500974 DOI: 10.1186/s13040-017-0143-7] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2016] [Accepted: 06/29/2017] [Indexed: 11/23/2022] Open
Abstract
Background Identifying epistasis or epistatic interactions, which refer to nonlinear interaction effects of single nucleotide polymorphisms (SNPs), is essential to understand disease susceptibility and to detect genetic architectures underlying complex diseases. Though many works have been done for identifying epistatic interactions, due to their methodological and computational challenges, the algorithmic development is still ongoing. Results In this study, a method epiACO is proposed to identify epistatic interactions, which based on ant colony optimization algorithm. Highlights of epiACO are the introduced fitness function Svalue, path selection strategies, and a memory based strategy. The Svalue leverages the advantages of both mutual information and Bayesian network to effectively and efficiently measure associations between SNP combinations and the phenotype. Two path selection strategies, i.e., probabilistic path selection strategy and stochastic path selection strategy, are provided to adaptively guide ant behaviors of exploration and exploitation. The memory based strategy is designed to retain candidate solutions found in the previous iterations, and compare them to solutions of the current iteration to generate new candidate solutions, yielding a more accurate way for identifying epistasis. Conclusions Experiments of epiACO and its comparison with other recent methods epiMODE, TEAM, BOOST, SNPRuler, AntEpiSeeker, AntMiner, MACOED, and IACO are performed on both simulation data sets and a real data set of age-related macular degeneration. Results show that epiACO is promising in identifying epistasis and might be an alternative to existing methods.
Collapse
|
14
|
CINOEDV: a co-information based method for detecting and visualizing n-order epistatic interactions. BMC Bioinformatics 2016; 17:214. [PMID: 27184783 PMCID: PMC4869388 DOI: 10.1186/s12859-016-1076-8] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2015] [Accepted: 05/07/2016] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Detecting and visualizing nonlinear interaction effects of single nucleotide polymorphisms (SNPs) or epistatic interactions are important topics in bioinformatics since they play an important role in unraveling the mystery of "missing heritability". However, related studies are almost limited to pairwise epistatic interactions due to their methodological and computational challenges. RESULTS We develop CINOEDV (Co-Information based N-Order Epistasis Detector and Visualizer) for the detection and visualization of epistatic interactions of their orders from 1 to n (n ≥ 2). CINOEDV is composed of two stages, namely, detecting stage and visualizing stage. In detecting stage, co-information based measures are employed to quantify association effects of n-order SNP combinations to the phenotype, and two types of search strategies are introduced to identify n-order epistatic interactions: an exhaustive search and a particle swarm optimization based search. In visualizing stage, all detected n-order epistatic interactions are used to construct a hypergraph, where a real vertex represents the main effect of a SNP and a virtual vertex denotes the interaction effect of an n-order epistatic interaction. By deeply analyzing the constructed hypergraph, some hidden clues for better understanding the underlying genetic architecture of complex diseases could be revealed. CONCLUSIONS Experiments of CINOEDV and its comparison with existing state-of-the-art methods are performed on both simulation data sets and a real data set of age-related macular degeneration. Results demonstrate that CINOEDV is promising in detecting and visualizing n-order epistatic interactions. CINOEDV is implemented in R and is freely available from R CRAN: http://cran.r-project.org and https://sourceforge.net/projects/cinoedv/files/ .
Collapse
|
15
|
Exploring mitochondrial system properties of neurodegenerative diseases through interactome mapping. J Proteomics 2013; 100:8-24. [PMID: 24262152 DOI: 10.1016/j.jprot.2013.11.008] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2013] [Revised: 10/08/2013] [Accepted: 11/06/2013] [Indexed: 12/20/2022]
Abstract
UNLABELLED Mitochondria are double membraned, dynamic organelles that are required for a large number of cellular processes, and defects in their function have emerged as causative factors for a growing number of human disorders and are highly associated with cancer, metabolic, and neurodegenerative (ND) diseases. Biochemical and genetic investigations have uncovered small numbers of candidate mitochondrial proteins (MPs) involved in ND disease, but given the diversity of processes affected by MP function and the difficulty of detecting interactions involving these proteins, many more likely remain unknown. However, high-throughput proteomic and genomic approaches developed in genetically tractable model prokaryotes and lower eukaryotes have proven to be effective tools for querying the physical (protein-protein) and functional (gene-gene) relationships between diverse types of proteins, including cytosolic and membrane proteins. In this review, we highlight how experimental and computational approaches developed recently by our group and others can be effectively used towards elucidating the mitochondrial interactome in an unbiased and systematic manner to uncover network-based connections. We discuss how the knowledge from the resulting interaction networks can effectively contribute towards the identification of new mitochondrial disease gene candidates, and thus further clarify the role of mitochondrial biology and the complex etiologies of ND disease. BIOLOGICAL SIGNIFICANCE Biochemical and genetic investigations have uncovered small numbers of candidate mitochondrial proteins (MPs) involved in neurodegenerative (ND) diseases, but given the diversity of processes affected by MP function and the difficulty of detecting interactions involving these proteins, many more likely remain unknown. Large-scale proteomic and genomic approaches developed in model prokaryotes and lower eukaryotes have proven to be effective tools for querying the physical (protein-protein) and functional (gene-gene) relationships between diverse types of proteins. Extension of this new framework to the mitochondrial sub-system in human will likewise provide a universally informative systems-level view of the physical and functional landscape for exploring the evolutionary principles underlying mitochondrial function. In this review, we highlight how experimental and computational approaches developed recently by our group and others can be effectively used towards elucidating the mitochondrial interactome in an unbiased and systematic manner to uncover network-based connections. We anticipate that the knowledge from these resulting interaction networks can effectively contribute towards the identification of new mitochondrial disease gene candidates, and thus foster a deeper molecular understanding of mitochondrial biology as well as the etiology of mitochondrial diseases. This article is part of a Special Issue: Can Proteomics Fill the Gap Between Genomics and Phenotypes?
Collapse
|