Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Total Articles

882
(from Reference Citation Analysis)

Article PDFs (348)

Cited by > 0 (609)

Searched Name

epistasis

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Number	Citation Analysis
401	Ge W, Steber CM. Positive and negative regulation of seed germination by the Arabidopsis GA hormone receptors, GID1a, b, and c. PLANT DIRECT 2018;2:e00083. [PMID: 31245748 PMCID: PMC6508844 DOI: 10.1002/pld3.83] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/25/2018] [Revised: 07/16/2018] [Accepted: 08/23/2018] [Indexed: 05/25/2023] Abstract Epistasis analysis of gid1 single and double mutants revealed that GID1c is a key positive regulator of seed germination, whereas the GID1b receptor can negatively regulate germination in dormant seeds and in the dark. The GID1 GA receptors were expected to positively regulate germination because the plant hormone gibberellin (GA) is required for seed germination in Arabidopsis thaliana. The three GA hormone receptors, GID1a, GID1b, and GID1c, positively regulate GA responses via GA/GID1-stimulated destruction of DELLA (Asp-Glu-Leu-Leu-Ala) repressors of GA responses. The fact that the gid1abc triple mutant but not gid1 double mutants fail to germinate indicates that all three GA receptors can positively regulate non-dormant seed germination in the light. It was known that the gid1abc triple mutant fails to lose dormancy through the dormancy breaking treatments of cold stratification (moist chilling of seeds) and dry after-ripening (a period of dry storage). Previous work suggested that there may be some specialization of GID1 gene function during germination because GID1b mRNA expression was more highly induced by after-ripening, whereas GID1a and GID1c mRNA levels were more highly induced by cold stratification. In light-germinated dormant seeds, the gid1b mutation can partly rescue the germination efficiency of gid1a but not of gid1c seeds. Thus, GID1b can function as an upstream negative regulator GID1c, a positive regulator of dormant seed germination. Further experiments showed that GID1b can negatively regulate dark germination. Wild-type Arabidopsis seeds do not germinate well in the dark. The gid1b and gid1ab double mutants germinated much more efficiently than wild type, gid1c, or gid1ac mutants in the dark. The observation that the gid1ab double mutant also shows increased dark germination suggests that GID1b, and to some extent GID1a, can act as upstream negative regulators of GID1c. Since the gid1abc triple mutant failed to germinate in the dark, it appears that GID1c is a key downstream positive regulator of dark germination. This genetic analysis indicates that the three GID1 receptors have partially specialized functions in GA signaling. Collapse Key Words GID1 epistasis germination gibberellin seed dormancy Collapse MESH Headings Collapse Grants Collapse
402	Cvijović I, Nguyen Ba AN, Desai MM. Experimental Studies of Evolutionary Dynamics in Microbes. Trends Genet 2018;34:693-703. [PMID: 30025666 PMCID: PMC6467257 DOI: 10.1016/j.tig.2018.06.004] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2018] [Revised: 06/18/2018] [Accepted: 06/22/2018] [Indexed: 11/16/2022] Abstract Evolutionary dynamics in laboratory microbial evolution experiments can be surprisingly complex. In the past two decades, observations of these dynamics have challenged simple models of adaptation and have shown that clonal interference, hitchhiking, ecological diversification, and contingency are widespread. In recent years, advances in high-throughput strain maintenance and phenotypic assays, the dramatically reduced cost of genome sequencing, and emerging methods for lineage barcoding have made it possible to observe evolutionary dynamics at unprecedented resolution. These new methods can now begin to provide detailed measurements of key aspects of fitness landscapes and of evolutionary outcomes across a range of systems. These measurements can highlight challenges to existing theoretical models and guide new theoretical work towards the complications that are most widely important. Collapse Key Words clonal interference contingency ecological diversification epistasis pleiotropy Collapse MESH Headings Adaptation, Physiological/genetics Bacteria/genetics Bacteria/growth & development Directed Molecular Evolution Genetic Fitness/genetics High-Throughput Nucleotide Sequencing Collapse Grants R01 GM104239 NIGMS NIH HHS Collapse
403	Posfai A, Zhou J, Plotkin JB, Kinney JB, McCandlish DM. Selection for Protein Stability Enriches for Epistatic Interactions. Genes (Basel) 2018;9:E423. [PMID: 30134605 PMCID: PMC6162820 DOI: 10.3390/genes9090423] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2018] [Revised: 07/30/2018] [Accepted: 08/14/2018] [Indexed: 12/15/2022] Open Abstract A now classical argument for the marginal thermodynamic stability of proteins explains the distribution of observed protein stabilities as a consequence of an entropic pull in protein sequence space. In particular, most sequences that are sufficiently stable to fold will have stabilities near the folding threshold. Here, we extend this argument to consider its predictions for epistatic interactions for the effects of mutations on the free energy of folding. Although there is abundant evidence to indicate that the effects of mutations on the free energy of folding are nearly additive and conserved over evolutionary time, we show that these observations are compatible with the hypothesis that a non-additive contribution to the folding free energy is essential for observed proteins to maintain their native structure. In particular, through both simulations and analytical results, we show that even very small departures from additivity are sufficient to drive this effect. Collapse Key Words epistasis molecular evolution purifying selection thermodynamic stability Collapse MESH Headings Collapse Grants P30 CA045508 NCI NIH HHS Collapse
404	Wittenburg D, Liebscher V. An approximate Bayesian significance test for genomic evaluations. Biom J 2018;60:1096-1109. [PMID: 30101421 PMCID: PMC6282823 DOI: 10.1002/bimj.201700219] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2017] [Revised: 03/06/2018] [Accepted: 04/10/2018] [Indexed: 11/12/2022] Abstract Genomic information can be used to study the genetic architecture of some trait. Not only the size of the genetic effect captured by molecular markers and their position on the genome but also the mode of inheritance, which might be additive or dominant, and the presence of interactions are interesting parameters. When searching for interacting loci, estimating the effect size and determining the significant marker pairs increases the computational burden in terms of speed and memory allocation dramatically. This study revisits a rapid Bayesian approach (fastbayes). As a novel contribution, a measure of evidence is derived to select markers with effect significantly different from zero. It is based on the credibility of the highest posterior density interval next to zero in a marginalized manner. This methodology is applied to simulated data resembling a dairy cattle population in order to verify the sensitivity of testing for a given range of type-I error levels. A real data application complements this study. Sensitivity and specificity of fastbayes were similar to a variational Bayesian method, and a further reduction of computing time could be achieved. More than 50% of the simulated causative variants were identified. The most complex model containing different kinds of genetic effects and their pairwise interactions yielded the best outcome over a range of type-I error levels. The validation study showed that fastbayes is a dual-purpose tool for genomic inferences - it is applicable to predict future outcome of not-yet phenotyped individuals with high precision as well as to estimate and test single-marker effects. Furthermore, it allows the estimation of billions of interaction effects. Collapse Key Words SNP conditional expectation dominance epistasis genetic architecture Collapse MESH Headings Collapse Grants Collapse
405	Pervasive Modulation of Obesity Risk by the Environment and Genomic Background. Genes (Basel) 2018;9:genes9080411. [PMID: 30110940 PMCID: PMC6115725 DOI: 10.3390/genes9080411] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2018] [Revised: 07/30/2018] [Accepted: 07/30/2018] [Indexed: 12/22/2022] Open Abstract The prevalence of the so-called diseases of affluence, such as type 2 diabetes or hypertension, has increased dramatically in the last two generations. Although genome-wide association studies (GWAS) have discovered hundreds of genes involved in disease etiology, the sudden increase in disease incidence suggests a major role for environmental risk factors. Obesity constitutes a case example of a modern trait shaped by contemporary environment, although with considerable debates about the extent to which gene-by-environment (G×E) interactions accentuate obesity risk in individuals following obesogenic lifestyles. Although interaction effects have been robustly confirmed at the FTO locus, accumulating evidence at the genome-wide level implicates a role for polygenic risk-by-environment interactions. Through a variety of analyses using the UK Biobank, we confirm that the genomic background plays a major role in shaping the expressivity of alleles that increase body mass index (BMI). Collapse Key Words UK Biobank allele expressivity body mass index diseases of affluence epistasis gene-by-environment interactions genome-wide association studies (GWAS) polygenic scores (PGS) Collapse MESH Headings Collapse Grants Collapse
406	Deep mutational scanning of hemagglutinin helps predict evolutionary fates of human H3N2 influenza variants. Proc Natl Acad Sci U S A 2018;115:E8276-E8285. [PMID: 30104379 PMCID: PMC6126756 DOI: 10.1073/pnas.1806133115] [Citation(s) in RCA: 113] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open Abstract A key goal in the study of influenza virus evolution is to forecast which viral strains will persist and which ones will die out. Here we experimentally measure the effects of all amino acid mutations to the hemagglutinin protein from a human H3N2 influenza strain on viral growth in cell culture. We show that these measurements have utility for distinguishing among viral strains that do and do not succeed in nature. Overall, our work suggests that new high-throughput experimental approaches may be useful for understanding virus evolution in nature. Human influenza virus rapidly accumulates mutations in its major surface protein hemagglutinin (HA). The evolutionary success of influenza virus lineages depends on how these mutations affect HA’s functionality and antigenicity. Here we experimentally measure the effects on viral growth in cell culture of all single amino acid mutations to the HA from a recent human H3N2 influenza virus strain. We show that mutations that are measured to be more favorable for viral growth are enriched in evolutionarily successful H3N2 viral lineages relative to mutations that are measured to be less favorable for viral growth. Therefore, despite the well-known caveats about cell-culture measurements of viral fitness, such measurements can still be informative for understanding evolution in nature. We also compare our measurements for H3 HA to similar data previously generated for a distantly related H1 HA and find substantial differences in which amino acids are preferred at many sites. For instance, the H3 HA has less disparity in mutational tolerance between the head and stalk domains than the H1 HA. Overall, our work suggests that experimental measurements of mutational effects can be leveraged to help understand the evolutionary fates of viral lineages in nature—but only when the measurements are made on a viral strain similar to the ones being studied in nature. Collapse Key Words deep mutational scanning epistasis hemagglutinin influenza virus mutational shifts Collapse MESH Headings Collapse Grants Collapse
407	Azithromycin Resistance through Interspecific Acquisition of an Epistasis-Dependent Efflux Pump Component and Transcriptional Regulator in Neisseria gonorrhoeae. mBio 2018;9:mBio.01419-18. [PMID: 30087172 PMCID: PMC6083905 DOI: 10.1128/mbio.01419-18] [Citation(s) in RCA: 113] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open Abstract Mosaic interspecifically acquired alleles of the multiple transferable resistance (mtr) efflux pump operon correlate with increased resistance to azithromycin in Neisseria gonorrhoeae in epidemiological studies. However, whether and how these alleles cause resistance is unclear. Here, we use population genomics, transformations, and transcriptional analyses to dissect the relationship between variant mtr alleles and azithromycin resistance. We find that the locus encompassing the mtrR transcriptional repressor and the mtrCDE pump is a hot spot of interspecific recombination introducing alleles from Neisseria meningitidis and Neisseria lactamica into N. gonorrhoeae, with multiple rare haplotypes in linkage disequilibrium at mtrD and the mtr promoter region. Transformations demonstrate that resistance to azithromycin, as well as to other antimicrobial compounds such as polymyxin B and crystal violet, is mediated through epistasis between these two loci and that the full-length mosaic mtrD allele is required. Gene expression profiling reveals the mechanism of resistance in mosaics couples novel mtrD alleles with promoter mutations that increase expression of the pump. Overall, our results demonstrate that epistatic interactions at mtr gained from multiple neisserial species has contributed to increased gonococcal resistance to diverse antimicrobial agents.IMPORTANCENeisseria gonorrhoeae is the sexually transmitted bacterial pathogen responsible for more than 100 million cases of gonorrhea worldwide each year. The incidence of resistance to the macrolide azithromycin has increased in the past decade; however, a large proportion of the genetic basis of resistance remains unexplained. This study is the first to conclusively demonstrate the acquisition of macrolide resistance through mtr alleles from other Neisseria species, demonstrating that commensal Neisseria bacteria are a reservoir for antibiotic resistance to macrolides, extending the role of interspecies mosaicism in resistance beyond what has been previously described for cephalosporins. Ultimately, our results emphasize that future fine-mapping of genome-wide interspecies mosaicism may be valuable in understanding the pathways to antimicrobial resistance. Our results also have implications for diagnostics and public health surveillance and control, as they can be used to inform the development of sequence-based tools to monitor and control the spread of antibiotic-resistant gonorrhea. Collapse Key Words Neisseria gonorrhoeae antibiotic resistance efflux pump epistasis gonorrhea macrolide Collapse MESH Headings Collapse Grants Collapse
408	Lyons DM, Lauring AS. Mutation and Epistasis in Influenza Virus Evolution. Viruses 2018;10:E407. [PMID: 30081492 PMCID: PMC6115771 DOI: 10.3390/v10080407] [Citation(s) in RCA: 64] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2018] [Revised: 07/30/2018] [Accepted: 07/30/2018] [Indexed: 12/25/2022] Open Abstract Influenza remains a persistent public health challenge, because the rapid evolution of influenza viruses has led to marginal vaccine efficacy, antiviral resistance, and the annual emergence of novel strains. This evolvability is driven, in part, by the virus's capacity to generate diversity through mutation and reassortment. Because many new traits require multiple mutations and mutations are frequently combined by reassortment, epistatic interactions between mutations play an important role in influenza virus evolution. While mutation and epistasis are fundamental to the adaptability of influenza viruses, they also constrain the evolutionary process in important ways. Here, we review recent work on mutational effects and epistasis in influenza viruses. Collapse Key Words epistasis evolution influenza mutation reassortment Collapse MESH Headings Animals Epistasis, Genetic Evolution, Molecular Genetic Fitness Genome, Viral Hemagglutinin Glycoproteins, Influenza Virus/genetics Humans Influenza A Virus, H1N1 Subtype/genetics Influenza A Virus, H3N2 Subtype/genetics Influenza, Human/virology Mutation Orthomyxoviridae/genetics Reassortant Viruses/genetics Collapse Grants F31 AI140618 NIAID NIH HHS R01 AI118886 NIAID NIH HHS T32 GM007863 NIGMS NIH HHS Collapse
409	On the Relationship Between High-Order Linkage Disequilibrium and Epistasis. G3-GENES GENOMES GENETICS 2018;8:2817-2824. [PMID: 29945968 PMCID: PMC6071592 DOI: 10.1534/g3.118.200513] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Abstract A plausible explanation for statistical epistasis revealed in genome wide association analyses is the presence of high order linkage disequilibrium (LD) between the genotyped markers tested for interactions and unobserved functional polymorphisms. Based on findings in experimental data, it has been suggested that high order LD might be a common explanation for statistical epistasis inferred between local polymorphisms in the same genomic region. Here, we empirically evaluate how prevalent high order LD is between local, as well as distal, polymorphisms in the genome. This could provide insights into whether we should account for this when interpreting results from genome wide scans for statistical epistasis. An extensive and strong genome wide high order LD was revealed between pairs of markers on the high density 250k SNP-chip and individual markers revealed by whole genome sequencing in the Arabidopsis thaliana 1001-genomes collection. The high order LD was found to be more prevalent in smaller populations, but present also in samples including several hundred individuals. An empirical example illustrates that high order LD might be an even greater challenge in cases when the genetic architecture is more complex than the common assumption of bi-allelic loci. The example shows how significant statistical epistasis is detected for a pair of markers in high order LD with a complex multi allelic locus. Overall, our study illustrates the importance of considering also other explanations than functional genetic interactions when genome wide statistical epistasis is detected, in particular when the results are obtained in small populations of inbred individuals. Collapse Key Words Arabidopsis thaliana epistasis high order linkage disequilibrium leaf molybdenum Collapse MESH Headings Collapse Grants Collapse
410	Horlbeck MA, Xu A, Wang M, Bennett NK, Park CY, Bogdanoff D, Adamson B, Chow ED, Kampmann M, Peterson TR, Nakamura K, Fischbach MA, Weissman JS, Gilbert LA. Mapping the Genetic Landscape of Human Cells. Cell 2018;174:953-967.e22. [PMID: 30033366 DOI: 10.1016/j.cell.2018.06.010] [Citation(s) in RCA: 165] [Impact Index Per Article: 27.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2017] [Revised: 03/08/2018] [Accepted: 06/05/2018] [Indexed: 12/31/2022] Abstract Seminal yeast studies have established the value of comprehensively mapping genetic interactions (GIs) for inferring gene function. Efforts in human cells using focused gene sets underscore the utility of this approach, but the feasibility of generating large-scale, diverse human GI maps remains unresolved. We developed a CRISPR interference platform for large-scale quantitative mapping of human GIs. We systematically perturbed 222,784 gene pairs in two cancer cell lines. The resultant maps cluster functionally related genes, assigning function to poorly characterized genes, including TMEM261, a new electron transport chain component. Individual GIs pinpoint unexpected relationships between pathways, exemplified by a specific cholesterol biosynthesis intermediate whose accumulation induces deoxynucleotide depletion, causing replicative DNA damage and a synthetic-lethal interaction with the ATR/9-1-1 DNA repair pathway. Our map provides a broad resource, establishes GI maps as a high-resolution tool for dissecting gene function, and serves as a blueprint for mapping the genetic landscape of human cells. Collapse Key Words CRISPR CRISPRi epistasis functional genomics genetic interactions Collapse MESH Headings Collapse Grants Collapse
411	Martínez H, Barrachina S, Castillo M, Quintana-OrtÍ ES, Rambla de Argila J, Farré X, Navarro A. FaST-LMM for Two-Way Epistasis Tests on High-Performance Clusters. J Comput Biol 2018;25:862-870. [PMID: 30020811 DOI: 10.1089/cmb.2018.0087] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open Abstract We introduce a version of the epistasis test in FaST-LMM for clusters of multithreaded processors. This new software maintains the sensitivity of the original FaST-LMM while delivering acceleration that is close to linear on 12-16 nodes of two recent platforms, with respect to improved implementation of FaST-LMM presented in an earlier work. This efficiency is attained through several enhancements on the original single-node version of FaST-LMM, together with the development of a message passing interface (MPI)-based version that ensures a balanced distribution of the workload as well as a multigraphics processing unit (GPU) module that can exploit the presence of multiple GPUs per node. Collapse Key Words FaST-LMM GPUs clusters of computers epistasis genome-wide association studies (GWAS) multicore processors Collapse MESH Headings Collapse Grants Collapse
412	Knops E, Sierra S, Kalaghatgi P, Heger E, Kaiser R, Kalinina OV. Epistatic Interactions in NS5A of Hepatitis C Virus Suggest Drug Resistance Mechanisms. Genes (Basel) 2018;9:E343. [PMID: 29986475 PMCID: PMC6071292 DOI: 10.3390/genes9070343] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2018] [Accepted: 06/29/2018] [Indexed: 02/06/2023] Open Abstract Hepatitis C virus (HCV) causes a major health burden and can be effectively treated by direct-acting antivirals (DAAs). The non-structural protein 5A (NS5A), which plays a role in the viral genome replication, is one of the DAAs’ targets. Resistance-associated viruses (RAVs) harbouring NS5A resistance-associated mutations (RAMs) have been described at baseline and after therapy failure. A mutation from glutamine to arginine at position 30 (Q30R) is a characteristic RAM for the HCV sub/genotype (GT) 1a, but arginine corresponds to the wild type in the GT-1b; still, GT-1b strains are susceptible to NS5A-inhibitors. In this study, we show that GT-1b strains with R30Q often display other specific NS5A substitutions, particularly in positions 24 and 34. We demonstrate that in GT-1b secondary substitutions usually happen after initial R30Q development in the phylogeny, and that the chemical properties of the corresponding amino acids serve to restore the positive charge in this region, acting as compensatory mutations. These findings may have implications for RAVs treatment. Collapse Key Words NS5A drug resistance epistasis hepatitis C virus protein structure Collapse MESH Headings Collapse Grants Collapse
413	Monir MM, Zhu J. Dominance and Epistasis Interactions Revealed as Important Variants for Leaf Traits of Maize NAM Population. FRONTIERS IN PLANT SCIENCE 2018;9:627. [PMID: 29967625 PMCID: PMC6015889 DOI: 10.3389/fpls.2018.00627] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/21/2018] [Accepted: 04/20/2018] [Indexed: 05/26/2023] Abstract Leaf orientation traits of maize (Zea mays) are complex traits controlling by multiple loci with additive, dominance, epistasis, and environmental interaction effects. In this study, an attempt was made for identifying the causal loci, and estimating the additive, non-additive, environmental specific genetic effects underpinning leaf traits (leaf length, leaf width, and upper leaf angle) of maize NAM population. Leaf traits were analyzed by using full genetic model and additive model of multiple loci. Analysis with full genetic model identified 38∼47 highly significant loci (-log₁₀P_EW > 5), while estimated total heritability were 64.32∼79.06% with large contributions due to dominance and dominance related epistasis effects (16.00∼56.91%). Analysis with additive model obtained smaller total heritability ( hT2 ≙ 18.68∼29.56%) and detected fewer loci (30∼36) as compared to the full genetic model. There were 12 pleiotropic loci identified for the three leaf traits: eight loci for leaf length and leaf width, and four loci for leaf length and leaf angle. Optimal genotype combinations of superior line (SL) and superior hybrid (SH) were predicted for each of the traits under four different environments based on estimated genotypic effects to facilitate maker-assisted selection for the leaf traits. Collapse Key Words GWAS dominance epistasis genomic prediction maize leaf traits Collapse MESH Headings Collapse Grants Collapse
414	Dato S, Soerensen M, De Rango F, Rose G, Christensen K, Christiansen L, Passarino G. The genetic component of human longevity: New insights from the analysis of pathway-based SNP-SNP interactions. Aging Cell 2018;17:e12755. [PMID: 29577582 PMCID: PMC5946073 DOI: 10.1111/acel.12755] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/13/2018] [Indexed: 01/24/2023] Open Abstract In human longevity studies, single nucleotide polymorphism (SNP) analysis identified a large number of genetic variants with small effects, yet not easily replicable in different populations. New insights may come from the combined analysis of different SNPs, especially when grouped by metabolic pathway. We applied this approach to study the joint effect on longevity of SNPs belonging to three candidate pathways, the insulin/insulin‐like growth factor signalling (IIS), DNA repair and pro/antioxidant. We analysed data from 1,058 tagging SNPs in 140 genes, collected in 1825 subjects (1,089 unrelated nonagenarians from the Danish 1905 Birth Cohort Study and 736 Danish controls aged 46–55 years) for evaluating synergic interactions by SNPsyn. Synergies were further tested by the multidimensional reduction (MDR) approach, both intra‐ and interpathways. The best combinations (FDR<0.0001) resulted those encompassing IGF1R‐rs12437963 and PTPN1‐rs6067484, TP53‐rs2078486 and ERCC2‐rs50871, TXNRD1‐rs17202060 and TP53‐rs2078486, the latter two supporting a central role of TP53 in mediating the concerted activation of the DNA repair and pro‐antioxidant pathways in human longevity. Results were consistently replicated with both approaches, as well as a significant effect on longevity was found for the GHSR gene, which also interacts with partners belonging to both IIS and DNA repair pathways (PAPPA,PTPN1,PARK7, MRE11A). The combination GHSR‐MREA11, positively associated with longevity by MDR, was further found influencing longitudinal survival in nonagenarian females (p = .026). Results here presented highlight the validity of SNP‐SNP interactions analyses for investigating the genetics of human longevity, confirming previously identified markers but also pointing to novel genes as central nodes of additional networks involved in human longevity. Collapse Key Words SNP aging epistasis genetic component of human longevity pathway-based analysis synergic interaction Collapse MESH Headings Collapse Grants Collapse
415	Zhang Y, He Q, Zhang R, Zhang H, Zhong W, Xia H. Large-scale replication study identified multiple independent SNPs in RET synergistically associated with Hirschsprung disease in Southern Chinese population. Aging (Albany NY) 2018;9:1996-2009. [PMID: 28930629 PMCID: PMC5636671 DOI: 10.18632/aging.101294] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2017] [Accepted: 09/17/2017] [Indexed: 01/04/2023] Abstract Hischsprung disease (HSCR) is an intestinal disorder with strong genetic components. RET was considered as the strongest contributor. Multiple single nucleotide polymorphisms (SNP) were demonstrated as associated with HSCR in different populations. However, whether the associations of reported SNPs derived from one causal variants or congregations of multiple variants were still not clear. In this study, we successfully genotyped 16 SNPs in RET with a largest case-control study to date, totaling 1470 HSCR and 1473 control subjects in South Chinese population. Multiple independent contributors were identified through pairwise and stepwise logistic regression. The intragenic synergistic effect among these SNPs were further explored and cross validated by logistic regression and multifactor dimensionality reduction (MDR). Noteworthy, in further subclinical manifestation analysis, the six potential independent contributors in RET were more essential for the patients with short-segment aganglionosis (S-HSCR). Although functional evaluations are required, our comprehensive analysis for RET gene integrating detailed disease subphenotypes might facilitate improved understanding for the genetic understanding of HSCR etiology. Collapse Key Words Hirschsprung disease association epistasis subclinical stratification Collapse MESH Headings Collapse Grants Collapse
416	Puranen S, Pesonen M, Pensar J, Xu YY, Lees JA, Bentley SD, Croucher NJ, Corander J. SuperDCA for genome-wide epistasis analysis. Microb Genom 2018;4. [PMID: 29813016 PMCID: PMC6096938 DOI: 10.1099/mgen.0.000184] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open Abstract The potential for genome-wide modelling of epistasis has recently surfaced given the possibility of sequencing densely sampled populations and the emerging families of statistical interaction models. Direct coupling analysis (DCA) has previously been shown to yield valuable predictions for single protein structures, and has recently been extended to genome-wide analysis of bacteria, identifying novel interactions in the co-evolution between resistance, virulence and core genome elements. However, earlier computational DCA methods have not been scalable to enable model fitting simultaneously to 10⁴-10⁵ polymorphisms, representing the amount of core genomic variation observed in analyses of many bacterial species. Here, we introduce a novel inference method (SuperDCA) that employs a new scoring principle, efficient parallelization, optimization and filtering on phylogenetic information to achieve scalability for up to 10⁵ polymorphisms. Using two large population samples of Streptococcus pneumoniae, we demonstrate the ability of SuperDCA to make additional significant biological findings about this major human pathogen. We also show that our method can uncover signals of selection that are not detectable by genome-wide association analysis, even though our analysis does not require phenotypic measurements. SuperDCA, thus, holds considerable potential in building understanding about numerous organisms at a systems biological level. Collapse Key Words epistasis linkage disequilibrium population genomics Collapse MESH Headings Collapse Grants Collapse
417	Dutta S, Eckmann JP, Libchaber A, Tlusty T. Green function of correlated genes in a minimal mechanical model of protein evolution. Proc Natl Acad Sci U S A 2018;115:E4559-E4568. [PMID: 29712824 PMCID: PMC5960285 DOI: 10.1073/pnas.1716215115] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open Abstract The function of proteins arises from cooperative interactions and rearrangements of their amino acids, which exhibit large-scale dynamical modes. Long-range correlations have also been revealed in protein sequences, and this has motivated the search for physical links between the observed genetic and dynamic cooperativity. We outline here a simplified theory of protein, which relates sequence correlations to physical interactions and to the emergence of mechanical function. Our protein is modeled as a strongly coupled amino acid network with interactions and motions that are captured by the mechanical propagator, the Green function. The propagator describes how the gene determines the connectivity of the amino acids and thereby, the transmission of forces. Mutations introduce localized perturbations to the propagator that scatter the force field. The emergence of function is manifested by a topological transition when a band of such perturbations divides the protein into subdomains. We find that epistasis-the interaction among mutations in the gene-is related to the nonlinearity of the Green function, which can be interpreted as a sum over multiple scattering paths. We apply this mechanical framework to simulations of protein evolution and observe long-range epistasis, which facilitates collective functional modes. Collapse Key Words Green function dimensional reduction epistasis genotype-to-phenotype map protein evolution Collapse MESH Headings Computational Biology/methods Epistasis, Genetic Evolution, Molecular Humans Mutation Phenotype Proteins/chemistry Proteins/genetics Proteins/metabolism Collapse Grants Collapse
418	Predictable Phenotypes of Antibiotic Resistance Mutations. mBio 2018;9:mBio.00770-18. [PMID: 29764951 PMCID: PMC5954217 DOI: 10.1128/mbio.00770-18] [Citation(s) in RCA: 54] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open Abstract Antibiotic-resistant bacteria represent a major threat to our ability to treat bacterial infections. Two factors that determine the evolutionary success of antibiotic resistance mutations are their impact on resistance level and the fitness cost. Recent studies suggest that resistance mutations commonly show epistatic interactions, which would complicate predictions of their stability in bacterial populations. We analyzed 13 different chromosomal resistance mutations and 10 host strains of Salmonella enterica and Escherichia coli to address two main questions. (i) Are there epistatic interactions between different chromosomal resistance mutations? (ii) How does the strain background and genetic distance influence the effect of chromosomal resistance mutations on resistance and fitness? Our results show that the effects of combined resistance mutations on resistance and fitness are largely predictable and that epistasis remains rare even when up to four mutations were combined. Furthermore, a majority of the mutations, especially target alteration mutations, demonstrate strain-independent phenotypes across different species. This study extends our understanding of epistasis among resistance mutations and shows that interactions between different resistance mutations are often predictable from the characteristics of the individual mutations. The spread of antibiotic-resistant bacteria imposes an urgent threat to public health. The ability to forecast the evolutionary success of resistant mutants would help to combat dissemination of antibiotic resistance. Previous studies have shown that the phenotypic effects (fitness and resistance level) of resistance mutations can vary substantially depending on the genetic context in which they occur. We conducted a broad screen using many different resistance mutations and host strains to identify potential epistatic interactions between various types of resistance mutations and to determine the effect of strain background on resistance phenotypes. Combinations of several different mutations showed a large amount of phenotypic predictability, and the majority of the mutations displayed strain-independent phenotypes. However, we also identified a few outliers from these patterns, illustrating that the choice of host organism can be critically important when studying antibiotic resistance mutations. Collapse Key Words Escherichia coli Salmonella Strain specificity antibiotic resistance epistasis fitness Collapse MESH Headings Collapse Grants Collapse
419	Mallik S, Basu S, Hait S, Kundu S. Translational regulation of ribosomal protein S15 drives characteristic patterns of protein-mRNA epistasis. Proteins 2018;86:827-832. [PMID: 29679401 DOI: 10.1002/prot.25518] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2018] [Revised: 04/09/2018] [Accepted: 04/17/2018] [Indexed: 01/15/2023] Abstract Do coding and regulatory segments of a gene co-evolve with each-other? Seeking answers to this question, here we analyze the case of Escherichia coli ribosomal protein S15, that represses its own translation by specifically binding its messenger RNA (rpsO mRNA) and stabilizing a pseudoknot structure at the upstream untranslated region, thus trapping the ribosome into an incomplete translation initiation complex. In the absence of S15, ribosomal protein S1 recognizes rpsO and promotes translation by melting this very pseudoknot. We employ a robust statistical method to detect signatures of positive epistasis between residue site pairs and find that biophysical constraints of translational regulation (S15-rpsO and S1-rpsO recognition, S15-mediated rpsO structural rearrangement, and S1-mediated melting) are strong predictors of positive epistasis. Transforming the epistatic pairs into a network, we find that signatures of two different, but interconnected regulatory cascades are imprinted in the sequence-space and can be captured in terms of two dense network modules that are sparsely connected to each other. This network topology further reflects a general principle of how functionally coupled components of biological networks are interconnected. These results depict a model case, where translational regulation drives characteristic residue-level epistasis-not only between a protein and its own mRNA but also between a protein and the mRNA of an entirely different protein. Collapse Key Words epistasis promoter-protein coevolution protein-mRNA interaction ribosomal protein translational regulation Collapse MESH Headings Collapse Grants Collapse
420	Haplotype-Based Genome-Wide Prediction Models Exploit Local Epistatic Interactions Among Markers. G3-GENES GENOMES GENETICS 2018;8:1687-1699. [PMID: 29549092 PMCID: PMC5940160 DOI: 10.1534/g3.117.300548] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Abstract Genome-wide prediction approaches represent versatile tools for the analysis and prediction of complex traits. Mostly they rely on marker-based information, but scenarios have been reported in which models capitalizing on closely-linked markers that were combined into haplotypes outperformed marker-based models. Detailed comparisons were undertaken to reveal under which circumstances haplotype-based genome-wide prediction models are superior to marker-based models. Specifically, it was of interest to analyze whether and how haplotype-based models may take local epistatic effects between markers into account. Assuming that populations consisted of fully homozygous individuals, a marker-based model in which local epistatic effects inside haplotype blocks were exploited (LEGBLUP) was linearly transformable into a haplotype-based model (HGBLUP). This theoretical derivation formally revealed that haplotype-based genome-wide prediction models capitalize on local epistatic effects among markers. Simulation studies corroborated this finding. Due to its computational efficiency the HGBLUP model promises to be an interesting tool for studies in which ultra-high-density SNP data sets are studied. Applying the HGBLUP model to empirical data sets revealed higher prediction accuracies than for marker-based models for both traits studied using a mouse panel. In contrast, only a small subset of the traits analyzed in crop populations showed such a benefit. Cases in which higher prediction accuracies are observed for HGBLUP than for marker-based models are expected to be of immediate relevance for breeders, due to the tight linkage a beneficial haplotype will be preserved for many generations. In this respect the inheritance of local epistatic effects very much resembles the one of additive effects. Collapse Key Words GenPred Genomic Selection Shared Data Resources epistasis genome-wide prediction haplotype local epistatic effect Collapse MESH Headings Collapse Grants Collapse
421	Manduchi E, Williams SM, Chesi A, Johnson ME, Wells AD, Grant SFA, Moore JH. Leveraging epigenomics and contactomics data to investigate SNP pairs in GWAS. Hum Genet 2018;137:413-425. [PMID: 29797095 PMCID: PMC5996751 DOI: 10.1007/s00439-018-1893-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2018] [Accepted: 05/20/2018] [Indexed: 12/29/2022] Abstract Although Genome Wide Association Studies (GWAS) have led to many valuable insights into the genetic bases of common diseases over the past decade, the issue of missing heritability has surfaced, as the discovered main effect genetic variants found to date do not account for much of a trait's predicted genetic component. We present a workflow, integrating epigenomics and topologically associating domain data, aimed at discovering trait-associated SNP pairs from GWAS where neither SNP achieved independent genome-wide significance. Each analyzed SNP pair consists of one SNP in a putative active enhancer and another SNP in a putative physically interacting gene promoter in a trait-relevant tissue. As a proof-of-principle case study, we used this approach to identify focused collections of SNP pairs that we analyzed in three independent Type 2 diabetes (T2D) GWAS. This approach led us to discover 35 significant SNP pairs, encompassing both novel signals and signals for which we have found orthogonal support from other sources. Nine of these pairs are consistent with eQTL results, two are consistent with our own capture C experiments, and seven involve signals supported by recent T2D literature. Collapse Key Words gwas epistasis epigenomics contactomics topologically associating domain type 2 diabetes Collapse MESH Headings Diabetes Mellitus, Type 2/genetics Diabetes Mellitus, Type 2/physiopathology Epigenomics Genome-Wide Association Study/statistics & numerical data Genotype Humans Phenotype Polymorphism, Single Nucleotide/genetics Quantitative Trait Loci/genetics Collapse Grants R21 HD089824 NIH HHS U01 DK112217 NIDDK NIH HHS R21 HD089824 NICHD NIH HHS P30 ES013508 NIEHS NIH HHS R01 LM010098 NLM NIH HHS Center for Spatial and Functional Genomics The Children's Hospital of Philadelphia ES013508 NIH HHS UC4 DK112217 NIDDK NIH HHS DK112217 NIH HHS LM010098 NIH HHS Collapse
422	Wilkins JF, Cannataro VL, Shuch B, Townsend JP. Analysis of mutation, selection, and epistasis: an informed approach to cancer clinical trials. Oncotarget 2018;9:22243-22253. [PMID: 29854275 PMCID: PMC5976461 DOI: 10.18632/oncotarget.25155] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2018] [Accepted: 04/02/2018] [Indexed: 12/30/2022] Open Abstract Currently, drug development efforts and clinical trials to test them are often prioritized by targeting genes with high frequencies of somatic variants among tumors. However, differences in oncogenic mutation rate-not necessarily the effect the variant has on tumor growth-contribute enormously to somatic variant frequency. We argue that decoupling the contributions of mutation and cancer lineage selection to the frequency of somatic variants among tumors is critical to understanding-and predicting-the therapeutic potential of different interventions. To provide an indicator of that strength of selection and therapeutic potential, the frequency at which we observe a given variant across patients must be modulated by our expectation given the mutation rate and target size to provide an indicator of that strength of selection and therapeutic potential. Additionally, antagonistic and synergistic epistasis among mutations also impacts the potential therapeutic benefit of targeted drug development. Quantitative approaches should be fostered that use the known genetic architectures of cancer types, decouple mutation rate, and provide rigorous guidance regarding investment in targeted drug development. By integrating evolutionary principles and detailed mechanistic knowledge into those approaches, we can maximize our ability to identify those targeted therapies most likely to yield substantial clinical benefit. Collapse Key Words cancer epistasis evolution mutation natural selection Collapse MESH Headings Collapse Grants T32 CA193200 NCI NIH HHS Collapse
423	Chakravorty S, Hegde M. Inferring the effect of genomic variation in the new era of genomics. Hum Mutat 2018;39:756-773. [PMID: 29633501 DOI: 10.1002/humu.23427] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2017] [Revised: 03/20/2018] [Accepted: 03/28/2018] [Indexed: 12/11/2022] Abstract Accurate and detailed understanding of the effects of variants in the coding and noncoding regions of the genome is the next big challenge in the new genomic era of personalized medicine, especially to tackle newer findings of genetic and phenotypic heterogeneity of diseases. This is necessary to resolve the gene-variant-disease relationship, the pathogenic variant spectrum of genes, pathogenic variants with variable clinical consequences, and multiloci diseases. In turn, this will facilitate patient recruitment for relevant clinical trials. In this review, we describe the trends in research at the intersection of basic and clinical genomics aiming to (a) overcome molecular diagnostic challenges and increase the clinical utility of next-generation sequencing (NGS) platforms, (b) elucidate variants associated with disease, (c) determine overall genomic complexity including epistasis, complex inheritance patterns such as "synergistic heterozygosity," digenic/multigenic inheritance, modifier effect, and rare variant load. We describe the newly emerging field of integrated functional genomics, in vivo or in vitro large-scale functional approaches, statistical bioinformatics algorithms that support NGS genomics data to interpret variants for timely clinical diagnostics and disease management. Thus, facilitating the discovery of new therapeutic or biomarker options, and their roles in the future of personalized medicine. Collapse Key Words NGS digenic inheritance epistasis functional genomics molecular diagnostics multigenic inheritance newborn screening personalized medicine synergistic heterozygosity variant annotation whole genome sequencing Collapse MESH Headings Collapse Grants Collapse
424	Diss G, Lehner B. The genetic landscape of a physical interaction. eLife 2018;7:32472. [PMID: 29638215 PMCID: PMC5896888 DOI: 10.7554/elife.32472] [Citation(s) in RCA: 80] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2017] [Accepted: 03/02/2018] [Indexed: 12/26/2022] Open Abstract A key question in human genetics and evolutionary biology is how mutations in different genes combine to alter phenotypes. Efforts to systematically map genetic interactions have mostly made use of gene deletions. However, most genetic variation consists of point mutations of diverse and difficult to predict effects. Here, by developing a new sequencing-based protein interaction assay – deepPCA – we quantified the effects of >120,000 pairs of point mutations on the formation of the AP-1 transcription factor complex between the products of the FOS and JUN proto-oncogenes. Genetic interactions are abundant both in cis (within one protein) and trans (between the two molecules) and consist of two classes – interactions driven by thermodynamics that can be predicted using a three-parameter global model, and structural interactions between proximally located residues. These results reveal how physical interactions generate quantitatively predictable genetic interactions. Proteins, the molecular workhorses of the cell, are made of small units called amino acids attached together like the links of a chain. Each protein is composed of a unique combination of amino acids, which is determined by a specific sequence of DNA called a gene. A change in a gene – a mutation – can create a variation in the protein it codes for, for instance by swapping a type of amino acid for another. Different mutations in the same gene can alter a protein in different ways. Some of these changes are harmless, but other can hinder how the protein performs its role. For example, a small change in the structure of a protein could affect how it will bind to other molecules. It is possible for people to have identical mutations in the same genes, but experience different consequences. For instance, two persons could carry the same disease-inducing mutation, but one has a severe version of the condition and the other only mild symptoms. One reason is that changes in other genes cancel out or enhance the effect of a mutation. This phenomenon is known as a genetic interaction and it remains poorly understood, especially at the molecular level. Here, Diss and Lehner developed a method, called deepPCA, to study the consequences of mutations in proteins in the laboratory. The experiments focused on two human genes which code for two proteins that normally attach to each other. Two mutations were artificially created, either one in each gene, or two in one of them. Diss and Lehner then examined how strongly the two mutated proteins could still attach to each other. By repeating this process with over 120,000 different pairs of mutations, it became possible to study how one mutation can have different effects depending on the presence of other mutations in the same protein or in the binding partner. Overall, Diss and Lehner found that genetic interactions are the result of two mechanisms. In the first one, the two mutations together cause specific structural changes that modify how proteins bind to each other. In the second one, the changes solely depend on the magnitude of the initial, thermodynamic effects of individual mutations, but not on their specific physical and chemical properties. To predict the consequences of this second type of genetic interactions, knowing the identity or the exact effects of the two mutations is not necessary. Understanding and predicting genetic interactions is important to develop personalized medicine, where treatments are tailored based on the genetic make up of an individual. This knowledge will also help to study how genes have evolved together. Collapse Key Words S. cerevisiae computational biology deep mutagenesis epistasis genetic interaction human protein interactions systems biology transcription factors Collapse MESH Headings Collapse Grants Collapse
425	Morgunova E, Yin Y, Das PK, Jolma A, Zhu F, Popov A, Xu Y, Nilsson L, Taipale J. Two distinct DNA sequences recognized by transcription factors represent enthalpy and entropy optima. eLife 2018;7:32963. [PMID: 29638214 PMCID: PMC5896879 DOI: 10.7554/elife.32963] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2017] [Accepted: 02/12/2018] [Indexed: 11/17/2022] Open Abstract Most transcription factors (TFs) can bind to a population of sequences closely related to a single optimal site. However, some TFs can bind to two distinct sequences that represent two local optima in the Gibbs free energy of binding (ΔG). To determine the molecular mechanism behind this effect, we solved the structures of human HOXB13 and CDX2 bound to their two optimal DNA sequences, CAATAAA and TCGTAAA. Thermodynamic analyses by isothermal titration calorimetry revealed that both sites were bound with similar ΔG. However, the interaction with the CAA sequence was driven by change in enthalpy (ΔH), whereas the TCG site was bound with similar affinity due to smaller loss of entropy (ΔS). This thermodynamic mechanism that leads to at least two local optima likely affects many macromolecular interactions, as ΔG depends on two partially independent variables ΔH and ΔS according to the central equation of thermodynamics, ΔG = ΔH - TΔS. Genes are sections of DNA that carry the instructions needed to build other molecules including all the proteins that the cell needs to fulfill its role. The information in the DNA is stored as a code consisting of four chemical bases, often referred to simply as “A”, “C”, “G” and “T”. The order or sequence of these bases determines the role of a protein. Many organisms – including humans – are built of many different types of cells that perform unique roles. Almost all cells carry the same genetic information, but proteins called transcription factors can regulate the activity of genes so that only a relevant subset of genes is switched on at a particular time. Transcription factors glide along DNA and bind to short DNA sequences by attaching to the DNA bases directly or through bridges made up of water molecules. Two physical concepts known as enthalpy and entropy determine the strength of the connection. Enthalpy relates to how strong the chemical bonds that form between the transcription factors and the DNA bases are, compared to a situation where the transcription factor and DNA do not form a complex and bind to water molecules around them. Entropy measures the disorder of the system – the more disordered the solvent and protein-DNA complex are compared to solvent-containing free DNA and protein, the stronger the binding. A water molecule that bridges a DNA base with an amino-acid of a protein contributes to enthalpy, but results in loss of entropy, because the system becomes more ordered since the water molecule can no longer move freely. Most transcription factors can only bind to DNA sequences that are very similar to each other, but some transcription factors can recognize several different kinds of sequences, and until now it was not clear how they could do this. Morgunova et al. studied four different human transcription factors that can each bind to two distinct DNA sequences. The results showed that the transcription factors bound to both DNA sequences with similar strength, but via different mechanisms. For one DNA sequence, an enthalpy-based mechanism essentially ‘froze’ the transcription factor to the DNA through rigid water bridges. The other DNA sequence was bound equally strongly but through moving water molecules, because this increased the entropy of the system. It is possible that these mechanisms could also apply to many other molecules that interact with each other through water-molecule bridges. A better knowledge of the chemical bonds between transcription factors and DNA bases may in future help efforts to develop new treatments that depend on molecules being able to bind to other molecules. In addition, these findings may one day help scientists to predict how strongly two molecules will interact simply by knowing the structures of the molecules involved. Collapse Key Words DNA recognition biochemistry chemical biology enthalpy entropy epistasis human molecular biophysics structural biology thermodynamics transcription factors Collapse MESH Headings Collapse Grants Collapse