1
|
Katoh-Kurasawa M, Hrovatin K, Hirose S, Webb A, Ho HI, Zupan B, Shaulsky G. Transcriptional milestones in Dictyostelium development. Genome Res 2021; 31:1498-1511. [PMID: 34183452 PMCID: PMC8327917 DOI: 10.1101/gr.275496.121] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2021] [Accepted: 06/23/2021] [Indexed: 02/02/2023]
Abstract
Dictyostelium development begins with single-cell starvation and ends with multicellular fruiting bodies. Developmental morphogenesis is accompanied by sweeping transcriptional changes, encompassing nearly half of the 13,000 genes in the genome. We performed time-series RNA-sequencing analyses of the wild type and 20 mutants to explore the relationships between transcription and morphogenesis. These strains show developmental arrest at different stages, accelerated development, or atypical morphologies. Considering eight major morphological transitions, we identified 1371 milestone genes whose expression changes sharply between consecutive transitions. We also identified 1099 genes as members of 21 regulons, which are groups of genes that remain coordinately regulated despite the genetic, temporal, and developmental perturbations. The gene annotations in these groups validate known transitions and reveal new developmental events. For example, DNA replication genes are tightly coregulated with cell division genes, so they are expressed in mid-development although chromosomal DNA is not replicated. Our data set includes 486 transcriptional profiles that can help identify new relationships between transcription and development and improve gene annotations. We show its utility by showing that cycles of aggregation and disaggregation in allorecognition-defective mutants involve dedifferentiation. We also show sensitivity to genetic and developmental conditions in two commonly used actin genes, act6 and act15, and robustness of the coaA gene. Finally, we propose that gpdA is a better mRNA quantitation standard because it is less sensitive to external conditions than commonly used standards. The data set is available for democratized exploration through the web application dictyExpress and the data mining environment Orange.
Collapse
Affiliation(s)
- Mariko Katoh-Kurasawa
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Karin Hrovatin
- Faculty of Computer and Information Science, University of Ljubljana, SI-1000 Ljubljana, Slovenia
| | - Shigenori Hirose
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Amanda Webb
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Hsing-I Ho
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Blaž Zupan
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
- Faculty of Computer and Information Science, University of Ljubljana, SI-1000 Ljubljana, Slovenia
| | - Gad Shaulsky
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| |
Collapse
|
2
|
Collet J, Fellous S. Do traits separated by metamorphosis evolve independently? Concepts and methods. Proc Biol Sci 2020; 286:20190445. [PMID: 30966980 DOI: 10.1098/rspb.2019.0445] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Despite the ubiquity of complex life cycles, we know little of the evolutionary constraints exerted by metamorphosis. Here, we present pitfalls and methods to answer whether animals with a complex life cycle can independently adapt to the environments encountered at each life stage, with a specific focus on the microevolution of quantitative characters. We first discuss challenges associated with study traits and populations. We further emphasize the benefits of using a combination of approaches. We then develop how multivariate methods can limit several issues by revealing genetic patterns that are invisible when only considering trait-by-trait genetic correlations. Finally, we detail how Lande's work on sexual dimorphism can be applied in measuring G matrices across life stages. The methods and tools described here will contribute towards building a predictive framework for trait evolution across life stages.
Collapse
Affiliation(s)
- Julie Collet
- 1 CBGP, INRA, CIRAD, IRD, Montpellier SupAgro, Univ. Montpellier , Montpellier , France.,2 CEFE, CNRS, Univ. Montpellier, Univ. Paul Valéry Montpellier 3, EPHE, IRD , Montpellier , France
| | - Simon Fellous
- 1 CBGP, INRA, CIRAD, IRD, Montpellier SupAgro, Univ. Montpellier , Montpellier , France
| |
Collapse
|
3
|
Norman TM, Horlbeck MA, Replogle JM, Ge AY, Xu A, Jost M, Gilbert LA, Weissman JS. Exploring genetic interaction manifolds constructed from rich single-cell phenotypes. Science 2019; 365:786-793. [PMID: 31395745 PMCID: PMC6746554 DOI: 10.1126/science.aax4438] [Citation(s) in RCA: 125] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Accepted: 07/29/2019] [Indexed: 12/18/2022]
Abstract
How cellular and organismal complexity emerges from combinatorial expression of genes is a central question in biology. High-content phenotyping approaches such as Perturb-seq (single-cell RNA-sequencing pooled CRISPR screens) present an opportunity for exploring such genetic interactions (GIs) at scale. Here, we present an analytical framework for interpreting high-dimensional landscapes of cell states (manifolds) constructed from transcriptional phenotypes. We applied this approach to Perturb-seq profiling of strong GIs mined from a growth-based, gain-of-function GI map. Exploration of this manifold enabled ordering of regulatory pathways, principled classification of GIs (e.g., identifying suppressors), and mechanistic elucidation of synergistic interactions, including an unexpected synergy between CBL and CNN1 driving erythroid differentiation. Finally, we applied recommender system machine learning to predict interactions, facilitating exploration of vastly larger GI manifolds.
Collapse
Affiliation(s)
- Thomas M Norman
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, CA 94158, USA.
- Howard Hughes Medical Institute, University of California, San Francisco, CA 94158, USA
- California Institute for Quantitative Biomedical Research, University of California, San Francisco, CA 94158, USA
| | - Max A Horlbeck
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, CA 94158, USA
- Howard Hughes Medical Institute, University of California, San Francisco, CA 94158, USA
- California Institute for Quantitative Biomedical Research, University of California, San Francisco, CA 94158, USA
| | - Joseph M Replogle
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, CA 94158, USA
- Howard Hughes Medical Institute, University of California, San Francisco, CA 94158, USA
- California Institute for Quantitative Biomedical Research, University of California, San Francisco, CA 94158, USA
| | - Alex Y Ge
- Department of Urology, University of California, San Francisco, CA 94158, USA
- Helen Diller Family Comprehensive Cancer Center, San Francisco, CA 94158, USA
| | - Albert Xu
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, CA 94158, USA
- Howard Hughes Medical Institute, University of California, San Francisco, CA 94158, USA
- California Institute for Quantitative Biomedical Research, University of California, San Francisco, CA 94158, USA
| | - Marco Jost
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, CA 94158, USA
- Howard Hughes Medical Institute, University of California, San Francisco, CA 94158, USA
- California Institute for Quantitative Biomedical Research, University of California, San Francisco, CA 94158, USA
| | - Luke A Gilbert
- Department of Urology, University of California, San Francisco, CA 94158, USA.
- Helen Diller Family Comprehensive Cancer Center, San Francisco, CA 94158, USA
| | - Jonathan S Weissman
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, CA 94158, USA.
- Howard Hughes Medical Institute, University of California, San Francisco, CA 94158, USA
- California Institute for Quantitative Biomedical Research, University of California, San Francisco, CA 94158, USA
| |
Collapse
|
4
|
Awdeh A, Phenix H, Karn M, Perkins TJ. Dynamics in Epistasis Analysis. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 15:878-891. [PMID: 28092574 DOI: 10.1109/tcbb.2017.2653110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Finding regulatory relationships between genes, including the direction and nature of influence between them, is a fundamental challenge in the field of molecular genetics. One classical approach to this problem is epistasis analysis. Broadly speaking, epistasis analysis infers the regulatory relationships between a pair of genes in a genetic pathway by considering the patterns of change in an observable trait resulting from single and double deletion of genes. While classical epistasis analysis has yielded deep insights on numerous genetic pathways, it is not without limitations. Here, we explore the possibility of dynamic epistasis analysis, in which, in addition to performing genetic perturbations of a pathway, we drive the pathway by a time-varying upstream signal. We explore the theoretical power of dynamical epistasis analysis by conducting an identifiability analysis of Boolean models of genetic pathways, comparing static and dynamic approaches. We find that even relatively simple input dynamics greatly increases the power of epistasis analysis to discriminate alternative network structures. Further, we explore the question of experiment design, and show that a subset of short time-varying signals, which we call dynamic primitives, allow maximum discriminative power with a reduced number of experiments.
Collapse
|
5
|
Angeles-Albores D, Puckett Robinson C, Williams BA, Wold BJ, Sternberg PW. Reconstructing a metazoan genetic pathway with transcriptome-wide epistasis measurements. Proc Natl Acad Sci U S A 2018; 115:E2930-E2939. [PMID: 29531064 PMCID: PMC5879656 DOI: 10.1073/pnas.1712387115] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
RNA-sequencing (RNA-seq) is commonly used to identify genetic modules that respond to perturbations. In single cells, transcriptomes have been used as phenotypes, but this concept has not been applied to whole-organism RNA-seq. Also, quantifying and interpreting epistatic effects using expression profiles remains a challenge. We developed a single coefficient to quantify transcriptome-wide epistasis that reflects the underlying interactions and which can be interpreted intuitively. To demonstrate our approach, we sequenced four single and two double mutants of Caenorhabditis elegans From these mutants, we reconstructed the known hypoxia pathway. In addition, we uncovered a class of 56 genes with HIF-1-dependent expression that have opposite changes in expression in mutants of two genes that cooperate to negatively regulate HIF-1 abundance; however, the double mutant of these genes exhibits suppression epistasis. This class violates the classical model of HIF-1 regulation but can be explained by postulating a role of hydroxylated HIF-1 in transcriptional control.
Collapse
Affiliation(s)
- David Angeles-Albores
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125
- Howard Hughes Medical Institute, California Institute of Technology, Pasadena, CA 91125
| | - Carmie Puckett Robinson
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125
- Howard Hughes Medical Institute, California Institute of Technology, Pasadena, CA 91125
- Department of Neurology, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033
| | - Brian A Williams
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125
| | - Barbara J Wold
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125
| | - Paul W Sternberg
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125;
- Howard Hughes Medical Institute, California Institute of Technology, Pasadena, CA 91125
| |
Collapse
|
6
|
Curcumin affects gene expression and reactive oxygen species via a PKA dependent mechanism in Dictyostelium discoideum. PLoS One 2017; 12:e0187562. [PMID: 29135990 PMCID: PMC5685611 DOI: 10.1371/journal.pone.0187562] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2017] [Accepted: 10/04/2017] [Indexed: 01/05/2023] Open
Abstract
Botanicals are widely used as dietary supplements and for the prevention and treatment of disease. Despite a long history of use, there is generally little evidence supporting the efficacy and safety of these preparations. Curcumin has been used to treat a myriad of human diseases and is widely advertised and marketed for its ability to improve health, but there is no clear understanding how curcumin interacts with cells and affects cell physiology. D. discoideum is a simple eukaryotic lead system that allows both tractable genetic and biochemical studies. The studies reported here show novel effects of curcumin on cell proliferation and physiology, and a pleiotropic effect on gene transcription. Transcriptome analysis showed that the effect is two-phased with an early transient effect on the transcription of approximately 5% of the genome, and demonstrates that cells respond to curcumin through a variety of previously unknown molecular pathways. This is followed by later unique transcriptional changes and a protein kinase A dependent decrease in catalase A and three superoxide dismutase enzymes. Although this results in an increase in reactive oxygen species (ROS; superoxide and H2O2), the effects of curcumin on transcription do not appear to be the direct result of oxidation. This study opens the door to future explorations of the effect of curcumin on cell physiology.
Collapse
|
7
|
Chen X, Chang JT. Planning bioinformatics workflows using an expert system. Bioinformatics 2017; 33:1210-1215. [PMID: 28052928 DOI: 10.1093/bioinformatics/btw817] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2016] [Accepted: 12/21/2016] [Indexed: 12/30/2022] Open
Abstract
Motivation Bioinformatic analyses are becoming formidably more complex due to the increasing number of steps required to process the data, as well as the proliferation of methods that can be used in each step. To alleviate this difficulty, pipelines are commonly employed. However, pipelines are typically implemented to automate a specific analysis, and thus are difficult to use for exploratory analyses requiring systematic changes to the software or parameters used. Results To automate the development of pipelines, we have investigated expert systems. We created the Bioinformatics ExperT SYstem (BETSY) that includes a knowledge base where the capabilities of bioinformatics software is explicitly and formally encoded. BETSY is a backwards-chaining rule-based expert system comprised of a data model that can capture the richness of biological data, and an inference engine that reasons on the knowledge base to produce workflows. Currently, the knowledge base is populated with rules to analyze microarray and next generation sequencing data. We evaluated BETSY and found that it could generate workflows that reproduce and go beyond previously published bioinformatics results. Finally, a meta-investigation of the workflows generated from the knowledge base produced a quantitative measure of the technical burden imposed by each step of bioinformatics analyses, revealing the large number of steps devoted to the pre-processing of data. In sum, an expert system approach can facilitate exploratory bioinformatic analysis by automating the development of workflows, a task that requires significant domain expertise. Availability and Implementation https://github.com/jefftc/changlab. Contact jeffrey.t.chang@uth.tmc.edu.
Collapse
Affiliation(s)
| | - Jeffrey T Chang
- School of Biomedical Informatics.,Department of Integrative Biology & Pharmacology, University of Texas Health Science Center at Houston, Houston, TX 77030, USA.,Department of Bioinformatics and Computational Biology, University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| |
Collapse
|
8
|
The Long Noncoding RNA Transcriptome of Dictyostelium discoideum Development. G3-GENES GENOMES GENETICS 2017; 7:387-398. [PMID: 27932387 PMCID: PMC5295588 DOI: 10.1534/g3.116.037150] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Dictyostelium discoideum live in the soil as single cells, engulfing bacteria and growing vegetatively. Upon starvation, tens of thousands of amoebae enter a developmental program that includes aggregation, multicellular differentiation, and sporulation. Major shifts across the protein-coding transcriptome accompany these developmental changes. However, no study has presented a global survey of long noncoding RNAs (ncRNAs) in D. discoideum To characterize the antisense and long intergenic noncoding RNA (lncRNA) transcriptome, we analyzed previously published developmental time course samples using an RNA-sequencing (RNA-seq) library preparation method that selectively depletes ribosomal RNAs (rRNAs). We detected the accumulation of transcripts for 9833 protein-coding messenger RNAs (mRNAs), 621 lncRNAs, and 162 putative antisense RNAs (asRNAs). The noncoding RNAs were interspersed throughout the genome, and were distinct in expression level, length, and nucleotide composition. The noncoding transcriptome displayed a temporal profile similar to the coding transcriptome, with stages of gradual change interspersed with larger leaps. The transcription profiles of some noncoding RNAs were strongly correlated with known differentially expressed coding RNAs, hinting at a functional role for these molecules during development. Examining the mitochondrial transcriptome, we modeled two novel antisense transcripts. We applied yet another ribosomal depletion method to a subset of the samples to better retain transfer RNA (tRNA) transcripts. We observed polymorphisms in tRNA anticodons that suggested a post-transcriptional means by which D. discoideum compensates for codons missing in the genomic complement of tRNAs. We concluded that the prevalence and characteristics of long ncRNAs indicate that these molecules are relevant to the progression of molecular and cellular phenotypes during development.
Collapse
|
9
|
Expectation propagation for large scale Bayesian inference of non-linear molecular networks from perturbation data. PLoS One 2017; 12:e0171240. [PMID: 28166542 PMCID: PMC5293552 DOI: 10.1371/journal.pone.0171240] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2016] [Accepted: 01/17/2017] [Indexed: 11/19/2022] Open
Abstract
Inferring the structure of molecular networks from time series protein or gene expression data provides valuable information about the complex biological processes of the cell. Causal network structure inference has been approached using different methods in the past. Most causal network inference techniques, such as Dynamic Bayesian Networks and ordinary differential equations, are limited by their computational complexity and thus make large scale inference infeasible. This is specifically true if a Bayesian framework is applied in order to deal with the unavoidable uncertainty about the correct model. We devise a novel Bayesian network reverse engineering approach using ordinary differential equations with the ability to include non-linearity. Besides modeling arbitrary, possibly combinatorial and time dependent perturbations with unknown targets, one of our main contributions is the use of Expectation Propagation, an algorithm for approximate Bayesian inference over large scale network structures in short computation time. We further explore the possibility of integrating prior knowledge into network inference. We evaluate the proposed model on DREAM4 and DREAM8 data and find it competitive against several state-of-the-art existing network inference methods.
Collapse
|
10
|
Abstract
Genes encode components of coevolved and interconnected networks. The effect of genotype on phenotype therefore depends on genotypic context through gene interactions known as epistasis. Epistasis is important in predicting phenotype from genotype for an individual. It is also examined in population studies to identify genetic risk factors in complex traits and to predict evolution under selection. Paradoxically, the effects of genotypic context in individuals and populations are distinct and sometimes contradictory. We argue that predicting genotype from phenotype for individuals based on population studies is difficult and, especially in human genetics, likely to result in underestimating the effects of genotypic context.
Collapse
Affiliation(s)
- Timothy B Sackton
- Informatics Group, 38 Oxford Street, Harvard University, Cambridge, MA 02138, USA
| | - Daniel L Hartl
- Department of Organismic and Evolutionary Biology, 16 Divinity Avenue, Harvard University, Cambridge, MA 02138, USA.
| |
Collapse
|
11
|
Sameith K, Amini S, Groot Koerkamp MJA, van Leenen D, Brok M, Brabers N, Lijnzaad P, van Hooff SR, Benschop JJ, Lenstra TL, Apweiler E, van Wageningen S, Snel B, Holstege FCP, Kemmeren P. A high-resolution gene expression atlas of epistasis between gene-specific transcription factors exposes potential mechanisms for genetic interactions. BMC Biol 2015; 13:112. [PMID: 26700642 PMCID: PMC4690272 DOI: 10.1186/s12915-015-0222-5] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2015] [Accepted: 12/14/2015] [Indexed: 01/14/2023] Open
Abstract
BACKGROUND Genetic interactions, or non-additive effects between genes, play a crucial role in many cellular processes and disease. Which mechanisms underlie these genetic interactions has hardly been characterized. Understanding the molecular basis of genetic interactions is crucial in deciphering pathway organization and understanding the relationship between genotype, phenotype and disease. RESULTS To investigate the nature of genetic interactions between gene-specific transcription factors (GSTFs) in Saccharomyces cerevisiae, we systematically analyzed 72 GSTF pairs by gene expression profiling double and single deletion mutants. These pairs were selected through previously published growth-based genetic interactions as well as through similarity in DNA binding properties. The result is a high-resolution atlas of gene expression-based genetic interactions that provides systems-level insight into GSTF epistasis. The atlas confirms known genetic interactions and exposes new ones. Importantly, the data can be used to investigate mechanisms that underlie individual genetic interactions. Two molecular mechanisms are proposed, "buffering by induced dependency" and "alleviation by derepression". CONCLUSIONS These mechanisms indicate how negative genetic interactions can occur between seemingly unrelated parallel pathways and how positive genetic interactions can indirectly expose parallel rather than same-pathway relationships. The focus on GSTFs is important for understanding the transcription regulatory network of yeast as it uncovers details behind many redundancy relationships, some of which are completely new. In addition, the study provides general insight into the complex nature of epistasis and proposes mechanistic models for genetic interactions, the majority of which do not fall into easily recognizable within- or between-pathway relationships.
Collapse
Affiliation(s)
- Katrin Sameith
- Molecular Cancer Research, University Medical Centre Utrecht, Universiteitsweg 100, Utrecht, The Netherlands
| | - Saman Amini
- Molecular Cancer Research, University Medical Centre Utrecht, Universiteitsweg 100, Utrecht, The Netherlands
| | - Marian J A Groot Koerkamp
- Molecular Cancer Research, University Medical Centre Utrecht, Universiteitsweg 100, Utrecht, The Netherlands
| | - Dik van Leenen
- Molecular Cancer Research, University Medical Centre Utrecht, Universiteitsweg 100, Utrecht, The Netherlands
| | - Mariel Brok
- Molecular Cancer Research, University Medical Centre Utrecht, Universiteitsweg 100, Utrecht, The Netherlands
| | - Nathalie Brabers
- Molecular Cancer Research, University Medical Centre Utrecht, Universiteitsweg 100, Utrecht, The Netherlands
| | - Philip Lijnzaad
- Molecular Cancer Research, University Medical Centre Utrecht, Universiteitsweg 100, Utrecht, The Netherlands
| | - Sander R van Hooff
- Molecular Cancer Research, University Medical Centre Utrecht, Universiteitsweg 100, Utrecht, The Netherlands
| | - Joris J Benschop
- Molecular Cancer Research, University Medical Centre Utrecht, Universiteitsweg 100, Utrecht, The Netherlands
| | - Tineke L Lenstra
- Molecular Cancer Research, University Medical Centre Utrecht, Universiteitsweg 100, Utrecht, The Netherlands
| | - Eva Apweiler
- Molecular Cancer Research, University Medical Centre Utrecht, Universiteitsweg 100, Utrecht, The Netherlands
| | - Sake van Wageningen
- Molecular Cancer Research, University Medical Centre Utrecht, Universiteitsweg 100, Utrecht, The Netherlands
| | - Berend Snel
- Theoretical Biology and Bioinformatics, Department of Biology, Utrecht University, Padualaan 8, Utrecht, The Netherlands
| | - Frank C P Holstege
- Molecular Cancer Research, University Medical Centre Utrecht, Universiteitsweg 100, Utrecht, The Netherlands
| | - Patrick Kemmeren
- Molecular Cancer Research, University Medical Centre Utrecht, Universiteitsweg 100, Utrecht, The Netherlands.
| |
Collapse
|
12
|
Rosengarten RD, Santhanam B, Fuller D, Katoh-Kurasawa M, Loomis WF, Zupan B, Shaulsky G. Leaps and lulls in the developmental transcriptome of Dictyostelium discoideum. BMC Genomics 2015; 16:294. [PMID: 25887420 PMCID: PMC4403905 DOI: 10.1186/s12864-015-1491-7] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2014] [Accepted: 03/26/2015] [Indexed: 12/26/2022] Open
Abstract
BACKGROUND Development of the soil amoeba Dictyostelium discoideum is triggered by starvation. When placed on a solid substrate, the starving solitary amoebae cease growth, communicate via extracellular cAMP, aggregate by tens of thousands and develop into multicellular organisms. Early phases of the developmental program are often studied in cells starved in suspension while cAMP is provided exogenously. Previous studies revealed massive shifts in the transcriptome under both developmental conditions and a close relationship between gene expression and morphogenesis, but were limited by the sampling frequency and the resolution of the methods. RESULTS Here, we combine the superior depth and specificity of RNA-seq-based analysis of mRNA abundance with high frequency sampling during filter development and cAMP pulsing in suspension. We found that the developmental transcriptome exhibits mostly gradual changes interspersed by a few instances of large shifts. For each time point we treated the entire transcriptome as single phenotype, and were able to characterize development as groups of similar time points separated by gaps. The grouped time points represented gradual changes in mRNA abundance, or molecular phenotype, and the gaps represented times during which many genes are differentially expressed rapidly, and thus the phenotype changes dramatically. Comparing developmental experiments revealed that gene expression in filter developed cells lagged behind those treated with exogenous cAMP in suspension. The high sampling frequency revealed many genes whose regulation is reproducibly more complex than indicated by previous studies. Gene Ontology enrichment analysis suggested that the transition to multicellularity coincided with rapid accumulation of transcripts associated with DNA processes and mitosis. Later development included the up-regulation of organic signaling molecules and co-factor biosynthesis. Our analysis also demonstrated a high level of synchrony among the developing structures throughout development. CONCLUSIONS Our data describe D. discoideum development as a series of coordinated cellular and multicellular activities. Coordination occurred within fields of aggregating cells and among multicellular bodies, such as mounds or migratory slugs that experience both cell-cell contact and various soluble signaling regimes. These time courses, sampled at the highest temporal resolution to date in this system, provide a comprehensive resource for studies of developmental gene expression.
Collapse
Affiliation(s)
- Rafael David Rosengarten
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX, 77030, USA.
| | - Balaji Santhanam
- Graduate Program in Structural and Computational Biology and Molecular Biophysics, Baylor College of Medicine, One Baylor Plaza, Houston, TX, 77030, USA.
| | - Danny Fuller
- Section of Cell and Developmental Biology, University of California San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA.
| | - Mariko Katoh-Kurasawa
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX, 77030, USA.
| | - William F Loomis
- Section of Cell and Developmental Biology, University of California San Diego, 9500 Gilman Drive, La Jolla, CA, 92093, USA.
| | - Blaz Zupan
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX, 77030, USA.
- Faculty of Computer and Information Science, University of Ljubljana, Trzaska cesta 25, Ljubljana, SI-1001, Slovenia.
| | - Gad Shaulsky
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX, 77030, USA.
- Graduate Program in Structural and Computational Biology and Molecular Biophysics, Baylor College of Medicine, One Baylor Plaza, Houston, TX, 77030, USA.
| |
Collapse
|
13
|
Zitnik M, Zupan B. Gene network inference by probabilistic scoring of relationships from a factorized model of interactions. ACTA ACUST UNITED AC 2014; 30:i246-i254. [PMID: 24931990 PMCID: PMC4229904 DOI: 10.1093/bioinformatics/btu287] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Motivation: Epistasis analysis is an essential tool of classical genetics for inferring the order of function of genes in a common pathway. Typically, it considers single and double mutant phenotypes and for a pair of genes observes whether a change in the first gene masks the effects of the mutation in the second gene. Despite the recent emergence of biotechnology techniques that can provide gene interaction data on a large, possibly genomic scale, few methods are available for quantitative epistasis analysis and epistasis-based network reconstruction. Results: We here propose a conceptually new probabilistic approach to gene network inference from quantitative interaction data. The approach is founded on epistasis analysis. Its features are joint treatment of the mutant phenotype data with a factorized model and probabilistic scoring of pairwise gene relationships that are inferred from the latent gene representation. The resulting gene network is assembled from scored pairwise relationships. In an experimental study, we show that the proposed approach can accurately reconstruct several known pathways and that it surpasses the accuracy of current approaches. Availability and implementation: Source code is available at http://github.com/biolab/red. Contact:blaz.zupan@fri.uni-lj.si Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Marinka Zitnik
- Faculty of Computer and Information Science, University of Ljubljana, SI-1000 Ljubljana, Slovenia and Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Blaž Zupan
- Faculty of Computer and Information Science, University of Ljubljana, SI-1000 Ljubljana, Slovenia and Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USAFaculty of Computer and Information Science, University of Ljubljana, SI-1000 Ljubljana, Slovenia and Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| |
Collapse
|
14
|
Mackay TFC. Epistasis and quantitative traits: using model organisms to study gene-gene interactions. Nat Rev Genet 2014; 15:22-33. [PMID: 24296533 PMCID: PMC3918431 DOI: 10.1038/nrg3627] [Citation(s) in RCA: 500] [Impact Index Per Article: 50.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
The role of epistasis in the genetic architecture of quantitative traits is controversial, despite the biological plausibility that nonlinear molecular interactions underpin the genotype-phenotype map. This controversy arises because most genetic variation for quantitative traits is additive. However, additive variance is consistent with pervasive epistasis. In this Review, I discuss experimental designs to detect the contribution of epistasis to quantitative trait phenotypes in model organisms. These studies indicate that epistasis is common, and that additivity can be an emergent property of underlying genetic interaction networks. Epistasis causes hidden quantitative genetic variation in natural populations and could be responsible for the small additive effects, missing heritability and the lack of replication that are typically observed for human complex traits.
Collapse
Affiliation(s)
- Trudy F C Mackay
- Department of Biological Sciences, Campus Box 7614, North Carolina State University, Raleigh, North Carolina 27695-7614, USA
| |
Collapse
|
15
|
Boucher B, Jenna S. Genetic interaction networks: better understand to better predict. Front Genet 2013; 4:290. [PMID: 24381582 PMCID: PMC3865423 DOI: 10.3389/fgene.2013.00290] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2013] [Accepted: 11/28/2013] [Indexed: 12/21/2022] Open
Abstract
A genetic interaction (GI) between two genes generally indicates that the phenotype of a double mutant differs from what is expected from each individual mutant. In the last decade, genome scale studies of quantitative GIs were completed using mainly synthetic genetic array technology and RNA interference in yeast and Caenorhabditis elegans. These studies raised questions regarding the functional interpretation of GIs, the relationship of genetic and molecular interaction networks, the usefulness of GI networks to infer gene function and co-functionality, the evolutionary conservation of GI, etc. While GIs have been used for decades to dissect signaling pathways in genetic models, their functional interpretations are still not trivial. The existence of a GI between two genes does not necessarily imply that these two genes code for interacting proteins or that the two genes are even expressed in the same cell. In fact, a GI only implies that the two genes share a functional relationship. These two genes may be involved in the same biological process or pathway; or they may also be involved in compensatory pathways with unrelated apparent function. Considering the powerful opportunity to better understand gene function, genetic relationship, robustness and evolution, provided by a genome-wide mapping of GIs, several in silico approaches have been employed to predict GIs in unicellular and multicellular organisms. Most of these methods used weighted data integration. In this article, we will review the later knowledge acquired on GI networks in metazoans by looking more closely into their relationship with pathways, biological processes and molecular complexes but also into their modularity and organization. We will also review the different in silico methods developed to predict GIs and will discuss how the knowledge acquired on GI networks can be used to design predictive tools with higher performances.
Collapse
Affiliation(s)
- Benjamin Boucher
- Laboratory of Integrative Genomics and Cell Signalling, Pharmaqam, Biomed, Department of Chemistry, Université du Québec à Montréal Montréal, QC, Canada
| | - Sarah Jenna
- Laboratory of Integrative Genomics and Cell Signalling, Pharmaqam, Biomed, Department of Chemistry, Université du Québec à Montréal Montréal, QC, Canada
| |
Collapse
|
16
|
Zhang J. Epistatic Clustering: A Model-Based Approach for Identifying Links Between Clusters. J Am Stat Assoc 2013. [DOI: 10.1080/01621459.2013.835661] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
17
|
Dümcke S, Bräuer J, Anchang B, Spang R, Beerenwinkel N, Tresch A. Exact likelihood computation in Boolean networks with probabilistic time delays, and its application in signal network reconstruction. ACTA ACUST UNITED AC 2013; 30:414-9. [PMID: 24292937 DOI: 10.1093/bioinformatics/btt696] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
MOTIVATION For biological pathways, it is common to measure a gene expression time series after various knockdowns of genes that are putatively involved in the process of interest. These interventional time-resolved data are most suitable for the elucidation of dynamic causal relationships in signaling networks. Even with this kind of data it is still a major and largely unsolved challenge to infer the topology and interaction logic of the underlying regulatory network. RESULTS In this work, we present a novel model-based approach involving Boolean networks to reconstruct small to medium-sized regulatory networks. In particular, we solve the problem of exact likelihood computation in Boolean networks with probabilistic exponential time delays. Simulations demonstrate the high accuracy of our approach. We apply our method to data of Ivanova et al. (2006), where RNA interference knockdown experiments were used to build a network of the key regulatory genes governing mouse stem cell maintenance and differentiation. In contrast to previous analyses of that data set, our method can identify feedback loops and provides new insights into the interplay of some master regulators in embryonic stem cell development. AVAILABILITY AND IMPLEMENTATION The algorithm is implemented in the statistical language R. Code and documentation are available at Bioinformatics online. CONTACT duemcke@mpipz.mpg.de or tresch@mpipz.mpg.de SUPPLEMENTARY INFORMATION Supplementary Materials are available at Bioinfomatics online.
Collapse
Affiliation(s)
- Sebastian Dümcke
- Institute for Genetics, University of Cologne, 50674 Cologne, Max Planck Institute for Plant Breeding Research, 50829 Cologne, Gene Center, Department of Biochemistry, Ludwig-Maximilians University, 81379 Munich, Germany, Insitute for Functional Genomics, 93053 Regensburg, Germany, Department of Radiology, Center for Cancer Systems Biology, Stanford University, Stanford, CA 94305-5488, USA and ETH Zürich, Department of Biosystems Science and Engineering, Mattenstrasse 26 4058 Basel, Switzerland
| | | | | | | | | | | |
Collapse
|
18
|
Functional dissection of regulatory models using gene expression data of deletion mutants. PLoS Genet 2013; 9:e1003757. [PMID: 24039601 PMCID: PMC3764135 DOI: 10.1371/journal.pgen.1003757] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2013] [Accepted: 07/13/2013] [Indexed: 11/19/2022] Open
Abstract
Genome-wide gene expression profiles accumulate at an alarming rate, how to integrate these expression profiles generated by different laboratories to reverse engineer the cellular regulatory network has been a major challenge. To automatically infer gene regulatory pathways from genome-wide mRNA expression profiles before and after genetic perturbations, we introduced a new Bayesian network algorithm: Deletion Mutant Bayesian Network (DM_BN). We applied DM_BN to the expression profiles of 544 yeast single or double deletion mutants of transcription factors, chromatin remodeling machinery components, protein kinases and phosphatases in S. cerevisiae. The network inferred by this method identified causal regulatory and non-causal concurrent interactions among these regulators (genetically perturbed genes) that are strongly supported by the experimental evidence, and generated many new testable hypotheses. Compared to networks reconstructed by routine similarity measures or by alternative Bayesian network algorithms, the network inferred by DM_BN excels in both precision and recall. To facilitate its application in other systems, we packaged the algorithm into a user-friendly analysis tool that can be downloaded at http://www.picb.ac.cn/hanlab/DM_BN.html.
Collapse
|
19
|
Miranda ER, Zhuchenko O, Toplak M, Santhanam B, Zupan B, Kuspa A, Shaulsky G. ABC transporters in Dictyostelium discoideum development. PLoS One 2013; 8:e70040. [PMID: 23967067 PMCID: PMC3743828 DOI: 10.1371/journal.pone.0070040] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2013] [Accepted: 06/13/2013] [Indexed: 12/15/2022] Open
Abstract
ATP-binding cassette (ABC) transporters can translocate a broad spectrum of molecules across the cell membrane including physiological cargo and toxins. ABC transporters are known for the role they play in resistance towards anticancer agents in chemotherapy of cancer patients. There are 68 ABC transporters annotated in the genome of the social amoeba Dictyostelium discoideum. We have characterized more than half of these ABC transporters through a systematic study of mutations in their genes. We have analyzed morphological and transcriptional phenotypes for these mutants during growth and development and found that most of the mutants exhibited rather subtle phenotypes. A few of the genes may share physiological functions, as reflected in their transcriptional phenotypes. Since most of the abc-transporter mutants showed subtle morphological phenotypes, we utilized these transcriptional phenotypes to identify genes that are important for development by looking for transcripts whose abundance was unperturbed in most of the mutants. We found a set of 668 genes that includes many validated D. discoideum developmental genes. We have also found that abcG6 and abcG18 may have potential roles in intercellular signaling during terminal differentiation of spores and stalks.
Collapse
Affiliation(s)
- Edward Roshan Miranda
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
- Graduate Program in Developmental Biology, Baylor College of Medicine, Houston, Texas, United States of America
| | - Olga Zhuchenko
- Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, Texas, United States of America
| | - Marko Toplak
- Faculty of Computer and Information Science, University of Ljubljana, Ljubljana, Slovenia
| | - Balaji Santhanam
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
- Graduate Program in Structural and Computational Biology and Molecular Biophysics, Baylor College of Medicine, Houston, Texas, United States of America
| | - Blaz Zupan
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
- Faculty of Computer and Information Science, University of Ljubljana, Ljubljana, Slovenia
| | - Adam Kuspa
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
- Graduate Program in Developmental Biology, Baylor College of Medicine, Houston, Texas, United States of America
- Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, Texas, United States of America
| | - Gad Shaulsky
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
- Graduate Program in Developmental Biology, Baylor College of Medicine, Houston, Texas, United States of America
- Graduate Program in Structural and Computational Biology and Molecular Biophysics, Baylor College of Medicine, Houston, Texas, United States of America
- * E-mail:
| |
Collapse
|
20
|
Systematic profiling of Caenorhabditis elegans locomotive behaviors reveals additional components in G-protein Gαq signaling. Proc Natl Acad Sci U S A 2013; 110:11940-5. [PMID: 23818641 DOI: 10.1073/pnas.1310468110] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Genetic screens have been widely applied to uncover genetic mechanisms of movement disorders. However, most screens rely on human observations of qualitative differences. Here we demonstrate the application of an automatic imaging system to conduct a quantitative screen for genes regulating the locomotive behavior in Caenorhabditis elegans. Two hundred twenty-seven neuronal signaling genes with viable homozygous mutants were selected for this study. We tracked and recorded each animal for 4 min and analyzed over 4,400 animals of 239 genotypes to obtain a quantitative, 10-parameter behavioral profile for each genotype. We discovered 87 genes whose inactivation causes movement defects, including 50 genes that had never been associated with locomotive defects. Computational analysis of the high-content behavioral profiles predicted 370 genetic interactions among these genes. Network partition revealed several functional modules regulating locomotive behaviors, including sensory genes that detect environmental conditions, genes that function in multiple types of excitable cells, and genes in the signaling pathway of the G protein Gαq, a protein that is essential for animal life and behavior. We developed quantitative epistasis analysis methods to analyze the locomotive profiles and validated the prediction of the γ isoform of phospholipase C as a component in the Gαq pathway. These results provided a system-level understanding of how neuronal signaling genes coordinate locomotive behaviors. This study also demonstrated the power of quantitative approaches in genetic studies.
Collapse
|
21
|
Jaehnig EJ, Kuo D, Hombauer H, Ideker TG, Kolodner RD. Checkpoint kinases regulate a global network of transcription factors in response to DNA damage. Cell Rep 2013; 4:174-88. [PMID: 23810556 DOI: 10.1016/j.celrep.2013.05.041] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2012] [Revised: 04/04/2013] [Accepted: 05/24/2013] [Indexed: 01/01/2023] Open
Abstract
DNA damage activates checkpoint kinases that induce several downstream events, including widespread changes in transcription. However, the specific connections between the checkpoint kinases and downstream transcription factors (TFs) are not well understood. Here, we integrate kinase mutant expression profiles, transcriptional regulatory interactions, and phosphoproteomics to map kinases and downstream TFs to transcriptional regulatory networks. Specifically, we investigate the role of the Saccharomyces cerevisiae checkpoint kinases (Mec1, Tel1, Chk1, Rad53, and Dun1) in the transcriptional response to DNA damage caused by methyl methanesulfonate. The result is a global kinase-TF regulatory network in which Mec1 and Tel1 signal through Rad53 to synergistically regulate the expression of more than 600 genes. This network involves at least nine TFs, many of which have Rad53-dependent phosphorylation sites, as regulators of checkpoint-kinase-dependent genes. We also identify a major DNA damage-induced transcriptional network that regulates stress response genes independently of the checkpoint kinases.
Collapse
Affiliation(s)
- Eric J Jaehnig
- Ludwig Institute for Cancer Research, University of California School of Medicine, San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA
| | | | | | | | | |
Collapse
|
22
|
Nasser W, Santhanam B, Miranda ER, Parikh A, Juneja K, Rot G, Dinh C, Chen R, Zupan B, Shaulsky G, Kuspa A. Bacterial discrimination by dictyostelid amoebae reveals the complexity of ancient interspecies interactions. Curr Biol 2013; 23:862-72. [PMID: 23664307 DOI: 10.1016/j.cub.2013.04.034] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2013] [Revised: 03/12/2013] [Accepted: 04/11/2013] [Indexed: 10/26/2022]
Abstract
BACKGROUND Amoebae and bacteria interact within predator-prey and host-pathogen relationships, but the general response of amoeba to bacteria is not well understood. The amoeba Dictyostelium discoideum feeds on, and is colonized by, diverse bacterial species, including Gram-positive [Gram(+)] and Gram-negative [Gram(-)] bacteria, two major groups of bacteria that differ in structure and macromolecular composition. RESULTS Transcriptional profiling of D. discoideum revealed sets of genes whose expression is enriched in amoebae interacting with different species of bacteria, including sets that appear specific to amoebae interacting with Gram(+) or with Gram(-) bacteria. In a genetic screen utilizing the growth of mutant amoebae on a variety of bacteria as a phenotypic readout, we identified amoebal genes that are only required for growth on Gram(+) bacteria, including one that encodes the cell-surface protein gp130, as well as several genes that are only required for growth on Gram(-) bacteria, including one that encodes a putative lysozyme, AlyL. These genes are required for parts of the transcriptional response of wild-type amoebae, and this allowed their classification into potential response pathways. CONCLUSIONS We have defined genes that are critical for amoebal survival during feeding on Gram(+), or Gram(-), bacteria that we propose form part of a regulatory network that allows D. discoideum to elicit specific cellular responses to different species of bacteria in order to optimize survival.
Collapse
Affiliation(s)
- Waleed Nasser
- Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, TX 77030, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
23
|
Failmezger H, Praveen P, Tresch A, Fröhlich H. Learning gene network structure from time laps cell imaging in RNAi Knock downs. Bioinformatics 2013; 29:1534-40. [PMID: 23595660 DOI: 10.1093/bioinformatics/btt179] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION As RNA interference is becoming a standard method for targeted gene perturbation, computational approaches to reverse engineer parts of biological networks based on measurable effects of RNAi become increasingly relevant. The vast majority of these methods use gene expression data, but little attention has been paid so far to other data types. RESULTS Here we present a method, which can infer gene networks from high-dimensional phenotypic perturbation effects on single cells recorded by time-lapse microscopy. We use data from the Mitocheck project to extract multiple shape, intensity and texture features at each frame. Features from different cells and movies are then aligned along the cell cycle time. Subsequently we use Dynamic Nested Effects Models (dynoNEMs) to estimate parts of the network structure between perturbed genes via a Markov Chain Monte Carlo approach. Our simulation results indicate a high reconstruction quality of this method. A reconstruction based on 22 gene knock downs yielded a network, where all edges could be explained via the biological literature. AVAILABILITY The implementation of dynoNEMs is part of the Bioconductor R-package nem.
Collapse
Affiliation(s)
- Henrik Failmezger
- Computational Biology and Regulatory Networks, Max-Planck Institute for Plant Breeding Research, Carl-von-Linne-Weg 10, 50829 Cologne, Germany
| | | | | | | |
Collapse
|
24
|
Flynn KM, Cooper TF, Moore FBG, Cooper VS. The environment affects epistatic interactions to alter the topology of an empirical fitness landscape. PLoS Genet 2013; 9:e1003426. [PMID: 23593024 PMCID: PMC3616912 DOI: 10.1371/journal.pgen.1003426] [Citation(s) in RCA: 70] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2012] [Accepted: 02/14/2013] [Indexed: 11/27/2022] Open
Abstract
The fitness effect of mutations can be influenced by their interactions with the environment, other mutations, or both. Previously, we constructed 32 ( = 25) genotypes that comprise all possible combinations of the first five beneficial mutations to fix in a laboratory-evolved population of Escherichia coli. We found that (i) all five mutations were beneficial for the background on which they occurred; (ii) interactions between mutations drove a diminishing returns type epistasis, whereby epistasis became increasingly antagonistic as the expected fitness of a genotype increased; and (iii) the adaptive landscape revealed by the mutation combinations was smooth, having a single global fitness peak. Here we examine how the environment influences epistasis by determining the interactions between the same mutations in two alternative environments, selected from among 1,920 screened environments, that produced the largest increase or decrease in fitness of the most derived genotype. Some general features of the interactions were consistent: mutations tended to remain beneficial and the overall pattern of epistasis was of diminishing returns. Other features depended on the environment; in particular, several mutations were deleterious when added to specific genotypes, indicating the presence of antagonistic interactions that were absent in the original selection environment. Antagonism was not caused by consistent pleiotropic effects of individual mutations but rather by changing interactions between mutations. Our results demonstrate that understanding adaptation in changing environments will require consideration of the combined effect of epistasis and pleiotropy across environments. The fitness effect of beneficial mutations can depend on how they interact with their genetic and external environment. The form of these interactions is important because it can alter adaptive outcomes, selecting for or against certain combinations of beneficial mutations. Here, we examine how interactions between beneficial mutations favored during adaptation of a lab strain of Escherichia coli to one simple environment are altered when the strain is grown in two novel environments. We found that fitness effects were greatly influenced by both the genetic and external environments. In several instances a change in environment reversed the effect of a mutation from beneficial to deleterious or caused combinations of beneficial mutations to become deleterious. Our results suggest that a complex or fluctuating environment may favor combinations of mutations whose interactions may be less sensitive to external conditions.
Collapse
Affiliation(s)
- Kenneth M. Flynn
- Department of Molecular, Cellular, and Biomedical Sciences, University of New Hampshire, Durham, New Hampshire, United States of America
| | - Tim F. Cooper
- Department of Biology and Biochemistry, University of Houston, Houston, Texas, United States of America
| | - Francisco B-G. Moore
- Integrated Bioscience Program, University of Akron, Akron, Ohio, United States of America
| | - Vaughn S. Cooper
- Department of Molecular, Cellular, and Biomedical Sciences, University of New Hampshire, Durham, New Hampshire, United States of America
- * E-mail:
| |
Collapse
|
25
|
Ying BW, Seno S, Kaneko F, Matsuda H, Yomo T. Multilevel comparative analysis of the contributions of genome reduction and heat shock to the Escherichia coli transcriptome. BMC Genomics 2013; 14:25. [PMID: 23324527 PMCID: PMC3553035 DOI: 10.1186/1471-2164-14-25] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2012] [Accepted: 12/29/2012] [Indexed: 12/24/2022] Open
Abstract
Background Both large deletions in genome and heat shock stress would lead to alterations in the gene expression profile; however, whether there is any potential linkage between these disturbances to the transcriptome have not been discovered. Here, the relationship between the genomic and environmental contributions to the transcriptome was analyzed by comparing the transcriptomes of the bacterium Escherichia coli (strain MG1655 and its extensive genomic deletion derivative, MDS42) grown in regular and transient heat shock conditions. Results The transcriptome analysis showed the following: (i) there was a reorganization of the transcriptome in accordance with preferred chromosomal periodicity upon genomic or heat shock perturbation; (ii) there was a considerable overlap between the perturbed regulatory networks and the categories enriched for differentially expressed genes (DEGs) following genome reduction and heat shock; (iii) the genes sensitive to genome reduction tended to be located close to genomic scars, and some were also highly responsive to heat shock; and (iv) the genomic and environmental contributions to the transcriptome displayed not only a positive correlation but also a negatively compensated relationship (i.e., antagonistic epistasis). Conclusion The contributions of genome reduction and heat shock to the Escherichia coli transcriptome were evaluated at multiple levels. The observations of overlapping perturbed networks, directional similarity in transcriptional changes, positive correlation and epistatic nature linked the two contributions and suggest somehow a crosstalk guiding transcriptional reorganization in response to both genetic and environmental disturbances in bacterium E. coli.
Collapse
Affiliation(s)
- Bei-Wen Ying
- Graduate School of Information Science and Technology, Osaka University, 1-5 Yamadaoka, Suita, Osaka, Japan
| | | | | | | | | |
Collapse
|
26
|
Abstract
Transcriptional profiling methods have been utilized in the analysis of various biological processes in Dictyostelium. Recent advances in high-throughput sequencing have increased the resolution and the dynamic range of transcriptional profiling. Here we describe the utility of RNA sequencing with the Illumina technology for production of transcriptional profiles. We also describe methods for data mapping and storage as well as common and specialized tools for data analysis, both online and offline.
Collapse
|
27
|
Phenix H, Morin K, Batenchuk C, Parker J, Abedi V, Yang L, Tepliakova L, Perkins TJ, Kærn M. Quantitative epistasis analysis and pathway inference from genetic interaction data. PLoS Comput Biol 2011; 7:e1002048. [PMID: 21589890 PMCID: PMC3093353 DOI: 10.1371/journal.pcbi.1002048] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2010] [Accepted: 03/28/2011] [Indexed: 11/20/2022] Open
Abstract
Inferring regulatory and metabolic network models from quantitative genetic interaction data remains a major challenge in systems biology. Here, we present a novel quantitative model for interpreting epistasis within pathways responding to an external signal. The model provides the basis of an experimental method to determine the architecture of such pathways, and establishes a new set of rules to infer the order of genes within them. The method also allows the extraction of quantitative parameters enabling a new level of information to be added to genetic network models. It is applicable to any system where the impact of combinatorial loss-of-function mutations can be quantified with sufficient accuracy. We test the method by conducting a systematic analysis of a thoroughly characterized eukaryotic gene network, the galactose utilization pathway in Saccharomyces cerevisiae. For this purpose, we quantify the effects of single and double gene deletions on two phenotypic traits, fitness and reporter gene expression. We show that applying our method to fitness traits reveals the order of metabolic enzymes and the effects of accumulating metabolic intermediates. Conversely, the analysis of expression traits reveals the order of transcriptional regulatory genes, secondary regulatory signals and their relative strength. Strikingly, when the analyses of the two traits are combined, the method correctly infers ~80% of the known relationships without any false positives.
Collapse
Affiliation(s)
- Hilary Phenix
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, Ontario, Canada
- Department of Cellular and Molecular Medicine, University of Ottawa, Ottawa, Ontario, Canada
| | - Katy Morin
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, Ontario, Canada
- Department of Biochemistry, Immunology and Microbiology, University of Ottawa, Ottawa, Ontario, Canada
| | - Cory Batenchuk
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, Ontario, Canada
- Department of Cellular and Molecular Medicine, University of Ottawa, Ottawa, Ontario, Canada
| | - Jacob Parker
- Ottawa Hospital Research Institute, Ottawa, Ontario, Canada
- Department of Applied Mathematics, University of Waterloo, Waterloo, Ontario, Canada
| | - Vida Abedi
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, Ontario, Canada
- Department of Cellular and Molecular Medicine, University of Ottawa, Ottawa, Ontario, Canada
| | - Liu Yang
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, Ontario, Canada
- Department of Cellular and Molecular Medicine, University of Ottawa, Ottawa, Ontario, Canada
- Department of Applied Mathematics, University of Waterloo, Waterloo, Ontario, Canada
| | - Lioudmila Tepliakova
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, Ontario, Canada
- Department of Cellular and Molecular Medicine, University of Ottawa, Ottawa, Ontario, Canada
| | - Theodore J. Perkins
- Department of Biochemistry, Immunology and Microbiology, University of Ottawa, Ottawa, Ontario, Canada
- Ottawa Hospital Research Institute, Ottawa, Ontario, Canada
| | - Mads Kærn
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, Ontario, Canada
- Department of Cellular and Molecular Medicine, University of Ottawa, Ottawa, Ontario, Canada
- Department of Physics, University of Ottawa, Ottawa, Ontario, Canada
| |
Collapse
|
28
|
Lindén RO, Eronen VP, Aittokallio T. Quantitative maps of genetic interactions in yeast - comparative evaluation and integrative analysis. BMC SYSTEMS BIOLOGY 2011; 5:45. [PMID: 21435228 PMCID: PMC3079637 DOI: 10.1186/1752-0509-5-45] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/30/2010] [Accepted: 03/24/2011] [Indexed: 01/08/2023]
Abstract
Background High-throughput genetic screening approaches have enabled systematic means to study how interactions among gene mutations contribute to quantitative fitness phenotypes, with the aim of providing insights into the functional wiring diagrams of genetic interaction networks on a global scale. However, it is poorly known how well these quantitative interaction measurements agree across the screening approaches, which hinders their integrated use toward improving the coverage and quality of the genetic interaction maps in yeast and other organisms. Results Using large-scale data matrices from epistatic miniarray profiling (E-MAP), genetic interaction mapping (GIM), and synthetic genetic array (SGA) approaches, we carried out here a systematic comparative evaluation among these quantitative maps of genetic interactions in yeast. The relatively low association between the original interaction measurements or their customized scores could be improved using a matrix-based modelling framework, which enables the use of single- and double-mutant fitness estimates and measurements, respectively, when scoring genetic interactions. Toward an integrative analysis, we show how the detections from the different screening approaches can be combined to suggest novel positive and negative interactions which are complementary to those obtained using any single screening approach alone. The matrix approximation procedure has been made available to support the design and analysis of the future screening studies. Conclusions We have shown here that even if the correlation between the currently available quantitative genetic interaction maps in yeast is relatively low, their comparability can be improved by means of our computational matrix approximation procedure, which will enable integrative analysis and detection of a wider spectrum of genetic interactions using data from the complementary screening approaches.
Collapse
Affiliation(s)
- Rolf O Lindén
- Biomathematics Research Group, Department of Mathematics, University of Turku, Turku, Finland
| | | | | |
Collapse
|
29
|
Abstract
Dictyostelium discoideum belongs to a group of multicellular life forms that can also exist for long periods as single cells. This ability to shift between uni- and multicellularity makes the group ideal for studying the genetic changes that occurred at the crossroads between uni- and multicellular life. In this Primer, I discuss the mechanisms that control multicellular development in Dictyostelium discoideum and reconstruct how some of these mechanisms evolved from a stress response in the unicellular ancestor.
Collapse
Affiliation(s)
- Pauline Schaap
- College of Life Sciences, University of Dundee, Dundee, UK.
| |
Collapse
|
30
|
Baryshnikova A, Costanzo M, Kim Y, Ding H, Koh J, Toufighi K, Youn JY, Ou J, San Luis BJ, Bandyopadhyay S, Hibbs M, Hess D, Gingras AC, Bader GD, Troyanskaya OG, Brown GW, Andrews B, Boone C, Myers CL. Quantitative analysis of fitness and genetic interactions in yeast on a genome scale. Nat Methods 2010; 7:1017-24. [PMID: 21076421 PMCID: PMC3117325 DOI: 10.1038/nmeth.1534] [Citation(s) in RCA: 287] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2010] [Accepted: 10/14/2010] [Indexed: 12/27/2022]
Abstract
Global quantitative analysis of genetic interactions is a powerful approach for deciphering the roles of genes and mapping functional relationships among pathways. Using colony size as a proxy for fitness, we developed a method for measuring fitness-based genetic interactions from high-density arrays of yeast double mutants generated by synthetic genetic array (SGA) analysis. We identified several experimental sources of systematic variation and developed normalization strategies to obtain accurate single- and double-mutant fitness measurements, which rival the accuracy of other high-resolution studies. We applied the SGA score to examine the relationship between physical and genetic interaction networks, and we found that positive genetic interactions connect across functionally distinct protein complexes revealing a network of genetic suppression among loss-of-function alleles.
Collapse
Affiliation(s)
- Anastasia Baryshnikova
- Banting and Best Department of Medical Research, Terrence Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Canada.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
31
|
Costanzo M, Baryshnikova A, Myers CL, Andrews B, Boone C. Charting the genetic interaction map of a cell. Curr Opin Biotechnol 2010; 22:66-74. [PMID: 21111604 DOI: 10.1016/j.copbio.2010.11.001] [Citation(s) in RCA: 94] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2010] [Accepted: 11/01/2010] [Indexed: 12/23/2022]
Abstract
Genome sequencing projects have revealed a massive catalog of genes and astounding genetic diversity in a variety of organisms. We are now faced with the formidable challenge of assigning functions to thousands of genes, and how to use this information to understand how genes interact and coordinate cell function. Studies indicate that the majority of eukaryotic genes are dispensable, highlighting the extensive buffering of genomes against genetic and environmental perturbations. Such robustness poses a significant challenge to those seeking to understand the wiring diagram of the cell. Genome-scale screens for genetic interactions are an effective means to chart the network that underlies this functional redundancy. A complete atlas of genetic interactions offers the potential to assign functions to most genes identified by whole genome sequencing projects and to delineate a functional wiring diagram of the cell. Perhaps more importantly, mapping genetic networks on a large-scale will shed light on the general principles and rules governing genetic networks and provide valuable information regarding the important but elusive relationship between genotype and phenotype.
Collapse
Affiliation(s)
- Michael Costanzo
- Banting and Best Department of Medical Research and Department of Molecular Genetics, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
| | | | | | | | | |
Collapse
|
32
|
Schmid AK, Pan M, Sharma K, Baliga NS. Two transcription factors are necessary for iron homeostasis in a salt-dwelling archaeon. Nucleic Acids Res 2010; 39:2519-33. [PMID: 21109526 PMCID: PMC3074139 DOI: 10.1093/nar/gkq1211] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Because iron toxicity and deficiency are equally life threatening, maintaining intracellular iron levels within a narrow optimal range is critical for nearly all known organisms. However, regulatory mechanisms that establish homeostasis are not well understood in organisms that dwell in environments at the extremes of pH, temperature, and salinity. Under conditions of limited iron, the extremophile Halobacterium salinarum, a salt-loving archaeon, mounts a specific response to scavenge iron for growth. We have identified and characterized the role of two transcription factors (TFs), Idr1 and Idr2, in regulating this important response. An integrated systems analysis of TF knockout gene expression profiles and genome-wide binding locations in the presence and absence of iron has revealed that these TFs operate collaboratively to maintain iron homeostasis. In the presence of iron, Idr1 and Idr2 bind near each other at 24 loci in the genome, where they are both required to repress some genes. By contrast, Idr1 and Idr2 are both necessary to activate other genes in a putative a feed forward loop. Even at loci bound independently, the two TFs target different genes with similar functions in iron homeostasis. We discuss conserved and unique features of the Idr1-Idr2 system in the context of similar systems in organisms from other domains of life.
Collapse
Affiliation(s)
- Amy K Schmid
- Duke University, Department of Biology and Institute for Genome Sciences and Policy, Center for Systems Biology, Durham, NC 27708, USA.
| | | | | | | |
Collapse
|
33
|
Fröhlich H, Praveen P, Tresch A. Fast and efficient dynamic nested effects models. Bioinformatics 2010; 27:238-44. [DOI: 10.1093/bioinformatics/btq631] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
34
|
Plato's cave algorithm: inferring functional signaling networks from early gene expression shadows. PLoS Comput Biol 2010; 6:e1000828. [PMID: 20585619 PMCID: PMC2891706 DOI: 10.1371/journal.pcbi.1000828] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2009] [Accepted: 05/21/2010] [Indexed: 11/19/2022] Open
Abstract
Improving the ability to reverse engineer biochemical networks is a major goal of systems biology. Lesions in signaling networks lead to alterations in gene expression, which in principle should allow network reconstruction. However, the information about the activity levels of signaling proteins conveyed in overall gene expression is limited by the complexity of gene expression dynamics and of regulatory network topology. Two observations provide the basis for overcoming this limitation: a. genes induced without de-novo protein synthesis (early genes) show a linear accumulation of product in the first hour after the change in the cell's state; b. The signaling components in the network largely function in the linear range of their stimulus-response curves. Therefore, unlike most genes or most time points, expression profiles of early genes at an early time point provide direct biochemical assays that represent the activity levels of upstream signaling components. Such expression data provide the basis for an efficient algorithm (Plato's Cave algorithm; PLACA) to reverse engineer functional signaling networks. Unlike conventional reverse engineering algorithms that use steady state values, PLACA uses stimulated early gene expression measurements associated with systematic perturbations of signaling components, without measuring the signaling components themselves. Besides the reverse engineered network, PLACA also identifies the genes detecting the functional interaction, thereby facilitating validation of the predicted functional network. Using simulated datasets, the algorithm is shown to be robust to experimental noise. Using experimental data obtained from gonadotropes, PLACA reverse engineered the interaction network of six perturbed signaling components. The network recapitulated many known interactions and identified novel functional interactions that were validated by further experiment. PLACA uses the results of experiments that are feasible for any signaling network to predict the functional topology of the network and to identify novel relationships.
Collapse
|
35
|
Boj SF, Petrov D, Ferrer J. Epistasis of transcriptomes reveals synergism between transcriptional activators Hnf1alpha and Hnf4alpha. PLoS Genet 2010; 6:e1000970. [PMID: 20523905 PMCID: PMC2877749 DOI: 10.1371/journal.pgen.1000970] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2009] [Accepted: 04/23/2010] [Indexed: 01/16/2023] Open
Abstract
The transcription of individual genes is determined by combinatorial interactions between DNA-binding transcription factors. The current challenge is to understand how such combinatorial interactions regulate broad genetic programs that underlie cellular functions and disease. The transcription factors Hnf1alpha and Hnf4alpha control pancreatic islet beta-cell function and growth, and mutations in their genes cause closely related forms of diabetes. We have now exploited genetic epistasis to examine how Hnf1alpha and Hnf4alpha functionally interact in pancreatic islets. Expression profiling in islets from either Hnf1a(+/-) or pancreas-specific Hnf4a mutant mice showed that the two transcription factors regulate a strikingly similar set of genes. We integrated expression and genomic binding studies and show that the shared transcriptional phenotype of these two mutant models is linked to common direct targets, rather than to known effects of Hnf1alpha on Hnf4a gene transcription. Epistasis analysis with transcriptomes of single- and double-mutant islets revealed that Hnf1alpha and Hnf4alpha regulate common targets synergistically. Hnf1alpha binding in Hnf4a-deficient islets was decreased in selected targets, but remained unaltered in others, thus suggesting that the mechanisms for synergistic regulation are gene-specific. These findings provide an in vivo strategy to study combinatorial gene regulation and reveal how Hnf1alpha and Hnf4alpha control a common islet-cell regulatory program that is defective in human monogenic diabetes.
Collapse
Affiliation(s)
- Sylvia F. Boj
- Genomic Programming of Beta-Cells Laboratory, Institut d'Investigacions Biomèdiques August Pi i Sunyer, Barcelona, Spain
| | - Dimitri Petrov
- Genomic Programming of Beta-Cells Laboratory, Institut d'Investigacions Biomèdiques August Pi i Sunyer, Barcelona, Spain
| | - Jorge Ferrer
- Genomic Programming of Beta-Cells Laboratory, Institut d'Investigacions Biomèdiques August Pi i Sunyer, Barcelona, Spain
- Endocrinology Department, Hospital Clínic de Barcelona, Barcelona, Spain
- Centro de Investigación Biomédica en Red de Diabetes y Enfermedades Metabólicas Asociadas, Barcelona, Spain
- * E-mail:
| |
Collapse
|
36
|
Parikh A, Huang E, Dinh C, Zupan B, Kuspa A, Subramanian D, Shaulsky G. New components of the Dictyostelium PKA pathway revealed by Bayesian analysis of expression data. BMC Bioinformatics 2010; 11:163. [PMID: 20356373 PMCID: PMC2873529 DOI: 10.1186/1471-2105-11-163] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2009] [Accepted: 03/31/2010] [Indexed: 11/30/2022] Open
Abstract
Background Identifying candidate genes in genetic networks is important for understanding regulation and biological function. Large gene expression datasets contain relevant information about genetic networks, but mining the data is not a trivial task. Algorithms that infer Bayesian networks from expression data are powerful tools for learning complex genetic networks, since they can incorporate prior knowledge and uncover higher-order dependencies among genes. However, these algorithms are computationally demanding, so novel techniques that allow targeted exploration for discovering new members of known pathways are essential. Results Here we describe a Bayesian network approach that addresses a specific network within a large dataset to discover new components. Our algorithm draws individual genes from a large gene-expression repository, and ranks them as potential members of a known pathway. We apply this method to discover new components of the cAMP-dependent protein kinase (PKA) pathway, a central regulator of Dictyostelium discoideum development. The PKA network is well studied in D. discoideum but the transcriptional networks that regulate PKA activity and the transcriptional outcomes of PKA function are largely unknown. Most of the genes highly ranked by our method encode either known components of the PKA pathway or are good candidates. We tested 5 uncharacterized highly ranked genes by creating mutant strains and identified a candidate cAMP-response element-binding protein, yet undiscovered in D. discoideum, and a histidine kinase, a candidate upstream regulator of PKA activity. Conclusions The single-gene expansion method is useful in identifying new components of known pathways. The method takes advantage of the Bayesian framework to incorporate prior biological knowledge and discovers higher-order dependencies among genes while greatly reducing the computational resources required to process high-throughput datasets.
Collapse
Affiliation(s)
- Anup Parikh
- Graduate Program in Structural Computational Biology and Molecular Biophysics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA
| | | | | | | | | | | | | |
Collapse
|
37
|
Dixon SJ, Costanzo M, Baryshnikova A, Andrews B, Boone C. Systematic Mapping of Genetic Interaction Networks. Annu Rev Genet 2009; 43:601-25. [DOI: 10.1146/annurev.genet.39.073003.114751] [Citation(s) in RCA: 216] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Scott J. Dixon
- Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 1A7, Canada;
- Department of Biological Sciences, Columbia University, New York, New York 10027
| | - Michael Costanzo
- Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 1A7, Canada;
| | - Anastasia Baryshnikova
- Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 1A7, Canada;
| | - Brenda Andrews
- Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 1A7, Canada;
| | - Charles Boone
- Banting and Best Department of Medical Research, Terrence Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 1A7, Canada;
| |
Collapse
|
38
|
Tanaka H, Yi TM. Reverse engineering a signaling network using alternative inputs. PLoS One 2009; 4:e7622. [PMID: 19898612 PMCID: PMC2764141 DOI: 10.1371/journal.pone.0007622] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2009] [Accepted: 10/06/2009] [Indexed: 11/19/2022] Open
Abstract
One of the goals of systems biology is to reverse engineer in a comprehensive fashion the arrow diagrams of signal transduction systems. An important tool for ordering pathway components is genetic epistasis analysis, and here we present a strategy termed Alternative Inputs (AIs) to perform systematic epistasis analysis. An alternative input is defined as any genetic manipulation that can activate the signaling pathway instead of the natural input. We introduced the concept of an "AIs-Deletions matrix" that summarizes the outputs of all combinations of alternative inputs and deletions. We developed the theory and algorithms to construct a pairwise relationship graph from the AIs-Deletions matrix capturing both functional ordering (upstream, downstream) and logical relationships (AND, OR), and then interpreting these relationships into a standard arrow diagram. As a proof-of-principle, we applied this methodology to a subset of genes involved in yeast mating signaling. This experimental pilot study highlights the robustness of the approach and important technical challenges. In summary, this research formalizes and extends classical epistasis analysis from linear pathways to more complex networks, facilitating computational analysis and reconstruction of signaling arrow diagrams.
Collapse
Affiliation(s)
- Hiromasa Tanaka
- Department of Developmental and Cell Biology, University of California Irvine, Irvine, California, United States of America
- Center for Complex Biological Systems, University of California Irvine, Irvine, California, United States of America
| | - Tau-Mu Yi
- Department of Developmental and Cell Biology, University of California Irvine, Irvine, California, United States of America
- Center for Complex Biological Systems, University of California Irvine, Irvine, California, United States of America
- * E-mail:
| |
Collapse
|
39
|
Fröhlich H, Sahin O, Arlt D, Bender C, Beissbarth T. Deterministic Effects Propagation Networks for reconstructing protein signaling networks from multiple interventions. BMC Bioinformatics 2009; 10:322. [PMID: 19814779 PMCID: PMC2770070 DOI: 10.1186/1471-2105-10-322] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2009] [Accepted: 10/08/2009] [Indexed: 11/10/2022] Open
Abstract
Background Modern gene perturbation techniques, like RNA interference (RNAi), enable us to study effects of targeted interventions in cells efficiently. In combination with mRNA or protein expression data this allows to gain insights into the behavior of complex biological systems. Results In this paper, we propose Deterministic Effects Propagation Networks (DEPNs) as a special Bayesian Network approach to reverse engineer signaling networks from a combination of protein expression and perturbation data. DEPNs allow to reconstruct protein networks based on combinatorial intervention effects, which are monitored via changes of the protein expression or activation over one or a few time points. Our implementation of DEPNs allows for latent network nodes (i.e. proteins without measurements) and has a built in mechanism to impute missing data. The robustness of our approach was tested on simulated data. We applied DEPNs to reconstruct the ERBB signaling network in de novo trastuzumab resistant human breast cancer cells, where protein expression was monitored on Reverse Phase Protein Arrays (RPPAs) after knockdown of network proteins using RNAi. Conclusion DEPNs offer a robust, efficient and simple approach to infer protein signaling networks from multiple interventions. The method as well as the data have been made part of the latest version of the R package "nem" available as a supplement to this paper and via the Bioconductor repository.
Collapse
Affiliation(s)
- Holger Fröhlich
- German Cancer Research Center, Molecular Genome Analysis, Im Neuenheimer Feld 280, 69120 Heidelberg, Germany.
| | | | | | | | | |
Collapse
|
40
|
A developmental systems perspective on epistasis: computational exploration of mutational interactions in model developmental regulatory networks. PLoS One 2009; 4:e6823. [PMID: 19738908 PMCID: PMC2734181 DOI: 10.1371/journal.pone.0006823] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2009] [Accepted: 07/31/2009] [Indexed: 11/19/2022] Open
Abstract
The way in which the information contained in genotypes is translated into complex phenotypic traits (i.e. embryonic expression patterns) depends on its decoding by a multilayered hierarchy of biomolecular systems (regulatory networks). Each layer of this hierarchy displays its own regulatory schemes (i.e. operational rules such as +/− feedback) and associated control parameters, resulting in characteristic variational constraints. This process can be conceptualized as a mapping issue, and in the context of highly-dimensional genotype-phenotype mappings (GPMs) epistatic events have been shown to be ubiquitous, manifested in non-linear correspondences between changes in the genotype and their phenotypic effects. In this study I concentrate on epistatic phenomena pervading levels of biological organization above the genetic material, more specifically the realm of molecular networks. At this level, systems approaches to studying GPMs are specially suitable to shed light on the mechanistic basis of epistatic phenomena. To this aim, I constructed and analyzed ensembles of highly-modular (fully interconnected) networks with distinctive topologies, each displaying dynamic behaviors that were categorized as either arbitrary or functional according to early patterning processes in the Drosophila embryo. Spatio-temporal expression trajectories in virtual syncytial embryos were simulated via reaction-diffusion models. My in silico mutational experiments show that: 1) the average fitness decay tendency to successively accumulated mutations in ensembles of functional networks indicates the prevalence of positive epistasis, whereas in ensembles of arbitrary networks negative epistasis is the dominant tendency; and 2) the evaluation of epistatic coefficients of diverse interaction orders indicates that, both positive and negative epistasis are more prevalent in functional networks than in arbitrary ones. Overall, I conclude that the phenotypic and fitness effects of multiple perturbations are strongly conditioned by both the regulatory architecture (i.e. pattern of coupled feedback structures) and the dynamic nature of the spatio-temporal expression trajectories displayed by the simulated networks.
Collapse
|
41
|
Rot G, Parikh A, Curk T, Kuspa A, Shaulsky G, Zupan B. dictyExpress: a Dictyostelium discoideum gene expression database with an explorative data analysis web-based interface. BMC Bioinformatics 2009; 10:265. [PMID: 19706156 PMCID: PMC2738683 DOI: 10.1186/1471-2105-10-265] [Citation(s) in RCA: 61] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2009] [Accepted: 08/25/2009] [Indexed: 11/25/2022] Open
Abstract
Background Bioinformatics often leverages on recent advancements in computer science to support biologists in their scientific discovery process. Such efforts include the development of easy-to-use web interfaces to biomedical databases. Recent advancements in interactive web technologies require us to rethink the standard submit-and-wait paradigm, and craft bioinformatics web applications that share analytical and interactive power with their desktop relatives, while retaining simplicity and availability. Results We have developed dictyExpress, a web application that features a graphical, highly interactive explorative interface to our database that consists of more than 1000 Dictyostelium discoideum gene expression experiments. In dictyExpress, the user can select experiments and genes, perform gene clustering, view gene expression profiles across time, view gene co-expression networks, perform analyses of Gene Ontology term enrichment, and simultaneously display expression profiles for a selected gene in various experiments. Most importantly, these tasks are achieved through web applications whose components are seamlessly interlinked and immediately respond to events triggered by the user, thus providing a powerful explorative data analysis environment. Conclusion dictyExpress is a precursor for a new generation of web-based bioinformatics applications with simple but powerful interactive interfaces that resemble that of the modern desktop. While dictyExpress serves mainly the Dictyostelium research community, it is relatively easy to adapt it to other datasets. We propose that the design ideas behind dictyExpress will influence the development of similar applications for other model organisms.
Collapse
Affiliation(s)
- Gregor Rot
- Faculty of Computer and Information Science, University of Ljubljana, SI-1000 Ljubljana, Slovenia.
| | | | | | | | | | | |
Collapse
|
42
|
Fröhlich H, Tresch A, Beissbarth T. Nested effects models for learning signaling networks from perturbation data. Biom J 2009; 51:304-23. [PMID: 19358219 DOI: 10.1002/bimj.200800185] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Targeted gene perturbations have become a major tool to gain insight into complex cellular processes. In combination with the measurement of downstream effects via DNA microarrays, this approach can be used to gain insight into signaling pathways. Nested Effects Models were first introduced by Markowetz et al. as a probabilistic method to reverse engineer signaling cascades based on the nested structure of downstream perturbation effects. The basic framework was substantially extended later on by Fröhlich et al., Markowetz et al., and Tresch and Markowetz. In this paper, we present a review of the complete methodology with a detailed comparison of so far proposed algorithms on a qualitative and quantitative level. As an application, we present results on estimating the signaling network between 13 genes in the ER-alpha pathway of human MCF-7 breast cancer cells. Comparison with the literature shows a substantial overlap.
Collapse
Affiliation(s)
- Holger Fröhlich
- German Cancer Research Center (DKFZ), Im Neuenheimer Feld 580, 69120 Heidelberg, Germany.
| | | | | |
Collapse
|
43
|
Lee SI, Dudley AM, Drubin D, Silver PA, Krogan NJ, Pe'er D, Koller D. Learning a prior on regulatory potential from eQTL data. PLoS Genet 2009; 5:e1000358. [PMID: 19180192 PMCID: PMC2627940 DOI: 10.1371/journal.pgen.1000358] [Citation(s) in RCA: 144] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2008] [Accepted: 12/29/2008] [Indexed: 11/19/2022] Open
Abstract
Genome-wide RNA expression data provide a detailed view of an organism's biological state; hence, a dataset measuring expression variation between genetically diverse individuals (eQTL data) may provide important insights into the genetics of complex traits. However, with data from a relatively small number of individuals, it is difficult to distinguish true causal polymorphisms from the large number of possibilities. The problem is particularly challenging in populations with significant linkage disequilibrium, where traits are often linked to large chromosomal regions containing many genes. Here, we present a novel method, Lirnet, that automatically learns a regulatory potential for each sequence polymorphism, estimating how likely it is to have a significant effect on gene expression. This regulatory potential is defined in terms of “regulatory features”—including the function of the gene and the conservation, type, and position of genetic polymorphisms—that are available for any organism. The extent to which the different features influence the regulatory potential is learned automatically, making Lirnet readily applicable to different datasets, organisms, and feature sets. We apply Lirnet both to the human HapMap eQTL dataset and to a yeast eQTL dataset and provide statistical and biological results demonstrating that Lirnet produces significantly better regulatory programs than other recent approaches. We demonstrate in the yeast data that Lirnet can correctly suggest a specific causal sequence variation within a large, linked chromosomal region. In one example, Lirnet uncovered a novel, experimentally validated connection between Puf3—a sequence-specific RNA binding protein—and P-bodies—cytoplasmic structures that regulate translation and RNA stability—as well as the particular causative polymorphism, a SNP in Mkt1, that induces the variation in the pathway. Gene expression data of genetically diverse individuals (eQTL data) provide a unique perspective on the effect of genetic variation on cellular pathways. However, the burden of multiple hypotheses, combined with the challenges of linkage disequilibrium, makes it difficult to correctly identify causal polymorphisms. Researchers traditionally apply heuristics for selecting among plausible hypotheses, favoring polymorphisms that are more conserved, that lead to significant amino acid change, or that reside in genes whose function is related to that of the targets. But how do we know how much weight to attribute to different regulatory features? We describe Lirnet, which learns from eQTL data how to weight regulatory features and induce a regulatory potential for sequence variations. Lirnet assesses these weights simultaneously to learning a regulatory network, finding weights that lead to a more predictive network. We show that Lirnet constructs high-accuracy regulatory programs and demonstrate its ability to correctly identify causative polymorphisms. Lirnet can flexibly use any regulatory features, including sequence features that are available for any sequenced organism, and automatically learn their weights in a dataset-specific way. This feature makes it especially advantageous for mammalian systems, where many forms of prior knowledge used in simple model organisms are incomplete or unavailable.
Collapse
Affiliation(s)
- Su-In Lee
- Computer Science Department, Stanford University, Stanford, California, United States of America
| | - Aimée M. Dudley
- Institute for Systems Biology, Seattle, Washington, United States of America
| | - David Drubin
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Pamela A. Silver
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Nevan J. Krogan
- Department of Cellular and Molecular Pharmacology, University of California San Francisco, San Francisco, California, United States of America
| | - Dana Pe'er
- Department of Biological Sciences, Columbia University, New York, New York, United States of America
| | - Daphne Koller
- Computer Science Department, Stanford University, Stanford, California, United States of America
- * E-mail:
| |
Collapse
|
44
|
Sultana H, Neelakanta G, Eichinger L, Rivero F, Noegel AA. Microarray phenotyping places cyclase associated protein CAP at the crossroad of signaling pathways reorganizing the actin cytoskeleton in Dictyostelium. Exp Cell Res 2009; 315:127-40. [DOI: 10.1016/j.yexcr.2008.10.023] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2008] [Revised: 08/29/2008] [Accepted: 10/14/2008] [Indexed: 01/31/2023]
|
45
|
Structure and function of a transcriptional network activated by the MAPK Hog1. Nat Genet 2008; 40:1300-6. [PMID: 18931682 PMCID: PMC2825711 DOI: 10.1038/ng.235] [Citation(s) in RCA: 171] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2008] [Accepted: 08/13/2008] [Indexed: 11/09/2022]
Abstract
Cells regulate gene expression using a complex network of signaling pathways, transcription factors and promoters. To gain insight into the structure and function of these networks, we analyzed gene expression in single- and multiple-mutant strains to build a quantitative model of the Hog1 MAPK-dependent osmotic stress response in budding yeast. Our model reveals that the Hog1 and general stress (Msn2/4) pathways interact, at both the signaling and promoter level, to integrate information and create a context-dependent response. This study lays out a path to identifying and characterizing the role of signal integration and processing in other gene regulatory networks.
Collapse
|
46
|
uvsF RFC1, the large subunit of replication factor C in Aspergillus nidulans, is essential for DNA replication, functions in UV repair and is upregulated in response to MMS-induced DNA damage. Fungal Genet Biol 2008; 45:1227-34. [PMID: 18655840 DOI: 10.1016/j.fgb.2008.06.008] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2007] [Revised: 06/26/2008] [Accepted: 06/26/2008] [Indexed: 11/22/2022]
Abstract
uvsF201 was the first highly UV-sensitive repair-defective mutation isolated in Aspergillus nidulans. It showed epistasis only with postreplication repair mutations, but caused lethal interactions with many other repair-defective strains. Unexpectedly, closest homology of uvsF was found to the large subunit of human DNA replication factor RFC that is essential for DNA replication. Sequencing of the uvsF201 region identified changes at two close base pairs and the corresponding amino acids in the 5'-region of uvsF(RFC1). This viable mutant represents a novel and possibly important type. Additional sequencing of the uvsF region confirmed a mitochondrial ribosomal protein gene, mrpA(L16), closely adjacent, head-to-head with a 0.2kb joint promoter region. MMS-induced transcription of both the genes, but especially uvsF(RFC1), providing evidence for a function in DNA damage response.
Collapse
|
47
|
Expression profiles reveal parallel evolution of epistatic interactions involving the CRP regulon in Escherichia coli. PLoS Genet 2008; 4:e35. [PMID: 18282111 PMCID: PMC2242816 DOI: 10.1371/journal.pgen.0040035] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2007] [Accepted: 12/26/2007] [Indexed: 12/15/2022] Open
Abstract
The extent and nature of epistatic interactions between mutations are issues of fundamental importance in evolutionary biology. However, they are difficult to study and their influence on adaptation remains poorly understood. Here, we use a systems-level approach to examine epistatic interactions that arose during the evolution of Escherichia coli in a defined environment. We used expression arrays to compare the effect on global patterns of gene expression of deleting a central regulatory gene, crp. Effects were measured in two lineages that had independently evolved for 20,000 generations and in their common ancestor. We found that deleting crp had a much more dramatic effect on the expression profile of the two evolved lines than on the ancestor. Because the sequence of the crp gene was unchanged during evolution, these differences indicate epistatic interactions between crp and mutations at other loci that accumulated during evolution. Moreover, a striking degree of parallelism was observed between the two independently evolved lines; 115 genes that were not crp-dependent in the ancestor became dependent on crp in both evolved lines. An analysis of changes in crp dependence of well-characterized regulons identified a number of regulatory genes as candidates for harboring beneficial mutations that could account for these parallel expression changes. Mutations within three of these genes have previously been found and shown to contribute to fitness. Overall, these findings indicate that epistasis has been important in the adaptive evolution of these lines, and they provide new insight into the types of genetic changes through which epistasis can evolve. More generally, we demonstrate that expression profiles can be profitably used to investigate epistatic interactions. The effect of a genetic mutation can depend on the genotype of the organism in which it occurs. For example, a mutation that is beneficial in one genetic background might be neutral or even deleterious in another. The interactions between genes that cause this dependence—known as epistasis—play an important role in many evolutionary theories. However, they are difficult to study and remain poorly understood. We used a novel approach to examine the evolution of interactions arising between a key regulatory gene, crp, and mutations that occurred during the adaptation of a bacterium, Escherichia coli, to a laboratory environment. To do this, we measured the effect of deleting crp on the expression of all genes in the organism, providing a sensitive measure to identify new interactions involving this gene. We found that deleting crp had a dramatic and parallel effect on gene expression in two independently evolved populations, but much less effect in their ancestor. An analysis of these changes identified a number of regulatory genes as candidates for harboring beneficial mutations that could account for the parallel changes. These findings indicate that epistasis has played an important role in the evolution of these populations, and they provide insight into the types of genetic changes through which epistasis can evolve.
Collapse
|
48
|
Kim J, Bates DG, Postlethwaite I, Heslop-Harrison P, Cho KH. Linear time-varying models can reveal non-linear interactions of biomolecular regulatory networks using multiple time-series data. Bioinformatics 2008; 24:1286-92. [PMID: 18367478 DOI: 10.1093/bioinformatics/btn107] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/07/2024] Open
Abstract
MOTIVATION Inherent non-linearities in biomolecular interactions make the identification of network interactions difficult. One of the principal problems is that all methods based on the use of linear time-invariant models will have fundamental limitations in their capability to infer certain non-linear network interactions. Another difficulty is the multiplicity of possible solutions, since, for a given dataset, there may be many different possible networks which generate the same time-series expression profiles. RESULTS A novel algorithm for the inference of biomolecular interaction networks from temporal expression data is presented. Linear time-varying models, which can represent a much wider class of time-series data than linear time-invariant models, are employed in the algorithm. From time-series expression profiles, the model parameters are identified by solving a non-linear optimization problem. In order to systematically reduce the set of possible solutions for the optimization problem, a filtering process is performed using a phase-portrait analysis with random numerical perturbations. The proposed approach has the advantages of not requiring the system to be in a stable steady state, of using time-series profiles which have been generated by a single experiment, and of allowing non-linear network interactions to be identified. The ability of the proposed algorithm to correctly infer network interactions is illustrated by its application to three examples: a non-linear model for cAMP oscillations in Dictyostelium discoideum, the cell-cycle data for Saccharomyces cerevisiae and a large-scale non-linear model of a group of synchronized Dictyostelium cells. AVAILABILITY The software used in this article is available from http://sbie.kaist.ac.kr/software
Collapse
Affiliation(s)
- Jongrae Kim
- Department of Aerospace Engineering, University of Glasgow, Glasgow, UK
| | | | | | | | | |
Collapse
|
49
|
Tresch A, Beissbarth T, Sültmann H, Kuner R, Poustka A, Buness A. Discrimination of direct and indirect interactions in a network of regulatory effects. J Comput Biol 2008; 14:1217-28. [PMID: 17990974 DOI: 10.1089/cmb.2007.0085] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The matter of concern are algorithms for the discrimination of direct from indirect regulatory effects from an interaction graph built up by error-prone measurements. Many of these algorithms can be cast as a rule for the removal of a single edge of the graph, such that the remaining graph is still consistent with the data. A set of mild conditions is given under which iterated application of such a rule leads to a unique minimal consistent graph. We show that three of the common methods for direct interactions search fulfill these conditions, thus providing a justification of their use. The main issues a reconstruction algorithm has to deal with, are the noise in the data, the presence of regulatory cycles, and the direction of the regulatory effects. We introduce a novel rule that, in contrast to the previously mentioned methods, simultaneously takes into account all these aspects. An efficient algorithm for the computation of the minimal graph is given, whose time complexity is cubic in the number of vertices of the graph. Finally, we demonstrate the utility of our method in a simulation study.
Collapse
Affiliation(s)
- Achim Tresch
- Institute for Medical Biometry, Epidemiology and Informatics, Mainz, Germany.
| | | | | | | | | | | |
Collapse
|
50
|
Aylor DL, Zeng ZB. From classical genetics to quantitative genetics to systems biology: modeling epistasis. PLoS Genet 2008; 4:e1000029. [PMID: 18369448 PMCID: PMC2265472 DOI: 10.1371/journal.pgen.1000029] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2007] [Accepted: 02/08/2008] [Indexed: 11/17/2022] Open
Abstract
Gene expression data has been used in lieu of phenotype in both classical and quantitative genetic settings. These two disciplines have separate approaches to measuring and interpreting epistasis, which is the interaction between alleles at different loci. We propose a framework for estimating and interpreting epistasis from a classical experiment that combines the strengths of each approach. A regression analysis step accommodates the quantitative nature of expression measurements by estimating the effect of gene deletions plus any interaction. Effects are selected by significance such that a reduced model describes each expression trait. We show how the resulting models correspond to specific hierarchical relationships between two regulator genes and a target gene. These relationships are the basic units of genetic pathways and genomic system diagrams. Our approach can be extended to analyze data from a variety of experiments, multiple loci, and multiple environments. Epistasis has long had two slightly different meanings depending on the context in which it is discussed. The classical definition describes an allele at one locus completely masking the effect of an allele at a second locus. Such relationships can be interpreted as hierarchical, and they can be combined to infer genetic pathways. In quantitative genetics, epistasis encompasses a wide range of interactions and can be extended to more than two loci. These two definitions coexist because they are typically applied to different types of study populations and different types of traits. The current trend is to treat gene expression as a trait in a variety of genetic backgrounds. This provides reason to revisit epistasis in this new context. We accommodate the continuous nature of gene expression using ideas from quantitative genetics, but retain the hierarchical interpretation of the classical experiment. These hierarchical relationships are the building blocks of systems diagrams and genetic pathways. This framework can serve as a foundation for future epistasis analyses based on genomic data.
Collapse
Affiliation(s)
- David L Aylor
- Bioinformatics Research Center and Program in Bioinformatics, North Carolina State University, Raleigh, North Carolina, United States of America
| | | |
Collapse
|