1
|
Jayathilaka C, Araujo R, Nguyen L, Flegg M. Two wrongs do not make a right: the assumption that an inhibitor acts as an inverse activator. J Math Biol 2024; 89:26. [PMID: 38967811 PMCID: PMC11226533 DOI: 10.1007/s00285-024-02118-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Revised: 05/10/2024] [Accepted: 06/09/2024] [Indexed: 07/06/2024]
Abstract
Models of biochemical networks are often large intractable sets of differential equations. To make sense of the complexity, relationships between genes/proteins are presented as connected graphs, the edges of which are drawn to indicate activation or inhibition relationships. These diagrams are useful for drawing qualitative conclusions in many cases by the identifying recurring of topological motifs, for example positive and negative feedback loops. These topological features are usually classified under the presumption that activation and inhibition are inverse relationships. For example, inhibition of an inhibitor is often classified the same as activation of an activator within a motif classification, effectively treating them as equivalent. Whilst in many contexts this may not lead to catastrophic errors, drawing conclusions about the behavior of motifs, pathways or networks from these broad classes of topological feature without adequate mathematical descriptions can lead to obverse outcomes. We investigate the extent to which a biochemical pathway/network will behave quantitatively dissimilar to pathway/ networks with similar typologies formed by swapping inhibitors as the inverse of activators. The purpose of the study is to determine under what circumstances rudimentary qualitative assessment of network structure can provide reliable conclusions as to the quantitative behaviour of the network. Whilst there are others, We focus on two main mathematical qualities which may cause a divergence in the behaviour of two pathways/networks which would otherwise be classified as similar; (i) a modelling feature we label 'bias' and (ii) the precise positioning of activators and inhibitors within simple pathways/motifs.
Collapse
Affiliation(s)
| | - Robyn Araujo
- School of Mathematics and Statistics, The University of Melbourne, Victoria, 3010, Australia
- ARC Centre of Excellence for the Mathematical Analysis of Cellular Systems (MACSYS), Parkville, VIC, 3010, Australia
| | - Lan Nguyen
- Monash Biomedicine Discovery Institute, Monash University, Clayton, VIC, Australia
- ARC Centre of Excellence for the Mathematical Analysis of Cellular Systems (MACSYS), Parkville, VIC, 3010, Australia
| | - Mark Flegg
- Department of Mathematics, Monash University, Clayton, VIC, Australia.
| |
Collapse
|
2
|
Paczkó M, Vörös D, Szabó P, Jékely G, Szathmáry E, Szilágyi A. A neural network-based model framework for cell-fate decisions and development. Commun Biol 2024; 7:323. [PMID: 38486083 PMCID: PMC10940658 DOI: 10.1038/s42003-024-05985-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2023] [Accepted: 02/28/2024] [Indexed: 03/18/2024] Open
Abstract
Gene regulatory networks (GRNs) fulfill the essential function of maintaining the stability of cellular differentiation states by sustaining lineage-specific gene expression, while driving the progression of development. However, accounting for the relative stability of intermediate differentiation stages and their divergent trajectories remains a major challenge for models of developmental biology. Here, we develop an empirical data-based associative GRN model (AGRN) in which regulatory networks store multilineage stage-specific gene expression profiles as associative memory patterns. These networks are capable of responding to multiple instructive signals and, depending on signal timing and identity, can dynamically drive the differentiation of multipotent cells toward different cell state attractors. The AGRN dynamics can thus generate diverse lineage-committed cell populations in a robust yet flexible manner, providing an attractor-based explanation for signal-driven cell fate decisions during differentiation and offering a readily generalizable modelling tool that can be applied to a wide variety of cell specification systems.
Collapse
Affiliation(s)
- Mátyás Paczkó
- Institute of Evolution, HUN-REN Centre for Ecological Research, Konkoly-Thege M. út 29-33, 1121, Budapest, Hungary
- Doctoral School of Biology, Institute of Biology, ELTE Eötvös Loránd University, Pázmány Péter sétány 1/C, 1117, Budapest, Hungary
| | - Dániel Vörös
- Institute of Evolution, HUN-REN Centre for Ecological Research, Konkoly-Thege M. út 29-33, 1121, Budapest, Hungary
- Doctoral School of Biology, Institute of Biology, ELTE Eötvös Loránd University, Pázmány Péter sétány 1/C, 1117, Budapest, Hungary
| | - Péter Szabó
- Institute of Evolution, HUN-REN Centre for Ecological Research, Konkoly-Thege M. út 29-33, 1121, Budapest, Hungary
| | - Gáspár Jékely
- Living Systems Institute, University of Exeter, Stocker Road 4QD, EX4, Exeter, UK
| | - Eörs Szathmáry
- Institute of Evolution, HUN-REN Centre for Ecological Research, Konkoly-Thege M. út 29-33, 1121, Budapest, Hungary.
- Center for the Conceptual Foundations of Science, Parmenides Foundation, Hindenburgstr. 15, 82343, Pöcking, Germany.
- Department of Plant Systematics, Ecology and Theoretical Biology, Eötvös Loránd University, Pázmány Péter sétány 1/C, 1117, Budapest, Hungary.
| | - András Szilágyi
- Institute of Evolution, HUN-REN Centre for Ecological Research, Konkoly-Thege M. út 29-33, 1121, Budapest, Hungary
| |
Collapse
|
3
|
Chanchal DK, Chaudhary JS, Kumar P, Agnihotri N, Porwal P. CRISPR-Based Therapies: Revolutionizing Drug Development and Precision Medicine. Curr Gene Ther 2024; 24:193-207. [PMID: 38310456 DOI: 10.2174/0115665232275754231204072320] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 10/26/2023] [Accepted: 11/15/2023] [Indexed: 02/05/2024]
Abstract
With the discovery of CRISPR-Cas9, drug development and precision medicine have undergone a major change. This review article looks at the new ways that CRISPR-based therapies are being used and how they are changing the way medicine is done. CRISPR technology's ability to precisely and flexibly edit genes has opened up new ways to find, validate, and develop drug targets. Also, it has made way for personalized gene therapies, precise gene editing, and advanced screening techniques, all of which hold great promise for treating a wide range of diseases. In this article, we look at the latest research and clinical trials that show how CRISPR could be used to treat genetic diseases, cancer, infectious diseases, and other hard-to-treat conditions. However, ethical issues and problems with regulations are also discussed in relation to CRISPR-based therapies, which shows how important it is to use them safely and responsibly. As CRISPR continues to change how drugs are made and used, this review shines a light on the amazing things that have been done and what the future might hold in this rapidly changing field.
Collapse
Affiliation(s)
- Dilip Kumar Chanchal
- Department of Pharmacy, Smt. Vidyawati College of Pharmacy, Jhansi, Uttar Pradesh, India
- Glocal School of Pharmacy, Glocal University Mirzapur Pole, Saharanpur - 247121, Uttar Pradesh, India
| | | | - Pushpendra Kumar
- Faculty of Pharmacy, Uttar Pradesh University of Medical Sciences, Saifai, Etawah 206130, Uttar Pradesh, India
| | - Neha Agnihotri
- Department of Pharmacy, Maharana Pratap College of Pharmacy, Kothi, Mandhana, Kanpur-209217, Uttar Pradesh, India
| | - Prateek Porwal
- Glocal School of Pharmacy, Glocal University Mirzapur Pole, Saharanpur - 247121, Uttar Pradesh, India
| |
Collapse
|
4
|
Zhao H, Shao C, Shi Z, He S, Gong Z. The Intrinsic Similarity of Topological Structure in Biological Neural Networks. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:3292-3305. [PMID: 37224366 DOI: 10.1109/tcbb.2023.3279443] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
Most previous studies mainly have focused on the analysis of structural properties of individual neuronal networks from C. elegans. In recent years, an increasing number of synapse-level neural maps, also known as biological neural networks, have been reconstructed. However, it is not clear whether there are intrinsic similarities of structural properties of biological neural networks from different brain compartments or species. To explore this issue, we collected nine connectomes at synaptic resolution including C. elegans, and analyzed their structural properties. We found that these biological neural networks possess small-world properties and modules. Excluding the Drosophila larval visual system, these networks have rich clubs. The distributions of synaptic connection strength for these networks can be fitted by the truncated pow-law distributions. Additionally, compared with the power-law model, a log-normal distribution is a better model to fit the complementary cumulative distribution function (CCDF) of degree for these neuronal networks. Moreover, we also observed that these neural networks belong to the same superfamily based on the significance profile (SP) of small subgraphs in the network. Taken together, these findings suggest that biological neural networks share intrinsic similarities in their topological structure, revealing some principles underlying the formation of biological neural networks within and across species.
Collapse
|
5
|
Santos MVC, Feltrin AS, Costa-Amaral IC, Teixeira LR, Perini JA, Martins DC, Larentis AL. Network Analysis of Biomarkers Associated with Occupational Exposure to Benzene and Malathion. Int J Mol Sci 2023; 24:ijms24119415. [PMID: 37298367 DOI: 10.3390/ijms24119415] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2023] [Revised: 04/21/2023] [Accepted: 05/03/2023] [Indexed: 06/12/2023] Open
Abstract
Complex diseases are associated with the effects of multiple genes, proteins, and biological pathways. In this context, the tools of Network Medicine are compatible as a platform to systematically explore not only the molecular complexity of a specific disease but may also lead to the identification of disease modules and pathways. Such an approach enables us to gain a better understanding of how environmental chemical exposures affect the function of human cells, providing better perceptions about the mechanisms involved and helping to monitor/prevent exposure and disease to chemicals such as benzene and malathion. We selected differentially expressed genes for exposure to benzene and malathion. The construction of interaction networks was carried out using GeneMANIA and STRING. Topological properties were calculated using MCODE, BiNGO, and CentiScaPe, and a Benzene network composed of 114 genes and 2415 interactions was obtained. After topological analysis, five networks were identified. In these subnets, the most interconnected nodes were identified as: IL-8, KLF6, KLF4, JUN, SERTAD1, and MT1H. In the Malathion network, composed of 67 proteins and 134 interactions, HRAS and STAT3 were the most interconnected nodes. Path analysis, combined with various types of high-throughput data, reflects biological processes more clearly and comprehensively than analyses involving the evaluation of individual genes. We emphasize the central roles played by several important hub genes obtained by exposure to benzene and malathion.
Collapse
Affiliation(s)
- Marcus Vinicius C Santos
- Studies Center of Worker's Health and Human Ecology (CESTEH), Sergio Arouca National School of Public Health (ENSP), Oswaldo Cruz Foundation (FIOCRUZ), Rio de Janeiro 21041-210, RJ, Brazil
| | - Arthur S Feltrin
- Center for Mathematics, Computation and Cognition, Federal University of ABC, Santo André 09210-580, SP, Brazil
| | - Isabele C Costa-Amaral
- Studies Center of Worker's Health and Human Ecology (CESTEH), Sergio Arouca National School of Public Health (ENSP), Oswaldo Cruz Foundation (FIOCRUZ), Rio de Janeiro 21041-210, RJ, Brazil
| | - Liliane R Teixeira
- Studies Center of Worker's Health and Human Ecology (CESTEH), Sergio Arouca National School of Public Health (ENSP), Oswaldo Cruz Foundation (FIOCRUZ), Rio de Janeiro 21041-210, RJ, Brazil
| | - Jamila A Perini
- Research Laboratory of Pharmaceutical Sciences (LAPESF), State University of Rio de Janeiro (West Zone-UERJ-ZO), Rio de Janeiro 23070-200, RJ, Brazil
| | - David C Martins
- Center for Mathematics, Computation and Cognition, Federal University of ABC, Santo André 09210-580, SP, Brazil
| | - Ariane L Larentis
- Studies Center of Worker's Health and Human Ecology (CESTEH), Sergio Arouca National School of Public Health (ENSP), Oswaldo Cruz Foundation (FIOCRUZ), Rio de Janeiro 21041-210, RJ, Brazil
| |
Collapse
|
6
|
Yu K, Xie W, Wang L, Zhang S, Li W. Determination of biomarkers from microarray data using graph neural network and spectral clustering. Sci Rep 2021; 11:23828. [PMID: 34903818 PMCID: PMC8668890 DOI: 10.1038/s41598-021-03316-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2021] [Accepted: 12/02/2021] [Indexed: 11/26/2022] Open
Abstract
In bioinformatics, the rapid development of gene sequencing technology has produced an increasing amount of microarray data. This type of data shares the typical characteristics of small sample size and high feature dimensions. Searching for biomarkers from microarray data, which expression features of various diseases, is essential for the disease classification. feature selection has therefore became fundemental for the analysis of microarray data, which designs to remove irrelevant and redundant features. There are a large number of redundant features and irrelevant features in microarray data, which severely degrade the classification effectiveness. We propose an innovative feature selection method with the goal of obtaining feature dependencies from a priori knowledge and removing redundant features using spectral clustering. In this paper, the graph structure is firstly constructed by using the gene interaction network as a priori knowledge, and then a link prediction method based on graph neural network is proposed to enhance the graph structure data. Finally, a feature selection method based on spectral clustering is proposed to determine biomarkers. The classification accuracy on DLBCL and Prostate can be improved by 10.90% and 16.22% compared to traditional methods. Link prediction provides an average classification accuracy improvement of 1.96% and 1.31%, and is up to 16.98% higher than the published method. The results show that the proposed method can have full use of a priori knowledge to effectively select disease prediction biomarkers with high classification accuracy.
Collapse
Affiliation(s)
- Kun Yu
- College of Medicine and Bioinformation Engineering, Northeastern University, Shenyang, China
| | - Weidong Xie
- School of Computer Science and Engineering, Northeastern University, Shenyang, China
| | - Linjie Wang
- School of Computer Science and Engineering, Northeastern University, Shenyang, China
| | - Shoujia Zhang
- School of Computer Science and Engineering, Northeastern University, Shenyang, China
| | - Wei Li
- Key Laboratory of Intelligent Computing in Medical Image MIIC, Northeastern University, Ministry of Education, Shenyang, China.
| |
Collapse
|
7
|
Carianopol CS, Chan AL, Dong S, Provart NJ, Lumba S, Gazzarrini S. An abscisic acid-responsive protein interaction network for sucrose non-fermenting related kinase1 in abiotic stress response. Commun Biol 2020; 3:145. [PMID: 32218501 PMCID: PMC7099082 DOI: 10.1038/s42003-020-0866-8] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2019] [Accepted: 02/24/2020] [Indexed: 12/13/2022] Open
Abstract
Yeast Snf1 (Sucrose non-fermenting1), mammalian AMPK (5′ AMP-activated protein kinase) and plant SnRK1 (Snf1-Related Kinase1) are conserved heterotrimeric kinase complexes that re-establish energy homeostasis following stress. The hormone abscisic acid (ABA) plays a crucial role in plant stress response. Activation of SnRK1 or ABA signaling results in overlapping transcriptional changes, suggesting these stress pathways share common targets. To investigate how SnRK1 and ABA interact during stress response in Arabidopsis thaliana, we screened the SnRK1 complex by yeast two-hybrid against a library of proteins encoded by 258 ABA-regulated genes. Here, we identify 125 SnRK1- interacting proteins (SnIPs). Network analysis indicates that a subset of SnIPs form signaling modules in response to abiotic stress. Functional studies show the involvement of SnRK1 and select SnIPs in abiotic stress responses. This targeted study uncovers the largest set of SnRK1 interactors, which can be used to further characterize SnRK1 role in plant survival under stress. Carianopol et al. construct a detailed protein interaction network for the SnRK1 kinase complex to investigate the interaction of SnRK1 and ABA during stress response. They identify 125 proteins that interact with SnRK1, which can be used further to characterise the role of SnRK1 in plant survival under stress.
Collapse
Affiliation(s)
- Carina Steliana Carianopol
- Department of Biological Sciences, University of Toronto Scarborough, 1265 Military Trail, Toronto, ON, M1C 1A4, Canada.,Department of Cell and Systems Biology, University of Toronto, 25 Willcocks Street, Toronto, ON, M5S 3B2, Canada
| | - Aaron Lorheed Chan
- Department of Biological Sciences, University of Toronto Scarborough, 1265 Military Trail, Toronto, ON, M1C 1A4, Canada.,Department of Cell and Systems Biology, University of Toronto, 25 Willcocks Street, Toronto, ON, M5S 3B2, Canada
| | - Shaowei Dong
- Department of Cell and Systems Biology, University of Toronto, 25 Willcocks Street, Toronto, ON, M5S 3B2, Canada
| | - Nicholas J Provart
- Department of Cell and Systems Biology, University of Toronto, 25 Willcocks Street, Toronto, ON, M5S 3B2, Canada.,Centre for the Analysis of Genome Evolution and Function, 25 Willcocks Street, Toronto, ON, M5S 3B2, Canada
| | - Shelley Lumba
- Department of Cell and Systems Biology, University of Toronto, 25 Willcocks Street, Toronto, ON, M5S 3B2, Canada
| | - Sonia Gazzarrini
- Department of Biological Sciences, University of Toronto Scarborough, 1265 Military Trail, Toronto, ON, M1C 1A4, Canada. .,Department of Cell and Systems Biology, University of Toronto, 25 Willcocks Street, Toronto, ON, M5S 3B2, Canada.
| |
Collapse
|
8
|
Rivera-Mulia JC, Kim S, Gabr H, Chakraborty A, Ay F, Kahveci T, Gilbert DM. Replication timing networks reveal a link between transcription regulatory circuits and replication timing control. Genome Res 2019; 29:1415-1428. [PMID: 31434679 PMCID: PMC6724675 DOI: 10.1101/gr.247049.118] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2018] [Accepted: 08/05/2019] [Indexed: 12/11/2022]
Abstract
DNA replication occurs in a defined temporal order known as the replication timing (RT) program and is regulated during development, coordinated with 3D genome organization and transcriptional activity. However, transcription and RT are not sufficiently coordinated to predict each other, suggesting an indirect relationship. Here, we exploit genome-wide RT profiles from 15 human cell types and intermediate differentiation stages derived from human embryonic stem cells to construct different types of RT regulatory networks. First, we constructed networks based on the coordinated RT changes during cell fate commitment to create highly complex RT networks composed of thousands of interactions that form specific functional subnetwork communities. We also constructed directional regulatory networks based on the order of RT changes within cell lineages, and identified master regulators of differentiation pathways. Finally, we explored relationships between RT networks and transcriptional regulatory networks (TRNs) by combining them into more complex circuitries of composite and bipartite networks. Results identified novel trans interactions linking transcription factors that are core to the regulatory circuitry of each cell type to RT changes occurring in those cell types. These core transcription factors were found to bind cooperatively to sites in the affected replication domains, providing provocative evidence that they constitute biologically significant directional interactions. Our findings suggest a regulatory link between the establishment of cell-type-specific TRNs and RT control during lineage specification.
Collapse
Affiliation(s)
- Juan Carlos Rivera-Mulia
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota Medical School, Minneapolis, Minnesota 55455, USA
| | - Sebo Kim
- Department of Computer and Information Sciences and Engineering, University of Florida, Gainesville, Florida 32611, USA
| | - Haitham Gabr
- Department of Computer and Information Sciences and Engineering, University of Florida, Gainesville, Florida 32611, USA
| | - Abhijit Chakraborty
- La Jolla Institute for Allergy and Immunology, La Jolla, California 92037, USA
| | - Ferhat Ay
- La Jolla Institute for Allergy and Immunology, La Jolla, California 92037, USA
- School of Medicine, University of California San Diego, La Jolla, California 92093, USA
| | - Tamer Kahveci
- Department of Computer and Information Sciences and Engineering, University of Florida, Gainesville, Florida 32611, USA
| | - David M Gilbert
- Department of Biological Science, Florida State University, Tallahassee, Florida, 32306-4295, USA
- Center for Genomics and Personalized Medicine, Florida State University, Tallahassee, Florida 32306, USA
| |
Collapse
|
9
|
Defoort J, Van de Peer Y, Vermeirssen V. Function, dynamics and evolution of network motif modules in integrated gene regulatory networks of worm and plant. Nucleic Acids Res 2019; 46:6480-6503. [PMID: 29873777 PMCID: PMC6061849 DOI: 10.1093/nar/gky468] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2017] [Accepted: 05/14/2018] [Indexed: 12/29/2022] Open
Abstract
Gene regulatory networks (GRNs) consist of different molecular interactions that closely work together to establish proper gene expression in time and space. Especially in higher eukaryotes, many questions remain on how these interactions collectively coordinate gene regulation. We study high quality GRNs consisting of undirected protein–protein, genetic and homologous interactions, and directed protein–DNA, regulatory and miRNA–mRNA interactions in the worm Caenorhabditis elegans and the plant Arabidopsis thaliana. Our data-integration framework integrates interactions in composite network motifs, clusters these in biologically relevant, higher-order topological network motif modules, overlays these with gene expression profiles and discovers novel connections between modules and regulators. Similar modules exist in the integrated GRNs of worm and plant. We show how experimental or computational methodologies underlying a certain data type impact network topology. Through phylogenetic decomposition, we found that proteins of worm and plant tend to functionally interact with proteins of a similar age, while at the regulatory level TFs favor same age, but also older target genes. Despite some influence of the duplication mode difference, we also observe at the motif and module level for both species a preference for age homogeneity for undirected and age heterogeneity for directed interactions. This leads to a model where novel genes are added together to the GRNs in a specific biological functional context, regulated by one or more TFs that also target older genes in the GRNs. Overall, we detected topological, functional and evolutionary properties of GRNs that are potentially universal in all species.
Collapse
Affiliation(s)
- Jonas Defoort
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium.,VIB Center for Plant Systems Biology, 9052 Ghent, Belgium.,Bioinformatics Institute Ghent, Ghent University, 9052 Ghent, Belgium
| | - Yves Van de Peer
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium.,VIB Center for Plant Systems Biology, 9052 Ghent, Belgium.,Bioinformatics Institute Ghent, Ghent University, 9052 Ghent, Belgium.,Department of Biochemistry, Genetics and Microbiology, University of Pretoria, Pretoria 0028, South Africa
| | - Vanessa Vermeirssen
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium.,VIB Center for Plant Systems Biology, 9052 Ghent, Belgium.,Bioinformatics Institute Ghent, Ghent University, 9052 Ghent, Belgium
| |
Collapse
|
10
|
Systematic Complex Haploinsufficiency-Based Genetic Analysis of Candida albicans Transcription Factors: Tools and Applications to Virulence-Associated Phenotypes. G3-GENES GENOMES GENETICS 2018; 8:1299-1314. [PMID: 29472308 PMCID: PMC5873919 DOI: 10.1534/g3.117.300515] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Genetic interaction analysis is a powerful approach to the study of complex biological processes that are dependent on multiple genes. Because of the largely diploid nature of the human fungal pathogen Candida albicans, genetic interaction analysis has been limited to a small number of large-scale screens and a handful for gene-by-gene studies. Complex haploinsufficiency, which occurs when a strain containing two heterozygous mutations at distinct loci shows a phenotype that is distinct from either of the corresponding single heterozygous mutants, is an expedient approach to genetic interactions analysis in diploid organisms. Here, we describe the construction of a barcoded-library of 133 heterozygous TF deletion mutants and deletion cassettes for designed to facilitate complex haploinsufficiency-based genetic interaction studies of the TF networks in C. albicans. We have characterized the phenotypes of these heterozygous mutants under a broad range of in vitro conditions using both agar-plate and pooled signature tag-based assays. Consistent with previous studies, haploinsufficiency is relative uncommon. In contrast, a set of 12 TFs enriched in mutants with a role in adhesion were found to have altered competitive fitness at early time points in a murine model of disseminated candidiasis. Finally, we characterized the genetic interactions of a set of biofilm related TFs in the first two steps of biofilm formation, adherence and filamentation of adherent cells. The genetic interaction networks at each stage of biofilm formation are significantly different indicating that the network is not static but dynamic.
Collapse
|
11
|
Ignatius Pang CN, Goel A, Wilkins MR. Investigating the Network Basis of Negative Genetic Interactions in Saccharomyces cerevisiae with Integrated Biological Networks and Triplet Motif Analysis. J Proteome Res 2018; 17:1014-1030. [DOI: 10.1021/acs.jproteome.7b00649] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Affiliation(s)
- Chi Nam Ignatius Pang
- Systems
Biology Initiative, School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales 2052, Australia
| | - Apurv Goel
- Systems
Biology Initiative, School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales 2052, Australia
| | - Marc R. Wilkins
- Systems
Biology Initiative, School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales 2052, Australia
| |
Collapse
|
12
|
Frenkel-Morgenstern M, Gorohovski A, Tagore S, Sekar V, Vazquez M, Valencia A. ChiPPI: a novel method for mapping chimeric protein-protein interactions uncovers selection principles of protein fusion events in cancer. Nucleic Acids Res 2017; 45:7094-7105. [PMID: 28549153 PMCID: PMC5499553 DOI: 10.1093/nar/gkx423] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2016] [Accepted: 05/07/2017] [Indexed: 12/20/2022] Open
Abstract
Fusion proteins, comprising peptides deriving from the translation of two parental genes, are produced in cancer by chromosomal aberrations. The expressed fusion protein incorporates domains of both parental proteins. Using a methodology that treats discrete protein domains as binding sites for specific domains of interacting proteins, we have cataloged the protein interaction networks for 11 528 cancer fusions (ChiTaRS-3.1). Here, we present our novel method, chimeric protein–protein interactions (ChiPPI) that uses the domain–domain co-occurrence scores in order to identify preserved interactors of chimeric proteins. Mapping the influence of fusion proteins on cell metabolism and pathways reveals that ChiPPI networks often lose tumor suppressor proteins and gain oncoproteins. Furthermore, fusions often induce novel connections between non-interactors skewing interaction networks and signaling pathways. We compared fusion protein PPI networks in leukemia/lymphoma, sarcoma and solid tumors finding distinct enrichment patterns for each disease type. While certain pathways are enriched in all three diseases (Wnt, Notch and TGF β), there are distinct patterns for leukemia (EGFR signaling, DNA replication and CCKR signaling), for sarcoma (p53 pathway and CCKR signaling) and solid tumors (FGFR and EGFR signaling). Thus, the ChiPPI method represents a comprehensive tool for studying the anomaly of skewed cellular networks produced by fusion proteins in cancer.
Collapse
Affiliation(s)
| | | | - Somnath Tagore
- Faculty of Medicine, Bar-Ilan-University, Henrietta Szold 8, Safed 1311502, Israel
| | - Vaishnovi Sekar
- Structural Biology and BioComputing Programme, Spanish National Cancer Research Centre (CNIO), M.F.Almagro 3, 28029 Madrid, Spain
| | - Miguel Vazquez
- Structural Biology and BioComputing Programme, Spanish National Cancer Research Centre (CNIO), M.F.Almagro 3, 28029 Madrid, Spain
| | - Alfonso Valencia
- Structural Biology and BioComputing Programme, Spanish National Cancer Research Centre (CNIO), M.F.Almagro 3, 28029 Madrid, Spain
| |
Collapse
|
13
|
Kim W, Haukap L. NemoProfile as an efficient approach to network motif analysis with instance collection. BMC Bioinformatics 2017; 18:423. [PMID: 29072139 PMCID: PMC5657038 DOI: 10.1186/s12859-017-1822-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Background A network motif is defined as a statistically significant and recurring subgraph pattern within a network. Most existing instance collection methods are not feasible due to high memory usage issues and provision of limited network motif information. They require a two-step process that requires network motif identification prior to instance collection. Due to the impracticality in obtaining motif instances, the significance of their contribution to problem solving is debated within the field of biology. Results This paper presents NemoProfile, an efficient new network motif data model. NemoProfile simplifies instance collection by resolving memory overhead issues and is seamlessly generated, thus eliminating the need for costly two-step processing. Additionally, a case study was conducted to demonstrate the application of network motifs to existing problems in the field of biology. Conclusion NemoProfile comprises network motifs and their instances, thereby facilitating network motifs usage in real biological problems.
Collapse
|
14
|
Takaguchi T, Yoshida Y. Cycle and flow trusses in directed networks. ROYAL SOCIETY OPEN SCIENCE 2016; 3:160270. [PMID: 28018610 PMCID: PMC5180108 DOI: 10.1098/rsos.160270] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/19/2016] [Accepted: 10/31/2016] [Indexed: 06/06/2023]
Abstract
When we represent real-world systems as networks, the directions of links often convey valuable information. Finding module structures that respect link directions is one of the most important tasks for analysing directed networks. Although many notions of a directed module have been proposed, no consensus has been reached. This lack of consensus results partly because there might exist distinct types of modules in a single directed network, whereas most previous studies focused on an independent criterion for modules. To address this issue, we propose a generic notion of the so-called truss structures in directed networks. Our definition of truss is able to extract two distinct types of trusses, named the cycle truss and the flow truss, from a unified framework. By applying the method for finding trusses to empirical networks obtained from a wide range of research fields, we find that most real networks contain both cycle and flow trusses. In addition, the abundance of (and the overlap between) the two types of trusses may be useful to characterize module structures in a wide variety of empirical networks. Our findings shed light on the importance of simultaneously considering different types of modules in directed networks.
Collapse
Affiliation(s)
- Taro Takaguchi
- National Institute of Informatics, ERATO, Kawarabayashi Large Graph Project, 2-1-2 Hitotsubashi, Chiyoda-ku, 101-8430 Tokyo, Japan
- JST, ERATO, Kawarabayashi Large Graph Project, 2-1-2 Hitotsubashi, Chiyoda-ku, 101-8430 Tokyo, Japan
| | - Yuichi Yoshida
- National Institute of Informatics, ERATO, Kawarabayashi Large Graph Project, 2-1-2 Hitotsubashi, Chiyoda-ku, 101-8430 Tokyo, Japan
- Preferred Infrastructure, 1-6-1 Otemachi, Chiyoda-ku, 100-0004 Tokyo, Japan
| |
Collapse
|
15
|
Gazestani VH, Nikpour N, Mehta V, Najafabadi HS, Moshiri H, Jardim A, Salavati R. A Protein Complex Map of Trypanosoma brucei. PLoS Negl Trop Dis 2016; 10:e0004533. [PMID: 26991453 PMCID: PMC4798371 DOI: 10.1371/journal.pntd.0004533] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2015] [Accepted: 02/20/2016] [Indexed: 12/27/2022] Open
Abstract
The functions of the majority of trypanosomatid-specific proteins are unknown, hindering our understanding of the biology and pathogenesis of Trypanosomatida. While protein-protein interactions are highly informative about protein function, a global map of protein interactions and complexes is still lacking for these important human parasites. Here, benefiting from in-depth biochemical fractionation, we systematically interrogated the co-complex interactions of more than 3354 protein groups in procyclic life stage of Trypanosoma brucei, the protozoan parasite responsible for human African trypanosomiasis. Using a rigorous methodology, our analysis led to identification of 128 high-confidence complexes encompassing 716 protein groups, including 635 protein groups that lacked experimental annotation. These complexes correlate well with known pathways as well as for proteins co-expressed across the T. brucei life cycle, and provide potential functions for a large number of previously uncharacterized proteins. We validated the functions of several novel proteins associated with the RNA-editing machinery, identifying a candidate potentially involved in the mitochondrial post-transcriptional regulation of T. brucei. Our data provide an unprecedented view of the protein complex map of T. brucei, and serve as a reliable resource for further characterization of trypanosomatid proteins. The presented results in this study are available at: www.TrypsNetDB.org. Due to high evolutionary divergence of trypanosomatid pathogens from other eukaryotes, accurate prediction of functional roles for most of their proteins is not feasible based on homology-based approaches. Although protein co-complex maps provide a compelling tool for the functional annotation of proteins, as subunits of a complex are expected to be involved in similar biological processes, the current knowledge about these maps is still rudimentary. Here, we systematically examined the protein co-complex membership of more than one third of T. brucei proteome using two orthogonal fractionation approaches. A high-confidence network of co-complex relationships predicts the network context of 866 proteins, including many hypothetical and experimentally unannotated proteins. To our knowledge, this study presents the largest proteomics-based interaction map of trypanosomatid parasites to date, providing a useful resource for formulating new biological hypothesises and further experimental leads.
Collapse
Affiliation(s)
- Vahid H. Gazestani
- Institute of Parasitology, McGill University, Ste. Anne de Bellevue, Quebec, Canada
| | - Najmeh Nikpour
- Institute of Parasitology, McGill University, Ste. Anne de Bellevue, Quebec, Canada
| | - Vaibhav Mehta
- Institute of Parasitology, McGill University, Ste. Anne de Bellevue, Quebec, Canada
- Department of Biochemistry, McGill University, Montreal, Quebec, Canada
| | - Hamed S. Najafabadi
- Institute of Parasitology, McGill University, Ste. Anne de Bellevue, Quebec, Canada
- McGill Centre for Bioinformatics, McGill University, Montreal, Quebec, Canada
| | - Houtan Moshiri
- Institute of Parasitology, McGill University, Ste. Anne de Bellevue, Quebec, Canada
- Department of Biochemistry, McGill University, Montreal, Quebec, Canada
| | - Armando Jardim
- Institute of Parasitology, McGill University, Ste. Anne de Bellevue, Quebec, Canada
- Centre for Host-Parasite Interactions, Institute of Parasitology, McGill University, Ste. Anne de Bellevue, Quebec, Canada
| | - Reza Salavati
- Institute of Parasitology, McGill University, Ste. Anne de Bellevue, Quebec, Canada
- Department of Biochemistry, McGill University, Montreal, Quebec, Canada
- McGill Centre for Bioinformatics, McGill University, Montreal, Quebec, Canada
- * E-mail:
| |
Collapse
|
16
|
Modeling Gene Networks in Saccharomyces cerevisiae Based on Gene Expression Profiles. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2015; 2015:621264. [PMID: 26839582 PMCID: PMC4709922 DOI: 10.1155/2015/621264] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/14/2015] [Revised: 10/14/2015] [Accepted: 11/16/2015] [Indexed: 11/30/2022]
Abstract
Detailed and innovative analysis of gene regulatory network structures may reveal novel insights to biological mechanisms. Here we study how gene regulatory network in Saccharomyces cerevisiae can differ under aerobic and anaerobic conditions. To achieve this, we discretized the gene expression profiles and calculated the self-entropy of down- and upregulation of gene expression as well as joint entropy. Based on these quantities the uncertainty coefficient was calculated for each gene triplet, following which, separate gene logic networks were constructed for the aerobic and anaerobic conditions. Four structural parameters such as average degree, average clustering coefficient, average shortest path, and average betweenness were used to compare the structure of the corresponding aerobic and anaerobic logic networks. Five genes were identified to be putative key components of the two energy metabolisms. Furthermore, community analysis using the Newman fast algorithm revealed two significant communities for the aerobic but only one for the anaerobic network. David Gene Functional Classification suggests that, under aerobic conditions, one such community reflects the cell cycle and cell replication, while the other one is linked to the mitochondrial respiratory chain function.
Collapse
|
17
|
Kim WY, Kurmar S. Sensible method for updating motif instances in an increased biological network. Methods 2015; 83:71-9. [PMID: 25869675 DOI: 10.1016/j.ymeth.2015.04.007] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2015] [Revised: 04/04/2015] [Accepted: 04/06/2015] [Indexed: 11/20/2022] Open
Abstract
A network motif is defined as an over-represented subgraph pattern in a network. Network motif based techniques have been widely applied in analyses of biological networks such as transcription regulation networks (TRNs), protein-protein interaction networks (PPIs), and metabolic networks. The detection of network motifs involves the computationally expensive enumeration of subgraphs, NP-complete graph isomorphism testing, and significance testing through the generation of many random graphs to determine the statistical uniqueness of a given subgraph. These computational obstacles make network motif analysis unfeasible for many real-world applications. We observe that the fast growth of biotechnology has led to the rapid accretion of molecules (vertices) and interactions (edges) to existing biological network databases. Even with a small percentage of additions, revised networks can have a large number of differing motif instances. Currently, no existing algorithms recalculate motif instances in 'updated' networks in a practical manner. In this paper, we introduce a sensible method for efficiently recalculating motif instances by performing motif enumeration from only updated vertices and edges. Preliminary experimental results indicate that our method greatly reduces computational time by eliminating the repeated enumeration of overlapped subgraph instances detected in earlier versions of the network. The software program implementing this algorithm, defined as SUNMI (Sensible Update of Network Motif Instances), is currently a stand-alone java program and we plan to upgrade it as a web-interactive program that will be available through http://faculty.washington.edu/kimw6/research.htm in near future. Meanwhile it is recommended to contact authors to obtain the stand-alone SUNMI program.
Collapse
Affiliation(s)
- W Y Kim
- Computing and Software Systems, School of Science, Technology, Engineering, and Mathematics, University of Washington Bothell, Bothell, WA 98011-8246, United States.
| | - S Kurmar
- Computing and Software Systems, School of Science, Technology, Engineering, and Mathematics, University of Washington Bothell, Bothell, WA 98011-8246, United States.
| |
Collapse
|
18
|
Nitzan M, Shimoni Y, Rosolio O, Margalit H, Biham O. Stochastic analysis of bistability in coherent mixed feedback loops combining transcriptional and posttranscriptional regulations. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2015; 91:052706. [PMID: 26066198 DOI: 10.1103/physreve.91.052706] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/29/2014] [Indexed: 06/04/2023]
Abstract
Mixed feedback loops combining transcriptional and posttranscriptional regulations are common in cellular regulatory networks. They consist of two genes, encoding a transcription factor and a small noncoding RNA (sRNA), which mutually regulate each other's expression. We present a theoretical and numerical study of coherent mixed feedback loops of this type, in which both regulations are negative. Under suitable conditions, these feedback loops are expected to exhibit bistability, namely, two stable states, one dominated by the transcriptional repressor and the other dominated by the sRNA. We use deterministic methods based on rate equation models, in order to identify the range of parameters in which bistability takes place. However, the deterministic models do not account for the finite lifetimes of the bistable states and the spontaneous, fluctuation-driven transitions between them. Therefore, we use stochastic methods to calculate the average lifetimes of the two states. It is found that these lifetimes strongly depend on rate coefficients such as the transcription rates of the transcriptional repressor and the sRNA. In particular, we show that the fraction of time the system spends in the sRNA-dominated state follows a monotonically decreasing sigmoid function of the transcriptional repressor transcription rate. The biological relevance of these results is discussed in the context of such mixed feedback loops in Escherichia coli. It is shown that the fluctuation-driven transitions and the dependence of some rate coefficients on the biological conditions enable the cells to switch to the state which is better suited for the existing conditions and to remain in that state as long as these conditions persist.
Collapse
Affiliation(s)
- Mor Nitzan
- Racah Institute of Physics, Hebrew University, Jerusalem 91904, Israel
- Department of Microbiology and Molecular Genetics, IMRIC, Faculty of Medicine, Hebrew University, Jerusalem 91120, Israel
| | - Yishai Shimoni
- Racah Institute of Physics, Hebrew University, Jerusalem 91904, Israel
- Department of Microbiology and Molecular Genetics, IMRIC, Faculty of Medicine, Hebrew University, Jerusalem 91120, Israel
- Center for Computational Biology and Bioinformatics (C2B2), Columbia University, New York, New York 10027, USA
| | - Oded Rosolio
- Racah Institute of Physics, Hebrew University, Jerusalem 91904, Israel
| | - Hanah Margalit
- Department of Microbiology and Molecular Genetics, IMRIC, Faculty of Medicine, Hebrew University, Jerusalem 91120, Israel
| | - Ofer Biham
- Racah Institute of Physics, Hebrew University, Jerusalem 91904, Israel
| |
Collapse
|
19
|
Network motifs that recur across species, including gene regulatory and protein-protein interaction networks. Arch Toxicol 2014; 89:489-99. [PMID: 24847787 DOI: 10.1007/s00204-014-1274-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2014] [Accepted: 05/13/2014] [Indexed: 10/25/2022]
Abstract
Cellular molecules interact in complex ways, giving rise to a cell's functional outcomes. Conscientious efforts have been made in recent years to better characterize these patterns of interactions. It has been learned that many of these interactions can be represented abstractly as a network and within a network there in many instances are network motifs. Network motifs are subgraphs that are statistically overrepresented within networks. To date, specific network motifs have been experimentally identified across various species and also within specific, intracellular networks; however, motifs that recur across species and major network types have not been systematically characterized. We reason that recurring network motifs could potentially have important implications and applications for toxicology and, in particular, toxicity testing. Therefore, the goal of this study was to determine the set of intracellular, network motifs found to recur across species of both gene regulatory and protein-protein interaction networks. We report the recurrence of 13 intracellular, network motifs across species. Ten recurring motifs were found across both protein-protein interaction networks and gene regulatory networks. The significant pair motif was found to recur only in gene regulatory networks. The diamond and one-way cycle reversible step motifs were found to recur only in protein-protein interaction networks. This study is the first formal review of recurring, intracellular network motifs across species. Within toxicology, combining our understanding of recurring motifs with mechanism and mode of action knowledge could result in more robust and efficient toxicity testing models. We are sure that our results will support research in applying network motifs to toxicity testing.
Collapse
|
20
|
Tohge T, Fernie AR. Lignin, mitochondrial family, and photorespiratory transporter classification as case studies in using co-expression, co-response, and protein locations to aid in identifying transport functions. FRONTIERS IN PLANT SCIENCE 2014; 5:75. [PMID: 24672529 PMCID: PMC3955873 DOI: 10.3389/fpls.2014.00075] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/03/2013] [Accepted: 02/17/2014] [Indexed: 06/03/2023]
Abstract
Whole genome sequencing and the relative ease of transcript profiling have facilitated the collection and data warehousing of immense quantities of expression data. However, a substantial proportion of genes are not yet functionally annotated a problem which is particularly acute for transport proteins. In Arabidopsis, for example, only a minor fraction of the estimated 700 intracellular transporters have been identified at the molecular genetic level. Furthermore it is only within the last couple of years that critical genes such as those encoding the final transport step required for the long distance transport of sucrose and the first transporter of the core photorespiratory pathway have been identified. Here we will describe how transcriptional coordination between genes of known function and non-annotated genes allows the identification of putative transporters on the premise that such co-expressed genes tend to be functionally related. We will additionally extend this to include the expansion of this approach to include phenotypic information from other levels of cellular organization such as proteomic and metabolomic data and provide case studies wherein this approach has successfully been used to fill knowledge gaps in important metabolic pathways and physiological processes.
Collapse
Affiliation(s)
- Takayuki Tohge
- *Correspondence: Takayuki Tohge, Department 1 (Willmitzer), Central Metabolism, Max Planck Institute for Plant Physiology, Am Mühlenberg 1, 14476 Potsdam, Germany e-mail:
| | | |
Collapse
|
21
|
Silberberg Y, Kupiec M, Sharan R. A method for predicting protein-protein interaction types. PLoS One 2014; 9:e90904. [PMID: 24625764 PMCID: PMC3953217 DOI: 10.1371/journal.pone.0090904] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2013] [Accepted: 02/07/2014] [Indexed: 11/19/2022] Open
Abstract
Protein-protein interactions (PPIs) govern basic cellular processes through signal transduction and complex formation. The diversity of those processes gives rise to a remarkable diversity of interactions types, ranging from transient phosphorylation interactions to stable covalent bonding. Despite our increasing knowledge on PPIs in humans and other species, their types remain relatively unexplored and few annotations of types exist in public databases. Here, we propose the first method for systematic prediction of PPI type based solely on the techniques by which the interaction was detected. We show that different detection methods are better suited for detecting specific types. We apply our method to ten interaction types on a large scale human PPI dataset. We evaluate the performance of the method using both internal cross validation and external data sources. In cross validation, we obtain an area under receiver operating characteristic (ROC) curve ranging from 0.65 to 0.97 with an average of 0.84 across the predicted types. Comparing the predicted interaction types to external data sources, we obtained significant agreements for phosphorylation and ubiquitination interactions, with hypergeometric p-value = 2.3e(-54) and 5.6e(-28) respectively. We examine the biological relevance of our predictions using known signaling pathways and chart the abundance of interaction types in cell processes. Finally, we investigate the cross-relations between different interaction types within the network and characterize the discovered patterns, or motifs. We expect the resulting annotated network to facilitate the reconstruction of process-specific subnetworks and assist in predicting protein function or interaction.
Collapse
Affiliation(s)
- Yael Silberberg
- Department of Molecular Microbiology and Biotechnology, Tel-Aviv University, Tel Aviv, Israel
| | - Martin Kupiec
- Department of Molecular Microbiology and Biotechnology, Tel-Aviv University, Tel Aviv, Israel
| | - Roded Sharan
- The Blavatnik School of Computer Science, Tel-Aviv University, Tel-Aviv, Israel
- * E-mail:
| |
Collapse
|
22
|
Integrative approaches for finding modular structure in biological networks. Nat Rev Genet 2013; 14:719-32. [PMID: 24045689 DOI: 10.1038/nrg3552] [Citation(s) in RCA: 351] [Impact Index Per Article: 31.9] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
A central goal of systems biology is to elucidate the structural and functional architecture of the cell. To this end, large and complex networks of molecular interactions are being rapidly generated for humans and model organisms. A recent focus of bioinformatics research has been to integrate these networks with each other and with diverse molecular profiles to identify sets of molecules and interactions that participate in a common biological function - that is, 'modules'. Here, we classify such integrative approaches into four broad categories, describe their bioinformatic principles and review their applications.
Collapse
|
23
|
Jafari M, Sadeghi M, Mirzaie M, Marashi SA, Rezaei-Tavirani M. Evolutionarily conserved motifs and modules in mitochondrial protein–protein interaction networks. Mitochondrion 2013; 13:668-75. [DOI: 10.1016/j.mito.2013.09.006] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2013] [Revised: 08/18/2013] [Accepted: 09/23/2013] [Indexed: 10/26/2022]
|
24
|
Van Landeghem S, De Bodt S, Drebert ZJ, Inzé D, Van de Peer Y. The potential of text mining in data integration and network biology for plant research: a case study on Arabidopsis. THE PLANT CELL 2013; 25:794-807. [PMID: 23532071 PMCID: PMC3634689 DOI: 10.1105/tpc.112.108753] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/20/2012] [Revised: 02/27/2013] [Accepted: 03/08/2013] [Indexed: 05/21/2023]
Abstract
Despite the availability of various data repositories for plant research, a wealth of information currently remains hidden within the biomolecular literature. Text mining provides the necessary means to retrieve these data through automated processing of texts. However, only recently has advanced text mining methodology been implemented with sufficient computational power to process texts at a large scale. In this study, we assess the potential of large-scale text mining for plant biology research in general and for network biology in particular using a state-of-the-art text mining system applied to all PubMed abstracts and PubMed Central full texts. We present extensive evaluation of the textual data for Arabidopsis thaliana, assessing the overall accuracy of this new resource for usage in plant network analyses. Furthermore, we combine text mining information with both protein-protein and regulatory interactions from experimental databases. Clusters of tightly connected genes are delineated from the resulting network, illustrating how such an integrative approach is essential to grasp the current knowledge available for Arabidopsis and to uncover gene information through guilt by association. All large-scale data sets, as well as the manually curated textual data, are made publicly available, hereby stimulating the application of text mining data in future plant biology studies.
Collapse
Affiliation(s)
- Sofie Van Landeghem
- Department of Plant Systems Biology, VIB, 9052 Ghent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium
| | - Stefanie De Bodt
- Department of Plant Systems Biology, VIB, 9052 Ghent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium
| | - Zuzanna J. Drebert
- Department of Plant Systems Biology, VIB, 9052 Ghent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium
| | - Dirk Inzé
- Department of Plant Systems Biology, VIB, 9052 Ghent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium
| | - Yves Van de Peer
- Department of Plant Systems Biology, VIB, 9052 Ghent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium
- Address correspondence to
| |
Collapse
|
25
|
Michoel T, Nachtergaele B. Alignment and integration of complex networks by hypergraph-based spectral clustering. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2012; 86:056111. [PMID: 23214847 DOI: 10.1103/physreve.86.056111] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2012] [Indexed: 06/01/2023]
Abstract
Complex networks possess a rich, multiscale structure reflecting the dynamical and functional organization of the systems they model. Often there is a need to analyze multiple networks simultaneously, to model a system by more than one type of interaction, or to go beyond simple pairwise interactions, but currently there is a lack of theoretical and computational methods to address these problems. Here we introduce a framework for clustering and community detection in such systems using hypergraph representations. Our main result is a generalization of the Perron-Frobenius theorem from which we derive spectral clustering algorithms for directed and undirected hypergraphs. We illustrate our approach with applications for local and global alignment of protein-protein interaction networks between multiple species, for tripartite community detection in folksonomies, and for detecting clusters of overlapping regulatory pathways in directed networks.
Collapse
Affiliation(s)
- Tom Michoel
- Freiburg Institute for Advanced Studies (FRIAS), University of Freiburg, Albertstrasse 19, D-79104 Freiburg, Germany.
| | | |
Collapse
|
26
|
Sun J, Gong X, Purow B, Zhao Z. Uncovering MicroRNA and Transcription Factor Mediated Regulatory Networks in Glioblastoma. PLoS Comput Biol 2012; 8:e1002488. [PMID: 22829753 PMCID: PMC3400583 DOI: 10.1371/journal.pcbi.1002488] [Citation(s) in RCA: 109] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2011] [Accepted: 03/05/2012] [Indexed: 12/12/2022] Open
Abstract
Glioblastoma multiforme (GBM) is the most common and lethal brain tumor in humans. Recent studies revealed that patterns of microRNA (miRNA) expression in GBM tissue samples are different from those in normal brain tissues, suggesting that a number of miRNAs play critical roles in the pathogenesis of GBM. However, little is yet known about which miRNAs play central roles in the pathology of GBM and their regulatory mechanisms of action. To address this issue, in this study, we systematically explored the main regulation format (feed-forward loops, FFLs) consisting of miRNAs, transcription factors (TFs) and their impacting GBM-related genes, and developed a computational approach to construct a miRNA-TF regulatory network. First, we compiled GBM-related miRNAs, GBM-related genes, and known human TFs. We then identified 1,128 3-node FFLs and 805 4-node FFLs with statistical significance. By merging these FFLs together, we constructed a comprehensive GBM-specific miRNA-TF mediated regulatory network. Then, from the network, we extracted a composite GBM-specific regulatory network. To illustrate the GBM-specific regulatory network is promising for identification of critical miRNA components, we specifically examined a Notch signaling pathway subnetwork. Our follow up topological and functional analyses of the subnetwork revealed that six miRNAs (miR-124, miR-137, miR-219-5p, miR-34a, miR-9, and miR-92b) might play important roles in GBM, including some results that are supported by previous studies. In this study, we have developed a computational framework to construct a miRNA-TF regulatory network and generated the first miRNA-TF regulatory network for GBM, providing a valuable resource for further understanding the complex regulatory mechanisms in GBM. The observation of critical miRNAs in the Notch signaling pathway, with partial verification from previous studies, demonstrates that our network-based approach is promising for the identification of new and important miRNAs in GBM and, potentially, other cancers. Several recent studies have implicated the critical role of microRNAs (miRNAs) in the pathogenesis of glioblastoma (GBM), the most common and lethal brain tumor in humans, suggesting that miRNAs may be clinically useful as biomarkers for brain tumors and other cancers. However, to date, the regulatory mechanisms of miRNAs in GBM are unclear. In this study, we have systematically constructed miRNA and transcription factor (TF) mediated regulatory networks specific to GBM. To demonstrate that the GBM-specific regulatory network contains functional modules that may composite of critical miRNA components, we extracted a subnetwork including GBM-related genes involved in the Notch signaling pathway. Through network topological and functional analyses of the Notch signaling pathway subnetwork, several critical miRNAs have been identified, some of which have been reinforced by previous studies. This study not only provides novel miRNAs for further experimental design but also develops a novel computational framework to construct a miRNA-TF combinatory regulatory network for a specific disease.
Collapse
Affiliation(s)
- Jingchun Sun
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
| | - Xue Gong
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
| | - Benjamin Purow
- Division of Neuro-Oncology, Neurology Department, University of Virginia Health System, Charlottesville, Virginia, United States of America
| | - Zhongming Zhao
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
- Department of Psychiatry, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
- Department of Cancer Biology, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
- * E-mail:
| |
Collapse
|
27
|
Tohge T, Fernie AR. Co-expression and co-responses: within and beyond transcription. FRONTIERS IN PLANT SCIENCE 2012; 3:248. [PMID: 23162560 PMCID: PMC3492870 DOI: 10.3389/fpls.2012.00248] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/25/2012] [Accepted: 10/20/2012] [Indexed: 05/04/2023]
Abstract
Whole genome sequencing, the relative ease of transcript profiling by the use of microarrays and latterly RNA sequencing approaches have facilitated the capture of vast amounts of transcript data. However, despite the enormous progress made in gene annotation a substantial proportion of genes remain to be annotated at the functional level. Considerable progress has, however, been made by searching for transcriptional coordination between genes of known function and non-annotated genes on the premise that such co-expressed genes tend to be functionally related. Here we review progress made following this approach as well as its expansion to include phenotypic information from other levels of cellular organization such as proteomic and metabolomic data as well as physiological and developmental phenotypes.
Collapse
Affiliation(s)
- Takayuki Tohge
- *Correspondence: Takayuki Tohge, Max-Planck-Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476 Potsdam-Golm, Germany. e-mail:
| | | |
Collapse
|
28
|
|
29
|
Casey F, Krogan N, Shields DC, Cagney G. Distinct configurations of protein complexes and biochemical pathways revealed by epistatic interaction network motifs. BMC SYSTEMS BIOLOGY 2011; 5:133. [PMID: 21859460 PMCID: PMC3176491 DOI: 10.1186/1752-0509-5-133] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/20/2011] [Accepted: 08/22/2011] [Indexed: 11/24/2022]
Abstract
Background Gene and protein interactions are commonly represented as networks, with the genes or proteins comprising the nodes and the relationship between them as edges. Motifs, or small local configurations of edges and nodes that arise repeatedly, can be used to simplify the interpretation of networks. Results We examined triplet motifs in a network of quantitative epistatic genetic relationships, and found a non-random distribution of particular motif classes. Individual motif classes were found to be associated with different functional properties, suggestive of an underlying biological significance. These associations were apparent not only for motif classes, but for individual positions within the motifs. As expected, NNN (all negative) motifs were strongly associated with previously reported genetic (i.e. synthetic lethal) interactions, while PPP (all positive) motifs were associated with protein complexes. The two other motif classes (NNP: a positive interaction spanned by two negative interactions, and NPP: a negative spanned by two positives) showed very distinct functional associations, with physical interactions dominating for the former but alternative enrichments, typical of biochemical pathways, dominating for the latter. Conclusion We present a model showing how NNP motifs can be used to recognize supportive relationships between protein complexes, while NPP motifs often identify opposing or regulatory behaviour between a gene and an associated pathway. The ability to use motifs to point toward underlying biological organizational themes is likely to be increasingly important as more extensive epistasis mapping projects in higher organisms begin.
Collapse
|
30
|
Gunsalus KC, Rhrissorrakrai K. Networks in Caenorhabditis elegans. Curr Opin Genet Dev 2011; 21:787-98. [PMID: 22054717 DOI: 10.1016/j.gde.2011.10.003] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2011] [Accepted: 10/11/2011] [Indexed: 10/15/2022]
Abstract
The network paradigm has become a pervasive theme in biology over the last decade, as increasingly large functional genomic datasets are being collected to interrogate regulatory influences, physical interactions, and genetic dependencies between genes, transcripts, and proteins. These 'molecular interaction' networks can be analyzed collectively and individually to define their global architecture and local patterns of connectivity. These structural features ultimately underlie functional properties such as robustness, modularity, component circuitry (e.g. feedback loops), dynamics, and responses to perturbations. This review focuses on recent progress in elucidating molecular interaction networks using different kinds of functional assays in the classical genetic model for animal development, the roundworm Caenorhabditis elegans, with representative examples to illustrate current directions in different areas of network biology.
Collapse
Affiliation(s)
- Kristin C Gunsalus
- Center for Genomics and Systems Biology and Department of Biology, New York University, 12 Waverly Place, 8th floor, New York, NY 10012, USA.
| | | |
Collapse
|
31
|
Anvar SY, Tucker A, Vinciotti V, Venema A, van Ommen GJB, van der Maarel SM, Raz V, 't Hoen PAC. Interspecies translation of disease networks increases robustness and predictive accuracy. PLoS Comput Biol 2011; 7:e1002258. [PMID: 22072955 PMCID: PMC3207951 DOI: 10.1371/journal.pcbi.1002258] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2011] [Accepted: 09/16/2011] [Indexed: 02/03/2023] Open
Abstract
Gene regulatory networks give important insights into the mechanisms underlying physiology and pathophysiology. The derivation of gene regulatory networks from high-throughput expression data via machine learning strategies is problematic as the reliability of these models is often compromised by limited and highly variable samples, heterogeneity in transcript isoforms, noise, and other artifacts. Here, we develop a novel algorithm, dubbed Dandelion, in which we construct and train intraspecies Bayesian networks that are translated and assessed on independent test sets from other species in a reiterative procedure. The interspecies disease networks are subjected to multi-layers of analysis and evaluation, leading to the identification of the most consistent relationships within the network structure. In this study, we demonstrate the performance of our algorithms on datasets from animal models of oculopharyngeal muscular dystrophy (OPMD) and patient materials. We show that the interspecies network of genes coding for the proteasome provide highly accurate predictions on gene expression levels and disease phenotype. Moreover, the cross-species translation increases the stability and robustness of these networks. Unlike existing modeling approaches, our algorithms do not require assumptions on notoriously difficult one-to-one mapping of protein orthologues or alternative transcripts and can deal with missing data. We show that the identified key components of the OPMD disease network can be confirmed in an unseen and independent disease model. This study presents a state-of-the-art strategy in constructing interspecies disease networks that provide crucial information on regulatory relationships among genes, leading to better understanding of the disease molecular mechanisms.
Collapse
Affiliation(s)
- Seyed Yahya Anvar
- Center for Human and Clinical Genetics, Leiden University Medical Center, The Netherlands.
| | | | | | | | | | | | | | | |
Collapse
|
32
|
Axelsson E, Sandmann T, Horn T, Boutros M, Huber W, Fischer B. Extracting quantitative genetic interaction phenotypes from matrix combinatorial RNAi. BMC Bioinformatics 2011; 12:342. [PMID: 21849035 PMCID: PMC3230910 DOI: 10.1186/1471-2105-12-342] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2011] [Accepted: 08/17/2011] [Indexed: 01/07/2023] Open
Abstract
Background Systematic measurement of genetic interactions by combinatorial RNAi (co-RNAi) is a powerful tool for mapping functional modules and discovering components. It also provides insights into the role of epistasis on the way from genotype to phenotype. The interpretation of co-RNAi data requires computational and statistical analysis in order to detect interactions reliably and sensitively. Results We present a comprehensive approach to the analysis of univariate phenotype measurements, such as cell growth. The method is based on a quantitative model and is demonstrated on two example Drosophila cell culture data sets. We discuss adjustments for technical variability, data quality assessment, model parameter fitting and fit diagnostics, choice of scale, and assessment of statistical significance. Conclusions As a result, we obtain quantitative genetic interactions and interaction networks reflecting known biological relationships between target genes. The reliable extraction of presence, absence, and strength of interactions provides insights into molecular mechanisms.
Collapse
Affiliation(s)
- Elin Axelsson
- EMBL European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK.
| | | | | | | | | | | |
Collapse
|
33
|
Srivas R, Hannum G, Ruscheinski J, Ono K, Wang PL, Smoot M, Ideker T. Assembling global maps of cellular function through integrative analysis of physical and genetic networks. Nat Protoc 2011; 6:1308-23. [PMID: 21886098 DOI: 10.1038/nprot.2011.368] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
To take full advantage of high-throughput genetic and physical interaction mapping projects, the raw interactions must first be assembled into models of cell structure and function. PanGIA (for physical and genetic interaction alignment) is a plug-in for the bioinformatics platform Cytoscape, designed to integrate physical and genetic interactions into hierarchical module maps. PanGIA identifies 'modules' as sets of proteins whose physical and genetic interaction data matches that of known protein complexes. Higher-order functional cooperativity and redundancy is identified by enrichment for genetic interactions across modules. This protocol begins with importing interaction networks into Cytoscape, followed by filtering and basic network visualization. Next, PanGIA is used to infer a set of modules and their functional inter-relationships. This module map is visualized in a number of intuitive ways, and modules are tested for functional enrichment and overlap with known complexes. The full protocol can be completed between 10 and 30 min, depending on the size of the data set being analyzed.
Collapse
Affiliation(s)
- Rohith Srivas
- Department of Bioengineering, University of California, San Diego, La Jolla, California, USA
| | | | | | | | | | | | | |
Collapse
|
34
|
Retrieval, alignment, and clustering of computational models based on semantic annotations. Mol Syst Biol 2011; 7:512. [PMID: 21772260 PMCID: PMC3159965 DOI: 10.1038/msb.2011.41] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2010] [Accepted: 05/31/2011] [Indexed: 01/17/2023] Open
Abstract
As the number of computational systems biology models increases, new methods are needed to explore their content and build connections with experimental data. In this Perspective article, the authors propose a flexible semantic framework that can help achieve these aims. The exploding number of computational models produced by Systems Biologists over the last years is an invitation to structure and exploit this new wealth of information. Researchers would like to trace models relevant to specific scientific questions, to explore their biological content, to align and combine them, and to match them with experimental data. To automate these processes, it is essential to consider semantic annotations, which describe their biological meaning. As a prerequisite for a wide range of computational methods, we propose general and flexible similarity measures for Systems Biology models computed from semantic annotations. By using these measures and a large extensible ontology, we implement a platform that can retrieve, cluster, and align Systems Biology models and experimental data sets. At present, its major application is the search for relevant models in the BioModels Database, starting from initial models, data sets, or lists of biological concepts. Beyond similarity searches, the representation of models by semantic feature vectors may pave the way for visualisation, exploration, and statistical analysis of large collections of models and corresponding data.
Collapse
|
35
|
Vidal M, Cusick ME, Barabási AL. Interactome networks and human disease. Cell 2011; 144:986-98. [PMID: 21414488 DOI: 10.1016/j.cell.2011.02.016] [Citation(s) in RCA: 1121] [Impact Index Per Article: 86.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2010] [Revised: 02/07/2011] [Accepted: 02/09/2011] [Indexed: 02/06/2023]
Abstract
Complex biological systems and cellular networks may underlie most genotype to phenotype relationships. Here, we review basic concepts in network biology, discussing different types of interactome networks and the insights that can come from analyzing them. We elaborate on why interactome networks are important to consider in biology, how they can be mapped and integrated with each other, what global properties are starting to emerge from interactome network models, and how these properties may relate to human disease.
Collapse
Affiliation(s)
- Marc Vidal
- Center for Cancer Systems Biology and Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA 02215, USA.
| | | | | |
Collapse
|
36
|
Mirzasoleiman B, Jalili M. Failure tolerance of motif structure in biological networks. PLoS One 2011; 6:e20512. [PMID: 21637829 PMCID: PMC3102726 DOI: 10.1371/journal.pone.0020512] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2011] [Accepted: 04/28/2011] [Indexed: 01/21/2023] Open
Abstract
Complex networks serve as generic models for many biological systems that have been shown to share a number of common structural properties such as power-law degree distribution and small-worldness. Real-world networks are composed of building blocks called motifs that are indeed specific subgraphs of (usually) small number of nodes. Network motifs are important in the functionality of complex networks, and the role of some motifs such as feed-forward loop in many biological networks has been heavily studied. On the other hand, many biological networks have shown some degrees of robustness in terms of their efficiency and connectedness against failures in their components. In this paper we investigated how random and systematic failures in the edges of biological networks influenced their motif structure. We considered two biological networks, namely, protein structure network and human brain functional network. Furthermore, we considered random failures as well as systematic failures based on different strategies for choosing candidate edges for removal. Failure in the edges tipping to high degree nodes had the most destructive role in the motif structure of the networks by decreasing their significance level, while removing edges that were connected to nodes with high values of betweenness centrality had the least effect on the significance profiles. In some cases, the latter caused increase in the significance levels of the motifs.
Collapse
Affiliation(s)
| | - Mahdi Jalili
- Department of Computer Engineering, Sharif University of Technology,
Tehran, Iran
| |
Collapse
|
37
|
Audenaert P, Van Parys T, Brondel F, Pickavet M, Demeester P, Van de Peer Y, Michoel T. CyClus3D: a Cytoscape plugin for clustering network motifs in integrated networks. Bioinformatics 2011; 27:1587-8. [DOI: 10.1093/bioinformatics/btr182] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
38
|
Michoel T, Joshi A, Nachtergaele B, Van de Peer Y. Enrichment and aggregation of topological motifs are independent organizational principles of integrated interaction networks. MOLECULAR BIOSYSTEMS 2011; 7:2769-78. [DOI: 10.1039/c1mb05241a] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
39
|
Abstract
AbstractThe mechanisms underlying life machinery are still not completely understood. Something is known, something is “probably” known, other things are still unknown. Scientists all over the world are working very hard to clarify the processes regulating the cell life cycle and bioinformaticians try to support them by developing specialized automated tools. Within the plethora of applications devoted to the study of life mechanisms, tools for the analysis and comparison of biological networks are catching the attention of many researchers. It is interesting to investigate why.
Collapse
|
40
|
Luo F, Liu J, Li J. Discovering conditional co-regulated protein complexes by integrating diverse data sources. BMC SYSTEMS BIOLOGY 2010; 4 Suppl 2:S4. [PMID: 20840731 PMCID: PMC2982691 DOI: 10.1186/1752-0509-4-s2-s4] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
Abstract
Collapse
Affiliation(s)
- Fei Luo
- School of Computer, Wuhan University, Wuhan, Hubei, China.
| | | | | |
Collapse
|
41
|
Eronen VP, Lindén RO, Lindroos A, Kanerva M, Aittokallio T. Genome-wide scoring of positive and negative epistasis through decomposition of quantitative genetic interaction fitness matrices. PLoS One 2010; 5:e11611. [PMID: 20657656 PMCID: PMC2904709 DOI: 10.1371/journal.pone.0011611] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2010] [Accepted: 06/22/2010] [Indexed: 01/07/2023] Open
Abstract
Recent technological developments in genetic screening approaches have offered the means to start exploring quantitative genotype-phenotype relationships on a large-scale. What remains unclear is the extent to which the quantitative genetic interaction datasets can distinguish the broad spectrum of interaction classes, as compared to existing information on mutation pairs associated with both positive and negative interactions, and whether the scoring of varying degrees of such epistatic effects could be improved by computational means. To address these questions, we introduce here a computational approach for improving the quantitative discrimination power encoded in the genetic interaction screening data. Our matrix approximation model decomposes the original double-mutant fitness matrix into separate components, representing variability across the array and query mutants, which can be utilized for estimating and correcting the single-mutant fitness effects, respectively. When applied to three large-scale quantitative interaction datasets in yeast, we could improve the accuracy of scoring various interaction classes beyond that obtained with the original fitness data, especially in synthetic genetic array (SGA) and in genetic interaction mapping (GIM) datasets. In addition to the known pairs of interactions used in the evaluation of the computational approach, a number of novel interaction pairs were also predicted, along with underlying biological mechanisms, which remained undetected by the original datasets. It was shown that the optimal choice of the scoring function depends heavily on the screening approach and on the interaction class under analysis. Moreover, a simple preprocessing of the fitness matrix could further enhance the discrimination power of the epistatic miniarray profiling (E-MAP) dataset. These systematic evaluation results provide in-depth information on the optimal analysis of the future, large-scale screening experiments. In general, the modeling framework, enabling accurate identification and classification of genetic interactions, provides a solid basis for completing and mining the genetic interaction networks in yeast and other organisms.
Collapse
Affiliation(s)
- Ville-Pekka Eronen
- Biomathematics Research Group, Department of Mathematics, University of Turku, Turku, Finland
| | - Rolf O. Lindén
- Biomathematics Research Group, Department of Mathematics, University of Turku, Turku, Finland
- Data Mining and Modeling Group, Turku Centre for Biotechnology, University of Turku, Turku, Finland
| | - Anna Lindroos
- Division of Genetics and Physiology, Department of Biology, University of Turku, Turku, Finland
| | - Mirella Kanerva
- Division of Genetics and Physiology, Department of Biology, University of Turku, Turku, Finland
| | - Tero Aittokallio
- Biomathematics Research Group, Department of Mathematics, University of Turku, Turku, Finland
- Data Mining and Modeling Group, Turku Centre for Biotechnology, University of Turku, Turku, Finland
- * E-mail:
| |
Collapse
|
42
|
Wu C, Zhang F, Li X, Zhang S, Li J, Su F, Li K, Yan Y. Composite functional module inference: detecting cooperation between transcriptional regulation and protein interaction by mantel test. BMC SYSTEMS BIOLOGY 2010; 4:82. [PMID: 20534172 PMCID: PMC2901225 DOI: 10.1186/1752-0509-4-82] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/09/2009] [Accepted: 06/10/2010] [Indexed: 11/22/2022]
Abstract
Background Functional modules are basic units of cell function, and exploring them is important for understanding the organization, regulation and execution of cell processes. Functional modules in single biological networks (e.g., the protein-protein interaction network), have been the focus of recent studies. Functional modules in the integrated network are composite functional modules, which imply the complex relationships involving multiple biological interaction types, and detect them will help us understand the complexity of cell processes. Results We aimed to detect composite functional modules containing co-transcriptional regulation interaction, and protein-protein interaction, in our pre-constructed integrated network of Saccharomyces cerevisiae. We computationally extracted 15 composite functional modules, and found structural consistency between co-transcriptional regulation interaction sub-network and protein-protein interaction sub-network that was well correlated with their functional hierarchy. This type of composite functional modules was compact in structure, and was found to participate in essential cell processes such as oxidative phosphorylation and RNA splicing. Conclusions The structure of composite functional modules containing co-transcriptional regulation interaction, and protein-protein interaction reflected the cooperation of transcriptional regulation and protein function implementation, and was indicative of their important roles in essential cell functions. In addition, their structural and functional characteristics were closely related, and suggesting the complexity of the cell regulatory system.
Collapse
Affiliation(s)
- Chao Wu
- Department of Bioinformatics and Bio-pharmaceutical Key Laboratory of Heilongjiang Province and State, Harbin Medical University, Harbin, Heilongjiang 150086, China
| | | | | | | | | | | | | | | |
Collapse
|
43
|
Joshi A, Van Parys T, Van de Peer Y, Michoel T. Characterizing regulatory path motifs in integrated networks using perturbational data. Genome Biol 2010; 11:R32. [PMID: 20230615 PMCID: PMC2864572 DOI: 10.1186/gb-2010-11-3-r32] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2009] [Revised: 10/01/2009] [Accepted: 03/11/2010] [Indexed: 01/12/2023] Open
Abstract
Pathicular – a Cytoscape plugin for analysing cellular responses to transcription factor perturbations is presented We introduce Pathicular http://bioinformatics.psb.ugent.be/software/details/Pathicular, a Cytoscape plugin for studying the cellular response to perturbations of transcription factors by integrating perturbational expression data with transcriptional, protein-protein and phosphorylation networks. Pathicular searches for 'regulatory path motifs', short paths in the integrated physical networks which occur significantly more often than expected between transcription factors and their targets in the perturbational data. A case study in Saccharomyces cerevisiae identifies eight regulatory path motifs and demonstrates their biological significance.
Collapse
Affiliation(s)
- Anagha Joshi
- Department of Plant Systems Biology, VIB, Technologiepark 927, Gent, Belgium.
| | | | | | | |
Collapse
|
44
|
Stark C, Su TC, Breitkreutz A, Lourenco P, Dahabieh M, Breitkreutz BJ, Tyers M, Sadowski I. PhosphoGRID: a database of experimentally verified in vivo protein phosphorylation sites from the budding yeast Saccharomyces cerevisiae. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2010; 2010:bap026. [PMID: 20428315 PMCID: PMC2860897 DOI: 10.1093/database/bap026] [Citation(s) in RCA: 77] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/30/2009] [Revised: 12/07/2009] [Accepted: 12/23/2009] [Indexed: 12/17/2022]
Abstract
Protein phosphorylation plays a central role in cellular regulation. Recent proteomics strategies for identifying phosphopeptides have been developed using the model organism Saccharomyces cerevisiae, and consequently, when combined with studies of individual gene products, the number of reported specific phosphorylation sites for this organism has expanded enormously. In order to systematically document and integrate these various data types, we have developed a database of experimentally verified in vivo phosphorylation sites curated from the S. cerevisiae primary literature. PhosphoGRID (www.phosphogrid.org) records the positions of over 5000 specific phosphorylated residues on 1495 gene products. Nearly 900 phosphorylated residues are reported from detailed studies of individual proteins; these in vivo phosphorylation sites are documented by a hierarchy of experimental evidence codes. Where available for specific sites, we have also noted the relevant protein kinases and/or phosphatases, the specific condition(s) under which phosphorylation occurs, and the effect(s) that phosphorylation has on protein function. The unique features of PhosphoGRID that assign both function and specific physiological conditions to each phosphorylated residue will provide a valuable benchmark for proteome-level studies and will facilitate bioinformatic analysis of cellular signal transduction networks. Database URL: http://phosphogrid.org/
Collapse
Affiliation(s)
- Chris Stark
- Centre for Systems Biology, Samuel Lunenfeld Research Institute, 600 University Avenue, Toronto, Ontario M5G 1X5, Canada
| | | | | | | | | | | | | | | |
Collapse
|
45
|
Ulitsky I, Krogan NJ, Shamir R. Towards accurate imputation of quantitative genetic interactions. Genome Biol 2009; 10:R140. [PMID: 20003301 PMCID: PMC2812947 DOI: 10.1186/gb-2009-10-12-r140] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2009] [Revised: 11/08/2009] [Accepted: 12/10/2009] [Indexed: 12/25/2022] Open
Abstract
A new method for calculating quantitative genetic interactions allows for the inference of 190,000 new genetic interactions in Saccharomyces cerevisae. Recent technological breakthroughs have enabled high-throughput quantitative measurements of hundreds of thousands of genetic interactions among hundreds of genes in Saccharomyces cerevisiae. However, these assays often fail to measure the genetic interactions among up to 40% of the studied gene pairs. Here we present a novel method, which combines genetic interaction data together with diverse genomic data, to quantitatively impute these missing interactions. We also present data on almost 190,000 novel interactions.
Collapse
Affiliation(s)
- Igor Ulitsky
- Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv 69978, Israel.
| | | | | |
Collapse
|
46
|
|
47
|
Siegal-Gaskins D, Grotewold E, Smith GD. The capacity for multistability in small gene regulatory networks. BMC SYSTEMS BIOLOGY 2009; 3:96. [PMID: 19772572 PMCID: PMC2759935 DOI: 10.1186/1752-0509-3-96] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/13/2009] [Accepted: 09/21/2009] [Indexed: 11/10/2022]
Abstract
BACKGROUND Recent years have seen a dramatic increase in the use of mathematical modeling to gain insight into gene regulatory network behavior across many different organisms. In particular, there has been considerable interest in using mathematical tools to understand how multistable regulatory networks may contribute to developmental processes such as cell fate determination. Indeed, such a network may subserve the formation of unicellular leaf hairs (trichomes) in the model plant Arabidopsis thaliana. RESULTS In order to investigate the capacity of small gene regulatory networks to generate multiple equilibria, we present a chemical reaction network (CRN)-based modeling formalism and describe a number of methods for CRN analysis in a parameter-free context. These methods are compared and applied to a full set of one-component subnetworks, as well as a large random sample from 40,680 similarly constructed two-component subnetworks. We find that positive feedback and cooperativity mediated by transcription factor (TF) dimerization is a requirement for one-component subnetwork bistability. For subnetworks with two components, the presence of these processes increases the probability that a randomly sampled subnetwork will exhibit multiple equilibria, although we find several examples of bistable two-component subnetworks that do not involve cooperative TF-promoter binding. In the specific case of epidermal differentiation in Arabidopsis, dimerization of the GL3-GL1 complex and cooperative sequential binding of GL3-GL1 to the CPC promoter are each independently sufficient for bistability. CONCLUSION Computational methods utilizing CRN-specific theorems to rule out bistability in small gene regulatory networks are far superior to techniques generally applicable to deterministic ODE systems. Using these methods to conduct an unbiased survey of parameter-free deterministic models of small networks, and the Arabidopsis epidermal cell differentiation subnetwork in particular, we illustrate how future experimental research may be guided by network structure analysis.
Collapse
Affiliation(s)
- Dan Siegal-Gaskins
- Mathematical Bioscience Institute, The Ohio State University, Columbus, OH 43210, USA
- Department of Plant Cellular and Molecular Biology and Plant Biotechnology Center, The Ohio State University, Columbus, OH 43210, USA
| | - Erich Grotewold
- Mathematical Bioscience Institute, The Ohio State University, Columbus, OH 43210, USA
- Department of Plant Cellular and Molecular Biology and Plant Biotechnology Center, The Ohio State University, Columbus, OH 43210, USA
| | - Gregory D Smith
- Mathematical Bioscience Institute, The Ohio State University, Columbus, OH 43210, USA
- Department of Applied Science, The College of William and Mary, Williamsburg, VA 23187, USA
| |
Collapse
|
48
|
Breker M, Schuldiner M. Explorations in topology-delving underneath the surface of genetic interaction maps. MOLECULAR BIOSYSTEMS 2009; 5:1473-81. [PMID: 19763324 DOI: 10.1039/b907076c] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
High throughput assays, as well as advances in computational approaches, have recently allowed the acquisition of vast amounts of genetic interaction (GI) data in several organisms. Since GIs are a functional measure that reports on the effect of a mutation in one gene on the phenotype of a mutation in another, they can serve as a powerful tool to study both the function of individual genes and the wiring of biological networks. Therefore, these data hold much promise for advancing our understanding of cellular systems. In this review we focus on the methodologies currently available for using and interpreting large datasets of GIs for functional gene groups (GI maps), and elaborate on the challenges ahead. In addition, we discuss potential applications for the study of evolution and disease mechanisms, and highlight the need for comprehensive integrative analysis to extract the wealth of information found in these maps.
Collapse
Affiliation(s)
- Michal Breker
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, Israel.
| | | |
Collapse
|
49
|
Mutwil M, Ruprecht C, Giorgi FM, Bringmann M, Usadel B, Persson S. Transcriptional wiring of cell wall-related genes in Arabidopsis. MOLECULAR PLANT 2009; 2:1015-24. [PMID: 19825676 DOI: 10.1093/mp/ssp055] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
Transcriptional coordination, or co-expression, of genes may signify functional relatedness of the corresponding proteins. For example, several genes involved in secondary cell wall cellulose biosynthesis are co-expressed with genes engaged in the synthesis of xylan, which is a major component of the secondary cell wall. To extend these types of analyses, we investigated the co-expression relationships of all Carbohydrate-Active enZYmes (CAZy)-related genes for Arabidopsis thaliana. Thus, the intention was to transcriptionally link different cell wall-related processes to each other, and also to other biological functions. To facilitate easy manual inspection, we have displayed these interactions as networks and matrices, and created a web-based interface (http://aranet.mpimp-golm.mpg.de/corecarb) containing downloadable files for all the transcriptional associations.
Collapse
Affiliation(s)
- Marek Mutwil
- Max-Planck-Institute for Molecular Plant Physiology, Am Mühlenberg 1, 14476 Potsdam, Germany
| | | | | | | | | | | |
Collapse
|
50
|
Flannick J, Novak A, Do CB, Srinivasan BS, Batzoglou S. Automatic parameter learning for multiple local network alignment. J Comput Biol 2009; 16:1001-22. [PMID: 19645599 PMCID: PMC3154456 DOI: 10.1089/cmb.2009.0099] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
We developed Graemlin 2.0, a new multiple network aligner with (1) a new multi-stage approach to local network alignment; (2) a novel scoring function that can use arbitrary features of a multiple network alignment, such as protein deletions, protein duplications, protein mutations, and interaction losses; (3) a parameter learning algorithm that uses a training set of known network alignments to learn parameters for our scoring function and thereby adapt it to any set of networks; and (4) an algorithm that uses our scoring function to find approximate multiple network alignments in linear time. We tested Graemlin 2.0's accuracy on protein interaction networks from IntAct, DIP, and the Stanford Network Database. We show that, on each of these datasets, Graemlin 2.0 has higher sensitivity and specificity than existing network aligners. Graemlin 2.0 is available under the GNU public license at http://graemlin.stanford.edu .
Collapse
Affiliation(s)
- Jason Flannick
- Department of Computer Science, Stanford University , Stanford, CA 94305, USA.
| | | | | | | | | |
Collapse
|