1
|
Hou L, Li J, Wang H, Chen Q, Su JQ, Gad M, Ahmed W, Yu CP, Hu A. Storm promotes the dissemination of antibiotic resistome in an urban lagoon through enhancing bio-interactions. ENVIRONMENT INTERNATIONAL 2022; 168:107457. [PMID: 35963060 DOI: 10.1016/j.envint.2022.107457] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Revised: 07/07/2022] [Accepted: 08/03/2022] [Indexed: 06/15/2023]
Abstract
Antibiotic-resistance genes (ARGs) and resistant bacteria (ARB) are abundant in stormwater that could cause serious infections, posing a potential threat to public health. However, there is no inference about how stormwater contributes to ARG profiles as well as the dynamic interplay between ARGs and bacteria via vertical gene transfer (VGT) or horizontal gene transfer (HGT) in urban water ecosystems. In this study, the distribution of ARGs, their host communities, and the source and community assembly process of ARGs were investigated in Yundang Lagoon (China) via high-throughput quantitative PCR, 16S rRNA gene amplicon sequencing, and application of SourceTracker before, after and recovering from an extreme precipitation event (132.1 mm). The abundance of ARGs and mobile genetic elements (MGEs) was the highest one day after precipitation and then decreased 2 days after precipitation and so on. Based on SourceTracker and NMDS analysis, the ARG and bacterial communities in lagoon surface water from one day after precipitation were mainly contributed by the wastewater treatment plant (WWTP) influent and effluent. However, the contribution of WWTP to ARG communities was minor 11 days after the precipitation, suggesting that the storm promoted the ARG levels by introducing the input of ARGs, MGEs, and ARB from point and non-point sources, such as sewer overflow and land-applied manure. Based on a novel microbial network analysis framework, the contribution of positive biological interactions between ARGs and MGEs or bacteria was the highest one day after precipitation, indicating a promoted VGT and HGT for ARG dissemination. The microbial networks deconstructed 11 days after precipitation, suggesting the stormwater practices (e.g., tide gate opening, diversion channels, and pumping) alleviated the spread of ARGs. These results advanced our understanding of the distribution and transport of ARGs associated with their source in urban stormwater runoff.
Collapse
Affiliation(s)
- Liyuan Hou
- CAS Key Laboratory of Urban Pollutant Conversion, Institute of Urban Environment, Chinese Academy of Sciences, Xiamen 361021, China; Department of Bacteriology, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Jiangwei Li
- CAS Key Laboratory of Urban Pollutant Conversion, Institute of Urban Environment, Chinese Academy of Sciences, Xiamen 361021, China; University of Chinese Academy of Sciences, Beijing 100049, China; Fujian Key Laboratory of Watershed Ecology, Institute of Urban Environment, Chinese Academy of Sciences, Xiamen 361021, China
| | - Hongjie Wang
- CAS Key Laboratory of Urban Pollutant Conversion, Institute of Urban Environment, Chinese Academy of Sciences, Xiamen 361021, China; University of Chinese Academy of Sciences, Beijing 100049, China; Fujian Key Laboratory of Watershed Ecology, Institute of Urban Environment, Chinese Academy of Sciences, Xiamen 361021, China
| | - Qingfu Chen
- Yundang Lake Management Center, Xiamen, Fujian 361004, China
| | - Jian-Qiang Su
- Key Laboratory of Urban Environment and Health, Institute of Urban Environment, Chinese Academy of Sciences, Xiamen, Fujian 361021, China
| | - Mahmoud Gad
- CAS Key Laboratory of Urban Pollutant Conversion, Institute of Urban Environment, Chinese Academy of Sciences, Xiamen 361021, China; Fujian Key Laboratory of Watershed Ecology, Institute of Urban Environment, Chinese Academy of Sciences, Xiamen 361021, China; Water Pollution Research Department, National Research Centre, Giza 12622, Egypt
| | - Warish Ahmed
- CSIRO Land and Water, Ecosciences Precinct, 41 Boggo Road, Qld 4102, Australia
| | - Chang-Ping Yu
- CAS Key Laboratory of Urban Pollutant Conversion, Institute of Urban Environment, Chinese Academy of Sciences, Xiamen 361021, China; Graduate Institute of Environmental Engineering, National Taiwan University, Taipei 106, Taiwan
| | - Anyi Hu
- CAS Key Laboratory of Urban Pollutant Conversion, Institute of Urban Environment, Chinese Academy of Sciences, Xiamen 361021, China; Fujian Key Laboratory of Watershed Ecology, Institute of Urban Environment, Chinese Academy of Sciences, Xiamen 361021, China.
| |
Collapse
|
2
|
Three topological features of regulatory networks control life-essential and specialized subsystems. Sci Rep 2021; 11:24209. [PMID: 34930908 PMCID: PMC8688434 DOI: 10.1038/s41598-021-03625-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Accepted: 12/07/2021] [Indexed: 11/08/2022] Open
Abstract
Gene regulatory networks (GRNs) play key roles in development, phenotype plasticity, and evolution. Although graph theory has been used to explore GRNs, associations amongst topological features, transcription factors (TFs), and systems essentiality are poorly understood. Here we sought the relationship amongst the main GRN topological features that influence the control of essential and specific subsystems. We found that the Knn, page rank, and degree are the most relevant GRN features: the ones are conserved along the evolution and are also relevant in pluripotent cells. Interestingly, life-essential subsystems are governed mainly by TFs with intermediary Knn and high page rank or degree, whereas specialized subsystems are mainly regulated by TFs with low Knn. Hence, we suggest that the high probability of TFs be toured by a random signal, and the high probability of the signal propagation to target genes ensures the life-essential subsystems' robustness. Gene/genome duplication is the main evolutionary process to rise Knn as the most relevant feature. Herein, we shed light on unexplored topological GRN features to assess how they are related to subsystems and how the duplications shaped the regulatory systems along the evolution. The classification model generated can be found here: https://github.com/ivanrwolf/NoC/ .
Collapse
|
3
|
Chagoyen M, Ranea JAG, Pazos F. Applications of molecular networks in biomedicine. Biol Methods Protoc 2019; 4:bpz012. [PMID: 32395629 PMCID: PMC7200821 DOI: 10.1093/biomethods/bpz012] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2019] [Revised: 08/20/2019] [Accepted: 08/28/2019] [Indexed: 12/12/2022] Open
Abstract
Due to the large interdependence between the molecular components of living systems, many phenomena, including those related to pathologies, cannot be explained in terms of a single gene or a small number of genes. Molecular networks, representing different types of relationships between molecular entities, embody these large sets of interdependences in a framework that allow their mining from a systemic point of view to obtain information. These networks, often generated from high-throughput omics datasets, are used to study the complex phenomena of human pathologies from a systemic point of view. Complementing the reductionist approach of molecular biology, based on the detailed study of a small number of genes, systemic approaches to human diseases consider that these are better reflected in large and intricate networks of relationships between genes. These networks, and not the single genes, provide both better markers for diagnosing diseases and targets for treating them. Network approaches are being used to gain insight into the molecular basis of complex diseases and interpret the large datasets associated with them, such as genomic variants. Network formalism is also suitable for integrating large, heterogeneous and multilevel datasets associated with diseases from the molecular level to organismal and epidemiological scales. Many of these approaches are available to nonexpert users through standard software packages.
Collapse
Affiliation(s)
- Monica Chagoyen
- Computational Systems Biology Group, Systems Biology Program, National Centre for Biotechnology (CNB-CSIC), Madrid, Spain
| | - Juan A G Ranea
- Department of Molecular Biology and Biochemistry, University of Malaga, Malaga, Spain
- CIBER de Enfermedades Raras, Instituto de Salud Carlos III, Madrid, Spain
| | - Florencio Pazos
- Computational Systems Biology Group, Systems Biology Program, National Centre for Biotechnology (CNB-CSIC), Madrid, Spain
| |
Collapse
|
4
|
Lysenko A, Boroevich KA, Tsunoda T. Arete - candidate gene prioritization using biological network topology with additional evidence types. BioData Min 2017; 10:22. [PMID: 28694847 PMCID: PMC5501438 DOI: 10.1186/s13040-017-0141-9] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2016] [Accepted: 06/12/2017] [Indexed: 12/27/2022] Open
Abstract
BACKGROUND Refinement of candidate gene lists to select the most promising candidates for further experimental verification remains an essential step between high-throughput exploratory analysis and the discovery of specific causal genes. Given the qualitative and semantic complexity of biological data, successfully addressing this challenge requires development of flexible and interoperable solutions for making the best possible use of the largest possible fraction of all available data. RESULTS We have developed an easily accessible framework that links two established network-based gene prioritization approaches with a supporting isolation forest-based integrative ranking method. The defining feature of the method is that both topological information of the biological networks and additional sources of evidence can be considered at the same time. The implementation was realized as an app extension for the Cytoscape graph analysis suite, and therefore can further benefit from the synergy with other analysis methods available as part of this system. CONCLUSIONS We provide efficient reference implementations of two popular gene prioritization algorithms - DIAMOnD and random walk with restart for the Cytoscape system. An extension of those methods was also developed that allows outputs of these algorithms to be combined with additional data. To demonstrate the utility of our software, we present two example disease gene prioritization application cases and show how our tool can be used to evaluate these different approaches.
Collapse
Affiliation(s)
- Artem Lysenko
- Laboratory for Medical Science Mathematics, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi, Yokohama, 230-0045 Japan
| | - Keith Anthony Boroevich
- Laboratory for Medical Science Mathematics, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi, Yokohama, 230-0045 Japan
| | - Tatsuhiko Tsunoda
- Laboratory for Medical Science Mathematics, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi, Yokohama, 230-0045 Japan.,Department of Medical Science Mathematics, Medical Research Institute, Tokyo Medical and Dental University, 1-5-45 Yushima, Bunkyo-ku, Tokyo, 113-8510 Japan.,CREST, JST, Tokyo, 113-8510 Japan
| |
Collapse
|
5
|
Nguyen TTT, Chua JKK, Seah KS, Koo SH, Yee JY, Yang EG, Lim KK, Pang SYW, Yuen A, Zhang L, Ang WH, Dymock B, Lee EJD, Chen ES. Predicting chemotherapeutic drug combinations through gene network profiling. Sci Rep 2016; 6:18658. [PMID: 26791325 PMCID: PMC4726371 DOI: 10.1038/srep18658] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2015] [Accepted: 11/23/2015] [Indexed: 12/29/2022] Open
Abstract
Contemporary chemotherapeutic treatments incorporate the use of several agents in combination. However, selecting the most appropriate drugs for such therapy is not necessarily an easy or straightforward task. Here, we describe a targeted approach that can facilitate the reliable selection of chemotherapeutic drug combinations through the interrogation of drug-resistance gene networks. Our method employed single-cell eukaryote fission yeast (Schizosaccharomyces pombe) as a model of proliferating cells to delineate a drug resistance gene network using a synthetic lethality workflow. Using the results of a previous unbiased screen, we assessed the genetic overlap of doxorubicin with six other drugs harboring varied mechanisms of action. Using this fission yeast model, drug-specific ontological sub-classifications were identified through the computation of relative hypersensitivities. We found that human gastric adenocarcinoma cells can be sensitized to doxorubicin by concomitant treatment with cisplatin, an intra-DNA strand crosslinking agent, and suberoylanilide hydroxamic acid, a histone deacetylase inhibitor. Our findings point to the utility of fission yeast as a model and the differential targeting of a conserved gene interaction network when screening for successful chemotherapeutic drug combinations for human cells.
Collapse
Affiliation(s)
- Thi Thuy Trang Nguyen
- Department of Biochemistry, National University of Singapore, Singapore.,National University Health System (NUHS), Singapore
| | - Jacqueline Kia Kee Chua
- Department of Biochemistry, National University of Singapore, Singapore.,Department of Chemistry, Faculty of Science, National University of Singapore, Singapore
| | - Kwi Shan Seah
- Department of Biochemistry, National University of Singapore, Singapore.,National University Health System (NUHS), Singapore
| | - Seok Hwee Koo
- Department of Pharmacology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore.,Changi General Hospital, Ministry of Health, Singapore
| | - Jie Yin Yee
- National University Health System (NUHS), Singapore.,Department of Pharmacology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore
| | - Eugene Guorong Yang
- Department of Pharmacy, Faculty of Science, National University of Singapore, Singapore
| | - Kim Kiat Lim
- Department of Biochemistry, National University of Singapore, Singapore.,National University Health System (NUHS), Singapore
| | | | - Audrey Yuen
- School of Chemical and Life Sciences, Singapore Polytechnic, Singapore
| | - Louxin Zhang
- Department of Mathematics, Faculty of Science, National University of Singapore, Singapore
| | - Wee Han Ang
- NUS Graduate School for Integrative Sciences and Engineering, National University of Singapore, Singapore.,Department of Chemistry, Faculty of Science, National University of Singapore, Singapore
| | - Brian Dymock
- Department of Pharmacy, Faculty of Science, National University of Singapore, Singapore
| | - Edmund Jon Deoon Lee
- National University Health System (NUHS), Singapore.,Department of Pharmacology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore
| | - Ee Sin Chen
- Department of Biochemistry, National University of Singapore, Singapore.,National University Health System (NUHS), Singapore.,NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), Life Sciences Institute, National University of Singapore.,NUS Graduate School for Integrative Sciences and Engineering, National University of Singapore, Singapore
| |
Collapse
|
6
|
Pritykin Y, Ghersi D, Singh M. Genome-Wide Detection and Analysis of Multifunctional Genes. PLoS Comput Biol 2015; 11:e1004467. [PMID: 26436655 PMCID: PMC4593560 DOI: 10.1371/journal.pcbi.1004467] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2015] [Accepted: 07/19/2015] [Indexed: 12/25/2022] Open
Abstract
Many genes can play a role in multiple biological processes or molecular functions. Identifying multifunctional genes at the genome-wide level and studying their properties can shed light upon the complexity of molecular events that underpin cellular functioning, thereby leading to a better understanding of the functional landscape of the cell. However, to date, genome-wide analysis of multifunctional genes (and the proteins they encode) has been limited. Here we introduce a computational approach that uses known functional annotations to extract genes playing a role in at least two distinct biological processes. We leverage functional genomics data sets for three organisms—H. sapiens, D. melanogaster, and S. cerevisiae—and show that, as compared to other annotated genes, genes involved in multiple biological processes possess distinct physicochemical properties, are more broadly expressed, tend to be more central in protein interaction networks, tend to be more evolutionarily conserved, and are more likely to be essential. We also find that multifunctional genes are significantly more likely to be involved in human disorders. These same features also hold when multifunctionality is defined with respect to molecular functions instead of biological processes. Our analysis uncovers key features about multifunctional genes, and is a step towards a better genome-wide understanding of gene multifunctionality. Almost every aspect of cellular function depends on protein activity. In spite of being fine-tuned to carry out highly specific functions, proteins can also multitask. Experimental studies have identified genes and proteins endowed with more than one molecular function, or participating in very different biological processes. These studies suggest that the degree of functional plasticity exhibited by proteins might go well beyond a simple “one protein—one function” relationship. However, systematic studies of the properties of multifunctional genes (and their encoded proteins) have been limited. Here we present a computational framework to identify putative multifunctional genes, and compare their properties with those of other genes. We find that multifunctional genes are significantly different from other genes with respect to their physicochemical properties, expression profiles, and interaction properties. We also observe that multifunctional genes tend to be more conserved, and that a greater fraction of them are associated with human disorders. Taken together, these results represent a step towards a more complete understanding of the role multifunctional genes play in the functional organization of the cell.
Collapse
Affiliation(s)
- Yuri Pritykin
- Department of Computer Science, Princeton University, Princeton, New Jersey, United States of America
- Lewis–Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
| | - Dario Ghersi
- Lewis–Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
- School of Interdisciplinary Informatics, University of Nebraska at Omaha, Omaha, Nebraska, United States of America
- * E-mail: (DG); (MS)
| | - Mona Singh
- Department of Computer Science, Princeton University, Princeton, New Jersey, United States of America
- Lewis–Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
- * E-mail: (DG); (MS)
| |
Collapse
|
7
|
Hulsman M, Dimitrakopoulos C, de Ridder J. Scale-space measures for graph topology link protein network architecture to function. ACTA ACUST UNITED AC 2014; 30:i237-45. [PMID: 24931989 PMCID: PMC4058939 DOI: 10.1093/bioinformatics/btu283] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
Motivation: The network architecture of physical protein interactions is an important determinant for the molecular functions that are carried out within each cell. To study this relation, the network architecture can be characterized by graph topological characteristics such as shortest paths and network hubs. These characteristics have an important shortcoming: they do not take into account that interactions occur across different scales. This is important because some cellular functions may involve a single direct protein interaction (small scale), whereas others require more and/or indirect interactions, such as protein complexes (medium scale) and interactions between large modules of proteins (large scale). Results: In this work, we derive generalized scale-aware versions of known graph topological measures based on diffusion kernels. We apply these to characterize the topology of networks across all scales simultaneously, generating a so-called graph topological scale-space. The comprehensive physical interaction network in yeast is used to show that scale-space based measures consistently give superior performance when distinguishing protein functional categories and three major types of functional interactions—genetic interaction, co-expression and perturbation interactions. Moreover, we demonstrate that graph topological scale spaces capture biologically meaningful features that provide new insights into the link between function and protein network architecture. Availability and implementation: MatlabTM code to calculate the scale-aware topological measures (STMs) is available at http://bioinformatics.tudelft.nl/TSSA Contact:j.deridder@tudelft.nl Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Marc Hulsman
- Delft Bioinformatics Lab, Faculty of Electrical Engineering, Mathematics and Computer Science, Delft University of Technology, 2628CD Delft, The Netherlands
| | - Christos Dimitrakopoulos
- Delft Bioinformatics Lab, Faculty of Electrical Engineering, Mathematics and Computer Science, Delft University of Technology, 2628CD Delft, The Netherlands
| | - Jeroen de Ridder
- Delft Bioinformatics Lab, Faculty of Electrical Engineering, Mathematics and Computer Science, Delft University of Technology, 2628CD Delft, The Netherlands
| |
Collapse
|
8
|
Suratanee A, Plaimas K. Identification of inflammatory bowel disease-related proteins using a reverse k-nearest neighbor search. J Bioinform Comput Biol 2014; 12:1450017. [DOI: 10.1142/s0219720014500176] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Inflammatory bowel disease (IBD) is a chronic disease whose incidence and prevalence increase every year; however, the pathogenesis of IBD is still unclear. Thus, identifying IBD-related proteins is important for understanding its complex disease mechanism. Here, we propose a new and simple network-based approach using a reverse k-nearest neighbor ( R k NN ) search to identify novel IBD-related proteins. Protein–protein interactions (PPI) and Genome-Wide Association Studies (GWAS) were used in this study. After constructing the PPI network, the R k NN search was applied to all of the proteins to identify sets of influenced proteins among their k-nearest neighbors ( R k NNs ). An observed protein whose influenced proteins were mostly known IBD-related proteins was statistically identified as a novel IBD-related protein. Our method outperformed a random aspect, k NN search, and centrality measures based on the network topology. A total of 39 proteins were identified as IBD-related proteins. Of these proteins, 71% were reported at least once in the literature as related to IBD. Additionally, these proteins were found over-represented in the IBD pathway and enriched in importantly functional pathways in IBD. In conclusion, the R k NN search with the statistical enrichment test is a great tool to identify IBD-related proteins to better understand its complex disease mechanism.
Collapse
Affiliation(s)
- Apichat Suratanee
- Department of Mathematics, Faculty of Applied Science, King Mongkut's University of Technology North Bangkok, 1518 Pracharat 1 Road, Wongsawang, Bangsue, Bangkok 10800, Thailand
| | - Kitiporn Plaimas
- Integrative Bioinformatics and Systems Biology Group, Advanced Virtual and Intelligent Computing Research Center (AVIC), Department of Mathematics and Computer Science, Faculty of Science, Chulalongkorn University, Phyathai Road, Patumwan, Bangkok 10330, Thailand
| |
Collapse
|
9
|
Otasek D, Pastrello C, Holzinger A, Jurisica I. Visual Data Mining: Effective Exploration of the Biological Universe. INTERACTIVE KNOWLEDGE DISCOVERY AND DATA MINING IN BIOMEDICAL INFORMATICS 2014. [DOI: 10.1007/978-3-662-43968-5_2] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
10
|
Ghersi D, Singh M. Interaction-based discovery of functionally important genes in cancers. Nucleic Acids Res 2013; 42:e18. [PMID: 24362839 PMCID: PMC3919581 DOI: 10.1093/nar/gkt1305] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
A major challenge in cancer genomics is uncovering genes with an active role in tumorigenesis from a potentially large pool of mutated genes across patient samples. Here we focus on the interactions that proteins make with nucleic acids, small molecules, ions and peptides, and show that residues within proteins that are involved in these interactions are more frequently affected by mutations observed in large-scale cancer genomic data than are other residues. We leverage this observation to predict genes that play a functionally important role in cancers by introducing a computational pipeline (http://canbind.princeton.edu) for mapping large-scale cancer exome data across patients onto protein structures, and automatically extracting proteins with an enriched number of mutations affecting their nucleic acid, small molecule, ion or peptide binding sites. Using this computational approach, we show that many previously known genes implicated in cancers are enriched in mutations within the binding sites of their encoded proteins. By focusing on functionally relevant portions of proteins--specifically those known to be involved in molecular interactions--our approach is particularly well suited to detect infrequent mutations that may nonetheless be important in cancer, and should aid in expanding our functional understanding of the genomic landscape of cancer.
Collapse
Affiliation(s)
- Dario Ghersi
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA and Department of Computer Science, Princeton University, Princeton, NJ 08544, USA
| | | |
Collapse
|
11
|
Li P, Hua X, Zhang Z, Li J, Wang J. Characterization of regulatory features of housekeeping and tissue-specific regulators within tissue regulatory networks. BMC SYSTEMS BIOLOGY 2013; 7:112. [PMID: 24172660 PMCID: PMC3843562 DOI: 10.1186/1752-0509-7-112] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/29/2013] [Accepted: 10/28/2013] [Indexed: 01/10/2023]
Abstract
Background Transcription factors (TFs) and miRNAs are essential for the regulation of gene expression; however, the global view of human gene regulatory networks remains poorly understood. For example, how is the expression of so many genes regulated by limited cohorts of regulators and how are genes differentially expressed in different tissues despite the genetic code being the same in all tissues? Results We analyzed the network properties of housekeeping and tissue-specific genes in gene regulatory networks from seven human tissues. Our results show that different classes of genes behave quite differently in these networks. Tissue-specific miRNAs show a higher average target number compared with non-tissue specific miRNAs, which indicates that tissue-specific miRNAs tend to regulate different sets of targets. Tissue-specific TFs exhibit higher in-degree, out-degree, cluster coefficient and betweenness values, indicating that they occupy central positions in the regulatory network and that they transfer genetic information from upstream genes to downstream genes more quickly than other TFs. Housekeeping TFs tend to have higher cluster coefficients compared with other genes that are neither housekeeping nor tissue specific, indicating that housekeeping TFs tend to regulate their targets synergistically. Several topological properties of disease-associated miRNAs and genes were found to be significantly different from those of non-disease-associated miRNAs and genes. Conclusions Tissue-specific miRNAs, TFs and disease genes have particular topological properties within the transcriptional regulatory networks of the seven human tissues examined. The tendency of tissue-specific miRNAs to regulate different sets of genes shows that a particular tissue-specific miRNA and its target gene set may form a regulatory module to execute particular functions in the process of tissue differentiation. The regulatory patterns of tissue-specific TFs reflect their vital role in regulatory networks and their importance to biological functions in their respective tissues. The topological differences between disease and non-disease genes may aid the discovery of new disease genes or drug targets. Determining the network properties of these regulatory factors will help define the basic principles of human gene regulation and the molecular mechanisms of disease.
Collapse
Affiliation(s)
| | | | | | - Jie Li
- The State Key Laboratory of Pharmaceutical Biotechnology, Jiangsu Engineering Research Center for MicroRNA Biology and Biotechnology, School of Life Science, Nanjing University, Nanjing, China.
| | | |
Collapse
|
12
|
Simple topological features reflect dynamics and modularity in protein interaction networks. PLoS Comput Biol 2013; 9:e1003243. [PMID: 24130468 PMCID: PMC3794914 DOI: 10.1371/journal.pcbi.1003243] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2013] [Accepted: 08/14/2013] [Indexed: 11/30/2022] Open
Abstract
The availability of large-scale protein-protein interaction networks for numerous organisms provides an opportunity to comprehensively analyze whether simple properties of proteins are predictive of the roles they play in the functional organization of the cell. We begin by re-examining an influential but controversial characterization of the dynamic modularity of the S. cerevisiae interactome that incorporated gene expression data into network analysis. We analyse the protein-protein interaction networks of five organisms, S. cerevisiae, H. sapiens, D. melanogaster, A. thaliana, and E. coli, and confirm significant and consistent functional and structural differences between hub proteins that are co-expressed with their interacting partners and those that are not, and support the view that the former tend to be intramodular whereas the latter tend to be intermodular. However, we also demonstrate that in each of these organisms, simple topological measures are significantly correlated with the average co-expression of a hub with its partners, independent of any classification, and therefore also reflect protein intra- and inter- modularity. Further, cross-interactomic analysis demonstrates that these simple topological characteristics of hub proteins tend to be conserved across organisms. Overall, we give evidence that purely topological features of static interaction networks reflect aspects of the dynamics and modularity of interactomes as well as previous measures incorporating expression data, and are a powerful means for understanding the dynamic roles of hubs in interactomes. A better understanding of protein interaction networks would be a great aid in furthering our knowledge of the molecular biology of the cell. Towards this end, large-scale protein-protein physical interaction data have been determined for organisms across the evolutionary spectrum. However, the resulting networks give a static view of interactomes, and our knowledge about protein interactions is rarely time or context specific. A previous prominent but controversial attempt to characterize the dynamic modularity of the interactome was based on integrating physical interaction data with gene activity measurements from transcript expression data. This analysis distinguished between proteins that are co-expressed with their interacting partners and those that are not, and argued that the former are intramodular and the latter are intermodular. By analyzing the interactomes of five organisms, we largely confirm the biological significance of this characterization through a variety of statistical tests and computational experiments. Surprisingly, however, we find that similar results can be obtained using just network information without additionally integrating expression data, suggesting that purely topological characteristics of interaction networks strongly reflect certain aspects of the dynamics and modularity of interactomes.
Collapse
|
13
|
Klapa MI, Tsafou K, Theodoridis E, Tsakalidis A, Moschonas NK. Reconstruction of the experimentally supported human protein interactome: what can we learn? BMC SYSTEMS BIOLOGY 2013; 7:96. [PMID: 24088582 PMCID: PMC4015887 DOI: 10.1186/1752-0509-7-96] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/26/2013] [Accepted: 09/25/2013] [Indexed: 02/02/2023]
Abstract
BACKGROUND Understanding the topology and dynamics of the human protein-protein interaction (PPI) network will significantly contribute to biomedical research, therefore its systematic reconstruction is required. Several meta-databases integrate source PPI datasets, but the protein node sets of their networks vary depending on the PPI data combined. Due to this inherent heterogeneity, the way in which the human PPI network expands via multiple dataset integration has not been comprehensively analyzed. We aim at assembling the human interactome in a global structured way and exploring it to gain insights of biological relevance. RESULTS First, we defined the UniProtKB manually reviewed human "complete" proteome as the reference protein-node set and then we mined five major source PPI datasets for direct PPIs exclusively between the reference proteins. We updated the protein and publication identifiers and normalized all PPIs to the UniProt identifier level. The reconstructed interactome covers approximately 60% of the human proteome and has a scale-free structure. No apparent differentiating gene functional classification characteristics were identified for the unrepresented proteins. The source dataset integration augments the network mainly in PPIs. Polyubiquitin emerged as the highest-degree node, but the inclusion of most of its identified PPIs may be reconsidered. The high number (>300) of connections of the subsequent fifteen proteins correlates well with their essential biological role. According to the power-law network structure, the unrepresented proteins should mainly have up to four connections with equally poorly-connected interactors. CONCLUSIONS Reconstructing the human interactome based on the a priori definition of the protein nodes enabled us to identify the currently included part of the human "complete" proteome, and discuss the role of the proteins within the network topology with respect to their function. As the network expansion has to comply with the scale-free theory, we suggest that the core of the human interactome has essentially emerged. Thus, it could be employed in systems biology and biomedical research, despite the considerable number of currently unrepresented proteins. The latter are probably involved in specialized physiological conditions, justifying the scarcity of related PPI information, and their identification can assist in designing relevant functional experiments and targeted text mining algorithms.
Collapse
Affiliation(s)
- Maria I Klapa
- Department of General Biology, School of Medicine, University of Patras, Rio, Patras, Greece.
| | | | | | | | | |
Collapse
|