101
|
Zhou X, Shi T, Li B, Zhang Y, Shen X, Li H, Hong G, Liu C, Guo Z. Genes dysregulated to different extent or oppositely in estrogen receptor-positive and estrogen receptor-negative breast cancers. PLoS One 2013; 8:e70017. [PMID: 23875016 PMCID: PMC3715479 DOI: 10.1371/journal.pone.0070017] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2012] [Accepted: 06/14/2013] [Indexed: 12/22/2022] Open
Abstract
Background Directly comparing gene expression profiles of estrogen receptor-positive (ER+) and estrogen receptor-negative (ER−) breast cancers cannot determine whether differentially expressed genes between these two subtypes result from dysregulated expression in ER+ cancer or ER− cancer versus normal controls, and thus would miss critical information for elucidating the transcriptomic difference between the two subtypes. Principal Findings Using microarray datasets from TCGA, we classified the genes dysregulated in both ER+ and ER− cancers versus normal controls into two classes: (i) genes dysregulated in the same direction but to a different extent, and (ii) genes dysregulated to opposite directions, and then validated the two classes in RNA-sequencing datasets of independent cohorts. We showed that the genes dysregulated to a larger extent in ER+ cancers than in ER− cancers enriched in glycerophospholipid and polysaccharide metabolic processes, while the genes dysregulated to a larger extent in ER− cancers than in ER+ cancers enriched in cell proliferation. Phosphorylase kinase and enzymes of glycosylphosphatidylinositol (GPI) anchor biosynthesis were upregulated to a larger extent in ER+ cancers than in ER− cancers, whereas glycogen synthase and phospholipase A2 were downregulated to a larger extent in ER+ cancers than in ER− cancers. We also found that the genes oppositely dysregulated in the two subtypes significantly enriched with known cancer genes and tended to closely collaborate with the cancer genes. Furthermore, we showed the possibility that these oppositely dysregulated genes could contribute to carcinogenesis of ER+ and ER− cancers through rewiring different subpathways. Conclusions GPI-anchor biosynthesis and glycogenolysis were elevated and hydrolysis of phospholipids was depleted to a larger extent in ER+ cancers than in ER− cancers. Our findings indicate that the genes oppositely dysregulated in the two subtypes are potential cancer genes which could contribute to carcinogenesis of both ER+ and ER− cancers through rewiring different subpathways.
Collapse
Affiliation(s)
- Xianxiao Zhou
- Bioinformatics Centre and Key Laboratory for NeuroInformation of Ministry of Education, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
| | - Tongwei Shi
- Bioinformatics Centre and Key Laboratory for NeuroInformation of Ministry of Education, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
| | - Bailiang Li
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
- Genomics Research Center, Harbin Medical University, Harbin, China
| | - Yuannv Zhang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Xiaopei Shen
- Bioinformatics Centre and Key Laboratory for NeuroInformation of Ministry of Education, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
| | - Hongdong Li
- Bioinformatics Centre and Key Laboratory for NeuroInformation of Ministry of Education, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
| | - Guini Hong
- Bioinformatics Centre and Key Laboratory for NeuroInformation of Ministry of Education, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
| | - Chunyang Liu
- Department of Bioinformatics, School of Basic Medical Sciences, Fujian Medical University, Fuzhou, China
| | - Zheng Guo
- Bioinformatics Centre and Key Laboratory for NeuroInformation of Ministry of Education, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
- Department of Bioinformatics, School of Basic Medical Sciences, Fujian Medical University, Fuzhou, China
- * E-mail:
| |
Collapse
|
102
|
Predicting potential cancer genes by integrating network properties, sequence features and functional annotations. SCIENCE CHINA-LIFE SCIENCES 2013; 56:751-7. [PMID: 23838808 DOI: 10.1007/s11427-013-4500-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/25/2012] [Accepted: 05/14/2013] [Indexed: 10/26/2022]
Abstract
The discovery of novel cancer genes is one of the main goals in cancer research. Bioinformatics methods can be used to accelerate cancer gene discovery, which may help in the understanding of cancer and the development of drug targets. In this paper, we describe a classifier to predict potential cancer genes that we have developed by integrating multiple biological evidence, including protein-protein interaction network properties, and sequence and functional features. We detected 55 features that were significantly different between cancer genes and non-cancer genes. Fourteen cancer-associated features were chosen to train the classifier. Four machine learning methods, logistic regression, support vector machines (SVMs), BayesNet and decision tree, were explored in the classifier models to distinguish cancer genes from non-cancer genes. The prediction power of the different models was evaluated by 5-fold cross-validation. The area under the receiver operating characteristic curve for logistic regression, SVM, Baysnet and J48 tree models was 0.834, 0.740, 0.800 and 0.782, respectively. Finally, the logistic regression classifier with multiple biological features was applied to the genes in the Entrez database, and 1976 cancer gene candidates were identified. We found that the integrated prediction model performed much better than the models based on the individual biological evidence, and the network and functional features had stronger powers than the sequence features in predicting cancer genes.
Collapse
|
103
|
Laenen G, Thorrez L, Börnigen D, Moreau Y. Finding the targets of a drug by integration of gene expression data with a protein interaction network. MOLECULAR BIOSYSTEMS 2013; 9:1676-1685. [PMID: 23443074 DOI: 10.1039/c3mb25438k] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2025]
Abstract
Polypharmacology, which focuses on designing drugs that bind efficiently to multiple targets, has emerged as a new strategic trend in today's drug discovery research. Many successful drugs achieve their effects via multi-target interactions. However, these targets are largely unknown for both marketed drugs and drugs in development. A better knowledge of a drug's mode of action could be of substantial value to future drug development, in particular for side effect prediction and drug repositioning. We propose a network-based computational method for drug target prediction, applicable on a genome-wide scale. Our approach relies on the analysis of gene expression following drug treatment in the context of a functional protein association network. By diffusing differential expression signals to neighboring or correlated nodes in the network, genes are prioritized as potential targets based on the transcriptional response of functionally related genes. Different diffusion strategies were evaluated on 235 publicly available gene expression datasets for treatment with bioactive molecules having a known target. AUC values of up to more than 90% demonstrate the effectiveness of our approach and indicate the predictive power of integrating experimental gene expression data with prior knowledge from protein association networks.
Collapse
Affiliation(s)
- Griet Laenen
- KU Leuven, Department of Electrical Engineering-ESAT, SCD-SISTA, Leuven, Belgium
| | | | | | | |
Collapse
|
104
|
Ruan P, Hayashida M, Maruyama O, Akutsu T. Prediction of heterodimeric protein complexes from weighted protein-protein interaction networks using novel features and kernel functions. PLoS One 2013; 8:e65265. [PMID: 23776458 PMCID: PMC3679142 DOI: 10.1371/journal.pone.0065265] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2013] [Accepted: 04/23/2013] [Indexed: 12/30/2022] Open
Abstract
Since many proteins express their functional activity by interacting with other proteins and forming protein complexes, it is very useful to identify sets of proteins that form complexes. For that purpose, many prediction methods for protein complexes from protein-protein interactions have been developed such as MCL, MCODE, RNSC, PCP, RRW, and NWE. These methods have dealt with only complexes with size of more than three because the methods often are based on some density of subgraphs. However, heterodimeric protein complexes that consist of two distinct proteins occupy a large part according to several comprehensive databases of known complexes. In this paper, we propose several feature space mappings from protein-protein interaction data, in which each interaction is weighted based on reliability. Furthermore, we make use of prior knowledge on protein domains to develop feature space mappings, domain composition kernel and its combination kernel with our proposed features. We perform ten-fold cross-validation computational experiments. These results suggest that our proposed kernel considerably outperforms the naive Bayes-based method, which is the best existing method for predicting heterodimeric protein complexes.
Collapse
Affiliation(s)
- Peiying Ruan
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Gokasho, Uji, Kyoto, Japan
| | - Morihiro Hayashida
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Gokasho, Uji, Kyoto, Japan
| | - Osamu Maruyama
- Institute of Mathematics for Industry, Kyushu University, 744 Motooka, Nishi-ku, Fukuoka, Japan
| | - Tatsuya Akutsu
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Gokasho, Uji, Kyoto, Japan
| |
Collapse
|
105
|
Huang CH, Chou SY, Ng KL. Improving protein complex classification accuracy using amino acid composition profile. Comput Biol Med 2013; 43:1196-204. [PMID: 23930814 DOI: 10.1016/j.compbiomed.2013.05.026] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2012] [Revised: 05/29/2013] [Accepted: 05/30/2013] [Indexed: 11/18/2022]
Abstract
Protein complex prediction approaches are based on the assumptions that complexes have dense protein-protein interactions and high functional similarity between their subunits. We investigated those assumptions by studying the subunits' interaction topology, sequence similarity and molecular function for human and yeast protein complexes. Inclusion of amino acids' physicochemical properties can provide better understanding of protein complex properties. Principal component analysis is carried out to determine the major features. Adopting amino acid composition profile information with the SVM classifier serves as an effective post-processing step for complexes classification. Improvement is based on primary sequence information only, which is easy to obtain.
Collapse
Affiliation(s)
- Chien-Hung Huang
- Department of Computer Science and Information Engineering, National Formosa University, 64, Wen-Hwa Road, Hu-wei, Yun-Lin 632, Taiwan
| | | | | |
Collapse
|
106
|
Kundu N, Dozier U, Deslandes L, Somssich IE, Ullah H. Arabidopsis scaffold protein RACK1A interacts with diverse environmental stress and photosynthesis related proteins. PLANT SIGNALING & BEHAVIOR 2013; 8:e24012. [PMID: 23435172 PMCID: PMC3906143 DOI: 10.4161/psb.24012] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/28/2012] [Revised: 02/14/2013] [Accepted: 02/14/2013] [Indexed: 05/20/2023]
Abstract
Scaffold proteins are known to regulate important cellular processes by interacting with multiple proteins to modulate molecular responses. RACK1 (Receptor for Activated C Kinase 1) is a WD-40 type scaffold protein, conserved in eukaryotes, from Chlamydymonas to plants and humans, expresses ubiquitously and plays regulatory roles in diverse signal transduction and stress response pathways. Here we present the use of Arabidopsis RACK1A, the predominant isoform of a 3-member family, as a bait to screen a split-ubiquitin based cDNA library. In total 97 proteins from dehydration, salt stress, ribosomal and photosynthesis pathways are found to potentially interact with RACK1A. False positive interactions were eliminated following extensive selection based growth potentials. Confirmation of a sub-set of selected interactions is demonstrated through the co-transformation with individual plasmid containing cDNA and the respective bait. Interaction of diverse proteins points to a regulatory role of RACK1A in the cross-talk between signaling pathways. Promoter analysis of the stress and photosynthetic pathway genes revealed conserved transcription factor binding sites. RACK1A is known to be a multifunctional protein and the current identification of potential interacting proteins and future in vivo elucidations of the physiological basis of such interactions will shed light on the possible molecular mechanisms that RACK1A uses to regulate diverse signaling pathways.
Collapse
Affiliation(s)
- Nabanita Kundu
- Department of Biology; Howard University; Washington, DC USA
| | - Uvetta Dozier
- Department of Biology; Howard University; Washington, DC USA
| | - Laurent Deslandes
- Department of Plant Developmental Biology; Max Planck Institute for Plant Breeding Research; Köln, Germany
| | - Imre E. Somssich
- Department of Plant Developmental Biology; Max Planck Institute for Plant Breeding Research; Köln, Germany
| | - Hemayet Ullah
- Department of Biology; Howard University; Washington, DC USA
- Correspondence to: Hemayet Ullah,
| |
Collapse
|
107
|
Zhang T, Xie N, He W, Liu R, Lei Y, Chen Y, Tang H, Liu B, Huang C, Wei Y. An integrated proteomics and bioinformatics analyses of hepatitis B virus X interacting proteins and identification of a novel interactor apoA-I. J Proteomics 2013; 84:92-105. [PMID: 23568022 DOI: 10.1016/j.jprot.2013.03.028] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2013] [Revised: 03/01/2013] [Accepted: 03/12/2013] [Indexed: 02/05/2023]
Abstract
UNLABELLED HBx is well-known to be a multifunctional protein encoded by HBV and its biological functions are mainly dependent on pleiotropic protein-protein interactions (PPIs); however, the global mapping of HBx-interactome has not been established so far. Thus, in this study, we have identified 127 HBx-interacting proteins by a profound GST pull-down assay coupled with mass spectrometry, and constructed an HBx-interactome network and core apoA-I pathways with a series of bioinformatics approaches. One of the identified HBx-binding partners is apolipoprotein A-I (apoA-I), which has a specific role in lipid and cholesterol metabolism. The HBx-apoA-I protein interaction was confirmed by both GST pull-down and co-immunoprecipitation. The ectopic overexpression of apoA-I can lead to a significant inhibition on HBV secretion concomitant with the reduction of cellular cholesterol level. In addition, HBV can modulate the function of apoA-I through HBx which might interact with the 44-189 residues of apoA-I and result in dysfunction of apoA-I such as decreased self-association ability, increased carbonyl level and impaired lipid-binding ability. Our results demonstrate an integrated physical association of HBx and host proteins, especially a novel interactor apoA-I that may influence the HBV secretion, which would shed new light on exploring the complicated mechanisms of HBV manipulation on host cellular functions. BIOLOGICAL SIGNIFICANCE HBx is well-known to be a multifunctional protein encoded by HBV and its biological functions are mainly dependent on pleiotropic protein-protein interactions. Although a series of HBx-interacting proteins have been identified, a global characterization of HBx interactome has not been reported. In this study, we have identified a total of 127 HBx-interacting proteins by a profound GST pull-down assay coupled with mass spectrometry, and constructed an HBx-interactome network with a series of bioinformatics approaches. Our results demonstrate an integrated physical association of HBx and host proteins which may help us explore the complicated mechanisms of HBV manipulation on host cellular functions. In addition, we validated one of the identified HBx-binding partners, apolipoprotein A-I (apoA-I), which played a significant inhibitory effect on HBV secretion, indicating a crucial role of the HBx-apoA-I axis in HBV life cycle.
Collapse
Affiliation(s)
- Tao Zhang
- The State Key Laboratory of Biotherapy, West China Hospital, Sichuan University, Chengdu 610041, PR China
| | | | | | | | | | | | | | | | | | | |
Collapse
|
108
|
Paparountas T, Nikolaidou-Katsaridou MN, Rustici G, Aidinis V. Data Mining and Meta-Analysis on DNA Microarray Data. Bioinformatics 2013. [DOI: 10.4018/978-1-4666-3604-0.ch062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
Microarray technology enables high-throughput parallel gene expression analysis, and use has grown exponentially thanks to the development of a variety of applications for expression, genetics and epigenetic studies. A wealth of data is now available from public repositories, providing unprecedented opportunities for meta-analysis approaches, which could generate new biological information, unrelated to the original scope of individual studies. This study provides a guideline for identification of biological significance of the statistically-selected differentially-expressed genes derived from gene expression arrays as well as to suggest further analysis pathways. The authors review the prerequisites for data-mining and meta-analysis, summarize the conceptual methods to derive biological information from microarray data and suggest software for each category of data mining or meta-analysis.
Collapse
Affiliation(s)
| | | | - Gabriella Rustici
- European Molecular Biology Laboratory-European Bioinformatics Institute, UK
| | - Vasilis Aidinis
- Biomedical Sciences Research Center “Alexander Fleming”, Greece
| |
Collapse
|
109
|
Dimitrakopoulos CM, Theofilatos KA, Georgopoulos EF, Likothanassis S, Tsakalidis A, Mavroudi SP. Efficient Computational Construction of Weighted Protein-Protein Interaction Networks Using Adaptive Filtering Techniques Combined with Natural Selection-Based Heuristic Algorithms. Bioinformatics 2013. [DOI: 10.4018/978-1-4666-3604-0.ch063] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
The analysis of protein-protein interactions (PPIs) is crucial to the understanding of cellular processes. In recent years, a variety of computational methods have been developed to supplement the interactions that have been detected experimentally. The article’s main objective is to present a novel classification framework for predicting PPIs combining the advantages of two algorithmic methods’ categories. State-of-the-art adaptive filtering techniques were combined with the most contemporary heuristic methods which are based in the natural selection process. The authors’ goal is to find a simple mathematical equation that governs the best classifier enabling the extraction of biological knowledge. The proposed methodology assigns a confidence score to each protein pair and as a result a weighted PPI network is constructed. All possible combinations of the selected adaptive filtering and heuristic techniques were used and comparisons were made to explore the classifiers with the highest performance and interpretability.
Collapse
|
110
|
Leung MCK, Rooney JP, Ryde IT, Bernal AJ, Bess AS, Crocker TL, Ji AQ, Meyer JN. Effects of early life exposure to ultraviolet C radiation on mitochondrial DNA content, transcription, ATP production, and oxygen consumption in developing Caenorhabditis elegans. BMC Pharmacol Toxicol 2013; 14:9. [PMID: 23374645 PMCID: PMC3621653 DOI: 10.1186/2050-6511-14-9] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2012] [Accepted: 01/14/2013] [Indexed: 11/25/2022] Open
Abstract
Background Mitochondrial DNA (mtDNA) is present in multiple copies per cell and undergoes dramatic amplification during development. The impacts of mtDNA damage incurred early in development are not well understood, especially in the case of types of mtDNA damage that are irreparable, such as ultraviolet C radiation (UVC)-induced photodimers. Methods We exposed first larval stage nematodes to UVC using a protocol that results in accumulated mtDNA damage but permits nuclear DNA (nDNA) repair. We then measured the transcriptional response, as well as oxygen consumption, ATP levels, and mtDNA copy number through adulthood. Results Although the mtDNA damage persisted to the fourth larval stage, we observed only a relatively minor ~40% decrease in mtDNA copy number. Transcriptomic analysis suggested an inhibition of aerobic metabolism and developmental processes; mRNA levels for mtDNA-encoded genes were reduced ~50% at 3 hours post-treatment, but recovered and, in some cases, were upregulated at 24 and 48 hours post-exposure. The mtDNA polymerase γ was also induced ~8-fold at 48 hours post-exposure. Moreover, ATP levels and oxygen consumption were reduced in response to UVC exposure, with marked reductions of ~50% at the later larval stages. Conclusions These results support the hypothesis that early life exposure to mitochondrial genotoxicants could result in mitochondrial dysfunction at later stages of life, thereby highlighting the potential health hazards of time-delayed effects of these genotoxicants in the environment.
Collapse
Affiliation(s)
- Maxwell C K Leung
- Nicholas School of the Environment, Duke University, Durham, NC, USA
| | | | | | | | | | | | | | | |
Collapse
|
111
|
Miteva YV, Budayeva HG, Cristea IM. Proteomics-based methods for discovery, quantification, and validation of protein-protein interactions. Anal Chem 2013; 85:749-68. [PMID: 23157382 PMCID: PMC3666915 DOI: 10.1021/ac3033257] [Citation(s) in RCA: 78] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
| | | | - Ileana M. Cristea
- Corresponding author: Ileana M. Cristea 210 Lewis Thomas Laboratory Department of Molecular Biology Princeton University Princeton, NJ 08544 Tel: 6092589417 Fax: 6092584575
| |
Collapse
|
112
|
Fu LL, Yang Y, Xu HL, Cheng Y, Wen X, Ouyang L, Bao JK, Wei YQ, Liu B. Identification of novel caspase/autophagy-related gene switch to cell fate decisions in breast cancers. Cell Prolif 2013; 46:67-75. [PMID: 23289893 DOI: 10.1111/cpr.12005] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2012] [Revised: 08/24/2012] [Accepted: 08/24/2012] [Indexed: 02/05/2023] Open
Abstract
OBJECTIVES Caspases, a family of cysteine proteases with unique substrate specificities, contribute to apoptosis, whereas autophagy-related genes (ATGs) regulate cytoprotective autophagy or autophagic cell death in cancer. Accumulating evidence has recently revealed underlying mechanisms of apoptosis and autophagy; however, their intricate relationships still remain to be clarified. Identification of caspase/ATG switches between apoptosis and autophagy may address this problem. MATERIALS AND METHODS Identification of caspase/ATG switches was carried out using a series of elegant systems biology & bioinformatics approaches, such as network construction, hub protein identification, microarray analyses, targeted microRNA prediction and molecular docking. RESULTS We computationally constructed the global human network from several online databases and further modified it into the basic caspase/ATG network. On the basis of apoptotic or autophagic gene differential expressions, we identified three molecular switches [including androgen receptor, serine/threonine-protein kinase PAK-1 (PAK-1) and mitogen-activated protein kinase-3 (MAPK-3)] between certain caspases and ATGs in human breast carcinoma MCF-7 cells. Subsequently, we identified microRNAs (miRNAs) able to target androgen receptor, PAK-1 and MAPK-3, respectively. Ultimately, we screened a range of small molecule compounds from DrugBank, able to target the three above-mentioned molecular switches in breast cancer cells. CONCLUSIONS We have systematically identified novel caspase/ATG switches involved in miRNA regulation, and predicted targeted anti-cancer drugs. These findings may uncover intricate relationships between apoptosis and autophagy and thus provide further new clues towards possible cancer drug discovery.
Collapse
Affiliation(s)
- L-L Fu
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, 610041, China
| | | | | | | | | | | | | | | | | |
Collapse
|
113
|
Quantitative analysis of murine terminal erythroid differentiation in vivo: novel method to study normal and disordered erythropoiesis. Blood 2013; 121:e43-9. [PMID: 23287863 DOI: 10.1182/blood-2012-09-456079] [Citation(s) in RCA: 190] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
Terminal erythroid differentiation is the process during which proerythroblasts differentiate to produce enucleated reticulocytes. Although it is well established that during murine erythropoiesis in vivo, 1 proerythroblast undergoes 3 mitosis to generate sequentially 2 basophilic, 4 polychromatic, and 8 orthochromatic erythroblasts, currently there is no method to quantitatively monitor this highly regulated process. Here we outline a method that distinguishes each distinct stage of erythroid differentiation in cells from mouse bone marrow and spleen based on expression levels of TER119, CD44, and cell size. Quantitative analysis revealed that the ratio of proerythroblasts:basophilic:polychromatic:orthromatic erythroblasts follows the expected 1:2:4:8 ratio, reflecting the physiologic progression of terminal erythroid differentiation in normal mice. Moreover, in 2 stress erythropoiesis mouse models, phlebotomy-induced acute anemia and chronic hemolytic anemia because of 4.1R deficiency, the ratio of these erythroblast populations remains the same as that of wild-type bone marrow. In contrast, in anemic β-thalassemia intermedia mice, there is altered progression which is restored to normal by transferrin treatment which was previously shown to ameliorate the anemic phenotype. The means to quantitate in vivo murine erythropoiesis using our approach will probably have broad application in the study of altered erythropoiesis in various red cell disorders.
Collapse
|
114
|
Durmuş Tekir SD, Ülgen KÖ. Systems biology of pathogen-host interaction: networks of protein-protein interaction within pathogens and pathogen-human interactions in the post-genomic era. Biotechnol J 2013; 8:85-96. [PMID: 23193100 PMCID: PMC7161785 DOI: 10.1002/biot.201200110] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2012] [Revised: 09/17/2012] [Accepted: 10/11/2012] [Indexed: 12/13/2022]
Abstract
Infectious diseases comprise some of the leading causes of death and disability worldwide. Interactions between pathogen and host proteins underlie the process of infection. Improved understanding of pathogen-host molecular interactions will increase our knowledge of the mechanisms involved in infection, and allow novel therapeutic solutions to be devised. Complete genome sequences for a number of pathogenic microorganisms, as well as the human host, has led to the revelation of their protein-protein interaction (PPI) networks. In this post-genomic era, pathogen-host interactions (PHIs) operating during infection can also be mapped. Detailed systematic analyses of PPI and PHI data together are required for a complete understanding of pathogenesis of infections. Here we review the striking results recently obtained during the construction and investigation of these networks. Emphasis is placed on studies producing large-scale interaction data by high-throughput experimental techniques.
Collapse
Affiliation(s)
| | - Kutlu Ö. Ülgen
- Department of Chemical Engineering, Boǧaziçi University, Istanbul, Turkey
| |
Collapse
|
115
|
Zhi W, Minturn J, Rappaport E, Brodeur G, Li H. Network-based analysis of multivariate gene expression data. Methods Mol Biol 2013; 972:121-39. [PMID: 23385535 PMCID: PMC3692268 DOI: 10.1007/978-1-60327-337-4_8] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Multivariate microarray gene expression data are commonly collected to study the genomic responses under ordered conditions such as over increasing/decreasing dose levels or over time during biological processes, where the expression levels of a give gene are expected to be dependent. One important question from such multivariate gene expression experiments is to identify genes that show different expression patterns over treatment dosages or over time; these genes can also point to the pathways that are perturbed during a given biological process. Several empirical Bayes approaches have been developed for identifying the differentially expressed genes in order to account for the parallel structure of the data and to borrow information across all the genes. However, these methods assume that the genes are independent. In this paper, we introduce an alternative empirical Bayes approach for analysis of multivariate gene expression data by assuming a discrete Markov random field (MRF) prior, where the dependency of the differential expression patterns of genes on the networks are modeled by a Markov random field. Simulation studies indicated that the method is quite effective in identifying genes and the modified subnetworks and has higher sensitivity than the commonly used procedures that do not use the pathway information, with similar observed false discovery rates. We applied the proposed methods for analysis of a microarray time course gene expression study of TrkA- and TrkB-transfected neuroblastoma cell lines and identified genes and subnetworks on MAPK, focal adhesion, and prion disease pathways that may explain cell differentiation in TrkA-transfected cell lines.
Collapse
Affiliation(s)
- Wei Zhi
- Department of Biostatistics and Epidemiology, University of Pennsylvania School of Medicine, Philadelphia, PA 19104, USA
| | - Jane Minturn
- Department of Pediatrics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Eric Rappaport
- Department of Pediatrics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Garrett Brodeur
- Department of Pediatrics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Hongzhe Li
- Department of Pediatrics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA,Address correspondence to: Hongzhe Li Department of Biostatistics and Epidemiology University of Pennsylvania School of Medicine Philadelphia, PA 19104, USA. Tel: (215) 573-5038
| |
Collapse
|
116
|
Abstract
High-throughput methods for screening of physical and functional interactions now provide the means to study virus-host interactions on a genome scale. The limited coverage of these methods and the large size and uncertain quality of the identified interaction sets, however, require sophisticated computational approaches to obtain novel insights and hypotheses on virus infection processes from these interactions. Here, we describe the central steps of bioinformatics methods applied most commonly for this task and highlight important aspects that need to be considered and potential pitfalls that should be avoided.
Collapse
Affiliation(s)
- Susanne M. Bailer
- University of Stuttgart Institute of Interfacial Process, Stuttgart, Germany
| | - Diana Lieber
- Ulm University Medical Center Institute of Virology, Ulm, Germany
| |
Collapse
|
117
|
Wang X, Brunetti P, Mauri PL. Processing of Mass Spectrometry Data in Clinical Applications. BIOINFORMATICS OF HUMAN PROTEOMICS 2012; 3. [PMCID: PMC7123949 DOI: 10.1007/978-94-007-5811-7_9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Mass spectrometry-based proteomics has become the leading approach for analyzing complex biological samples at a large-scale level. Its importance for clinical applications is more and more increasing, thanks to the development of high-performing instruments which allow the discovery of disease-specific biomarkers and an automated and rapid protein profiling of the analyzed samples. In this scenario, the large-scale production of proteomic data has driven the development of specific bioinformatic tools to assist researchers during the discovery processes. Here, we discuss the main methods, algorithms, and procedures to identify and use biomarkers for clinical and research purposes. In particular, we have been focused on quantitative approaches, the identification of proteotypic peptides, and the classification of samples, using proteomic data. Finally, this chapter is concluded by reporting the integration of experimental data with network datasets, as valuable instrument for identifying alterations that underline the emergence of specific phenotypes. Based on our experience, we show some examples taking into consideration experimental data obtained by multidimensional protein identification technology (MudPIT) approach.
Collapse
Affiliation(s)
- Xiangdong Wang
- , Medicine, Biomedical Research Center, Fudan University Zhongshan Hospital, Shang Hai, China, People's Republic
| | | | | |
Collapse
|
118
|
Zhu C, Kushwaha A, Berman K, Jegga AG. A vertex similarity-based framework to discover and rank orphan disease-related genes. BMC SYSTEMS BIOLOGY 2012; 6 Suppl 3:S8. [PMID: 23281592 PMCID: PMC3524320 DOI: 10.1186/1752-0509-6-s3-s8] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Background A rare or orphan disease (OD) is any disease that affects a small percentage of the population. While opportunities now exist to accelerate progress toward understanding the basis for many more ODs, the prioritization of candidate genes is still a critical step for disease-gene identification. Several network-based frameworks have been developed to address this problem with varied results. Result We have developed a novel vertex similarity (VS) based parameter-free prioritizing framework to identify and rank orphan disease candidate genes. We validate our approach by using 1598 known orphan disease-causing genes (ODGs) representing 172 orphan diseases (ODs). We compare our approach with a state-of-art parameter-based approach (PageRank with Priors or PRP) and with another parameter-free method (Interconnectedness or ICN). Our results show that VS-based approach outperforms ICN and is comparable to PRP. We further apply VS-based ranking to identify and rank potential novel candidate genes for several ODs. Conclusion We demonstrate that VS-based parameter-free ranking approach can be successfully used for disease candidate gene prioritization and can complement other network-based methods for candidate disease gene ranking. Importantly, our VS-ranked top candidate genes for the ODs match the known literature, suggesting several novel causal relationships for further investigation.
Collapse
Affiliation(s)
- Cheng Zhu
- Department of Computer Science, University of Cincinnati, Cincinnati, Ohio 45229, USA
| | | | | | | |
Collapse
|
119
|
Hale PJ, López-Yunez AM, Chen JY. Genome-wide meta-analysis of genetic susceptible genes for Type 2 Diabetes. BMC SYSTEMS BIOLOGY 2012; 6 Suppl 3:S16. [PMID: 23281828 PMCID: PMC3524015 DOI: 10.1186/1752-0509-6-s3-s16] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
Background Many genetic studies, including single gene studies and Genome-wide association studies (GWAS), aim to identify risk alleles for genetic diseases such as Type II Diabetes (T2D). However, in T2D studies, there is a significant amount of the hereditary risk that cannot be simply explained by individual risk genes. There is a need for developing systems biology approaches to integrate comprehensive genetic information and provide new insight on T2D biology. Methods We performed comprehensive integrative analysis of Single Nucleotide Polymorphisms (SNP's) individually curated from T2D GWAS results and mapped them to T2D candidate risk genes. Using protein-protein interaction data, we constructed a T2D-specific molecular interaction network consisting of T2D genetic risk genes and their interacting gene partners. We then studied the relationship between these T2D genes and curated gene sets. Results We determined that T2D candidate risk genes are concentrated in certain parts of the genome, specifically in chromosome 20. Using the T2D genetic network, we identified highly-interconnected network "hub" genes. By incorporating T2D GWAS results, T2D pathways, and T2D genes' functional category information, we further ranked T2D risk genes, T2D-related pathways, and T2D-related functional categories. We found that highly-interconnected T2D disease network “hub” genes most highly associated to T2D genetic risks to be PI3KR1, ESR1, and ENPP1. The well-characterized TCF7L2, contractor to our expectation, was not among the highest-ranked T2D gene list. Many interacted pathways play a role in T2D genetic risks, which includes insulin signalling pathway, type II diabetes pathway, maturity onset diabetes of the young, adipocytokine signalling pathway, and pathways in cancer. We also observed significant crosstalk among T2D gene subnetworks which include insulin secretion, regulation of insulin secretion, response to peptide hormone stimulus, response to insulin stimulus, peptide secretion, glucose homeostasis, and hormone transport. Overview maps involving T2D genes, gene sets, pathways, and their interactions are all reported. Conclusions Large-scale systems biology meta-analyses of GWAS results can improve interpretations of genetic variations and genetic risk factors. T2D genetic risks can be attributable to the summative genetic effects of many genes involved in a broad range of signalling pathways and functional networks. The framework developed for T2D studies may serve as a guide for studying other complex diseases.
Collapse
Affiliation(s)
- Paul J Hale
- School of Informatics, Indiana University-Purdue University, Indianapolis, IN, USA
| | | | | |
Collapse
|
120
|
Hsu CL, Yang UC. Discovering pathway cross-talks based on functional relations between pathways. BMC Genomics 2012; 13 Suppl 7:S25. [PMID: 23282018 PMCID: PMC3521217 DOI: 10.1186/1471-2164-13-s7-s25] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Background In biological systems, pathways coordinate or interact with one another to achieve a complex biological process. Studying how they influence each other is essential for understanding the intricacies of a biological system. However, current methods rely on statistical tests to determine pathway relations, and may lose numerous biologically significant relations. Results This study proposes a method that identifies the pathway relations by measuring the functional relations between pathways based on the Gene Ontology (GO) annotations. This approach identified 4,661 pathway relations among 166 pathways from Pathway Interaction Database (PID). Using 143 pathway interactions from PID as testing data, the function-based approach (FBA) is able to identify 93% of pathway interactions, better than the existing methods based on the shared components and protein-protein interactions. Many well-known pathway cross-talks are only identified by FBA. In addition, the false positive rate of FBA is significantly lower than others via pathway co-expression analysis. Conclusions This function-based approach appears to be more sensitive and able to infer more biologically significant and explainable pathway relations.
Collapse
Affiliation(s)
- Chia-Lang Hsu
- Institute of Biomedical Informatics, National Yang-Ming University, Taipei, Taiwan
| | | |
Collapse
|
121
|
Fennell H, Olawin A, Mizanur RM, Izumori K, Chen JG, Ullah H. Arabidopsis scaffold protein RACK1A modulates rare sugar D-allose regulated gibberellin signaling. PLANT SIGNALING & BEHAVIOR 2012; 7:1407-10. [PMID: 22951405 PMCID: PMC3548859 DOI: 10.4161/psb.21995] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
As energy sources and structural components, sugars are the central regulators of plant growth and development. In addition to the abundant natural sugars in plants, more than 50 different kinds of rare sugars exist in nature, several of which show distinct roles in plant growth and development. Recently, one of the rare sugars, D-allose, an epimer of D-glucose at C3, is found to suppress plant hormone gibberellin (GA) signaling in rice. Scaffold protein RACK1A in the model plant Arabidopsis is implicated in the GA pathway as rack1a knockout mutants show insensitivity to GA in GA-induced seed germination. Using genetic knockout lines and a reporter gene, the functional role of RACK1A in the D-allose pathway was investigated. It was found that the rack1a knockout seeds showed hypersensitivity to D-allose-induced inhibition of seed germination, implicating a role for RACK1A in the D-allose mediated suppression of seed germination. On the other hand, a functional RACK1A in the background of the double knockout mutations in the other two RACK1 isoforms, rack1b/rack1c, showed significant resistance to the D-allose induced inhibition of seed germination. The collective results implicate the RACK1A in the D-allose mediated seed germination inhibition pathway. Elucidation of the rare sugar signaling mechanism will help to advance understanding of this less studied but important cellular signaling pathway.
Collapse
Affiliation(s)
- Herman Fennell
- Department of Biology; Howard University; Washington, DC USA
| | | | - Rahman M. Mizanur
- US Army Medical Research Institute of Infectious Diseases (USAMRIID); Fort Detrick; Frederick, MD USA
| | - Ken Izumori
- Faculty of Agriculture; Kagawa University; Kagawa, Japan
| | - Jin-Gui Chen
- Biosciences Division; Oak Ridge National Laboratory; Oak Ridge, TN USA
| | - Hemayet Ullah
- Department of Biology; Howard University; Washington, DC USA
- Correspondence to: Hemayet Ullah,
| |
Collapse
|
122
|
Abstract
Molecular interaction databases are playing an ever more important role in our understanding of the biology of the cell. An increasing number of resources exist to provide these data and many of these have adopted the controlled vocabularies and agreed-upon standardised data formats produced by the Molecular Interaction workgroup of the Human Proteome Organization Proteomics Standards Initiative (HUPO PSI-MI). Use of these standards allows each resource to establish PSI Common QUery InterfaCe (PSICQUIC) service, making data from multiple resources available to the user in response to a single query. This cooperation between databases has been taken a stage further, with the establishment of the International Molecular Exchange (IMEx) consortium which aims to maximise the curation power of numerous data resources, and provide the user with a non-redundant, consistently annotated set of interaction data.
Collapse
Affiliation(s)
- Sandra Orchard
- EMBL Outstation - European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK.
| |
Collapse
|
123
|
Camargo M, Lopes PI, Del Giudice PT, Carvalho VM, Cardozo KHM, Andreoni C, Fraietta R, Bertolla RP. Unbiased label-free quantitative proteomic profiling and enriched proteomic pathways in seminal plasma of adult men before and after varicocelectomy. Hum Reprod 2012; 28:33-46. [DOI: 10.1093/humrep/des357] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
|
124
|
Fu L, Zhao X, Xu H, Wen X, Wang S, Liu B, Bao J, Wei Y. Identification of microRNA-regulated autophagic pathways in plant lectin-induced cancer cell death. Cell Prolif 2012; 45:477-85. [PMID: 22882626 PMCID: PMC6496687 DOI: 10.1111/j.1365-2184.2012.00840.x] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2012] [Accepted: 05/14/2012] [Indexed: 02/05/2023] Open
Abstract
OBJECTIVES Plant lectins, carbohydrate-binding proteins of non-immune origin, have recently been reported to induce programmed cell death (including apoptosis and autophagy) in many types of cancer cells. MicroRNAs (miRNAs), small, non-coding endogenous RNAs, ~22 nucleotides (nt) in length, have been well characterized to play essential roles in regulation of the autophagy process in cancer; however, how these miRNAs regulate autophagic pathways in plant lectin-induced cancer cells, still remains an enigma. MATERIALS AND METHODS Identification of microRNA-regulated autophagic pathways was carried out using a series of elegant systems - biology and bioinformatics approaches, such as network construction, hub protein identification, targeted microRNA prediction, microarray analyses and molecular docking. RESULTS We computationally constructed the human autophagic protein-protein interaction (PPI) network, and further modified this network into a plant lectin-induced network. Subsequently, we identified 9 autophagic hub proteins and 13 relevant oncogenic and tumour suppressive miRNAs, that could regulate these aforementioned targeted autophagic hub proteins, in human breast carcinoma MCF-7 cells. In addition, we confirmed that plant lectins could block the sugar-containing receptor EGFR-mediated survival pathways, involved in autophagic hub proteins and relevant miRNAs, thereby ultimately culminating in autophagic cell death. CONCLUSIONS These results demonstrate that network-based identification of microRNAs modulate autophagic pathways in plant lectin-treated cancer cells, which may shed new light on the discovery of plant lectins as potent autophagic inducers, for cancer drug discovery.
Collapse
Affiliation(s)
- L.‐L. Fu
- School of Life SciencesState Key Laboratory of Biotherapy and Cancer CenterWest China HospitalSichuan UniversityChengduChina
| | - X. Zhao
- School of Life SciencesState Key Laboratory of Biotherapy and Cancer CenterWest China HospitalSichuan UniversityChengduChina
| | - H.‐L. Xu
- School of Life SciencesState Key Laboratory of Biotherapy and Cancer CenterWest China HospitalSichuan UniversityChengduChina
| | - X. Wen
- School of Life SciencesState Key Laboratory of Biotherapy and Cancer CenterWest China HospitalSichuan UniversityChengduChina
| | - S.‐Y. Wang
- School of Life SciencesState Key Laboratory of Biotherapy and Cancer CenterWest China HospitalSichuan UniversityChengduChina
| | - B. Liu
- School of Life SciencesState Key Laboratory of Biotherapy and Cancer CenterWest China HospitalSichuan UniversityChengduChina
| | - J.‐K. Bao
- School of Life SciencesState Key Laboratory of Biotherapy and Cancer CenterWest China HospitalSichuan UniversityChengduChina
| | - Y.‐Q. Wei
- School of Life SciencesState Key Laboratory of Biotherapy and Cancer CenterWest China HospitalSichuan UniversityChengduChina
| |
Collapse
|
125
|
Bassel GW, Gaudinier A, Brady SM, Hennig L, Rhee SY, De Smet I. Systems analysis of plant functional, transcriptional, physical interaction, and metabolic networks. THE PLANT CELL 2012; 24:3859-75. [PMID: 23110892 PMCID: PMC3517224 DOI: 10.1105/tpc.112.100776] [Citation(s) in RCA: 71] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/14/2012] [Revised: 08/21/2012] [Accepted: 10/11/2012] [Indexed: 05/19/2023]
Abstract
Physiological responses, developmental programs, and cellular functions rely on complex networks of interactions at different levels and scales. Systems biology brings together high-throughput biochemical, genetic, and molecular approaches to generate omics data that can be analyzed and used in mathematical and computational models toward uncovering these networks on a global scale. Various approaches, including transcriptomics, proteomics, interactomics, and metabolomics, have been employed to obtain these data on the cellular, tissue, organ, and whole-plant level. We summarize progress on gene regulatory, cofunction, protein interaction, and metabolic networks. We also illustrate the main approaches that have been used to obtain these networks, with specific examples from Arabidopsis thaliana, and describe the pros and cons of each approach.
Collapse
Affiliation(s)
- George W. Bassel
- School of Biosciences, University of Birmingham, Birmingham B15 2TT, United Kingdom
- Division of Plant and Crop Sciences, School of Biosciences and Centre for Plant Integrative Biology, University of Nottingham, Loughborough LE12 5RD, United Kingdom
| | - Allison Gaudinier
- Department of Plant Biology and Genome Center, University of California, Davis, California 95616
| | - Siobhan M. Brady
- Department of Plant Biology and Genome Center, University of California, Davis, California 95616
| | - Lars Hennig
- Department of Plant Biology and Forest Genetics, Uppsala BioCenter, Swedish University of Agricultural Sciences and Linnean Center for Plant Biology, SE-75007 Uppsala, Sweden
| | - Seung Y. Rhee
- Department of Plant Biology, Carnegie Institution for Science, Stanford, California 94305
| | - Ive De Smet
- Division of Plant and Crop Sciences, School of Biosciences and Centre for Plant Integrative Biology, University of Nottingham, Loughborough LE12 5RD, United Kingdom
| |
Collapse
|
126
|
Guan Y, Gorenshteyn D, Burmeister M, Wong AK, Schimenti JC, Handel MA, Bult CJ, Hibbs MA, Troyanskaya OG. Tissue-specific functional networks for prioritizing phenotype and disease genes. PLoS Comput Biol 2012; 8:e1002694. [PMID: 23028291 PMCID: PMC3459891 DOI: 10.1371/journal.pcbi.1002694] [Citation(s) in RCA: 88] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2012] [Accepted: 08/02/2012] [Indexed: 12/16/2022] Open
Abstract
Integrated analyses of functional genomics data have enormous potential for identifying phenotype-associated genes. Tissue-specificity is an important aspect of many genetic diseases, reflecting the potentially different roles of proteins and pathways in diverse cell lineages. Accounting for tissue specificity in global integration of functional genomics data is challenging, as “functionality” and “functional relationships” are often not resolved for specific tissue types. We address this challenge by generating tissue-specific functional networks, which can effectively represent the diversity of protein function for more accurate identification of phenotype-associated genes in the laboratory mouse. Specifically, we created 107 tissue-specific functional relationship networks through integration of genomic data utilizing knowledge of tissue-specific gene expression patterns. Cross-network comparison revealed significantly changed genes enriched for functions related to specific tissue development. We then utilized these tissue-specific networks to predict genes associated with different phenotypes. Our results demonstrate that prediction performance is significantly improved through using the tissue-specific networks as compared to the global functional network. We used a testis-specific functional relationship network to predict genes associated with male fertility and spermatogenesis phenotypes, and experimentally confirmed one top prediction, Mbyl1. We then focused on a less-common genetic disease, ataxia, and identified candidates uniquely predicted by the cerebellum network, which are supported by both literature and experimental evidence. Our systems-level, tissue-specific scheme advances over traditional global integration and analyses and establishes a prototype to address the tissue-specific effects of genetic perturbations, diseases and drugs. Tissue specificity is an important aspect of many genetic diseases, reflecting the potentially different roles of proteins and pathways in diverse cell lineages. We propose an effective strategy to model tissue-specific functional relationship networks in the laboratory mouse. We integrated large scale genomics datasets as well as low-throughput tissue-specific expression profiles to estimate the probability that two proteins are co-functioning in the tissue under study. These networks can accurately reflect the diversity of protein functions across different organs and tissue compartments. By computationally exploring the tissue-specific networks, we can accurately predict novel phenotype-related gene candidates. We experimentally confirmed a top candidate gene, Mybl1, to affect several male fertility phenotypes, predicted based on male-reproductive system-specific networks and we predicted candidates related to a rare genetic disease ataxia, which are supported by experimental and literature evidence. The above results demonstrate the power of modeling tissue-specific dynamics of co-functionality through computational approaches.
Collapse
Affiliation(s)
- Yuanfang Guan
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, United States of America
- Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Dmitriy Gorenshteyn
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
- Department of Molecular Biology, Princeton University, Princeton, New Jersey, United States of America
| | - Margit Burmeister
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, United States of America
- Molecular & Behavioral Neuroscience Institution, Department of Psychiatry, and Department of Human Genetics, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Aaron K. Wong
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
| | - John C. Schimenti
- Department of Biomedical Sciences, College of Veterinary Medicine, Cornell University, Ithaca, New York, United States of America
| | - Mary Ann Handel
- The Jackson Laboratory, Bar Harbor, Maine, United States of America
| | - Carol J. Bult
- The Jackson Laboratory, Bar Harbor, Maine, United States of America
| | - Matthew A. Hibbs
- The Jackson Laboratory, Bar Harbor, Maine, United States of America
- Trinity University, Computer Science Department, San Antonio, Texas, United States of America
- * E-mail: (MAH); (OGT)
| | - Olga G. Troyanskaya
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
- Department of Computer Science, Princeton University, Princeton, New Jersey, United States of America
- * E-mail: (MAH); (OGT)
| |
Collapse
|
127
|
Villaamil VM, Gallego GA, Valladares-Ayerbes M, Caínzos IS, Aparicio LMA. Multiple biomarker tissue arrays: A computational approach to identifying protein-protein interactions in the EGFR/ERK signalling pathway. J Mol Signal 2012; 7:14. [PMID: 22937740 PMCID: PMC3493339 DOI: 10.1186/1750-2187-7-14] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2012] [Accepted: 08/15/2012] [Indexed: 11/18/2022] Open
Abstract
Background Many studies have demonstrated genetic and environmental factors that lead to renal cell carcinoma (RCC) and that occur during a protracted period of tumourigenesis. It appears suitable to identify and characterise potential molecular markers that appear during tumourigenesis and that might provide rapid and effective possibilities for the early detection of RCC. EGFR activation induces cell cycle progression, inhibition of apoptosis and angiogenesis, promotion of invasion/metastasis, and other tumour promoting activities. Over-expression of EGFR is thought to play an important role in tumour initiation and progression of RCC because up-regulation of EGFR has been associated with high grade cancers and a worse prognosis. Methods Characterisation of the protein profile interacting with EGFR was performed using the following: an immunohistochemical (IHC) study of EGFR, a comprehensive computational study of EGFR protein-protein interactions, an analysis correlating the expression levels of EGFR with other significant markers in the tumourigenicity of RCC, and finally, an analysis of the utility of EGFR for prognosis in a cohort of patients with renal cell carcinoma. Results The cases that showed a higher level of this protein fell within the clear cell histological subtype (p = 0.001). The EGFR significance statistic was found with respect to a worse prognosis. In vivo significant correlations were found with PDGFR-β, Flk-1, Hif1-α, proteins related to differentiation (such as DLL3 and DLL4 ligands), and certain metabolic proteins such as Glut5. In silico significant associations gave us a panel of 32 EGFR-interacting proteins (EIP) using the APID and STRING databases. Conclusions This work summarises the multifaceted role of EGFR in the pathology of RCC, and it identifies EIPs that could help to provide mechanistic explanations for the different behaviours observed in tumours.
Collapse
|
128
|
Medina Villaamil V, Aparicio Gallego G, Santamarina Caínzos I, Valladares-Ayerbes M, Antón Aparicio LM. Searching for Hif1-α interacting proteins in renal cell carcinoma. Clin Transl Oncol 2012; 14:698-708. [PMID: 22926943 DOI: 10.1007/s12094-012-0857-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2011] [Accepted: 12/08/2011] [Indexed: 12/16/2022]
Abstract
INTRODUCTION Kidney tumours are frequently characterised by hypoxic conditions due to a local imbalance between oxygen (O2) supply and consumption. Hif1-α regulates angiogenesis, tumour growth, tumour progression, metastatic spread, and glucose metabolism by acting as a transcription factor for relevant genes. Here, we describe an immunohistochemical study of Hif1-α, a comprehensive computational study of Hif1-α interacting proteins (HIPs), an analysis correlating expression levels of Hif1-α with upstream and downstream proteins, and an analysis of the utility of Hif1-α for prognosis in a cohort of patients with renal cell carcinoma. MATERIALS AND METHODS The patient cohort included 80 patients. For immunohistochemistry evaluation, tissue microarrays were constructed. The IntAct, MINT, and BOND databases were used for the HIP approach. The Kruskal-Wallis test was used for comparing protein expression with pathology measurements. Correlation was expressed as the Pearson coefficient. RESULTS Hif1-α expression correlates significantly with the "clear" histological subtype of renal cell carcinoma (p < 0.01). The samples with the worst prognoses related to the pathological variables analysed showed the highest levels of Hif1-α expression. Significant correlations were found with Bcl-2, CAIX, C-kit, EGFR, TGF-β, proteins of the VEGF family, proteins related to differentiation (such as Notch1 and Notch3) and certain metabolic enzymes. Bioinformatic analysis suggested 45 evidence-based HIPs and 4 complexes involving protein Hif1-α. CONCLUSIONS This work summarises the multifaceted role of Hif1-α in the pathology of renal cell carcinomas, and it identifies HIPs that could help provide mechanistic explanations for the different behaviours seen in tumours.
Collapse
|
129
|
Davis MJ, Shin CJ, Jing N, Ragan MA. Rewiring the dynamic interactome. MOLECULAR BIOSYSTEMS 2012; 8:2054-2013. [PMID: 22729145 DOI: 10.1039/c2mb25050k] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2025]
Abstract
Transcriptomics continues to provide ever-more evidence that in morphologically complex eukaryotes, each protein-coding genetic locus can give rise to multiple transcripts that differ in length, exon content and/or other sequence features. In humans, more than 60% of loci give rise to multiple transcripts in this way. Motifs that mediate protein-protein interactions can be present or absent in these transcripts. Analysis of protein interaction networks has been a valuable development in systems biology. Interactions are typically recorded for representative proteins or even genes, although exploratory transcriptomics has revealed great spatiotemporal diversity in the output of genes at both the transcript and protein-isoform levels. The increasing availability of high-resolution protein structures has made it possible to identify the domain-domain interactions that underpin many protein interactions. To explore the impact of transcript and isoform diversity we use full-length human cDNAs to interrogate the protein-coding transcriptional output of genes, identifying variation in the inclusion of protein interaction domains. We map these data to a set of high-quality protein interactions, and characterise the variation in network connectivity likely to result. We find strong evidence for altered interaction potential in nearly 20% of genes, suggesting that transcriptional variation can significantly rewire the human interactome.
Collapse
Affiliation(s)
- Melissa J Davis
- Institute for Molecular Bioscience, The University of Queensland, St Lucia, Queensland 4072, Australia
| | | | | | | |
Collapse
|
130
|
Subnetwork-based analysis of chronic lymphocytic leukemia identifies pathways that associate with disease progression. Blood 2012; 120:2639-49. [PMID: 22837534 DOI: 10.1182/blood-2012-03-416461] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open
Abstract
The clinical course of patients with chronic lymphocytic leukemia (CLL) is heterogeneous. Several prognostic factors have been identified that can stratify patients into groups that differ in their relative tendency for disease progression and/or survival. Here, we pursued a subnetwork-based analysis of gene expression profiles to discriminate between groups of patients with disparate risks for CLL progression. From an initial cohort of 130 patients, we identified 38 prognostic subnetworks that could predict the relative risk for disease progression requiring therapy from the time of sample collection, more accurately than established markers. The prognostic power of these subnetworks then was validated on 2 other cohorts of patients. We noted reduced divergence in gene expression between leukemia cells of CLL patients classified at diagnosis with aggressive versus indolent disease over time. The predictive subnetworks vary in levels of expression over time but exhibit increased similarity at later time points before therapy, suggesting that degenerate pathways apparently converge into common pathways that are associated with disease progression. As such, these results have implications for understanding cancer evolution and for the development of novel treatment strategies for patients with CLL.
Collapse
|
131
|
Bockhorst JP, Conroy JM, Agarwal S, O’Leary DP, Yu H. Beyond captions: linking figures with abstract sentences in biomedical articles. PLoS One 2012; 7:e39618. [PMID: 22815711 PMCID: PMC3399876 DOI: 10.1371/journal.pone.0039618] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2011] [Accepted: 05/23/2012] [Indexed: 11/18/2022] Open
Abstract
Although figures in scientific articles have high information content and concisely communicate many key research findings, they are currently under utilized by literature search and retrieval systems. Many systems ignore figures, and those that do not typically only consider caption text. This study describes and evaluates a fully automated approach for associating figures in the body of a biomedical article with sentences in its abstract. We use supervised methods to learn probabilistic language models, hidden Markov models, and conditional random fields for predicting associations between abstract sentences and figures. Three kinds of evidence are used: text in abstract sentences and figures, relative positions of sentences and figures, and the patterns of sentence/figure associations across an article. Each information source is shown to have predictive value, and models that use all kinds of evidence are more accurate than models that do not. Our most accurate method has an -score of 69% on a cross-validation experiment, is competitive with the accuracy of human experts, has significantly better predictive accuracy than state-of-the-art methods and enables users to access figures associated with an abstract sentence with an average of 1.82 fewer mouse clicks. A user evaluation shows that human users find our system beneficial. The system is available at http://FigureItOut.askHERMES.org.
Collapse
Affiliation(s)
- Joseph P. Bockhorst
- Department of Computer Science, University of Wisconsin–Milwaukee, Milwaukee, Wisconsin, United States of America
- * E-mail: (JPB); (HY)
| | - John M. Conroy
- IDA/Center for Computing Sciences, Bowie, Maryland, United States of America
| | - Shashank Agarwal
- Department of Health Sciences, University of Wisconsin–Milwaukee, Milwaukee, Wisconsin, United States of America
| | - Dianne P. O’Leary
- Computer Science Department and UMIACS, University of Maryland, College Park, Maryland, United States of America
| | - Hong Yu
- Department of Computer Science, University of Wisconsin–Milwaukee, Milwaukee, Wisconsin, United States of America
- Department of Health Sciences, University of Wisconsin–Milwaukee, Milwaukee, Wisconsin, United States of America
- * E-mail: (JPB); (HY)
| |
Collapse
|
132
|
Kundrotas PJ, Zhu Z, Vakser IA. GWIDD: a comprehensive resource for genome-wide structural modeling of protein-protein interactions. Hum Genomics 2012; 6:7. [PMID: 23245398 PMCID: PMC3500202 DOI: 10.1186/1479-7364-6-7] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2012] [Accepted: 07/11/2012] [Indexed: 11/10/2022] Open
Abstract
Protein-protein interactions are a key component of life processes. The knowledge of the three-dimensional structure of these interactions is important for understanding protein function. Genome-Wide Docking Database (http://gwidd.bioinformatics.ku.edu) offers an extensive source of data for structural studies of protein-protein complexes on genome scale. The current release of the database combines the available experimental data on the structure and characteristics of protein interactions with structural modeling of protein complexes for 771 organisms spanned over the entire universe of life from viruses to humans. The interactions are stored in a relational database with user-friendly interface that includes various search options. The search results can be interactively previewed; the structures, downloaded, along with the interaction characteristics.
Collapse
|
133
|
Dannenfelser R, Clark NR, Ma'ayan A. Genes2FANs: connecting genes through functional association networks. BMC Bioinformatics 2012; 13:156. [PMID: 22748121 PMCID: PMC3472228 DOI: 10.1186/1471-2105-13-156] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2011] [Accepted: 05/25/2012] [Indexed: 01/04/2023] Open
Abstract
Background Protein-protein, cell signaling, metabolic, and transcriptional interaction networks are useful for identifying connections between lists of experimentally identified genes/proteins. However, besides physical or co-expression interactions there are many ways in which pairs of genes, or their protein products, can be associated. By systematically incorporating knowledge on shared properties of genes from diverse sources to build functional association networks (FANs), researchers may be able to identify additional functional interactions between groups of genes that are not readily apparent. Results Genes2FANs is a web based tool and a database that utilizes 14 carefully constructed FANs and a large-scale protein-protein interaction (PPI) network to build subnetworks that connect lists of human and mouse genes. The FANs are created from mammalian gene set libraries where mouse genes are converted to their human orthologs. The tool takes as input a list of human or mouse Entrez gene symbols to produce a subnetwork and a ranked list of intermediate genes that are used to connect the query input list. In addition, users can enter any PubMed search term and then the system automatically converts the returned results to gene lists using GeneRIF. This gene list is then used as input to generate a subnetwork from the user’s PubMed query. As a case study, we applied Genes2FANs to connect disease genes from 90 well-studied disorders. We find an inverse correlation between the counts of links connecting disease genes through PPI and links connecting diseases genes through FANs, separating diseases into two categories. Conclusions Genes2FANs is a useful tool for interpreting the relationships between gene/protein lists in the context of their various functions and networks. Combining functional association interactions with physical PPIs can be useful for revealing new biology and help form hypotheses for further experimentation. Our finding that disease genes in many cancers are mostly connected through PPIs whereas other complex diseases, such as autism and type-2 diabetes, are mostly connected through FANs without PPIs, can guide better strategies for disease gene discovery. Genes2FANs is available at:
http://actin.pharm.mssm.edu/genes2FANs.
Collapse
Affiliation(s)
- Ruth Dannenfelser
- Department of Pharmacology and Systems Therapeutics, Systems Biology Center of New York, Mount Sinai School of Medicine, New York, NY 10029, USA
| | | | | |
Collapse
|
134
|
Orchard S, Kerrien S, Abbani S, Aranda B, Bhate J, Bidwell S, Bridge A, Briganti L, Brinkman FSL, Brinkman F, Cesareni G, Chatr-aryamontri A, Chautard E, Chen C, Dumousseau M, Goll J, Hancock REW, Hancock R, Hannick LI, Jurisica I, Khadake J, Lynn DJ, Mahadevan U, Perfetto L, Raghunath A, Ricard-Blum S, Roechert B, Salwinski L, Stümpflen V, Tyers M, Uetz P, Xenarios I, Hermjakob H. Protein interaction data curation: the International Molecular Exchange (IMEx) consortium. Nat Methods 2012; 9:345-50. [PMID: 22453911 DOI: 10.1038/nmeth.1931] [Citation(s) in RCA: 402] [Impact Index Per Article: 30.9] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
The International Molecular Exchange (IMEx) consortium is an international collaboration between major public interaction data providers to share literature-curation efforts and make a nonredundant set of protein interactions available in a single search interface on a common website (http://www.imexconsortium.org/). Common curation rules have been developed, and a central registry is used to manage the selection of articles to enter into the dataset. We discuss the advantages of such a service to the user, our quality-control measures and our data-distribution practices.
Collapse
Affiliation(s)
- Sandra Orchard
- European Molecular Biology Laboratory-European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
135
|
Sanz-Pamplona R, García-García J, Franco S, Messeguer X, Driouch K, Oliva B, Sierra A. A taxonomy of organ-specific breast cancer metastases based on a protein-protein interaction network. MOLECULAR BIOSYSTEMS 2012; 8:2085-96. [PMID: 22710377 DOI: 10.1039/c2mb25104c] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
We carried out a systems-level study of the mechanisms underlying organ-specific metastases of breast cancer. We followed a network-based approach using microarray expression data from human breast cancer metastases to select organ-specific proteins that exert a range of functions allowing cell survival and growth in the microenvironment of distant organs. MinerProt, a home-made software application, was used to group organ-specific signatures of brain (1191 genes), bone (1623 genes), liver (977 genes) and lung (254 genes) metastases by function and select the most differentially expressed gene in each function. As a result, we obtained 19 functional representative proteins in brain, 23 in bone, 15 in liver and 9 in lung, with which we constructed four organ-specific protein-protein interaction networks. The network taxonomy included seven proteins that interacted in brain metastasis, which were mainly associated with signal transduction. Proteins related to immune response functions were bone specific, while those involved in proteolysis, signal transduction and hepatic glucose metabolism were found in liver metastasis. No experimental protein-protein interaction was found in lung metastasis; thus, computationally determined interactions were included in this network. Moreover, three of these selected genes (CXCL12, DSC2 and TFDP2) were associated with progression to specific organs when tested in an independent dataset. In conclusion, we present a network-based approach to filter information by selecting key protein functions as metastatic markers or therapeutic targets.
Collapse
Affiliation(s)
- Rebeca Sanz-Pamplona
- Biological Clues of the Invasive and Metastatic Phenotype Group, Bellvitge Biomedical Research Institute-IDIBELL, Gran Via de L'Hospitalet, 199 L'Hospitalet de Llobregat, E-08908 Barcelona, Spain
| | | | | | | | | | | | | |
Collapse
|
136
|
Sánchez Claros C, Tramontano A. Detecting mutually exclusive interactions in protein-protein interaction maps. PLoS One 2012; 7:e38765. [PMID: 22715412 PMCID: PMC3370996 DOI: 10.1371/journal.pone.0038765] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2012] [Accepted: 05/10/2012] [Indexed: 01/09/2023] Open
Abstract
Comprehensive protein interaction maps can complement genetic and biochemical experiments and allow the formulation of new hypotheses to be tested in the system of interest. The computational analysis of the maps may help to focus on interesting cases and thereby to appropriately prioritize the validation experiments. We show here that, by automatically comparing and analyzing structurally similar regions of proteins of known structure interacting with a common partner, it is possible to identify mutually exclusive interactions present in the maps with a sensitivity of 70% and a specificity higher than 85% and that, in about three fourth of the correctly identified complexes, we also correctly recognize at least one residue (five on average) belonging to the interaction interface. Given the present and continuously increasing number of proteins of known structure, the requirement of the knowledge of the structure of the interacting proteins does not substantially impact on the coverage of our strategy that can be estimated to be around 25%. We also introduce here the Estrella server that embodies this strategy, is designed for users interested in validating specific hypotheses about the functional role of a protein-protein interaction and it also allows access to pre-computed data for seven organisms.
Collapse
Affiliation(s)
| | - Anna Tramontano
- Department of Physics, Sapienza University of Rome, Rome, Italy
- Institut Pasteur Fondazione Cenci-Bolognetti, Rome, Italy
- Center for Life Nano Science @Sapienza, Istituto Italiano di Tecnologia, Rome, Italy
- * E-mail:
| |
Collapse
|
137
|
Templates are available to model nearly all complexes of structurally characterized proteins. Proc Natl Acad Sci U S A 2012; 109:9438-41. [PMID: 22645367 DOI: 10.1073/pnas.1200678109] [Citation(s) in RCA: 150] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
Traditional approaches to protein-protein docking sample the binding modes with no regard to similar experimentally determined structures (templates) of protein-protein complexes. Emerging template-based docking approaches utilize such similar complexes to determine the docking predictions. The docking problem assumes the knowledge of the participating proteins' structures. Thus, it provides the possibility of aligning the structures of the proteins and the template complexes. The progress in the development of template-based docking and the vast experience in template-based modeling of individual proteins show that, generally, such approaches are more reliable than the free modeling. The key aspect of this modeling paradigm is the availability of the templates. The current common perception is that due to the difficulties in experimental structure determination of protein-protein complexes, the pool of docking templates is insignificant, and thus a broad application of template-based docking is possible only at some future time. The results of our large scale, systematic study show that, surprisingly, in spite of the limited number of protein-protein complexes in the Protein Data Bank, docking templates can be found for complexes representing almost all the known protein-protein interactions, provided the components themselves have a known structure or can be homology-built. About one-third of the templates are of good quality when they are compared to experimental structures in test sets extracted from the Protein Data Bank and would be useful starting points in modeling the complexes. This finding dramatically expands our ability to model protein interactions, and has far-reaching implications for the protein docking field in general.
Collapse
|
138
|
From genome-wide association studies to disease mechanisms: celiac disease as a model for autoimmune diseases. Semin Immunopathol 2012. [PMID: 22580835 DOI: 10.107/s00281-012-0312-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Celiac disease is characterized by a chronic inflammatory reaction in the intestine and is triggered by gluten, a constituent derived from grains which is present in the common daily diet in the Western world. Despite decades of research, the mechanisms behind celiac disease etiology are still not fully understood, although it is clear that both genetic and environmental factors are involved. To improve the understanding of the disease, the genetic component has been extensively studied by genome-wide association studies. These have uncovered a wealth of information that still needs further investigation to clarify its importance. In this review, we summarize and discuss the results of the genetic studies in celiac disease, focusing on the "non-HLA" genes. We also present novel approaches to identifying the causal variants in complex susceptibility loci and disease mechanisms.
Collapse
|
139
|
Kumar V, Wijmenga C, Withoff S. From genome-wide association studies to disease mechanisms: celiac disease as a model for autoimmune diseases. Semin Immunopathol 2012; 34:567-80. [PMID: 22580835 PMCID: PMC3410018 DOI: 10.1007/s00281-012-0312-1] [Citation(s) in RCA: 97] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2012] [Accepted: 04/10/2012] [Indexed: 02/06/2023]
Abstract
Celiac disease is characterized by a chronic inflammatory reaction in the intestine and is triggered by gluten, a constituent derived from grains which is present in the common daily diet in the Western world. Despite decades of research, the mechanisms behind celiac disease etiology are still not fully understood, although it is clear that both genetic and environmental factors are involved. To improve the understanding of the disease, the genetic component has been extensively studied by genome-wide association studies. These have uncovered a wealth of information that still needs further investigation to clarify its importance. In this review, we summarize and discuss the results of the genetic studies in celiac disease, focusing on the “non-HLA” genes. We also present novel approaches to identifying the causal variants in complex susceptibility loci and disease mechanisms.
Collapse
Affiliation(s)
- Vinod Kumar
- Department of Genetics, University Medical Hospital Groningen, University of Groningen, PO Box 30001, 9700 RB, Groningen, the Netherlands
| | | | | |
Collapse
|
140
|
Wang N, Xu HL, Zhao X, Wen X, Wang FT, Wang SY, Fu LL, Liu B, Bao JK. Network-based identification of novel connections among apoptotic signaling pathways in cancer. Appl Biochem Biotechnol 2012; 167:621-31. [PMID: 22581077 DOI: 10.1007/s12010-012-9704-x] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2011] [Accepted: 04/23/2012] [Indexed: 02/05/2023]
Abstract
MicroRNAs (miRNAs), highly conserved, non-coding endogenous RNA and nearly ~22 nucleotides (nt) in length, are well-known to regulate several apoptotic pathways in cancer. In this study, we computationally constructed the initial human apoptotic PPI network by several online databases, and further integrated these high-throughput datasets into a Naïve Bayesian model to predict protein functional connections. Based on the modified apoptotic network, we identified several apoptotic hub proteins such as TP53, SRC, M3K3/5/8, cyclin-dependent kinase2/6, TNFR16/19, and TGF-β receptor 1/2. Subsequently, we identified some microRNAs that could target the aforementioned apoptotic hub proteins by using TargetScan, PicTar, and Diana-MicroH. In conclusion, these results demonstrate the PPI network-based identification of new connections amongst apoptotic pathways in cancer, which may shed new light on the intricate relationships between core apoptotic pathways and some targeted miRNAs in human cancers.
Collapse
Affiliation(s)
- Nan Wang
- School of Life Sciences and The State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu 610064, China
| | | | | | | | | | | | | | | | | |
Collapse
|
141
|
Liu ZP, Wang J, Qiu YQ, Leung RKK, Zhang XS, Tsui SKW, Chen L. Inferring a protein interaction map of Mycobacterium tuberculosis based on sequences and interologs. BMC Bioinformatics 2012; 13 Suppl 7:S6. [PMID: 22595003 PMCID: PMC3348046 DOI: 10.1186/1471-2105-13-s7-s6] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Mycobacterium tuberculosis is an infectious bacterium posing serious threats to human health. Due to the difficulty in performing molecular biology experiments to detect protein interactions, reconstruction of a protein interaction map of M. tuberculosis by computational methods will provide crucial information to understand the biological processes in the pathogenic microorganism, as well as provide the framework upon which new therapeutic approaches can be developed. RESULTS In this paper, we constructed an integrated M. tuberculosis protein interaction network by machine learning and ortholog-based methods. Firstly, we built a support vector machine (SVM) method to infer the protein interactions of M. tuberculosis H37Rv by gene sequence information. We tested our predictors in Escherichia coli and mapped the genetic codon features underlying its protein interactions to M. tuberculosis. Moreover, the documented interactions of 14 other species were mapped to the interactome of M. tuberculosis by the interolog method. The ensemble protein interactions were validated by various functional relationships, i.e., gene coexpression, evolutionary relationship and functional similarity, extracted from heterogeneous data sources. The accuracy and validation demonstrate the effectiveness and efficiency of our framework. CONCLUSIONS A protein interaction map of M. tuberculosis is inferred from genetic codons and interologs. The prediction accuracy and numerically experimental validation demonstrate the effectiveness and efficiency of our method. Furthermore, our methods can be straightforwardly extended to infer the protein interactions of other bacterial species.
Collapse
Affiliation(s)
- Zhi-Ping Liu
- Key Laboratory of Systems Biology, SIBS-Novo Nordisk Translational Research Centre for PreDiabetes, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China.
| | | | | | | | | | | | | |
Collapse
|
142
|
Zhao Y, Wang J, Wang Y, Huang J. A comparative analysis of protein targets of withdrawn cardiovascular drugs in human and mouse. J Clin Bioinforma 2012; 2:10. [PMID: 22548699 PMCID: PMC3413526 DOI: 10.1186/2043-9113-2-10] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2011] [Accepted: 05/01/2012] [Indexed: 12/12/2022] Open
Abstract
Background Mouse is widely used in animal testing of cardiovascular disease. However, a large number of cardiovascular drugs that have been experimentally proved to work well on mouse were withdrawn because they caused adverse side effects in human. Methods In this study, we investigate whether binding patterns of withdrawn cardiovascular drugs are conserved between mouse and human through computational dockings and molecular dynamic simulations. In addition, we also measured the level of conservation of gene expression patterns of the drug targets and their interacting partners by analyzing the microarray data. Results The results show that target proteins of withdrawn cardiovascular drugs are functionally conserved between human and mouse. However, all the binding patterns of withdrawn drugs we retrieved show striking difference due to sequence divergence in drug-binding pocket, mainly through loss or gain of hydrogen bond donors and distinct drug-binding pockets. The binding affinities of withdrawn drugs to their receptors tend to be reduced from mouse to human. In contrast, the FDA-approved and best-selling drugs are little affected. Conclusions Our analysis suggests that sequence divergence in drug-binding pocket may be a reasonable explanation for the discrepancy of drug effects between animal models and human.
Collapse
Affiliation(s)
- Yuqi Zhao
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, 32, Eastern Jiaochang Road, Kunming, Yunnan, 650223, China.
| | | | | | | |
Collapse
|
143
|
Chen YA, Wen YC, Chang WC. AtPAN: an integrated system for reconstructing transcriptional regulatory networks in Arabidopsis thaliana. BMC Genomics 2012; 13:85. [PMID: 22397531 PMCID: PMC3314555 DOI: 10.1186/1471-2164-13-85] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2011] [Accepted: 03/08/2012] [Indexed: 11/25/2022] Open
Abstract
Background Construction of transcriptional regulatory networks (TRNs) is of priority concern in systems biology. Numerous high-throughput approaches, including microarray and next-generation sequencing, are extensively adopted to examine transcriptional expression patterns on the whole-genome scale; those data are helpful in reconstructing TRNs. Identifying transcription factor binding sites (TFBSs) in a gene promoter is the initial step in elucidating the transcriptional regulation mechanism. Since transcription factors usually co-regulate a common group of genes by forming regulatory modules with similar TFBSs. Therefore, the combinatorial interactions of transcription factors must be modeled to reconstruct the gene regulatory networks. Description For systems biology applications, this work develops a novel database called Arabidopsis thaliana Promoter Analysis Net (AtPAN), capable of detecting TFBSs and their corresponding transcription factors (TFs) in a promoter or a set of promoters in Arabidopsis. For further analysis, according to the microarray expression data and literature, the co-expressed TFs and their target genes can be retrieved from AtPAN. Additionally, proteins interacting with the co-expressed TFs are also incorporated to reconstruct co-expressed TRNs. Moreover, combinatorial TFs can be detected by the frequency of TFBSs co-occurrence in a group of gene promoters. In addition, TFBSs in the conserved regions between the two input sequences or homologous genes in Arabidopsis and rice are also provided in AtPAN. The output results also suggest conducting wet experiments in the future. Conclusions The AtPAN, which has a user-friendly input/output interface and provide graphical view of the TRNs. This novel and creative resource is freely available online at http://AtPAN.itps.ncku.edu.tw/.
Collapse
Affiliation(s)
- Yi-An Chen
- Institute of Tropical Plant Sciences, National Cheng Kung University, Tainan 701, Taiwan
| | | | | |
Collapse
|
144
|
Planas-Iglesias J, Guney E, García-García J, Robertson KA, Raza S, Freeman TC, Ghazal P, Oliva B. Extending signaling pathways with protein-interaction networks. Application to apoptosis. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2012; 16:245-56. [PMID: 22385281 DOI: 10.1089/omi.2011.0130] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Cells exploit signaling pathways during responses to environmental changes, and these processes are often modulated during disease. Particularly, relevant human pathologies such as cancer or viral infections require downregulating apoptosis signaling pathways to progress. As a result, the identification of proteins responsible for these changes is essential for the diagnostics and development of therapeutics. Transferring functional annotation within protein interaction networks has proven useful to identify such proteins, although this is not a trivial task. Here, we used different scoring methods to transfer annotation from 53 well-studied members of the human apoptosis pathways (as known by 2005) to their protein interactors. All scoring methods produced significant predictions (compared to a random negative model), but its number was too large to be useful. Thus, we made a final prediction using specific combinations of scoring methods and compared it to the proteins related to apoptosis signaling pathways during the last 5 years. We propose 273 candidate proteins that may be relevant in apoptosis signaling pathways. Although some of them have known functions consistent with their proposed apoptotsis involvement, the majority have not been annotated yet, leaving room for further experimental studies. We provide our predictions at http://sbi.imim.es/web/Apoptosis.php.
Collapse
Affiliation(s)
- Joan Planas-Iglesias
- Structural Bioinformatics Group, GRIB, Universitat Pompeu Fabra, Barcelona, Catalonia, Spain
| | | | | | | | | | | | | | | |
Collapse
|
145
|
Koh GCKW, Porras P, Aranda B, Hermjakob H, Orchard SE. Analyzing protein-protein interaction networks. J Proteome Res 2012; 11:2014-31. [PMID: 22385417 DOI: 10.1021/pr201211w] [Citation(s) in RCA: 128] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
The advent of the "omics" era in biology research has brought new challenges and requires the development of novel strategies to answer previously intractable questions. Molecular interaction networks provide a framework to visualize cellular processes, but their complexity often makes their interpretation an overwhelming task. The inherently artificial nature of interaction detection methods and the incompleteness of currently available interaction maps call for a careful and well-informed utilization of this valuable data. In this tutorial, we aim to give an overview of the key aspects that any researcher needs to consider when working with molecular interaction data sets and we outline an example for interactome analysis. Using the molecular interaction database IntAct, the software platform Cytoscape, and its plugins BiNGO and clusterMaker, and taking as a starting point a list of proteins identified in a mass spectrometry-based proteomics experiment, we show how to build, visualize, and analyze a protein-protein interaction network.
Collapse
Affiliation(s)
- Gavin C K W Koh
- European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
| | | | | | | | | |
Collapse
|
146
|
Abstract
MOTIVATION Protein-protein interactions play vital functional roles in various biological phenomena. Physical contacts between proteins have been revealed using experimental approaches that have solved the structures of protein complexes at atomic resolution. To examine the huge number of protein complexes available in the Protein Data Bank, an efficient automated method that compares protein complexes is required. RESULTS We have developed Structural Comparison of Protein Complexes (SCPC), a novel method to structurally compare protein complexes. SCPC compares the spatial arrangements of subunits in a complex with those in another complex using secondary structure elements. Similar substructures are detected in two protein complexes and the similarity is scored. SCPC was applied to dimers, homo-oligomers and haemoglobins. SCPC properly estimated structural similarities between the dimers examined as well as an existing method, MM-align. Conserved substructures were detected in a homo-tetramer and a homo-hexamer composed of homologous proteins. Classification of quaternary structures of haemoglobins using SCPC was consistent with the conventional classification. The results demonstrate that SCPC is a valuable tool to investigate the structures of protein complexes. AVAILABILITY SCPC is available at http://idp1.force.cs.is.nagoya-u.ac.jp/scpc/. CONTACT rkoike@is.nagoya-u.ac.jp SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Ryotaro Koike
- Department of Complex Systems Science, Graduate School of Information Science, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8601, Japan.
| | | |
Collapse
|
147
|
Desai K, Brott D, Hu X, Christianson A. A systems biology approach for detecting toxicity-related hotspots inside protein interaction networks. J Bioinform Comput Biol 2012; 9:647-62. [PMID: 21976381 DOI: 10.1142/s0219720011005707] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2011] [Accepted: 08/08/2011] [Indexed: 12/19/2022]
Abstract
Drug-induced neutropenia can be fatal when severe and therefore requires an improved understanding of its mechanism(s) of toxicity. Systems biology provides an opportunity to understand adverse events after drug administration using analysis of biomolecular networks. In this study, a human protein interaction network was analyzed to identify proteins that are most central to topological paths connecting a drug's target proteins to hematopoiesis-related proteins. For a set of non-immune neutropenia inducing drugs, 9 proteins were found to be common to putative signaling paths across all drugs evaluated. All 9 proteins showed relevance to neutrophil biology. Geneset enrichment analysis showed that proteins associated with cancer-related processes such as apoptosis provide topological linkages between drug targets and proteins involved in neutrophil production. The algorithm can be applied towards analysis of any toxicity where the drugs and the physiological processes involved in the toxic mechanism are known.
Collapse
|
148
|
Balaji S, Mcclendon C, Chowdhary R, Liu JS, Zhang J. IMID: integrated molecular interaction database. Bioinformatics 2012; 28:747-9. [PMID: 22238258 DOI: 10.1093/bioinformatics/bts010] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
MOTIVATION Molecular interaction information, such as protein-protein interactions and protein-small molecule interactions, is indispensable for understanding the mechanism of biological processes and discovering treatments for diseases. Many databases have been built by manual annotation of literature to organize such information into structured form. However, most databases focus on only one type of interactions, which are often not well annotated and integrated with related functional information. RESULTS In this study, we integrate molecular interaction information from literature by automatic information extraction and from manually annotated databases. We further integrate the relationships between protein/gene and other bio-entity terms including gene ontology terms, pathways, species and diseases to build an integrated molecular interaction database (IMID). Interactions can be selected by their associated probabilities. IMID allows complex and versatile queries for context-specific molecular interactions, which are not available currently in other molecular interaction databases. AVAILABILITY The database is located at www.integrativebiology.org.
Collapse
Affiliation(s)
- Sentil Balaji
- Department of Statistics, Florida State University, Tallahassee, FL 32306, USA
| | | | | | | | | |
Collapse
|
149
|
Wang F, Liu M, Song B, Li D, Pei H, Guo Y, Huang J, Zhang D. Prediction and characterization of protein-protein interaction networks in swine. Proteome Sci 2012; 10:2. [PMID: 22230699 PMCID: PMC3306829 DOI: 10.1186/1477-5956-10-2] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2011] [Accepted: 01/10/2012] [Indexed: 11/13/2022] Open
Abstract
Background Studying the large-scale protein-protein interaction (PPI) network is important in understanding biological processes. The current research presents the first PPI map of swine, which aims to give new insights into understanding their biological processes. Results We used three methods, Interolog-based prediction of porcine PPI network, domain-motif interactions from structural topology-based prediction of porcine PPI network and motif-motif interactions from structural topology-based prediction of porcine PPI network, to predict porcine protein interactions among 25,767 porcine proteins. We predicted 20,213, 331,484, and 218,705 porcine PPIs respectively, merged the three results into 567,441 PPIs, constructed four PPI networks, and analyzed the topological properties of the porcine PPI networks. Our predictions were validated with Pfam domain annotations and GO annotations. Averages of 70, 10,495, and 863 interactions were related to the Pfam domain-interacting pairs in iPfam database. For comparison, randomized networks were generated, and averages of only 4.24, 66.79, and 44.26 interactions were associated with Pfam domain-interacting pairs in iPfam database. In GO annotations, we found 52.68%, 75.54%, 27.20% of the predicted PPIs sharing GO terms respectively. However, the number of PPI pairs sharing GO terms in the 10,000 randomized networks reached 52.68%, 75.54%, 27.20% is 0. Finally, we determined the accuracy and precision of the methods. The methods yielded accuracies of 0.92, 0.53, and 0.50 at precisions of about 0.93, 0.74, and 0.75, respectively. Conclusion The results reveal that the predicted PPI networks are considerably reliable. The present research is an important pioneering work on protein function research. The porcine PPI data set, the confidence score of each interaction and a list of related data are available at (http://pppid.biositemap.com/).
Collapse
Affiliation(s)
- Fen Wang
- College of Veterinary Medicine, Northwest A&F University, Yangling, Shaanxi 712100, China.
| | | | | | | | | | | | | | | |
Collapse
|
150
|
Harris TJ. Adherens Junction Assembly and Function in the Drosophila Embryo. INTERNATIONAL REVIEW OF CELL AND MOLECULAR BIOLOGY 2012; 293:45-83. [DOI: 10.1016/b978-0-12-394304-0.00007-5] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|