1
|
Alachram H, Chereda H, Beißbarth T, Wingender E, Stegmaier P. Text mining-based word representations for biomedical data analysis and protein-protein interaction networks in machine learning tasks. PLoS One 2021; 16:e0258623. [PMID: 34653224 PMCID: PMC8519453 DOI: 10.1371/journal.pone.0258623] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2020] [Accepted: 10/01/2021] [Indexed: 11/18/2022] Open
Abstract
Biomedical and life science literature is an essential way to publish experimental results. With the rapid growth of the number of new publications, the amount of scientific knowledge represented in free text is increasing remarkably. There has been much interest in developing techniques that can extract this knowledge and make it accessible to aid scientists in discovering new relationships between biological entities and answering biological questions. Making use of the word2vec approach, we generated word vector representations based on a corpus consisting of over 16 million PubMed abstracts. We developed a text mining pipeline to produce word2vec embeddings with different properties and performed validation experiments to assess their utility for biomedical analysis. An important pre-processing step consisted in the substitution of synonymous terms by their preferred terms in biomedical databases. Furthermore, we extracted gene-gene networks from two embedding versions and used them as prior knowledge to train Graph-Convolutional Neural Networks (CNNs) on large breast cancer gene expression data and on other cancer datasets. Performances of resulting models were compared to Graph-CNNs trained with protein-protein interaction (PPI) networks or with networks derived using other word embedding algorithms. We also assessed the effect of corpus size on the variability of word representations. Finally, we created a web service with a graphical and a RESTful interface to extract and explore relations between biomedical terms using annotated embeddings. Comparisons to biological databases showed that relations between entities such as known PPIs, signaling pathways and cellular functions, or narrower disease ontology groups correlated with higher cosine similarity. Graph-CNNs trained with word2vec-embedding-derived networks performed sufficiently good for the metastatic event prediction tasks compared to other networks. Such performance was good enough to validate the utility of our generated word embeddings in constructing biological networks. Word representations as produced by text mining algorithms like word2vec, therefore are able to capture biologically meaningful relations between entities. Our generated embeddings are publicly available at https://github.com/genexplain/Word2vec-based-Networks/blob/main/README.md.
Collapse
Affiliation(s)
- Halima Alachram
- Department of Medical Bioinformatics, University Medical Center, Göttingen, Lower Saxony, Germany
| | - Hryhorii Chereda
- Department of Medical Bioinformatics, University Medical Center, Göttingen, Lower Saxony, Germany
| | - Tim Beißbarth
- Department of Medical Bioinformatics, University Medical Center, Göttingen, Lower Saxony, Germany
| | | | | |
Collapse
|
2
|
Sukhorukov V, Nikiforov N, Kubekina M, Sobenin I, Foxx K, Stegmaier P, Pintus S, Stelmashenko D, Kel A, Manabe I, Oishi Y, Orekhov A. Signaling pathways potentially responsible for foam cell formation: Cholesterol accumulation or inflammatory response - What is primary? Atherosclerosis 2020. [DOI: 10.1016/j.atherosclerosis.2020.10.179] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
3
|
Lloyd K, Papoutsopoulou S, Smith E, Stegmaier P, Bergey F, Morris L, Kittner M, England H, Spiller D, White MHR, Duckworth CA, Campbell BJ, Poroikov V, Martins Dos Santos VAP, Kel A, Muller W, Pritchard DM, Probert C, Burkitt MD. Using systems medicine to identify a therapeutic agent with potential for repurposing in inflammatory bowel disease. Dis Model Mech 2020; 13:dmm044040. [PMID: 32958515 PMCID: PMC7710021 DOI: 10.1242/dmm.044040] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2019] [Accepted: 09/08/2020] [Indexed: 12/11/2022] Open
Abstract
Inflammatory bowel diseases (IBDs) cause significant morbidity and mortality. Aberrant NF-κB signalling is strongly associated with these conditions, and several established drugs influence the NF-κB signalling network to exert their effect. This study aimed to identify drugs that alter NF-κB signalling and could be repositioned for use in IBD. The SysmedIBD Consortium established a novel drug-repurposing pipeline based on a combination of in silico drug discovery and biological assays targeted at demonstrating an impact on NF-κB signalling, and a murine model of IBD. The drug discovery algorithm identified several drugs already established in IBD, including corticosteroids. The highest-ranked drug was the macrolide antibiotic clarithromycin, which has previously been reported to have anti-inflammatory effects in aseptic conditions. The effects of clarithromycin effects were validated in several experiments: it influenced NF-κB-mediated transcription in murine peritoneal macrophages and intestinal enteroids; it suppressed NF-κB protein shuttling in murine reporter enteroids; it suppressed NF-κB (p65) DNA binding in the small intestine of mice exposed to lipopolysaccharide; and it reduced the severity of dextran sulphate sodium-induced colitis in C57BL/6 mice. Clarithromycin also suppressed NF-κB (p65) nuclear translocation in human intestinal enteroids. These findings demonstrate that in silico drug repositioning algorithms can viably be allied to laboratory validation assays in the context of IBD, and that further clinical assessment of clarithromycin in the management of IBD is required.This article has an associated First Person interview with the joint first authors of the paper.
Collapse
Affiliation(s)
- Katie Lloyd
- Department of Cellular and Molecular Physiology, University of Liverpool, Liverpool L69 3GE, UK
| | - Stamatia Papoutsopoulou
- Department of Cellular and Molecular Physiology, University of Liverpool, Liverpool L69 3GE, UK
- Faculty of Biology, Medicine and Health, University of Manchester, Manchester M13 9PL, UK
| | - Emily Smith
- Faculty of Biology, Medicine and Health, University of Manchester, Manchester M13 9PL, UK
| | | | | | | | | | - Hazel England
- Faculty of Biology, Medicine and Health, University of Manchester, Manchester M13 9PL, UK
| | - Dave Spiller
- Faculty of Biology, Medicine and Health, University of Manchester, Manchester M13 9PL, UK
| | - Mike H R White
- Faculty of Biology, Medicine and Health, University of Manchester, Manchester M13 9PL, UK
| | - Carrie A Duckworth
- Department of Cellular and Molecular Physiology, University of Liverpool, Liverpool L69 3GE, UK
| | - Barry J Campbell
- Department of Cellular and Molecular Physiology, University of Liverpool, Liverpool L69 3GE, UK
| | | | | | | | - Werner Muller
- Faculty of Biology, Medicine and Health, University of Manchester, Manchester M13 9PL, UK
| | - D Mark Pritchard
- Department of Cellular and Molecular Physiology, University of Liverpool, Liverpool L69 3GE, UK
| | - Chris Probert
- Department of Cellular and Molecular Physiology, University of Liverpool, Liverpool L69 3GE, UK
| | - Michael D Burkitt
- Department of Cellular and Molecular Physiology, University of Liverpool, Liverpool L69 3GE, UK
- Faculty of Biology, Medicine and Health, University of Manchester, Manchester M13 9PL, UK
| |
Collapse
|
4
|
Orekhov AN, Nikiforov NG, Sukhorukov VN, Kubekina MV, Sobenin IA, Wu WK, Foxx KK, Pintus S, Stegmaier P, Stelmashenko D, Kel A, Gratchev AN, Melnichenko AA, Wetzker R, Summerhill VI, Manabe I, Oishi Y. Role of Phagocytosis in the Pro-Inflammatory Response in LDL-Induced Foam Cell Formation; a Transcriptome Analysis. Int J Mol Sci 2020; 21:ijms21030817. [PMID: 32012706 PMCID: PMC7037225 DOI: 10.3390/ijms21030817] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2019] [Revised: 01/14/2020] [Accepted: 01/23/2020] [Indexed: 12/25/2022] Open
Abstract
Excessive accumulation of lipid inclusions in the arterial wall cells (foam cell formation) caused by modified low-density lipoprotein (LDL) is the earliest and most noticeable manifestation of atherosclerosis. The mechanisms of foam cell formation are not fully understood and can involve altered lipid uptake, impaired lipid metabolism, or both. Recently, we have identified the top 10 master regulators that were involved in the accumulation of cholesterol in cultured macrophages induced by the incubation with modified LDL. It was found that most of the identified master regulators were related to the regulation of the inflammatory immune response, but not to lipid metabolism. A possible explanation for this unexpected result is a stimulation of the phagocytic activity of macrophages by modified LDL particle associates that have a relatively large size. In the current study, we investigated gene regulation in macrophages using transcriptome analysis to test the hypothesis that the primary event occurring upon the interaction of modified LDL and macrophages is the stimulation of phagocytosis, which subsequently triggers the pro-inflammatory immune response. We identified genes that were up- or downregulated following the exposure of cultured cells to modified LDL or latex beads (inert phagocytosis stimulators). Most of the identified master regulators were involved in the innate immune response, and some of them were encoding major pro-inflammatory proteins. The obtained results indicated that pro-inflammatory response to phagocytosis stimulation precedes the accumulation of intracellular lipids and possibly contributes to the formation of foam cells. In this way, the currently recognized hypothesis that the accumulation of lipids triggers the pro-inflammatory response was not confirmed. Comparative analysis of master regulators revealed similarities in the genetic regulation of the interaction of macrophages with naturally occurring LDL and desialylated LDL. Oxidized and desialylated LDL affected a different spectrum of genes than naturally occurring LDL. These observations suggest that desialylation is the most important modification of LDL occurring in vivo. Thus, modified LDL caused the gene regulation characteristic of the stimulation of phagocytosis. Additionally, the knock-down effect of five master regulators, such as IL15, EIF2AK3, F2RL1, TSPYL2, and ANXA1, on intracellular lipid accumulation was tested. We knocked down these genes in primary macrophages derived from human monocytes. The addition of atherogenic naturally occurring LDL caused a significant accumulation of cholesterol in the control cells. The knock-down of the EIF2AK3 and IL15 genes completely prevented cholesterol accumulation in cultured macrophages. The knock-down of the ANXA1 gene caused a further decrease in cholesterol content in cultured macrophages. At the same time, knock-down of F2RL1 and TSPYL2 did not cause an effect. The results obtained allowed us to explain in which way the inflammatory response and the accumulation of cholesterol are related confirming our hypothesis of atherogenesis development based on the following viewpoints: LDL particles undergo atherogenic modifications that, in turn, accompanied by the formation of self-associates; large LDL associates stimulate phagocytosis; as a result of phagocytosis stimulation, pro-inflammatory molecules are secreted; these molecules cause or at least contribute to the accumulation of intracellular cholesterol. Therefore, it became obvious that the primary event in this sequence is not the accumulation of cholesterol but an inflammatory response.
Collapse
Affiliation(s)
- Alexander N. Orekhov
- Laboratory of Angiopathology, Institute of General Pathology and Pathophysiology, 8 Baltiiskaya Street, 125315 Moscow, Russia
- Laboratory of Infection Pathology and Molecular Microecology, Institute of Human Morphology, 3 Tsyurupa Street, 117418 Moscow, Russia
- Correspondence: (A.N.O.); (V.I.S.)
| | - Nikita G. Nikiforov
- Laboratory of Angiopathology, Institute of General Pathology and Pathophysiology, 8 Baltiiskaya Street, 125315 Moscow, Russia
- Laboratory of Medical Genetics, Institute of Experimental Cardiology, National Medical Research Center of Cardiology, 15A 3-rd Cherepkovskaya Street, 121552 Moscow, Russia
- Centre of Collective Usage, Institute of Gene Biology, Russian Academy of Sciences, 34/5 Vavilova Street, 119334 Moscow, Russia
| | - Vasily N. Sukhorukov
- Laboratory of Angiopathology, Institute of General Pathology and Pathophysiology, 8 Baltiiskaya Street, 125315 Moscow, Russia
- Laboratory of Infection Pathology and Molecular Microecology, Institute of Human Morphology, 3 Tsyurupa Street, 117418 Moscow, Russia
| | - Marina V. Kubekina
- Centre of Collective Usage, Institute of Gene Biology, Russian Academy of Sciences, 34/5 Vavilova Street, 119334 Moscow, Russia
| | - Igor A. Sobenin
- Laboratory of Angiopathology, Institute of General Pathology and Pathophysiology, 8 Baltiiskaya Street, 125315 Moscow, Russia
- Laboratory of Medical Genetics, Institute of Experimental Cardiology, National Medical Research Center of Cardiology, 15A 3-rd Cherepkovskaya Street, 121552 Moscow, Russia
| | - Wei-Kai Wu
- Department of Internal Medicine, National Taiwan University Hospital, Bei-Hu Branch, Taipei 10002, Taiwan
| | - Kathy K. Foxx
- Kalen Biomedical, LLC, Montgomery Village, MD 20886, USA
| | - Sergey Pintus
- BIOSOFT.RU, LLC, 630090 Novosibirsk, Russia
- Institute of Computational Technologies, 630090 Novosibirsk, Russia
| | | | - Daria Stelmashenko
- BIOSOFT.RU, LLC, 630090 Novosibirsk, Russia
- geneXplain GmbH, 38302 Wolfenbüttel, Germany
| | - Alexander Kel
- BIOSOFT.RU, LLC, 630090 Novosibirsk, Russia
- geneXplain GmbH, 38302 Wolfenbüttel, Germany
- Institute of Chemical Biology and Fundamental Medicine, 630090 Novosibirsk, Russia
| | - Alexei N. Gratchev
- N. N. Blokhin National Medical Research Center of Oncology, 24 Kashirskoye sh., 115478 Moscow, Russia
| | - Alexandra A. Melnichenko
- Laboratory of Angiopathology, Institute of General Pathology and Pathophysiology, 8 Baltiiskaya Street, 125315 Moscow, Russia
- Laboratory of Medical Genetics, Institute of Experimental Cardiology, National Medical Research Center of Cardiology, 15A 3-rd Cherepkovskaya Street, 121552 Moscow, Russia
| | - Reinhard Wetzker
- Department of Anaesthesiology and Intensive Care Medicine, University Hospital Jena, Am Klinikum 1, D-07747 Jena, Germany
| | - Volha I. Summerhill
- Department of Basic Research, Institute for Atherosclerosis Research, 121609 Moscow, Russia
- Correspondence: (A.N.O.); (V.I.S.)
| | - Ichiro Manabe
- Department of Aging Research, Graduate School of Medicine, Chiba University, Chiba 263-8522, Japan
| | - Yumiko Oishi
- Department of Biochemistry & Molecular Biology, Nippon Medical School, Tokyo 113-8602, Japan
| |
Collapse
|
5
|
|
6
|
Kel A, Boyarskikh U, Stegmaier P, Leskov LS, Sokolov AV, Yevshin I, Mandrik N, Stelmashenko D, Koschmann J, Kel-Margoulis O, Krull M, Martínez-Cardús A, Moran S, Esteller M, Kolpakov F, Filipenko M, Wingender E. Walking pathways with positive feedback loops reveal DNA methylation biomarkers of colorectal cancer. BMC Bioinformatics 2019; 20:119. [PMID: 30999858 PMCID: PMC6471696 DOI: 10.1186/s12859-019-2687-7] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open
Abstract
BACKGROUND The search for molecular biomarkers of early-onset colorectal cancer (CRC) is an important but still quite challenging and unsolved task. Detection of CpG methylation in human DNA obtained from blood or stool has been proposed as a promising approach to a noninvasive early diagnosis of CRC. Thousands of abnormally methylated CpG positions in CRC genomes are often located in non-coding parts of genes. Novel bioinformatic methods are thus urgently needed for multi-omics data analysis to reveal causative biomarkers with a potential driver role in early stages of cancer. METHODS We have developed a method for finding potential causal relationships between epigenetic changes (DNA methylations) in gene regulatory regions that affect transcription factor binding sites (TFBS) and gene expression changes. This method also considers the topology of the involved signal transduction pathways and searches for positive feedback loops that may cause the carcinogenic aberrations in gene expression. We call this method "Walking pathways", since it searches for potential rewiring mechanisms in cancer pathways due to dynamic changes in the DNA methylation status of important gene regulatory regions ("epigenomic walking"). RESULTS In this paper, we analysed an extensive collection of full genome gene-expression data (RNA-seq) and DNA methylation data of genomic CpG islands (using Illumina methylation arrays) generated from a sample of tumor and normal gut epithelial tissues of 300 patients with colorectal cancer (at different stages of the disease) (data generated in the EU-supported SysCol project). Identification of potential epigenetic biomarkers of DNA methylation was performed using the fully automatic multi-omics analysis web service "My Genome Enhancer" (MGE) (my-genome-enhancer.com). MGE uses the database on gene regulation TRANSFAC®, the signal transduction pathways database TRANSPATH®, and software that employs AI (artificial intelligence) methods for the analysis of cancer-specific enhancers. CONCLUSIONS The identified biomarkers underwent experimental testing on an independent set of blood samples from patients with colorectal cancer. As a result, using advanced methods of statistics and machine learning, a minimum set of 6 biomarkers was selected, which together achieve the best cancer detection potential. The markers include hypermethylated positions in regulatory regions of the following genes: CALCA, ENO1, MYC, PDX1, TCF7, ZNF43.
Collapse
Affiliation(s)
- Alexander Kel
- Institute of Chemical Biology and Fundamental Medicine, SBRAN, Novosibirsk, 630090, Russia. .,Biosoft.ru, Ltd, Novosibirsk, 630090, Russia. .,geneXplain GmbH, 38302, Wolfenbüttel, Germany.
| | - Ulyana Boyarskikh
- Institute of Chemical Biology and Fundamental Medicine, SBRAN, Novosibirsk, 630090, Russia
| | | | | | | | | | | | | | | | | | | | - Anna Martínez-Cardús
- Cancer Epigenetics and Biology Program (PEBC), Bellvitge Biomedical Research Institute (IDIBELL), 08908, Barcelona, Spain
| | - Sebastian Moran
- Cancer Epigenetics and Biology Program (PEBC), Bellvitge Biomedical Research Institute (IDIBELL), 08908, Barcelona, Spain
| | - Manel Esteller
- Cancer Epigenetics and Biology Program (PEBC), Bellvitge Biomedical Research Institute (IDIBELL), 08908, Barcelona, Spain.,Centro de Investigacion Biomedica en Red Cancer (CIBERONC), 28029, Madrid, Spain.,Physiological Sciences Department, School of Medicine and Health Sciences, University of Barcelona (UB), 08010, Barcelona, Spain.,Institucio Catalana de Recerca i Estudis Avançats (ICREA), 08010, Barcelona, Spain
| | - Fedor Kolpakov
- Biosoft.ru, Ltd, Novosibirsk, 630090, Russia.,Institute of Computational Technologies SB RAS, Novosibirsk, 630090, Russia
| | - Maxim Filipenko
- Institute of Chemical Biology and Fundamental Medicine, SBRAN, Novosibirsk, 630090, Russia
| | - Edgar Wingender
- geneXplain GmbH, 38302, Wolfenbüttel, Germany.,Institute of Bioinformatics, University Medical Center Göttingen (UMG), Göttingen, 37077, Germany
| |
Collapse
|
7
|
Smetanina MA, Kel AE, Sevost'ianova KS, Maiborodin IV, Shevela AI, Zolotukhin IA, Stegmaier P, Filipenko ML. DNA methylation and gene expression profiling reveal MFAP5 as a regulatory driver of extracellular matrix remodeling in varicose vein disease. Epigenomics 2018; 10:1103-1119. [PMID: 30070582 DOI: 10.2217/epi-2018-0001] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
AIM To integrate transcriptomic and DNA-methylomic measurements on varicose versus normal veins using a systems biological analysis to shed light on the interplay between genetic and epigenetic factors. MATERIALS & METHODS Differential expression and methylation were measured using microarrays, supported by real-time quantitative PCR and immunohistochemistry confirmation for relevant gene products. A systems biological 'upstream analysis' was further applied. RESULTS We identified several potential key players contributing to extracellular matrix remodeling in varicose veins. Specifically, our analysis suggests MFAP5 acting as a master regulator, upstream of integrins, of the cellular network affecting the varicose vein condition. Possible mechanism and pathogenic model were outlined. CONCLUSION A coherent model proposed incorporates the relevant signaling networks and will hopefully aid further studies on varicose vein pathogenesis.
Collapse
Affiliation(s)
- Mariya A Smetanina
- Laboratory of Pharmacogenomics, Institute of Chemical Biology & Fundamental Medicine, Novosibirsk 630090, Russia.,Department of Fundamental Medicine, Novosibirsk State University, Novosibirsk 630090, Russia
| | - Alexander E Kel
- Laboratory of Pharmacogenomics, Institute of Chemical Biology & Fundamental Medicine, Novosibirsk 630090, Russia.,Department of Research & Development, geneXplain GmbH, Wolfenbüttel D-38302, Germany
| | - Ksenia S Sevost'ianova
- Department of Fundamental Medicine, Novosibirsk State University, Novosibirsk 630090, Russia.,Center of New Medical Technologies, Institute of Chemical Biology & Fundamental Medicine, Novosibirsk 630090, Russia
| | - Igor V Maiborodin
- Stem Cell Laboratory, Institute of Chemical Biology & Fundamental Medicine, Novosibirsk 630090, Russia
| | - Andrey I Shevela
- Department of Fundamental Medicine, Novosibirsk State University, Novosibirsk 630090, Russia.,Center of New Medical Technologies, Institute of Chemical Biology & Fundamental Medicine, Novosibirsk 630090, Russia
| | - Igor A Zolotukhin
- Laboratory of Pharmacogenomics, Institute of Chemical Biology & Fundamental Medicine, Novosibirsk 630090, Russia.,Chair of Faculty Surgery of the Medical Department, Pirogov Russian National Research Medical University, Moscow 117997, Russia
| | - Philip Stegmaier
- Department of Research & Development, geneXplain GmbH, Wolfenbüttel D-38302, Germany
| | - Maxim L Filipenko
- Laboratory of Pharmacogenomics, Institute of Chemical Biology & Fundamental Medicine, Novosibirsk 630090, Russia.,Department of Fundamental Medicine, Novosibirsk State University, Novosibirsk 630090, Russia
| |
Collapse
|
8
|
Boyarskikh U, Pintus S, Mandrik N, Stelmashenko D, Kiselev I, Evshin I, Sharipov R, Stegmaier P, Kolpakov F, Filipenko M, Kel A. Computational master-regulator search reveals mTOR and PI3K pathways responsible for low sensitivity of NCI-H292 and A427 lung cancer cell lines to cytotoxic action of p53 activator Nutlin-3. BMC Med Genomics 2018; 11:12. [PMID: 29504919 PMCID: PMC5836833 DOI: 10.1186/s12920-018-0330-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Small molecule Nutlin-3 reactivates p53 in cancer cells by interacting with the complex between p53 and its repressor Mdm-2 and causing an increase in cancer cell apoptosis. Therefore, Nutlin-3 has potent anticancer properties. Clinical and experimental studies of Nutlin-3 showed that some cancer cells may lose sensitivity to this compound. Here we analyze possible mechanisms for insensitivity of cancer cells to Nutlin-3. METHODS We applied upstream analysis approach implemented in geneXplain platform ( genexplain.com ) using TRANSFAC® database of transcription factors and their binding sites in genome and using TRANSPATH® database of signal transduction network with associated software such as Match™ and Composite Module Analyst (CMA). RESULTS Using genome-wide gene expression profiling we compared several lung cancer cell lines and showed that expression programs executed in Nutlin-3 insensitive cell lines significantly differ from that of Nutlin-3 sensitive cell lines. Using artificial intelligence approach embed in CMA software, we identified a set of transcription factors cooperatively binding to the promoters of genes up-regulated in the Nutlin-3 insensitive cell lines. Graph analysis of signal transduction network upstream of these transcription factors allowed us to identify potential master-regulators responsible for maintaining such low sensitivity to Nutlin-3 with the most promising candidate mTOR, which acts in the context of activated PI3K pathway. These finding were validated experimentally using an array of chemical inhibitors. CONCLUSIONS We showed that the Nutlin-3 insensitive cell lines are actually highly sensitive to the dual PI3K/mTOR inhibitor NVP-BEZ235, while no responding to either PI3K -specific LY294002 nor Bcl-XL specific 2,3-DCPE compounds.
Collapse
Affiliation(s)
- Ulyana Boyarskikh
- Institute of Chemical Biology and Fundamental Medicine, SBRAN, Novosibirsk, Russia
| | | | | | | | | | | | | | | | | | - Maxim Filipenko
- Institute of Chemical Biology and Fundamental Medicine, SBRAN, Novosibirsk, Russia
| | - Alexander Kel
- Institute of Chemical Biology and Fundamental Medicine, SBRAN, Novosibirsk, Russia.
- Biosoft.ru, Ltd, Novosibirsk, Russia.
- geneXplain GmbH, D-38302, Wolfenbüttel, Germany.
| |
Collapse
|
9
|
Kel AE, Stegmaier P, Valeev T, Koschmann J, Poroikov V, Kel-Margoulis OV, Wingender E. Multi-omics "upstream analysis" of regulatory genomic regions helps identifying targets against methotrexate resistance of colon cancer. EuPA Open Proteom 2016; 13:1-13. [PMID: 29900117 PMCID: PMC5988513 DOI: 10.1016/j.euprot.2016.09.002] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/31/2015] [Revised: 09/05/2016] [Accepted: 09/08/2016] [Indexed: 11/25/2022]
Abstract
Upstream analysis strategy for multi-omics data is proposed. Drug targets are predicted by search for TFBS and analysis of signaling network. Methotrexate resistance data include transcriptomics, proteomics and epigenomics. Predicted targets are: TGFalpha, IGFBP7, alpha9-integrin. Predicted drugs are: zardaverine, divalproex and human metabolite nicotinamide N-oxide.
We present an “upstream analysis” strategy for causal analysis of multiple “-omics” data. It analyzes promoters using the TRANSFAC database, combines it with an analysis of the upstream signal transduction pathways and identifies master regulators as potential drug targets for a pathological process. We applied this approach to a complex multi-omics data set that contains transcriptomics, proteomics and epigenomics data. We identified the following potential drug targets against induced resistance of cancer cells towards chemotherapy by methotrexate (MTX): TGFalpha, IGFBP7, alpha9-integrin, and the following chemical compounds: zardaverine and divalproex as well as human metabolites such as nicotinamide N-oxide.
Collapse
Affiliation(s)
- Alexander E Kel
- Institute of Chemical Biology and Fundamental Medicine, SBRAS, Novosibirsk, Russia.,Biosoft.ru, Ltd, Novosibirsk, Russia.,geneXplain GmbH, D-38302 Wolfenbüttel, Germany
| | | | - Tagir Valeev
- Biosoft.ru, Ltd, Novosibirsk, Russia.,A.P. Ershov Institute of Informatics Systems, SB RAS, Novosibirsk, Russia
| | | | | | | | - Edgar Wingender
- geneXplain GmbH, D-38302 Wolfenbüttel, Germany.,Institute of Bioinformatics, University Medical Center Göttingen, D-37077 Göttingen, Germany
| |
Collapse
|
10
|
Kirste S, Bell E, Fleming J, Stegmaier P, Drendel V, Mo X, Ling S, Fabian D, Jilg C, Schultze-Seemann W, Zynger D, Martin D, White J, Werner M, Chakravarti A, Grosu A. PO-0731: A miRNA-based predictive model in prostate cancer patients. Radiother Oncol 2015. [DOI: 10.1016/s0167-8140(15)40723-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
11
|
Matuschek C, Boelke E, Belka C, Ganswindt U, Henke M, Stegmaier P, Bamberg M, Welz S, Debus J, Budach W. Feasibility of 6 Months Maintenance Cetuximab After Adjuvant Concurrent Chemoradiation Plus Cetuximab in Squamous Cell Carcinoma of the Head and Neck. Int J Radiat Oncol Biol Phys 2013. [DOI: 10.1016/j.ijrobp.2013.06.1222] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
12
|
Stegmaier P, Kel A, Wingender E, Borlak J. A discriminative approach for unsupervised clustering of DNA sequence motifs. PLoS Comput Biol 2013; 9:e1002958. [PMID: 23555204 PMCID: PMC3605052 DOI: 10.1371/journal.pcbi.1002958] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2012] [Accepted: 01/15/2013] [Indexed: 12/03/2022] Open
Abstract
Algorithmic comparison of DNA sequence motifs is a problem in bioinformatics that has received increased attention during the last years. Its main applications concern characterization of potentially novel motifs and clustering of a motif collection in order to remove redundancy. Despite growing interest in motif clustering, the question which motif clusters to aim at has so far not been systematically addressed. Here we analyzed motif similarities in a comprehensive set of vertebrate transcription factor classes. For this we developed enhanced similarity scores by inclusion of the information coverage (IC) criterion, which evaluates the fraction of information an alignment covers in aligned motifs. A network-based method enabled us to identify motif clusters with high correspondence to DNA-binding domain phylogenies and prior experimental findings. Based on this analysis we derived a set of motif families representing distinct binding specificities. These motif families were used to train a classifier which was further integrated into a novel algorithm for unsupervised motif clustering. Application of the new algorithm demonstrated its superiority to previously published methods and its ability to reproduce entrained motif families. As a result, our work proposes a probabilistic approach to decide whether two motifs represent common or distinct binding specificities. Transcription factors play a central role in the regulation of gene expression. Their interaction with specific elements in the DNA mediates dynamic changes in transcriptional activity. Databases store a growing number of known DNA sequence patterns, also denoted as DNA sequence motifs that are recognized by transcription factors. Such databases can be searched to find a match for a newly discovered pattern and that way identify the potential binding factor. It is also of interest to cluster motifs in order to examine which transcription factors have similar binding properties and, thus, may promiscuously bind to each other's sites, or how many distinct specificities have been described. To gain deeper insight into the similarities between DNA sequence motifs, we analyzed a comprehensive set of known motifs. For this purpose we devised a network-based approach that enabled us to identify clusters of related motifs that largely coincided with grouping of related TFs on the basis of protein similarity. On the basis of these results, we were able to predict whether two motifs belong to the same subgroup and constructed a novel, fully-automated method for motif clustering, which enables users to assess the similarity of a newly found motif with all known motifs in the collection.
Collapse
|
13
|
Stegmaier P, Momm F, Duncker-Rohr V, Schimek-Jasch T, Grosu A, Henke M. Xerostomia: Association Between Saliva Production and Patient-reported Grading. Int J Radiat Oncol Biol Phys 2012. [DOI: 10.1016/j.ijrobp.2012.07.1336] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
14
|
Gabdoulline R, Eckweiler D, Kel A, Stegmaier P. 3DTF: a web server for predicting transcription factor PWMs using 3D structure-based energy calculations. Nucleic Acids Res 2012; 40:W180-5. [PMID: 22693215 PMCID: PMC3394331 DOI: 10.1093/nar/gks551] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
We present the webserver 3D transcription factor (3DTF) to compute position-specific weight matrices (PWMs) of transcription factors using a knowledge-based statistical potential derived from crystallographic data on protein–DNA complexes. Analysis of available structures that can be used to construct PWMs shows that there are hundreds of 3D structures from which PWMs could be derived, as well as thousands of proteins homologous to these. Therefore, we created 3DTF, which delivers binding matrices given the experimental or modeled protein–DNA complex. The webserver can be used by biologists to derive novel PWMs for transcription factors lacking known binding sites and is freely accessible at http://www.gene-regulation.com/pub/programs/3dtf/.
Collapse
Affiliation(s)
- R Gabdoulline
- Heinrich-Heine University of Duesseldorf, Universitaetstr. 1, 40225 Duesseldorf, Germany
| | | | | | | |
Collapse
|
15
|
Stegmaier P, Voss N, Meier T, Kel A, Wingender E, Borlak J. Advanced computational biology methods identify molecular switches for malignancy in an EGF mouse model of liver cancer. PLoS One 2011; 6:e17738. [PMID: 21464922 PMCID: PMC3065454 DOI: 10.1371/journal.pone.0017738] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2010] [Accepted: 02/09/2011] [Indexed: 01/04/2023] Open
Abstract
The molecular causes by which the epidermal growth factor receptor tyrosine kinase induces malignant transformation are largely unknown. To better understand EGFs' transforming capacity whole genome scans were applied to a transgenic mouse model of liver cancer and subjected to advanced methods of computational analysis to construct de novo gene regulatory networks based on a combination of sequence analysis and entrained graph-topological algorithms. Here we identified transcription factors, processes, key nodes and molecules to connect as yet unknown interacting partners at the level of protein-DNA interaction. Many of those could be confirmed by electromobility band shift assay at recognition sites of gene specific promoters and by western blotting of nuclear proteins. A novel cellular regulatory circuitry could therefore be proposed that connects cell cycle regulated genes with components of the EGF signaling pathway. Promoter analysis of differentially expressed genes suggested the majority of regulated transcription factors to display specificity to either the pre-tumor or the tumor state. Subsequent search for signal transduction key nodes upstream of the identified transcription factors and their targets suggested the insulin-like growth factor pathway to render the tumor cells independent of EGF receptor activity. Notably, expression of IGF2 in addition to many components of this pathway was highly upregulated in tumors. Together, we propose a switch in autocrine signaling to foster tumor growth that was initially triggered by EGF and demonstrate the knowledge gain form promoter analysis combined with upstream key node identification.
Collapse
Affiliation(s)
| | - Nico Voss
- BIOBASE GmbH, Wolfenbuettel, Germany
| | - Tatiana Meier
- Department Molecular Medicine and Medical Biotechnology, Fraunhofer Institute of Toxicology and Experimental Medicine, Hannover, Germany
- Centre for Pharmacology and Toxicology, Hannover Medical School, Hannover, Germany
| | - Alexander Kel
- BIOBASE GmbH, Wolfenbuettel, Germany
- GeneXplain GmbH, Wolfenbuettel, Germany
- Institute of Chemical Biology and Fundamental Medicine, Novosibirsk, Russia
| | - Edgar Wingender
- BIOBASE GmbH, Wolfenbuettel, Germany
- GeneXplain GmbH, Wolfenbuettel, Germany
- Department of Bioinformatics, University of Goettingen, Goettingen, Germany
| | - Juergen Borlak
- Department Molecular Medicine and Medical Biotechnology, Fraunhofer Institute of Toxicology and Experimental Medicine, Hannover, Germany
- Centre for Pharmacology and Toxicology, Hannover Medical School, Hannover, Germany
- * E-mail:
| |
Collapse
|
16
|
Stegmaier P, Krull M, Voss N, Kel AE, Wingender E. Molecular mechanistic associations of human diseases. BMC Syst Biol 2010; 4:124. [PMID: 20815942 PMCID: PMC2946303 DOI: 10.1186/1752-0509-4-124] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/18/2009] [Accepted: 09/06/2010] [Indexed: 01/05/2023]
Abstract
Background The study of relationships between human diseases provides new possibilities for biomedical research. Recent achievements on human genetic diseases have stimulated interest to derive methods to identify disease associations in order to gain further insight into the network of human diseases and to predict disease genes. Results Using about 10000 manually collected causal disease/gene associations, we developed a statistical approach to infer meaningful associations between human morbidities. The derived method clustered cardiometabolic and endocrine disorders, immune system-related diseases, solid tissue neoplasms and neurodegenerative pathologies into prominent disease groups. Analysis of biological functions confirmed characteristic features of corresponding disease clusters. Inference of disease associations was further employed as a starting point for prediction of disease genes. Efforts were made to underpin the validity of results by relevant literature evidence. Interestingly, many inferred disease relationships correspond to known clinical associations and comorbidities, and several predicted disease genes were subjects of therapeutic target research. Conclusions Causal molecular mechanisms present a unifying principle to derive methods for disease classification, analysis of clinical disorder associations, and prediction of disease genes. According to the definition of causal disease genes applied in this study, these results are not restricted to genetic disease/gene relationships. This may be particularly useful for the study of long-term or chronic illnesses, where pathological derangement due to environmental or as part of sequel conditions is of importance and may not be fully explained by genetic background.
Collapse
Affiliation(s)
- Philip Stegmaier
- BIOBASE GmbH, Halchtersche Strasse 33, D-38304 Wolfenbüttel, Germany.
| | | | | | | | | |
Collapse
|
17
|
Alamanova D, Stegmaier P, Kel A. Creating PWMs of transcription factors using 3D structure-based computation of protein-DNA free binding energies. BMC Bioinformatics 2010; 11:225. [PMID: 20438625 PMCID: PMC2879287 DOI: 10.1186/1471-2105-11-225] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2009] [Accepted: 05/03/2010] [Indexed: 12/03/2022] Open
Abstract
Background Knowledge of transcription factor-DNA binding patterns is crucial for understanding gene transcription. Numerous DNA-binding proteins are annotated as transcription factors in the literature, however, for many of them the corresponding DNA-binding motifs remain uncharacterized. Results The position weight matrices (PWMs) of transcription factors from different structural classes have been determined using a knowledge-based statistical potential. The scoring function calibrated against crystallographic data on protein-DNA contacts recovered PWMs of various members of widely studied transcription factor families such as p53 and NF-κB. Where it was possible, extensive comparison to experimental binding affinity data and other physical models was made. Although the p50p50, p50RelB, and p50p65 dimers belong to the same family, particular differences in their PWMs were detected, thereby suggesting possibly different in vivo binding modes. The PWMs of p63 and p73 were computed on the basis of homology modeling and their performance was studied using upstream sequences of 85 p53/p73-regulated human genes. Interestingly, about half of the p63 and p73 hits reported by the Match algorithm in the altogether 126 promoters lay more than 2 kb upstream of the corresponding transcription start sites, which deviates from the common assumption that most regulatory sites are located more proximal to the TSS. The fact that in most of the cases the binding sites of p63 and p73 did not overlap with the p53 sites suggests that p63 and p73 could influence the p53 transcriptional activity cooperatively. The newly computed p50p50 PWM recovered 5 more experimental binding sites than the corresponding TRANSFAC matrix, while both PWMs showed comparable receiver operator characteristics. Conclusions A novel algorithm was developed to calculate position weight matrices from protein-DNA complex structures. The proposed algorithm was extensively validated against experimental data. The method was further combined with Homology Modeling to obtain PWMs of factors for which crystallographic complexes with DNA are not yet available. The performance of PWMs obtained in this work in comparison to traditionally constructed matrices demonstrates that the structure-based approach presents a promising alternative to experimental determination of transcription factor binding properties.
Collapse
Affiliation(s)
- Denitsa Alamanova
- BIOBASE GmbH, Halchtersche Strasse 33, D-38304 Wolfenbuettel, Germany.
| | | | | |
Collapse
|
18
|
Kel A, Voss N, Valeev T, Stegmaier P, Kel-Margoulis O, Wingender E. ExPlain: finding upstream drug targets in disease gene regulatory networks. SAR QSAR Environ Res 2008; 19:481-494. [PMID: 18853298 DOI: 10.1080/10629360802083806] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
Different signal transduction pathways leading to the activation of transcription factors (TFs) converge at key molecules that master the regulation of many cellular processes. Such crossroads of signalling networks often appear as "Achilles Heels" causing a disease when not functioning properly. Novel computational tools are needed for analysis of the gene expression data in the context of signal transduction and gene regulatory pathways and for identification of the key nodes in the networks. An integrated computational system, ExPlain (www.biobase.de) was developed for causal interpretation of gene expression data and identification of key signalling molecules. The system utilizes data from two databases (TRANSFAC and TRANSPATH) and integrates two programs: (1) Composite Module Analyst (CMA) analyses 5'-upstream regions of co-expressed genes and applies a genetic algorithm to reveal composite modules (CMs) consisting of co-occurring single TF binding sites and composite elements; (2) ArrayAnalyzer is a fast network search engine that analyses signal transduction networks controlling the activities of the corresponding TFs and seeks key molecules responsible for the observed concerted gene activation. ExPlain system was applied to microarray data on inflammatory bowel diseases (IBD). The results obtained suggest a number of highly interesting biological hypotheses about molecular mechanisms of pathological genetic disregulation.
Collapse
Affiliation(s)
- A Kel
- BIOBASE GmbH, Wolfenbüttel, Germany.
| | | | | | | | | | | |
Collapse
|
19
|
Minovitsky S, Stegmaier P, Kel A, Kondrashov AS, Dubchak I. Short sequence motifs, overrepresented in mammalian conserved non-coding sequences. BMC Genomics 2007; 8:378. [PMID: 17945028 PMCID: PMC2176071 DOI: 10.1186/1471-2164-8-378] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2007] [Accepted: 10/18/2007] [Indexed: 12/22/2022] Open
Abstract
Background A substantial fraction of non-coding DNA sequences of multicellular eukaryotes is under selective constraint. In particular, ~5% of the human genome consists of conserved non-coding sequences (CNSs). CNSs differ from other genomic sequences in their nucleotide composition and must play important functional roles, which mostly remain obscure. Results We investigated relative abundances of short sequence motifs in all human CNSs present in the human/mouse whole-genome alignments vs. three background sets of sequences: (i) weakly conserved or unconserved non-coding sequences (non-CNSs); (ii) near-promoter sequences (located between nucleotides -500 and -1500, relative to a start of transcription); and (iii) random sequences with the same nucleotide composition as that of CNSs. When compared to non-CNSs and near-promoter sequences, CNSs possess an excess of AT-rich motifs, often containing runs of identical nucleotides. In contrast, when compared to random sequences, CNSs contain an excess of GC-rich motifs which, however, lack CpG dinucleotides. Thus, abundance of short sequence motifs in human CNSs, taken as a whole, is mostly determined by their overall compositional properties and not by overrepresentation of any specific short motifs. These properties are: (i) high AT-content of CNSs, (ii) a tendency, probably due to context-dependent mutation, of A's and T's to clump, (iii) presence of short GC-rich regions, and (iv) avoidance of CpG contexts, due to their hypermutability. Only a small number of short motifs, overrepresented in all human CNSs are similar to binding sites of transcription factors from the FOX family. Conclusion Human CNSs as a whole appear to be too broad a class of sequences to possess strong footprints of any short sequence-specific functions. Such footprints should be studied at the level of functional subclasses of CNSs, such as those which flank genes with a particular pattern of expression. Overall properties of CNSs are affected by patterns in mutation, suggesting that selection which causes their conservation is not always very strong.
Collapse
Affiliation(s)
- Simon Minovitsky
- Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA.
| | | | | | | | | |
Collapse
|
20
|
Waleev T, Shtokalo D, Konovalova T, Voss N, Cheremushkin E, Stegmaier P, Kel-Margoulis O, Wingender E, Kel A. Composite Module Analyst: identification of transcription factor binding site combinations using genetic algorithm. Nucleic Acids Res 2006; 34:W541-5. [PMID: 16845066 PMCID: PMC1538785 DOI: 10.1093/nar/gkl342] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Composite Module Analyst (CMA) is a novel software tool aiming to identify promoter-enhancer models based on the composition of transcription factor (TF) binding sites and their pairs. CMA is closely interconnected with the TRANSFAC database. In particular, CMA uses the positional weight matrix (PWM) library collected in TRANSFAC and therefore provides the possibility to search for a large variety of different TF binding sites. We model the structure of the long gene regulatory regions by a Boolean function that joins several local modules, each consisting of co-localized TF binding sites. Having as an input a set of co-regulated genes, CMA builds the promoter model and optimizes the parameters of the model automatically by applying a genetic-regression algorithm. We use a multicomponent fitness function of the algorithm which includes several statistical criteria in a weighted linear function. We show examples of successful application of CMA to a microarray data on transcription profiling of TNF-alpha stimulated primary human endothelial cells. The CMA web server is freely accessible at http://www.gene-regulation.com/pub/programs/cma/CMA.html. An advanced version of CMA is also a part of the commercial system ExPlaintrade mark (www.biobase.de) designed for causal analysis of gene expression data.
Collapse
Affiliation(s)
- T. Waleev
- A.P. Ershov's Institute of Informatics Systems6, Lavrentiev avenue, 630090 Novosibirsk, Russia
| | - D. Shtokalo
- A.P. Ershov's Institute of Informatics Systems6, Lavrentiev avenue, 630090 Novosibirsk, Russia
| | - T. Konovalova
- Institute of Cytology and GeneticsNovosibirsk, Russia
| | - N. Voss
- BIOBASE GmbHHalchtersche Strasse 33, D-38304 Wolfenbüttel, Germany
| | - E. Cheremushkin
- A.P. Ershov's Institute of Informatics Systems6, Lavrentiev avenue, 630090 Novosibirsk, Russia
| | - P. Stegmaier
- BIOBASE GmbHHalchtersche Strasse 33, D-38304 Wolfenbüttel, Germany
| | - O. Kel-Margoulis
- BIOBASE GmbHHalchtersche Strasse 33, D-38304 Wolfenbüttel, Germany
| | - E. Wingender
- BIOBASE GmbHHalchtersche Strasse 33, D-38304 Wolfenbüttel, Germany
- Department Bioinformatics, UKG/University GöttingenGoldschmidtstr. 1, D-37077 Göttingen, Germany
| | - A. Kel
- BIOBASE GmbHHalchtersche Strasse 33, D-38304 Wolfenbüttel, Germany
- To whom correspondence should be addressed. Tel: +49-5331-858441; Fax: +49-5331-858470;
| |
Collapse
|
21
|
Matys V, Kel-Margoulis OV, Fricke E, Liebich I, Land S, Barre-Dirrie A, Reuter I, Chekmenev D, Krull M, Hornischer K, Voss N, Stegmaier P, Lewicki-Potapov B, Saxel H, Kel AE, Wingender E. TRANSFAC and its module TRANSCompel: transcriptional gene regulation in eukaryotes. Nucleic Acids Res 2006; 34:D108-10. [PMID: 16381825 PMCID: PMC1347505 DOI: 10.1093/nar/gkj143] [Citation(s) in RCA: 1660] [Impact Index Per Article: 92.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2005] [Revised: 10/27/2005] [Accepted: 10/27/2005] [Indexed: 02/06/2023] Open
Abstract
The TRANSFAC database on transcription factors, their binding sites, nucleotide distribution matrices and regulated genes as well as the complementing database TRANSCompel on composite elements have been further enhanced on various levels. A new web interface with different search options and integrated versions of Match and Patch provides increased functionality for TRANSFAC. The list of databases which are linked to the common GENE table of TRANSFAC and TRANSCompel has been extended by: Ensembl, UniGene, EntrezGene, HumanPSD and TRANSPRO. Standard gene names from HGNC, MGI and RGD, are included for human, mouse and rat genes, respectively. With the help of InterProScan, Pfam, SMART and PROSITE domains are assigned automatically to the protein sequences of the transcription factors. TRANSCompel contains now, in addition to the COMPEL table, a separate table for detailed information on the experimental EVIDENCE on which the composite elements are based. Finally, for TRANSFAC, in respect of data growth, in particular the gain of Drosophila transcription factor binding sites (by courtesy of the Drosophila DNase I footprint database) and of Arabidopsis factors (by courtesy of DATF, Database of Arabidopsis Transcription Factors) has to be stressed. The here described public releases, TRANSFAC 7.0 and TRANSCompel 7.0, are accessible under http://www.gene-regulation.com/pub/databases.html.
Collapse
Affiliation(s)
- V Matys
- BIOBASE GmbH, Halchtersche Strasse 33, D-38304 Wolfenbüttel, Germany.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
22
|
Stegmaier P, Kel AE, Wingender E. Systematic DNA-binding domain classification of transcription factors. Genome Inform 2004; 15:276-86. [PMID: 15706513] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 05/01/2023]
Abstract
Based on the manual annotation of transcription factors stored in the TRANSFAC database, we developed a library of hidden Markov models (HMM) to represent their DNA-binding domains and used it for a comprehensive classification. The models constructed were applied on the UniProt/Swiss-Prot database, leading to a systematic classification of further DNA-binding protein entries. The HMM library obtained can be used to classify any newly discovered transcription factor according to its DNA-binding domain and, thus, to generate hypotheses about its DNA-binding specificity.
Collapse
Affiliation(s)
- Philip Stegmaier
- BIOBASE GmbH, Halchtersche Str. 33, D-38304 Wolfenbüttel, Germany.
| | | | | |
Collapse
|
23
|
Hille K, Draeger J, Eggers T, Stegmaier P. [Technical construction, calibration and results with a new intraocular pressure sensor with telemetric transmission]. Klin Monbl Augenheilkd 2001; 218:376-80. [PMID: 11417341 DOI: 10.1055/s-2001-15905] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Abstract
BACKGROUND Presently intraocular pressure is measured indirectly by applanation or impression of the cornea. Only isolated values are available with this method. We present a new implantable system for direct and continuos measurement of the intraocular pressure. METHODS An implantable system consisting of a miniaturized sensor and a telemetric unit was integrated in an intraocular lens. The eye pressure is determined by the sensor, modulated and transduced by the telemetric system. By an extracorporal receiver the signal is demodulated. The electric supply of the intraocular system is achieved by external electromagnetic induction. RESULTS The telemetric transmission of the intraocular pressure can be achieved with an accuracy of +/- 1 mm Hg and a frequency of registration of 3 values per second. CONCLUSION Clinical application necessitates further animal trials in vivo.
Collapse
Affiliation(s)
- K Hille
- Augenklinik der Universität des Saarlandes, 66421 Homburg/Saar.
| | | | | | | |
Collapse
|
24
|
Stegmaier P, Maurer U. [Successful irradiation of a pronounced thoracic metastasis from breast cancer]. Rontgenpraxis 1999; 52:78-80. [PMID: 10431573] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 02/13/2023]
Affiliation(s)
- P Stegmaier
- Institut für Klinische Radiologie, Klinikum der Stadt Mannheim
| | | |
Collapse
|
25
|
Maurer U, Stegmaier P, Bolte R, Jungius K, Mueller D, Georgi M. 2114 Gemicitabine (G) in combination radiationtherapy (RT) in stage III–IV pancreatic cancer: First results of a current phase I study. Int J Radiat Oncol Biol Phys 1999. [DOI: 10.1016/s0360-3016(99)90384-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|