1
|
Tabas-Madrid D, Méndez-Vigo B, Arteaga N, Marcer A, Pascual-Montano A, Weigel D, Xavier Picó F, Alonso-Blanco C. Genome-wide signatures of flowering adaptation to climate temperature: Regional analyses in a highly diverse native range of Arabidopsis thaliana. Plant Cell Environ 2018; 41:1806-1820. [PMID: 29520809 DOI: 10.1111/pce.13189] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/16/2018] [Revised: 03/02/2018] [Accepted: 03/02/2018] [Indexed: 05/25/2023]
Abstract
Current global change is fueling an interest to understand the genetic and molecular mechanisms of plant adaptation to climate. In particular, altered flowering time is a common strategy for escape from unfavourable climate temperature. In order to determine the genomic bases underlying flowering time adaptation to this climatic factor, we have systematically analysed a collection of 174 highly diverse Arabidopsis thaliana accessions from the Iberian Peninsula. Analyses of 1.88 million single nucleotide polymorphisms provide evidence for a spatially heterogeneous contribution of demographic and adaptive processes to geographic patterns of genetic variation. Mountains appear to be allele dispersal barriers, whereas the relationship between flowering time and temperature depended on the precise temperature range. Environmental genome-wide associations supported an overall genome adaptation to temperature, with 9.4% of the genes showing significant associations. Furthermore, phenotypic genome-wide associations provided a catalogue of candidate genes underlying flowering time variation. Finally, comparison of environmental and phenotypic genome-wide associations identified known (Twin Sister of FT, FRIGIDA-like 1, and Casein Kinase II Beta chain 1) and new (Epithiospecifer Modifier 1 and Voltage-Dependent Anion Channel 5) genes as candidates for adaptation to climate temperature by altered flowering time. Thus, this regional collection provides an excellent resource to address the spatial complexity of climate adaptation in annual plants.
Collapse
Affiliation(s)
- Daniel Tabas-Madrid
- Departamento de Genética Molecular de Plantas, Centro Nacional de Biotecnología (CNB), Consejo Superior de Investigaciones Científicas (CSIC), 28049, Madrid, Spain
| | - Belén Méndez-Vigo
- Departamento de Genética Molecular de Plantas, Centro Nacional de Biotecnología (CNB), Consejo Superior de Investigaciones Científicas (CSIC), 28049, Madrid, Spain
| | - Noelia Arteaga
- Departamento de Genética Molecular de Plantas, Centro Nacional de Biotecnología (CNB), Consejo Superior de Investigaciones Científicas (CSIC), 28049, Madrid, Spain
| | - Arnald Marcer
- CREAF, 08193, Cerdanyola del Vallès, Spain
- Universitat Autònoma de Barcelona, 08193, Cerdanyola del Vallès, Spain
| | - Alberto Pascual-Montano
- Departamento de Genética Molecular de Plantas, Centro Nacional de Biotecnología (CNB), Consejo Superior de Investigaciones Científicas (CSIC), 28049, Madrid, Spain
| | - Detlef Weigel
- Max Planck Institute for Developmental Biology, Max-Planck-Ring 5, 72076, Tübingen, Germany
| | - F Xavier Picó
- Departamento de Ecología Integrativa, Estación Biológica de Doñana (EBD), Consejo Superior de Investigaciones Científicas (CSIC), 41092, Sevilla, Spain
| | - Carlos Alonso-Blanco
- Departamento de Genética Molecular de Plantas, Centro Nacional de Biotecnología (CNB), Consejo Superior de Investigaciones Científicas (CSIC), 28049, Madrid, Spain
| |
Collapse
|
2
|
Gutiérrez-Vázquez C, Rodríguez-Galán A, Fernández-Alfara M, Mittelbrunn M, Sánchez-Cabo F, Martínez-Herrera DJ, Ramírez-Huesca M, Pascual-Montano A, Sánchez-Madrid F. miRNA profiling during antigen-dependent T cell activation: A role for miR-132-3p. Sci Rep 2017; 7:3508. [PMID: 28615644 PMCID: PMC5471249 DOI: 10.1038/s41598-017-03689-7] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2016] [Accepted: 05/04/2017] [Indexed: 02/07/2023] Open
Abstract
microRNAs (miRNAs) are tightly regulated during T lymphocyte activation to enable the establishment of precise immune responses. Here, we analyzed the changes of the miRNA profiles of T cells in response to activation by cognate interaction with dendritic cells. We also studied mRNA targets common to miRNAs regulated in T cell activation. pik3r1 gene, which encodes the regulatory subunits of PI3K p50, p55 and p85, was identified as target of miRNAs upregulated after T cell activation. Using 3′UTR luciferase reporter-based and biochemical assays, we showed the inhibitory relationship between miR-132-3p upregulation and expression of the pik3r1 gene. Our results indicate that specific miRNAs whose expression is modulated during T cell activation might regulate PI3K signaling in T cells.
Collapse
Affiliation(s)
- Cristina Gutiérrez-Vázquez
- Instituto de Investigación Sanitaria Princesa, Hospital Universitario de la Princesa, Universidad Autónoma de Madrid, Madrid, Spain.,Fundación Centro Nacional de Investigaciones Cardiovasculares Carlos III (CNIC), Madrid, Spain
| | - Ana Rodríguez-Galán
- Instituto de Investigación Sanitaria Princesa, Hospital Universitario de la Princesa, Universidad Autónoma de Madrid, Madrid, Spain.,Fundación Centro Nacional de Investigaciones Cardiovasculares Carlos III (CNIC), Madrid, Spain
| | - Marcos Fernández-Alfara
- Fundación Centro Nacional de Investigaciones Cardiovasculares Carlos III (CNIC), Madrid, Spain
| | - María Mittelbrunn
- Fundación Centro Nacional de Investigaciones Cardiovasculares Carlos III (CNIC), Madrid, Spain
| | - Fátima Sánchez-Cabo
- Fundación Centro Nacional de Investigaciones Cardiovasculares Carlos III (CNIC), Madrid, Spain
| | | | - Marta Ramírez-Huesca
- Fundación Centro Nacional de Investigaciones Cardiovasculares Carlos III (CNIC), Madrid, Spain
| | | | - Francisco Sánchez-Madrid
- Instituto de Investigación Sanitaria Princesa, Hospital Universitario de la Princesa, Universidad Autónoma de Madrid, Madrid, Spain. .,Fundación Centro Nacional de Investigaciones Cardiovasculares Carlos III (CNIC), Madrid, Spain. .,CIBER Cardiovascular, Madrid, Spain.
| |
Collapse
|
3
|
Martínez M, Sorzano COS, Pascual-Montano A, Carazo JM. Gene signature associated with benign neurofibroma transformation to malignant peripheral nerve sheath tumors. PLoS One 2017; 12:e0178316. [PMID: 28542306 PMCID: PMC5443557 DOI: 10.1371/journal.pone.0178316] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2017] [Accepted: 05/11/2017] [Indexed: 11/19/2022] Open
Abstract
Benign neurofibromas, the main phenotypic manifestations of the rare neurological disorder neurofibromatosis type 1, degenerate to malignant tumors associated to poor prognosis in about 10% of patients. Despite efforts in the field of (epi)genomics, the lack of prognostic biomarkers with which to predict disease evolution frustrates the adoption of appropriate early therapeutic measures. To identify potential biomarkers of malignant neurofibroma transformation, we integrated four human experimental studies and one for mouse, using a gene score-based meta-analysis method, from which we obtained a score-ranked signature of 579 genes. Genes with the highest absolute scores were classified as promising disease biomarkers. By grouping genes with similar neurofibromatosis-related profiles, we derived panels of potential biomarkers. The addition of promoter methylation data to gene profiles indicated a panel of genes probably silenced by hypermethylation. To identify possible therapeutic treatments, we used the gene signature to query drug expression databases. Trichostatin A and other histone deacetylase inhibitors, as well as cantharidin and tamoxifen, were retrieved as putative therapeutic means to reverse the aberrant regulation that drives to malignant cell proliferation and metastasis. This in silico prediction corroborated reported experimental results that suggested the inclusion of these compounds in clinical trials. This experimental validation supported the suitability of the meta-analysis method used to integrate several sources of public genomic information, and the reliability of the gene signature associated to the malignant evolution of neurofibromas to generate working hypotheses for prognostic and drug-responsive biomarkers or therapeutic measures, thus showing the potential of this in silico approach for biomarker discovery.
Collapse
Affiliation(s)
- Marta Martínez
- Biocomputing Unit, Nacional Center for Biotechnology (CSIC), Campus Universidad Autónoma de Madrid, Cantoblanco, Madrid, Spain
- * E-mail:
| | - Carlos O. S. Sorzano
- Biocomputing Unit, Nacional Center for Biotechnology (CSIC), Campus Universidad Autónoma de Madrid, Cantoblanco, Madrid, Spain
- Bioengineering Lab., Universidad CEU San Pablo, Campus Urb. Montepríncipe, Boadilla del Monte, Madrid, Spain
| | - Alberto Pascual-Montano
- Biocomputing Unit, Nacional Center for Biotechnology (CSIC), Campus Universidad Autónoma de Madrid, Cantoblanco, Madrid, Spain
| | - Jose M. Carazo
- Biocomputing Unit, Nacional Center for Biotechnology (CSIC), Campus Universidad Autónoma de Madrid, Cantoblanco, Madrid, Spain
| |
Collapse
|
4
|
Manesia JK, Franch M, Tabas-Madrid D, Nogales-Cadenas R, Vanwelden T, Van Den Bosch E, Xu Z, Pascual-Montano A, Khurana S, Verfaillie CM. Distinct Molecular Signature of Murine Fetal Liver and Adult Hematopoietic Stem Cells Identify Novel Regulators of Hematopoietic Stem Cell Function. Stem Cells Dev 2017; 26:573-584. [PMID: 27958775 DOI: 10.1089/scd.2016.0294] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
During ontogeny, fetal liver (FL) acts as a major site for hematopoietic stem cell (HSC) maturation and expansion, whereas HSCs in the adult bone marrow (ABM) are largely quiescent. HSCs in the FL possess faster repopulation capacity as compared with ABM HSCs. However, the molecular mechanism regulating the greater self-renewal potential of FL HSCs has not yet extensively been assessed. Recently, we published RNA sequencing-based gene expression analysis on FL HSCs from 14.5-day mouse embryo (E14.5) in comparison to the ABM HSCs. We reanalyzed these data to identify key transcriptional regulators that play important roles in the expansion of HSCs during development. The comparison of FL E14.5 with ABM HSCs identified more than 1,400 differentially expressed genes. More than 200 genes were shortlisted based on the gene ontology (GO) annotation term "transcription." By morpholino-based knockdown studies in zebrafish, we assessed the function of 18 of these regulators, previously not associated with HSC proliferation. Our studies identified a previously unknown role for tdg, uhrf1, uchl5, and ncoa1 in the emergence of definitive hematopoiesis in zebrafish. In conclusion, we demonstrate that identification of genes involved in transcriptional regulation differentially expressed between expanding FL HSCs and quiescent ABM HSCs, uncovers novel regulators of HSC function.
Collapse
Affiliation(s)
- Javed K Manesia
- 1 Inter-Departmental Stem Cell Institute, KU Leuven , Leuven, Belgium .,2 Department of Development and Regeneration, Stem Cell Biology and Embryology, KU Leuven , Leuven, Belgium
| | - Monica Franch
- 3 Functional Bioinformatics Group, National Center for Biotechnology-CSIC , Madrid, Spain
| | - Daniel Tabas-Madrid
- 3 Functional Bioinformatics Group, National Center for Biotechnology-CSIC , Madrid, Spain
| | - Ruben Nogales-Cadenas
- 3 Functional Bioinformatics Group, National Center for Biotechnology-CSIC , Madrid, Spain
| | - Thomas Vanwelden
- 1 Inter-Departmental Stem Cell Institute, KU Leuven , Leuven, Belgium .,2 Department of Development and Regeneration, Stem Cell Biology and Embryology, KU Leuven , Leuven, Belgium
| | - Elisa Van Den Bosch
- 1 Inter-Departmental Stem Cell Institute, KU Leuven , Leuven, Belgium .,2 Department of Development and Regeneration, Stem Cell Biology and Embryology, KU Leuven , Leuven, Belgium
| | - Zhuofei Xu
- 1 Inter-Departmental Stem Cell Institute, KU Leuven , Leuven, Belgium .,2 Department of Development and Regeneration, Stem Cell Biology and Embryology, KU Leuven , Leuven, Belgium
| | | | - Satish Khurana
- 1 Inter-Departmental Stem Cell Institute, KU Leuven , Leuven, Belgium .,2 Department of Development and Regeneration, Stem Cell Biology and Embryology, KU Leuven , Leuven, Belgium .,4 Indian Institute of Science Education and Research , Thiruvananthapuram, India
| | - Catherine M Verfaillie
- 1 Inter-Departmental Stem Cell Institute, KU Leuven , Leuven, Belgium .,2 Department of Development and Regeneration, Stem Cell Biology and Embryology, KU Leuven , Leuven, Belgium
| |
Collapse
|
5
|
Ogando J, Tardáguila M, Díaz-Alderete A, Usategui A, Miranda-Ramos V, Martínez-Herrera DJ, de la Fuente L, García-León MJ, Moreno MC, Escudero S, Cañete JD, Toribio ML, Cases I, Pascual-Montano A, Pablos JL, Mañes S. Notch-regulated miR-223 targets the aryl hydrocarbon receptor pathway and increases cytokine production in macrophages from rheumatoid arthritis patients. Sci Rep 2016; 6:20223. [PMID: 26838552 PMCID: PMC4738320 DOI: 10.1038/srep20223] [Citation(s) in RCA: 50] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2015] [Accepted: 12/23/2015] [Indexed: 12/28/2022] Open
Abstract
Evidence links aryl hydrocarbon receptor (AHR) activation to rheumatoid arthritis (RA) pathogenesis, although results are inconsistent. AHR agonists inhibit pro-inflammatory cytokine expression in macrophages, pivotal cells in RA aetiopathogenesis, which hints at specific circuits that regulate the AHR pathway in RA macrophages. We compared microRNA (miR) expression in CD14+ cells from patients with active RA or with osteoarthritis (OA). Seven miR were downregulated and one (miR-223) upregulated in RA compared to OA cells. miR-223 upregulation correlated with reduced Notch3 and Notch effector expression in RA patients. Overexpression of the Notch-induced repressor HEY-1 and co-culture of healthy donor monocytes with Notch ligand-expressing cells showed direct Notch-mediated downregulation of miR-223. Bioinformatics predicted the AHR regulator ARNT (AHR nuclear translocator) as a miR-223 target. Pre-miR-223 overexpression silenced ARNT 3’UTR-driven reporter expression, reduced ARNT (but not AHR) protein levels and prevented AHR/ARNT-mediated inhibition of pro-inflammatory cytokine expression. miR-223 counteracted AHR/ARNT-induced Notch3 upregulation in monocytes. Levels of ARNT and of CYP1B1, an AHR/ARNT signalling effector, were reduced in RA compared to OA synovial tissue, which correlated with miR-223 levels. Our results associate Notch signalling to miR-223 downregulation in RA macrophages, and identify miR-223 as a negative regulator of the AHR/ARNT pathway through ARNT targeting.
Collapse
Affiliation(s)
- Jesús Ogando
- Department of Immunology and Oncology, Centro Nacional de Biotecnología/CSIC, Madrid
| | - Manuel Tardáguila
- Department of Immunology and Oncology, Centro Nacional de Biotecnología/CSIC, Madrid
| | - Andrea Díaz-Alderete
- Department of Immunology and Oncology, Centro Nacional de Biotecnología/CSIC, Madrid
| | - Alicia Usategui
- Servicio de Reumatología, Instituto de Investigación Hospital 12 de Octubre, Madrid
| | | | | | | | | | - María C Moreno
- Flow Cytometry Unit, Centro Nacional de Biotecnología/CSIC, Madrid
| | - Sara Escudero
- Flow Cytometry Unit, Centro Nacional de Biotecnología/CSIC, Madrid
| | - Juan D Cañete
- Unitat d'Artritis, Hospital Clínic de Barcelona and Institut d'Investigacions Biomèdiques August Pí i Sunyer (IDIBAPS), Barcelona
| | | | - Ildefonso Cases
- Institut de Medicina Predictiva i Personalitzada del Càncer, Badalona, Barcelona, Spain
| | | | - José Luis Pablos
- Servicio de Reumatología, Instituto de Investigación Hospital 12 de Octubre, Madrid
| | - Santos Mañes
- Department of Immunology and Oncology, Centro Nacional de Biotecnología/CSIC, Madrid
| |
Collapse
|
6
|
Manesia JK, Xu Z, Broekaert D, Boon R, van Vliet A, Eelen G, Vanwelden T, Stegen S, Van Gastel N, Pascual-Montano A, Fendt SM, Carmeliet G, Carmeliet P, Khurana S, Verfaillie CM. Highly proliferative primitive fetal liver hematopoietic stem cells are fueled by oxidative metabolic pathways. Stem Cell Res 2015; 15:715-721. [PMID: 26599326 DOI: 10.1016/j.scr.2015.11.001] [Citation(s) in RCA: 49] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/10/2015] [Revised: 10/16/2015] [Accepted: 11/02/2015] [Indexed: 12/21/2022] Open
Abstract
Hematopoietic stem cells (HSCs) in the fetal liver (FL) unlike adult bone marrow (BM) proliferate extensively, posing different metabolic demands. However, metabolic pathways responsible for the production of energy and cellular building blocks in FL HSCs have not been described. Here, we report that FL HSCs use oxygen dependent energy generating pathways significantly more than their BM counterparts. RNA-Seq analysis of E14.5 FL versus BM derived HSCs identified increased expression levels of genes involved in oxidative phosphorylation (OxPhos) and the citric acid cycle (TCA). We demonstrated that FL HSCs contain more mitochondria than BM HSCs, which resulted in increased levels of oxygen consumption and reactive oxygen species (ROS) production. Higher levels of DNA repair and antioxidant pathway gene expression may prevent ROS-mediated (geno)toxicity in FL HSCs. Thus, we here for the first time highlight the underestimated importance of oxygen dependent pathways for generating energy and building blocks in FL HSCs.
Collapse
Affiliation(s)
- Javed K Manesia
- Inter-departmental Stem Cell Institute, KU Leuven, Leuven, Belgium; Department of Development and Regeneration, Stem Cell Biology and Embryology, KU Leuven, Leuven, Belgium
| | - Zhuofei Xu
- Inter-departmental Stem Cell Institute, KU Leuven, Leuven, Belgium; Department of Development and Regeneration, Stem Cell Biology and Embryology, KU Leuven, Leuven, Belgium
| | - Dorien Broekaert
- Inter-departmental Stem Cell Institute, KU Leuven, Leuven, Belgium; Department of Development and Regeneration, Stem Cell Biology and Embryology, KU Leuven, Leuven, Belgium
| | - Ruben Boon
- Inter-departmental Stem Cell Institute, KU Leuven, Leuven, Belgium; Department of Development and Regeneration, Stem Cell Biology and Embryology, KU Leuven, Leuven, Belgium
| | - Alex van Vliet
- Laboratory of Cell Death Research and Therapy, KU Leuven, Leuven, Belgium
| | - Guy Eelen
- Laboratory of Angiogenesis and Neurovascular Link, KU Leuven, Leuven, Belgium; Laboratory of Angiogenesis and Neurovascular Link, Leuven, Belgium
| | - Thomas Vanwelden
- Inter-departmental Stem Cell Institute, KU Leuven, Leuven, Belgium; Department of Development and Regeneration, Stem Cell Biology and Embryology, KU Leuven, Leuven, Belgium
| | - Steve Stegen
- Clinical and Experimental Endocrinology, KU Leuven, Leuven, Belgium
| | - Nick Van Gastel
- Clinical and Experimental Endocrinology, KU Leuven, Leuven, Belgium
| | | | - Sarah-Maria Fendt
- Laboratory of Cellular Metabolism and Metabolic Regulation, KU Leuven, Leuven, Belgium
| | - Geert Carmeliet
- Clinical and Experimental Endocrinology, KU Leuven, Leuven, Belgium
| | - Peter Carmeliet
- Laboratory of Angiogenesis and Neurovascular Link, KU Leuven, Leuven, Belgium; Laboratory of Angiogenesis and Neurovascular Link, Leuven, Belgium
| | - Satish Khurana
- Inter-departmental Stem Cell Institute, KU Leuven, Leuven, Belgium; Department of Development and Regeneration, Stem Cell Biology and Embryology, KU Leuven, Leuven, Belgium.
| | - Catherine M Verfaillie
- Inter-departmental Stem Cell Institute, KU Leuven, Leuven, Belgium; Department of Development and Regeneration, Stem Cell Biology and Embryology, KU Leuven, Leuven, Belgium.
| |
Collapse
|
7
|
Horvatovich P, Lundberg EK, Chen YJ, Sung TY, He F, Nice EC, Goode RJ, Yu S, Ranganathan S, Baker MS, Domont GB, Velasquez E, Li D, Liu S, Wang Q, He QY, Menon R, Guan Y, Corrales FJ, Segura V, Casal JI, Pascual-Montano A, Albar JP, Fuentes M, Gonzalez-Gonzalez M, Diez P, Ibarrola N, Degano RM, Mohammed Y, Borchers CH, Urbani A, Soggiu A, Yamamoto T, Salekdeh GH, Archakov A, Ponomarenko E, Lisitsa A, Lichti CF, Mostovenko E, Kroes RA, Rezeli M, Végvári Á, Fehniger TE, Bischoff R, Vizcaíno JA, Deutsch EW, Lane L, Nilsson CL, Marko-Varga G, Omenn GS, Jeong SK, Lim JS, Paik YK, Hancock WS. Quest for Missing Proteins: Update 2015 on Chromosome-Centric Human Proteome Project. J Proteome Res 2015; 14:3415-31. [DOI: 10.1021/pr5013009] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Affiliation(s)
- Péter Horvatovich
- Analytical
Biochemistry, Department of Pharmacy, University of Groningen, A. Deusinglaan
1, 9713 AV Groningen, The Netherlands
| | - Emma K. Lundberg
- Science
for Life Laboratory, KTH - Royal Institute of Technology, SE-171 21 Stockholm, Sweden
| | - Yu-Ju Chen
- Institute
of Chemistry, Academia Sinica, 128 Academia Road Sec. 2, Taipei 115, Taiwan
| | - Ting-Yi Sung
- Institute
of Information Science, Academia Sinica, 128 Academia Road Sec. 2, Taipei 115, Taiwan
| | - Fuchu He
- The State Key Laboratory of Proteomics, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine, No. 27 Taiping Road, Haidian District, Beijing 100850, China
| | - Edouard C. Nice
- Department
of Biochemistry and Molecular Biology, Monash University, Clayton, Victoria 3800, Australia
| | - Robert J. Goode
- Department
of Biochemistry and Molecular Biology, Monash University, Clayton, Victoria 3800, Australia
| | - Simon Yu
- Department
of Biochemistry and Molecular Biology, Monash University, Clayton, Victoria 3800, Australia
| | - Shoba Ranganathan
- Department
of Chemistry and Biomolecular Sciences and ARC Centre of Excellence
in Bioinformatics, Macquarie University, Sydney, New South Wales 2109, Australia
| | - Mark S. Baker
- Australian
School of Advanced Medicine, Macquarie University, Sydney, NSW 2109, Australia
| | - Gilberto B. Domont
- Proteomics Unit, Institute of Chemistry, Federal University of Rio de Janeiro, Cidade Universitária, Av Athos da Silveira Ramos 149, CT-A542, 21941-909 Rio de Janeriro, Rj, Brazil
| | - Erika Velasquez
- Proteomics Unit, Institute of Chemistry, Federal University of Rio de Janeiro, Cidade Universitária, Av Athos da Silveira Ramos 149, CT-A542, 21941-909 Rio de Janeriro, Rj, Brazil
| | - Dong Li
- The State Key Laboratory of Proteomics, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine, No. 27 Taiping Road, Haidian District, Beijing 100850, China
| | - Siqi Liu
- Beijing Institute of Genomics and BGI Shenzhen, No. 1 Beichen West Road, Chaoyang District, Beijing 100101, China
- BGI Shenzhen, Beishan Road, Yantian District, Shenzhen, 518083, China
| | - Quanhui Wang
- Beijing Institute of Genomics and BGI Shenzhen, No. 1 Beichen West Road, Chaoyang District, Beijing 100101, China
| | - Qing-Yu He
- Key Laboratory of Functional Protein
Research of Guangdong
Higher Education Institutes, College of Life Science and Technology, Jinan University, Guangzhou 510632, China
| | - Rajasree Menon
- Department of Computational Medicine & Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, Michigan 48109-2218, United States
| | - Yuanfang Guan
- Departments of Computational Medicine & Bioinformatics and Computer Sciences, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, Michigan 48109-2218, United States
| | - Fernando J. Corrales
- ProteoRed-ISCIII,
Biomolecular and Bioinformatics Resources Platform (PRB2), Spanish
Consortium of C-HPP (Chr-16), CIMA, University of Navarra, 31008 Pamplona, Spain
- Chr16 SpHPP Consortium, CIMA, University of Navarra, 31008 Pamplona, Spain
| | - Victor Segura
- ProteoRed-ISCIII,
Biomolecular and Bioinformatics Resources Platform (PRB2), Spanish
Consortium of C-HPP (Chr-16), CIMA, University of Navarra, 31008 Pamplona, Spain
- Chr16 SpHPP Consortium, CIMA, University of Navarra, 31008 Pamplona, Spain
| | - J. Ignacio Casal
- Department
of Cellular and Molecular Medicine, Centro de Investigaciones Biológicas (CIB-CSIC), 28040 Madrid, Spain
| | | | - Juan P. Albar
- Centro Nacional de Biotecnologia (CNB-CSIC), Cantoblanco, 28049 Madrid, Spain
| | - Manuel Fuentes
- Cancer
Research Center. Proteomics Unit and General Service of Cytometry,
Department of Medicine, University of Salmanca-CSIC, IBSAL, Campus Miguel de Unamuno
s/n, 37007 Salamanca, Spain
| | - Maria Gonzalez-Gonzalez
- Cancer
Research Center. Proteomics Unit and General Service of Cytometry,
Department of Medicine, University of Salmanca-CSIC, IBSAL, Campus Miguel de Unamuno
s/n, 37007 Salamanca, Spain
| | - Paula Diez
- Cancer
Research Center. Proteomics Unit and General Service of Cytometry,
Department of Medicine, University of Salmanca-CSIC, IBSAL, Campus Miguel de Unamuno
s/n, 37007 Salamanca, Spain
| | - Nieves Ibarrola
- Cancer
Research Center. Proteomics Unit and General Service of Cytometry,
Department of Medicine, University of Salmanca-CSIC, IBSAL, Campus Miguel de Unamuno
s/n, 37007 Salamanca, Spain
| | - Rosa M. Degano
- Cancer
Research Center. Proteomics Unit and General Service of Cytometry,
Department of Medicine, University of Salmanca-CSIC, IBSAL, Campus Miguel de Unamuno
s/n, 37007 Salamanca, Spain
| | - Yassene Mohammed
- University of Victoria-Genome British Columbia Proteomics
Centre, Vancouver Island
Technology Park, #3101−4464 Markham Street, Victoria, British Columbia V8Z 7X8, Canada
- Center
for Proteomics and Metabolomics, Leiden University Medical Center, 2333 ZA Leiden, The Netherlands
| | - Christoph H. Borchers
- University of Victoria-Genome British Columbia Proteomics
Centre, Vancouver Island
Technology Park, #3101−4464 Markham Street, Victoria, British Columbia V8Z 7X8, Canada
| | - Andrea Urbani
- Proteomics
and Metabonomic, Laboratory, Fondazione Santa Lucia, Rome, Italy
- Department
of Experimental Medicine and Surgery, University of Rome “Tor Vergata”, Rome, Italy
| | - Alessio Soggiu
- Department
of Veterinary Science and Public Health (DIVET), University of Milano, via Celoria 10, 20133 Milano, Italy
| | - Tadashi Yamamoto
- Institute
of Nephrology, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan
| | - Ghasem Hosseini Salekdeh
- Department of Molecular Systems Biology at Cell Science Research Center, Royan Institute for Stem Cell Biology and Technology, ACECR, Tehran, Iran
- Department of Systems Biology, Agricultural Biotechnology Research Institute of Iran, Karaj, Iran
| | | | | | - Andrey Lisitsa
- Orechovich Institute of Biomedical Chemistry, Moscow, Russia
| | - Cheryl F. Lichti
- Department
of Pharmacology and Toxicology, The University of Texas Medical Branch, Galveston, Texas 77555-0617, United States
| | - Ekaterina Mostovenko
- Department
of Pharmacology and Toxicology, The University of Texas Medical Branch, Galveston, Texas 77555-0617, United States
| | - Roger A. Kroes
- Falk Center for Molecular Therapeutics, Department of Biomedical Engineering, Northwestern University, 1801 Maple Ave., Suite 4300, Evanston, Illinois 60201, United States
| | - Melinda Rezeli
- Clinical Protein Science & Imaging, Department of Biomedical Engineering, Lund University, BMC D13, 221 84 Lund, Sweden
| | - Ákos Végvári
- Clinical Protein Science & Imaging, Department of Biomedical Engineering, Lund University, BMC D13, 221 84 Lund, Sweden
| | - Thomas E. Fehniger
- Clinical Protein Science & Imaging, Department of Biomedical Engineering, Lund University, BMC D13, 221 84 Lund, Sweden
| | - Rainer Bischoff
- Analytical
Biochemistry, Department of Pharmacy, University of Groningen, A. Deusinglaan
1, 9713 AV Groningen, The Netherlands
| | - Juan Antonio Vizcaíno
- European Molecular
Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, CB10 1SD, Hinxton, Cambridge, United Kingdom
| | - Eric W. Deutsch
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, Washington 98109, United States
| | - Lydie Lane
- SIB Swiss Institute of Bioinformatics, Geneva, Switzerland
- Department
of Human Protein Science, Faculty of Medicine, University of Geneva, Geneva, Switzerland
| | - Carol L. Nilsson
- Department
of Pharmacology and Toxicology, The University of Texas Medical Branch, Galveston, Texas 77555-0617, United States
| | - György Marko-Varga
- Clinical Protein Science & Imaging, Department of Biomedical Engineering, Lund University, BMC D13, 221 84 Lund, Sweden
| | - Gilbert S. Omenn
- Departments of Computational Medicine & Bioinformatics, Internal Medicine, Human Genetics and School of Public Health, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, Michigan 48109-2218, United States
| | - Seul-Ki Jeong
- Departments of Integrated Omics for Biomedical Science & Biochemistry, College of Life Science and Technology, Yonsei Proteome Research Center, Yonsei University, Seoul, 120-749, Korea
| | - Jong-Sun Lim
- Departments of Integrated Omics for Biomedical Science & Biochemistry, College of Life Science and Technology, Yonsei Proteome Research Center, Yonsei University, Seoul, 120-749, Korea
| | - Young-Ki Paik
- Departments of Integrated Omics for Biomedical Science & Biochemistry, College of Life Science and Technology, Yonsei Proteome Research Center, Yonsei University, Seoul, 120-749, Korea
| | - William S. Hancock
- The
Barnett Institute of Chemical and Biological Analysis, Northeastern University, 140 The Fenway, Boston, Massachusetts 02115, United States
| |
Collapse
|
8
|
Tabas-Madrid D, Alves-Cruzeiro J, Segura V, Guruceaga E, Vialas V, Prieto G, García C, Corrales FJ, Albar JP, Pascual-Montano A. Proteogenomics Dashboard for the Human Proteome Project. J Proteome Res 2015; 14:3738-49. [DOI: 10.1021/acs.jproteome.5b00466] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Affiliation(s)
- Daniel Tabas-Madrid
- ProteoRed-ISCIII, National Center for Biotechnology-CSIC (CNB), C/Darwin 3, Madrid 28049, Spain
| | - Joao Alves-Cruzeiro
- ProteoRed-ISCIII, National Center for Biotechnology-CSIC (CNB), C/Darwin 3, Madrid 28049, Spain
| | - Victor Segura
- ProteoRed-ISCIII,
Center for Applied Medical Research (CIMA), University of Navarra, Avda. Pío XII, 55, Pamplona E-31008, Spain
| | - Elizabeth Guruceaga
- ProteoRed-ISCIII,
Center for Applied Medical Research (CIMA), University of Navarra, Avda. Pío XII, 55, Pamplona E-31008, Spain
| | - Vital Vialas
- ProteoRed-ISCIII, National Center for Biotechnology-CSIC (CNB), C/Darwin 3, Madrid 28049, Spain
| | - Gorka Prieto
- Department
of Communication Engineering E.T.S. Ingenierı́a de Bilbao, University of the Basque Country (UPV/EHU), Alda. Urquijo, s/n, Bilbao 48013, Spain
| | - Carlos García
- Computer
Science Faculty, Complutense University of Madrid (UCM), C/ Jose
Garcı́á Santesmases 9, Madrid 28040, Spain
| | - Fernando J. Corrales
- ProteoRed-ISCIII,
Center for Applied Medical Research (CIMA), University of Navarra, Avda. Pío XII, 55, Pamplona E-31008, Spain
| | - Juan Pablo Albar
- ProteoRed-ISCIII, National Center for Biotechnology-CSIC (CNB), C/Darwin 3, Madrid 28049, Spain
| | - Alberto Pascual-Montano
- ProteoRed-ISCIII, National Center for Biotechnology-CSIC (CNB), C/Darwin 3, Madrid 28049, Spain
| |
Collapse
|
9
|
Setoain J, Franch M, Martínez M, Tabas-Madrid D, Sorzano COS, Bakker A, Gonzalez-Couto E, Elvira J, Pascual-Montano A. NFFinder: an online bioinformatics tool for searching similar transcriptomics experiments in the context of drug repositioning. Nucleic Acids Res 2015; 43:W193-9. [PMID: 25940629 PMCID: PMC4489258 DOI: 10.1093/nar/gkv445] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2015] [Accepted: 04/24/2015] [Indexed: 12/18/2022] Open
Abstract
Drug repositioning, using known drugs for treating conditions different from those the drug was originally designed to treat, is an important drug discovery tool that allows for a faster and cheaper development process by using drugs that are already approved or in an advanced trial stage for another purpose. This is especially relevant for orphan diseases because they affect too few people to make drug research de novo economically viable. In this paper we present NFFinder, a bioinformatics tool for identifying potential useful drugs in the context of orphan diseases. NFFinder uses transcriptomic data to find relationships between drugs, diseases and a phenotype of interest, as well as identifying experts having published on that domain. The application shows in a dashboard a series of graphics and tables designed to help researchers formulate repositioning hypotheses and identify potential biological relationships between drugs and diseases. NFFinder is freely available at http://nffinder.cnb.csic.es.
Collapse
Affiliation(s)
| | - Mònica Franch
- National Center for Biotechnology-CSIC, Madrid, Spain
| | | | | | | | | | | | | | - Alberto Pascual-Montano
- National Center for Biotechnology-CSIC, Madrid, Spain Perkin Elmer España, S.L., Madrid, Spain
| |
Collapse
|
10
|
Barthlott T, Bosch AJT, Berkemeier C, Nogales-Cadenas R, Jeker LT, Keller MP, Pascual-Montano A, Holländer GA. A subpopulation of CD103(pos) ICOS(pos) Treg cells occurs at high frequency in lymphopenic mice and represents a lymph node specific differentiation stage. Eur J Immunol 2015; 45:1760-71. [PMID: 25752506 DOI: 10.1002/eji.201445235] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2014] [Revised: 01/16/2015] [Accepted: 03/04/2015] [Indexed: 01/09/2023]
Abstract
Regulatory T (Treg) cells are pivotal for the maintenance of peripheral tolerance by controlling self-reactive, chronic, and homeostatic T-cell responses. Here, we report that the increase in Treg-cell suppressive function observed in lymphopenic mice correlates with the degree of lymphopenia and is caused by a higher frequency of a novel subpopulation of CD103(pos) ICOS(pos) Treg cells. Though present in the thymus, CD103(pos) ICOS(pos) Treg cells are not generated there but recirculate from the periphery to that site. The acquisition and maintenance of this distinctive phenotype requires the LN microenvironment and the in situ availability of antigen. Contrary to conventional effector and other Treg cells, the cellularity of CD103(pos) ICOS(pos) Treg cells is not affected by the absence of IL-7 and thymic stroma lymphopoetin. Given their increased frequency in lymphopenia, the absolute number of CD103(pos) ICOS(pos) Treg cells remains unchanged in the periphery irrespective of a paucity of total Treg cells. We furthermore demonstrate, with cell transfers in mice, that the CD103(pos) ICOS(pos) phenotype represents a LN-specific differentiation stage arrived at by several other Treg-cell subsets. Thus, tissue-specific cues determine the overall potency of the peripheral Treg-cell pool by shaping its subset composition.
Collapse
Affiliation(s)
- Thomas Barthlott
- Pediatric Immunology, Department of Biomedicine, University Children's Hospital of Basel, Basel, Switzerland
| | - Angela J T Bosch
- Pediatric Immunology, Department of Biomedicine, University Children's Hospital of Basel, Basel, Switzerland
| | - Caroline Berkemeier
- Pediatric Immunology, Department of Biomedicine, University Children's Hospital of Basel, Basel, Switzerland
| | - Rubén Nogales-Cadenas
- Functional Bioinformatics Group, National Center for Biotechnology-CSIC, Madrid, Spain
| | - Lukas T Jeker
- Pediatric Immunology, Department of Biomedicine, University Children's Hospital of Basel, Basel, Switzerland
| | - Marcel P Keller
- Pediatric Immunology, Department of Biomedicine, University Children's Hospital of Basel, Basel, Switzerland
| | | | - Georg A Holländer
- Pediatric Immunology, Department of Biomedicine, University Children's Hospital of Basel, Basel, Switzerland.,Department of Paediatrics and the Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| |
Collapse
|
11
|
Mejía-Roa E, Tabas-Madrid D, Setoain J, García C, Tirado F, Pascual-Montano A. NMF-mGPU: non-negative matrix factorization on multi-GPU systems. BMC Bioinformatics 2015; 16:43. [PMID: 25887585 PMCID: PMC4339678 DOI: 10.1186/s12859-015-0485-4] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2014] [Accepted: 01/30/2015] [Indexed: 01/11/2023] Open
Abstract
BACKGROUND In the last few years, the Non-negative Matrix Factorization ( NMF ) technique has gained a great interest among the Bioinformatics community, since it is able to extract interpretable parts from high-dimensional datasets. However, the computing time required to process large data matrices may become impractical, even for a parallel application running on a multiprocessors cluster. In this paper, we present NMF-mGPU, an efficient and easy-to-use implementation of the NMF algorithm that takes advantage of the high computing performance delivered by Graphics-Processing Units ( GPUs ). Driven by the ever-growing demands from the video-games industry, graphics cards usually provided in PCs and laptops have evolved from simple graphics-drawing platforms into high-performance programmable systems that can be used as coprocessors for linear-algebra operations. However, these devices may have a limited amount of on-board memory, which is not considered by other NMF implementations on GPU. RESULTS NMF-mGPU is based on CUDA ( Compute Unified Device Architecture ), the NVIDIA's framework for GPU computing. On devices with low memory available, large input matrices are blockwise transferred from the system's main memory to the GPU's memory, and processed accordingly. In addition, NMF-mGPU has been explicitly optimized for the different CUDA architectures. Finally, platforms with multiple GPUs can be synchronized through MPI ( Message Passing Interface ). In a four-GPU system, this implementation is about 120 times faster than a single conventional processor, and more than four times faster than a single GPU device (i.e., a super-linear speedup). CONCLUSIONS Applications of GPUs in Bioinformatics are getting more and more attention due to their outstanding performance when compared to traditional processors. In addition, their relatively low price represents a highly cost-effective alternative to conventional clusters. In life sciences, this results in an excellent opportunity to facilitate the daily work of bioinformaticians that are trying to extract biological meaning out of hundreds of gigabytes of experimental information. NMF-mGPU can be used "out of the box" by researchers with little or no expertise in GPU programming in a variety of platforms, such as PCs, laptops, or high-end GPU clusters. NMF-mGPU is freely available at https://github.com/bioinfo-cnb/bionmf-gpu .
Collapse
Affiliation(s)
- Edgardo Mejía-Roa
- ArTeCS Group, Department of Computer Architecture, Complutense University of Madrid (UCM), Madrid, 28040, Spain.
| | - Daniel Tabas-Madrid
- Functional Bioinformatics Group, Biocomputing Unit, National Center for Biotechnology-CSIC, UAM, Madrid, 28049, Spain.
| | - Javier Setoain
- Functional Bioinformatics Group, Biocomputing Unit, National Center for Biotechnology-CSIC, UAM, Madrid, 28049, Spain.
| | - Carlos García
- ArTeCS Group, Department of Computer Architecture, Complutense University of Madrid (UCM), Madrid, 28040, Spain.
| | - Francisco Tirado
- ArTeCS Group, Department of Computer Architecture, Complutense University of Madrid (UCM), Madrid, 28040, Spain.
| | - Alberto Pascual-Montano
- Functional Bioinformatics Group, Biocomputing Unit, National Center for Biotechnology-CSIC, UAM, Madrid, 28049, Spain.
| |
Collapse
|
12
|
Tabas-Madrid D, Muniategui A, Sánchez-Caballero I, Martínez-Herrera DJ, Sorzano COS, Rubio A, Pascual-Montano A. Improving miRNA-mRNA interaction predictions. BMC Genomics 2014; 15 Suppl 10:S2. [PMID: 25559987 PMCID: PMC4304206 DOI: 10.1186/1471-2164-15-s10-s2] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
Background MicroRNAs are short RNA molecules that post-transcriptionally regulate gene expression. Today, microRNA target prediction remains challenging since very few have been experimentally validated and sequence-based predictions have large numbers of false positives. Furthermore, due to the different measuring rules used in each database of predicted interactions, the selection of the most reliable ones requires extensive knowledge about each algorithm. Results Here we propose two methods to measure the confidence of predicted interactions based on experimentally validated information. The output of the methods is a combined database where new scores and statistical confidences are re-assigned to each predicted interaction. The new scores allow the robust combination of several databases without the effect of low-performing algorithms dragging down good-performing ones. The combined databases obtained using both algorithms described in this paper outperform each of the existing predictive algorithms that were considered for the combination. Conclusions Our approaches are a useful way to integrate predicted interactions from different databases. They reduce the selection of interactions to a unique database based on an intuitive score and allow comparing databases between them.
Collapse
|
13
|
Perez-Villamil B, Paz-Cabezas M, Pascual-Montano A, Fuentes M, Sastre J, Díaz-Rubio E. Association Between Colon Cancer Molecular Subtypes Obtained By Expression Profiling and By Microrna (Mir) Profiling. Ann Oncol 2014. [DOI: 10.1093/annonc/mdu333.57] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
|
14
|
Villarroya-Beltri C, Gutiérrez-Vázquez C, Sánchez-Cabo F, Pérez-Hernández D, Vázquez J, Martin-Cofreces N, Martinez-Herrera DJ, Pascual-Montano A, Mittelbrunn M, Sánchez-Madrid F. Sumoylated hnRNPA2B1 controls the sorting of miRNAs into exosomes through binding to specific motifs. Nat Commun 2014; 4:2980. [PMID: 24356509 PMCID: PMC3905700 DOI: 10.1038/ncomms3980] [Citation(s) in RCA: 1342] [Impact Index Per Article: 134.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2013] [Accepted: 11/21/2013] [Indexed: 12/14/2022] Open
Abstract
Exosomes are released by most cells to the extracellular environment and are involved in cell-to-cell communication. Exosomes contain specific repertoires of mRNAs, microRNAs (miRNAs) and other non-coding RNAs that can be functionally transferred to recipient cells. However, the mechanisms that control the specific loading of RNA species into exosomes remain unknown. Here we describe sequence motifs present in miRNAs that control their localization into exosomes. The protein heterogeneous nuclear ribonucleoprotein A2B1 (hnRNPA2B1) specifically binds exosomal miRNAs through the recognition of these motifs and controls their loading into exosomes. Moreover, hnRNPA2B1 in exosomes is sumoylated, and sumoylation controls the binding of hnRNPA2B1 to miRNAs. The loading of miRNAs into exosomes can be modulated by mutagenesis of the identified motifs or changes in hnRNPA2B1 expression levels. These findings identify hnRNPA2B1 as a key player in miRNA sorting into exosomes and provide potential tools for the packaging of selected regulatory RNAs into exosomes and their use in biomedical applications. Cells secrete micro-RNAs by packaging them into exosomes; however, the mechanisms by which this packaging occurs are unclear. Here, the authors identify a sequence motif that confers exosomal targeting to micro-RNAs and identify a ribonucleoprotein complex that plays a role in this process.
Collapse
Affiliation(s)
- Carolina Villarroya-Beltri
- 1] Department of Vascular Biology and Inflammation, Centro Nacional de Investigaciones Cardiovasculares, Madrid 28029, Spain [2] Servicio de Inmunología, Hospital de la Princesa, Madrid 28006, Spain
| | - Cristina Gutiérrez-Vázquez
- Department of Vascular Biology and Inflammation, Centro Nacional de Investigaciones Cardiovasculares, Madrid 28029, Spain
| | - Fátima Sánchez-Cabo
- Department of Vascular Biology and Inflammation, Centro Nacional de Investigaciones Cardiovasculares, Madrid 28029, Spain
| | - Daniel Pérez-Hernández
- Department of Vascular Biology and Inflammation, Centro Nacional de Investigaciones Cardiovasculares, Madrid 28029, Spain
| | - Jesús Vázquez
- Department of Vascular Biology and Inflammation, Centro Nacional de Investigaciones Cardiovasculares, Madrid 28029, Spain
| | | | | | | | - María Mittelbrunn
- 1] Department of Vascular Biology and Inflammation, Centro Nacional de Investigaciones Cardiovasculares, Madrid 28029, Spain [2]
| | - Francisco Sánchez-Madrid
- 1] Department of Vascular Biology and Inflammation, Centro Nacional de Investigaciones Cardiovasculares, Madrid 28029, Spain [2] Servicio de Inmunología, Hospital de la Princesa, Madrid 28006, Spain [3]
| |
Collapse
|
15
|
Segura V, Medina-Aunon JA, Mora MI, Martínez-Bartolomé S, Abian J, Aloria K, Antúnez O, Arizmendi JM, Azkargorta M, Barceló-Batllori S, Beaskoetxea J, Bech-Serra JJ, Blanco F, Monteiro MB, Cáceres D, Canals F, Carrascal M, Casal JI, Clemente F, Colomé N, Dasilva N, Díaz P, Elortza F, Fernández-Puente P, Fuentes M, Gallardo O, Gharbi SI, Gil C, González-Tejedo C, Hernáez ML, Lombardía M, Lopez-Lucendo M, Marcilla M, Mato JM, Mendes M, Oliveira E, Orera I, Pascual-Montano A, Prieto G, Ruiz-Romero C, Sánchez del Pino MM, Tabas-Madrid D, Valero ML, Vialas V, Villanueva J, Albar JP, Corrales FJ. Surfing transcriptomic landscapes. A step beyond the annotation of chromosome 16 proteome. J Proteome Res 2013; 13:158-72. [PMID: 24138474 DOI: 10.1021/pr400721r] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
The Spanish team of the Human Proteome Project (SpHPP) marked the annotation of Chr16 and data analysis as one of its priorities. Precise annotation of Chromosome 16 proteins according to C-HPP criteria is presented. Moreover, Human Body Map 2.0 RNA-Seq and Encyclopedia of DNA Elements (ENCODE) data sets were used to obtain further information relative to cell/tissue specific chromosome 16 coding gene expression patterns and to infer the presence of missing proteins. Twenty-four shotgun 2D-LC-MS/MS and gel/LC-MS/MS MIAPE compliant experiments, representing 41% coverage of chromosome 16 proteins, were performed. Furthermore, mapping of large-scale multicenter mass spectrometry data sets from CCD18, MCF7, Jurkat, and Ramos cell lines into RNA-Seq data allowed further insights relative to correlation of chromosome 16 transcripts and proteins. Detection and quantification of chromosome 16 proteins in biological matrices by SRM procedures are also primary goals of the SpHPP. Two strategies were undertaken: one focused on known proteins, taking advantage of MS data already available, and the second, aimed at the detection of the missing proteins, is based on the expression of recombinant proteins to gather MS information and optimize SRM methods that will be used in real biological samples. SRM methods for 49 known proteins and for recombinant forms of 24 missing proteins are reported in this study.
Collapse
Affiliation(s)
- Víctor Segura
- ProteoRed-ISCIII, Center for Applied Medical Research (CIMA), University of Navarra , Pío XII, 55; Ed. CIMA, 31008 Pamplona, Spain
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
16
|
Sorzano COS, Pascual-Montano A, Sánchez de Diego A, Martínez-A C, van Wely KHM. Chromothripsis: breakage-fusion-bridge over and over again. Cell Cycle 2013; 12:2016-23. [PMID: 23759584 DOI: 10.4161/cc.25266] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
The acquisition of massive but localized chromosome translocations, a phenomenon termed chromothripsis, has received widespread attention since its discovery over a year ago. Until recently, chromothripsis was believed to originate from a single catastrophic event, but the molecular mechanisms leading to this event are yet to be uncovered. Because a thorough interpretation of the data are missing, the phenomenon itself has wrongly acquired the status of a mechanism used to justify many kinds of complex rearrangements. Although the assumption that all translocations in chromothripsis originate from a single event has met with criticism, satisfactory explanations for the intense but localized nature of this phenomenon are still missing. Here, we show why the data used to describe massive catastrophic rearrangements are incompatible with a model comprising a single event only and propose a molecular mechanism in which a combination of known cellular pathways accounts for chromothripsis. Instead of a single traumatic event, the protection of undamaged chromosomes by telomeres can limit repetitive breakage-fusion-bridge events to a single chromosome arm. Ultimately, common properties of chromosomal instability, such as aneuploidy and centromere fission, might establish the complex genetic pattern observed in this genomic state.
Collapse
|
17
|
Nogales-Cadenas R, Jonic S, Tama F, Arteni AA, Tabas-Madrid D, Vázquez M, Pascual-Montano A, Sorzano COS. 3DEM Loupe: Analysis of macromolecular dynamics using structures from electron microscopy. Nucleic Acids Res 2013; 41:W363-7. [PMID: 23671335 PMCID: PMC3692114 DOI: 10.1093/nar/gkt385] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Electron microscopy (EM) provides access to structural information of macromolecular complexes in the 3-20 Å resolution range. Normal mode analysis has been extensively used with atomic resolution structures and successfully applied to EM structures. The major application of normal modes is the identification of possible conformational changes in proteins. The analysis can throw light on the mechanism following ligand binding, protein-protein interactions, channel opening and other functional macromolecular movements. In this article, we present a new web server, 3DEM Loupe, which allows normal mode analysis of any uploaded EM volume using a user-friendly interface and an intuitive workflow. Results can be fully explored in 3D through animations and movies generated by the server. The application is freely available at http://3demloupe.cnb.csic.es.
Collapse
|
18
|
Zuklys S, Mayer CE, Zhanybekova S, Stefanski HE, Nusspaumer G, Gill J, Barthlott T, Chappaz S, Nitta T, Dooley J, Nogales-Cadenas R, Takahama Y, Finke D, Liston A, Blazar BR, Pascual-Montano A, Holländer GA. MicroRNAs control the maintenance of thymic epithelia and their competence for T lineage commitment and thymocyte selection. J Immunol 2012; 189:3894-904. [PMID: 22972926 DOI: 10.4049/jimmunol.1200783] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Thymic epithelial cells provide unique cues for the lifelong selection and differentiation of a repertoire of functionally diverse T cells. Rendered microRNA (miRNA) deficient, these stromal cells in the mouse lose their capacity to instruct the commitment of hematopoietic precursors to a T cell fate, to effect thymocyte positive selection, and to achieve promiscuous gene expression required for central tolerance induction. Over time, the microenvironment created by miRNA-deficient thymic epithelia assumes the cellular composition and structure of peripheral lymphoid tissue, where thympoiesis fails to be supported. These findings emphasize a global role for miRNA in the maintenance and function of the thymic epithelial cell scaffold and establish a novel mechanism how these cells control peripheral tissue Ag expression to prompt central immunological tolerance.
Collapse
Affiliation(s)
- Saulius Zuklys
- Laboratory of Pediatric Immunology, Department of Biomedicine, University of Basel and Basel University Children's Hospital, Basel 4031, Switzerland
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
19
|
Tabas-Madrid D, Nogales-Cadenas R, Pascual-Montano A. GeneCodis3: a non-redundant and modular enrichment analysis tool for functional genomics. Nucleic Acids Res 2012; 40:W478-83. [PMID: 22573175 PMCID: PMC3394297 DOI: 10.1093/nar/gks402] [Citation(s) in RCA: 457] [Impact Index Per Article: 38.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
Since its first release in 2007, GeneCodis has become a valuable tool to functionally interpret results from experimental techniques in genomics. This web-based application integrates different sources of information to finding groups of genes with similar biological meaning. This process, known as enrichment analysis, is essential in the interpretation of high-throughput experiments. The frequent feedbacks and the natural evolution of genomics and bioinformatics have allowed the growth of the tool and the development of this third release. In this version, a special effort has been made to remove noisy and redundant output from the enrichment results with the inclusion of a recently reported algorithm that summarizes significantly enriched terms and generates functionally coherent modules of genes and terms. A new comparative analysis has been added to allow the differential analysis of gene sets. To expand the scope of the application, new sources of biological information have been included, such as genetic diseases, drugs-genes interactions and Pubmed information among others. Finally, the graphic section has been renewed with the inclusion of new interactive graphics and filtering options. The application is freely available at http://genecodis.cnb.csic.es.
Collapse
Affiliation(s)
- Daniel Tabas-Madrid
- Functional Bioinformatics Group, National Center for Biotechnology (CNB-CSIC), Madrid, Spain
| | | | | |
Collapse
|
20
|
Muniategui A, Nogales-Cadenas R, Vázquez M, L. Aranguren X, Agirre X, Luttun A, Prosper F, Pascual-Montano A, Rubio A. Quantification of miRNA-mRNA interactions. PLoS One 2012; 7:e30766. [PMID: 22348024 PMCID: PMC3279346 DOI: 10.1371/journal.pone.0030766] [Citation(s) in RCA: 62] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2011] [Accepted: 12/21/2011] [Indexed: 02/07/2023] Open
Abstract
miRNAs are small RNA molecules (′ 22nt) that interact with their corresponding target mRNAs inhibiting the translation of the mRNA into proteins and cleaving the target mRNA. This second effect diminishes the overall expression of the target mRNA. Several miRNA-mRNA relationship databases have been deployed, most of them based on sequence complementarities. However, the number of false positives in these databases is large and they do not overlap completely. Recently, it has been proposed to combine expression measurement from both miRNA and mRNA and sequence based predictions to achieve more accurate relationships. In our work, we use LASSO regression with non-positive constraints to integrate both sources of information. LASSO enforces the sparseness of the solution and the non-positive constraints restrict the search of miRNA targets to those with down-regulation effects on the mRNA expression. We named this method TaLasso (miRNA-Target LASSO). We used TaLasso on two public datasets that have paired expression levels of human miRNAs and mRNAs. The top ranked interactions recovered by TaLasso are especially enriched (more than using any other algorithm) in experimentally validated targets. The functions of the genes with mRNA transcripts in the top-ranked interactions are meaningful. This is not the case using other algorithms. TaLasso is available as Matlab or R code. There is also a web-based tool for human miRNAs at http://talasso.cnb.csic.es/.
Collapse
Affiliation(s)
- Ander Muniategui
- Group of Bioinformatics, CEIT and TECNUN, University of Navarra, San Sebastian, Spain
| | | | - Miguél Vázquez
- Structural Biology and Biocomputing Programme, Spanish National Cancer Research Centre (CNIO), Madrid, Spain
| | - Xabier L. Aranguren
- Center for Molecular and Vascular Biology, Katholieke Universiteit Leuven, Leuven, Belgium
| | - Xabier Agirre
- Hematology Department and Area of Cell Therapy, Clínica Universidad de Navarra, Foundation for Applied Medical Research, University of Navarra, Pamplona, Spain
| | - Aernout Luttun
- Center for Molecular and Vascular Biology, Katholieke Universiteit Leuven, Leuven, Belgium
| | - Felipe Prosper
- Hematology Department and Area of Cell Therapy, Clínica Universidad de Navarra, Foundation for Applied Medical Research, University of Navarra, Pamplona, Spain
| | | | - Angel Rubio
- Group of Bioinformatics, CEIT and TECNUN, University of Navarra, San Sebastian, Spain
- * E-mail:
| |
Collapse
|
21
|
Rebholz-Schuhmann D, Jimeno Yepes A, Li C, Kafkas S, Lewin I, Kang N, Corbett P, Milward D, Buyko E, Beisswanger E, Hornbostel K, Kouznetsov A, Witte R, Laurila JB, Baker CJ, Kuo CJ, Clematide S, Rinaldi F, Farkas R, Móra G, Hara K, Furlong LI, Rautschka M, Neves ML, Pascual-Montano A, Wei Q, Collier N, Chowdhury MFM, Lavelli A, Berlanga R, Morante R, Van Asch V, Daelemans W, Marina JL, van Mulligen E, Kors J, Hahn U. Assessment of NER solutions against the first and second CALBC Silver Standard Corpus. J Biomed Semantics 2011; 2 Suppl 5:S11. [PMID: 22166494 PMCID: PMC3239301 DOI: 10.1186/2041-1480-2-s5-s11] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Background Competitions in text mining have been used to measure the performance of automatic text processing solutions against a manually annotated gold standard corpus (GSC). The preparation of the GSC is time-consuming and costly and the final corpus consists at the most of a few thousand documents annotated with a limited set of semantic groups. To overcome these shortcomings, the CALBC project partners (PPs) have produced a large-scale annotated biomedical corpus with four different semantic groups through the harmonisation of annotations from automatic text mining solutions, the first version of the Silver Standard Corpus (SSC-I). The four semantic groups are chemical entities and drugs (CHED), genes and proteins (PRGE), diseases and disorders (DISO) and species (SPE). This corpus has been used for the First CALBC Challenge asking the participants to annotate the corpus with their text processing solutions. Results All four PPs from the CALBC project and in addition, 12 challenge participants (CPs) contributed annotated data sets for an evaluation against the SSC-I. CPs could ignore the training data and deliver the annotations from their genuine annotation system, or could train a machine-learning approach on the provided pre-annotated data. In general, the performances of the annotation solutions were lower for entities from the categories CHED and PRGE in comparison to the identification of entities categorized as DISO and SPE. The best performance over all semantic groups were achieved from two annotation solutions that have been trained on the SSC-I. The data sets from participants were used to generate the harmonised Silver Standard Corpus II (SSC-II), if the participant did not make use of the annotated data set from the SSC-I for training purposes. The performances of the participants’ solutions were again measured against the SSC-II. The performances of the annotation solutions showed again better results for DISO and SPE in comparison to CHED and PRGE. Conclusions The SSC-I delivers a large set of annotations (1,121,705) for a large number of documents (100,000 Medline abstracts). The annotations cover four different semantic groups and are sufficiently homogeneous to be reproduced with a trained classifier leading to an average F-measure of 85%. Benchmarking the annotation solutions against the SSC-II leads to better performance for the CPs’ annotation solutions in comparison to the SSC-I.
Collapse
|
22
|
Fontanillo C, Nogales-Cadenas R, Pascual-Montano A, De Las Rivas J. Functional analysis beyond enrichment: non-redundant reciprocal linkage of genes and biological terms. PLoS One 2011; 6:e24289. [PMID: 21949701 PMCID: PMC3174934 DOI: 10.1371/journal.pone.0024289] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2011] [Accepted: 08/03/2011] [Indexed: 11/30/2022] Open
Abstract
Functional analysis of large sets of genes and proteins is becoming more and more necessary with the increase of experimental biomolecular data at omic-scale. Enrichment analysis is by far the most popular available methodology to derive functional implications of sets of cooperating genes. The problem with these techniques relies in the redundancy of resulting information, that in most cases generate lots of trivial results with high risk to mask the reality of key biological events. We present and describe a computational method, called GeneTerm Linker, that filters and links enriched output data identifying sets of associated genes and terms, producing metagroups of coherent biological significance. The method uses fuzzy reciprocal linkage between genes and terms to unravel their functional convergence and associations. The algorithm is tested with a small set of well known interacting proteins from yeast and with a large collection of reference sets from three heterogeneous resources: multiprotein complexes (CORUM), cellular pathways (SGD) and human diseases (OMIM). Statistical Precision, Recall and balanced F-score are calculated showing robust results, even when different levels of random noise are included in the test sets. Although we could not find an equivalent method, we present a comparative analysis with a widely used method that combines enrichment and functional annotation clustering. A web application to use the method here proposed is provided at http://gtlinker.cnb.csic.es.
Collapse
Affiliation(s)
- Celia Fontanillo
- Cancer Research Center (CiC-IBMCC, CSIC/USAL), Campus Miguel de Unamuno, Salamanca, Spain
| | - Ruben Nogales-Cadenas
- National Center of Biotechnology (CNB, CSIC), Campus de Cantoblanco UAM, Madrid, Spain
| | | | - Javier De Las Rivas
- Cancer Research Center (CiC-IBMCC, CSIC/USAL), Campus Miguel de Unamuno, Salamanca, Spain
- * E-mail:
| |
Collapse
|
23
|
Vazquez M, Nogales-Cadenas R, Arroyo J, Botías P, García R, Carazo JM, Tirado F, Pascual-Montano A, Carmona-Saez P. MARQ: an online tool to mine GEO for experiments with similar or opposite gene expression signatures. Nucleic Acids Res 2010; 38:W228-32. [PMID: 20513648 PMCID: PMC2896165 DOI: 10.1093/nar/gkq476] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The enormous amount of data available in public gene expression repositories such as Gene Expression Omnibus (GEO) offers an inestimable resource to explore gene expression programs across several organisms and conditions. This information can be used to discover experiments that induce similar or opposite gene expression patterns to a given query, which in turn may lead to the discovery of new relationships among diseases, drugs or pathways, as well as the generation of new hypotheses. In this work, we present MARQ, a web-based application that allows researchers to compare a query set of genes, e.g. a set of over- and under-expressed genes, against a signature database built from GEO datasets for different organisms and platforms. MARQ offers an easy-to-use and integrated environment to mine GEO, in order to identify conditions that induce similar or opposite gene expression patterns to a given experimental condition. MARQ also includes additional functionalities for the exploration of the results, including a meta-analysis pipeline to find genes that are differentially expressed across different experiments. The application is freely available at http://marq.dacya.ucm.es.
Collapse
Affiliation(s)
- Miguel Vazquez
- Software Engineering Department, Facultad de Informatica, Universidad Complutense de Madrid, Madrid, Spain
| | | | | | | | | | | | | | | | | |
Collapse
|
24
|
Medina I, Carbonell J, Pulido L, Madeira SC, Goetz S, Conesa A, Tárraga J, Pascual-Montano A, Nogales-Cadenas R, Santoyo J, García F, Marbà M, Montaner D, Dopazo J. Babelomics: an integrative platform for the analysis of transcriptomics, proteomics and genomic data with advanced functional profiling. Nucleic Acids Res 2010; 38:W210-3. [PMID: 20478823 PMCID: PMC2896184 DOI: 10.1093/nar/gkq388] [Citation(s) in RCA: 265] [Impact Index Per Article: 18.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Babelomics is a response to the growing necessity of integrating and analyzing different types of genomic data in an environment that allows an easy functional interpretation of the results. Babelomics includes a complete suite of methods for the analysis of gene expression data that include normalization (covering most commercial platforms), pre-processing, differential gene expression (case-controls, multiclass, survival or continuous values), predictors, clustering; large-scale genotyping assays (case controls and TDTs, and allows population stratification analysis and correction). All these genomic data analysis facilities are integrated and connected to multiple options for the functional interpretation of the experiments. Different methods of functional enrichment or gene set enrichment can be used to understand the functional basis of the experiment analyzed. Many sources of biological information, which include functional (GO, KEGG, Biocarta, Reactome, etc.), regulatory (Transfac, Jaspar, ORegAnno, miRNAs, etc.), text-mining or protein–protein interaction modules can be used for this purpose. Finally a tool for the de novo functional annotation of sequences has been included in the system. This provides support for the functional analysis of non-model species. Mirrors of Babelomics or command line execution of their individual components are now possible. Babelomics is available at http://www.babelomics.org.
Collapse
Affiliation(s)
- Ignacio Medina
- Bioinformatics Department, Centro de Investigación Príncipe Felipe, Autopista del Saler 16, Valencia, Spain
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
25
|
Neves ML, Carazo JM, Pascual-Montano A. Moara: a Java library for extracting and normalizing gene and protein mentions. BMC Bioinformatics 2010; 11:157. [PMID: 20346105 PMCID: PMC2851609 DOI: 10.1186/1471-2105-11-157] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2009] [Accepted: 03/26/2010] [Indexed: 11/10/2022] Open
Abstract
Background Gene/protein recognition and normalization are important preliminary steps for many biological text mining tasks, such as information retrieval, protein-protein interactions, and extraction of semantic information, among others. Despite dedication to these problems and effective solutions being reported, easily integrated tools to perform these tasks are not readily available. Results This study proposes a versatile and trainable Java library that implements gene/protein tagger and normalization steps based on machine learning approaches. The system has been trained for several model organisms and corpora but can be expanded to support new organisms and documents. Conclusions Moara is a flexible, trainable and open-source system that is not specifically orientated to any organism and therefore does not requires specific tuning in the algorithms or dictionaries utilized. Moara can be used as a stand-alone application or can be incorporated in the workflow of a more general text mining system.
Collapse
Affiliation(s)
- Mariana L Neves
- BioComputing Unit, National Center of Biotechnology (CNB-CSIC), Madrid, Spain
| | | | | |
Collapse
|
26
|
Vialás V, Nogales-Cadenas R, Nombela C, Pascual-Montano A, Gil C. Proteopathogen, a protein database for studyingCandida albicans- host interaction. Proteomics 2009; 9:4664-8. [DOI: 10.1002/pmic.200900023] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
27
|
Nogales-Cadenas R, Carmona-Saez P, Vazquez M, Vicente C, Yang X, Tirado F, Carazo JM, Pascual-Montano A. GeneCodis: interpreting gene lists through enrichment analysis and integration of diverse biological information. Nucleic Acids Res 2009; 37:W317-22. [PMID: 19465387 PMCID: PMC2703901 DOI: 10.1093/nar/gkp416] [Citation(s) in RCA: 347] [Impact Index Per Article: 23.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
GeneCodis is a web server application for functional analysis of gene lists that integrates different sources of information and finds modular patterns of interrelated annotations. This integrative approach has proved to be useful for the interpretation of high-throughput experiments and therefore a new version of the system has been developed to expand its functionality and scope. GeneCodis now expands the functional information with regulatory patterns and user-defined annotations, offering the possibility of integrating all sources of information in the same analysis. Traditional singular enrichment is now permitted and more organisms and gene identifiers have been added to the database. The application has been re-engineered to improve performance, accessibility and scalability. In addition, GeneCodis can now be accessed through a public SOAP web services interface, enabling users to perform analysis from their own scripts and workflows. The application is freely available at http://genecodis.dacya.ucm.es
Collapse
|
28
|
Abstract
We present SENT (semantic features in text), a functional interpretation tool based on literature analysis. SENT uses Non-negative Matrix Factorization to identify topics in the scientific articles related to a collection of genes or their products, and use them to group and summarize these genes. In addition, the application allows users to rank and explore the articles that best relate to the topics found, helping put the analysis results into context. This approach is useful as an exploratory step in the workflow of interpreting and understanding experimental data, shedding some light into the complex underlying biological mechanisms. This tool provides a user-friendly interface via a web site, and a programmatic access via a SOAP web server. SENT is freely accessible at http://sent.dacya.ucm.es.
Collapse
Affiliation(s)
- Miguel Vazquez
- Software Engineering Department, Complutense University and Biocomputing Unit, National Center for Biotechnology, CNB-CSIC, Madrid, Spain
| | | | | | | | | | | | | |
Collapse
|
29
|
Velázquez-Muriel JA, Rueda M, Cuesta I, Pascual-Montano A, Orozco M, Carazo JM. Comparison of molecular dynamics and superfamily spaces of protein domain deformation. BMC Struct Biol 2009; 9:6. [PMID: 19220918 PMCID: PMC2666742 DOI: 10.1186/1472-6807-9-6] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/27/2008] [Accepted: 02/17/2009] [Indexed: 11/10/2022]
Abstract
BACKGROUND It is well known the strong relationship between protein structure and flexibility, on one hand, and biological protein function, on the other hand. Technically, protein flexibility exploration is an essential task in many applications, such as protein structure prediction and modeling. In this contribution we have compared two different approaches to explore the flexibility space of protein domains: i) molecular dynamics (MD-space), and ii) the study of the structural changes within superfamily (SF-space). RESULTS Our analysis indicates that the MD-space and the SF-space display a significant overlap, but are still different enough to be considered as complementary. The SF-space space is wider but less complex than the MD-space, irrespective of the number of members in the superfamily. Also, the SF-space does not sample all possibilities offered by the MD-space, but often introduces very large changes along just a few deformation modes, whose number tend to a plateau as the number of related folds in the superfamily increases. CONCLUSION Theoretically, we obtained two conclusions. First, that function restricts the access to some flexibility patterns to evolution, as we observe that when a superfamily member changes to become another, the path does not completely overlap with the physical deformability. Second, that conformational changes from variation in a superfamily are larger and much simpler than those allowed by physical deformability. Methodologically, the conclusion is that both spaces studied are complementary, and have different size and complexity. We expect this fact to have application in fields as 3D-EM/X-ray hybrid models or ab initio protein folding.
Collapse
|
30
|
Abstract
Active research on the biology of the centrosome during the past decades has allowed the identification and characterization of many centrosomal proteins. Unfortunately, the accumulated data is still dispersed among heterogeneous sources of information. Here we present centrosome:db, which intends to compile and integrate relevant information related to the human centrosome. We have compiled a set of 383 likely human centrosomal genes and recorded the associated supporting evidences. Centrosome:db offers several perspectives to study the human centrosome including evolution, function and structure. The database contains information on the orthology relationships with other species, including fungi, nematodes, arthropods, urochordates and vertebrates. Predictions of the domain organization of centrosome:db proteins are graphically represented at different sections of the database, including sets of alternative protein isoforms, interacting proteins, groups of orthologs and the homologs identified with blast. Centrosome:db also contains information related to function, gene–disease associations, SNPs and the 3D structure of proteins. Apart from important differences in the coverage of the set of centrosomal genes, our database differentiates from other similar initiatives in the way information is treated and analyzed. Centrosome:db is publicly available at http://centrosome.dacya.ucm.es.
Collapse
|
31
|
Chagoyen M, Carazo JM, Pascual-Montano A. Assessment of protein set coherence using functional annotations. BMC Bioinformatics 2008; 9:444. [PMID: 18937846 PMCID: PMC2588600 DOI: 10.1186/1471-2105-9-444] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2008] [Accepted: 10/20/2008] [Indexed: 11/23/2022] Open
Abstract
Background Analysis of large-scale experimental datasets frequently produces one or more sets of proteins that are subsequently mined for functional interpretation and validation. To this end, a number of computational methods have been devised that rely on the analysis of functional annotations. Although current methods provide valuable information (e.g. significantly enriched annotations, pairwise functional similarities), they do not specifically measure the degree of homogeneity of a protein set. Results In this work we present a method that scores the degree of functional homogeneity, or coherence, of a set of proteins on the basis of the global similarity of their functional annotations. The method uses statistical hypothesis testing to assess the significance of the set in the context of the functional space of a reference set. As such, it can be used as a first step in the validation of sets expected to be homogeneous prior to further functional interpretation. Conclusion We evaluate our method by analysing known biologically relevant sets as well as random ones. The known relevant sets comprise macromolecular complexes, cellular components and pathways described for Saccharomyces cerevisiae, which are mostly significantly coherent. Finally, we illustrate the usefulness of our approach for validating 'functional modules' obtained from computational analysis of protein-protein interaction networks. Matlab code and supplementary data are available at
Collapse
|
32
|
Mejía-Roa E, Carmona-Saez P, Nogales R, Vicente C, Vázquez M, Yang XY, García C, Tirado F, Pascual-Montano A. bioNMF: a web-based tool for nonnegative matrix factorization in biology. Nucleic Acids Res 2008; 36:W523-8. [PMID: 18515346 PMCID: PMC2447803 DOI: 10.1093/nar/gkn335] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
In the last few years, advances in high-throughput technologies are generating large amounts of biological data that require analysis and interpretation. Nonnegative matrix factorization (NMF) has been established as a very effective method to reveal information about the complex latent relationships in experimental data sets. Using this method as part of the exploratory data analysis, workflow would certainly help in the process of interpreting and understanding the complex biology mechanisms that are underlying experimental data. We have developed bioNMF, a web-based tool that implements the NMF methodology in different analysis contexts to support some of the most important reported applications in biology. This online tool provides a user-friendly interface, combined with a computational efficient parallel implementation of the NMF methods to explore the data in different analysis scenarios. In addition to the online access, bioNMF also provides the same functionality included in the website as a public web services interface, enabling users with more computer expertise to launch jobs into bioNMF server from their own scripts and workflows. bioNMF application is freely available at http://bionmf.dacya.ucm.es.
Collapse
Affiliation(s)
- E Mejía-Roa
- Computer Architecture Department, Complutense University, Madrid, Spain
| | | | | | | | | | | | | | | | | |
Collapse
|
33
|
Abascal F, Carmona-Saez P, Carazo JM, Pascual-Montano A. ChIPCodis: mining complex regulatory systems in yeast by concurrent enrichment analysis of chip-on-chip data. Bioinformatics 2008; 24:1208-9. [PMID: 18339638 DOI: 10.1093/bioinformatics/btn094] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Eukaryotic genes are often regulated by multiple transcription factors (TFs). Depending on the interactions among different TFs the expression of a gene can be tuned to respond to diverse environmental conditions. Chip-on-chip experiments provide a snapshot of which TF are in vivo bound to which genes in a particular condition, and have been applied to characterize the regulatory code of yeast under several experimental settings. ChIPCodis mines this data to provide new insights about how the expression of a particular group of genes is regulated. For a given list of yeast genes ChIPCodis determines which combinations of TFs are significantly over-represented in a series of environmental conditions. AVAILABILITY http://chipcodis.dacya.ucm.es
Collapse
Affiliation(s)
- Federico Abascal
- BioComputing Unit, National Center of Biotechnology (CSIC), Madrid, Spain
| | | | | | | |
Collapse
|
34
|
Liang W, Pascual-Montano A, Silva AJ, Benitez JA. The cyclic AMP receptor protein modulates quorum sensing, motility and multiple genes that affect intestinal colonization in Vibrio cholerae. Microbiology (Reading) 2007; 153:2964-2975. [PMID: 17768239 DOI: 10.1099/mic.0.2007/006668-0] [Citation(s) in RCA: 127] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Vibrio cholerae is the causative agent of cholera, which continues to be a major public health concern in Asia, Africa and Latin America. The bacterium can persist outside the human host and alternates between planktonic and biofilm community lifestyles. Transition between the different lifestyles is mediated by multiple signal transduction pathways including quorum sensing. Expression of the Zn-metalloprotease haemagglutinin (HA)/protease is subject to a dual regulation which involves the quorum-sensing regulator HapR and the cAMP receptor protein. In a previous study, we observed that a mutant defective in the cAMP-receptor protein (CRP) expressed lower levels of HapR. To further investigate the role of CRP in modulating HapR and other signal transduction pathways, we performed global gene expression profiling of a Deltacrp mutant of El Tor biotype V. cholerae. Here we show that CRP is required for the biosynthesis of cholera autoinducer 1 (CAI-1) and affects the expression of multiple HapR-regulated genes. As expected, the Deltacrp mutant produced more cholera toxin and enhanced biofilm. Expression of flagellar genes, reported to be affected in DeltahapR mutants, was diminished in the Deltacrp mutant. However, an epistasis analysis indicated that cAMP-CRP affects motility by a mechanism independent of HapR. Inactivation of crp inhibited the expression of multiple genes reported to be strongly induced in vivo and to affect the ability of V. cholerae to colonize the small intestine and cause disease. These genes included ompU, ompT and ompW encoding outer-membrane proteins, the alternative sigma factor sigma(E) required for intestinal colonization, and genes involved in anaerobic energy metabolism. Our results indicate that CRP plays a crucial role in the V. cholerae life cycle by affecting quorum sensing and multiple genes required for survival of V. cholerae in the human host and the environment.
Collapse
Affiliation(s)
- Weili Liang
- Morehouse School of Medicine, Department of Microbiology, Biochemistry and Immunology, 720 Westview Dr. SW, Atlanta, GA, USA
| | - Alberto Pascual-Montano
- Computer Architecture Department, Facultad de Ciencias Físicas, Universidad Complutense de Madrid, Madrid 28040, Spain
| | - Anisia J Silva
- Morehouse School of Medicine, Department of Microbiology, Biochemistry and Immunology, 720 Westview Dr. SW, Atlanta, GA, USA
| | - Jorge A Benitez
- Morehouse School of Medicine, Department of Microbiology, Biochemistry and Immunology, 720 Westview Dr. SW, Atlanta, GA, USA
| |
Collapse
|
35
|
Carmona-Saez P, Chagoyen M, Tirado F, Carazo JM, Pascual-Montano A. GENECODIS: a web-based tool for finding significant concurrent annotations in gene lists. Genome Biol 2007; 8:R3. [PMID: 17204154 PMCID: PMC1839127 DOI: 10.1186/gb-2007-8-1-r3] [Citation(s) in RCA: 510] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2006] [Revised: 09/29/2006] [Accepted: 01/04/2007] [Indexed: 12/01/2022] Open
Abstract
GENECODIS, a web-based tool for finding annotations that frequently co-occur in a set of genes and ranking them by their statistical significance, is presented. We present GENECODIS, a web-based tool that integrates different sources of information to search for annotations that frequently co-occur in a set of genes and rank them by statistical significance. The analysis of concurrent annotations provides significant information for the biologic interpretation of high-throughput experiments and may outperform the results of standard methods for the functional analysis of gene lists. GENECODIS is publicly available at .
Collapse
Affiliation(s)
- Pedro Carmona-Saez
- BioComputing Unit, National Center of Biotechnology (CNB-CSIC), C/Darwin 3, Campus Universidad Autónoma de Madrid, 28049 Madrid, Spain
| | - Monica Chagoyen
- BioComputing Unit, National Center of Biotechnology (CNB-CSIC), C/Darwin 3, Campus Universidad Autónoma de Madrid, 28049 Madrid, Spain
- Computer Architecture Department, Facultad de Ciencias Físicas, Universidad Complutense de Madrid, C/Avenida Complutense S/N, 28040 Madrid, Spain
| | - Francisco Tirado
- Computer Architecture Department, Facultad de Ciencias Físicas, Universidad Complutense de Madrid, C/Avenida Complutense S/N, 28040 Madrid, Spain
| | - Jose M Carazo
- BioComputing Unit, National Center of Biotechnology (CNB-CSIC), C/Darwin 3, Campus Universidad Autónoma de Madrid, 28049 Madrid, Spain
| | - Alberto Pascual-Montano
- Computer Architecture Department, Facultad de Ciencias Físicas, Universidad Complutense de Madrid, C/Avenida Complutense S/N, 28040 Madrid, Spain
| |
Collapse
|
36
|
Pascual-Montano A, Carmona-Saez P, Chagoyen M, Tirado F, Carazo JM, Pascual-Marqui RD. bioNMF: a versatile tool for non-negative matrix factorization in biology. BMC Bioinformatics 2006; 7:366. [PMID: 16875499 PMCID: PMC1550731 DOI: 10.1186/1471-2105-7-366] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2006] [Accepted: 07/28/2006] [Indexed: 12/02/2022] Open
Abstract
Background In the Bioinformatics field, a great deal of interest has been given to Non-negative matrix factorization technique (NMF), due to its capability of providing new insights and relevant information about the complex latent relationships in experimental data sets. This method, and some of its variants, has been successfully applied to gene expression, sequence analysis, functional characterization of genes and text mining. Even if the interest on this technique by the bioinformatics community has been increased during the last few years, there are not many available simple standalone tools to specifically perform these types of data analysis in an integrated environment. Results In this work we propose a versatile and user-friendly tool that implements the NMF methodology in different analysis contexts to support some of the most important reported applications of this new methodology. This includes clustering and biclustering gene expression data, protein sequence analysis, text mining of biomedical literature and sample classification using gene expression. The tool, which is named bioNMF, also contains a user-friendly graphical interface to explore results in an interactive manner and facilitate in this way the exploratory data analysis process. Conclusion bioNMF is a standalone versatile application which does not require any special installation or libraries. It can be used for most of the multiple applications proposed in the bioinformatics field or to support new research using this method. This tool is publicly available at .
Collapse
Affiliation(s)
- Alberto Pascual-Montano
- Computer Architecture Department, Facultad de Ciencias Físicas, Universidad Complutense de Madrid, 28040, Spain
| | - Pedro Carmona-Saez
- BioComputing Unit, National Center of Biotechnology, Campus Universidad Autónoma de Madrid, 28049, Spain
| | - Monica Chagoyen
- Computer Architecture Department, Facultad de Ciencias Físicas, Universidad Complutense de Madrid, 28040, Spain
- BioComputing Unit, National Center of Biotechnology, Campus Universidad Autónoma de Madrid, 28049, Spain
| | - Francisco Tirado
- Computer Architecture Department, Facultad de Ciencias Físicas, Universidad Complutense de Madrid, 28040, Spain
| | - Jose M Carazo
- BioComputing Unit, National Center of Biotechnology, Campus Universidad Autónoma de Madrid, 28049, Spain
| | - Roberto D Pascual-Marqui
- The KEY Institute for Brain-Mind Research, University Hospital of Psychiatry. Lenggstr. 31, CH-8029 Zurich, Switzerland
| |
Collapse
|
37
|
Chagoyen M, Carmona-Saez P, Gil C, Carazo JM, Pascual-Montano A. A literature-based similarity metric for biological processes. BMC Bioinformatics 2006; 7:363. [PMID: 16872502 PMCID: PMC1579237 DOI: 10.1186/1471-2105-7-363] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2006] [Accepted: 07/26/2006] [Indexed: 11/10/2022] Open
Abstract
Background Recent analyses in systems biology pursue the discovery of functional modules within the cell. Recognition of such modules requires the integrative analysis of genome-wide experimental data together with available functional schemes. In this line, methods to bridge the gap between the abstract definitions of cellular processes in current schemes and the interlinked nature of biological networks are required. Results This work explores the use of the scientific literature to establish potential relationships among cellular processes. To this end we haveused a document based similarity method to compute pair-wise similarities of the biological processes described in the Gene Ontology (GO). The method has been applied to the biological processes annotated for the Saccharomyces cerevisiae genome. We compared our results with similarities obtained with two ontology-based metrics, as well as with gene product annotation relationships. We show that the literature-based metric conserves most direct ontological relationships, while reveals biologically sounded similarities that are not obtained using ontology-based metrics and/or genome annotation. Conclusion The scientific literature is a valuable source of information from which to compute similarities among biological processes. The associations discovered by literature analysis are a valuable complement to those encoded in existing functional schemes, and those that arise by genome annotation. These similarities can be used to conveniently map the interlinked structure of cellular processes in a particular organism.
Collapse
Affiliation(s)
- Monica Chagoyen
- Biocomputing Unit. Centro Nacional de Biotecnologia – CSIC, Madrid, Spain
- Dpto. Arquitectura de Computadores y Automatica. Universidad Complutense de Madrid, Madrid, Spain
| | - Pedro Carmona-Saez
- Biocomputing Unit. Centro Nacional de Biotecnologia – CSIC, Madrid, Spain
| | - Concha Gil
- Dpto. Microbiologia II. Facultad de Farmacia. Universidad Complutense de Madrid, Madrid, Spain
- Unidad de Proteomica UCM – Parque Cientifico de Madrid, Madrid, Spain
| | - Jose M Carazo
- Biocomputing Unit. Centro Nacional de Biotecnologia – CSIC, Madrid, Spain
| | - Alberto Pascual-Montano
- Dpto. Arquitectura de Computadores y Automatica. Universidad Complutense de Madrid, Madrid, Spain
| |
Collapse
|
38
|
Pascual-Montano A, Carazo JM, Kochi K, Lehmann D, Pascual-Marqui RD. Nonsmooth nonnegative matrix factorization (nsNMF). IEEE Trans Pattern Anal Mach Intell 2006; 28:403-15. [PMID: 16526426 DOI: 10.1109/tpami.2006.60] [Citation(s) in RCA: 117] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]
Abstract
We propose a novel nonnegative matrix factorization model that aims at finding localized, part-based, representations of nonnegative multivariate data items. Unlike the classical nonnegative matrix factorization (NMF) technique, this new model, denoted "nonsmooth nonnegative matrix factorization" (nsNMF), corresponds to the optimization of an unambiguous cost function designed to explicitly represent sparseness, in the form of nonsmoothness, which is controlled by a single parameter. In general, this method produces a set of basis and encoding vectors that are not only capable of representing the original data, but they also extract highly localized patterns, which generally lend themselves to improved interpretability. The properties of this new method are illustrated with several data sets. Comparisons to previously published methods show that the new nsNMF method has some advantages in keeping faithfulness to the data in the achieving a high degree of sparseness for both the estimated basis and the encoding vectors and in better interpretability of the factors.
Collapse
Affiliation(s)
- Alberto Pascual-Montano
- Computer Architecture and System Engineering Department, Facultad de Ciencias Físicas, Universidad Complutense, Madrid, Spain.
| | | | | | | | | |
Collapse
|
39
|
Carmona-Saez P, Pascual-Marqui RD, Tirado F, Carazo JM, Pascual-Montano A. Biclustering of gene expression data by Non-smooth Non-negative Matrix Factorization. BMC Bioinformatics 2006; 7:78. [PMID: 16503973 PMCID: PMC1434777 DOI: 10.1186/1471-2105-7-78] [Citation(s) in RCA: 130] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2005] [Accepted: 02/17/2006] [Indexed: 12/01/2022] Open
Abstract
Background The extended use of microarray technologies has enabled the generation and accumulation of gene expression datasets that contain expression levels of thousands of genes across tens or hundreds of different experimental conditions. One of the major challenges in the analysis of such datasets is to discover local structures composed by sets of genes that show coherent expression patterns across subsets of experimental conditions. These patterns may provide clues about the main biological processes associated to different physiological states. Results In this work we present a methodology able to cluster genes and conditions highly related in sub-portions of the data. Our approach is based on a new data mining technique, Non-smooth Non-Negative Matrix Factorization (nsNMF), able to identify localized patterns in large datasets. We assessed the potential of this methodology analyzing several synthetic datasets as well as two large and heterogeneous sets of gene expression profiles. In all cases the method was able to identify localized features related to sets of genes that show consistent expression patterns across subsets of experimental conditions. The uncovered structures showed a clear biological meaning in terms of relationships among functional annotations of genes and the phenotypes or physiological states of the associated conditions. Conclusion The proposed approach can be a useful tool to analyze large and heterogeneous gene expression datasets. The method is able to identify complex relationships among genes and conditions that are difficult to identify by standard clustering algorithms.
Collapse
Affiliation(s)
- Pedro Carmona-Saez
- BioComputing Unit. National Center of Biotechnology. Campus Universidad Autónoma de Madrid. 28049. Spain
| | - Roberto D Pascual-Marqui
- The KEY Institute for Brain-Mind Research, University Hospital of Psychiatry. Lenggstr. 31, CH-8029 Zurich, Switzerland
| | - F Tirado
- Computer Architecture Department. Facultad de Ciencias Físicas. Universidad Complutense de Madrid. 28040. Spain
| | - Jose M Carazo
- BioComputing Unit. National Center of Biotechnology. Campus Universidad Autónoma de Madrid. 28049. Spain
| | - Alberto Pascual-Montano
- Computer Architecture Department. Facultad de Ciencias Físicas. Universidad Complutense de Madrid. 28040. Spain
| |
Collapse
|
40
|
Carmona-Saez P, Chagoyen M, Rodriguez A, Trelles O, Carazo JM, Pascual-Montano A. Integrated analysis of gene expression by Association Rules Discovery. BMC Bioinformatics 2006; 7:54. [PMID: 16464256 PMCID: PMC1386712 DOI: 10.1186/1471-2105-7-54] [Citation(s) in RCA: 75] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2005] [Accepted: 02/07/2006] [Indexed: 11/24/2022] Open
Abstract
Background Microarray technology is generating huge amounts of data about the expression level of thousands of genes, or even whole genomes, across different experimental conditions. To extract biological knowledge, and to fully understand such datasets, it is essential to include external biological information about genes and gene products to the analysis of expression data. However, most of the current approaches to analyze microarray datasets are mainly focused on the analysis of experimental data, and external biological information is incorporated as a posterior process. Results In this study we present a method for the integrative analysis of microarray data based on the Association Rules Discovery data mining technique. The approach integrates gene annotations and expression data to discover intrinsic associations among both data sources based on co-occurrence patterns. We applied the proposed methodology to the analysis of gene expression datasets in which genes were annotated with metabolic pathways, transcriptional regulators and Gene Ontology categories. Automatically extracted associations revealed significant relationships among these gene attributes and expression patterns, where many of them are clearly supported by recently reported work. Conclusion The integration of external biological information and gene expression data can provide insights about the biological processes associated to gene expression programs. In this paper we show that the proposed methodology is able to integrate multiple gene annotations and expression data in the same analytic framework and extract meaningful associations among heterogeneous sources of data. An implementation of the method is included in the Engene software package.
Collapse
Affiliation(s)
- Pedro Carmona-Saez
- BioComputing Unit, National Center for Biotechnology (CNB-CSIC), Cantoblanco, 28049, Madrid, Spain
| | - Monica Chagoyen
- BioComputing Unit, National Center for Biotechnology (CNB-CSIC), Cantoblanco, 28049, Madrid, Spain
| | - Andres Rodriguez
- Computer Architecture Department, Universidad de Málaga, 29080, Málaga, Spain
| | - Oswaldo Trelles
- Computer Architecture Department, Universidad de Málaga, 29080, Málaga, Spain
| | - Jose M Carazo
- BioComputing Unit, National Center for Biotechnology (CNB-CSIC), Cantoblanco, 28049, Madrid, Spain
| | - Alberto Pascual-Montano
- Computer Architecture and System Engineering Department, Facultad de CC Físicas, Universidad Complutense de Madrid, 28040, Madrid, Spain
| |
Collapse
|
41
|
Chagoyen M, Carmona-Saez P, Shatkay H, Carazo JM, Pascual-Montano A. Discovering semantic features in the literature: a foundation for building functional associations. BMC Bioinformatics 2006; 7:41. [PMID: 16438716 PMCID: PMC1386711 DOI: 10.1186/1471-2105-7-41] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2005] [Accepted: 01/26/2006] [Indexed: 11/10/2022] Open
Abstract
Background Experimental techniques such as DNA microarray, serial analysis of gene expression (SAGE) and mass spectrometry proteomics, among others, are generating large amounts of data related to genes and proteins at different levels. As in any other experimental approach, it is necessary to analyze these data in the context of previously known information about the biological entities under study. The literature is a particularly valuable source of information for experiment validation and interpretation. Therefore, the development of automated text mining tools to assist in such interpretation is one of the main challenges in current bioinformatics research. Results We present a method to create literature profiles for large sets of genes or proteins based on common semantic features extracted from a corpus of relevant documents. These profiles can be used to establish pair-wise similarities among genes, utilized in gene/protein classification or can be even combined with experimental measurements. Semantic features can be used by researchers to facilitate the understanding of the commonalities indicated by experimental results. Our approach is based on non-negative matrix factorization (NMF), a machine-learning algorithm for data analysis, capable of identifying local patterns that characterize a subset of the data. The literature is thus used to establish putative relationships among subsets of genes or proteins and to provide coherent justification for this clustering into subsets. We demonstrate the utility of the method by applying it to two independent and vastly different sets of genes. Conclusion The presented method can create literature profiles from documents relevant to sets of genes. The representation of genes as additive linear combinations of semantic features allows for the exploration of functional associations as well as for clustering, suggesting a valuable methodology for the validation and interpretation of high-throughput experimental data.
Collapse
Affiliation(s)
- Monica Chagoyen
- Biocomputing Unit, Centro Nacional de Biotecnologia – CSIC, Madrid, Spain
| | - Pedro Carmona-Saez
- Biocomputing Unit, Centro Nacional de Biotecnologia – CSIC, Madrid, Spain
| | - Hagit Shatkay
- School of Computing, Queen's University, Kingston, Ontario, Canada
| | - Jose M Carazo
- Biocomputing Unit, Centro Nacional de Biotecnologia – CSIC, Madrid, Spain
| | | |
Collapse
|
42
|
Guerra S, López-Fernández LA, Pascual-Montano A, Nájera JL, Zaballos A, Esteban M. Host response to the attenuated poxvirus vector NYVAC: upregulation of apoptotic genes and NF-kappaB-responsive genes in infected HeLa cells. J Virol 2006; 80:985-98. [PMID: 16379000 PMCID: PMC1346868 DOI: 10.1128/jvi.80.2.985-998.2006] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2005] [Accepted: 10/10/2005] [Indexed: 11/20/2022] Open
Abstract
NYVAC has been engineered as a safe, attenuated vaccinia virus (VV) vector for use in vaccination against a broad spectrum of pathogens and tumors. Due to the interest in NYVAC-based vectors as vaccines and current phase I/II clinical trials with this vector, there is a need to analyze the human host response to NYVAC infection. Using high-density cDNA microarrays, we found 368 differentially regulated genes after NYVAC infection of HeLa cells. Clustering of the regulated genes identified six discrete gene clusters with altered expression patterns. Clusters 1 to 3 represented 47.5% of the regulated genes, with three patterns of gene activation kinetics, whereas clusters 4 to 6 showed distinct repression kinetics. Quantitative real-time reverse transcription-PCR analysis of selected genes validated the array data. Upregulated transcripts correlated with genes implicated in immune responses, including those encoding interleukin-1 receptor 2 (IL-1R2), IL-6, ISG-15, CD-80, and TNFSF7. NYVAC upregulated several intermediates of apoptotic cascades, including caspase-9, correlating with its ability to induce apoptosis. NYVAC infection also stimulated the expression of NF-kappaB1 and NF-kappaB2 as well as that of NF-kappaB target genes. Expression of the VV host range K1L gene during NYVAC infection prevented NF-kappaB activation, but not the induction of apoptosis. This study is the first overall analysis of the transcriptional response of human cells to NYVAC infection and provides a framework for future functional studies to evaluate this vector and its derivatives as human vaccines.
Collapse
Affiliation(s)
- Susana Guerra
- Department of Molecular and Cellular Biology, Centro Nacional de Biotecnología/CSIC, Ciudad Universitaria Cantoblanco, 28049 Madrid, Spain
| | | | | | | | | | | |
Collapse
|
43
|
Sorzano COS, Marabini R, Velázquez-Muriel J, Bilbao-Castro JR, Scheres SHW, Carazo JM, Pascual-Montano A. XMIPP: a new generation of an open-source image processing package for electron microscopy. J Struct Biol 2005; 148:194-204. [PMID: 15477099 DOI: 10.1016/j.jsb.2004.06.006] [Citation(s) in RCA: 353] [Impact Index Per Article: 18.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2004] [Revised: 06/04/2004] [Indexed: 11/30/2022]
Abstract
X-windows based microscopy image processing package (Xmipp) is a specialized suit of image processing programs, primarily aimed at obtaining the 3D reconstruction of biological specimens from large sets of projection images acquired by transmission electron microscopy. This public-domain software package was introduced to the electron microscopy field eight years ago, and since then it has changed drastically. New methodologies for the analysis of single-particle projection images have been added to classification, contrast transfer function correction, angular assignment, 3D reconstruction, reconstruction of crystals, etc. In addition, the package has been extended with functionalities for 2D crystal and electron tomography data. Furthermore, its current implementation in C++, with a highly modular design of well-documented data structures and functions, offers a convenient environment for the development of novel algorithms. In this paper, we present a general overview of a new generation of Xmipp that has been re-engineered to maximize flexibility and modularity, potentially facilitating its integration in future standardization efforts in the field. Moreover, by focusing on those developments that distinguish Xmipp from other packages available, we illustrate its added value to the electron microscopy community.
Collapse
Affiliation(s)
- C O S Sorzano
- Unidad de Biocomputación, Centro Nacional de Biotecnología (CSIC), Campus Universidad Autónoma s/n, 28049 Cantoblanco, Madrid, Spain.
| | | | | | | | | | | | | |
Collapse
|
44
|
Guerra S, López-Fernández LA, Conde R, Pascual-Montano A, Harshman K, Esteban M. Microarray analysis reveals characteristic changes of host cell gene expression in response to attenuated modified vaccinia virus Ankara infection of human HeLa cells. J Virol 2004; 78:5820-34. [PMID: 15140980 PMCID: PMC415835 DOI: 10.1128/jvi.78.11.5820-5834.2004] [Citation(s) in RCA: 65] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The potential use of the modified vaccinia virus Ankara (MVA) strain as a live recombinant vector to deliver antigens and elicit protective immune responses against infectious diseases demands a comprehensive understanding of the effect of MVA infection on human host gene expression. We used microarrays containing more than 15,000 human cDNAs to identify gene expression changes in human HeLa cell cultures at 2, 6, and 16 h postinfection. Clustering of the 410 differentially regulated genes identified 11 discrete gene clusters with altered expression patterns after MVA infection. Clusters 1 and 2 (accounting for 16.59% [68 of 410] of the genes) contained 68 transcripts showing a robust induction pattern that was maintained during the course of infection. Changes in cellular gene transcription detected by microarrays after MVA infection were confirmed for selected genes by Northern blot analysis and by real-time reverse transcription-PCR. Upregulated transcripts in clusters 1 and 2 included 20 genes implicated in immune responses, including interleukin 1A (IL-1A), IL-6, IL-7, IL-8, and IL-15 genes. MVA infection also stimulated the expression of NF-kappaB and components of the NF-kappaB signal transduction pathway, including p50 and TRAF-interacting protein. A marked increase in the expression of histone family members was also induced during MVA infection. Expression of the Wiskott-Aldrich syndrome family members WAS, WASF1, and the small GTP-binding protein RAC-1, which are involved in actin cytoskeleton reorganization, was enhanced after MVA infection. This study demonstrates that MVA infection triggered the induction of groups of genes, some of which may be involved in host resistance and immune modulation during virus infection.
Collapse
Affiliation(s)
- Susana Guerra
- Centro Nacional de Biotecnología, CSIC, Campus Universidad Autónoma, 28049 Madrid, Spain
| | | | | | | | | | | |
Collapse
|
45
|
Guerra S, López-Fernández LA, Pascual-Montano A, Muñoz M, Harshman K, Esteban M. Cellular gene expression survey of vaccinia virus infection of human HeLa cells. J Virol 2003; 77:6493-506. [PMID: 12743306 PMCID: PMC154985 DOI: 10.1128/jvi.77.11.6493-6506.2003] [Citation(s) in RCA: 100] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Vaccinia virus (VV) is a cytocidal virus that causes major changes in host cell machinery shortly after infecting cells. To define the consequences of virus infection on host gene expression, we used microarrays of approximately 15,000 human cDNAs to examine expression levels of mRNAs isolated at 2, 6, and 16 h postinfection from cultures of infected HeLa cells. The majority of profiling changes during VV infection corresponded to downregulation of genes at 16 h postinfection. Differentially expressed genes were clustered into seven groups to identify common regulatory pathways, with most of them (90%) belonging to clusters 6 and 7, which represent genes whose expression was repressed after infection. Cluster 1, however, contained 37 transcripts (2.81%) showing a robust pattern of induction that was maintained during the course of infection. Genes in cluster 1 included those for Wiskott-Aldrich syndrome protein (WASP) family member WASF1, thymosine, adenosine A2a receptor, glutamate decarboxylase 2, CD-80 antigen, KIAA0888 protein, selenophosphate synthetase, pericentrin, and attractin as well as several expressed sequence tags. We analyzed in more detail the fate of WASP protein in VV-infected cells, because a related family member, N-WASP, is involved in viral motility. WASP protein accumulated in the course of infection; its increase required viral DNA replication and de novo protein synthesis, and it localized in cytoplasmic structures distinct from uninfected cells. This study is the first quantitative analysis of host gene expression following VV infection of cultured human cells, demonstrating global changes in the expression profile, and identifies upregulated genes with potential roles in the virus replication cycle.
Collapse
Affiliation(s)
- Susana Guerra
- Department of Molecular and Cellular Biology, Centro Nacional de Biotecnología, Consejo Superior de Investigaciones Científicas, Campus Universidad Autónoma, 28049 Madrid, Spain
| | | | | | | | | | | |
Collapse
|
46
|
García de la Nava J, Santaella DF, Cuenca Alba J, María Carazo J, Trelles O, Pascual-Montano A. Engene: the processing and exploratory analysis of gene expression data. Bioinformatics 2003; 19:657-8. [PMID: 12651727 DOI: 10.1093/bioinformatics/btg028] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Engene is a versatile, and platform-independent web tool for exploratory analysis of gene expression data that aims at storing, visualizing and processing large sets of gene expression patterns.
Collapse
|
47
|
Abstract
In the present work we develop an efficient way of representing the geometry and topology of volumetric datasets of biological structures from medium to low resolution, aiming at storing and querying them in a database framework. We make use of a new vector quantization algorithm to select the points within the macromolecule that best approximate the probability density function of the original volume data. Connectivity among points is obtained with the use of the alpha shapes theory. This novel data representation has a number of interesting characteristics, such as 1) it allows us to automatically segment and quantify a number of important structural features from low-resolution maps, such as cavities and channels, opening the possibility of querying large collections of maps on the basis of these quantitative structural features; 2) it provides a compact representation in terms of size; 3) it contains a subset of three-dimensional points that optimally quantify the densities of medium resolution data; and 4) a general model of the geometry and topology of the macromolecule (as opposite to a spatially unrelated bunch of voxels) is easily obtained by the use of the alpha shapes theory.
Collapse
Affiliation(s)
- Pedro A De-Alarcón
- Biocomputing Unit, Centro Nacional de Biotecnologia (CSIC), Campus UAM, Cantoblanco, 28049 Madrid, Spain
| | | | | | | |
Collapse
|
48
|
Abstract
Tomography emerges as a powerful methodology for determining the complex architectures of biological specimens that are better regarded from the structural point of view as singular entities. However, once the structure of a sufficiently large number of singular specimens is solved, quite possibly structural patterns start to emerge. This latter situation is addressed here, where the clustering of a set of 3D reconstructions using a novel quantitative approach is presented. In general terms, we propose a new variant of a self-organizing neural network for the unsupervised classification of 3D reconstructions. The novelty of the algorithm lies in its rigorous mathematical formulation that, starting from a large set of noisy input data, finds a set of "representative" items, organized onto an ordered output map, such that the probability density of this set of representative items resembles at its possible best the probability density of the input data. In this study, we evaluate the feasibility of application of the proposed neural approach to the problem of identifying similar 3D motifs within tomograms of insect flight muscle. Our experimental results prove that this technique is suitable for this type of problem, providing the electron microscopy community with a new tool for exploring large sets of tomogram data to find complex patterns.
Collapse
Affiliation(s)
- A Pascual-Montano
- Centro Nacional de Biotecnología-CSIC, Campus Universidad Autónoma, 28049 Madrid, Spain
| | | | | | | | | |
Collapse
|
49
|
Pascual-Montano A, Donate LE, Valle M, Bárcena M, Pascual-Marqui RD, Carazo JM. A novel neural network technique for analysis and classification of EM single-particle images. J Struct Biol 2001; 133:233-45. [PMID: 11472094 DOI: 10.1006/jsbi.2001.4369] [Citation(s) in RCA: 66] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
We propose a novel self-organizing neural network for the unsupervised classification of electron microscopy (EM) images of biological macromolecules. The radical novelty of the algorithm lies in its rigorous mathematical formulation that, starting from a large set of possibly very noisy input data, finds a set of "representative" data items, organized onto an ordered output map, such that the probability density of this set of representative items resembles at its possible best the probability density of the input data. In a way, it summarizes large amounts of information into a concise description that rigorously keeps the basic pattern of the input data distribution. In this application to the field of three-dimensional EM of single particles, two different data sets have been used; one comprised 2458 rotational power spectra of individual negative stain images of the G40P helicase of Bacillus subtilis bacteriophage SPP1, and the other contained 2822 cryoelectron images of SV40 large T-antigen. Our experimental results prove that this technique is indeed very successful, providing the user with the capability of exploring complex patterns in a succinct, informative, and objective manner. The above facts, together with the consideration that the integration of this new algorithm with commonly used software packages is immediate, prompt us to propose it as a valuable new tool in the analysis of large collections of noisy data.
Collapse
Affiliation(s)
- A Pascual-Montano
- Centro Nacional de Biotecnología-CSIC, Campus Universidad Autónoma, Madrid, 28049, Spain
| | | | | | | | | | | |
Collapse
|