1
|
Mazein A, Acencio ML, Balaur I, Rougny A, Welter D, Niarakis A, Ramirez Ardila D, Dogrusoz U, Gawron P, Satagopam V, Gu W, Kremer A, Schneider R, Ostaszewski M. A guide for developing comprehensive systems biology maps of disease mechanisms: planning, construction and maintenance. FRONTIERS IN BIOINFORMATICS 2023; 3:1197310. [PMID: 37426048 PMCID: PMC10325725 DOI: 10.3389/fbinf.2023.1197310] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Accepted: 06/09/2023] [Indexed: 07/11/2023] Open
Abstract
As a conceptual model of disease mechanisms, a disease map integrates available knowledge and is applied for data interpretation, predictions and hypothesis generation. It is possible to model disease mechanisms on different levels of granularity and adjust the approach to the goals of a particular project. This rich environment together with requirements for high-quality network reconstruction makes it challenging for new curators and groups to be quickly introduced to the development methods. In this review, we offer a step-by-step guide for developing a disease map within its mainstream pipeline that involves using the CellDesigner tool for creating and editing diagrams and the MINERVA Platform for online visualisation and exploration. We also describe how the Neo4j graph database environment can be used for managing and querying efficiently such a resource. For assessing the interoperability and reproducibility we apply FAIR principles.
Collapse
Affiliation(s)
- Alexander Mazein
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Marcio Luis Acencio
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Irina Balaur
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | | | - Danielle Welter
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Anna Niarakis
- Université Paris-Saclay, Laboratoire Européen de Recherche Pour la Polyarthrite Rhumatoïde–Genhotel, University Evry, Evry, France
- Lifeware Group, Inria Saclay-Ile de France, Palaiseau, France
| | - Diana Ramirez Ardila
- ITTM Information Technology for Translational Medicine, Esch-sur-Alzette, Luxemburg
| | - Ugur Dogrusoz
- Computer Engineering Department, Bilkent University, Ankara, Türkiye
| | - Piotr Gawron
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Venkata Satagopam
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
- ELIXIR Luxembourg, Belvaux, Luxembourg
| | - Wei Gu
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
- ELIXIR Luxembourg, Belvaux, Luxembourg
| | - Andreas Kremer
- ITTM Information Technology for Translational Medicine, Esch-sur-Alzette, Luxemburg
| | - Reinhard Schneider
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
- ELIXIR Luxembourg, Belvaux, Luxembourg
| | - Marek Ostaszewski
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
- ELIXIR Luxembourg, Belvaux, Luxembourg
| |
Collapse
|
2
|
Balaur I, Roy L, Touré V, Mazein A, Auffray C. GraphML-SBGN bidirectional converter for metabolic networks. J Integr Bioinform 2022; 19:jib-2022-0030. [PMID: 36563404 PMCID: PMC9800040 DOI: 10.1515/jib-2022-0030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Accepted: 12/12/2022] [Indexed: 12/24/2022] Open
Abstract
Systems biology researchers need feasible solutions for editing and visualisation of large biological diagrams. Here, we present the ySBGN bidirectional converter that translates metabolic pathways, developed in the general-purpose yEd Graph Editor (using the GraphML format) into the Systems Biology Graphical Notation Markup Language (SBGN-ML) standard format and vice versa. We illustrate the functionality of this converter by applying it to the translation of the ReconMap resource (available in the SBGN-ML format) to the yEd-specific GraphML and back. The ySBGN tool makes possible to draw extensive metabolic diagrams in a powerful general-purpose graph editor while providing results in the standard SBGN format.
Collapse
Affiliation(s)
- Irina Balaur
- European Institute for Systems Biology and Medicine, CIRI UMR5308, CNRS-ENS-UCBL-INSERM, Université de Lyon, 50 Avenue Tony Garnier, 69007Lyon, France
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, 6 Avenue du Swing, L-4367Belvaux, Luxembourg
| | - Ludovic Roy
- European Institute for Systems Biology and Medicine, CIRI UMR5308, CNRS-ENS-UCBL-INSERM, Université de Lyon, 50 Avenue Tony Garnier, 69007Lyon, France
| | - Vasundra Touré
- European Institute for Systems Biology and Medicine, CIRI UMR5308, CNRS-ENS-UCBL-INSERM, Université de Lyon, 50 Avenue Tony Garnier, 69007Lyon, France
- Department of Biology, Norwegian University of Science and Technology (NTNU), Høgskoleringen 5, Realfagbygget, 7491Trondheim, Norway
| | - Alexander Mazein
- European Institute for Systems Biology and Medicine, CIRI UMR5308, CNRS-ENS-UCBL-INSERM, Université de Lyon, 50 Avenue Tony Garnier, 69007Lyon, France
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, 6 Avenue du Swing, L-4367Belvaux, Luxembourg
- Russian Academy of Sciences, Institute of Cell Biophysics, 3 Institutskaya Street, Pushchino, Moscow Region, 142290, Russia
| | - Charles Auffray
- European Institute for Systems Biology and Medicine, CIRI UMR5308, CNRS-ENS-UCBL-INSERM, Université de Lyon, 50 Avenue Tony Garnier, 69007Lyon, France
| |
Collapse
|
3
|
Pereira C, Mazein A, Farinha CM, Gray MA, Kunzelmann K, Ostaszewski M, Balaur I, Amaral MD, Falcao AO. CyFi-MAP: an interactive pathway-based resource for cystic fibrosis. Sci Rep 2021; 11:22223. [PMID: 34782688 PMCID: PMC8592983 DOI: 10.1038/s41598-021-01618-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Accepted: 10/27/2021] [Indexed: 12/11/2022] Open
Abstract
Cystic fibrosis (CF) is a life-threatening autosomal recessive disease caused by more than 2100 mutations in the CF transmembrane conductance regulator (CFTR) gene, generating variability in disease severity among individuals with CF sharing the same CFTR genotype. Systems biology can assist in the collection and visualization of CF data to extract additional biological significance and find novel therapeutic targets. Here, we present the CyFi-MAP-a disease map repository of CFTR molecular mechanisms and pathways involved in CF. Specifically, we represented the wild-type (wt-CFTR) and the F508del associated processes (F508del-CFTR) in separate submaps, with pathways related to protein biosynthesis, endoplasmic reticulum retention, export, activation/inactivation of channel function, and recycling/degradation after endocytosis. CyFi-MAP is an open-access resource with specific, curated and continuously updated information on CFTR-related pathways available online at https://cysticfibrosismap.github.io/ . This tool was developed as a reference CF pathway data repository to be continuously updated and used worldwide in CF research.
Collapse
Affiliation(s)
- Catarina Pereira
- Faculty of Sciences, BioISI-Biosystems Integrative Sciences Institute, University of Lisboa, Campo Grande, 1749-016, Lisbon, Portugal
- LASIGE, Faculty of Sciences, University of Lisboa, Campo Grande, 1749-016, Lisbon, Portugal
| | - Alexander Mazein
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, 6 Avenue du Swing, 4367, Belvaux, Luxembourg
- CIRI UMR5308, CNRS-ENS-UCBL-INSERM, European Institute for Systems Biology and Medicine, Université de Lyon, 50 Avenue Tony Garnier, 69007, Lyon, France
| | - Carlos M Farinha
- Faculty of Sciences, BioISI-Biosystems Integrative Sciences Institute, University of Lisboa, Campo Grande, 1749-016, Lisbon, Portugal
| | - Michael A Gray
- Biosciences Institute, University Medical School, Newcastle University, Framlington Place, Newcastle upon Tyne, NE2 4HH, UK
| | | | - Marek Ostaszewski
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, 6 Avenue du Swing, 4367, Belvaux, Luxembourg
| | - Irina Balaur
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, 6 Avenue du Swing, 4367, Belvaux, Luxembourg
- CIRI UMR5308, CNRS-ENS-UCBL-INSERM, European Institute for Systems Biology and Medicine, Université de Lyon, 50 Avenue Tony Garnier, 69007, Lyon, France
| | - Margarida D Amaral
- Faculty of Sciences, BioISI-Biosystems Integrative Sciences Institute, University of Lisboa, Campo Grande, 1749-016, Lisbon, Portugal
| | - Andre O Falcao
- Faculty of Sciences, BioISI-Biosystems Integrative Sciences Institute, University of Lisboa, Campo Grande, 1749-016, Lisbon, Portugal.
- LASIGE, Faculty of Sciences, University of Lisboa, Campo Grande, 1749-016, Lisbon, Portugal.
| |
Collapse
|
4
|
Nielsen SS, Ostaszewski M, McGee F, Hoksza D, Zorzan S. Machine Learning to Support the Presentation of Complex Pathway Graphs. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:1130-1141. [PMID: 31484128 DOI: 10.1109/tcbb.2019.2938501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Visualization of biological mechanisms by means of pathway graphs is necessary to better understand the often complex underlying system. Manual layout of such pathways or maps of knowledge is a difficult and time consuming process. Node duplication is a technique that makes layouts with improved readability possible by reducing edge crossings and shortening edge lengths in drawn diagrams. In this article, we propose an approach using Machine Learning (ML) to facilitate parts of this task by training a Support Vector Machine (SVM) with actions taken during manual biocuration. Our training input is a series of incremental snapshots of a diagram describing mechanisms of a disease, progressively curated by a human expert employing node duplication in the process. As a test of the trained SVM models, they are applied to a single large instance and 25 medium-sized instances of hand-curated biological pathways. Finally, in a user validation study, we compare the model predictions to the outcome of a node duplication questionnaire answered by users of biological pathways with varying experience. We successfully predicted nodes for duplication and emulated human choices, demonstrating that our approach can effectively learn human-like node duplication preferences to support curation of pathway diagrams in various contexts.
Collapse
|
5
|
Touré V, Dräger A, Luna A, Dogrusoz U, Rougny A. The Systems Biology Graphical Notation: Current Status and Applications in Systems Medicine. SYSTEMS MEDICINE 2021. [DOI: 10.1016/b978-0-12-801238-3.11515-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
|
6
|
Hoksza D, Gawron P, Ostaszewski M, Hasenauer J, Schneider R. Closing the gap between formats for storing layout information in systems biology. Brief Bioinform 2020; 21:1249-1260. [PMID: 31273380 PMCID: PMC7373180 DOI: 10.1093/bib/bbz067] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2019] [Revised: 04/23/2019] [Accepted: 05/14/2019] [Indexed: 11/13/2022] Open
Abstract
The understanding of complex biological networks often relies on both a dedicated layout and a topology. Currently, there are three major competing layout-aware systems biology formats, but there are no software tools or software libraries supporting all of them. This complicates the management of molecular network layouts and hinders their reuse and extension. In this paper, we present a high-level overview of the layout formats in systems biology, focusing on their commonalities and differences, review their support in existing software tools, libraries and repositories and finally introduce a new conversion module within the MINERVA platform. The module is available via a REST API and offers, besides the ability to convert between layout-aware systems biology formats, the possibility to export layouts into several graphical formats. The module enables conversion of very large networks with thousands of elements, such as disease maps or metabolic reconstructions, rendering it widely applicable in systems biology.
Collapse
Affiliation(s)
- David Hoksza
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, 6, avenue du Swing L-4367 Belvaux, Luxembourg
- Faculty of Mathematics and Physics, Charles University, Malostranské nám. 25, 118 00 Prague, Czech Republic
| | - Piotr Gawron
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, 6, avenue du Swing L-4367 Belvaux, Luxembourg
| | - Marek Ostaszewski
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, 6, avenue du Swing L-4367 Belvaux, Luxembourg
| | - Jan Hasenauer
- Institute of Computational Biology, Helmholtz Zentrum München - German Research Center for Environmental Health, Ingolstädter Landstr. 1, 85764, Neuherberg, Germany
- Department of Mathematics, Technische Universität München, München, Germany
- Faculty of Mathematics and Natural Sciences, University of Bonn, Bonn, Germany
| | - Reinhard Schneider
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, 6, avenue du Swing L-4367 Belvaux, Luxembourg
| |
Collapse
|