1
|
Rishan ST, Kline RJ, Rahman MS. New prospects of environmental RNA metabarcoding research in biological diversity, ecotoxicological monitoring, and detection of COVID-19: a critical review. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2024; 31:11406-11427. [PMID: 38183542 DOI: 10.1007/s11356-023-31776-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Accepted: 12/26/2023] [Indexed: 01/08/2024]
Abstract
Ecosystems are multifaceted and complex systems and understanding their composition is crucial for the implementation of efficient conservation and management. Conventional approaches to biodiversity surveys can have limitations in detecting the complete range of species present. In contrast, the study of environmental RNA (eRNA) offers a non-invasive and comprehensive method for monitoring and evaluating biodiversity across different ecosystems. Similar to eDNA, the examination of genetic material found in environmental samples can identify and measure many species, including ones that pose challenges to traditional methods. However, eRNA is degraded quickly and therefore shows promise in detection of living organisms closer to their actual location than eDNA methods. This method provides a comprehensive perspective on the well-being of ecosystems, facilitating the development of focused conservation approaches to save at-risk species and uphold ecological equilibrium. Furthermore, eRNA has been recognized as a valuable method for the identification of COVID-19 in the environment, besides its established uses in biodiversity protection. The SARS-CoV-2 virus, which is accountable for the worldwide epidemic, releases RNA particles into the surrounding environment via human waste, providing insights into the feasibility of detecting it in wastewater and other samples taken from the environment. In this article, we critically reviewed the recent research activities that use the eRNA method, including its utilization in biodiversity conservation, ecological surveillance, and ecotoxicological monitoring as well as its innovative potential in identifying COVID-19. Through this review, the reader can understand the recent developments, prospects, and challenges of eRNA research in ecosystem management and biodiversity conservation.
Collapse
Affiliation(s)
- Sakib Tahmid Rishan
- Biochemistry and Molecular Biology Program, School of Integrative Biological and Chemical Sciences, University of Texas Rio Grande Valley, Brownsville, TX, USA
| | - Richard J Kline
- Biochemistry and Molecular Biology Program, School of Integrative Biological and Chemical Sciences, University of Texas Rio Grande Valley, Brownsville, TX, USA
- School of Earth, Environmental, and Marine Sciences, University of Texas Rio Grande Valley, Brownsville, TX, USA
| | - Md Saydur Rahman
- Biochemistry and Molecular Biology Program, School of Integrative Biological and Chemical Sciences, University of Texas Rio Grande Valley, Brownsville, TX, USA.
- School of Earth, Environmental, and Marine Sciences, University of Texas Rio Grande Valley, Brownsville, TX, USA.
| |
Collapse
|
2
|
Hai Y, Ma J, Yang K, Wen Y. Bayesian linear mixed model with multiple random effects for prediction analysis on high-dimensional multi-omics data. Bioinformatics 2023; 39:btad647. [PMID: 37882747 PMCID: PMC10627352 DOI: 10.1093/bioinformatics/btad647] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Revised: 09/24/2023] [Accepted: 10/24/2023] [Indexed: 10/27/2023] Open
Abstract
MOTIVATION Accurate disease risk prediction is an essential step in the modern quest for precision medicine. While high-dimensional multi-omics data have provided unprecedented data resources for prediction studies, their high-dimensionality and complex inter/intra-relationships have posed significant analytical challenges. RESULTS We proposed a two-step Bayesian linear mixed model framework (TBLMM) for risk prediction analysis on multi-omics data. TBLMM models the predictive effects from multi-omics data using a hybrid of the sparsity regression and linear mixed model with multiple random effects. It can resemble the shape of the true effect size distributions and accounts for non-linear, including interaction effects, among multi-omics data via kernel fusion. It infers its parameters via a computationally efficient variational Bayes algorithm. Through extensive simulation studies and the prediction analyses on the positron emission tomography imaging outcomes using data obtained from the Alzheimer's Disease Neuroimaging Initiative, we have demonstrated that TBLMM can consistently outperform the existing method in predicting the risk of complex traits. AVAILABILITY AND IMPLEMENTATION The corresponding R package is available on GitHub (https://github.com/YaluWen/TBLMM).
Collapse
Affiliation(s)
- Yang Hai
- Department of Health Statistics, Shanxi Medical University, Taiyuan, Shanxi Province 030000, China
- Department of Statistics, University of Auckland, Auckland 1010, New Zealand
| | - Jixiang Ma
- Department of Health Statistics, Shanxi Medical University, Taiyuan, Shanxi Province 030000, China
| | - Kaixin Yang
- Department of Health Statistics, Shanxi Medical University, Taiyuan, Shanxi Province 030000, China
| | - Yalu Wen
- Department of Health Statistics, Shanxi Medical University, Taiyuan, Shanxi Province 030000, China
- Department of Statistics, University of Auckland, Auckland 1010, New Zealand
| |
Collapse
|
3
|
Scott-Boyer MP, Dufour P, Belleau F, Ongaro-Carcy R, Plessis C, Périn O, Droit A. Use of Elasticsearch-based business intelligence tools for integration and visualization of biological data. Brief Bioinform 2023; 24:bbad348. [PMID: 37798252 DOI: 10.1093/bib/bbad348] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Revised: 07/23/2023] [Accepted: 09/14/2023] [Indexed: 10/07/2023] Open
Abstract
The emergence of massive datasets exploring the multiple levels of molecular biology has made their analysis and knowledge transfer more complex. Flexible tools to manage big biological datasets could be of great help for standardizing the usage of developed data visualizations and integration methods. Business intelligence (BI) tools have been used in many fields as exploratory tools. They have numerous connectors to link numerous data repositories with a unified graphic interface, offering an overview of data and facilitating interpretation for decision makers. BI tools could be a flexible and user-friendly way of handling molecular biological data with interactive visualizations. However, it is rather uncommon to see such tools used for the exploration of massive and complex datasets in biological fields. We believe that two main obstacles could be the reason. Firstly, we posit that the way to import data into BI tools are not compatible with biological databases. Secondly, BI tools may not be adapted to certain particularities of complex biological data, namely, the size, the variability of datasets and the availability of specialized visualizations. This paper highlights the use of five BI tools (Elastic Kibana, Siren Investigate, Microsoft Power BI, Salesforce Tableau and Apache Superset) onto which the massive data management repository engine called Elasticsearch is compatible. Four case studies will be discussed in which these BI tools were applied on biological datasets with different characteristics. We conclude that the performance of the tools depends on the complexity of the biological questions and the size of the datasets.
Collapse
Affiliation(s)
- Marie-Pier Scott-Boyer
- Centre de Recherche du CHU de Québec-Université, Laval, Université Laval, G1V 4G2, Québec, Canada
| | - Pascal Dufour
- Centre de Recherche du CHU de Québec-Université, Laval, Université Laval, G1V 4G2, Québec, Canada
| | - François Belleau
- Centre de Recherche du CHU de Québec-Université, Laval, Université Laval, G1V 4G2, Québec, Canada
| | - Regis Ongaro-Carcy
- Centre de Recherche du CHU de Québec-Université, Laval, Université Laval, G1V 4G2, Québec, Canada
- Département de Médecine Moléculaire, G1V 0A6, Québec, Canada
| | - Clément Plessis
- Centre de Recherche du CHU de Québec-Université, Laval, Université Laval, G1V 4G2, Québec, Canada
| | - Olivier Périn
- L'Oréal Advance Research, Aulnay-sous-Bois, 93600, France
| | - Arnaud Droit
- Centre de Recherche du CHU de Québec-Université, Laval, Université Laval, G1V 4G2, Québec, Canada
- Département de Médecine Moléculaire, G1V 0A6, Québec, Canada
| |
Collapse
|
4
|
Mazein A, Acencio ML, Balaur I, Rougny A, Welter D, Niarakis A, Ramirez Ardila D, Dogrusoz U, Gawron P, Satagopam V, Gu W, Kremer A, Schneider R, Ostaszewski M. A guide for developing comprehensive systems biology maps of disease mechanisms: planning, construction and maintenance. FRONTIERS IN BIOINFORMATICS 2023; 3:1197310. [PMID: 37426048 PMCID: PMC10325725 DOI: 10.3389/fbinf.2023.1197310] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Accepted: 06/09/2023] [Indexed: 07/11/2023] Open
Abstract
As a conceptual model of disease mechanisms, a disease map integrates available knowledge and is applied for data interpretation, predictions and hypothesis generation. It is possible to model disease mechanisms on different levels of granularity and adjust the approach to the goals of a particular project. This rich environment together with requirements for high-quality network reconstruction makes it challenging for new curators and groups to be quickly introduced to the development methods. In this review, we offer a step-by-step guide for developing a disease map within its mainstream pipeline that involves using the CellDesigner tool for creating and editing diagrams and the MINERVA Platform for online visualisation and exploration. We also describe how the Neo4j graph database environment can be used for managing and querying efficiently such a resource. For assessing the interoperability and reproducibility we apply FAIR principles.
Collapse
Affiliation(s)
- Alexander Mazein
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Marcio Luis Acencio
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Irina Balaur
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | | | - Danielle Welter
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Anna Niarakis
- Université Paris-Saclay, Laboratoire Européen de Recherche Pour la Polyarthrite Rhumatoïde–Genhotel, University Evry, Evry, France
- Lifeware Group, Inria Saclay-Ile de France, Palaiseau, France
| | - Diana Ramirez Ardila
- ITTM Information Technology for Translational Medicine, Esch-sur-Alzette, Luxemburg
| | - Ugur Dogrusoz
- Computer Engineering Department, Bilkent University, Ankara, Türkiye
| | - Piotr Gawron
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Venkata Satagopam
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
- ELIXIR Luxembourg, Belvaux, Luxembourg
| | - Wei Gu
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
- ELIXIR Luxembourg, Belvaux, Luxembourg
| | - Andreas Kremer
- ITTM Information Technology for Translational Medicine, Esch-sur-Alzette, Luxemburg
| | - Reinhard Schneider
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
- ELIXIR Luxembourg, Belvaux, Luxembourg
| | - Marek Ostaszewski
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
- ELIXIR Luxembourg, Belvaux, Luxembourg
| |
Collapse
|
5
|
Touré V, Krauss P, Gnodtke K, Buchhorn J, Unni D, Horki P, Raisaro JL, Kalt K, Teixeira D, Crameri K, Österle S. FAIRification of health-related data using semantic web technologies in the Swiss Personalized Health Network. Sci Data 2023; 10:127. [PMID: 36899064 PMCID: PMC10006404 DOI: 10.1038/s41597-023-02028-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Accepted: 02/17/2023] [Indexed: 03/12/2023] Open
Abstract
The Swiss Personalized Health Network (SPHN) is a government-funded initiative developing federated infrastructures for a responsible and efficient secondary use of health data for research purposes in compliance with the FAIR principles (Findable, Accessible, Interoperable and Reusable). We built a common standard infrastructure with a fit-for-purpose strategy to bring together health-related data and ease the work of both data providers to supply data in a standard manner and researchers by enhancing the quality of the collected data. As a result, the SPHN Resource Description Framework (RDF) schema was implemented together with a data ecosystem that encompasses data integration, validation tools, analysis helpers, training and documentation for representing health metadata and data in a consistent manner and reaching nationwide data interoperability goals. Data providers can now efficiently deliver several types of health data in a standardised and interoperable way while a high degree of flexibility is granted for the various demands of individual research projects. Researchers in Switzerland have access to FAIR health data for further use in RDF triplestores.
Collapse
Affiliation(s)
- Vasundra Touré
- Personalized Health Informatics Group, SIB Swiss Institute of Bioinformatics, 4051, Basel, Switzerland
| | - Philip Krauss
- Trivadis - Part of Accenture, 4051, Basel, Switzerland
| | - Kristin Gnodtke
- Personalized Health Informatics Group, SIB Swiss Institute of Bioinformatics, 4051, Basel, Switzerland
| | | | - Deepak Unni
- Personalized Health Informatics Group, SIB Swiss Institute of Bioinformatics, 4051, Basel, Switzerland
| | - Petar Horki
- Personalized Health Informatics Group, SIB Swiss Institute of Bioinformatics, 4051, Basel, Switzerland
| | - Jean Louis Raisaro
- Health Informatics and Data Privacy Group, Biomedical Data Science Center, 1010 Lausanne University Hospital, Lausanne, Switzerland
| | - Katie Kalt
- Clinical Data Platform Research, Directorate of Research and Education, Zurich University Hospital, 8091, Zurich, Switzerland
| | - Daniel Teixeira
- DSI - Data Group, Geneva University Hospital, 1205, Geneva, Switzerland
| | - Katrin Crameri
- Personalized Health Informatics Group, SIB Swiss Institute of Bioinformatics, 4051, Basel, Switzerland
| | - Sabine Österle
- Personalized Health Informatics Group, SIB Swiss Institute of Bioinformatics, 4051, Basel, Switzerland.
| |
Collapse
|
6
|
Capobianco E, Dominietto M. Translating Data Science Results into Precision Oncology Decisions: A Mini Review. J Clin Med 2023; 12:438. [PMID: 36675367 PMCID: PMC9862106 DOI: 10.3390/jcm12020438] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Revised: 12/29/2022] [Accepted: 01/03/2023] [Indexed: 01/06/2023] Open
Abstract
While reviewing and discussing the potential of data science in oncology, we emphasize medical imaging and radiomics as the leading contextual frameworks to measure the impacts of Artificial Intelligence (AI) and Machine Learning (ML) developments. We envision some domains and research directions in which radiomics should become more significant in view of current barriers and limitations.
Collapse
Affiliation(s)
- Enrico Capobianco
- The Jackson Laboratory, 10 Discovery Drive, Farmington, CT 06032, USA
| | - Marco Dominietto
- Paul Scherrer Institut, Forschungsstrasse 111, 5232 Villigen, Switzerland
| |
Collapse
|
7
|
Martins dos Santos V, Anton M, Szomolay B, Ostaszewski M, Arts I, Benfeitas R, Dominguez Del Angel V, Domínguez-Romero E, Ferk P, Fey D, Goble C, Golebiewski M, Gruden K, Heil KF, Hermjakob H, Kahlem P, Klapa MI, Koehorst J, Kolodkin A, Kutmon M, Leskošek B, Moretti S, Müller W, Pagni M, Rezen T, Rocha M, Rozman D, Šafránek D, T. Scott W, Sheriff RSM, Suarez Diez M, Van Steen K, Westerhoff HV, Wittig U, Wolstencroft K, Zupanic A, Evelo CT, Hancock JM. Systems Biology in ELIXIR: modelling in the spotlight. F1000Res 2022; 11:ELIXIR-1265. [PMID: 36742342 PMCID: PMC9871403 DOI: 10.12688/f1000research.126734.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 10/24/2022] [Indexed: 11/09/2022] Open
Abstract
In this white paper, we describe the founding of a new ELIXIR Community - the Systems Biology Community - and its proposed future contributions to both ELIXIR and the broader community of systems biologists in Europe and worldwide. The Community believes that the infrastructure aspects of systems biology - databases, (modelling) tools and standards development, as well as training and access to cloud infrastructure - are not only appropriate components of the ELIXIR infrastructure, but will prove key components of ELIXIR's future support of advanced biological applications and personalised medicine. By way of a series of meetings, the Community identified seven key areas for its future activities, reflecting both future needs and previous and current activities within ELIXIR Platforms and Communities. These are: overcoming barriers to the wider uptake of systems biology; linking new and existing data to systems biology models; interoperability of systems biology resources; further development and embedding of systems medicine; provisioning of modelling as a service; building and coordinating capacity building and training resources; and supporting industrial embedding of systems biology. A set of objectives for the Community has been identified under four main headline areas: Standardisation and Interoperability, Technology, Capacity Building and Training, and Industrial Embedding. These are grouped into short-term (3-year), mid-term (6-year) and long-term (10-year) objectives.
Collapse
Affiliation(s)
- Vitor Martins dos Santos
- Laboratory of Bioprocess Engineering, Wageningen University & Research, Wageningen, 6708 PB, The Netherlands
| | - Mihail Anton
- Department of Biology and Biological Engineering, National Bioinformatics Infrastructure Sweden, Science for Life Laboratory, Chalmers University of Technology, Gothenburg, SE-41258, Sweden
| | - Barbara Szomolay
- Division of Infection and Immunity, School of Medicine, Cardiff University, Cardiff, UK
| | - Marek Ostaszewski
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Belvaux, L-4367, Luxembourg
| | - Ilja Arts
- Maastricht Centre for Systems Biology (MaCSBio), Maastricht University, Maastricht, 6200 MD, The Netherlands
| | - Rui Benfeitas
- National Bioinformatics Infrastructure Sweden (NBIS), Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
| | | | | | - Polonca Ferk
- Faculty of Medicine, Institute for Biostatistics and Medical Informatics, Centre ELIXIR-SI, University of Ljubljana, Ljubljana, SI-1000, Slovenia
| | - Dirk Fey
- Systems Biology Ireland, School of Medicine, University College Dublin, Dublin, 4, Ireland
| | - Carole Goble
- Department of Computer Science, The University of Manchester, Manchester, M13 9PL, UK
| | - Martin Golebiewski
- Heidelberg Institute for Theoretical Studies - HITS, Heidelberg, 69118, Germany
| | - Kristina Gruden
- Department of Biotechnology and Systems Biology, National Institute of Biology, Ljubljana, SI-1000, Slovenia
| | | | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, UK
| | - Pascal Kahlem
- Scientific Network Management SL, Barcelona, 08015, Spain
| | - Maria I. Klapa
- Metabolic Engineering & Systems Biology Laboratory, Institute of Chemical Engineering Sciences, Foundation for Research & Technology - Hellas (FORTH/ICE-HT), Patras, 26504, Greece
| | - Jasper Koehorst
- Laboratory of Systems and Synthetic Biology, Wageningen University & Research, Wageningen, 6708WE, The Netherlands
| | - Alexey Kolodkin
- Competence Center for Methodology and Statistics; Transversal Translational Medicine, Translational Medicine Operations Hub, Luxembourg Institute of Health, Strassen, L-1445, Luxembourg
- ISBE.NL, VU University of Amsterdam, Amsterdam, The Netherlands
| | - Martina Kutmon
- Maastricht Centre for Systems Biology (MaCSBio), Maastricht University, Maastricht, 6200 MD, The Netherlands
- Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6200 MD, The Netherlands
| | - Brane Leskošek
- Faculty of Medicine, Institute for Biostatistics and Medical Informatics, Centre ELIXIR-SI, University of Ljubljana, Ljubljana, SI-1000, Slovenia
| | | | - Wolfgang Müller
- Heidelberg Institute for Theoretical Studies - HITS, Heidelberg, 69118, Germany
| | - Marco Pagni
- SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Tadeja Rezen
- Faculty of Medicine, University of Ljubljana, Ljubljana, SI-1000, Slovenia
| | - Miguel Rocha
- Centre of Biological Engineering, University of Minho, Braga, Portugal
| | - Damjana Rozman
- Faculty of Medicine, University of Ljubljana, Ljubljana, SI-1000, Slovenia
| | - David Šafránek
- Faculty of Informatics, Masaryk University, Brno, 602 00, Czech Republic
| | - William T. Scott
- Laboratory of Systems and Synthetic Biology, Wageningen University & Research, Wageningen, 6708WE, The Netherlands
- UNLOCK, Wageningen University & Research, 6708 PB Wageningen, The Netherlands
| | - Rahuman S. Malik Sheriff
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, UK
| | - Maria Suarez Diez
- Laboratory of Systems and Synthetic Biology, Wageningen University & Research, Wageningen, 6708WE, The Netherlands
| | - Kristel Van Steen
- BIO3 - Laboratory for Systems Medicine, Department of Human Genetics, KU Leuven, Leuven, 3000, Belgium
- BIO3 - Systems Genetics, GIGA-R Medical Genomics, University of Liege, Liege, 4000, Belgium
| | | | - Ulrike Wittig
- Heidelberg Institute for Theoretical Studies - HITS, Heidelberg, 69118, Germany
| | - Katherine Wolstencroft
- Leiden Institute of Advanced Computer Science, Leiden University, Leiden, 2333 CA, The Netherlands
| | - Anze Zupanic
- Department of Biotechnology and Systems Biology, National Institute of Biology, Ljubljana, SI-1000, Slovenia
| | - Chris T. Evelo
- Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6200 MD, The Netherlands
| | - John M. Hancock
- Faculty of Medicine, University of Ljubljana, Ljubljana, SI-1000, Slovenia
| |
Collapse
|
8
|
Martins dos Santos V, Anton M, Szomolay B, Ostaszewski M, Arts I, Benfeitas R, Dominguez Del Angel V, Domínguez-Romero E, Ferk P, Fey D, Goble C, Golebiewski M, Gruden K, Heil KF, Hermjakob H, Kahlem P, Klapa MI, Koehorst J, Kolodkin A, Kutmon M, Leskošek B, Moretti S, Müller W, Pagni M, Rezen T, Rocha M, Rozman D, Šafránek D, T. Scott W, Sheriff RSM, Suarez Diez M, Van Steen K, Westerhoff HV, Wittig U, Wolstencroft K, Zupanic A, Evelo CT, Hancock JM. Systems Biology in ELIXIR: modelling in the spotlight. F1000Res 2022; 11:ELIXIR-1265. [PMID: 36742342 PMCID: PMC9871403 DOI: 10.12688/f1000research.126734.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 05/20/2024] [Indexed: 06/05/2024] Open
Abstract
In this white paper, we describe the founding of a new ELIXIR Community - the Systems Biology Community - and its proposed future contributions to both ELIXIR and the broader community of systems biologists in Europe and worldwide. The Community believes that the infrastructure aspects of systems biology - databases, (modelling) tools and standards development, as well as training and access to cloud infrastructure - are not only appropriate components of the ELIXIR infrastructure, but will prove key components of ELIXIR's future support of advanced biological applications and personalised medicine. By way of a series of meetings, the Community identified seven key areas for its future activities, reflecting both future needs and previous and current activities within ELIXIR Platforms and Communities. These are: overcoming barriers to the wider uptake of systems biology; linking new and existing data to systems biology models; interoperability of systems biology resources; further development and embedding of systems medicine; provisioning of modelling as a service; building and coordinating capacity building and training resources; and supporting industrial embedding of systems biology. A set of objectives for the Community has been identified under four main headline areas: Standardisation and Interoperability, Technology, Capacity Building and Training, and Industrial Embedding. These are grouped into short-term (3-year), mid-term (6-year) and long-term (10-year) objectives.
Collapse
Affiliation(s)
- Vitor Martins dos Santos
- Laboratory of Bioprocess Engineering, Wageningen University & Research, Wageningen, 6708 PB, The Netherlands
| | - Mihail Anton
- Department of Biology and Biological Engineering, National Bioinformatics Infrastructure Sweden, Science for Life Laboratory, Chalmers University of Technology, Gothenburg, SE-41258, Sweden
| | - Barbara Szomolay
- Division of Infection and Immunity, School of Medicine, Cardiff University, Cardiff, UK
| | - Marek Ostaszewski
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Belvaux, L-4367, Luxembourg
| | - Ilja Arts
- Maastricht Centre for Systems Biology (MaCSBio), Maastricht University, Maastricht, 6200 MD, The Netherlands
| | - Rui Benfeitas
- National Bioinformatics Infrastructure Sweden (NBIS), Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
| | | | | | - Polonca Ferk
- Faculty of Medicine, Institute for Biostatistics and Medical Informatics, Centre ELIXIR-SI, University of Ljubljana, Ljubljana, SI-1000, Slovenia
| | - Dirk Fey
- Systems Biology Ireland, School of Medicine, University College Dublin, Dublin, 4, Ireland
| | - Carole Goble
- Department of Computer Science, The University of Manchester, Manchester, M13 9PL, UK
| | - Martin Golebiewski
- Heidelberg Institute for Theoretical Studies - HITS, Heidelberg, 69118, Germany
| | - Kristina Gruden
- Department of Biotechnology and Systems Biology, National Institute of Biology, Ljubljana, SI-1000, Slovenia
| | | | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, UK
| | - Pascal Kahlem
- Scientific Network Management SL, Barcelona, 08015, Spain
| | - Maria I. Klapa
- Metabolic Engineering & Systems Biology Laboratory, Institute of Chemical Engineering Sciences, Foundation for Research & Technology - Hellas (FORTH/ICE-HT), Patras, 26504, Greece
| | - Jasper Koehorst
- Laboratory of Systems and Synthetic Biology, Wageningen University & Research, Wageningen, 6708WE, The Netherlands
| | - Alexey Kolodkin
- Competence Center for Methodology and Statistics; Transversal Translational Medicine, Translational Medicine Operations Hub, Luxembourg Institute of Health, Strassen, L-1445, Luxembourg
- ISBE.NL, VU University of Amsterdam, Amsterdam, The Netherlands
| | - Martina Kutmon
- Maastricht Centre for Systems Biology (MaCSBio), Maastricht University, Maastricht, 6200 MD, The Netherlands
- Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6200 MD, The Netherlands
| | - Brane Leskošek
- Faculty of Medicine, Institute for Biostatistics and Medical Informatics, Centre ELIXIR-SI, University of Ljubljana, Ljubljana, SI-1000, Slovenia
| | | | - Wolfgang Müller
- Heidelberg Institute for Theoretical Studies - HITS, Heidelberg, 69118, Germany
| | - Marco Pagni
- SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Tadeja Rezen
- Faculty of Medicine, University of Ljubljana, Ljubljana, SI-1000, Slovenia
| | - Miguel Rocha
- Centre of Biological Engineering, University of Minho, Braga, Portugal
| | - Damjana Rozman
- Faculty of Medicine, University of Ljubljana, Ljubljana, SI-1000, Slovenia
| | - David Šafránek
- Faculty of Informatics, Masaryk University, Brno, 602 00, Czech Republic
| | - William T. Scott
- Laboratory of Systems and Synthetic Biology, Wageningen University & Research, Wageningen, 6708WE, The Netherlands
- UNLOCK, Wageningen University & Research, 6708 PB Wageningen, The Netherlands
| | - Rahuman S. Malik Sheriff
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, UK
| | - Maria Suarez Diez
- Laboratory of Systems and Synthetic Biology, Wageningen University & Research, Wageningen, 6708WE, The Netherlands
| | - Kristel Van Steen
- BIO3 - Laboratory for Systems Medicine, Department of Human Genetics, KU Leuven, Leuven, 3000, Belgium
- BIO3 - Systems Genetics, GIGA-R Medical Genomics, University of Liege, Liege, 4000, Belgium
| | | | - Ulrike Wittig
- Heidelberg Institute for Theoretical Studies - HITS, Heidelberg, 69118, Germany
| | - Katherine Wolstencroft
- Leiden Institute of Advanced Computer Science, Leiden University, Leiden, 2333 CA, The Netherlands
| | - Anze Zupanic
- Department of Biotechnology and Systems Biology, National Institute of Biology, Ljubljana, SI-1000, Slovenia
| | - Chris T. Evelo
- Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6200 MD, The Netherlands
| | - John M. Hancock
- Faculty of Medicine, University of Ljubljana, Ljubljana, SI-1000, Slovenia
| |
Collapse
|
9
|
Ostaszewski M, Niarakis A, Mazein A, Kuperstein I, Phair R, Orta‐Resendiz A, Singh V, Aghamiri SS, Acencio ML, Glaab E, Ruepp A, Fobo G, Montrone C, Brauner B, Frishman G, Monraz Gómez LC, Somers J, Hoch M, Kumar Gupta S, Scheel J, Borlinghaus H, Czauderna T, Schreiber F, Montagud A, Ponce de Leon M, Funahashi A, Hiki Y, Hiroi N, Yamada TG, Dräger A, Renz A, Naveez M, Bocskei Z, Messina F, Börnigen D, Fergusson L, Conti M, Rameil M, Nakonecnij V, Vanhoefer J, Schmiester L, Wang M, Ackerman EE, Shoemaker JE, Zucker J, Oxford K, Teuton J, Kocakaya E, Summak GY, Hanspers K, Kutmon M, Coort S, Eijssen L, Ehrhart F, Rex DAB, Slenter D, Martens M, Pham N, Haw R, Jassal B, Matthews L, Orlic‐Milacic M, Senff Ribeiro A, Rothfels K, Shamovsky V, Stephan R, Sevilla C, Varusai T, Ravel J, Fraser R, Ortseifen V, Marchesi S, Gawron P, Smula E, Heirendt L, Satagopam V, Wu G, Riutta A, Golebiewski M, Owen S, Goble C, Hu X, Overall RW, Maier D, Bauch A, Gyori BM, Bachman JA, Vega C, Grouès V, Vazquez M, Porras P, Licata L, Iannuccelli M, Sacco F, Nesterova A, Yuryev A, de Waard A, Turei D, Luna A, Babur O, Soliman S, Valdeolivas A, Esteban‐Medina M, Peña‐Chilet M, Rian K, Helikar T, Puniya BL, Modos D, Treveil A, Olbei M, De Meulder B, Ballereau S, Dugourd A, Naldi A, Noël V, Calzone L, Sander C, Demir E, Korcsmaros T, Freeman TC, Augé F, Beckmann JS, Hasenauer J, Wolkenhauer O, Wilighagen EL, Pico AR, Evelo CT, Gillespie ME, Stein LD, Hermjakob H, D'Eustachio P, Saez‐Rodriguez J, Dopazo J, Valencia A, Kitano H, Barillot E, Auffray C, Balling R, Schneider R. COVID19 Disease Map, a computational knowledge repository of virus-host interaction mechanisms. Mol Syst Biol 2021; 17:e10387. [PMID: 34664389 PMCID: PMC8524328 DOI: 10.15252/msb.202110387] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2021] [Revised: 08/25/2021] [Accepted: 08/26/2021] [Indexed: 12/13/2022] Open
Abstract
We need to effectively combine the knowledge from surging literature with complex datasets to propose mechanistic models of SARS-CoV-2 infection, improving data interpretation and predicting key targets of intervention. Here, we describe a large-scale community effort to build an open access, interoperable and computable repository of COVID-19 molecular mechanisms. The COVID-19 Disease Map (C19DMap) is a graphical, interactive representation of disease-relevant molecular mechanisms linking many knowledge sources. Notably, it is a computational resource for graph-based analyses and disease modelling. To this end, we established a framework of tools, platforms and guidelines necessary for a multifaceted community of biocurators, domain experts, bioinformaticians and computational biologists. The diagrams of the C19DMap, curated from the literature, are integrated with relevant interaction and text mining databases. We demonstrate the application of network analysis and modelling approaches by concrete examples to highlight new testable hypotheses. This framework helps to find signatures of SARS-CoV-2 predisposition, treatment response or prioritisation of drug candidates. Such an approach may help deal with new waves of COVID-19 or similar pandemics in the long-term perspective.
Collapse
Affiliation(s)
- Marek Ostaszewski
- Luxembourg Centre for Systems BiomedicineUniversity of LuxembourgEsch‐sur‐AlzetteLuxembourg
| | - Anna Niarakis
- Université Paris‐SaclayLaboratoire Européen de Recherche pour la Polyarthrite rhumatoïde ‐ GenhotelUniv EvryEvryFrance
- Lifeware GroupInria Saclay‐Ile de FrancePalaiseauFrance
| | - Alexander Mazein
- Luxembourg Centre for Systems BiomedicineUniversity of LuxembourgEsch‐sur‐AlzetteLuxembourg
| | - Inna Kuperstein
- Institut CuriePSL Research UniversityParisFrance
- INSERMParisFrance
- MINES ParisTechPSL Research UniversityParisFrance
| | - Robert Phair
- Integrative Bioinformatics, Inc.Mountain ViewCAUSA
| | - Aurelio Orta‐Resendiz
- Institut PasteurUniversité de Paris, Unité HIVInflammation et PersistanceParisFrance
- Bio Sorbonne Paris CitéUniversité de ParisParisFrance
| | - Vidisha Singh
- Université Paris‐SaclayLaboratoire Européen de Recherche pour la Polyarthrite rhumatoïde ‐ GenhotelUniv EvryEvryFrance
| | - Sara Sadat Aghamiri
- Inserm‐ Institut national de la santé et de la recherche médicaleParisFrance
| | - Marcio Luis Acencio
- Luxembourg Centre for Systems BiomedicineUniversity of LuxembourgEsch‐sur‐AlzetteLuxembourg
| | - Enrico Glaab
- Luxembourg Centre for Systems BiomedicineUniversity of LuxembourgEsch‐sur‐AlzetteLuxembourg
| | - Andreas Ruepp
- Institute of Experimental Genetics (IEG)Helmholtz Zentrum München‐German Research Center for Environmental Health (GmbH)NeuherbergGermany
| | - Gisela Fobo
- Institute of Experimental Genetics (IEG)Helmholtz Zentrum München‐German Research Center for Environmental Health (GmbH)NeuherbergGermany
| | - Corinna Montrone
- Institute of Experimental Genetics (IEG)Helmholtz Zentrum München‐German Research Center for Environmental Health (GmbH)NeuherbergGermany
| | - Barbara Brauner
- Institute of Experimental Genetics (IEG)Helmholtz Zentrum München‐German Research Center for Environmental Health (GmbH)NeuherbergGermany
| | - Goar Frishman
- Institute of Experimental Genetics (IEG)Helmholtz Zentrum München‐German Research Center for Environmental Health (GmbH)NeuherbergGermany
| | - Luis Cristóbal Monraz Gómez
- Institut CuriePSL Research UniversityParisFrance
- INSERMParisFrance
- MINES ParisTechPSL Research UniversityParisFrance
| | - Julia Somers
- Department of Molecular and Medical GeneticsOregon Health & Sciences UniversityPortlandORUSA
| | - Matti Hoch
- Department of Systems Biology and BioinformaticsUniversity of RostockRostockGermany
| | | | - Julia Scheel
- Department of Systems Biology and BioinformaticsUniversity of RostockRostockGermany
| | - Hanna Borlinghaus
- Department of Computer and Information ScienceUniversity of KonstanzKonstanzGermany
| | - Tobias Czauderna
- Faculty of Information TechnologyDepartment of Human‐Centred ComputingMonash UniversityClaytonVic.Australia
| | - Falk Schreiber
- Department of Computer and Information ScienceUniversity of KonstanzKonstanzGermany
- Faculty of Information TechnologyDepartment of Human‐Centred ComputingMonash UniversityClaytonVic.Australia
| | | | | | - Akira Funahashi
- Department of Biosciences and InformaticsKeio UniversityYokohamaJapan
| | - Yusuke Hiki
- Department of Biosciences and InformaticsKeio UniversityYokohamaJapan
| | - Noriko Hiroi
- Graduate School of Media and GovernanceResearch Institute at SFCKeio UniversityKanagawaJapan
| | - Takahiro G Yamada
- Department of Biosciences and InformaticsKeio UniversityYokohamaJapan
| | - Andreas Dräger
- Computational Systems Biology of Infections and Antimicrobial‐Resistant PathogensInstitute for Bioinformatics and Medical Informatics (IBMI)University of TübingenTübingenGermany
- Department of Computer ScienceUniversity of TübingenTübingenGermany
- German Center for Infection Research (DZIF), partner siteTübingenGermany
| | - Alina Renz
- Computational Systems Biology of Infections and Antimicrobial‐Resistant PathogensInstitute for Bioinformatics and Medical Informatics (IBMI)University of TübingenTübingenGermany
- Department of Computer ScienceUniversity of TübingenTübingenGermany
| | - Muhammad Naveez
- Department of Systems Biology and BioinformaticsUniversity of RostockRostockGermany
- Institute of Applied Computer SystemsRiga Technical UniversityRigaLatvia
| | - Zsolt Bocskei
- Sanofi R&DTranslational SciencesChilly‐MazarinFrance
| | - Francesco Messina
- Dipartimento di Epidemiologia Ricerca Pre‐Clinica e Diagnostica AvanzataNational Institute for Infectious Diseases 'Lazzaro Spallanzani' I.R.C.C.S.RomeItaly
- COVID‐19 INMI Network Medicine for IDs Study GroupNational Institute for Infectious Diseases 'Lazzaro Spallanzani' I.R.C.C.SRomeItaly
| | - Daniela Börnigen
- Bioinformatics Core FacilityUniversitätsklinikum Hamburg‐EppendorfHamburgGermany
| | - Liam Fergusson
- Royal (Dick) School of Veterinary MedicineThe University of EdinburghEdinburghUK
| | - Marta Conti
- Faculty of Mathematics and Natural SciencesUniversity of BonnBonnGermany
| | - Marius Rameil
- Faculty of Mathematics and Natural SciencesUniversity of BonnBonnGermany
| | - Vanessa Nakonecnij
- Faculty of Mathematics and Natural SciencesUniversity of BonnBonnGermany
| | - Jakob Vanhoefer
- Faculty of Mathematics and Natural SciencesUniversity of BonnBonnGermany
| | - Leonard Schmiester
- Faculty of Mathematics and Natural SciencesUniversity of BonnBonnGermany
- Center for MathematicsChair of Mathematical Modeling of Biological SystemsTechnische Universität MünchenGarchingGermany
| | - Muying Wang
- Department of Chemical and Petroleum EngineeringUniversity of PittsburghPittsburghPAUSA
| | - Emily E Ackerman
- Department of Chemical and Petroleum EngineeringUniversity of PittsburghPittsburghPAUSA
| | - Jason E Shoemaker
- Department of Chemical and Petroleum EngineeringUniversity of PittsburghPittsburghPAUSA
- Department of Computational and Systems BiologyUniversity of PittsburghPittsburghPAUSA
| | | | | | | | | | | | - Kristina Hanspers
- Institute of Data Science and BiotechnologyGladstone InstitutesSan FranciscoCAUSA
| | - Martina Kutmon
- Department of Bioinformatics ‐ BiGCaTNUTRIMMaastricht UniversityMaastrichtThe Netherlands
- Maastricht Centre for Systems Biology (MaCSBio)Maastricht UniversityMaastrichtThe Netherlands
| | - Susan Coort
- Department of Bioinformatics ‐ BiGCaTNUTRIMMaastricht UniversityMaastrichtThe Netherlands
| | - Lars Eijssen
- Department of Bioinformatics ‐ BiGCaTNUTRIMMaastricht UniversityMaastrichtThe Netherlands
- Maastricht University Medical CentreMaastrichtThe Netherlands
| | - Friederike Ehrhart
- Department of Bioinformatics ‐ BiGCaTNUTRIMMaastricht UniversityMaastrichtThe Netherlands
- Maastricht University Medical CentreMaastrichtThe Netherlands
| | | | - Denise Slenter
- Department of Bioinformatics ‐ BiGCaTNUTRIMMaastricht UniversityMaastrichtThe Netherlands
| | - Marvin Martens
- Department of Bioinformatics ‐ BiGCaTNUTRIMMaastricht UniversityMaastrichtThe Netherlands
| | - Nhung Pham
- Department of Bioinformatics ‐ BiGCaTNUTRIMMaastricht UniversityMaastrichtThe Netherlands
| | - Robin Haw
- MaRS CentreOntario Institute for Cancer ResearchTorontoONCanada
| | - Bijay Jassal
- MaRS CentreOntario Institute for Cancer ResearchTorontoONCanada
| | | | | | - Andrea Senff Ribeiro
- MaRS CentreOntario Institute for Cancer ResearchTorontoONCanada
- Universidade Federal do ParanáCuritibaBrasil
| | - Karen Rothfels
- MaRS CentreOntario Institute for Cancer ResearchTorontoONCanada
| | | | - Ralf Stephan
- MaRS CentreOntario Institute for Cancer ResearchTorontoONCanada
| | - Cristoffer Sevilla
- European Bioinformatics Institute (EMBL‐EBI)European Molecular Biology LaboratoryHinxton, CambridgeshireUK
| | - Thawfeek Varusai
- European Bioinformatics Institute (EMBL‐EBI)European Molecular Biology LaboratoryHinxton, CambridgeshireUK
| | - Jean‐Marie Ravel
- INSERM UMR_S 1256Nutrition, Genetics, and Environmental Risk Exposure (NGERE)Faculty of Medicine of NancyUniversity of LorraineNancyFrance
- Laboratoire de génétique médicaleCHRU NancyNancyFrance
| | - Rupsha Fraser
- Queen's Medical Research InstituteThe University of EdinburghEdinburghUK
| | - Vera Ortseifen
- Senior Research Group in Genome Research of Industrial MicroorganismsCenter for BiotechnologyBielefeld UniversityBielefeldGermany
| | - Silvia Marchesi
- Department of Surgical ScienceUppsala UniversityUppsalaSweden
| | - Piotr Gawron
- Luxembourg Centre for Systems BiomedicineUniversity of LuxembourgEsch‐sur‐AlzetteLuxembourg
- Institute of Computing SciencePoznan University of TechnologyPoznanPoland
| | - Ewa Smula
- Luxembourg Centre for Systems BiomedicineUniversity of LuxembourgEsch‐sur‐AlzetteLuxembourg
| | - Laurent Heirendt
- Luxembourg Centre for Systems BiomedicineUniversity of LuxembourgEsch‐sur‐AlzetteLuxembourg
| | - Venkata Satagopam
- Luxembourg Centre for Systems BiomedicineUniversity of LuxembourgEsch‐sur‐AlzetteLuxembourg
| | - Guanming Wu
- Department of Medical Informatics and Clinical EpidemiologyOregon Health & Science UniversityPortlandORUSA
| | - Anders Riutta
- Institute of Data Science and BiotechnologyGladstone InstitutesSan FranciscoCAUSA
| | | | - Stuart Owen
- Department of Computer ScienceThe University of ManchesterManchesterUK
| | - Carole Goble
- Department of Computer ScienceThe University of ManchesterManchesterUK
| | - Xiaoming Hu
- Heidelberg Institute for Theoretical Studies (HITS)HeidelbergGermany
| | - Rupert W Overall
- German Center for Neurodegenerative Diseases (DZNE) DresdenDresdenGermany
- Center for Regenerative Therapies Dresden (CRTD)Technische Universität DresdenDresdenGermany
- Institute for BiologyHumboldt University of BerlinBerlinGermany
| | | | | | - Benjamin M Gyori
- Harvard Medical SchoolLaboratory of Systems PharmacologyBostonMAUSA
| | - John A Bachman
- Harvard Medical SchoolLaboratory of Systems PharmacologyBostonMAUSA
| | - Carlos Vega
- Luxembourg Centre for Systems BiomedicineUniversity of LuxembourgEsch‐sur‐AlzetteLuxembourg
| | - Valentin Grouès
- Luxembourg Centre for Systems BiomedicineUniversity of LuxembourgEsch‐sur‐AlzetteLuxembourg
| | | | - Pablo Porras
- European Bioinformatics Institute (EMBL‐EBI)European Molecular Biology LaboratoryHinxton, CambridgeshireUK
| | - Luana Licata
- Department of BiologyUniversity of Rome Tor VergataRomeItaly
| | | | - Francesca Sacco
- Department of BiologyUniversity of Rome Tor VergataRomeItaly
| | | | | | | | - Denes Turei
- Institute for Computational BiomedicineHeidelberg UniversityHeidelbergGermany
| | - Augustin Luna
- cBio Center, Divisions of Biostatistics and Computational BiologyDepartment of Data SciencesDana‐Farber Cancer InstituteBostonMAUSA
- Department of Cell BiologyHarvard Medical SchoolBostonMAUSA
| | - Ozgun Babur
- Computer Science DepartmentUniversity of Massachusetts BostonBostonMAUSA
| | | | - Alberto Valdeolivas
- Institute for Computational BiomedicineHeidelberg UniversityHeidelbergGermany
| | - Marina Esteban‐Medina
- Clinical Bioinformatics AreaFundación Progreso y Salud (FPS)Hospital Virgen del RocioSevillaSpain
- Computational Systems Medicine GroupInstitute of Biomedicine of Seville (IBIS)Hospital Virgen del RocioSevillaSpain
| | - Maria Peña‐Chilet
- Clinical Bioinformatics AreaFundación Progreso y Salud (FPS)Hospital Virgen del RocioSevillaSpain
- Computational Systems Medicine GroupInstitute of Biomedicine of Seville (IBIS)Hospital Virgen del RocioSevillaSpain
- Bioinformatics in Rare Diseases (BiER)Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER)FPS, Hospital Virgen del RocíoSevillaSpain
| | - Kinza Rian
- Clinical Bioinformatics AreaFundación Progreso y Salud (FPS)Hospital Virgen del RocioSevillaSpain
- Computational Systems Medicine GroupInstitute of Biomedicine of Seville (IBIS)Hospital Virgen del RocioSevillaSpain
| | - Tomáš Helikar
- Department of BiochemistryUniversity of Nebraska‐LincolnLincolnNEUSA
| | | | - Dezso Modos
- Quadram Institute BioscienceNorwichUK
- Earlham InstituteNorwichUK
| | - Agatha Treveil
- Quadram Institute BioscienceNorwichUK
- Earlham InstituteNorwichUK
| | - Marton Olbei
- Quadram Institute BioscienceNorwichUK
- Earlham InstituteNorwichUK
| | | | - Stephane Ballereau
- Cancer Research UK Cambridge InstituteUniversity of CambridgeCambridgeUK
| | - Aurélien Dugourd
- Institute for Computational BiomedicineHeidelberg UniversityHeidelbergGermany
- Institute of Experimental Medicine and Systems BiologyFaculty of Medicine, RWTHAachen UniversityAachenGermany
| | | | - Vincent Noël
- Institut CuriePSL Research UniversityParisFrance
- INSERMParisFrance
- MINES ParisTechPSL Research UniversityParisFrance
| | - Laurence Calzone
- Institut CuriePSL Research UniversityParisFrance
- INSERMParisFrance
- MINES ParisTechPSL Research UniversityParisFrance
| | - Chris Sander
- cBio Center, Divisions of Biostatistics and Computational BiologyDepartment of Data SciencesDana‐Farber Cancer InstituteBostonMAUSA
- Department of Cell BiologyHarvard Medical SchoolBostonMAUSA
| | - Emek Demir
- Department of Molecular and Medical GeneticsOregon Health & Sciences UniversityPortlandORUSA
| | | | - Tom C Freeman
- The Roslin InstituteUniversity of EdinburghEdinburghUK
| | - Franck Augé
- Sanofi R&DTranslational SciencesChilly‐MazarinFrance
| | | | - Jan Hasenauer
- Helmholtz Zentrum München – German Research Center for Environmental HealthInstitute of Computational BiologyNeuherbergGermany
- Interdisciplinary Research Unit Mathematics and Life SciencesUniversity of BonnBonnGermany
| | - Olaf Wolkenhauer
- Department of Systems Biology and BioinformaticsUniversity of RostockRostockGermany
| | - Egon L Wilighagen
- Department of Bioinformatics ‐ BiGCaTNUTRIMMaastricht UniversityMaastrichtThe Netherlands
| | - Alexander R Pico
- Institute of Data Science and BiotechnologyGladstone InstitutesSan FranciscoCAUSA
| | - Chris T Evelo
- Department of Bioinformatics ‐ BiGCaTNUTRIMMaastricht UniversityMaastrichtThe Netherlands
- Maastricht Centre for Systems Biology (MaCSBio)Maastricht UniversityMaastrichtThe Netherlands
| | - Marc E Gillespie
- MaRS CentreOntario Institute for Cancer ResearchTorontoONCanada
- St. John’s University College of Pharmacy and Health SciencesQueensNYUSA
| | - Lincoln D Stein
- MaRS CentreOntario Institute for Cancer ResearchTorontoONCanada
- Department of Molecular GeneticsUniversity of TorontoTorontoONCanada
| | - Henning Hermjakob
- European Bioinformatics Institute (EMBL‐EBI)European Molecular Biology LaboratoryHinxton, CambridgeshireUK
| | | | | | - Joaquin Dopazo
- Clinical Bioinformatics AreaFundación Progreso y Salud (FPS)Hospital Virgen del RocioSevillaSpain
- Computational Systems Medicine GroupInstitute of Biomedicine of Seville (IBIS)Hospital Virgen del RocioSevillaSpain
- Bioinformatics in Rare Diseases (BiER)Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER)FPS, Hospital Virgen del RocíoSevillaSpain
- FPS/ELIXIR‐esHospital Virgen del RocíoSevillaSpain
| | - Alfonso Valencia
- Barcelona Supercomputing Center (BSC)BarcelonaSpain
- Institució Catalana de Recerca i Estudis Avançats (ICREA)BarcelonaSpain
| | - Hiroaki Kitano
- Systems Biology InstituteTokyoJapan
- Okinawa Institute of Science and Technology Graduate SchoolOkinawaJapan
| | - Emmanuel Barillot
- Institut CuriePSL Research UniversityParisFrance
- INSERMParisFrance
- MINES ParisTechPSL Research UniversityParisFrance
| | - Charles Auffray
- Cancer Research UK Cambridge InstituteUniversity of CambridgeCambridgeUK
| | - Rudi Balling
- Luxembourg Centre for Systems BiomedicineUniversity of LuxembourgEsch‐sur‐AlzetteLuxembourg
| | - Reinhard Schneider
- Luxembourg Centre for Systems BiomedicineUniversity of LuxembourgEsch‐sur‐AlzetteLuxembourg
| | | |
Collapse
|
10
|
Amirmahani F, Ebrahimi N, Molaei F, Faghihkhorasani F, Jamshidi Goharrizi K, Mirtaghi SM, Borjian‐Boroujeni M, Hamblin MR. Approaches for the integration of big data in translational medicine: single‐cell and computational methods. Ann N Y Acad Sci 2021; 1493:3-28. [DOI: 10.1111/nyas.14544] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2020] [Revised: 10/31/2020] [Accepted: 11/12/2020] [Indexed: 12/11/2022]
Affiliation(s)
- Farzane Amirmahani
- Genetics Division, Department of Cell and Molecular Biology and Microbiology, Faculty of Science and Technology University of Isfahan Isfahan Iran
| | - Nasim Ebrahimi
- Genetics Division, Department of Cell and Molecular Biology and Microbiology, Faculty of Science and Technology University of Isfahan Isfahan Iran
| | - Fatemeh Molaei
- Department of Anesthesiology, Faculty of Paramedical Jahrom University of Medical Sciences Jahrom Iran
| | | | | | | | | | - Michael R. Hamblin
- Laser Research Centre, Faculty of Health Science University of Johannesburg South Africa
| |
Collapse
|
11
|
Khan IH, Javaid M. Big Data Applications in Medical Field: A Literature Review. JOURNAL OF INDUSTRIAL INTEGRATION AND MANAGEMENT 2020. [DOI: 10.1142/s242486222030001x] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Digital imaging and medical reporting have acquired an essential role in healthcare, but the main challenge is the storage of a high volume of patient data. Although newer technologies are already introduced in the medical sciences to save records size, Big Data provides advancements by storing a large amount of data to improve the efficiency and quality of patient treatment with better care. It provides intelligent automation capabilities to reduce errors than manual inputs. Large numbers of research papers on big data in the medical field are studied and analyzed for their impacts, benefits, and applications. Big data has great potential to support the digitalization of all medical and clinical records and then save the entire data regarding the medical history of an individual or a group. This paper discusses big data usage for various industries and sectors. Finally, 12 significant applications for the medical field by the implementation of big data are identified and studied with a brief description. This technology can be gainfully used to extract useful information from the available data by analyzing and managing them through a combination of hardware and software. With technological advancement, big data provides health-related information for millions of patient-related to life issues such as lab tests reporting, clinical narratives, demographics, prescription, medical diagnosis, and related documentation. Thus, Big Data is essential in developing a better yet efficient analysis and storage healthcare services. The demand for big data applications is increasing due to its capability of handling and analyzing massive data. Not only in the future but even now, Big Data is proving itself as an axiom of storing, developing, analyzing, and providing overall health information to the physicians.
Collapse
Affiliation(s)
- Ibrahim Haleem Khan
- School of Engineering Sciences and Technology, Jamia Hamdard, New Delhi, India
| | - Mohd Javaid
- Department of Mechanical Engineering, Jamia Millia Islamia, New Delhi, India
| |
Collapse
|
12
|
Hendrickx JO, van Gastel J, Leysen H, Martin B, Maudsley S. High-dimensionality Data Analysis of Pharmacological Systems Associated with Complex Diseases. Pharmacol Rev 2020; 72:191-217. [PMID: 31843941 DOI: 10.1124/pr.119.017921] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
It is widely accepted that molecular reductionist views of highly complex human physiologic activity, e.g., the aging process, as well as therapeutic drug efficacy are largely oversimplifications. Currently some of the most effective appreciation of biologic disease and drug response complexity is achieved using high-dimensionality (H-D) data streams from transcriptomic, proteomic, metabolomics, or epigenomic pipelines. Multiple H-D data sets are now common and freely accessible for complex diseases such as metabolic syndrome, cardiovascular disease, and neurodegenerative conditions such as Alzheimer's disease. Over the last decade our ability to interrogate these high-dimensionality data streams has been profoundly enhanced through the development and implementation of highly effective bioinformatic platforms. Employing these computational approaches to understand the complexity of age-related diseases provides a facile mechanism to then synergize this pathologic appreciation with a similar level of understanding of therapeutic-mediated signaling. For informative pathology and drug-based analytics that are able to generate meaningful therapeutic insight across diverse data streams, novel informatics processes such as latent semantic indexing and topological data analyses will likely be important. Elucidation of H-D molecular disease signatures from diverse data streams will likely generate and refine new therapeutic strategies that will be designed with a cognizance of a realistic appreciation of the complexity of human age-related disease and drug effects. We contend that informatic platforms should be synergistic with more advanced chemical/drug and phenotypic cellular/tissue-based analytical predictive models to assist in either de novo drug prioritization or effective repurposing for the intervention of aging-related diseases. SIGNIFICANCE STATEMENT: All diseases, as well as pharmacological mechanisms, are far more complex than previously thought a decade ago. With the advent of commonplace access to technologies that produce large volumes of high-dimensionality data (e.g., transcriptomics, proteomics, metabolomics), it is now imperative that effective tools to appreciate this highly nuanced data are developed. Being able to appreciate the subtleties of high-dimensionality data will allow molecular pharmacologists to develop the most effective multidimensional therapeutics with effectively engineered efficacy profiles.
Collapse
Affiliation(s)
- Jhana O Hendrickx
- Receptor Biology Laboratory, Department of Biomedical Research (J.O.H., J.v.G., H.L., S.M.) and Faculty of Pharmacy, Biomedical and Veterinary Sciences (J.O.H., J.v.G., H.L., B.M., S.M.), University of Antwerp, Antwerp, Belgium
| | - Jaana van Gastel
- Receptor Biology Laboratory, Department of Biomedical Research (J.O.H., J.v.G., H.L., S.M.) and Faculty of Pharmacy, Biomedical and Veterinary Sciences (J.O.H., J.v.G., H.L., B.M., S.M.), University of Antwerp, Antwerp, Belgium
| | - Hanne Leysen
- Receptor Biology Laboratory, Department of Biomedical Research (J.O.H., J.v.G., H.L., S.M.) and Faculty of Pharmacy, Biomedical and Veterinary Sciences (J.O.H., J.v.G., H.L., B.M., S.M.), University of Antwerp, Antwerp, Belgium
| | - Bronwen Martin
- Receptor Biology Laboratory, Department of Biomedical Research (J.O.H., J.v.G., H.L., S.M.) and Faculty of Pharmacy, Biomedical and Veterinary Sciences (J.O.H., J.v.G., H.L., B.M., S.M.), University of Antwerp, Antwerp, Belgium
| | - Stuart Maudsley
- Receptor Biology Laboratory, Department of Biomedical Research (J.O.H., J.v.G., H.L., S.M.) and Faculty of Pharmacy, Biomedical and Veterinary Sciences (J.O.H., J.v.G., H.L., B.M., S.M.), University of Antwerp, Antwerp, Belgium
| |
Collapse
|
13
|
Ivanova O, Richards LB, Vijverberg SJ, Neerincx AH, Sinha A, Sterk PJ, Maitland‐van der Zee AH. What did we learn from multiple omics studies in asthma? Allergy 2019; 74:2129-2145. [PMID: 31004501 DOI: 10.1111/all.13833] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2019] [Revised: 03/25/2019] [Accepted: 04/12/2019] [Indexed: 12/13/2022]
Abstract
More than a decade has passed since the finalization of the Human Genome Project. Omics technologies made a huge leap from trendy and very expensive to routinely executed and relatively cheap assays. Simultaneously, we understood that omics is not a panacea for every problem in the area of human health and personalized medicine. Whilst in some areas of research omics showed immediate results, in other fields, including asthma, it only allowed us to identify the incredibly complicated molecular processes. Along with their possibilities, omics technologies also bring many issues connected to sample collection, analyses and interpretation. It is often impossible to separate the intrinsic imperfection of omics from asthma heterogeneity. Still, many insights and directions from applied omics were acquired-presumable phenotypic clusters of patients, plausible biomarkers and potential pathways involved. Omics technologies develop rapidly, bringing improvements also to asthma research. These improvements, together with our growing understanding of asthma subphenotypes and underlying cellular processes, will likely play a role in asthma management strategies.
Collapse
Affiliation(s)
- Olga Ivanova
- Department of Respiratory Medicine, Amsterdam University Medical Centres (AUMC) University of Amsterdam Amsterdam the Netherlands
| | - Levi B. Richards
- Department of Respiratory Medicine, Amsterdam University Medical Centres (AUMC) University of Amsterdam Amsterdam the Netherlands
| | - Susanne J. Vijverberg
- Department of Respiratory Medicine, Amsterdam University Medical Centres (AUMC) University of Amsterdam Amsterdam the Netherlands
| | - Anne H. Neerincx
- Department of Respiratory Medicine, Amsterdam University Medical Centres (AUMC) University of Amsterdam Amsterdam the Netherlands
| | - Anirban Sinha
- Department of Respiratory Medicine, Amsterdam University Medical Centres (AUMC) University of Amsterdam Amsterdam the Netherlands
| | - Peter J. Sterk
- Department of Respiratory Medicine, Amsterdam University Medical Centres (AUMC) University of Amsterdam Amsterdam the Netherlands
| | - Anke H. Maitland‐van der Zee
- Department of Respiratory Medicine, Amsterdam University Medical Centres (AUMC) University of Amsterdam Amsterdam the Netherlands
- Department of Paediatric Pulmonology Amsterdam UMC/ Emma Children's Hospital Amsterdam the Netherlands
| |
Collapse
|
14
|
Bravo-Merodio L, Williams JA, Gkoutos GV, Acharjee A. -Omics biomarker identification pipeline for translational medicine. J Transl Med 2019; 17:155. [PMID: 31088492 PMCID: PMC6518609 DOI: 10.1186/s12967-019-1912-5] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2018] [Accepted: 05/08/2019] [Indexed: 01/31/2023] Open
Abstract
BACKGROUND Translational medicine (TM) is an emerging domain that aims to facilitate medical or biological advances efficiently from the scientist to the clinician. Central to the TM vision is to narrow the gap between basic science and applied science in terms of time, cost and early diagnosis of the disease state. Biomarker identification is one of the main challenges within TM. The identification of disease biomarkers from -omics data will not only help the stratification of diverse patient cohorts but will also provide early diagnostic information which could improve patient management and potentially prevent adverse outcomes. However, biomarker identification needs to be robust and reproducible. Hence a robust unbiased computational framework that can help clinicians identify those biomarkers is necessary. METHODS We developed a pipeline (workflow) that includes two different supervised classification techniques based on regularization methods to identify biomarkers from -omics or other high dimension clinical datasets. The pipeline includes several important steps such as quality control and stability of selected biomarkers. The process takes input files (outcome and independent variables or -omics data) and pre-processes (normalization, missing values) them. After a random division of samples into training and test sets, Least Absolute Shrinkage and Selection Operator and Elastic Net feature selection methods are applied to identify the most important features representing potential biomarker candidates. The penalization parameters are optimised using 10-fold cross validation and the process undergoes 100 iterations and a combinatorial analysis to select the best performing multivariate model. An empirical unbiased assessment of their quality as biomarkers for clinical use is performed through a Receiver Operating Characteristic curve and its Area Under the Curve analysis on both permuted and real data for 1000 different randomized training and test sets. We validated this pipeline against previously published biomarkers. RESULTS We applied this pipeline to three different datasets with previously published biomarkers: lipidomics data by Acharjee et al. (Metabolomics 13:25, 2017) and transcriptomics data by Rajamani and Bhasin (Genome Med 8:38, 2016) and Mills et al. (Blood 114:1063-1072, 2009). Our results demonstrate that our method was able to identify both previously published biomarkers as well as new variables that add value to the published results. CONCLUSIONS We developed a robust pipeline to identify clinically relevant biomarkers that can be applied to different -omics datasets. Such identification reveals potentially novel drug targets and can be used as a part of a machine-learning based patient stratification framework in the translational medicine settings.
Collapse
Affiliation(s)
- Laura Bravo-Merodio
- College of Medical and Dental Sciences, Institute of Cancer and Genomic Sciences, Centre for Computational Biology, University of Birmingham, Birmingham, B15 2TT UK
- Institute of Translational Medicine, University Hospitals Birmingham NHS Foundation Trust, Birmingham, B15 2TT UK
| | - John A. Williams
- College of Medical and Dental Sciences, Institute of Cancer and Genomic Sciences, Centre for Computational Biology, University of Birmingham, Birmingham, B15 2TT UK
- Institute of Translational Medicine, University Hospitals Birmingham NHS Foundation Trust, Birmingham, B15 2TT UK
- Mammalian Genetics Unit, Medical Research Council Harwell Institute, Harwell Campus, Didcot, OX11 0RD UK
| | - Georgios V. Gkoutos
- College of Medical and Dental Sciences, Institute of Cancer and Genomic Sciences, Centre for Computational Biology, University of Birmingham, Birmingham, B15 2TT UK
- Institute of Translational Medicine, University Hospitals Birmingham NHS Foundation Trust, Birmingham, B15 2TT UK
- MRC Health Data Research UK (HDR UK), London, UK
- NIHR Experimental Cancer Medicine Centre, Birmingham, B15 2TT UK
- NIHR Surgical Reconstruction and Microbiology Research Centre, Birmingham, B15 2TT UK
- NIHR Biomedical Research Centre, Birmingham, B15 2TT UK
| | - Animesh Acharjee
- College of Medical and Dental Sciences, Institute of Cancer and Genomic Sciences, Centre for Computational Biology, University of Birmingham, Birmingham, B15 2TT UK
- Institute of Translational Medicine, University Hospitals Birmingham NHS Foundation Trust, Birmingham, B15 2TT UK
- NIHR Surgical Reconstruction and Microbiology Research Centre, Birmingham, B15 2TT UK
| |
Collapse
|
15
|
Barbosa-Silva A, Bratfalean D, Gu W, Satagopam V, Houston P, Becnel LB, Eifes S, Richard F, Tielmann A, Herzinger S, Rege K, Balling R, Peeters P, Schneider R. Presenting and sharing clinical data using the eTRIKS Standards Master Tree for tranSMART. Bioinformatics 2019; 35:1562-1565. [PMID: 30256906 PMCID: PMC6499239 DOI: 10.1093/bioinformatics/bty809] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2017] [Revised: 07/30/2018] [Accepted: 09/25/2018] [Indexed: 11/13/2022] Open
Abstract
Motivation Standardization and semantic alignment have been considered one of the major challenges for data integration in clinical research. The inclusion of the CDISC SDTM clinical data standard into the tranSMART i2b2 via a guiding master ontology tree positively impacts and supports the efficacy of data sharing, visualization and exploration across datasets. Results We present here a schema for the organization of SDTM variables into the tranSMART i2b2 tree along with a script and test dataset to exemplify the mapping strategy. The eTRIKS master tree concept is demonstrated by making use of fictitious data generated for four patients, including 16 SDTM clinical domains. We describe how the usage of correct visit names and data labels can help to integrate multiple readouts per patient and avoid ETL crashes when running a tranSMART loading routine. Availability and implementation The eTRIKS Master Tree package and test datasets are publicly available at https://doi.org/10.5281/zenodo.1009098 and a functional demo installation at https://public.etriks.org/transmart/datasetExplorer/ under eTRIKS—Master Tree branch, where the discussed examples can be visualized.
Collapse
Affiliation(s)
- Adriano Barbosa-Silva
- University of Luxembourg, Luxembourg Centre for Systems Biomedicine, L-4371 Belvaux, Luxembourg.,Department of Clinical Pharmacology, Centre for Translational Bioinformatics, William Harvey Research, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, Charterhouse Square, London EC1M6BQ, UK
| | - Dorina Bratfalean
- CDISC, Clinical Data Interchange Standards Consortium and CDISC EU Foundation, Saint-Louis, Alsace, France
| | - Wei Gu
- University of Luxembourg, Luxembourg Centre for Systems Biomedicine, L-4371 Belvaux, Luxembourg
| | - Venkata Satagopam
- University of Luxembourg, Luxembourg Centre for Systems Biomedicine, L-4371 Belvaux, Luxembourg
| | - Paul Houston
- CDISC, Clinical Data Interchange Standards Consortium and CDISC EU Foundation, Saint-Louis, Alsace, France
| | - Lauren B Becnel
- CDISC, Clinical Data Interchange Standards Consortium and CDISC EU Foundation, Saint-Louis, Alsace, France
| | - Serge Eifes
- University of Luxembourg, Luxembourg Centre for Systems Biomedicine, L-4371 Belvaux, Luxembourg.,Information Technology for Translational Medicine (ITTM) S.A, Esch/Belval, Luxembourg
| | - Fabien Richard
- Centre National de la Recherche Scientifique (CNRS), Lyon, Auvergne-Rhône-Alpes, France
| | - Andreas Tielmann
- Department of Research and Bioinformatics, Merck KGaA, Darmstadt, Germany
| | - Sascha Herzinger
- University of Luxembourg, Luxembourg Centre for Systems Biomedicine, L-4371 Belvaux, Luxembourg
| | - Kavita Rege
- University of Luxembourg, Luxembourg Centre for Systems Biomedicine, L-4371 Belvaux, Luxembourg
| | - Rudi Balling
- University of Luxembourg, Luxembourg Centre for Systems Biomedicine, L-4371 Belvaux, Luxembourg
| | | | - Reinhard Schneider
- University of Luxembourg, Luxembourg Centre for Systems Biomedicine, L-4371 Belvaux, Luxembourg
| |
Collapse
|
16
|
Gu W, Yildirimman R, Van der Stuyft E, Verbeeck D, Herzinger S, Satagopam V, Barbosa-Silva A, Schneider R, Lange B, Lehrach H, Guo Y, Henderson D, Rowe A. Data and knowledge management in translational research: implementation of the eTRIKS platform for the IMI OncoTrack consortium. BMC Bioinformatics 2019; 20:164. [PMID: 30935364 PMCID: PMC6444691 DOI: 10.1186/s12859-019-2748-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2018] [Accepted: 03/18/2019] [Indexed: 01/04/2023] Open
Abstract
Background For large international research consortia, such as those funded by the European Union’s Horizon 2020 programme or the Innovative Medicines Initiative, good data coordination practices and tools are essential for the successful collection, organization and analysis of the resulting data. Research consortia are attempting ever more ambitious science to better understand disease, by leveraging technologies such as whole genome sequencing, proteomics, patient-derived biological models and computer-based systems biology simulations. Results The IMI eTRIKS consortium is charged with the task of developing an integrated knowledge management platform capable of supporting the complexity of the data generated by such research programmes. In this paper, using the example of the OncoTrack consortium, we describe a typical use case in translational medicine. The tranSMART knowledge management platform was implemented to support data from observational clinical cohorts, drug response data from cell culture models and drug response data from mouse xenograft tumour models. The high dimensional (omics) data from the molecular analyses of the corresponding biological materials were linked to these collections, so that users could browse and analyse these to derive candidate biomarkers. Conclusions In all these steps, data mapping, linking and preparation are handled automatically by the tranSMART integration platform. Therefore, researchers without specialist data handling skills can focus directly on the scientific questions, without spending undue effort on processing the data and data integration, which are otherwise a burden and the most time-consuming part of translational research data analysis. Electronic supplementary material The online version of this article (10.1186/s12859-019-2748-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Wei Gu
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | | | | | | | - Sascha Herzinger
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Venkata Satagopam
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Adriano Barbosa-Silva
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Reinhard Schneider
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Bodo Lange
- Alacris Theranostics GmbH, Berlin, Germany
| | - Hans Lehrach
- Alacris Theranostics GmbH, Berlin, Germany.,Max Planck Institute for Molecular Genetics, Berlin, Germany.,Dahlem Centre for Genome Research and Medical Systems Biology, Berlin, Germany
| | - Yike Guo
- Data Science Institute, Imperial College London, London, UK
| | | | - Anthony Rowe
- Janssen Research and Development Ltd, High Wycombe, UK.
| | | |
Collapse
|
17
|
Ostaszewski M, Gebel S, Kuperstein I, Mazein A, Zinovyev A, Dogrusoz U, Hasenauer J, Fleming RMT, Le Novère N, Gawron P, Ligon T, Niarakis A, Nickerson D, Weindl D, Balling R, Barillot E, Auffray C, Schneider R. Community-driven roadmap for integrated disease maps. Brief Bioinform 2019; 20:659-670. [PMID: 29688273 PMCID: PMC6556900 DOI: 10.1093/bib/bby024] [Citation(s) in RCA: 40] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2018] [Revised: 03/02/2018] [Indexed: 01/07/2023] Open
Abstract
The Disease Maps Project builds on a network of scientific and clinical groups that exchange best practices, share information and develop systems biomedicine tools. The project aims for an integrated, highly curated and user-friendly platform for disease-related knowledge. The primary focus of disease maps is on interconnected signaling, metabolic and gene regulatory network pathways represented in standard formats. The involvement of domain experts ensures that the key disease hallmarks are covered and relevant, up-to-date knowledge is adequately represented. Expert-curated and computer readable, disease maps may serve as a compendium of knowledge, allow for data-supported hypothesis generation or serve as a scaffold for the generation of predictive mathematical models. This article summarizes the 2nd Disease Maps Community meeting, highlighting its important topics and outcomes. We outline milestones on the roadmap for the future development of disease maps, including creating and maintaining standardized disease maps; sharing parts of maps that encode common human disease mechanisms; providing technical solutions for complexity management of maps; and Web tools for in-depth exploration of such maps. A dedicated discussion was focused on mathematical modeling approaches, as one of the main goals of disease map development is the generation of mathematically interpretable representations to predict disease comorbidity or drug response and to suggest drug repositioning, altogether supporting clinical decisions.
Collapse
Affiliation(s)
- Marek Ostaszewski
- Luxembourg Centre for Systems Biomedicine, Universite du Luxembourg, 7 Avenue des Hauts-Fourneaux, L-4362 Esch-sur-Alzette, Luxembourg
| | - Stephan Gebel
- Luxembourg Centre for Systems Biomedicine, Universite du Luxembourg, 7 Avenue des Hauts-Fourneaux, L-4362 Esch-sur-Alzette, Luxembourg
| | - Inna Kuperstein
- Institut Curie, PSL Research University, INSERM U900, F-75005 Paris, France and CBIO-Centre for Computational Biology, MINES ParisTech, PSL Research University, F-75006 Paris, France
| | - Alexander Mazein
- European Institute for Systems Biology and Medicine, CIRI UMR5308, CNRS-ENS-UCBL-INSERM, Université de Lyon, 50 Avenue Tony Garnier, 69007 Lyon, France
| | - Andrei Zinovyev
- Institut Curie, PSL Research University, INSERM U900, F-75005 Paris, France and CBIO-Centre for Computational Biology, MINES ParisTech, PSL Research University, F-75006 Paris, France
| | - Ugur Dogrusoz
- Computer Engineering Department, Faculty of Engineering, Bilkent University, Ankara 06800, Turkey
| | - Jan Hasenauer
- Institute of Computational Biology, Helmholtz Zentrum München-German Research Center for Environmental Health, Ingolstädter Landstr. 1, 85764 Neuherberg, Germany
| | - Ronan M T Fleming
- Division of Systems Biomedicine and Pharmacology, Leiden Academic Centre for Drug Research, Faculty of Science, Leiden University, Leiden, Netherlands
| | - Nicolas Le Novère
- The Babraham Institute, Babraham Research Campus, Cambridge CB22 3AT, United Kingdom
| | - Piotr Gawron
- Luxembourg Centre for Systems Biomedicine, Universite du Luxembourg, 7 Avenue des Hauts-Fourneaux, L-4362 Esch-sur-Alzette, Luxembourg
| | - Thomas Ligon
- Faculty of Physics and Center for NanoScience (CeNS), Ludwig-Maximilians-Universität, 80539 München, Germany
| | - Anna Niarakis
- GenHotel EA3886, Univ Evry, Université Paris-Saclay, Evry 91025, France
| | - David Nickerson
- Auckland Bioengineering Institute, University of Auckland, Auckland, New Zealand
| | - Daniel Weindl
- Institute of Computational Biology, Helmholtz Zentrum München-German Research Center for Environmental Health, Ingolstädter Landstr. 1, 85764 Neuherberg, Germany
| | - Rudi Balling
- Luxembourg Centre for Systems Biomedicine, Universite du Luxembourg, 7 Avenue des Hauts-Fourneaux, L-4362 Esch-sur-Alzette, Luxembourg
| | - Emmanuel Barillot
- Institut Curie, PSL Research University, INSERM U900, F-75005 Paris, France and CBIO-Centre for Computational Biology, MINES ParisTech, PSL Research University, F-75006 Paris, France
| | - Charles Auffray
- European Institute for Systems Biology and Medicine, CIRI UMR5308, CNRS-ENS-UCBL-INSERM, Université de Lyon, 50 Avenue Tony Garnier, 69007 Lyon, France
| | - Reinhard Schneider
- Luxembourg Centre for Systems Biomedicine, Universite du Luxembourg, 7 Avenue des Hauts-Fourneaux, L-4362 Esch-sur-Alzette, Luxembourg
| |
Collapse
|
18
|
Saqi M, Lysenko A, Guo YK, Tsunoda T, Auffray C. Navigating the disease landscape: knowledge representations for contextualizing molecular signatures. Brief Bioinform 2019; 20:609-623. [PMID: 29684165 PMCID: PMC6556902 DOI: 10.1093/bib/bby025] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2017] [Revised: 02/05/2018] [Indexed: 12/14/2022] Open
Abstract
Large amounts of data emerging from experiments in molecular medicine are leading to the identification of molecular signatures associated with disease subtypes. The contextualization of these patterns is important for obtaining mechanistic insight into the aberrant processes associated with a disease, and this typically involves the integration of multiple heterogeneous types of data. In this review, we discuss knowledge representations that can be useful to explore the biological context of molecular signatures, in particular three main approaches, namely, pathway mapping approaches, molecular network centric approaches and approaches that represent biological statements as knowledge graphs. We discuss the utility of each of these paradigms, illustrate how they can be leveraged with selected practical examples and identify ongoing challenges for this field of research.
Collapse
Affiliation(s)
- Mansoor Saqi
- Mansoor Saqi Data Science Institute, Imperial College London, UK
| | - Artem Lysenko
- Artem Lysenko Laboratory for Medical Science Mathematics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Yi-Ke Guo
- Yi-Ke Guo Data Science Institute, Imperial College London, UK
| | - Tatsuhiko Tsunoda
- Tatsuhiko Tsunoda Laboratory for Medical Science Mathematics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan CREST, JST, Tokyo, Japan Department of Medical Science Mathematics, Medical Research Institute, Tokyo Medical and Dental University, Tokyo, Japan
| | - Charles Auffray
- Charles Auffray European Institute for Systems Biology and Medicine, Lyon, France
| |
Collapse
|
19
|
Jeanquartier F, Jean-Quartier C, Holzinger A. Use case driven evaluation of open databases for pediatric cancer research. BioData Min 2019; 12:2. [PMID: 30675185 PMCID: PMC6334395 DOI: 10.1186/s13040-018-0190-8] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2018] [Accepted: 12/05/2018] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND A plethora of Web resources are available offering information on clinical, pre-clinical, genomic and theoretical aspects of cancer, including not only the comprehensive cancer projects as ICGC and TCGA, but also less-known and more specialized projects on pediatric diseases such as PCGP. However, in case of data on childhood cancer there is very little information openly available. Several web-based resources and tools offer general biomedical data which are not purpose-built, for neither pediatric nor cancer analysis. Additionally, many Web resources on cancer focus on incidence data and statistical social characteristics as well as self-regulating communities. METHODS We summarize those resources which are open and are considered to support scientific fundamental research, while we address our comparison to 11 identified pediatric cancer-specific resources (5 tools, 6 databases). The evaluation consists of 5 use cases on the example of brain tumor research and covers user-defined search scenarios as well as data mining tasks, also examining interactive visual analysis features. RESULTS Web resources differ in terms of information quantity and presentation. Pedican lists an abundance of entries with few selection features. PeCan and PedcBioPortal include visual analysis tools while the latter integrates published and new consortia-based data. UCSC Xena Browser offers an in-depth analysis of genomic data. ICGC data portal provides various features for data analysis and an option to submit own data. Its focus lies on adult Pan-Cancer projects. Pediatric Pan-Cancer datasets are being integrated into PeCan and PedcBioPortal. Comparing information on prominent mutations within glioma discloses well-known, unknown, possible, as well as inapplicable biomarkers. This summary further emphasizes the varying data allocation. Tested tools show advantages and disadvantages, depending on the respective use case scenario, providing inhomogeneous data quantity and information specifics. CONCLUSIONS Web resources on specific pediatric cancers are less abundant and less-known compared to those offering adult cancer research data. Meanwhile, current efforts of ongoing pediatric data collection and Pan-Cancer projects indicate future opportunities for childhood cancer research, that is greatly needed for both fundamental as well as clinical research.
Collapse
Affiliation(s)
- Fleur Jeanquartier
- Institute of Interactive Systems and Data Science, Graz University of Technology, Graz, Austria
- Holzinger Group HCI-KDD, Institute for Medical Informatics, Statistics and Documentation, Medical University Graz, Auenbruggerplatz 2/V, Graz, 8036 Austria
| | - Claire Jean-Quartier
- Holzinger Group HCI-KDD, Institute for Medical Informatics, Statistics and Documentation, Medical University Graz, Auenbruggerplatz 2/V, Graz, 8036 Austria
| | - Andreas Holzinger
- Institute of Interactive Systems and Data Science, Graz University of Technology, Graz, Austria
- Holzinger Group HCI-KDD, Institute for Medical Informatics, Statistics and Documentation, Medical University Graz, Auenbruggerplatz 2/V, Graz, 8036 Austria
| |
Collapse
|
20
|
Abstract
PURPOSE OF REVIEW To elucidate the keywords big data and artificial intelligence and corresponding literature in the field of urolithiasis. RECENT FINDINGS Numbers of publications on big data and artificial intelligence in the field of urolithiasis are rising, but still low. Most publications describe the development, testing, and validation of automated computational analyses of clinical data sets and/or images in a preclinical setting. SUMMARY In the field of digital health services, there is a discrepancy between the enormous commitment of large private companies and investments of public funds. This situation means a still small number of medical publications on this topic in the urolithiasis field. Nevertheless, as doctors and scientists, we should try to provide our patients with secure and worthwhile digital services.
Collapse
|
21
|
Huppertz B. An updated view on the origin and use of angiogenic biomarkers for preeclampsia. Expert Rev Mol Diagn 2018; 18:1053-1061. [PMID: 30413130 DOI: 10.1080/14737159.2018.1546579] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Introduction: The last decade has seen massive efforts towards the identification and the potential use of predictive biomarkers for the pregnancy pathology preeclampsia. The angiogenic factors sFlt-1 and placental growth factor (PGF) have been in focus and have been massively supported. Areas covered: This review describes preeclampsia and intra-uterine growth restriction (IUGR), focusing on sFlt-1 and PGF, their sources during and outside pregnancy and the application of these markers in diseases outside pregnancy. Finally, the specificity of the angiogenic markers for preeclampsia is discussed. Expert commentary: The admixture of the two independent syndromes preeclampsia and IUGR has not helped in identifying the etiologies of either. Rather, it has made the search for new markers and pathways much more complicated as has the constriction on the angiogenic markers. The current markers sFlt-1 and PGF have a clear value once an adverse outcome is diagnosed but are not specific for preeclampsia. Also, they are mostly derived from the maternal vascular system rather than the placenta and are already in use as markers outside pregnancy. A new holistic approach using disease maps and interoperable workflows based on topic-related big data will help in broadening our understanding of the etiology of preeclampsia and hence, develop new markers and therapies.
Collapse
Affiliation(s)
- Berthold Huppertz
- a Division of Cell Biology, Histology and Embryology , Gottfried Schatz Research Center, Medical University of Graz , Graz , Austria
| |
Collapse
|
22
|
Ostaszewski M, Kieffer E, Danoy G, Schneider R, Bouvry P. Clustering approaches for visual knowledge exploration in molecular interaction networks. BMC Bioinformatics 2018; 19:308. [PMID: 30157777 PMCID: PMC6116538 DOI: 10.1186/s12859-018-2314-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2017] [Accepted: 08/14/2018] [Indexed: 12/02/2022] Open
Abstract
BACKGROUND Biomedical knowledge grows in complexity, and becomes encoded in network-based repositories, which include focused, expert-drawn diagrams, networks of evidence-based associations and established ontologies. Combining these structured information sources is an important computational challenge, as large graphs are difficult to analyze visually. RESULTS We investigate knowledge discovery in manually curated and annotated molecular interaction diagrams. To evaluate similarity of content we use: i) Euclidean distance in expert-drawn diagrams, ii) shortest path distance using the underlying network and iii) ontology-based distance. We employ clustering with these metrics used separately and in pairwise combinations. We propose a novel bi-level optimization approach together with an evolutionary algorithm for informative combination of distance metrics. We compare the enrichment of the obtained clusters between the solutions and with expert knowledge. We calculate the number of Gene and Disease Ontology terms discovered by different solutions as a measure of cluster quality. Our results show that combining distance metrics can improve clustering accuracy, based on the comparison with expert-provided clusters. Also, the performance of specific combinations of distance functions depends on the clustering depth (number of clusters). By employing bi-level optimization approach we evaluated relative importance of distance functions and we found that indeed the order by which they are combined affects clustering performance. Next, with the enrichment analysis of clustering results we found that both hierarchical and bi-level clustering schemes discovered more Gene and Disease Ontology terms than expert-provided clusters for the same knowledge repository. Moreover, bi-level clustering found more enriched terms than the best hierarchical clustering solution for three distinct distance metric combinations in three different instances of disease maps. CONCLUSIONS In this work we examined the impact of different distance functions on clustering of a visual biomedical knowledge repository. We found that combining distance functions may be beneficial for clustering, and improve exploration of such repositories. We proposed bi-level optimization to evaluate the importance of order by which the distance functions are combined. Both combination and order of these functions affected clustering quality and knowledge recognition in the considered benchmarks. We propose that multiple dimensions can be utilized simultaneously for visual knowledge exploration.
Collapse
Affiliation(s)
- Marek Ostaszewski
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, 7, Avenue des Hauts-Fourneaux, Esch-Belval, Luxembourg
| | - Emmanuel Kieffer
- Interdisciplinary Centre for Security, Reliability and Trust, University of Luxembourg, 6, Avenue de la Fonte, Esch-Belval, Luxembourg
| | - Grégoire Danoy
- Computer Science and Communications Research Unit, University of Luxembourg, 6, Avenue de la Fonte, Esch-Belval, Luxembourg
| | - Reinhard Schneider
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, 7, Avenue des Hauts-Fourneaux, Esch-Belval, Luxembourg
| | - Pascal Bouvry
- Computer Science and Communications Research Unit, University of Luxembourg, 6, Avenue de la Fonte, Esch-Belval, Luxembourg
| |
Collapse
|
23
|
Pité H, Morais-Almeida M, Rocha SM. Metabolomics in asthma: where do we stand? Curr Opin Pulm Med 2018; 24:94-103. [PMID: 29059088 DOI: 10.1097/mcp.0000000000000437] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
PURPOSE OF REVIEW Metabolomics has been used to uncover the metabolic signatures of asthma, both for biomarker identification and pathophysiologic mechanisms research. We aimed to review recent advances in this field, published since 2016, and discuss these findings implications to future research and application into clinical practice. RECENT FINDINGS Experimental asthma models and clinical studies in both children and adults supported independent metabolic signatures of asthma. Common reported pathways included purine, glycerophospholipid, glutathione, fatty acids, and arginine and proline metabolism. Metabolomics-based studies identified candidate biomarkers related to asthma severity and corticosteroid resistance, and supported the definition of the obesity-related phenotype at the molecular level. A systematic review with meta-analysis and recent prospective studies favored exhaled volatile organic compounds as one of the most promising biomarkers in asthma diagnosis and monitoring. SUMMARY Metabolomics has provided unique and novel insights into asthma profiling at the molecular level. Current challenges include procedures standardization and control of potentially confounding variables for external validation. Point-of-care technology developments bring metabolomics closer to clinical practice. In addition to biomarkers identification, relating metabolites to their biologic role will serve as critical foundations for understanding the biology underpinning asthma heterogeneity and for specific-targeted therapies. VIDEO ABSTRACT.
Collapse
Affiliation(s)
- Helena Pité
- Allergy Center, CUF Descobertas Hospital and CUF Infante Santo Hospital.,CEDOC, Chronic Diseases Research Center, NOVA Medical School/Faculdade de Ciências Médicas, Universidade Nova de Lisboa, Lisbon
| | | | - Sílvia M Rocha
- Department of Chemistry & QOPNA, University of Aveiro, Aveiro, Portugal
| |
Collapse
|
24
|
Mazein A, Ostaszewski M, Kuperstein I, Watterson S, Le Novère N, Lefaudeux D, De Meulder B, Pellet J, Balaur I, Saqi M, Nogueira MM, He F, Parton A, Lemonnier N, Gawron P, Gebel S, Hainaut P, Ollert M, Dogrusoz U, Barillot E, Zinovyev A, Schneider R, Balling R, Auffray C. Systems medicine disease maps: community-driven comprehensive representation of disease mechanisms. NPJ Syst Biol Appl 2018; 4:21. [PMID: 29872544 PMCID: PMC5984630 DOI: 10.1038/s41540-018-0059-y] [Citation(s) in RCA: 64] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2018] [Revised: 04/26/2018] [Accepted: 05/04/2018] [Indexed: 12/18/2022] Open
Abstract
The development of computational approaches in systems biology has reached a state of maturity that allows their transition to systems medicine. Despite this progress, intuitive visualisation and context-dependent knowledge representation still present a major bottleneck. In this paper, we describe the Disease Maps Project, an effort towards a community-driven computationally readable comprehensive representation of disease mechanisms. We outline the key principles and the framework required for the success of this initiative, including use of best practices, standards and protocols. We apply a modular approach to ensure efficient sharing and reuse of resources for projects dedicated to specific diseases. Community-wide use of disease maps will accelerate the conduct of biomedical research and lead to new disease ontologies defined from mechanism-based disease endotypes rather than phenotypes.
Collapse
Affiliation(s)
- Alexander Mazein
- 1European Institute for Systems Biology and Medicine, CIRI UMR5308, CNRS-ENS-UCBL-INSERM, Université de Lyon, 50 Avenue Tony Garnier, 69007 Lyon, France
| | - Marek Ostaszewski
- 2Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Campus Belval, 7 Avenue des Hauts-Fourneaux, L-4362 Esch-sur-Alzette, Luxembourg
| | - Inna Kuperstein
- 3Institut Curie, Paris, France.,4INSERM, U900 Paris, France.,5Mines ParisTech, Fontainebleau, France.,6PSL Research University, Paris, France
| | - Steven Watterson
- 7Northern Ireland Centre for Stratified Medicine, Ulster University, C-Tric, Altnagelvin Hospital Campus, Derry, Co Londonderry, Northern Ireland, BT47 6SB UK
| | - Nicolas Le Novère
- 8The Babraham Institute, Babraham Research Campus, Cambridge, CB22 3AT UK
| | - Diane Lefaudeux
- 1European Institute for Systems Biology and Medicine, CIRI UMR5308, CNRS-ENS-UCBL-INSERM, Université de Lyon, 50 Avenue Tony Garnier, 69007 Lyon, France
| | - Bertrand De Meulder
- 1European Institute for Systems Biology and Medicine, CIRI UMR5308, CNRS-ENS-UCBL-INSERM, Université de Lyon, 50 Avenue Tony Garnier, 69007 Lyon, France
| | - Johann Pellet
- 1European Institute for Systems Biology and Medicine, CIRI UMR5308, CNRS-ENS-UCBL-INSERM, Université de Lyon, 50 Avenue Tony Garnier, 69007 Lyon, France
| | - Irina Balaur
- 1European Institute for Systems Biology and Medicine, CIRI UMR5308, CNRS-ENS-UCBL-INSERM, Université de Lyon, 50 Avenue Tony Garnier, 69007 Lyon, France
| | - Mansoor Saqi
- 1European Institute for Systems Biology and Medicine, CIRI UMR5308, CNRS-ENS-UCBL-INSERM, Université de Lyon, 50 Avenue Tony Garnier, 69007 Lyon, France
| | - Maria Manuela Nogueira
- 1European Institute for Systems Biology and Medicine, CIRI UMR5308, CNRS-ENS-UCBL-INSERM, Université de Lyon, 50 Avenue Tony Garnier, 69007 Lyon, France
| | - Feng He
- Department of Infection and Immunity, Luxembourg Institute of Health (LIH), House of BioHealth, 29 Rue Henri Koch, L-4354 Esch-Sur-Alzette, Luxembourg
| | - Andrew Parton
- 7Northern Ireland Centre for Stratified Medicine, Ulster University, C-Tric, Altnagelvin Hospital Campus, Derry, Co Londonderry, Northern Ireland, BT47 6SB UK
| | - Nathanaël Lemonnier
- 10Institute for Advanced Biosciences, University Grenoble-Alpes-INSERM U1209-CNRS UMR5309, Site Santé - Allée des Alpes, 38700 La Tronche, France
| | - Piotr Gawron
- 2Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Campus Belval, 7 Avenue des Hauts-Fourneaux, L-4362 Esch-sur-Alzette, Luxembourg
| | - Stephan Gebel
- 2Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Campus Belval, 7 Avenue des Hauts-Fourneaux, L-4362 Esch-sur-Alzette, Luxembourg
| | - Pierre Hainaut
- 10Institute for Advanced Biosciences, University Grenoble-Alpes-INSERM U1209-CNRS UMR5309, Site Santé - Allée des Alpes, 38700 La Tronche, France
| | - Markus Ollert
- Department of Infection and Immunity, Luxembourg Institute of Health (LIH), House of BioHealth, 29 Rue Henri Koch, L-4354 Esch-Sur-Alzette, Luxembourg.,11Department of Dermatology and Allergy Center, Odense Research Center for Anaphylaxis, University of Southern Denmark, Odense, Denmark
| | - Ugur Dogrusoz
- 12Faculty of Engineering, Computer Engineering Department, Bilkent University, Ankara, 06800 Turkey
| | - Emmanuel Barillot
- 3Institut Curie, Paris, France.,4INSERM, U900 Paris, France.,5Mines ParisTech, Fontainebleau, France.,6PSL Research University, Paris, France
| | - Andrei Zinovyev
- 3Institut Curie, Paris, France.,4INSERM, U900 Paris, France.,5Mines ParisTech, Fontainebleau, France.,6PSL Research University, Paris, France
| | - Reinhard Schneider
- 2Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Campus Belval, 7 Avenue des Hauts-Fourneaux, L-4362 Esch-sur-Alzette, Luxembourg
| | - Rudi Balling
- 2Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Campus Belval, 7 Avenue des Hauts-Fourneaux, L-4362 Esch-sur-Alzette, Luxembourg
| | - Charles Auffray
- 1European Institute for Systems Biology and Medicine, CIRI UMR5308, CNRS-ENS-UCBL-INSERM, Université de Lyon, 50 Avenue Tony Garnier, 69007 Lyon, France
| |
Collapse
|
25
|
Herzinger S, Gu W, Satagopam V, Eifes S, Rege K, Barbosa-Silva A, Schneider R. SmartR: an open-source platform for interactive visual analytics for translational research data. Bioinformatics 2018; 33:2229-2231. [PMID: 28334291 PMCID: PMC5870773 DOI: 10.1093/bioinformatics/btx137] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2017] [Accepted: 03/08/2017] [Indexed: 11/28/2022] Open
Abstract
Summary In translational research, efficient knowledge exchange between the different fields of expertise is crucial. An open platform that is capable of storing a multitude of data types such as clinical, pre-clinical or OMICS data combined with strong visual analytical capabilities will significantly accelerate the scientific progress by making data more accessible and hypothesis generation easier. The open data warehouse tranSMART is capable of storing a variety of data types and has a growing user community including both academic institutions and pharmaceutical companies. tranSMART, however, currently lacks interactive and dynamic visual analytics and does not permit any post-processing interaction or exploration. For this reason, we developed SmartR, a plugin for tranSMART, that equips the platform not only with several dynamic visual analytical workflows, but also provides its own framework for the addition of new custom workflows. Modern web technologies such as D3.js or AngularJS were used to build a set of standard visualizations that were heavily improved with dynamic elements. Availability and Implementation The source code is licensed under the Apache 2.0 License and is freely available on GitHub: https://github.com/transmart/SmartR. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Sascha Herzinger
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch/Belval, Luxembourg
| | - Wei Gu
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch/Belval, Luxembourg
| | - Venkata Satagopam
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch/Belval, Luxembourg
| | - Serge Eifes
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch/Belval, Luxembourg.,Information Technology for Translational Medicine (ITTM) S.A, Esch/Belval, Luxembourg
| | - Kavita Rege
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch/Belval, Luxembourg
| | - Adriano Barbosa-Silva
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch/Belval, Luxembourg
| | - Reinhard Schneider
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch/Belval, Luxembourg
| | | |
Collapse
|
26
|
Turi KN, Romick-Rosendale L, Ryckman KK, Hartert TV. A review of metabolomics approaches and their application in identifying causal pathways of childhood asthma. J Allergy Clin Immunol 2018; 141:1191-1201. [PMID: 28479327 PMCID: PMC5671382 DOI: 10.1016/j.jaci.2017.04.021] [Citation(s) in RCA: 53] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2016] [Revised: 03/08/2017] [Accepted: 04/13/2017] [Indexed: 12/20/2022]
Abstract
Because asthma is a disease that results from host-environment interactions, an approach that allows assessment of the effect of the environment on the host is needed to understand the disease. Metabolomics has appealing potential as an application to study pathways to childhood asthma development. The objective of this review is to provide an overview of metabolomics methods and their application to understanding host-environment pathways in asthma development. We reviewed recent literature on advances in metabolomics and their application to study pathways to childhood asthma development. We highlight the (1) potential of metabolomics in understanding the pathogenesis of disease and the discovery of biomarkers; (2) choice of metabolomics techniques, biospecimen handling, and data analysis; (3) application to studying the role of the environment on asthma development; (4) review of metabolomics applied to the outcome of asthma; (5) recommendations for application of metabolomics-based -omics data integration in understanding disease pathogenesis; and (6) limitations. In conclusion, metabolomics allows use of biospecimens to identify useful biomarkers and pathways involved in disease development and subsequently to inform a greater understanding of disease pathogenesis and endotypes and prediction of the clinical course of childhood asthma phenotypes.
Collapse
Affiliation(s)
- Kedir N Turi
- Division of Allergy, Pulmonary, and Critical Care Medicine, Department of Medicine, Vanderbilt University School of Medicine, Nashville, Tenn
| | - Lindsey Romick-Rosendale
- Division of Pathology and Laboratory Medicine, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio
| | - Kelli K Ryckman
- Departments of Epidemiology and Pediatrics, College of Public Health and Carver College of Medicine, University of Iowa, Iowa City, Iowa
| | - Tina V Hartert
- Division of Allergy, Pulmonary, and Critical Care Medicine, Department of Medicine, Vanderbilt University School of Medicine, Nashville, Tenn.
| |
Collapse
|
27
|
Maudsley S, Devanarayan V, Martin B, Geerts H. Intelligent and effective informatic deconvolution of “Big Data” and its future impact on the quantitative nature of neurodegenerative disease therapy. Alzheimers Dement 2018; 14:961-975. [DOI: 10.1016/j.jalz.2018.01.014] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2017] [Revised: 10/03/2017] [Accepted: 01/18/2018] [Indexed: 12/31/2022]
Affiliation(s)
- Stuart Maudsley
- Department of Biomedical ResearchUniversity of AntwerpAntwerpBelgium
- VIB Center for Molecular NeurologyAntwerpBelgium
| | | | - Bronwen Martin
- Department of Biomedical ResearchUniversity of AntwerpAntwerpBelgium
| | | | | |
Collapse
|
28
|
Zeng ISL, Lumley T. Review of Statistical Learning Methods in Integrated Omics Studies (An Integrated Information Science). Bioinform Biol Insights 2018; 12:1177932218759292. [PMID: 29497285 PMCID: PMC5824897 DOI: 10.1177/1177932218759292] [Citation(s) in RCA: 38] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2017] [Accepted: 01/24/2018] [Indexed: 12/14/2022] Open
Abstract
Integrated omics is becoming a new channel for investigating the complex molecular system in modern biological science and sets a foundation for systematic learning for precision medicine. The statistical/machine learning methods that have emerged in the past decade for integrated omics are not only innovative but also multidisciplinary with integrated knowledge in biology, medicine, statistics, machine learning, and artificial intelligence. Here, we review the nontrivial classes of learning methods from the statistical aspects and streamline these learning methods within the statistical learning framework. The intriguing findings from the review are that the methods used are generalizable to other disciplines with complex systematic structure, and the integrated omics is part of an integrated information science which has collated and integrated different types of information for inferences and decision making. We review the statistical learning methods of exploratory and supervised learning from 42 publications. We also discuss the strengths and limitations of the extended principal component analysis, cluster analysis, network analysis, and regression methods. Statistical techniques such as penalization for sparsity induction when there are fewer observations than the number of features and using Bayesian approach when there are prior knowledge to be integrated are also included in the commentary. For the completeness of the review, a table of currently available software and packages from 23 publications for omics are summarized in the appendix.
Collapse
Affiliation(s)
- Irene Sui Lan Zeng
- Department of Statistics, Faculty of Science, The University of Auckland, Auckland, New Zealand
| | - Thomas Lumley
- Department of Statistics, Faculty of Science, The University of Auckland, Auckland, New Zealand
| |
Collapse
|
29
|
Paton C, Karopka T. The Role of Free/Libre and Open Source Software in Learning Health Systems. Yearb Med Inform 2017; 26:53-58. [PMID: 28480476 PMCID: PMC6239249 DOI: 10.15265/iy-2017-006] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
Objective: To give an overview of the role of Free/Libre and Open Source Software (FLOSS) in the context of secondary use of patient data to enable Learning Health Systems (LHSs). Methods: We conducted an environmental scan of the academic and grey literature utilising the MedFLOSS database of open source systems in healthcare to inform a discussion of the role of open source in developing LHSs that reuse patient data for research and quality improvement. Results: A wide range of FLOSS is identified that contributes to the information technology (IT) infrastructure of LHSs including operating systems, databases, frameworks, interoperability software, and mobile and web apps. The recent literature around the development and use of key clinical data management tools is also reviewed. Conclusions: FLOSS already plays a critical role in modern health IT infrastructure for the collection, storage, and analysis of patient data. The nature of FLOSS systems to be collaborative, modular, and modifiable may make open source approaches appropriate for building the digital infrastructure for a LHS.
Collapse
Affiliation(s)
- C. Paton
- Group Head for Global Health Informatics, Centre for Tropical Medicine and Global Health, University of Oxford, UK
| | - T. Karopka
- Chair of IMIA OS WG, Chair of EFMI LIFOSS WG, Project Manager, BioCon Valley GmbH, Greifswald, Germany
| |
Collapse
|
30
|
Tebani A, Afonso C, Marret S, Bekri S. Omics-Based Strategies in Precision Medicine: Toward a Paradigm Shift in Inborn Errors of Metabolism Investigations. Int J Mol Sci 2016; 17:ijms17091555. [PMID: 27649151 PMCID: PMC5037827 DOI: 10.3390/ijms17091555] [Citation(s) in RCA: 105] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2016] [Revised: 09/06/2016] [Accepted: 09/07/2016] [Indexed: 12/20/2022] Open
Abstract
The rise of technologies that simultaneously measure thousands of data points represents the heart of systems biology. These technologies have had a huge impact on the discovery of next-generation diagnostics, biomarkers, and drugs in the precision medicine era. Systems biology aims to achieve systemic exploration of complex interactions in biological systems. Driven by high-throughput omics technologies and the computational surge, it enables multi-scale and insightful overviews of cells, organisms, and populations. Precision medicine capitalizes on these conceptual and technological advancements and stands on two main pillars: data generation and data modeling. High-throughput omics technologies allow the retrieval of comprehensive and holistic biological information, whereas computational capabilities enable high-dimensional data modeling and, therefore, accessible and user-friendly visualization. Furthermore, bioinformatics has enabled comprehensive multi-omics and clinical data integration for insightful interpretation. Despite their promise, the translation of these technologies into clinically actionable tools has been slow. In this review, we present state-of-the-art multi-omics data analysis strategies in a clinical context. The challenges of omics-based biomarker translation are discussed. Perspectives regarding the use of multi-omics approaches for inborn errors of metabolism (IEM) are presented by introducing a new paradigm shift in addressing IEM investigations in the post-genomic era.
Collapse
Affiliation(s)
- Abdellah Tebani
- Department of Metabolic Biochemistry, Rouen University Hospital, 76031 Rouen, France.
- Normandie University, UNIROUEN, INSERM, CHU Rouen, Laboratoire NeoVasc ERI28, 76000 Rouen, France.
- Normandie University, UNIROUEN, INSA Rouen, CNRS, COBRA, 76000 Rouen, France.
| | - Carlos Afonso
- Normandie University, UNIROUEN, INSA Rouen, CNRS, COBRA, 76000 Rouen, France.
| | - Stéphane Marret
- Normandie University, UNIROUEN, INSERM, CHU Rouen, Laboratoire NeoVasc ERI28, 76000 Rouen, France.
- Department of Neonatal Pediatrics, Intensive Care and Neuropediatrics, Rouen University Hospital, 76031 Rouen, France.
| | - Soumeya Bekri
- Department of Metabolic Biochemistry, Rouen University Hospital, 76031 Rouen, France.
- Normandie University, UNIROUEN, INSERM, CHU Rouen, Laboratoire NeoVasc ERI28, 76000 Rouen, France.
| |
Collapse
|