1
|
Delfin C, Dragan I, Kuznetsov D, Tajes JF, Smit F, Coral DE, Farzaneh A, Haugg A, Hungele A, Niknejad A, Hall C, Jacobs D, Marek D, Fraser DP, Thuillier D, Ahmadizar F, Mehl F, Pattou F, Burdet F, Hawkes G, Arts ICW, Blanch J, Van Soest J, Fernández-Real JM, Boehl J, Fink K, van Greevenbroek MMJ, Kavousi M, Minten M, Prinz N, Ipsen N, Franks PW, Ramos R, Holl RW, Horban S, Duarte-Salles T, Tran VDT, Raverdy V, Leal Y, Lenart A, Pearson E, Sparsø T, Giordano GN, Ioannidis V, Soh K, Frayling TM, Le Roux CW, Ibberson M. A Federated Database for Obesity Research: An IMI-SOPHIA Study. Life (Basel) 2024; 14:262. [PMID: 38398771 PMCID: PMC10890572 DOI: 10.3390/life14020262] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Revised: 01/12/2024] [Accepted: 02/08/2024] [Indexed: 02/25/2024] Open
Abstract
Obesity is considered by many as a lifestyle choice rather than a chronic progressive disease. The Innovative Medicines Initiative (IMI) SOPHIA (Stratification of Obesity Phenotypes to Optimize Future Obesity Therapy) project is part of a momentum shift aiming to provide better tools for the stratification of people with obesity according to disease risk and treatment response. One of the challenges to achieving these goals is that many clinical cohorts are siloed, limiting the potential of combined data for biomarker discovery. In SOPHIA, we have addressed this challenge by setting up a federated database building on open-source DataSHIELD technology. The database currently federates 16 cohorts that are accessible via a central gateway. The database is multi-modal, including research studies, clinical trials, and routine health data, and is accessed using the R statistical programming environment where statistical and machine learning analyses can be performed at a distance without any disclosure of patient-level data. We demonstrate the use of the database by providing a proof-of-concept analysis, performing a federated linear model of BMI and systolic blood pressure, pooling all data from 16 studies virtually without any analyst seeing individual patient-level data. This analysis provided similar point estimates compared to a meta-analysis of the 16 individual studies. Our approach provides a benchmark for reproducible, safe federated analyses across multiple study types provided by multiple stakeholders.
Collapse
Affiliation(s)
| | - Iulian Dragan
- Vital-IT Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Dmitry Kuznetsov
- Vital-IT Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Juan Fernandez Tajes
- Genetic and Molecular Epidemiology Unit, Lund University Diabetes Centre, Department of Clinical Sciences, Clinical Research Centre (CRC), Lund University, Jan Waldenströmsgata 35, SE-20502 Malmö, Sweden
| | - Femke Smit
- Maastricht Center for Systems Biology, Faculty of Science and Engineering, Maastricht University, Paul Henri Spaaklaan 1, 6229 EN Maastricht, The Netherlands
| | - Daniel E. Coral
- Genetic and Molecular Epidemiology Unit, Lund University Diabetes Centre, Department of Clinical Sciences, Clinical Research Centre (CRC), Lund University, Jan Waldenströmsgata 35, SE-20502 Malmö, Sweden
| | - Ali Farzaneh
- Department of Epidemiology, Erasmus MC, University Medical Center Rotterdam, 3000 CA Rotterdam, The Netherlands
| | - André Haugg
- Global Biostatistics & Data Sciences, Boehringer Ingelheim Pharma GmbH & Co. KG, 88400 Biberach, Germany
| | - Andreas Hungele
- Institute of Epidemiology and Medical Biometry, CAQM, University of Ulm, 89081 Ulm, Germany
- German Center for Diabetes Research (DZD), 85764 Neuherberg, Germany
| | - Anne Niknejad
- Vital-IT Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Christopher Hall
- Division of Population Health and Genomics, Ninewells Hospital and School of Medicine, University of Dundee, Dundee DD1 4HN, UK
| | - Daan Jacobs
- Nederlandse Obesitas Kliniek, Huis Ter Heide, 3712 BA Utrecht, The Netherlands
| | - Diana Marek
- Vital-IT Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Diane P. Fraser
- University of Exeter Medical School, University of Exeter, Exeter EX1 2LU, UK
| | - Dorothee Thuillier
- Univ Lille, Inserm, CHU Lille, Pasteur Institute Lille, U1190 Translational Research for Diabetes, European Genomic Institute of Diabetes, 59000 Lille, France; (D.T.)
| | - Fariba Ahmadizar
- Data Science and Biostatistics Department, Julius Global Health, University Medical Center Utrecht, 3508 GA Utrecht, The Netherlands
| | - Florence Mehl
- Vital-IT Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Francois Pattou
- Univ Lille, Inserm, CHU Lille, Pasteur Institute Lille, U1190 Translational Research for Diabetes, European Genomic Institute of Diabetes, 59000 Lille, France; (D.T.)
| | - Frederic Burdet
- Vital-IT Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Gareth Hawkes
- University of Exeter Medical School, University of Exeter, Exeter EX1 2LU, UK
| | - Ilja C. W. Arts
- Maastricht Center for Systems Biology, Faculty of Science and Engineering, Maastricht University, Paul Henri Spaaklaan 1, 6229 EN Maastricht, The Netherlands
| | - Jordi Blanch
- Fundació Institut Universitari per a la Recerca a l’Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), 08007 Barcelona, Spain
- ISV-Girona Research Group, Research Unit in Primary Care, Primary Care Services, Catalan Institute of Health (ICS), 08908 Barcelona, Spain
| | - Johan Van Soest
- Brightlands Institute for Smart Society (BISS), Faculty of Science and Engineering, Maastricht University, 6229 EN Maastricht, The Netherlands
- Department of Radiation Oncology (Maastro), GROW-School for Oncology and Reproduction, Maastricht University Medical Center, 6229 EN Maastricht, The Netherlands
| | - José-Manuel Fernández-Real
- Nutrition, Eumetabolism and Health Group, Institut d’Investigació Biomèdica de Girona (IDIBGI-CERCA), Av. França 30, 17007 Girona, Spain
- Department of Medical Sciences, School of Medicine, University of Girona, 17003 Girona, Spain
- CIBER Fisiopatología de la Obesidad y Nutrición (CIBEROBN), Instituto de Salud Carlos III, 28029 Madrid, Spain
- Department of Diabetes, Endocrinology and Nutrition, Dr. Josep Trueta University Hospital, Av. França, s/n, 17007 Girona, Spain
| | - Juergen Boehl
- Global Biostatistics & Data Sciences, Boehringer Ingelheim Pharma GmbH & Co. KG, 88400 Biberach, Germany
| | - Katharina Fink
- Institute of Epidemiology and Medical Biometry, CAQM, University of Ulm, 89081 Ulm, Germany
- German Center for Diabetes Research (DZD), 85764 Neuherberg, Germany
| | - Marleen M. J. van Greevenbroek
- Department of Internal Medicine and CARIM School of Cardiovascular Diseases, Maastricht University, 6229 EN Maastricht, The Netherlands
| | - Maryam Kavousi
- Department of Epidemiology, Erasmus MC, University Medical Center Rotterdam, 3000 CA Rotterdam, The Netherlands
| | - Michiel Minten
- Maastricht Center for Systems Biology, Faculty of Science and Engineering, Maastricht University, Paul Henri Spaaklaan 1, 6229 EN Maastricht, The Netherlands
| | - Nicole Prinz
- Institute of Epidemiology and Medical Biometry, CAQM, University of Ulm, 89081 Ulm, Germany
- German Center for Diabetes Research (DZD), 85764 Neuherberg, Germany
| | | | - Paul W. Franks
- Genetic and Molecular Epidemiology Unit, Lund University Diabetes Centre, Department of Clinical Sciences, Clinical Research Centre (CRC), Lund University, Jan Waldenströmsgata 35, SE-20502 Malmö, Sweden
| | - Rafael Ramos
- Fundació Institut Universitari per a la Recerca a l’Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), 08007 Barcelona, Spain
- Department of Medical Sciences, School of Medicine, University of Girona, 17003 Girona, Spain
- Department of Medical Informatics, Erasmus University Medical Center, 3000 CA Rotterdam, The Netherlands
- Research in Vascular Health Group, Institut d’Investigació Biomèdica de Girona (IDIBGI-CERCA), Parc Hospitalari Martí i Julià, Edifici M2, 17190 Salt, Spain
| | - Reinhard W. Holl
- Institute of Epidemiology and Medical Biometry, CAQM, University of Ulm, 89081 Ulm, Germany
- German Center for Diabetes Research (DZD), 85764 Neuherberg, Germany
| | - Scott Horban
- Division of Population Health and Genomics, Ninewells Hospital and School of Medicine, University of Dundee, Dundee DD1 4HN, UK
| | - Talita Duarte-Salles
- Fundació Institut Universitari per a la Recerca a l’Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), 08007 Barcelona, Spain
- Department of Medical Informatics, Erasmus University Medical Center, 3000 CA Rotterdam, The Netherlands
| | - Van Du T. Tran
- Vital-IT Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Violeta Raverdy
- Univ Lille, Inserm, CHU Lille, Pasteur Institute Lille, U1190 Translational Research for Diabetes, European Genomic Institute of Diabetes, 59000 Lille, France; (D.T.)
| | - Yenny Leal
- Nutrition, Eumetabolism and Health Group, Institut d’Investigació Biomèdica de Girona (IDIBGI-CERCA), Av. França 30, 17007 Girona, Spain
- Department of Medical Sciences, School of Medicine, University of Girona, 17003 Girona, Spain
- CIBER Fisiopatología de la Obesidad y Nutrición (CIBEROBN), Instituto de Salud Carlos III, 28029 Madrid, Spain
- Department of Diabetes, Endocrinology and Nutrition, Dr. Josep Trueta University Hospital, Av. França, s/n, 17007 Girona, Spain
| | | | - Ewan Pearson
- Division of Population Health and Genomics, Ninewells Hospital and School of Medicine, University of Dundee, Dundee DD1 4HN, UK
| | | | - Giuseppe N. Giordano
- Genetic and Molecular Epidemiology Unit, Lund University Diabetes Centre, Department of Clinical Sciences, Clinical Research Centre (CRC), Lund University, Jan Waldenströmsgata 35, SE-20502 Malmö, Sweden
| | - Vassilios Ioannidis
- Vital-IT Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Keng Soh
- Novo Nordisk A/S, 2860 Søborg, Denmark
| | - Timothy M. Frayling
- University of Exeter Medical School, University of Exeter, Exeter EX1 2LU, UK
- Department of Genetic Medicine and Development, Faculty of Medicine, University of Geneva, 1 Rue Michel-Servet, CH-1211 Geneva, Switzerland
| | - Carel W. Le Roux
- Diabetes Complications Research Centre, University College Dublin, D04 V1W8 Dublin, Ireland
| | - Mark Ibberson
- Vital-IT Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| |
Collapse
|