1
|
Kiirikki AM, Antila HS, Bort LS, Buslaev P, Favela-Rosales F, Ferreira TM, Fuchs PFJ, Garcia-Fandino R, Gushchin I, Kav B, Kučerka N, Kula P, Kurki M, Kuzmin A, Lalitha A, Lolicato F, Madsen JJ, Miettinen MS, Mingham C, Monticelli L, Nencini R, Nesterenko AM, Piggot TJ, Piñeiro Á, Reuter N, Samantray S, Suárez-Lestón F, Talandashti R, Ollila OHS. Overlay databank unlocks data-driven analyses of biomolecules for all. Nat Commun 2024; 15:1136. [PMID: 38326316 PMCID: PMC10850068 DOI: 10.1038/s41467-024-45189-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Accepted: 01/17/2024] [Indexed: 02/09/2024] Open
Abstract
Tools based on artificial intelligence (AI) are currently revolutionising many fields, yet their applications are often limited by the lack of suitable training data in programmatically accessible format. Here we propose an effective solution to make data scattered in various locations and formats accessible for data-driven and machine learning applications using the overlay databank format. To demonstrate the practical relevance of such approach, we present the NMRlipids Databank-a community-driven, open-for-all database featuring programmatic access to quality-evaluated atom-resolution molecular dynamics simulations of cellular membranes. Cellular membrane lipid composition is implicated in diseases and controls major biological functions, but membranes are difficult to study experimentally due to their intrinsic disorder and complex phase behaviour. While MD simulations have been useful in understanding membrane systems, they require significant computational resources and often suffer from inaccuracies in model parameters. Here, we demonstrate how programmable interface for flexible implementation of data-driven and machine learning applications, and rapid access to simulation data through a graphical user interface, unlock possibilities beyond current MD simulation and experimental studies to understand cellular membranes. The proposed overlay databank concept can be further applied to other biomolecules, as well as in other fields where similar barriers hinder the AI revolution.
Collapse
Affiliation(s)
- Anne M Kiirikki
- University of Helsinki, Institute of Biotechnology, Helsinki, Finland
| | - Hanne S Antila
- Department of Theory and Bio-Systems, Max Planck Institute of Colloids and Interfaces, 14424, Potsdam, Germany
- Department of Biomedicine, University of Bergen, 5020, Bergen, Norway
| | - Lara S Bort
- Department of Theory and Bio-Systems, Max Planck Institute of Colloids and Interfaces, 14424, Potsdam, Germany
- University of Potsdam, Institute of Physics and Astronomy, 14476, Potsdam-Golm, Germany
| | - Pavel Buslaev
- Nanoscience Center and Department of Chemistry, University of Jyväskylä, 40014, Jyväskylä, Finland
| | - Fernando Favela-Rosales
- Departamento de Ciencias Básicas, Tecnológico Nacional de México - ITS Zacatecas Occidente, Sombrerete, 99102, Zacatecas, Mexico
| | - Tiago Mendes Ferreira
- NMR group - Institute for Physics, Martin Luther University Halle-Wittenberg, 06120, Halle (Saale), Germany
| | - Patrick F J Fuchs
- Sorbonne Université, Ecole Normale Supérieure, PSL University, CNRS, Laboratoire des Biomolécules (LBM), F-75005, Paris, France
- Université Paris Cité, F-75006, Paris, France
| | - Rebeca Garcia-Fandino
- Center for Research in Biological Chemistry and Molecular Materials (CiQUS), Universidade de Santiago de Compostela, E-15782, Santiago de Compostela, Spain
| | | | - Batuhan Kav
- Institute of Biological Information Processing: Structural Biochemistry (IBI-7), Forschungszentrum Jülich, 52428, Jülich, Germany
- ariadne.ai GmbH (Germany), Häusserstraße 3, 69115, Heidelberg, Germany
| | - Norbert Kučerka
- Department of Physical Chemistry of Drugs, Faculty of Pharmacy, Comenius University Bratislava, 832 32, Bratislava, Slovakia
| | - Patrik Kula
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo nám. 542/2, CZ-16610, Prague, Czech Republic
| | - Milla Kurki
- School of Pharmacy, University of Eastern Finland, 70211, Kuopio, Finland
| | | | - Anusha Lalitha
- Institut Charles Gerhardt Montpellier (UMR CNRS 5253), Université Montpellier, Place Eugène Bataillon, 34095, Montpellier, Cedex 05, France
| | - Fabio Lolicato
- Heidelberg University Biochemistry Center, 69120, Heidelberg, Germany
- Department of Physics, University of Helsinki, FI-00014, Helsinki, Finland
| | - Jesper J Madsen
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, 33612, Tampa, FL, USA
- Center for Global Health and Infectious Diseases Research, Global and Planetary Health, College of Public Health, University of South Florida, 33612, Tampa, FL, USA
| | - Markus S Miettinen
- Department of Theory and Bio-Systems, Max Planck Institute of Colloids and Interfaces, 14424, Potsdam, Germany
- Department of Chemistry, University of Bergen, 5007, Bergen, Norway
- Department of Informatics, Computational Biology Unit, University of Bergen, 5008, Bergen, Norway
| | - Cedric Mingham
- Hochschule Mannheim, University of Applied Sciences, 68163, Mannheim, Germany
| | - Luca Monticelli
- University of Lyon, CNRS, Molecular Microbiology and Structural Biochemistry (MMSB, UMR 5086), F-69007, Lyon, France
- Institut National de la Santé et de la Recherche Médicale (INSERM), Lyon, France
| | - Ricky Nencini
- University of Helsinki, Institute of Biotechnology, Helsinki, Finland
- Division of Pharmaceutical Biosciences, Faculty of Pharmacy, University of Helsinki, 00014, Helsinki, Finland
| | - Alexey M Nesterenko
- Department of Chemistry, University of Bergen, 5007, Bergen, Norway
- Department of Informatics, Computational Biology Unit, University of Bergen, 5008, Bergen, Norway
| | - Thomas J Piggot
- Chemistry, University of Southampton, Highfield, SO17 1BJ, Southampton, UK
| | - Ángel Piñeiro
- Department of Applied Physics, Faculty of Physics, University of Santiago de Compostela, E-15782, Santiago de Compostela, Spain
| | - Nathalie Reuter
- Department of Chemistry, University of Bergen, 5007, Bergen, Norway
- Department of Informatics, Computational Biology Unit, University of Bergen, 5008, Bergen, Norway
| | - Suman Samantray
- Institute of Biological Information Processing: Structural Biochemistry (IBI-7), Forschungszentrum Jülich, 52428, Jülich, Germany
- Institute of Biotechnology, RWTH Aachen University, Worringerweg 3, 52074, Aachen, Germany
| | - Fabián Suárez-Lestón
- Center for Research in Biological Chemistry and Molecular Materials (CiQUS), Universidade de Santiago de Compostela, E-15782, Santiago de Compostela, Spain
- Department of Applied Physics, Faculty of Physics, University of Santiago de Compostela, E-15782, Santiago de Compostela, Spain
- MD.USE Innovations S.L., Edificio Emprendia, 15782, Santiago de Compostela, Spain
| | - Reza Talandashti
- Department of Chemistry, University of Bergen, 5007, Bergen, Norway
- Department of Informatics, Computational Biology Unit, University of Bergen, 5008, Bergen, Norway
| | - O H Samuli Ollila
- University of Helsinki, Institute of Biotechnology, Helsinki, Finland.
- VTT Technical Research Centre of Finland, Espoo, Finland.
| |
Collapse
|