1
|
Bayarri G, Andrio P, Gelpí JL, Hospital A, Orozco M. Using interactive Jupyter Notebooks and BioConda for FAIR and reproducible biomolecular simulation workflows. PLoS Comput Biol 2024; 20:e1012173. [PMID: 38900779 PMCID: PMC11189206 DOI: 10.1371/journal.pcbi.1012173] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/22/2024] Open
Abstract
Interactive Jupyter Notebooks in combination with Conda environments can be used to generate FAIR (Findable, Accessible, Interoperable and Reusable/Reproducible) biomolecular simulation workflows. The interactive programming code accompanied by documentation and the possibility to inspect intermediate results with versatile graphical charts and data visualization is very helpful, especially in iterative processes, where parameters might be adjusted to a particular system of interest. This work presents a collection of FAIR notebooks covering various areas of the biomolecular simulation field, such as molecular dynamics (MD), protein-ligand docking, molecular checking/modeling, molecular interactions, and free energy perturbations. Workflows can be launched with myBinder or easily installed in a local system. The collection of notebooks aims to provide a compilation of demonstration workflows, and it is continuously updated and expanded with examples using new methodologies and tools.
Collapse
Affiliation(s)
- Genís Bayarri
- Institute for Research in Biomedicine (IRB Barcelona), the Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Pau Andrio
- Barcelona Supercomputing Center (BSC), Barcelona, Spain
| | - Josep Lluís Gelpí
- Barcelona Supercomputing Center (BSC), Barcelona, Spain
- Department of Biochemistry and Biomedicine, University of Barcelona, Barcelona, Spain
| | - Adam Hospital
- Institute for Research in Biomedicine (IRB Barcelona), the Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Modesto Orozco
- Institute for Research in Biomedicine (IRB Barcelona), the Barcelona Institute of Science and Technology, Barcelona, Spain
- Department of Biochemistry and Biomedicine, University of Barcelona, Barcelona, Spain
| |
Collapse
|
2
|
McCullough JWS, Coveney PV. Uncertainty quantification of the lattice Boltzmann method focussing on studies of human-scale vascular blood flow. Sci Rep 2024; 14:11317. [PMID: 38760455 PMCID: PMC11101457 DOI: 10.1038/s41598-024-61708-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Accepted: 05/08/2024] [Indexed: 05/19/2024] Open
Abstract
Uncertainty quantification is becoming a key tool to ensure that numerical models can be sufficiently trusted to be used in domains such as medical device design. Demonstration of how input parameters impact the quantities of interest generated by any numerical model is essential to understanding the limits of its reliability. With the lattice Boltzmann method now a widely used approach for computational fluid dynamics, building greater understanding of its numerical uncertainty characteristics will support its further use in science and industry. In this study we apply an in-depth uncertainty quantification study of the lattice Boltzmann method in a canonical bifurcating geometry that is representative of the vascular junctions present in arterial and venous domains. These campaigns examine how quantities of interest-pressure and velocity along the central axes of the bifurcation-are influenced by the algorithmic parameters of the lattice Boltzmann method and the parameters controlling the values imposed at inlet velocity and outlet pressure boundary conditions. We also conduct a similar campaign on a set of personalised vessels to further illustrate the application of these techniques. Our work provides insights into how input parameters and boundary conditions impact the velocity and pressure distributions calculated in a simulation and can guide the choices of such values when applied to vascular studies of patient specific geometries. We observe that, from an algorithmic perspective, the number of time steps and the size of the grid spacing are the most influential parameters. When considering the influence of boundary conditions, we note that the magnitude of the inlet velocity and the mean pressure applied within sinusoidal pressure outlets have the greatest impact on output quantities of interest. We also observe that, when comparing the magnitude of variation imposed in the input parameters with that observed in the output quantities, this variability is particularly magnified when the input velocity is altered. This study also demonstrates how open-source toolkits for validation, verification and uncertainty quantification can be applied to numerical models deployed on high-performance computers without the need for modifying the simulation code itself. Such an ability is key to the more widespread adoption of the analysis of uncertainty in numerical models by significantly reducing the complexity of their execution and analysis.
Collapse
Affiliation(s)
- Jon W S McCullough
- Centre for Computational Science, Department of Chemistry, University College London, London, UK
| | - Peter V Coveney
- Centre for Computational Science, Department of Chemistry, University College London, London, UK.
- Centre for Advanced Research Computing, University College London, London, UK.
- Informatics Institute, University of Amsterdam, Amsterdam, Netherlands.
| |
Collapse
|
3
|
Kusch L, Diaz-Pier S, Klijn W, Sontheimer K, Bernard C, Morrison A, Jirsa V. Multiscale co-simulation design pattern for neuroscience applications. Front Neuroinform 2024; 18:1156683. [PMID: 38410682 PMCID: PMC10895016 DOI: 10.3389/fninf.2024.1156683] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Accepted: 01/19/2024] [Indexed: 02/28/2024] Open
Abstract
Integration of information across heterogeneous sources creates added scientific value. Interoperability of data, tools and models is, however, difficult to accomplish across spatial and temporal scales. Here we introduce the toolbox Parallel Co-Simulation, which enables the interoperation of simulators operating at different scales. We provide a software science co-design pattern and illustrate its functioning along a neuroscience example, in which individual regions of interest are simulated on the cellular level allowing us to study detailed mechanisms, while the remaining network is efficiently simulated on the population level. A workflow is illustrated for the use case of The Virtual Brain and NEST, in which the CA1 region of the cellular-level hippocampus of the mouse is embedded into a full brain network involving micro and macro electrode recordings. This new tool allows integrating knowledge across scales in the same simulation framework and validating them against multiscale experiments, thereby largely widening the explanatory power of computational models.
Collapse
Affiliation(s)
- Lionel Kusch
- Institut de Neurosciences des Systèmes (INS), UMR1106, Aix-Marseille Université, Marseilles, France
| | - Sandra Diaz-Pier
- Simulation and Data Lab Neuroscience, Jülich Supercomputing Centre (JSC), Institute for Advanced Simulation, JARA, Forschungszentrum Jülich GmbH, Jülich, Germany
| | - Wouter Klijn
- Simulation and Data Lab Neuroscience, Jülich Supercomputing Centre (JSC), Institute for Advanced Simulation, JARA, Forschungszentrum Jülich GmbH, Jülich, Germany
| | - Kim Sontheimer
- Simulation and Data Lab Neuroscience, Jülich Supercomputing Centre (JSC), Institute for Advanced Simulation, JARA, Forschungszentrum Jülich GmbH, Jülich, Germany
| | - Christophe Bernard
- Institut de Neurosciences des Systèmes (INS), UMR1106, Aix-Marseille Université, Marseilles, France
| | - Abigail Morrison
- Simulation and Data Lab Neuroscience, Jülich Supercomputing Centre (JSC), Institute for Advanced Simulation, JARA, Forschungszentrum Jülich GmbH, Jülich, Germany
- Forschungszentrum Jülich GmbH, IAS-6/INM-6, JARA, Jülich, Germany
- Computer Science 3 - Software Engineering, RWTH Aachen University, Aachen, Germany
| | - Viktor Jirsa
- Institut de Neurosciences des Systèmes (INS), UMR1106, Aix-Marseille Université, Marseilles, France
| |
Collapse
|
4
|
Ahmad K, Javed A, Lanphere C, Coveney PV, Orlova EV, Howorka S. Structure and dynamics of an archetypal DNA nanoarchitecture revealed via cryo-EM and molecular dynamics simulations. Nat Commun 2023; 14:3630. [PMID: 37336895 DOI: 10.1038/s41467-023-38681-5] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2022] [Accepted: 05/11/2023] [Indexed: 06/21/2023] Open
Abstract
DNA can be folded into rationally designed, unique, and functional materials. To fully realise the potential of these DNA materials, a fundamental understanding of their structure and dynamics is necessary, both in simple solvents as well as more complex and diverse anisotropic environments. Here we analyse an archetypal six-duplex DNA nanoarchitecture with single-particle cryo-electron microscopy and molecular dynamics simulations in solvents of tunable ionic strength and within the anisotropic environment of biological membranes. Outside lipid bilayers, the six-duplex bundle lacks the designed symmetrical barrel-type architecture. Rather, duplexes are arranged in non-hexagonal fashion and are disorted to form a wider, less elongated structure. Insertion into lipid membranes, however, restores the anticipated barrel shape due to lateral duplex compression by the bilayer. The salt concentration has a drastic impact on the stability of the inserted barrel-shaped DNA nanopore given the tunable electrostatic repulsion between the negatively charged duplexes. By synergistically combining experiments and simulations, we increase fundamental understanding into the environment-dependent structural dynamics of a widely used nanoarchitecture. This insight will pave the way for future engineering and biosensing applications.
Collapse
Affiliation(s)
- Katya Ahmad
- Centre for Computational Science, University College London, London, WC1H 0AJ, UK
| | - Abid Javed
- Department of Biological Sciences, Birkbeck, University of London, London, WC1E 7HX, UK
- Astbury Centre for Structural Molecular Biology, School of Molecular and Cellular Biology, Faculty of Biological Sciences, University of Leeds, Leeds, UK
| | - Conor Lanphere
- Department of Chemistry, Institute for Structural and Molecular Biology, University College London, London, WC1H0AJ, UK
| | - Peter V Coveney
- Centre for Computational Science, University College London, London, WC1H 0AJ, UK.
- Advanced Research Computing Centre, University College London, London, WC1H 0AJ, UK.
- Informatics Institute, University of Amsterdam, Amsterdam, 1090 GH, The Netherlands.
| | - Elena V Orlova
- Department of Biological Sciences, Birkbeck, University of London, London, WC1E 7HX, UK.
| | - Stefan Howorka
- Department of Chemistry, Institute for Structural and Molecular Biology, University College London, London, WC1H0AJ, UK.
| |
Collapse
|
5
|
Peluso P, Chankvetadze B. Recognition in the Domain of Molecular Chirality: From Noncovalent Interactions to Separation of Enantiomers. Chem Rev 2022; 122:13235-13400. [PMID: 35917234 DOI: 10.1021/acs.chemrev.1c00846] [Citation(s) in RCA: 57] [Impact Index Per Article: 28.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
It is not a coincidence that both chirality and noncovalent interactions are ubiquitous in nature and synthetic molecular systems. Noncovalent interactivity between chiral molecules underlies enantioselective recognition as a fundamental phenomenon regulating life and human activities. Thus, noncovalent interactions represent the narrative thread of a fascinating story which goes across several disciplines of medical, chemical, physical, biological, and other natural sciences. This review has been conceived with the awareness that a modern attitude toward molecular chirality and its consequences needs to be founded on multidisciplinary approaches to disclose the molecular basis of essential enantioselective phenomena in the domain of chemical, physical, and life sciences. With the primary aim of discussing this topic in an integrated way, a comprehensive pool of rational and systematic multidisciplinary information is provided, which concerns the fundamentals of chirality, a description of noncovalent interactions, and their implications in enantioselective processes occurring in different contexts. A specific focus is devoted to enantioselection in chromatography and electromigration techniques because of their unique feature as "multistep" processes. A second motivation for writing this review is to make a clear statement about the state of the art, the tools we have at our disposal, and what is still missing to fully understand the mechanisms underlying enantioselective recognition.
Collapse
Affiliation(s)
- Paola Peluso
- Istituto di Chimica Biomolecolare ICB, CNR, Sede secondaria di Sassari, Traversa La Crucca 3, Regione Baldinca, Li Punti, I-07100 Sassari, Italy
| | - Bezhan Chankvetadze
- Institute of Physical and Analytical Chemistry, School of Exact and Natural Sciences, Tbilisi State University, Chavchavadze Avenue 3, 0179 Tbilisi, Georgia
| |
Collapse
|
6
|
Lamprecht AL, Palmblad M, Ison J, Schwämmle V, Al Manir MS, Altintas I, Baker CJO, Ben Hadj Amor A, Capella-Gutierrez S, Charonyktakis P, Crusoe MR, Gil Y, Goble C, Griffin TJ, Groth P, Ienasescu H, Jagtap P, Kalaš M, Kasalica V, Khanteymoori A, Kuhn T, Mei H, Ménager H, Möller S, Richardson RA, Robert V, Soiland-Reyes S, Stevens R, Szaniszlo S, Verberne S, Verhoeven A, Wolstencroft K. Perspectives on automated composition of workflows in the life sciences. F1000Res 2021; 10:897. [PMID: 34804501 PMCID: PMC8573700 DOI: 10.12688/f1000research.54159.1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 08/27/2021] [Indexed: 12/29/2022] Open
Abstract
Scientific data analyses often combine several computational tools in automated pipelines, or workflows. Thousands of such workflows have been used in the life sciences, though their composition has remained a cumbersome manual process due to a lack of standards for annotation, assembly, and implementation. Recent technological advances have returned the long-standing vision of automated workflow composition into focus. This article summarizes a recent Lorentz Center workshop dedicated to automated composition of workflows in the life sciences. We survey previous initiatives to automate the composition process, and discuss the current state of the art and future perspectives. We start by drawing the "big picture" of the scientific workflow development life cycle, before surveying and discussing current methods, technologies and practices for semantic domain modelling, automation in workflow development, and workflow assessment. Finally, we derive a roadmap of individual and community-based actions to work toward the vision of automated workflow development in the forthcoming years. A central outcome of the workshop is a general description of the workflow life cycle in six stages: 1) scientific question or hypothesis, 2) conceptual workflow, 3) abstract workflow, 4) concrete workflow, 5) production workflow, and 6) scientific results. The transitions between stages are facilitated by diverse tools and methods, usually incorporating domain knowledge in some form. Formal semantic domain modelling is hard and often a bottleneck for the application of semantic technologies. However, life science communities have made considerable progress here in recent years and are continuously improving, renewing interest in the application of semantic technologies for workflow exploration, composition and instantiation. Combined with systematic benchmarking with reference data and large-scale deployment of production-stage workflows, such technologies enable a more systematic process of workflow development than we know today. We believe that this can lead to more robust, reusable, and sustainable workflows in the future.
Collapse
Affiliation(s)
| | - Magnus Palmblad
- Leiden University Medical Center, 2333 ZA, Leiden, The Netherlands
| | - Jon Ison
- French Institute of Bioinformatics, 91057 Évry, France
| | | | | | - Ilkay Altintas
- University of California San Diego, La Jolla, CA, 92093, USA
| | - Christopher J. O. Baker
- University of New Brunswick, Saint John, E2L 4L5, Canada
- IPSNP Computing Inc., Saint John, E2L 4S6, Canada
| | | | | | | | | | - Yolanda Gil
- University of Southern California, Marina Del Rey, CA, 90292, USA
| | - Carole Goble
- Department of Computer Science, The University of Manchester, Manchester, M13 9PL, UK
| | - Timothy J. Griffin
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, 55455, USA
| | - Paul Groth
- University of Amsterdam, 1090 GH Amsterdam, The Netherlands
| | - Hans Ienasescu
- Technical University of Denmark, 2800 Kongens Lyngby, Denmark
| | - Pratik Jagtap
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, 55455, USA
| | | | | | | | - Tobias Kuhn
- VU Amsterdam, 1081 HV Amsterdam, The Netherlands
| | - Hailiang Mei
- Sequencing Analysis Support Core, Leiden University Medical Center, 2333 ZC Leiden, The Netherlands
| | | | - Steffen Möller
- IBIMA, Rostock University Medical Center, 18057 Rostock, Germany
| | | | | | - Stian Soiland-Reyes
- Department of Computer Science, The University of Manchester, Manchester, M13 9PL, UK
- Informatics Institute, University of Amsterdam, 1090 GH Amsterdam, The Netherlands
| | - Robert Stevens
- Department of Computer Science, The University of Manchester, Manchester, M13 9PL, UK
| | | | - Suzan Verberne
- Leiden Institute of Advanced Computer Science, Leiden University, 2333 BE Leiden, The Netherlands
| | - Aswin Verhoeven
- Leiden University Medical Center, 2333 ZA, Leiden, The Netherlands
| | - Katherine Wolstencroft
- Leiden Institute of Advanced Computer Science, Leiden University, 2333 BE Leiden, The Netherlands
| |
Collapse
|
7
|
Vassaux M, Wan S, Edeling W, Coveney PV. Ensembles Are Required to Handle Aleatoric and Parametric Uncertainty in Molecular Dynamics Simulation. J Chem Theory Comput 2021; 17:5187-5197. [PMID: 34280310 PMCID: PMC8389531 DOI: 10.1021/acs.jctc.1c00526] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Indexed: 11/29/2022]
Abstract
Classical molecular dynamics is a computer simulation technique that is in widespread use across many areas of science, from physics and chemistry to materials, biology, and medicine. The method continues to attract criticism due its oft-reported lack of reproducibility which is in part due to a failure to submit it to reliable uncertainty quantification (UQ). Here we show that the uncertainty arises from a combination of (i) the input parameters and (ii) the intrinsic stochasticity of the method controlled by the random seeds. To illustrate the situation, we make a systematic UQ analysis of a widely used molecular dynamics code (NAMD), applied to estimate binding free energy of a ligand-bound to a protein. In particular, we replace the usually fixed input parameters with random variables, systematically distributed about their mean values, and study the resulting distribution of the simulation output. We also perform a sensitivity analysis, which reveals that, out of a total of 175 parameters, just six dominate the variance in the code output. Furthermore, we show that binding energy calculations dampen the input uncertainty, in the sense that the variation around the mean output free energy is less than the variation around the mean of the assumed input distributions, if the output is ensemble-averaged over the random seeds. Without such ensemble averaging, the predicted free energy is five times more uncertain. The distribution of the predicted properties is thus strongly dependent upon the random seed. Owing to this substantial uncertainty, robust statistical measures of uncertainty in molecular dynamics simulation require the use of ensembles in all contexts.
Collapse
Affiliation(s)
- Maxime Vassaux
- Centre
for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, United Kingdom
| | - Shunzhou Wan
- Centre
for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, United Kingdom
| | - Wouter Edeling
- Centrum
Wiskunde & Informatica, Scientific Computing Group, Amsterdam 1090 GB, The Netherlands
| | - Peter V. Coveney
- Centre
for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, United Kingdom
- Informatics
Institute, University of Amsterdam, Amsterdam 1012 WX, The Netherlands
| |
Collapse
|