1
|
Prokop B, Frolov N, Gelens L. Enhancing model identification with SINDy via nullcline reconstruction. CHAOS (WOODBURY, N.Y.) 2024; 34:063135. [PMID: 38885073 DOI: 10.1063/5.0199311] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Accepted: 05/28/2024] [Indexed: 06/20/2024]
Abstract
Many dynamical systems exhibit oscillatory behavior that can be modeled with differential equations. Recently, these equations have increasingly been derived through data-driven methods, including the transparent technique known as Sparse Identification of Nonlinear Dynamics (SINDy). This paper illustrates the importance of accurately determining the system's limit cycle position in phase space for identifying sparse and effective models. We introduce a method for identifying the limit cycle position and the system's nullclines by applying SINDy to datasets adjusted with various offsets. This approach is evaluated using three criteria: model complexity, coefficient of determination, and generalization error. We applied this method to several models: the oscillatory FitzHugh-Nagumo model, a more complex model consisting of two coupled cubic differential equations with a single stable state, and a multistable model of glycolytic oscillations. Our results confirm that incorporating detailed information about the limit cycle in phase space enhances the accuracy of model identification in oscillatory systems.
Collapse
Affiliation(s)
- Bartosz Prokop
- Department of Cellular and Molecular Medicine, Laboratory of Dynamics in Biological Systems, KU Leuven, 3000 Leuven, Belgium
| | - Nikita Frolov
- Department of Cellular and Molecular Medicine, Laboratory of Dynamics in Biological Systems, KU Leuven, 3000 Leuven, Belgium
| | - Lendert Gelens
- Department of Cellular and Molecular Medicine, Laboratory of Dynamics in Biological Systems, KU Leuven, 3000 Leuven, Belgium
| |
Collapse
|
2
|
Prokop B, Gelens L. From biological data to oscillator models using SINDy. iScience 2024; 27:109316. [PMID: 38523784 PMCID: PMC10959654 DOI: 10.1016/j.isci.2024.109316] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 01/18/2024] [Accepted: 02/18/2024] [Indexed: 03/26/2024] Open
Abstract
Periodic changes in the concentration or activity of different molecules regulate vital cellular processes such as cell division and circadian rhythms. Developing mathematical models is essential to better understand the mechanisms underlying these oscillations. Recent data-driven methods like SINDy have fundamentally changed model identification, yet their application to experimental biological data remains limited. This study investigates SINDy's constraints by directly applying it to biological oscillatory data. We identify insufficient resolution, noise, dimensionality, and limited prior knowledge as primary limitations. Using various generic oscillator models of different complexity and/or dimensionality, we systematically analyze these factors. We then propose a comprehensive guide for inferring models from biological data, addressing these challenges step by step. Our approach is validated using glycolytic oscillation data from yeast.
Collapse
Affiliation(s)
- Bartosz Prokop
- Laboratory of Dynamics in Biological Systems, Department of Cellular and Molecular Medicine, KU Leuven, Herestraat 49, 3000 Leuven, Belgium
| | - Lendert Gelens
- Laboratory of Dynamics in Biological Systems, Department of Cellular and Molecular Medicine, KU Leuven, Herestraat 49, 3000 Leuven, Belgium
| |
Collapse
|
3
|
Fusaroli M, Giunchi V, Battini V, Puligheddu S, Khouri C, Carnovale C, Raschi E, Poluzzi E. Enhancing Transparency in Defining Studied Drugs: The Open-Source Living DiAna Dictionary for Standardizing Drug Names in the FAERS. Drug Saf 2024; 47:271-284. [PMID: 38175395 PMCID: PMC10874306 DOI: 10.1007/s40264-023-01391-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/11/2023] [Indexed: 01/05/2024]
Abstract
INTRODUCTION In refining drug safety signals, defining the object of study is crucial. While research has explored the effect of different event definitions, drug definition is often overlooked. The US FDA Adverse Event Reporting System (FAERS) records drug names as free text, necessitating mapping to active ingredients. Although pre-mapped databases exist, the subjectivity and lack of transparency of the mapping process lead to a loss of control over the object of study. OBJECTIVE We implemented the DiAna dictionary, systematically mapping individual free-text instances to their corresponding active ingredients and linking them to the World Health Organization Anatomical Therapeutic Chemical (WHO-ATC) classification. METHODS We retrieved all drug names reported to the FAERS (2004-December 2022). Using existing vocabularies and string editing, we automatically mapped free text to ingredients. We manually revised the mapping and linked it to the ATC classification. RESULTS We retrieved 18,151,842 reports, with 74,143,411 drug entries. We manually checked the first 14,832 terms, up to terms occurring over 200 times (96.88% of total drug entries), to 6282 unique active ingredients. Automatic unchecked translations extend the standardization to 346,854 terms (98.94%). The DiAna dictionary showed a higher sensitivity compared with RxNorm alone, particularly for specific drugs (e.g., rimegepant, adapalene, drospirenone, umeclidinium). The most prominent drug classes in the FAERS were immunomodulating (37.40%) and neurologic drugs (29.19%). CONCLUSION The DiAna dictionary, as a dynamic open-source tool, provides transparency and flexibility, enabling researchers to actively shape drug definitions during the mapping phase. This empowerment enhances accuracy, reproducibility, and interpretability of results.
Collapse
Affiliation(s)
- Michele Fusaroli
- Unit of Pharmacology, Department of Medical and Surgical Sciences, University of Bologna, Bologna, Italy.
| | - Valentina Giunchi
- Unit of Pharmacology, Department of Medical and Surgical Sciences, University of Bologna, Bologna, Italy
| | - Vera Battini
- Department of Biomedical and Clinical Sciences, Pharmacovigilance and Clinical Research, International Centre for Pesticides and Health Risk Prevention, ASST Fatebenefratelli-Sacco, Università degli Studi di Milano, Milan, Italy
| | - Stefano Puligheddu
- Unit of Pharmacology, Department of Medical and Surgical Sciences, University of Bologna, Bologna, Italy
| | - Charles Khouri
- Pharmacovigilance Unit, Grenoble Alpes University Hospital, Grenoble, France
- HP2 Laboratory, Inserm U1300, University of Grenoble Alpes, Grenoble, France
| | - Carla Carnovale
- Department of Biomedical and Clinical Sciences, Pharmacovigilance and Clinical Research, International Centre for Pesticides and Health Risk Prevention, ASST Fatebenefratelli-Sacco, Università degli Studi di Milano, Milan, Italy
| | - Emanuel Raschi
- Unit of Pharmacology, Department of Medical and Surgical Sciences, University of Bologna, Bologna, Italy
| | - Elisabetta Poluzzi
- Unit of Pharmacology, Department of Medical and Surgical Sciences, University of Bologna, Bologna, Italy
| |
Collapse
|
4
|
Alizadeh M, Sampaio Moura N, Schledwitz A, Patil SA, El-Serag H, Ravel J, Raufman JP. A Practical Guide to Evaluating and Using Big Data in Digestive Disease Research. Gastroenterology 2024; 166:240-247. [PMID: 38052336 PMCID: PMC10872385 DOI: 10.1053/j.gastro.2023.11.292] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Revised: 11/01/2023] [Accepted: 11/27/2023] [Indexed: 12/07/2023]
Affiliation(s)
- Madeline Alizadeh
- The Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland
| | - Natalia Sampaio Moura
- Department of Medicine, Division of Gastroenterology and Hepatology, University of Maryland School of Medicine, Baltimore, Maryland
| | - Alyssa Schledwitz
- Department of Medicine, Division of Gastroenterology and Hepatology, University of Maryland School of Medicine, Baltimore, Maryland
| | - Seema A Patil
- Department of Medicine, Division of Gastroenterology and Hepatology, University of Maryland School of Medicine, Baltimore, Maryland
| | - Hashem El-Serag
- Department of Medicine, Baylor College of Medicine, Houston, Texas
| | - Jacques Ravel
- The Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland
| | - Jean-Pierre Raufman
- Department of Medicine, Division of Gastroenterology and Hepatology, University of Maryland School of Medicine, Baltimore, Maryland; VA Maryland Healthcare System, Baltimore, Maryland; Marlene and Stewart Greenebaum Cancer Center, University of Maryland School of Medicine, Baltimore, Maryland; Department of Biochemistry and Molecular Biology, University of Maryland School of Medicine, Baltimore, Maryland.
| |
Collapse
|
5
|
Greslehner GP. "Molecular Biology"-Pleonasm or Denotation for a Discipline of Its Own? Reflections on the Origins of Molecular Biology and Its Situation Today. Biomolecules 2023; 13:1511. [PMID: 37892193 PMCID: PMC10605324 DOI: 10.3390/biom13101511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2023] [Revised: 10/05/2023] [Accepted: 10/07/2023] [Indexed: 10/29/2023] Open
Abstract
The disciplinary identity of molecular biology has frequently been called into question. Although the debates might sometimes have been more about creating or debunking myths, defending intellectual territory and the distribution of resources, there are interesting underlying questions about this area of biology and how it is conceptually organized. By looking at the history of molecular biology, its origins and development, I examine the possible criteria for its status as a scientific discipline. Doing so allows us to answer the title question in such a way that offers a reasonable middle ground, where molecular biology can be properly viewed as a viable interdisciplinary program that can very well be called a discipline in its own right, even if no strict boundaries can be established. In addition to this historical analysis, a couple of systematic issues from a philosophy of science perspective allow for some assessment of the current situation and the future of molecular biology.
Collapse
Affiliation(s)
- Gregor P Greslehner
- Department of Philosophy, University of Vienna, Universitätsstraße 7, 1010 Vienna, Austria
| |
Collapse
|
6
|
Rader JA, Pivovarnik MA, Vantilburg ME, Whitehouse LS. PhyloMatcher: a tool for resolving conflicts in taxonomic nomenclature. BIOINFORMATICS ADVANCES 2023; 3:vbad144. [PMID: 37840907 PMCID: PMC10576170 DOI: 10.1093/bioadv/vbad144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 08/31/2023] [Accepted: 10/03/2023] [Indexed: 10/17/2023]
Abstract
Summary Large-scale comparative studies rely on the application of both phylogenetic trees and phenotypic data, both of which come from a variety of sources, but due to the changing nature of phylogenetic classification over time, many taxon names in comparative datasets do not match the nomenclature in phylogenetic trees. Manual curation of taxonomic synonyms in large comparative datasets can be daunting. To address this issue, we introduce PhyloMatcher, a tool which allows for programmatic querying of the National Center for Biotechnology Information Taxonomy and Global Biodiversity Information Facility databases to find associated synonyms with given target species names. Availability and implementation PhyloMatcher is easily installed as a Python package with pip, or as a standalone GUI application. PhyloMatcher source code and documentation are freely available at https://github.com/Lswhiteh/PhyloMatcher, the GUI application can be downloaded from the Releases page.
Collapse
Affiliation(s)
- Jonathan A Rader
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599-3280, United States
| | - Madelyn A Pivovarnik
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599-3280, United States
| | - Matias E Vantilburg
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599-3280, United States
| | - Logan S Whitehouse
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599-7264, United States
| |
Collapse
|
7
|
Giunchi V, Fusaroli M, Hauben M, Raschi E, Poluzzi E. Challenges and Opportunities in Accessing and Analysing FAERS Data: A Call Towards a Collaborative Approach. Drug Saf 2023; 46:921-926. [PMID: 37651086 DOI: 10.1007/s40264-023-01345-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/06/2023] [Indexed: 09/01/2023]
Affiliation(s)
- Valentina Giunchi
- Pharmacology Unit, Department of Medical and Surgical Sciences, University of Bologna, Bologna, Italy.
| | - Michele Fusaroli
- Pharmacology Unit, Department of Medical and Surgical Sciences, University of Bologna, Bologna, Italy
| | | | - Emanuel Raschi
- Pharmacology Unit, Department of Medical and Surgical Sciences, University of Bologna, Bologna, Italy
| | - Elisabetta Poluzzi
- Pharmacology Unit, Department of Medical and Surgical Sciences, University of Bologna, Bologna, Italy
| |
Collapse
|
8
|
Rader JA, Pivovarnik MA, Vantilburg ME, Whitehouse LS. PhyloMatcher: a tool for resolving conflicts in taxonomic nomenclature. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.07.552263. [PMID: 37609275 PMCID: PMC10441299 DOI: 10.1101/2023.08.07.552263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/24/2023]
Abstract
Summary Large-scale comparative studies rely on the application of both phylogenetic trees and phenotypic data, both of which come from a variety of sources, but due to the changing nature of phylogenetic classification over time, many taxon names in comparative datasets do not match the nomenclature in phylogenetic trees. Manual curation of taxonomic synonyms in large comparative datasets can be daunting. To address this issue, we introduce PhyloMatcher, a tool which allows for programmatic querying of two commonly used taxonomic databases to find associated synonyms with given target species names. Availability and implementation PhyloMatcher is easily installed as a Python package with pip, or as a standalone GUI application. PhyloMatcher source code and documentation are freely available at https://github.com/Lswhiteh/PhyloMatcher, the GUI application can be downloaded from the Releases page. Contact Lswhiteh@unc.edu. Supplemental Information We provide documentation for PhyloMatcher, including walkthrough instructions for the GUI application on the Releases page of https://github.com/Lswhiteh/PhyloMatcher.
Collapse
Affiliation(s)
- Jonathan A. Rader
- Dept. of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | | | - Matias E. Vantilburg
- Dept. of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Logan S. Whitehouse
- Dept. of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| |
Collapse
|
9
|
Sterner B, Elliott S, Gilbert EE, Franz NM. Unified and pluralistic ideals for data sharing and reuse in biodiversity. Database (Oxford) 2023; 2023:baad048. [PMID: 37465916 PMCID: PMC10354506 DOI: 10.1093/database/baad048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Revised: 05/30/2023] [Accepted: 06/27/2023] [Indexed: 07/20/2023]
Abstract
How should billions of species observations worldwide be shared and made reusable? Many biodiversity scientists assume the ideal solution is to standardize all datasets according to a single, universal classification and aggregate them into a centralized, global repository. This ideal has known practical and theoretical limitations, however, which justifies investigating alternatives. To support better community deliberation and normative evaluation, we develop a novel conceptual framework showing how different organizational models, regulative ideals and heuristic strategies are combined to form shared infrastructures supporting data reuse. The framework is anchored in a general definition of data pooling as an activity of making a taxonomically standardized body of information available for community reuse via digital infrastructure. We describe and illustrate unified and pluralistic ideals for biodiversity data pooling and show how communities may advance toward these ideals using different heuristic strategies. We present evidence for the strengths and limitations of the unification and pluralistic ideals based on systemic relationships of power, responsibility and benefit they establish among stakeholders, and we conclude the pluralistic ideal is better suited for biodiversity data.
Collapse
Affiliation(s)
- Beckett Sterner
- School of Life Sciences, Arizona State University, 427 E Tyler Mall, Tempe, AZ 85281, USA
| | - Steve Elliott
- School of Life Sciences, Arizona State University, 427 E Tyler Mall, Tempe, AZ 85281, USA
| | - Edward E Gilbert
- School of Life Sciences, Arizona State University, 427 E Tyler Mall, Tempe, AZ 85281, USA
| | - Nico M Franz
- School of Life Sciences, Arizona State University, 427 E Tyler Mall, Tempe, AZ 85281, USA
| |
Collapse
|
10
|
Reuter K, Rocca E. Correspondence on 'Mining social media data to investigate patient perceptions regarding DMARD pharmacotherapy for rheumatoid arthritis'. Ann Rheum Dis 2023; 82:e91. [PMID: 33593738 DOI: 10.1136/annrheumdis-2020-219776] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Accepted: 12/24/2020] [Indexed: 12/13/2022]
Affiliation(s)
- Katja Reuter
- Department of Public Health & Preventive Medicine, SUNY Upstate Medical University, Syracuse, New York, USA
| | - Elena Rocca
- Centre for Applied Philosophy of Science, Norwegian University of Life Sciences, Ås, Akershus, Norway
| |
Collapse
|
11
|
Tretter F, Peters EMJ, Sturmberg J, Bennett J, Voit E, Dietrich JW, Smith G, Weckwerth W, Grossman Z, Wolkenhauer O, Marcum JA. Perspectives of (/memorandum for) systems thinking on COVID-19 pandemic and pathology. J Eval Clin Pract 2023; 29:415-429. [PMID: 36168893 PMCID: PMC9538129 DOI: 10.1111/jep.13772] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/27/2022] [Revised: 09/08/2022] [Accepted: 09/13/2022] [Indexed: 11/29/2022]
Abstract
Is data-driven analysis sufficient for understanding the COVID-19 pandemic and for justifying public health regulations? In this paper, we argue that such analysis is insufficient. Rather what is needed is the identification and implementation of over-arching hypothesis-related and/or theory-based rationales to conduct effective SARS-CoV2/COVID-19 (Corona) research. To that end, we analyse and compare several published recommendations for conceptual and methodological frameworks in medical research (e.g., public health, preventive medicine and health promotion) to current research approaches in medical Corona research. Although there were several efforts published in the literature to develop integrative conceptual frameworks before the COVID-19 pandemic, such as social ecology for public health issues and systems thinking in health care, only a few attempts to utilize these concepts can be found in medical Corona research. For this reason, we propose nested and integrative systemic modelling approaches to understand Corona pandemic and Corona pathology. We conclude that institutional efforts for knowledge integration and systemic thinking, but also for integrated science, are urgently needed to avoid or mitigate future pandemics and to resolve infection pathology.
Collapse
Affiliation(s)
- Felix Tretter
- Bertalanffy Center for the Study of Systems ScienceViennaAustria
| | - Eva M. J. Peters
- Psychoneuroimmunology Laboratory, Department of Psychosomatic Medicine and PsychotherapyJustus‐Liebig‐UniversityGiessenHesseGermany
- Internal Medicine and DermatologyUniversitätsmedizin‐CharitéBerlinGermany
| | - Joachim Sturmberg
- College of Health, Medicine and WellbeingUniversity of NewcastleNewcastleNew South WalesAustralia
- International Society for Systems and Complexity Sciences for HealthPrincetonNew JerseyUSA
| | - Jeanette Bennett
- Department of Psychological Science, StressWAVES Biobehavioral Research LabUniversity of North CarolinaCharlotteNorth CarolinaUSA
| | - Eberhard Voit
- Wallace H. Coulter Department of Biomedical EngineeringGeorgia Institute of Technology and Emory UniversityAtlantaGeorgiaUSA
| | - Johannes W. Dietrich
- Diabetes, Endocrinology and Metabolism Section, Department of Medicine ISt. Josef Hospital, Ruhr PhilosophyBochumGermany
- Diabetes Centre Bochum/HattingenKlinik BlankensteinHattingenGermany
- Centre for Rare Endocrine Diseases (ZSE), Ruhr Centre for Rare Diseases (CeSER)BochumGermany
- Centre for Diabetes Technology, Catholic Hospitals BochumRuhr University BochumBochumGermany
| | - Gary Smith
- International Society for the Systems SciencesPontypoolUK
| | - Wolfram Weckwerth
- Vienna Metabolomics Center (VIME) and Molecular Systems Biology (MOSYS)University of ViennaViennaAustria
| | - Zvi Grossman
- Department of Physiology and Pharmacology, Faculty of MedicineTel Aviv UniversityTel AvivIsrael
| | - Olaf Wolkenhauer
- Department of Systems Biology & BioinformaticsUniversity of RostockRostockGermany
| | | |
Collapse
|
12
|
Zhao K, Rhee SY. Interpreting omics data with pathway enrichment analysis. Trends Genet 2023; 39:308-319. [PMID: 36750393 DOI: 10.1016/j.tig.2023.01.003] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Revised: 11/24/2022] [Accepted: 01/13/2023] [Indexed: 02/09/2023]
Abstract
Pathway enrichment analysis is indispensable for interpreting omics datasets and generating hypotheses. However, the foundations of enrichment analysis remain elusive to many biologists. Here, we discuss best practices in interpreting different types of omics data using pathway enrichment analysis and highlight the importance of considering intrinsic features of various types of omics data. We further explain major components that influence the outcomes of a pathway enrichment analysis, including defining background sets and choosing reference annotation databases. To improve reproducibility, we describe how to standardize reporting methodological details in publications. This article aims to serve as a primer for biologists to leverage the wealth of omics resources and motivate bioinformatics tool developers to enhance the power of pathway enrichment analysis.
Collapse
Affiliation(s)
- Kangmei Zhao
- Department of Plant Biology, Carnegie Institution for Science, Stanford, CA 94025, USA.
| | - Seung Yon Rhee
- Department of Plant Biology, Carnegie Institution for Science, Stanford, CA 94025, USA.
| |
Collapse
|
13
|
Tretter F, Marcum J. 'Medical Corona Science': Philosophical and systemic issues: Re-thinking medicine? On the epistemology of Corona medicine. J Eval Clin Pract 2023; 29:405-414. [PMID: 35818671 DOI: 10.1111/jep.13734] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Revised: 06/15/2022] [Accepted: 06/22/2022] [Indexed: 11/27/2022]
Abstract
RATIONALE, AIMS, AND OBJECTIVES The disciplinary profile and the quality of production of knowledge on Corona pandemic is studied. This scientific field is called 'Medical Corona Science'. METHODS Criteria of analytical philosophy of science and science studies are systematically applied. RESULTS It is shown that mainly auxiliary medical disciplines such as virology and epidemiology but not clinical disciplines provide Corona knowledge. We see a laboratory-centered, technology- and data-driven science, largely ignoring clinical issues. Therefore we call these approaches "Medical Corona Science" (MCS). We see the need to adapt to features of a 'post-normal science', a 'mode 2 science' and of 'Integration and Implementation Science', especially as clinical knowledge must be integrated. There is also a severe lack of theoretical considerations that could help to frame the pandemic as a complex dynamic system. CONCLUSIONS We suggest a deeper meta-scientific discussion of the epistemic value of MCS and propose the application of tools from systems science.
Collapse
Affiliation(s)
- Felix Tretter
- Bertalanffy Center for the Study of Systems Science, Vienna, Austria
| | - James Marcum
- Department of Philosophy, Baylor University, Waco, Texas, USA
| |
Collapse
|
14
|
Bachmann L, Beermann J, Brey T, de Boer HJ, Dannheim J, Edvardsen B, Ericson PGP, Holston KC, Johansson VA, Kloss P, Konijnenberg R, Osborn KJ, Pappalardo P, Pehlke H, Piepenburg D, Struck TH, Sundberg P, Markussen SS, Teschke K, Vanhove MPM. The role of systematics for understanding ecosystem functions: Proceedings of the Zoologica Scripta Symposium, Oslo, Norway, 25 August 2022. ZOOL SCR 2023. [DOI: 10.1111/zsc.12593] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/28/2023]
|
15
|
Ehlman SM, Scherer U, Bierbach D, Francisco FA, Laskowski KL, Krause J, Wolf M. Leveraging big data to uncover the eco-evolutionary factors shaping behavioural development. Proc Biol Sci 2023; 290:20222115. [PMID: 36722081 PMCID: PMC9890127 DOI: 10.1098/rspb.2022.2115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
Mapping the eco-evolutionary factors shaping the development of animals' behavioural phenotypes remains a great challenge. Recent advances in 'big behavioural data' research-the high-resolution tracking of individuals and the harnessing of that data with powerful analytical tools-have vastly improved our ability to measure and model developing behavioural phenotypes. Applied to the study of behavioural ontogeny, the unfolding of whole behavioural repertoires can be mapped in unprecedented detail with relative ease. This overcomes long-standing experimental bottlenecks and heralds a surge of studies that more finely define and explore behavioural-experiential trajectories across development. In this review, we first provide a brief guide to state-of-the-art approaches that allow the collection and analysis of high-resolution behavioural data across development. We then outline how such approaches can be used to address key issues regarding the ecological and evolutionary factors shaping behavioural development: developmental feedbacks between behaviour and underlying states, early life effects and behavioural transitions, and information integration across development.
Collapse
Affiliation(s)
- Sean M. Ehlman
- SCIoI Excellence Cluster, 10587 Berlin, Germany,Faculty of Life Sciences, Humboldt University, 10117 Berlin, Germany,Department of Fish Biology, Fisheries, and Aquaculture, Leibniz Institute of Freshwater Ecology and Inland Fisheries, 12587 Berlin, Germany
| | - Ulrike Scherer
- SCIoI Excellence Cluster, 10587 Berlin, Germany,Faculty of Life Sciences, Humboldt University, 10117 Berlin, Germany,Department of Fish Biology, Fisheries, and Aquaculture, Leibniz Institute of Freshwater Ecology and Inland Fisheries, 12587 Berlin, Germany
| | - David Bierbach
- SCIoI Excellence Cluster, 10587 Berlin, Germany,Faculty of Life Sciences, Humboldt University, 10117 Berlin, Germany,Department of Fish Biology, Fisheries, and Aquaculture, Leibniz Institute of Freshwater Ecology and Inland Fisheries, 12587 Berlin, Germany
| | - Fritz A. Francisco
- SCIoI Excellence Cluster, 10587 Berlin, Germany,Faculty of Life Sciences, Humboldt University, 10117 Berlin, Germany
| | - Kate L. Laskowski
- Department of Evolution and Ecology, University of California – Davis, Davis, CA 95616, USA
| | - Jens Krause
- SCIoI Excellence Cluster, 10587 Berlin, Germany,Faculty of Life Sciences, Humboldt University, 10117 Berlin, Germany,Department of Fish Biology, Fisheries, and Aquaculture, Leibniz Institute of Freshwater Ecology and Inland Fisheries, 12587 Berlin, Germany
| | - Max Wolf
- SCIoI Excellence Cluster, 10587 Berlin, Germany,Department of Fish Biology, Fisheries, and Aquaculture, Leibniz Institute of Freshwater Ecology and Inland Fisheries, 12587 Berlin, Germany
| |
Collapse
|
16
|
Shan M, Jiang C, Qin L, Cheng G. A Review of Computational Methods in Predicting hERG Channel Blockers. ChemistrySelect 2022. [DOI: 10.1002/slct.202201221] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Mengyi Shan
- School of Pharmaceutical Sciences Zhejiang Chinese Medical University Hangzhou 310053 People's Republic of China
| | - Chen Jiang
- QuanMin RenZheng (HangZhou) Technology Co. Ltd. China
| | - Lu‐Ping Qin
- School of Pharmaceutical Sciences Zhejiang Chinese Medical University Hangzhou 310053 People's Republic of China
| | - Gang Cheng
- School of Pharmaceutical Sciences Zhejiang Chinese Medical University Hangzhou 310053 People's Republic of China
| |
Collapse
|
17
|
Artificial intelligence and machine-learning approaches in structure and ligand-based discovery of drugs affecting central nervous system. Mol Divers 2022; 27:959-985. [PMID: 35819579 DOI: 10.1007/s11030-022-10489-3] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Accepted: 06/21/2022] [Indexed: 12/11/2022]
Abstract
CNS disorders are indications with a very high unmet medical needs, relatively smaller number of available drugs, and a subpar satisfaction level among patients and caregiver. Discovery of CNS drugs is extremely expensive affair with its own unique challenges leading to extremely high attrition rates and low efficiency. With explosion of data in information age, there is hardly any aspect of life that has not been touched by data driven technologies such as artificial intelligence (AI) and machine learning (ML). Drug discovery is no exception, emergence of big data via genomic, proteomic, biological, and chemical technologies has driven pharmaceutical giants to collaborate with AI oriented companies to revolutionise drug discovery, with the goal of increasing the efficiency of the process. In recent years many examples of innovative applications of AI and ML techniques in CNS drug discovery has been reported. Research on therapeutics for diseases such as schizophrenia, Alzheimer's and Parkinsonism has been provided with a new direction and thrust from these developments. AI and ML has been applied to both ligand-based and structure-based drug discovery and design of CNS therapeutics. In this review, we have summarised the general aspects of AI and ML from the perspective of drug discovery followed by a comprehensive coverage of the recent developments in the applications of AI/ML techniques in CNS drug discovery.
Collapse
|
18
|
Grosman L, Muller A, Dag I, Goldgeier H, Harush O, Herzlinger G, Nebenhaus K, Valetta F, Yashuv T, Dick N. Artifact3-D: New software for accurate, objective and efficient 3D analysis and documentation of archaeological artifacts. PLoS One 2022; 17:e0268401. [PMID: 35709137 PMCID: PMC9202890 DOI: 10.1371/journal.pone.0268401] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Accepted: 04/20/2022] [Indexed: 11/19/2022] Open
Abstract
The study of artifacts is fundamental to archaeological research. The features of individual artifacts are recorded, analyzed, and compared within and between contextual assemblages. Here we present and make available for academic-use Artifact3-D, a new software package comprised of a suite of analysis and documentation procedures for archaeological artifacts. We introduce it here, alongside real archaeological case studies to demonstrate its utility. Artifact3-D equips its users with a range of computational functions for accurate measurements, including orthogonal distances, surface area, volume, CoM, edge angles, asymmetry, and scar attributes. Metrics and figures for each of these measurements are easily exported for the purposes of further analysis and illustration. We test these functions on a range of real archaeological case studies pertaining to tool functionality, technological organization, manufacturing traditions, knapping techniques, and knapper skill. Here we focus on lithic artifacts, but the Artifact3-D software can be used on any artifact type to address the needs of modern archaeology. Computational methods are increasingly becoming entwined in the excavation, documentation, analysis, database creation, and publication of archaeological research. Artifact3-D offers functions to address every stage of this workflow. It equips the user with the requisite toolkit for archaeological research that is accurate, objective, repeatable and efficient. This program will help archaeological research deal with the abundant material found during excavations and will open new horizons in research trajectories.
Collapse
Affiliation(s)
- Leore Grosman
- Institute of Archaeology, Mount Scopus, The Hebrew University of Jerusalem, Jerusalem, Israel
- * E-mail:
| | - Antoine Muller
- Institute of Archaeology, Mount Scopus, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Itamar Dag
- Institute of Archaeology, Mount Scopus, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Hadas Goldgeier
- Institute of Archaeology, Mount Scopus, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Ortal Harush
- Institute of Archaeology, Mount Scopus, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Gadi Herzlinger
- Institute of Archaeology, Mount Scopus, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Keren Nebenhaus
- Institute of Archaeology, Mount Scopus, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Francesco Valetta
- Institute of Archaeology, Mount Scopus, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Talia Yashuv
- Institute of Archaeology, Mount Scopus, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Nir Dick
- Institute of Archaeology, Mount Scopus, The Hebrew University of Jerusalem, Jerusalem, Israel
| |
Collapse
|
19
|
Zouache MA. Variability in Retinal Neuron Populations and Associated Variations in Mass Transport Systems of the Retina in Health and Aging. Front Aging Neurosci 2022; 14:778404. [PMID: 35283756 PMCID: PMC8914054 DOI: 10.3389/fnagi.2022.778404] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Accepted: 01/13/2022] [Indexed: 11/17/2022] Open
Abstract
Aging is associated with a broad range of visual impairments that can have dramatic consequences on the quality of life of those impacted. These changes are driven by a complex series of alterations affecting interactions between multiple cellular and extracellular elements. The resilience of many of these interactions may be key to minimal loss of visual function in aging; yet many of them remain poorly understood. In this review, we focus on the relation between retinal neurons and their respective mass transport systems. These metabolite delivery systems include the retinal vasculature, which lies within the inner portion of the retina, and the choroidal vasculature located externally to the retinal tissue. A framework for investigation is proposed and applied to identify the structures and processes determining retinal mass transport at the cellular and tissue levels. Spatial variability in the structure of the retina and changes observed in aging are then harnessed to explore the relation between variations in neuron populations and those seen among retinal metabolite delivery systems. Existing data demonstrate that the relation between inner retinal neurons and their mass transport systems is different in nature from that observed between the outer retina and choroid. The most prominent structural changes observed across the eye and in aging are seen in Bruch's membrane, which forms a selective barrier to mass transfers at the interface between the choroidal vasculature and the outer retina.
Collapse
Affiliation(s)
- Moussa A. Zouache
- John A. Moran Eye Center, Department of Ophthalmology and Visual Sciences, University of Utah, Salt Lake City, UT, United States
| |
Collapse
|
20
|
Zadissa A, Apweiler R. Data Mining, Quality and Management in the Life Sciences. Methods Mol Biol 2022; 2449:3-25. [PMID: 35507257 DOI: 10.1007/978-1-0716-2095-3_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
With the evermore emphasis put on open science and its invaluable benefits to the scientific community, it is no longer the case where a research project simply ends with a scientific publication. The benefits of data sharing and reproducibility of results have taken the centerpiece within the life science research supported by FAIR principles that firmly underline the importance of open data. The current data-intensive multidisciplinary research has also highlighted the significance of how data is mined and managed. Here we describe some of the features adopted by EMBL-EBI data resources to support data mining, data quality, and data management. We also highlight how EMBL-EBI has responded to the current pandemic through its data resources.
Collapse
Affiliation(s)
- Amonida Zadissa
- EMBL-EBI, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK.
| | - Rolf Apweiler
- EMBL-EBI, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| |
Collapse
|
21
|
Silva-Costa LC, Smith BJ. Post-translational Modifications in Brain Diseases: A Future for Biomarkers. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2022; 1382:129-141. [DOI: 10.1007/978-3-031-05460-0_10] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
22
|
Chen T, Ma J, Liu Y, Chen Z, Xiao N, Lu Y, Fu Y, Yang C, Li M, Wu S, Wang X, Li D, He F, Hermjakob H, Zhu Y. iProX in 2021: connecting proteomics data sharing with big data. Nucleic Acids Res 2021; 50:D1522-D1527. [PMID: 34871441 PMCID: PMC8728291 DOI: 10.1093/nar/gkab1081] [Citation(s) in RCA: 336] [Impact Index Per Article: 84.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Revised: 10/16/2021] [Accepted: 10/22/2021] [Indexed: 12/12/2022] Open
Abstract
The rapid development of proteomics studies has resulted in large volumes of experimental data. The emergence of big data platform provides the opportunity to handle these large amounts of data. The integrated proteome resource, iProX (https://www.iprox.cn), which was initiated in 2017, has been greatly improved with an up-to-date big data platform implemented in 2021. Here, we describe the main iProX developments since its first publication in Nucleic Acids Research in 2019. First, a hyper-converged architecture with high scalability supports the submission process. A hadoop cluster can store large amounts of proteomics datasets, and a distributed, RESTful-styled Elastic Search engine can query millions of records within one second. Also, several new features, including the Universal Spectrum Identifier (USI) mechanism proposed by ProteomeXchange, RESTful Web Service API, and a high-efficiency reanalysis pipeline, have been added to iProX for better open data sharing. By the end of August 2021, 1526 datasets had been submitted to iProX, reaching a total data volume of 92.42TB. With the implementation of the big data platform, iProX can support PB-level data storage, hundreds of billions of spectra records, and second-level latency service capabilities that meet the requirements of the fast growing field of proteomics.
Collapse
Affiliation(s)
- Tao Chen
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Jie Ma
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Yi Liu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Zhiguang Chen
- School of Computer Science and Engineering, Sun Yat-Sen University, Guangzhou 26469, China
| | - Nong Xiao
- School of Computer Science and Engineering, Sun Yat-Sen University, Guangzhou 26469, China
| | - Yutong Lu
- School of Computer Science and Engineering, Sun Yat-Sen University, Guangzhou 26469, China
| | - Yinjin Fu
- School of Computer Science and Engineering, Sun Yat-Sen University, Guangzhou 26469, China
| | - Chunyuan Yang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Mansheng Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Songfeng Wu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Xue Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Dongsheng Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Fuchu He
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Henning Hermjakob
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China.,European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Yunping Zhu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China.,Basic Medical School, Anhui Medical University, Anhui 230032, China
| |
Collapse
|
23
|
Vitorino R, Choudhury M, Guedes S, Ferreira R, Thongboonkerd V, Sharma L, Amado F, Srivastava S. Peptidomics and proteogenomics: background, challenges and future needs. Expert Rev Proteomics 2021; 18:643-659. [PMID: 34517741 DOI: 10.1080/14789450.2021.1980388] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Abstract
INTRODUCTION With available genomic data and related information, it is becoming possible to better highlight mutations or genomic alterations associated with a particular disease or disorder. The advent of high-throughput sequencing technologies has greatly advanced diagnostics, prognostics, and drug development. AREAS COVERED Peptidomics and proteogenomics are the two post-genomic technologies that enable the simultaneous study of peptides and proteins/transcripts/genes. Both technologies add a remarkably large amount of data to the pool of information on various peptides associated with gene mutations or genome remodeling. Literature search was performed in the PubMed database and is up to date. EXPERT OPINION This article lists various techniques used for peptidomic and proteogenomic analyses. It also explains various bioinformatics workflows developed to understand differentially expressed peptides/proteins and their role in disease pathogenesis. Their role in deciphering disease pathways, cancer research, and biomarker discovery using biofluids is highlighted. Finally, the challenges and future requirements to overcome the current limitations for their effective clinical use are also discussed.
Collapse
Affiliation(s)
- Rui Vitorino
- Faculdade de Medicina da Universidade do Porto, Porto, Portugal.,iBiMED, Department of Medical Sciences, University of Aveiro, Aveiro, Portugal.,Laqv/requimte, Department of Chemistry, University of Aveiro, Aveiro, Portugal
| | - Manisha Choudhury
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Mumbai, Powai, India
| | - Sofia Guedes
- Laqv/requimte, Department of Chemistry, University of Aveiro, Aveiro, Portugal
| | - Rita Ferreira
- Laqv/requimte, Department of Chemistry, University of Aveiro, Aveiro, Portugal
| | - Visith Thongboonkerd
- Medical Proteomics Unit, Office for Research and Development, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
| | | | - Francisco Amado
- Laqv/requimte, Department of Chemistry, University of Aveiro, Aveiro, Portugal
| | - Sanjeeva Srivastava
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Mumbai, Powai, India
| |
Collapse
|
24
|
Krantz M, Zimmer D, Adler SO, Kitashova A, Klipp E, Mühlhaus T, Nägele T. Data Management and Modeling in Plant Biology. FRONTIERS IN PLANT SCIENCE 2021; 12:717958. [PMID: 34539712 PMCID: PMC8446634 DOI: 10.3389/fpls.2021.717958] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Accepted: 07/29/2021] [Indexed: 05/25/2023]
Abstract
The study of plant-environment interactions is a multidisciplinary research field. With the emergence of quantitative large-scale and high-throughput techniques, amount and dimensionality of experimental data have strongly increased. Appropriate strategies for data storage, management, and evaluation are needed to make efficient use of experimental findings. Computational approaches of data mining are essential for deriving statistical trends and signatures contained in data matrices. Although, current biology is challenged by high data dimensionality in general, this is particularly true for plant biology. Plants as sessile organisms have to cope with environmental fluctuations. This typically results in strong dynamics of metabolite and protein concentrations which are often challenging to quantify. Summarizing experimental output results in complex data arrays, which need computational statistics and numerical methods for building quantitative models. Experimental findings need to be combined by computational models to gain a mechanistic understanding of plant metabolism. For this, bioinformatics and mathematics need to be combined with experimental setups in physiology, biochemistry, and molecular biology. This review presents and discusses concepts at the interface of experiment and computation, which are likely to shape current and future plant biology. Finally, this interface is discussed with regard to its capabilities and limitations to develop a quantitative model of plant-environment interactions.
Collapse
Affiliation(s)
- Maria Krantz
- Theoretical Biophysics, Institute of Biology, Humboldt-Universität zu Berlin, Berlin, Germany
| | - David Zimmer
- Computational Systems Biology, Technische Universität Kaiserslautern, Kaiserslautern, Germany
| | - Stephan O. Adler
- Theoretical Biophysics, Institute of Biology, Humboldt-Universität zu Berlin, Berlin, Germany
| | - Anastasia Kitashova
- Plant Evolutionary Cell Biology, Faculty of Biology, Ludwig-Maximilians-Universität München, Planegg-Martinsried, Germany
| | - Edda Klipp
- Theoretical Biophysics, Institute of Biology, Humboldt-Universität zu Berlin, Berlin, Germany
| | - Timo Mühlhaus
- Computational Systems Biology, Technische Universität Kaiserslautern, Kaiserslautern, Germany
| | - Thomas Nägele
- Plant Evolutionary Cell Biology, Faculty of Biology, Ludwig-Maximilians-Universität München, Planegg-Martinsried, Germany
| |
Collapse
|
25
|
Abstract
Cell atlases are essential companions to the genome as they elucidate how genes are used in a cell type-specific manner or how the usage of genes changes over the lifetime of an organism. This review explores recent advances in whole-organism single-cell atlases, which enable understanding of cell heterogeneity and tissue and cell fate, both in health and disease. Here we provide an overview of recent efforts to build cell atlases across species and discuss the challenges that the field is currently facing. Moreover, we propose the concept of having a knowledgebase that can scale with the number of experiments and computational approaches and a new feedback loop for development and benchmarking of computational methods that includes contributions from the users. These two aspects are key for community efforts in single-cell biology that will help produce a comprehensive annotated map of cell types and states with unparalleled resolution.
Collapse
Affiliation(s)
| | - Bruno Tojo
- Instituto Superior Técnico, Universidade de Lisboa, 1049-001 Lisbon, Portugal
| | - Aaron McGeever
- Chan Zuckerberg Biohub, San Francisco, California 94103, USA;
| |
Collapse
|
26
|
Salinas VH, Stüve O. Systems Approaches to Unravel T Cell Function and Therapeutic Potential in Autoimmune Disease. THE JOURNAL OF IMMUNOLOGY 2021; 206:669-675. [PMID: 33526601 DOI: 10.4049/jimmunol.2000954] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/14/2020] [Accepted: 11/02/2020] [Indexed: 12/22/2022]
Abstract
Producing Ag-specific immune responses constrained to target tissues or cells that can be engaged or disengaged at will is predicated on understanding the network of genes governing immune cell function, defining the rules underlying Ag specificity, and synthesizing the tools to engineer them. The successes and limitations of chimeric Ag receptor (CAR) T cells emphasize this goal, and advances in high-throughput sequencing, large-scale genomic screens, single-cell profiling, and genetic modification are providing the necessary data to bring it to fruition-including a broader application into the treatment of autoimmune diseases. In this review, we delve into the implementation of these developments, survey the relevant works, and propose a framework for generating the next generation of synthetic T cells informed by the principles learned from these systems approaches.
Collapse
Affiliation(s)
- Victor H Salinas
- Department of Neurology and Neurotherapeutics, University of Texas Southwestern Medical Center, Dallas, TX 75390; and
| | - Olaf Stüve
- Department of Neurology and Neurotherapeutics, University of Texas Southwestern Medical Center, Dallas, TX 75390; and .,Neurology Section, Medical Service, U.S. Department of Veterans Affairs, North Texas Health Care System, Dallas, TX 75216
| |
Collapse
|
27
|
Williamson HF, Brettschneider J, Caccamo M, Davey RP, Goble C, Kersey PJ, May S, Morris RJ, Ostler R, Pridmore T, Rawlings C, Studholme D, Tsaftaris SA, Leonelli S. Data management challenges for artificial intelligence in plant and agricultural research. F1000Res 2021; 10:324. [PMID: 36873457 PMCID: PMC9975417 DOI: 10.12688/f1000research.52204.1] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 04/12/2021] [Indexed: 09/14/2024] Open
Abstract
Artificial Intelligence (AI) is increasingly used within plant science, yet it is far from being routinely and effectively implemented in this domain. Particularly relevant to the development of novel food and agricultural technologies is the development of validated, meaningful and usable ways to integrate, compare and visualise large, multi-dimensional datasets from different sources and scientific approaches. After a brief summary of the reasons for the interest in data science and AI within plant science, the paper identifies and discusses eight key challenges in data management that must be addressed to further unlock the potential of AI in crop and agronomic research, and particularly the application of Machine Learning (AI) which holds much promise for this domain.
Collapse
Affiliation(s)
- Hugh F. Williamson
- Exeter Centre for the Study of the Life Sciences & Institute for Data Science and Artificial Intelligence, University of Exeter, Exeter, UK
| | | | - Mario Caccamo
- NIAB, National Research Institute of Brewing, East Malling, UK
| | | | - Carole Goble
- Department of Computer Science, University of Manchester, Manchester, UK
| | | | - Sean May
- School of Biosciences, University of Nottingham, Loughborough, UK
| | | | - Richard Ostler
- Department of Computational and Analytical Sciences, Rothamsted Research, Harpendem, UK
| | - Tony Pridmore
- School of Computer Science, University of Nottingham, Nottingham, UK
| | - Chris Rawlings
- Department of Computational and Analytical Sciences, Rothamsted Research, Harpendem, UK
| | | | - Sotirios A. Tsaftaris
- Institute of Digital Communications, University of Edinburgh, Edinburgh, UK
- Alan Turing Institute, London, UK
| | - Sabina Leonelli
- Exeter Centre for the Study of the Life Sciences & Institute for Data Science and Artificial Intelligence, University of Exeter, Exeter, UK
- Alan Turing Institute, London, UK
| |
Collapse
|
28
|
Williamson HF, Brettschneider J, Caccamo M, Davey RP, Goble C, Kersey PJ, May S, Morris RJ, Ostler R, Pridmore T, Rawlings C, Studholme D, Tsaftaris SA, Leonelli S. Data management challenges for artificial intelligence in plant and agricultural research. F1000Res 2021; 10:324. [PMID: 36873457 PMCID: PMC9975417 DOI: 10.12688/f1000research.52204.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 01/12/2023] [Indexed: 01/19/2023] Open
Abstract
Artificial Intelligence (AI) is increasingly used within plant science, yet it is far from being routinely and effectively implemented in this domain. Particularly relevant to the development of novel food and agricultural technologies is the development of validated, meaningful and usable ways to integrate, compare and visualise large, multi-dimensional datasets from different sources and scientific approaches. After a brief summary of the reasons for the interest in data science and AI within plant science, the paper identifies and discusses eight key challenges in data management that must be addressed to further unlock the potential of AI in crop and agronomic research, and particularly the application of Machine Learning (AI) which holds much promise for this domain.
Collapse
Affiliation(s)
- Hugh F. Williamson
- Exeter Centre for the Study of the Life Sciences & Institute for Data Science and Artificial Intelligence, University of Exeter, Exeter, UK
| | | | - Mario Caccamo
- NIAB, National Research Institute of Brewing, East Malling, UK
| | | | - Carole Goble
- Department of Computer Science, University of Manchester, Manchester, UK
| | | | - Sean May
- School of Biosciences, University of Nottingham, Loughborough, UK
| | | | - Richard Ostler
- Department of Computational and Analytical Sciences, Rothamsted Research, Harpendem, UK
| | - Tony Pridmore
- School of Computer Science, University of Nottingham, Nottingham, UK
| | - Chris Rawlings
- Department of Computational and Analytical Sciences, Rothamsted Research, Harpendem, UK
| | | | - Sotirios A. Tsaftaris
- Institute of Digital Communications, University of Edinburgh, Edinburgh, UK
- Alan Turing Institute, London, UK
| | - Sabina Leonelli
- Exeter Centre for the Study of the Life Sciences & Institute for Data Science and Artificial Intelligence, University of Exeter, Exeter, UK
- Alan Turing Institute, London, UK
| |
Collapse
|
29
|
Thessen AE, Bogdan P, Patterson DJ, Casey TM, Hinojo-Hinojo C, de Lange O, Haendel MA. From Reductionism to Reintegration: Solving society's most pressing problems requires building bridges between data types across the life sciences. PLoS Biol 2021; 19:e3001129. [PMID: 33770077 PMCID: PMC7997011 DOI: 10.1371/journal.pbio.3001129] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Decades of reductionist approaches in biology have achieved spectacular progress, but the proliferation of subdisciplines, each with its own technical and social practices regarding data, impedes the growth of the multidisciplinary and interdisciplinary approaches now needed to address pressing societal challenges. Data integration is key to a reintegrated biology able to address global issues such as climate change, biodiversity loss, and sustainable ecosystem management. We identify major challenges to data integration and present a vision for a "Data as a Service"-oriented architecture to promote reuse of data for discovery. The proposed architecture includes standards development, new tools and services, and strategies for career-development and sustainability.
Collapse
Affiliation(s)
- Anne E. Thessen
- Department of Environmental and Molecular Toxicology, Oregon State University, Corvallis, Oregon, United States of America
- * E-mail:
| | - Paul Bogdan
- Ming Hsieh Department of Electrical and Computer Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, California, United States of America
| | | | - Theresa M. Casey
- Department of Animal Sciences, Purdue University, West Lafayette, Indiana, United States of America
| | - César Hinojo-Hinojo
- Department of Earth System Science, University of California, Irvine, California, United States of America
| | - Orlando de Lange
- Department of Electrical Engineering, University of Washington, Seattle, Washington, United States of America
| | - Melissa A. Haendel
- Department of Environmental and Molecular Toxicology, Oregon State University, Corvallis, Oregon, United States of America
| |
Collapse
|
30
|
Minehart JA, Speer CM. A Picture Worth a Thousand Molecules-Integrative Technologies for Mapping Subcellular Molecular Organization and Plasticity in Developing Circuits. Front Synaptic Neurosci 2021; 12:615059. [PMID: 33469427 PMCID: PMC7813761 DOI: 10.3389/fnsyn.2020.615059] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Accepted: 12/07/2020] [Indexed: 12/23/2022] Open
Abstract
A key challenge in developmental neuroscience is identifying the local regulatory mechanisms that control neurite and synaptic refinement over large brain volumes. Innovative molecular techniques and high-resolution imaging tools are beginning to reshape our view of how local protein translation in subcellular compartments drives axonal, dendritic, and synaptic development and plasticity. Here we review recent progress in three areas of neurite and synaptic study in situ-compartment-specific transcriptomics/translatomics, targeted proteomics, and super-resolution imaging analysis of synaptic organization and development. We discuss synergies between sequencing and imaging techniques for the discovery and validation of local molecular signaling mechanisms regulating synaptic development, plasticity, and maintenance in circuits.
Collapse
Affiliation(s)
| | - Colenso M. Speer
- Department of Biology, University of Maryland, College Park, MD, United States
| |
Collapse
|
31
|
Premkumar KAR, Bharanikumar R, Palaniappan A. Riboflow: Using Deep Learning to Classify Riboswitches With ∼99% Accuracy. Front Bioeng Biotechnol 2020; 8:808. [PMID: 32760712 PMCID: PMC7371854 DOI: 10.3389/fbioe.2020.00808] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Accepted: 06/23/2020] [Indexed: 01/05/2023] Open
Abstract
Riboswitches are cis-regulatory genetic elements that use an aptamer to control gene expression. Specificity to cognate ligand and diversity of such ligands have expanded the functional repertoire of riboswitches to mediate mounting apt responses to sudden metabolic demands and signal changes in environmental conditions. Given their critical role in microbial life, riboswitch characterisation remains a challenging computational problem. Here we have addressed the issue with advanced deep learning frameworks, namely convolutional neural networks (CNN), and bidirectional recurrent neural networks (RNN) with Long Short-Term Memory (LSTM). Using a comprehensive dataset of 32 ligand classes and a stratified train-validate-test approach, we demonstrated the accurate performance of both the deep learning models (CNN and RNN) relative to conventional hyperparameter-optimized machine learning classifiers on all key performance metrics, including the ROC curve analysis. In particular, the bidirectional LSTM RNN emerged as the best-performing learning method for identifying the ligand-specificity of riboswitches with an accuracy >0.99 and macro-averaged F-score of 0.96. An additional attraction is that the deep learning models do not require prior feature engineering. A dynamic update functionality is built into the models to factor for the constant discovery of new riboswitches, and extend the predictive modeling to new classes. Our work would enable the design of genetic circuits with custom-tuned riboswitch aptamers that would effect precise translational control in synthetic biology. The associated software is available as an open-source Python package and standalone resource for use in genome annotation, synthetic biology, and biotechnology workflows.
Collapse
Affiliation(s)
- Keshav Aditya R. Premkumar
- MS Program in Computer Science, Department of Computer Science, College of Engineering and Applied Sciences, Stony Brook University, Stony Brook, NY, United States
| | - Ramit Bharanikumar
- MS in Bioinformatics, Georgia Institute of Technology, Atlanta, GA, United States
| | - Ashok Palaniappan
- Department of Bioinformatics, School of Chemical and Biotechnology, SASTRA Deemed University, Thanjavur, India
| |
Collapse
|
32
|
Chierici M, Bussola N, Marcolini A, Francescatto M, Zandonà A, Trastulla L, Agostinelli C, Jurman G, Furlanello C. Integrative Network Fusion: A Multi-Omics Approach in Molecular Profiling. Front Oncol 2020; 10:1065. [PMID: 32714870 PMCID: PMC7340129 DOI: 10.3389/fonc.2020.01065] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2020] [Accepted: 05/28/2020] [Indexed: 12/20/2022] Open
Abstract
Recent technological advances and international efforts, such as The Cancer Genome Atlas (TCGA), have made available several pan-cancer datasets encompassing multiple omics layers with detailed clinical information in large collection of samples. The need has thus arisen for the development of computational methods aimed at improving cancer subtyping and biomarker identification from multi-modal data. Here we apply the Integrative Network Fusion (INF) pipeline, which combines multiple omics layers exploiting Similarity Network Fusion (SNF) within a machine learning predictive framework. INF includes a feature ranking scheme (rSNF) on SNF-integrated features, used by a classifier over juxtaposed multi-omics features (juXT). In particular, we show instances of INF implementing Random Forest (RF) and linear Support Vector Machine (LSVM) as the classifier, and two baseline RF and LSVM models are also trained on juXT. A compact RF model, called rSNFi, trained on the intersection of top-ranked biomarkers from the two approaches juXT and rSNF is finally derived. All the classifiers are run in a 10x5-fold cross-validation schema to warrant reproducibility, following the guidelines for an unbiased Data Analysis Plan by the US FDA-led initiatives MAQC/SEQC. INF is demonstrated on four classification tasks on three multi-modal TCGA oncogenomics datasets. Gene expression, protein expression and copy number variants are used to predict estrogen receptor status (BRCA-ER, N = 381) and breast invasive carcinoma subtypes (BRCA-subtypes, N = 305), while gene expression, miRNA expression and methylation data is used as predictor layers for acute myeloid leukemia and renal clear cell carcinoma survival (AML-OS, N = 157; KIRC-OS, N = 181). In test, INF achieved similar Matthews Correlation Coefficient (MCC) values and 97% to 83% smaller feature sizes (FS), compared with juXT for BRCA-ER (MCC: 0.83 vs. 0.80; FS: 56 vs. 1801) and BRCA-subtypes (0.84 vs. 0.80; 302 vs. 1801), improving KIRC-OS performance (0.38 vs. 0.31; 111 vs. 2319). INF predictions are generally more accurate in test than one-dimensional omics models, with smaller signatures too, where transcriptomics consistently play the leading role. Overall, the INF framework effectively integrates multiple data levels in oncogenomics classification tasks, improving over the performance of single layers alone and naive juxtaposition, and provides compact signature sizes.
Collapse
Affiliation(s)
| | - Nicole Bussola
- Fondazione Bruno Kessler, Trento, Italy
- University of Trento, Trento, Italy
| | | | - Margherita Francescatto
- Fondazione Bruno Kessler, Trento, Italy
- Department of Medical, Surgical and Health Sciences, University of Trieste, Trieste, Italy
| | | | | | | | | | | |
Collapse
|
33
|
Hartmann J, Wong M, Gallo E, Gilmour D. An image-based data-driven analysis of cellular architecture in a developing tissue. eLife 2020; 9:e55913. [PMID: 32501214 PMCID: PMC7274788 DOI: 10.7554/elife.55913] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2020] [Accepted: 05/24/2020] [Indexed: 12/22/2022] Open
Abstract
Quantitative microscopy is becoming increasingly crucial in efforts to disentangle the complexity of organogenesis, yet adoption of the potent new toolbox provided by modern data science has been slow, primarily because it is often not directly applicable to developmental imaging data. We tackle this issue with a newly developed algorithm that uses point cloud-based morphometry to unpack the rich information encoded in 3D image data into a straightforward numerical representation. This enabled us to employ data science tools, including machine learning, to analyze and integrate cell morphology, intracellular organization, gene expression and annotated contextual knowledge. We apply these techniques to construct and explore a quantitative atlas of cellular architecture for the zebrafish posterior lateral line primordium, an experimentally tractable model of complex self-organized organogenesis. In doing so, we are able to retrieve both previously established and novel biologically relevant patterns, demonstrating the potential of our data-driven approach.
Collapse
Affiliation(s)
- Jonas Hartmann
- Cell Biology and Biophysics Unit, European Molecular Biology Laboratory (EMBL)HeidelbergGermany
| | - Mie Wong
- Institute of Molecular Life Sciences, University of Zurich (UZH)ZurichSwitzerland
| | - Elisa Gallo
- Cell Biology and Biophysics Unit, European Molecular Biology Laboratory (EMBL)HeidelbergGermany
- Institute of Molecular Life Sciences, University of Zurich (UZH)ZurichSwitzerland
- Collaboration for joint PhD degree between EMBL and Heidelberg University, Faculty of BiosciencesHeidelbergGermany
| | - Darren Gilmour
- Institute of Molecular Life Sciences, University of Zurich (UZH)ZurichSwitzerland
| |
Collapse
|
34
|
Xu C, Yang H, Sun L, Cao X, Hou Y, Cai Q, Jia P, Wang Y. Detecting Lung Cancer Trends by Leveraging Real-World and Internet-Based Data: Infodemiology Study. J Med Internet Res 2020; 22:e16184. [PMID: 32163035 PMCID: PMC7099398 DOI: 10.2196/16184] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2019] [Revised: 10/28/2019] [Accepted: 02/22/2020] [Indexed: 01/13/2023] Open
Abstract
Background Internet search data on health-related terms can reflect people’s concerns about their health status in near real time, and hence serve as a supplementary metric of disease characteristics. However, studies using internet search data to monitor and predict chronic diseases at a geographically finer state-level scale are sparse. Objective The aim of this study was to explore the associations of internet search volumes for lung cancer with published cancer incidence and mortality data in the United States. Methods We used Google relative search volumes, which represent the search frequency of specific search terms in Google. We performed cross-sectional analyses of the original and disease metrics at both national and state levels. A smoothed time series of relative search volumes was created to eliminate the effects of irregular changes on the search frequencies and obtain the long-term trends of search volumes for lung cancer at both the national and state levels. We also performed analyses of decomposed Google relative search volume data and disease metrics at the national and state levels. Results The monthly trends of lung cancer-related internet hits were consistent with the trends of reported lung cancer rates at the national level. Ohio had the highest frequency for lung cancer-related search terms. At the state level, the relative search volume was significantly correlated with lung cancer incidence rates in 42 states, with correlation coefficients ranging from 0.58 in Virginia to 0.94 in Oregon. Relative search volume was also significantly correlated with mortality in 47 states, with correlation coefficients ranging from 0.58 in Oklahoma to 0.94 in North Carolina. Both the incidence and mortality rates of lung cancer were correlated with decomposed relative search volumes in all states excluding Vermont. Conclusions Internet search behaviors could reflect public awareness of lung cancer. Research on internet search behaviors could be a novel and timely approach to monitor and estimate the prevalence, incidence, and mortality rates of a broader range of cancers and even more health issues.
Collapse
Affiliation(s)
- Chenjie Xu
- School of Public Health, Tianjin Medical University, Tianjin, China
| | - Hongxi Yang
- School of Public Health, Tianjin Medical University, Tianjin, China.,School of Public Health, Yale University, New Haven, CT, United States
| | - Li Sun
- School of Nursing, Tianjin Medical University, Tianjin, China
| | - Xinxi Cao
- School of Public Health, Tianjin Medical University, Tianjin, China
| | - Yabing Hou
- School of Public Health, Tianjin Medical University, Tianjin, China
| | - Qiliang Cai
- School of Public Health, Tianjin Medical University, Tianjin, China.,The Second Hospital of Tianjin Medical University, Tianjin, China
| | - Peng Jia
- Department of Land Surveying and Geo-Informatics, The Hong Kong Polytechnic University, Hong Kong, China.,International Initiative on Spatial Lifecourse Epidemiology, Hong Kong, China.,Faculty of Geo-information Science and Earth Observation, University of Twente, Enschede, Netherlands
| | - Yaogang Wang
- School of Public Health, Tianjin Medical University, Tianjin, China
| |
Collapse
|
35
|
Gatherer D. Reflections on integrating bioinformatics into the undergraduate curriculum: The Lancaster experience. BIOCHEMISTRY AND MOLECULAR BIOLOGY EDUCATION : A BIMONTHLY PUBLICATION OF THE INTERNATIONAL UNION OF BIOCHEMISTRY AND MOLECULAR BIOLOGY 2020; 48:118-127. [PMID: 31793726 DOI: 10.1002/bmb.21320] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/30/2019] [Revised: 10/25/2019] [Accepted: 11/15/2019] [Indexed: 06/10/2023]
Abstract
Bioinformatics is an essential discipline for biologists. It also has a reputation of being difficult for those without a strong quantitative and computer science background. At Lancaster University, we have developed modules for the integration of bioinformatics skills training into our undergraduate biology degree portfolio. This article describes those modules, situating them in the context of the accumulated quarter century of literature on bioinformatics education. The constant evolution of bioinformatics as a discipline is emphasized, drawing attention to the continual necessity to revise and upgrade those skills being taught, even at undergraduate level. Our overarching aim is to equip students both with a portfolio of skills in the currently most essential bioinformatics tools and with the confidence to continue their own bioinformatics skills development at postgraduate or professional level.
Collapse
Affiliation(s)
- Derek Gatherer
- Division of Biomedical and Life Sciences, Faculty of Health and Medicine, Lancaster University, Lancaster, UK
| |
Collapse
|