1
|
Bayoudhi L, Sassi N, Jaziri W. An Overview of Biomedical Ontologies for Pandemics and Infectious Diseases Representation. PROCEDIA COMPUTER SCIENCE 2021; 192:4249-4258. [PMID: 34868401 PMCID: PMC8629358 DOI: 10.1016/j.procs.2021.09.201] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
Several infectious diseases and pandemics have so far emerged. Pandemics are by nature rapidly evolving. In this context, COVID-19 cases, seen recently in a growing number of countries around the world, have been increasing exponentially. So, researchers and responsible actors should take quick decisions to mitigate the spread of such diseases. To do so, several computer science solutions, including ontologies, have been proposed to cope with these issues and save humanity. The ontology is the key formalism which allows modelling knowledge along with its semantics in a formal way. Indeed, the ontology provides unambiguous definitions of a discourse’s domain terms in a machine understandable way. Particularly, biomedical ontologies have ever been developed to capture and represent pandemics and infectious diseases. In this context, this paper aims to scrutinize and study these state-of-the-art ontologies.
Collapse
Affiliation(s)
| | - Najla Sassi
- Miracl Laboratory, Universiy of Sfax, Sfax, Tunisia
- CCSE, Taibah University, Medina, Saudi Arabia
| | - Wassim Jaziri
- Miracl Laboratory, Universiy of Sfax, Sfax, Tunisia
- CCSE, Taibah University, Medina, Saudi Arabia
| |
Collapse
|
2
|
Shen Y, Chen D, Tang B, Yang M, Lei K. EAPB: entropy-aware path-based metric for ontology quality. J Biomed Semantics 2018; 9:20. [PMID: 30097014 PMCID: PMC6086046 DOI: 10.1186/s13326-018-0188-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2018] [Accepted: 07/30/2018] [Indexed: 11/10/2022] Open
Abstract
Background Entropy has become increasingly popular in computer science and information theory because it can be used to measure the predictability and redundancy of knowledge bases, especially ontologies. However, current entropy applications that evaluate ontologies consider only single-point connectivity rather than path connectivity, and they assign equal weights to each entity and path. Results We propose an Entropy-Aware Path-Based (EAPB) metric for ontology quality by considering the path information between different vertices and textual information included in the path to calculate the connectivity path of the whole network and dynamic weights between different nodes. The information obtained from structure-based embedding and text-based embedding is multiplied by the connectivity matrix of the entropy computation. EAPB is analytically evaluated against the state-of-the-art criteria. We have performed empirical analysis on real-world medical ontologies and a synthetic ontology based on the following three aspects: ontology statistical information (data quantity), entropy evaluation (data quality), and a case study (ontology structure and text visualization). These aspects mutually demonstrate the reliability of the proposed metric. The experimental results show that the proposed EAPB can effectively evaluate ontologies, especially those in the medical informatics field. Conclusions We leverage path information and textual information to enrich the network representational learning and aid in entropy computation. The analytics and assessments of semantic web can benefit from the structure information but also the text information. We believe that EAPB is helpful for managing ontology development and evaluation projects. Our results are reproducible and we will release the source code and ontology of this work after publication. (Source code and ontology: https://github.com/AnonymousResearcher1/ontologyEvaluate).
Collapse
Affiliation(s)
- Ying Shen
- Shenzhen Key Lab for Information Centric Networking & Blockchain Technology (ICNLAB), School of Electronics and Computer Engineering, Peking University Shenzhen Graduate School, 518055, Shenzhen, People's Republic of China
| | - Daoyuan Chen
- Shenzhen Key Lab for Information Centric Networking & Blockchain Technology (ICNLAB), School of Electronics and Computer Engineering, Peking University Shenzhen Graduate School, 518055, Shenzhen, People's Republic of China
| | - Buzhou Tang
- School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen), 518055, Shenzhen, People's Republic of China
| | - Min Yang
- SIAT, Chinese Academy of Sciences, 518055, Shenzhen, People's Republic of China
| | - Kai Lei
- Shenzhen Key Lab for Information Centric Networking & Blockchain Technology (ICNLAB), School of Electronics and Computer Engineering, Peking University Shenzhen Graduate School, 518055, Shenzhen, People's Republic of China.
| |
Collapse
|
3
|
Al Manir MS, Brenas JH, Baker CJ, Shaban-Nejad A. A Surveillance Infrastructure for Malaria Analytics: Provisioning Data Access and Preservation of Interoperability. JMIR Public Health Surveill 2018; 4:e10218. [PMID: 29907554 PMCID: PMC6026300 DOI: 10.2196/10218] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2018] [Revised: 04/16/2018] [Accepted: 05/08/2018] [Indexed: 12/19/2022] Open
Abstract
Background According to the World Health Organization, malaria surveillance is weakest in countries and regions with the highest malaria burden. A core obstacle is that the data required to perform malaria surveillance are fragmented in multiple data silos distributed across geographic regions. Furthermore, consistent integrated malaria data sources are few, and a low degree of interoperability exists between them. As a result, it is difficult to identify disease trends and to plan for effective interventions. Objective We propose the Semantics, Interoperability, and Evolution for Malaria Analytics (SIEMA) platform for use in malaria surveillance based on semantic data federation. Using this approach, it is possible to access distributed data, extend and preserve interoperability between multiple dynamic distributed malaria sources, and facilitate detection of system changes that can interrupt mission-critical global surveillance activities. Methods We used Semantic Automated Discovery and Integration (SADI) Semantic Web Services to enable data access and improve interoperability, and the graphical user interface-enabled semantic query engine HYDRA to implement the target queries typical of malaria programs. We implemented a custom algorithm to detect changes to community-developed terminologies, data sources, and services that are core to SIEMA. This algorithm reports to a dashboard. Valet SADI is used to mitigate the impact of changes by rebuilding affected services. Results We developed a prototype surveillance and change management platform from a combination of third-party tools, community-developed terminologies, and custom algorithms. We illustrated a methodology and core infrastructure to facilitate interoperable access to distributed data sources using SADI Semantic Web services. This degree of access makes it possible to implement complex queries needed by our user community with minimal technical skill. We implemented a dashboard that reports on terminology changes that can render the services inactive, jeopardizing system interoperability. Using this information, end users can control and reactively rebuild services to preserve interoperability and minimize service downtime. Conclusions We introduce a framework suitable for use in malaria surveillance that supports the creation of flexible surveillance queries across distributed data resources. The platform provides interoperable access to target data sources, is domain agnostic, and with updates to core terminological resources is readily transferable to other surveillance activities. A dashboard enables users to review changes to the infrastructure and invoke system updates. The platform significantly extends the range of functionalities offered by malaria information systems, beyond the state-of-the-art.
Collapse
Affiliation(s)
| | - Jon Haël Brenas
- Oak Ridge National Laboratory Center for for Biomedical Informatics, Department of Pediatrics, The University of Tennessee Health Science Center, Memphis, TN, United States
| | - Christopher Jo Baker
- Department of Computer Science, University of New Brunswick, Saint John, NB, Canada.,IPSNP Computing Inc, Saint John, NB, Canada
| | - Arash Shaban-Nejad
- Oak Ridge National Laboratory Center for for Biomedical Informatics, Department of Pediatrics, The University of Tennessee Health Science Center, Memphis, TN, United States
| |
Collapse
|
4
|
El-Sappagh S, Kwak D, Ali F, Kwak KS. DMTO: a realistic ontology for standard diabetes mellitus treatment. J Biomed Semantics 2018; 9:8. [PMID: 29409535 PMCID: PMC5800094 DOI: 10.1186/s13326-018-0176-y] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2017] [Accepted: 01/04/2018] [Indexed: 12/18/2022] Open
Abstract
BACKGROUND Treatment of type 2 diabetes mellitus (T2DM) is a complex problem. A clinical decision support system (CDSS) based on massive and distributed electronic health record data can facilitate the automation of this process and enhance its accuracy. The most important component of any CDSS is its knowledge base. This knowledge base can be formulated using ontologies. The formal description logic of ontology supports the inference of hidden knowledge. Building a complete, coherent, consistent, interoperable, and sharable ontology is a challenge. RESULTS This paper introduces the first version of the newly constructed Diabetes Mellitus Treatment Ontology (DMTO) as a basis for shared-semantics, domain-specific, standard, machine-readable, and interoperable knowledge relevant to T2DM treatment. It is a comprehensive ontology and provides the highest coverage and the most complete picture of coded knowledge about T2DM patients' current conditions, previous profiles, and T2DM-related aspects, including complications, symptoms, lab tests, interactions, treatment plan (TP) frameworks, and glucose-related diseases and medications. It adheres to the design principles recommended by the Open Biomedical Ontologies Foundry and is based on ontological realism that follows the principles of the Basic Formal Ontology and the Ontology for General Medical Science. DMTO is implemented under Protégé 5.0 in Web Ontology Language (OWL) 2 format and is publicly available through the National Center for Biomedical Ontology's BioPortal at http://bioportal.bioontology.org/ontologies/DMTO . The current version of DMTO includes more than 10,700 classes, 277 relations, 39,425 annotations, 214 semantic rules, and 62,974 axioms. We provide proof of concept for this approach to modeling TPs. CONCLUSION The ontology is able to collect and analyze most features of T2DM as well as customize chronic TPs with the most appropriate drugs, foods, and physical exercises. DMTO is ready to be used as a knowledge base for semantically intelligent and distributed CDSS systems.
Collapse
Affiliation(s)
- Shaker El-Sappagh
- Information Systems Department, Faculty of Computers and Informatics, Benha University, Banha Mansura Road, Meit Ghamr - Benha, Banha, Al Qalyubia Governorate 3000-104 Egypt
| | - Daehan Kwak
- Department of Computer Science, Kean University, Union, NJ 07083 USA
| | - Farman Ali
- Department of Information and Communication Engineering, Inha University, 100 Inharo, Nam-gu, Incheon, 22212 South Korea
| | - Kyung-Sup Kwak
- Department of Information and Communication Engineering, Inha University, 100 Inharo, Nam-gu, Incheon, 22212 South Korea
| |
Collapse
|
5
|
Describing the breakbone fever: IDODEN, an ontology for dengue fever. PLoS Negl Trop Dis 2015; 9:e0003479. [PMID: 25646954 PMCID: PMC4315569 DOI: 10.1371/journal.pntd.0003479] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2014] [Accepted: 12/15/2014] [Indexed: 01/25/2023] Open
Abstract
Background Ontologies represent powerful tools in information technology because they enhance interoperability and facilitate, among other things, the construction of optimized search engines. To address the need to expand the toolbox available for the control and prevention of vector-borne diseases we embarked on the construction of specific ontologies. We present here IDODEN, an ontology that describes dengue fever, one of the globally most important diseases that are transmitted by mosquitoes. Methodology/Principal Findings We constructed IDODEN using open source software, and modeled it on IDOMAL, the malaria ontology developed previously. IDODEN covers all aspects of dengue fever, such as disease biology, epidemiology and clinical features. Moreover, it covers all facets of dengue entomology. IDODEN, which is freely available, can now be used for the annotation of dengue-related data and, in addition to its use for modeling, it can be utilized for the construction of other dedicated IT tools such as decision support systems. Conclusions/Significance The availability of the dengue ontology will enable databases hosting dengue-associated data and decision-support systems for that disease to perform most efficiently and to link their own data to those stored in other independent repositories, in an architecture- and software-independent manner. The need for the construction of a dengue ontology arose through the fact that the incidence of dengue fever is on the rise across the world; the number of cases may be three to four times higher than the 100 million estimated by the WHO and a vaccine is still not available in spite of the significant efforts undertaken. Thus, control of dengue fever still relies mostly on controlling its mosquito vectors. Large amounts of entomological, epidemiological and clinical data are generated; these need to be efficiently organized in order to further our comprehension of the disease and its control. IDODEN aims to cover the different aspects and intricacies of dengue fever and syndromes caused by dengue virus(es). It contains more than 5000 terms describing epidemiological data, vaccine development, clinical features, the disease course, and more. We show here that it can be a helpful tool for researchers and that, in addition to allowing sophisticated search strategies, it is also useful for tasks such as modeling.
Collapse
|
6
|
Giraldo-Calderón GI, Emrich SJ, MacCallum RM, Maslen G, Dialynas E, Topalis P, Ho N, Gesing S, Madey G, Collins FH, Lawson D. VectorBase: an updated bioinformatics resource for invertebrate vectors and other organisms related with human diseases. Nucleic Acids Res 2014; 43:D707-13. [PMID: 25510499 PMCID: PMC4383932 DOI: 10.1093/nar/gku1117] [Citation(s) in RCA: 437] [Impact Index Per Article: 43.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
VectorBase is a National Institute of Allergy and Infectious Diseases supported Bioinformatics Resource Center (BRC) for invertebrate vectors of human pathogens. Now in its 11th year, VectorBase currently hosts the genomes of 35 organisms including a number of non-vectors for comparative analysis. Hosted data range from genome assemblies with annotated gene features, transcript and protein expression data to population genetics including variation and insecticide-resistance phenotypes. Here we describe improvements to our resource and the set of tools available for interrogating and accessing BRC data including the integration of Web Apollo to facilitate community annotation and providing Galaxy to support user-based workflows. VectorBase also actively supports our community through hands-on workshops and online tutorials. All information and data are freely available from our website at https://www.vectorbase.org/.
Collapse
Affiliation(s)
| | - Scott J Emrich
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556, USA ECK Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556, USA
| | - Robert M MacCallum
- Department of Life Sciences, Imperial College London, South Kensington Campus, London SW7 2AZ, UK
| | - Gareth Maslen
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Emmanuel Dialynas
- Institute of Molecular Biology and Biotechnology (IMBB), FORTH, Vassilika Vouton,Nikolaou Plastira 100, 70013 Heraklion, Crete, Greece
| | - Pantelis Topalis
- Institute of Molecular Biology and Biotechnology (IMBB), FORTH, Vassilika Vouton,Nikolaou Plastira 100, 70013 Heraklion, Crete, Greece
| | - Nicholas Ho
- Department of Life Sciences, Imperial College London, South Kensington Campus, London SW7 2AZ, UK
| | - Sandra Gesing
- Center for Research Computing, University of Notre Dame, Notre Dame, IN 46556, USA
| | | | - Gregory Madey
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556, USA ECK Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556, USA
| | - Frank H Collins
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN 46556, USA ECK Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556, USA
| | - Daniel Lawson
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| |
Collapse
|
7
|
Salvemini M, Arunkumar KP, Nagaraju J, Sanges R, Petrella V, Tomar A, Zhang H, Zheng W, Saccone G. De novo assembly and transcriptome analysis of the Mediterranean fruit fly Ceratitis capitata early embryos. PLoS One 2014; 9:e114191. [PMID: 25474564 PMCID: PMC4256415 DOI: 10.1371/journal.pone.0114191] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2014] [Accepted: 11/05/2014] [Indexed: 01/04/2023] Open
Abstract
The agricultural pest Ceratitis capitata, also known as the Mediterranean fruit fly or Medfly, belongs to the Tephritidae family, which includes a large number of other damaging pest species. The Medfly has been the first non-drosophilid fly species which has been genetically transformed paving the way for designing genetic-based pest control strategies. Furthermore, it is an experimentally tractable model, in which transient and transgene-mediated RNAi have been successfully used. We applied Illumina sequencing to total RNA preparations of 8–10 hours old embryos of C. capitata, This developmental window corresponds to the blastoderm cellularization stage. In summary, we assembled 42,614 transcripts which cluster in 26,319 unique transcripts of which 11,045 correspond to protein coding genes; we identified several hundreds of long ncRNAs; we found an enrichment of transcripts encoding RNA binding proteins among the highly expressed transcripts, such as CcTRA-2, known to be necessary to establish and, most likely, to maintain female sex of C. capitata. Our study is the first de novo assembly performed for Ceratitis capitata based on Illumina NGS technology during embryogenesis and it adds novel data to the previously published C. capitata EST databases. We expect that it will be useful for a variety of applications such as gene cloning and phylogenetic analyses, as well as to advance genetic research and biotechnological applications in the Medfly and other related Tephritidae.
Collapse
Affiliation(s)
- Marco Salvemini
- Department of Biology, University of Naples Federico II, Naples, Italy
| | | | | | - Remo Sanges
- Stazione Zoologica "Anton Dohrn", Naples, Italy
| | - Valeria Petrella
- Department of Biology, University of Naples Federico II, Naples, Italy
| | - Archana Tomar
- Centre for DNA Fingerprinting and Diagnostics, Hyderabad, India
| | - Hongyu Zhang
- State Key Laboratory of Agricultural Microbiology and Institute of Urban and Horticultural Pests, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, People's Republic of China
| | - Weiwei Zheng
- State Key Laboratory of Agricultural Microbiology and Institute of Urban and Horticultural Pests, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, People's Republic of China
| | - Giuseppe Saccone
- Department of Biology, University of Naples Federico II, Naples, Italy
| |
Collapse
|