1
|
Glen AK, Ma C, Mendoza L, Womack F, Wood EC, Sinha M, Acevedo L, Kvarfordt LG, Peene RC, Liu S, Hoffman AS, Roach JC, Deutsch EW, Ramsey SA, Koslicki D. ARAX: a graph-based modular reasoning tool for translational biomedicine. Bioinformatics 2023; 39:btad082. [PMID: 36752514 PMCID: PMC10027432 DOI: 10.1093/bioinformatics/btad082] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 12/17/2022] [Accepted: 02/07/2023] [Indexed: 04/12/2023] Open
Abstract
MOTIVATION With the rapidly growing volume of knowledge and data in biomedical databases, improved methods for knowledge-graph-based computational reasoning are needed in order to answer translational questions. Previous efforts to solve such challenging computational reasoning problems have contributed tools and approaches, but progress has been hindered by the lack of an expressive analysis workflow language for translational reasoning and by the lack of a reasoning engine-supporting that language-that federates semantically integrated knowledge-bases. RESULTS We introduce ARAX, a new reasoning system for translational biomedicine that provides a web browser user interface and an application programming interface (API). ARAX enables users to encode translational biomedical questions and to integrate knowledge across sources to answer the user's query and facilitate exploration of results. For ARAX, we developed new approaches to query planning, knowledge-gathering, reasoning and result ranking and dynamically integrate knowledge providers for answering biomedical questions. To illustrate ARAX's application and utility in specific disease contexts, we present several use-case examples. AVAILABILITY AND IMPLEMENTATION The source code and technical documentation for building the ARAX server-side software and its built-in knowledge database are freely available online (https://github.com/RTXteam/RTX). We provide a hosted ARAX service with a web browser interface at arax.rtx.ai and a web API endpoint at arax.rtx.ai/api/arax/v1.3/ui/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Amy K Glen
- School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR 97331, USA
| | - Chunyu Ma
- Huck Institutes of the Life Sciences, Pennsylvania State University, State College, PA 16802, USA
| | - Luis Mendoza
- Institute for Systems Biology, Seattle, WA 98109, USA
| | - Finn Womack
- Huck Institutes of the Life Sciences, Pennsylvania State University, State College, PA 16802, USA
| | - E C Wood
- School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR 97331, USA
| | - Meghamala Sinha
- School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR 97331, USA
| | - Liliana Acevedo
- School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR 97331, USA
| | - Lindsey G Kvarfordt
- School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR 97331, USA
| | - Ross C Peene
- School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR 97331, USA
| | - Shaopeng Liu
- Huck Institutes of the Life Sciences, Pennsylvania State University, State College, PA 16802, USA
| | - Andrew S Hoffman
- Interdisciplinary Hub for Digitalization and Society, Radboud University, Nijmegen 6500GL, The Netherlands
| | - Jared C Roach
- Institute for Systems Biology, Seattle, WA 98109, USA
| | | | - Stephen A Ramsey
- School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR 97331, USA
- Department of Biomedical Sciences, Oregon State University, Corvallis, OR 97331, USA
| | - David Koslicki
- Huck Institutes of the Life Sciences, Pennsylvania State University, State College, PA 16802, USA
- Department of Biology, Pennsylvania State University, State College, PA 16801, USA
- Department of Computer Science and Engineering, Pennsylvania State University, State College, PA 16802, USA
| |
Collapse
|
3
|
Schulz S, Hahn U. Part-whole representation and reasoning in formal biomedical ontologies. Artif Intell Med 2005; 34:179-200. [PMID: 15993045 DOI: 10.1016/j.artmed.2004.11.005] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2003] [Revised: 10/21/2004] [Accepted: 11/12/2004] [Indexed: 11/18/2022]
Abstract
OBJECTIVE Biomedical ontologies are typically structured in a biaxial way, reflecting both a taxonomic (is-a) and a partonomic (part-of) hierarchy. Commonly used biomedical terminologies, which incorporate such distinctions excel in terms of broad coverage but lack a rigid formal foundation. The latter, however, is a prerequisite for automated reasoning. For the biomedical domain, it is not only crucial to cope with ontological dependencies between wholes and their parts but also with specific reasoning patterns which underlie the propagation of roles across partonomic hierarchies. METHODS We scale down part-whole reasoning to subsumption-based taxonomic reasoning within the formal framework of a parsimonious variant of description logics (viz. ALC). RESULTS We provide a formal basis for ontological engineering in the domain of biomedicine, as far as part-whole relationships are concerned, by addressing typical reasoning patterns encountered in this domain.
Collapse
Affiliation(s)
- Stefan Schulz
- Department of Medical Informatics, Freiburg University Hospital, Stefan-Meier-Str. 26, D-79104 Freiburg, Germany.
| | | |
Collapse
|
4
|
Joubert M, Dufour JC, Aymard S, Falco L, Fieschi M. Designing and implementing health data and information providers. Int J Med Inform 2005; 74:133-40. [PMID: 15694618 DOI: 10.1016/j.ijmedinf.2004.04.022] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2003] [Revised: 03/17/2004] [Accepted: 04/06/2004] [Indexed: 12/01/2022]
Abstract
OBJECTIVES To model and implement web portals providing access to certified and high-quality information in the domain of health. MATERIAL AND METHODS The Unified Medical Language System (UMLS) knowledge sources of the U.S. National Library of Medicine and principles of implementation resulting from the previous ARIANE project are described. The XML technology that allows files transformations by the means of XSLT is briefly presented. RESULTS The design and implementation of software modules that exploit knowledge sources, operate the translation of a user's query to selected information sources, and wrap obtained results are detailed. Querying documentary and factual medical databases are presented. DISCUSSION Current implementation and wrapping perspectives are discussed in terms of integration and interoperability of health information and data resources.
Collapse
Affiliation(s)
- Michel Joubert
- Laboratoire d'Enseignement et de Recherche en Traitement de l'Information Médicale, Faculté de Médecine, Université de la Méditerranée, 27 bid Jean Moulin, 13005 Marseille, France.
| | | | | | | | | |
Collapse
|
5
|
Jacquelinet C, Burgun A, Delamarre D, Strang N, Djabbour S, Boutin B, Le Beux P. Developing the ontological foundations of a terminological system for end-stage diseases, organ failure, dialysis and transplantation. Int J Med Inform 2003; 70:317-28. [PMID: 12909184 DOI: 10.1016/s1386-5056(03)00046-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The Etablissement français des Greffes (EfG) is a state agency dealing with Public Health issues related to organ, tissue and cell transplantation in France. The evaluation of organ retrieval and transplantation activities, one of its missions, is supported by a national information system (EfG-IS). The EfG-IS is moving towards a new n-tier architecture comprising a terminology server for end-stage diseases, organ failure, dialysis and transplantation (EfG-TS). Following a preliminary audit of the existing coding system and in order to facilitate data recording, to improve the quality of information, to assume compatibility with terminological existing standards and to allow semantic interoperability with other local, national or international registries, a specific work has been conducted on the thesauri to integrate within the EfG-TS. In this paper focusing on the server's content rather than the container, we report first the functional and cognitive requirements that resulted from the preliminary audit. We then describe the methodological approach used to build the terminological server on "sound ontological foundations". We performed the semantic analysis of existing medical terms to set up disease description frame-like structures. These diseases description frames consist of a limited set of nosological discriminating slots such as etiology, semiology, pathology, evolution and associated diseases. Each relevant medical term is thus associated to a concept defined and inserted within a hierarchy according to disease description frame resulting from the semantic analysis. Last, because this terminological server is shared by various transplant and dialysis centers to record patient data at different time point, contextualization of terms appeared as one of the functional requirements. We will also point out various contexts for medical terms and how they have been taken into account.
Collapse
Affiliation(s)
- Christian Jacquelinet
- Département Medical et Scientifique, Etablissement français des Greffes, 5, rue Lacuée, Paris 75012, France.
| | | | | | | | | | | | | |
Collapse
|
7
|
Chu S, Cesnik B. Knowledge representation and retrieval using conceptual graphs and free text document self-organisation techniques. Int J Med Inform 2001; 62:121-33. [PMID: 11470615 DOI: 10.1016/s1386-5056(01)00156-3] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Hospitals generate and store a large amount of clinical data each year, a significant portion of which is in free text format. Conventional database storage and retrieval algorithms are incapable of effectively processing free text medical data. The rich information and knowledge buried in healthcare records are unavailable for clinical decision-making. We examined a number of techniques for structuring and processing free text documents to effective and efficient for information retrieval and knowledge discovery. One critical success criterion is that the complexity of the techniques must be polynomial both in space and time for them to be able to cope with very large databases. We used conceptual graphs (CG) to capture the structure and semantic information/knowledge contained within the free text medical documents. Ordering and self-organising techniques (lattice techniques and knowledge space) were used to improve organisation of concepts from standard medical nomenclatures and large sets of free text medical documents. Pair-wise union of CG was performed to identify the common generalisation structure and a lattice structure of these CG documents. A combination of all three techniques allowed us to organise a set of 9000 discharge summaries into a generalisation hierarchy that supported efficient and rich information/knowledge retrieval.
Collapse
Affiliation(s)
- S Chu
- Centre of Medical Informatics, Institute of Public Health, Monash Medical Centre, 246 Clayton Road, Clayton, Victoria 3168, Australia.
| | | |
Collapse
|
8
|
Westberg EE, Miller RA. The basis for using the Internet to support the information needs of primary care. J Am Med Inform Assoc 1999; 6:6-25. [PMID: 9925225 PMCID: PMC61341 DOI: 10.1136/jamia.1999.0060006] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/1998] [Accepted: 09/22/1998] [Indexed: 11/03/2022] Open
Abstract
Synthesizing the state of the art from the published literature, this review assesses the basis for employing the Internet to support the information needs of primary care. The authors survey what has been published about the information needs of clinical practice, including primary care, and discuss currently available information resources potentially relevant to primary care. Potential methods of linking information needs with appropriate information resources are described in the context of previous classifications of clinical information needs. Also described is the role that existing terminology mapping systems, such as the National Library of Medicine's Unified Medical Language System, may play in representing and linking information needs to answers.
Collapse
Affiliation(s)
- E E Westberg
- Vanderbilt University, Nashville, Tennessee 37232-8340, USA.
| | | |
Collapse
|