1
|
Bouaud J, Bachimont B, Charlet J, Séroussi B, Boisvieux JF, Zweigenbaum P. From Text to Knowledge: a Unifying Document-Centered View of Analyzed Medical Language. Methods Inf Med 2018. [DOI: 10.1055/s-0038-1634559] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Abstract
AbstractAlthough medical language processing (MLP) has achieved some success, the actual use and dissemination of data extracted from free text by MLP systems is still very limited. We claim that the adoption of an ‘enricheddocument’ paradigm (or ‘document-centered’ view) can help to address this issue. We present this paradigm and explain how it can be implemented, then discuss its expected benefits both for end-users and MLP researchers.
Collapse
|
2
|
Bachimont B, Bouaud J, Charlet J, Boisvieux JF, Zweigenbaum P. Issues in the Structuring and Acquisition of an Ontology for Medical Language Understanding. Methods Inf Med 2018. [DOI: 10.1055/s-0038-1634577] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Abstract
Abstract:Medical natural language understanding basically aims at representing the contents of medical texts in a formal, conceptual representation. The understanding process itself increasingly relies on a body of domain knowledge, generally expressed in the same conceptual formalism. The design of such a conceptual representation is a key knowledge-acquisition issue. When representing knowledge, the most important point is to ensure that the formal exploitation of the knowledge representation conforms to its meaning in the domain. We examined some methodological and theoretical principles to enforce this conformity. These principles result from our experience in MENELAS, a medical language understanding project.
Collapse
|
3
|
Névéol A, Zweigenbaum P. Making Sense of Big Textual Data for Health Care: Findings from the Section on Clinical Natural Language Processing. Yearb Med Inform 2017; 26:228-234. [PMID: 29063569 PMCID: PMC6239234 DOI: 10.15265/iy-2017-027] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2017] [Indexed: 02/01/2023] Open
Abstract
Objectives: To summarize recent research and present a selection of the best papers published in 2016 in the field of clinical Natural Language Processing (NLP). Method: A survey of the literature was performed by the two section editors of the IMIA Yearbook NLP section. Bibliographic databases were searched for papers with a focus on NLP efforts applied to clinical texts or aimed at a clinical outcome. Papers were automatically ranked and then manually reviewed based on titles and abstracts. A shortlist of candidate best papers was first selected by the section editors before being peer-reviewed by independent external reviewers. Results: The five clinical NLP best papers provide a contribution that ranges from emerging original foundational methods to transitioning solid established research results to a practical clinical setting. They offer a framework for abbreviation disambiguation and coreference resolution, a classification method to identify clinically useful sentences, an analysis of counseling conversations to improve support to patients with mental disorder and grounding of gradable adjectives. Conclusions: Clinical NLP continued to thrive in 2016, with an increasing number of contributions towards applications compared to fundamental methods. Fundamental work addresses increasingly complex problems such as lexical semantics, coreference resolution, and discourse analysis. Research results translate into freely available tools, mainly for English.
Collapse
Affiliation(s)
- A. Névéol
- LIMSI, CNRS, Université Paris Saclay, Orsay, France
| | | | | |
Collapse
|
4
|
Abstract
OBJECTIVE To summarize recent research and present a selection of the best papers published in 2014 in the field of clinical Natural Language Processing (NLP). METHOD A systematic review of the literature was performed by the two section editors of the IMIA Yearbook NLP section by searching bibliographic databases with a focus on NLP efforts applied to clinical texts or aimed at a clinical outcome. A shortlist of candidate best papers was first selected by the section editors before being peer-reviewed by independent external reviewers. RESULTS The clinical NLP best paper selection shows that the field is tackling text analysis methods of increasing depth. The full review process highlighted five papers addressing foundational methods in clinical NLP using clinically relevant texts from online forums or encyclopedias, clinical texts from Electronic Health Records, and included studies specifically aiming at a practical clinical outcome. The increased access to clinical data that was made possible with the recent progress of de-identification paved the way for the scientific community to address complex NLP problems such as word sense disambiguation, negation, temporal analysis and specific information nugget extraction. These advances in turn allowed for efficient application of NLP to clinical problems such as cancer patient triage. Another line of research investigates online clinically relevant texts and brings interesting insight on communication strategies to convey health-related information. CONCLUSIONS The field of clinical NLP is thriving through the contributions of both NLP researchers and healthcare professionals interested in applying NLP techniques for concrete healthcare purposes. Clinical NLP is becoming mature for practical applications with a significant clinical impact.
Collapse
Affiliation(s)
- A Névéol
- Aurélie Névéol, LIMSI CNRS UPR 3251, Rue John von Neumann, Campus Universitaire d'Orsay, 91405 Orsay cedex, France, E-mail: {neveol,pz}@limsi.fr
| | | |
Collapse
|
5
|
Abstract
OBJECTIVE To summarize recent research and present a selection of the best papers published in 2015 in the field of clinical Natural Language Processing (NLP). METHOD A systematic review of the literature was performed by the two section editors of the IMIA Yearbook NLP section by searching bibliographic databases with a focus on NLP efforts applied to clinical texts or aimed at a clinical outcome. Section editors first selected a shortlist of candidate best papers that were then peer-reviewed by independent external reviewers. RESULTS The clinical NLP best paper selection shows that clinical NLP is making use of a variety of texts of clinical interest to contribute to the analysis of clinical information and the building of a body of clinical knowledge. The full review process highlighted five papers analyzing patient-authored texts or seeking to connect and aggregate multiple sources of information. They provide a contribution to the development of methods, resources, applications, and sometimes a combination of these aspects. CONCLUSIONS The field of clinical NLP continues to thrive through the contributions of both NLP researchers and healthcare professionals interested in applying NLP techniques to impact clinical practice. Foundational progress in the field makes it possible to leverage a larger variety of texts of clinical interest for healthcare purposes.
Collapse
Affiliation(s)
- A Névéol
- Aurélie Névéol, LIMSI CNRS UPR 3251, Université Paris Saclay, Rue John von Neumann, 91400 Orsay, France, E-mail:
| | - P Zweigenbaum
- Pierre Zweigenbaum, LIMSI CNRS UPR 3251, Université Paris Saclay, Rue John von Neumann, 91400 Orsay, France, E-mail:
| |
Collapse
|
6
|
Baud RH, Nyström M, Borin L, Evans R, Schulz S, Zweigenbaum P. Interchanging lexical information for a multilingual dictionary. AMIA Annu Symp Proc 2005; 2005:31-5. [PMID: 16778996 PMCID: PMC1560452] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
OBJECTIVE To facilitate the interchange of lexical information for multiple languages in the medical domain. To pave the way for the emergence of a generally available truly multilingual electronic dictionary in the medical domain. METHODS An interchange format has to be neutral relative to the target languages. It has to be consistent with current needs of lexicon authors, present and future. An active interaction between six potential authors aimed to determine a common denominator striking the right balance between richness of content and ease of use for lexicon providers. RESULTS A simple list of relevant attributes has been established and published. The format has the potential for collecting relevant parts of a future multilingual dictionary. An XML version is available. CONCLUSION This effort makes feasible the exchange of lexical information between research groups. Interchange files are made available in a public repository. This procedure opens the door to a true multilingual dictionary, in the awareness that the exchange of lexical information is (only) a necessary first step, before structuring the corresponding entries in different languages.
Collapse
Affiliation(s)
- R H Baud
- Service of Medical Informatics, University Hospitals of Geneva, Switzerland
| | | | | | | | | | | |
Collapse
|
7
|
Darmoni SJ, Jarrousse E, Zweigenbaum P, Le Beux P, Namer F, Baud R, Joubert M, Vallée H, Côté RA, Buemi A, Bourigault D, Recource G, Jeanneau S, Rodrigues JM. VUMeF: extending the French involvement in the UMLS Metathesaurus. AMIA Annu Symp Proc 2003; 2003:824. [PMID: 14728329 PMCID: PMC1480335] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 04/28/2023]
Abstract
A considerable number of robust vocabularies and thesauri have been developed for the healthcare and biomedical domain. No single vocabulary, however; provides complete coverage of the information needs from a public health perspective. The results of an investigation of vocabulary sources for the development of a comprehensive controlled vocabulary for the public health domain at the Centers for Disease Control and Prevention (CDC) is presented.
Collapse
Affiliation(s)
- S J Darmoni
- CISMeF, Rouen University Hospital, France & L@STICS, PSI fRE CNRS 2645
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
8
|
Zweigenbaum P, Jacquemart P, Grabar N, Habert B. Building a text corpus for representing the variety of medical language. Stud Health Technol Inform 2002; 84:290-4. [PMID: 11604751] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/21/2023]
Abstract
Medical language processing has focused until recently on a few types of textual documents. However, a much larger variety of document types are used in different settings. It has been showed that Natural Language Processing (NLP) tools can exhibit very different behavior on different types of texts. Without better informed knowledge about the differential performance of NLP tools on a variety of medical text types, it will be difficult to control the extension of their application to different medical documents. We endeavored to provide a basis for such informed assessment: the construction of a large corpus of medical text samples. We propose a framework for designing such a corpus: a set of descriptive dimensions and a standardized encoding of both meta-information (implementing these dimensions) and content. We present a proof of concept demonstration by encoding an initial corpus of text samples according to these principles.
Collapse
Affiliation(s)
- P Zweigenbaum
- DIAM Service d'Informatique Médicale, Assistance Publique, Hôpitaux de Paris, Département de Biomathématiques, Université Paris 75634 Paris Cedex 13, France.
| | | | | | | |
Collapse
|
9
|
Zweigenbaum P, Darmoni SJ, Grabar N, Douyère M, Benichou J. An assessment of the visibility of MeSH-indexed medical web catalogs through search engines. Proc AMIA Symp 2002:954-8. [PMID: 12463965 PMCID: PMC2244418] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/27/2023] Open
Abstract
Manually indexed Internet health catalogs such as CliniWeb or CISMeF provide resources for retrieving high-quality health information. Users of these quality-controlled subject gateways are most often referred to them by general search engines such as Google, AltaVista, etc. This raises several questions, among which the following: what is the relative visibility of medical Internet catalogs through search engines? This study addresses this issue by measuring and comparing the visibility of six major, MeSH-indexed health catalogs through four different search engines (AltaVista, Google, Lycos, Northern Light) in two languages (English and French). Over half a million queries were sent to the search engines; for most of these search engines, according to our measures at the time the queries were sent, the most visible catalog for English MeSH terms was CliniWeb and the most visible one for French MeSH terms was CISMeF.
Collapse
Affiliation(s)
- P Zweigenbaum
- STIM, DSI, Assistance Publique--Paris Hospitals & Département de Biomathématiques, Université Paris 6, Paris, France
| | | | | | | | | |
Collapse
|
10
|
Chiao YC, Zweigenbaum P. Looking for French-English translations in comparable medical corpora. Proc AMIA Symp 2002:150-4. [PMID: 12463805 PMCID: PMC2244154] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/27/2023] Open
Abstract
Cross-language retrieval of medical information needs to translate input queries into target language queries. It must be prepared to cope with 'new' words not yet listed in a multilingual lexicon. We address the issue of finding translational equivalents of such 'unknown' words from French to English in the medical domain. We rely on non-parallel, comparable corpora and an initial bilingual medical lexicon. We compare the distributional contexts of source and target words, testing several weighting factors and similarity measures. For the best combination (the Jaccard similarity measure with or without weighting), the correct translation is found in the top 10 candidates for more than 60% of the test words. This shows the potential of this technique to help extending bilingual medical lexicons.
Collapse
|
11
|
Zweigenbaum P, Darmoni SJ, Grabar N. The contribution of morphological knowledge to French MeSH mapping for information retrieval. Proc AMIA Symp 2001:796-800. [PMID: 11825295 PMCID: PMC2243345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/23/2023] Open
Abstract
MeSH-indexed Internet health directories must provide a mapping from natural language queries to MeSH terms so that both health professionals and the general public can query their contents. We describe here the design of lexical knowledge bases for mapping French expressions to MeSH terms, and the initial evaluation of their contribution to Doc'CISMeF, the search tool of a MeSH-indexed directory of French-language medical Internet resources. The observed trend is in favor of the use of morphological knowledge as a moderate (approximately 5%) but effective factor for improving query to term mapping capabilities.
Collapse
Affiliation(s)
- P Zweigenbaum
- DIAM--Service d'Informatique Médicale, DSI, Assistance Publique--Hôpitaux de Paris and Département de Biomathématiques, Université Paris 6, France.
| | | | | |
Collapse
|
12
|
Grabar N, Zweigenbaum P. A general method for sifting linguistic knowledge from structured terminologies. Proc AMIA Symp 2000:310-4. [PMID: 11079895 PMCID: PMC2243874] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/18/2023] Open
Abstract
Morphological knowledge is useful for medical language processing, information retrieval and terminology or ontology development. We show how a large volume of morphological associations between words can be learnt from existing medical terminologies by taking advantage of the semantic relations already encoded between terms in these terminologies: synonymy, hierarchy and transversal relations. The method proposed relies on no a priori linguistic knowledge. Since it can work with different relations between terms, it can be applied to any structured terminology. Tested on SNOMED and ICD in French and English, it proves to identify fairly reliable morphological relations (precision > 90%) with a good coverage (over 88% compared to the UMLS lexical variant generation program). For English words with a stem longer than 3 characters, recall reaches 98.8% for inflection and 94.7% for derivation.
Collapse
Affiliation(s)
- N Grabar
- DIAM-Service d'Informatique Médicale, DSI, Assistance Publique-Paris Hospitals & Département de Biomathématiques, Université Paris 6, Paris, France.
| | | |
Collapse
|
13
|
Bodenreider O, Zweigenbaum P. Identifying proper names in parallel medical terminologies. Stud Health Technol Inform 2000; 77:443-7. [PMID: 11187591 PMCID: PMC1790050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/19/2023]
Abstract
We propose several criteria to identify proper names in biomedical terminologies. Traditional, pattern-based methods that rely on the immediate context of a proper name are not applicable. However, the availability of translations of some terminologies supports methods based on invariant words instead. A combination of five criteria achieved 86% precision and 88% recall on the 16,401 word forms of the International Classification of Diseases.
Collapse
Affiliation(s)
- O Bodenreider
- U.S. National Library of Medicine, Bethesda, MD, USA
| | | |
Collapse
|
14
|
Zweigenbaum P, Courtois P. Acquisition of lexical resources from SNOMED for medical language processing. Stud Health Technol Inform 1999; 52 Pt 1:586-90. [PMID: 10384522] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/13/2023]
Abstract
Medical language processing depends on large-coverage, fine-grained specialized lexicons. The vast majority of existing electronic lexicons concern the English language; for other languages such as French, resources are scarce. In contrast, large medical thesauri exist in numerous languages, including French. Our goal was to study what kind of linguistic information could be extracted from thesauri into a lexicon, in which places human intervention is necessary, and what kind of issues arise in this process. We designed in this purpose a method to build a semantic lexicon from a subset of the SNOMED axes in their French translation.
Collapse
Affiliation(s)
- P Zweigenbaum
- DIAM Service d'Informatique Médicale, Assistance Publique-Hôpitaux de Paris.
| | | |
Collapse
|
15
|
Grabar N, Zweigenbaum P. Language-independent automatic acquisition of morphological knowledge from synonym pairs. Proc AMIA Symp 1999:77-81. [PMID: 10566324 PMCID: PMC2232580] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/14/2023] Open
Abstract
Medical words exhibit a rich and productive morphology. Beyond simple inflection, derivation and composition are a common way to form new words. Morphological knowledge is therefore very important for any medical language processing application. Whereas rich morphological resources are available for the English medical language with the UMLS Specialist Lexicon, no such resources are publicly available for French or most other languages. We propose a simple and powerful method to help acquire automatically such knowledge. This method takes advantage of the synonym terms present in medical terminologies. In a bootstrapping step, it detects morphologically related words from which it learns "derivation rules". In an expansion step, it then applies these rules to the whole vocabulary available. Our goal is to acquire data for French and other languages for which they are not available. However, to evaluate the efficiency of the method, we tested it on English in a setting which is close to that prevailing for French, and we confronted its results to those obtained with the Specialist lexical variant generation tool.
Collapse
Affiliation(s)
- N Grabar
- DIAM-Service d'Informatique Médicale, DSI, Paris, France
| | | |
Collapse
|
16
|
Zweigenbaum P, Bouaud J, Bachimont B, Charlet J, Séroussi B, Boisvieux JF. From text to knowledge: a unifying document-centered view of analyzed medical language. Methods Inf Med 1998; 37:384-93. [PMID: 9865036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/09/2023]
Abstract
Although medical language processing (MLP) has achieved some success, the actual use and dissemination of data extracted from free text by MLP systems is still very limited. We claim that the adoption of an 'enriched-document' paradigm (or 'document-centered' view) can help to address this issue. We present this paradigm and explain how it can be implemented, then discuss its expected benefits both for end-users and MLP researchers.
Collapse
Affiliation(s)
- P Zweigenbaum
- Service d'informatique médicale, Assistance Publique, Hôpitaux de Paris.
| | | | | | | | | | | |
Collapse
|
17
|
Charlet J, Bachimont B, Brunie V, el Kassar S, Zweigenbaum P, Boisvieux JF. Hospitexte: towards a document-based hypertextual electronic medical record. Proc AMIA Symp 1998:713-7. [PMID: 9929312 PMCID: PMC2232226] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/10/2023] Open
Abstract
The patient record is a repository for knowledge about a patient. Work in Artificial Intelligence and knowledge representation has evidenced the intrinsic difficulty of formalizing knowledge for computer processing. It is therefore not a surprise that most attempts at computerizing the patient record have only had a limited degree of success or applicability. We claim that this is due to the fact that medicine is an empirical domain, and thus fundamentally resists formalization. Therefore, the only way medical knowledge can be fully expressed is through natural languages which is indeed what clinicians actually use. We proposed and designed an electronic medical record which adheres to this hypothesis and where structured documents play a prominent role.
Collapse
Affiliation(s)
- J Charlet
- Service d'Informatique Médicale, AP-HP & Dép. de Biomathématiques, Univ. Paris 6, France
| | | | | | | | | | | |
Collapse
|
18
|
Nazarenko A, Zweigenbaum P, Bouaud J, Habert B. Corpus-based identification and refinement of semantic classes. Proc AMIA Annu Fall Symp 1997:585-9. [PMID: 9357693 PMCID: PMC2233482] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Medical Language Processing (MLP), especially in specific domains, requires fine-grained semantic lexica. We examine whether robust natural language processing tools used on a representative corpus of a domain help in building and refining a semantic categorization. We test this hypothesis with ZELLIG, a corpus analysis tool. The first clusters we obtain are consistent with a model of the domain, as found in the SNOMED nomenclature. They correspond to coarse-grained semantic categories, but isolate as well lexical idiosyncrasies belonging to the clinical sub-language. Moreover, they help categorize additional words.
Collapse
Affiliation(s)
- A Nazarenko
- Laboratoire d'Informatique de Paris-Nord, Université Paris 13
| | | | | | | |
Collapse
|
19
|
Zweigenbaum P, Bouaud J, Bachimont B, Charlet J, Boisvieux JF. Evaluating a normalized conceptual representation produced from natural language patient discharge summaries. Proc AMIA Annu Fall Symp 1997:590-4. [PMID: 9357694 PMCID: PMC2233459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
The Menelas project aimed to produce a normalized conceptual representation from natural language patient discharge summaries. Because of the complex and detailed nature of conceptual representations, evaluating the quality of output of such a system is difficult. We present the method designed to measure the quality of Menelas output, and its application to the state of the French Menelas prototype as of the end of the project. We examine this method in the framework recently proposed by Friedman and Hripcsak. We also propose two conditions which enable to reduce the evaluation preparation workload.
Collapse
Affiliation(s)
- P Zweigenbaum
- DIAM, Service d'Informatique Médicale, Assistance Publique, Hôpitaux de Paris.
| | | | | | | | | |
Collapse
|
20
|
Zweigenbaum P, Bachimont B, Bouaud J, Charlet J, Boisvieux JF. Issues in the structuring and acquisition of an ontology for medical language understanding. Methods Inf Med 1995; 34:15-24. [PMID: 9082125] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Medical natural language understanding basically aims at representing the contents of medical texts in a formal, conceptual representation. The understanding process itself increasingly relies on a body of domain knowledge, generally expressed in the same conceptual formalism. The design of such a conceptual representation is a key knowledge-acquisition issue. When representing knowledge, the most important point is to ensure that the formal exploitation of the knowledge representation conforms to its meaning in the domain. We examined some methodological and theoretical principles to enforce this conformity. These principles result from our experience in MENELAS, a medical language understanding project.
Collapse
Affiliation(s)
- P Zweigenbaum
- DIAM-SIM, Service d'Informatique Médicale, Hôpitaux de Paris
| | | | | | | | | |
Collapse
|
21
|
Zweigenbaum P, Bachimont B, Bouaud J, Charlet J, Boisvieux JF. A multi-lingual architecture for building a normalised conceptual representation from medical language. Proc Annu Symp Comput Appl Med Care 1995:357-61. [PMID: 8563301 PMCID: PMC2579114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
The overall goal of MENELAS is to provide better access to the information contained in natural language patient discharge summaries (PDSs), through the design and implementation of a prototype able to analyse medical texts. The approach taken by MENELAS is based on the following key principles: (i) to maximise the usefulness of natural language analysis and the usability of its results, the output of natural language analysis must be a normalised conceptual representation of medical information; and (ii) to maximise the reuse of resources, language analysis should be domain-independent and conceptual representation should be language-independent. This paper discusses the results obtained and the issues raised when implementing these principles during the project.
Collapse
|
22
|
Abstract
The overall goal of MENELAS is to provide better access to the information contained in natural language patient discharge summaries, through the design and implementation of a pilot system able to access medical reports through natural languages. A first, experimental version of the MENELAS indexing prototype for French has been assembled. Its function is to encode free text PDSs into both an internal representation and ICD-9-CM nomenclature codes. A preliminary evaluation shows the potential for reasonable coverage and precision. The MENELAS prototype will be enhanced and extended into a pilot system which will be tested in two hospital sites.
Collapse
Affiliation(s)
- P Zweigenbaum
- DIAM-INSERM U.194 and SIM, Service d'Informatique Médicale, Assistance Publique-Hôpitaux de Paris, France
| |
Collapse
|
23
|
Volot F, Zweigenbaum P, Bachimont B, Ben Said M, Bouaud J, Fieschi M, Boisvieux JF. Structuration and acquisition of medical knowledge. Using UMLS in the conceptual graph formalism. Proc Annu Symp Comput Appl Med Care 1993:710-4. [PMID: 8130568 PMCID: PMC2850667] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
The use of a taxonomy, such as the concept type lattice (CTL) of Conceptual Graphs, is a central structuring piece in a knowledge-based system. The knowledge it contains is constantly used by the system, and its structure provides a guide for the acquisition of other pieces of knowledge. We show how UMLS can be used as a knowledge resource to build a CTL and how the CTL can help the process of acquisition for other kinds of knowledge. We illustrate this method in the context of the MENELAS natural language understanding project.
Collapse
Affiliation(s)
- F Volot
- Service de l'Information Médicale, Hôpital de la Timone, Marseille, France
| | | | | | | | | | | | | |
Collapse
|