Minimalistic Approach to Coreference Resolution in Lithuanian Medical Records.
COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2019;
2019:9079840. [PMID:
31015858 PMCID:
PMC6446105 DOI:
10.1155/2019/9079840]
[Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/18/2019] [Accepted: 02/26/2019] [Indexed: 12/20/2022]
Abstract
Coreference resolution is a challenging part of natural language processing (NLP) with applications in machine translation, semantic search and other information retrieval, and decision support systems. Coreference resolution requires linguistic preprocessing and rich language resources for automatically identifying and resolving such expressions. Many rarer and under-resourced languages (such as Lithuanian) lack the required language resources and tools. We present a method for coreference resolution in Lithuanian language and its application for processing e-health records from a hospital reception. Our novelty is the ability to process coreferences with minimal linguistic resources, which is important in linguistic applications for rare and endangered languages. The experimental results show that coreference resolution is applicable to the development of NLP-powered online healthcare services in Lithuania.
Collapse