1
|
Wu S, Ke Z, Cai L, Wang L, Zhang X, Ke Q, Ye Y. Pelvic bone tumor segmentation fusion algorithm based on fully convolutional neural network and conditional random field. J Bone Oncol 2024; 45:100593. [PMID: 38495379 PMCID: PMC10943472 DOI: 10.1016/j.jbo.2024.100593] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2023] [Revised: 01/24/2024] [Accepted: 02/08/2024] [Indexed: 03/19/2024] Open
Abstract
Background and objective Pelvic bone tumors represent a harmful orthopedic condition, encompassing both benign and malignant forms. Addressing the issue of limited accuracy in current machine learning algorithms for bone tumor image segmentation, we have developed an enhanced bone tumor image segmentation algorithm. This algorithm is built upon an improved full convolutional neural network, incorporating both the fully convolutional neural network (FCNN-4s) and a conditional random field (CRF) to achieve more precise segmentation. Methodology The enhanced fully convolutional neural network (FCNN-4s) was employed to conduct initial segmentation on preprocessed images. Following each convolutional layer, batch normalization layers were introduced to expedite network training convergence and enhance the accuracy of the trained model. Subsequently, a fully connected conditional random field (CRF) was integrated to fine-tune the segmentation results, refining the boundaries of pelvic bone tumors and achieving high-quality segmentation. Results The experimental outcomes demonstrate a significant enhancement in segmentation accuracy and stability when compared to the conventional convolutional neural network bone tumor image segmentation algorithm. The algorithm achieves an average Dice coefficient of 93.31 %, indicating superior performance in real-time operations. Conclusion In contrast to the conventional convolutional neural network segmentation algorithm, the algorithm presented in this paper boasts a more intricate structure, proficiently addressing issues of over-segmentation and under-segmentation in pelvic bone tumor segmentation. This segmentation model exhibits superior real-time performance, robust stability, and is capable of achieving heightened segmentation accuracy.
Collapse
Affiliation(s)
- Shiqiang Wu
- Department of Orthopedics, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, Fujian 362000, China
- Department of Orthopedics, The Second Clinical College of Fujian Medical University, Quanzhou, Fujian 362000, China
| | - Zhanlong Ke
- Department of Orthopedics, The Second Clinical College of Fujian Medical University, Quanzhou, Fujian 362000, China
| | - Liquan Cai
- Department of Orthopedics, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, Fujian 362000, China
| | - Liangming Wang
- Department of Orthopedics, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, Fujian 362000, China
| | - XiaoLu Zhang
- Department of Orthopedics, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, Fujian 362000, China
| | - Qingfeng Ke
- Department of Orthopedics, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, Fujian 362000, China
| | - Yuguang Ye
- Faculty of Mathematics and Computer Science, Quanzhou Normal University, Quanzhou 362000, China
- Fujian Provincial Key Laboratory of Data Intensive Computing, Quanzhou 362000, China
- Key Laboratory of Intelligent Computing and Information Processing, Fujian Province University, Quanzhou 362000, China
| |
Collapse
|
2
|
Ma MW, Gao XS, Zhang ZY, Shang SY, Jin L, Liu PL, Lv F, Ni W, Han YC, Zong H. Extracting laboratory test information from paper-based reports. BMC Med Inform Decis Mak 2023; 23:251. [PMID: 37932733 PMCID: PMC10629084 DOI: 10.1186/s12911-023-02346-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2022] [Accepted: 10/20/2023] [Indexed: 11/08/2023] Open
Abstract
BACKGROUND In the healthcare domain today, despite the substantial adoption of electronic health information systems, a significant proportion of medical reports still exist in paper-based formats. As a result, there is a significant demand for the digitization of information from these paper-based reports. However, the digitization of paper-based laboratory reports into a structured data format can be challenging due to their non-standard layouts, which includes various data types such as text, numeric values, reference ranges, and units. Therefore, it is crucial to develop a highly scalable and lightweight technique that can effectively identify and extract information from laboratory test reports and convert them into a structured data format for downstream tasks. METHODS We developed an end-to-end Natural Language Processing (NLP)-based pipeline for extracting information from paper-based laboratory test reports. Our pipeline consists of two main modules: an optical character recognition (OCR) module and an information extraction (IE) module. The OCR module is applied to locate and identify text from scanned laboratory test reports using state-of-the-art OCR algorithms. The IE module is then used to extract meaningful information from the OCR results to form digitalized tables of the test reports. The IE module consists of five sub-modules, which are time detection, headline position, line normalization, Named Entity Recognition (NER) with a Conditional Random Fields (CRF)-based method, and step detection for multi-column. Finally, we evaluated the performance of the proposed pipeline on 153 laboratory test reports collected from Peking University First Hospital (PKU1). RESULTS In the OCR module, we evaluate the accuracy of text detection and recognition results at three different levels and achieved an averaged accuracy of 0.93. In the IE module, we extracted four laboratory test entities, including test item name, test result, test unit, and reference value range. The overall F1 score is 0.86 on the 153 laboratory test reports collected from PKU1. With a single CPU, the average inference time of each report is only 0.78 s. CONCLUSION In this study, we developed a practical lightweight pipeline to digitalize and extract information from paper-based laboratory test reports in diverse types and with different layouts that can be adopted in real clinical environments with the lowest possible computing resources requirements. The high evaluation performance on the real-world hospital dataset validated the feasibility of the proposed pipeline.
Collapse
Affiliation(s)
- Ming-Wei Ma
- Department of Radiation Oncology, Peking University First Hospital, No.7 Xishiku Street, Beijing, 100034, China
| | - Xian-Shu Gao
- Department of Radiation Oncology, Peking University First Hospital, No.7 Xishiku Street, Beijing, 100034, China.
| | - Ze-Yu Zhang
- Philips Research China, Shanghai, 200072, China
| | - Shi-Yu Shang
- Department of Radiation Oncology, Peking University First Hospital, No.7 Xishiku Street, Beijing, 100034, China
| | - Ling Jin
- Philips Research China, Shanghai, 200072, China
| | - Pei-Lin Liu
- Department of Radiation Oncology, Peking University First Hospital, No.7 Xishiku Street, Beijing, 100034, China
| | - Feng Lv
- Department of Radiation Oncology, Peking University First Hospital, No.7 Xishiku Street, Beijing, 100034, China
| | - Wei Ni
- Philips Research China, Shanghai, 200072, China
| | - Yu-Chen Han
- Philips Research China, Shanghai, 200072, China
| | - Hui Zong
- Philips Research China, Shanghai, 200072, China
| |
Collapse
|
3
|
Wu S, Bai X, Cai L, Wang L, Zhang X, Ke Q, Huang J. Bone tumor examination based on FCNN-4s and CRF fine segmentation fusion algorithm. J Bone Oncol 2023; 42:100502. [PMID: 37736418 PMCID: PMC10509716 DOI: 10.1016/j.jbo.2023.100502] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Revised: 08/24/2023] [Accepted: 09/03/2023] [Indexed: 09/23/2023] Open
Abstract
Background and objective Bone tumor is a kind of harmful orthopedic disease, there are benign and malignant points. Aiming at the problem that the accuracy of the existing machine learning algorithm for bone tumor image segmentation is not high, a bone tumor image segmentation algorithm based on improved full convolutional neural network which consists fully convolutional neural network (FCNN-4s) and conditional random field (CRF). Methodology The improved fully convolutional neural network (FCNN-4s) was used to perform coarse segmentation on preprocessed images. Batch normalization layers were added after each convolutional layer to accelerate the convergence speed of network training and improve the accuracy of the trained model. Then, a fully connected conditional random field (CRF) was fused to refine the bone tumor boundary in the coarse segmentation results, achieving the fine segmentation effect. Results The experimental results show that compared with the traditional convolutional neural network bone tumor image segmentation algorithm, the algorithm has a great improvement in segmentation accuracy and stability, the average Dice can reach 91.56%, the real-time performance is better. Conclusion Compared with the traditional convolutional neural network segmentation algorithm, the algorithm in this paper has a more refined structure, which can effectively solve the problem of over-segmentation and under-segmentation of bone tumors. The segmentation prediction has better real-time performance, strong stability, and can achieve higher segmentation accuracy.
Collapse
Affiliation(s)
- Shiqiang Wu
- Department of Orthopedics, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, Fujian 362000, China
- Department of Orthopedics, The Second Clinical College of Fujian Medical University, Quanzhou, Fujian 362000, China
| | - Xiaoming Bai
- Department of Orthopedics, The Second Clinical College of Fujian Medical University, Quanzhou, Fujian 362000, China
| | - Liquan Cai
- Department of Orthopedics, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, Fujian 362000, China
| | - Liangming Wang
- Department of Orthopedics, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, Fujian 362000, China
| | - XiaoLu Zhang
- Department of Orthopedics, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, Fujian 362000, China
| | - Qingfeng Ke
- Department of Orthopedics, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, Fujian 362000, China
| | - Jianlong Huang
- Faculty of Mathematics and Computer Science, Quanzhou Normal University, Quanzhou 362000, China
- Fujian Provincial Key Laboratory of Data Intensive Computing, Quanzhou 362000, China
- Key Laboratory of Intelligent Computing and Information Processing, Fujian Province University, Quanzhou 362000, China
| |
Collapse
|
4
|
Hirpassa S, Lehal G. Improving part-of-speech tagging in Amharic language using deep neural network. Heliyon 2023; 9:e17175. [PMID: 37539248 PMCID: PMC10394909 DOI: 10.1016/j.heliyon.2023.e17175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2021] [Revised: 06/08/2023] [Accepted: 06/09/2023] [Indexed: 08/05/2023] Open
Abstract
To date, several POS taggers have been introduced to facilitate the success of semantic analysis for different languages. However, the task of POS tagging becomes a bit intricate in morphologically complex languages, like Amharic. In this paper, we evaluated different models such as bidirectional long short term memory, convolutional neural network in combination with bidirectional long short term memory, and conditional random field for Amharic POS tagging. Various features, both language-dependent and -independent, have been explored in a conditional random field model. Besides, word-level and character-level features are analyzed in deep neural network models. A convolutional neural network is utilized for encoding features at the word and character level. Each model's performance has evaluated on the dataset that contained 321 K tokens and manually tagged with 31 POS tags. Lastly, the best performance obtained by an end-to-end deep neural network model, convolutional neural network in combination with bidirectional long term short memory and conditional random field, is 97.23% accuracy. This is the highest accuracy for Amharic POS tagging task and is competent with contemporary taggers currently existing in different languages.
Collapse
Affiliation(s)
- Sintayehu Hirpassa
- Department of Computer Science, Adama Science and Technology University, Ethiopia
| | - G.S. Lehal
- Department of Computer Science, Punjabi University, India
| |
Collapse
|
5
|
Carrillo-Reid L, Han S, O'Neil D, Taralova E, Jebara T, Yuste R. Identification of Pattern Completion Neurons in Neuronal Ensembles Using Probabilistic Graphical Models. J Neurosci 2021; 41:8577-88. [PMID: 34413204 DOI: 10.1523/JNEUROSCI.0051-21.2021] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2021] [Revised: 07/06/2021] [Accepted: 07/11/2021] [Indexed: 01/21/2023] Open
Abstract
Neuronal ensembles are groups of neurons with coordinated activity that could represent sensory, motor, or cognitive states. The study of how neuronal ensembles are built, recalled, and involved in the guiding of complex behaviors has been limited by the lack of experimental and analytical tools to reliably identify and manipulate neurons that have the ability to activate entire ensembles. Such pattern completion neurons have also been proposed as key elements of artificial and biological neural networks. Indeed, the relevance of pattern completion neurons is highlighted by growing evidence that targeting them can activate neuronal ensembles and trigger behavior. As a method to reliably detect pattern completion neurons, we use conditional random fields (CRFs), a type of probabilistic graphical model. We apply CRFs to identify pattern completion neurons in ensembles in experiments using in vivo two-photon calcium imaging from primary visual cortex of male mice and confirm the CRFs predictions with two-photon optogenetics. To test the broader applicability of CRFs we also analyze publicly available calcium imaging data (Allen Institute Brain Observatory dataset) and demonstrate that CRFs can reliably identify neurons that predict specific features of visual stimuli. Finally, to explore the scalability of CRFs we apply them to in silico network simulations and show that CRFs-identified pattern completion neurons have increased functional connectivity. These results demonstrate the potential of CRFs to characterize and selectively manipulate neural circuits.SIGNIFICANCE STATEMENT We describe a graph theory method to identify and optically manipulate neurons with pattern completion capability in mouse cortical circuits. Using calcium imaging and two-photon optogenetics in vivo we confirm that key neurons identified by this method can recall entire neuronal ensembles. This method could be broadly applied to manipulate neuronal ensemble activity to trigger behavior or for therapeutic applications in brain prostheses.
Collapse
|
6
|
Jiao L, Huo L, Hu C, Tang P. Refined UNet v3: Efficient end-to-end patch-wise network for cloud and shadow segmentation with multi-channel spectral features. Neural Netw 2021; 143:767-782. [PMID: 34488013 DOI: 10.1016/j.neunet.2021.08.008] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Revised: 06/26/2021] [Accepted: 08/05/2021] [Indexed: 10/20/2022]
Abstract
Semantic segmentation is one of the essential prerequisites for computer vision tasks, but edge-precise segmentation stays challenging due to the potential lack of a proper model indicating the low-level relation between pixels. We have presented Refined UNet v2, a concatenation of a network backbone and a subsequent embedded conditional random field (CRF) layer, which coarsely performs pixel-wise classification and refines edges of segmentation regions in a one-stage way. However, the CRF layer of v2 employs a gray-scale global observation (image) to construct contrast-sensitive bilateral features, which is not able to achieve the desired performance on ambiguous edges. In addition, the naïve depth-wise Gaussian filter cannot always compute efficiently, especially for a longer-range message-passing step. To address the aforementioned issues, we upgrade the bilateral message-passing kernel and the efficient implementation of Gaussian filtering in the CRF layer in this paper, referred to as Refined UNet v3, which is able to effectively capture ambiguous edges and accelerate the message-passing procedure. Specifically, the inherited UNet is employed to coarsely locate cloud and shadow regions and the embedded CRF layer refines the edges of the forthcoming segmentation proposals. The multi-channel guided Gaussian filter is applied to the bilateral message-passing step, which improves detecting ambiguous edges that are hard for the gray-scale counterpart to identify, and fast Fourier transform-based (FFT-based) Gaussian filtering facilitates an efficient and potentially range-agnostic implementation. Furthermore, Refined UNet v3 is able to be extended to segmentation on multi-spectral datasets, and the corresponding refinement examination confirms the development of shadow retrieval. Experiments and corresponding results demonstrate that the proposed update can outperform its counterpart in terms of the detection of vague edges, shadow retrieval, and isolated redundant regions, and it is practically efficient in our TensorFlow implementation. The demo source code is available at https://github.com/92xianshen/refined-unet-v3.
Collapse
Affiliation(s)
- Libin Jiao
- Aerospace Information Research Institute (AIR), Chinese Academy of Sciences (CAS), Beijing 100101, China.
| | - Lianzhi Huo
- Aerospace Information Research Institute (AIR), Chinese Academy of Sciences (CAS), Beijing 100101, China.
| | - Changmiao Hu
- Aerospace Information Research Institute (AIR), Chinese Academy of Sciences (CAS), Beijing 100101, China.
| | - Ping Tang
- Aerospace Information Research Institute (AIR), Chinese Academy of Sciences (CAS), Beijing 100101, China.
| |
Collapse
|
7
|
Bozkurt S, Alkim E, Banerjee I, Rubin DL. Automated Detection of Measurements and Their Descriptors in Radiology Reports Using a Hybrid Natural Language Processing Algorithm. J Digit Imaging 2020; 32:544-553. [PMID: 31222557 PMCID: PMC6646482 DOI: 10.1007/s10278-019-00237-9] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022] Open
Abstract
Radiological measurements are reported in free text reports, and it is challenging to extract such measures for treatment planning such as lesion summarization and cancer response assessment. The purpose of this work is to develop and evaluate a natural language processing (NLP) pipeline that can extract measurements and their core descriptors, such as temporality, anatomical entity, imaging observation, RadLex descriptors, series number, image number, and segment from a wide variety of radiology reports (MR, CT, and mammogram). We created a hybrid NLP pipeline that integrates rule-based feature extraction modules and conditional random field (CRF) model for extraction of the measurements from the radiology reports and links them with clinically relevant features such as anatomical entities or imaging observations. The pipeline was trained on 1117 CT/MR reports, and performance of the system was evaluated on an independent set of 100 expert-annotated CT/MR reports and also tested on 25 mammography reports. The system detected 813 out of 806 measurements in the CT/MR reports; 784 were true positives, 29 were false positives, and 0 were false negatives. Similarly, from the mammography reports, 96% of the measurements with their modifiers were extracted correctly. Our approach could enable the development of computerized applications that can utilize summarized lesion measurements from radiology report of varying modalities and improve practice by tracking the same lesions along multiple radiologic encounters.
Collapse
Affiliation(s)
- Selen Bozkurt
- Department of Biomedical Data Science, Stanford University School of Medicine, Medical School Office Building (MSOB), Room X-335, MC 5464, 1265 Welch Road, Stanford, CA, 94305-5479, USA
| | - Emel Alkim
- Department of Biomedical Data Science, Stanford University School of Medicine, Medical School Office Building (MSOB), Room X-335, MC 5464, 1265 Welch Road, Stanford, CA, 94305-5479, USA
| | - Imon Banerjee
- Department of Biomedical Data Science, Stanford University School of Medicine, Medical School Office Building (MSOB), Room X-335, MC 5464, 1265 Welch Road, Stanford, CA, 94305-5479, USA.,Department of Radiology, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | - Daniel L Rubin
- Department of Biomedical Data Science, Stanford University School of Medicine, Medical School Office Building (MSOB), Room X-335, MC 5464, 1265 Welch Road, Stanford, CA, 94305-5479, USA. .,Department of Radiology, Stanford University School of Medicine, Stanford, CA, 94305, USA.
| |
Collapse
|
8
|
Hasan A, Levene M, Weston D. Learning structured medical information from social media. J Biomed Inform 2020; 110:103568. [PMID: 32942027 DOI: 10.1016/j.jbi.2020.103568] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2019] [Revised: 08/21/2020] [Accepted: 09/12/2020] [Indexed: 11/21/2022]
Abstract
Our goal is to summarise and aggregate information from social media regarding the symptoms of a disease, the drugs used and the treatment effects both positive and negative. To achieve this we first apply a supervised machine learning method to automatically extract medical concepts from natural language text. In an environment such as social media, where new data is continuously streamed, we need a methodology that will allow us to continuously train with the new data. To attain such incremental re-training, a semi-supervised methodology is developed, which is capable of learning new concepts from a small set of labelled data together with the much larger set of unlabelled data. The semi-supervised methodology deploys a conditional random field (CRF) as the base-line training algorithm for extracting medical concepts. The methodology iteratively augments to the training set sentences having high confidence, and adds terms to existing dictionaries to be used as features with the base-line model for further classification. Our empirical results show that the base-line CRF performs strongly across a range of different dictionary and training sizes; when the base-line is built with the full training data the F1 score reaches the range 84%-90%. Moreover, we show that the semi-supervised method produces a mild but significant improvement over the base-line. We also discuss the significance of the potential improvement of the semi-supervised methodology and found that it is significantly more accurate in most cases than the underlying base-line model.
Collapse
|
9
|
Clark NJ, Owada K, Ruberanziza E, Ortu G, Umulisa I, Bayisenge U, Mbonigaba JB, Mucaca JB, Lancaster W, Fenwick A, Soares Magalhães RJ, Mbituyumuremyi A. Parasite associations predict infection risk: incorporating co-infections in predictive models for neglected tropical diseases. Parasit Vectors 2020; 13:138. [PMID: 32178706 PMCID: PMC7077138 DOI: 10.1186/s13071-020-04016-2] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2019] [Accepted: 03/10/2020] [Indexed: 12/19/2022] Open
Abstract
BACKGROUND Schistosomiasis and infection by soil-transmitted helminths are some of the world's most prevalent neglected tropical diseases. Infection by more than one parasite (co-infection) is common and can contribute to clinical morbidity in children. Geostatistical analyses of parasite infection data are key for developing mass drug administration strategies, yet most methods ignore co-infections when estimating risk. Infection status for multiple parasites can act as a useful proxy for data-poor individual-level or environmental risk factors while avoiding regression dilution bias. Conditional random fields (CRF) is a multivariate graphical network method that opens new doors in parasite risk mapping by (i) predicting co-infections with high accuracy; (ii) isolating associations among parasites; and (iii) quantifying how these associations change across landscapes. METHODS We built a spatial CRF to estimate infection risks for Ascaris lumbricoides, Trichuris trichiura, hookworms (Ancylostoma duodenale and Necator americanus) and Schistosoma mansoni using data from a national survey of Rwandan schoolchildren. We used an ensemble learning approach to generate spatial predictions by simulating from the CRF's posterior distribution with a multivariate boosted regression tree that captured non-linear relationships between predictors and covariance in infection risks. This CRF ensemble was compared against single parasite gradient boosted machines to assess each model's performance and prediction uncertainty. RESULTS Parasite co-infections were common, with 19.57% of children infected with at least two parasites. The CRF ensemble achieved higher predictive power than single-parasite models by improving estimates of co-infection prevalence at the individual level and classifying schools into World Health Organization treatment categories with greater accuracy. The CRF uncovered important environmental and demographic predictors of parasite infection probabilities. Yet even after capturing demographic and environmental risk factors, the presences or absences of other parasites were strong predictors of individual-level infection risk. Spatial predictions delineated high-risk regions in need of anthelminthic treatment interventions, including areas with higher than expected co-infection prevalence. CONCLUSIONS Monitoring studies routinely screen for multiple parasites, yet statistical models generally ignore this multivariate data when assessing risk factors and designing treatment guidelines. Multivariate approaches can be instrumental in the global effort to reduce and eventually eliminate neglected helminth infections in developing countries.
Collapse
Affiliation(s)
- Nicholas J. Clark
- UQ Spatial Epidemiology Laboratory, School of Veterinary Science, The University of Queensland, Gatton, QLD 4343 Australia
| | - Kei Owada
- UQ Spatial Epidemiology Laboratory, School of Veterinary Science, The University of Queensland, Gatton, QLD 4343 Australia
- Children Health and Environment Program, Child Health Research Centre, The University of Queensland, South Brisbane, QLD 4101 Australia
| | - Eugene Ruberanziza
- Neglected Tropical Diseases and Other Parasitic Diseases Unit, Malaria and Other Parasitic Diseases Division, Rwanda Biomedical Center, Kigali, Rwanda
| | - Giuseppina Ortu
- Schistosomiasis Control Initiative (SCI), Department of Infectious Diseases Epidemiology, Imperial College, London, UK
| | - Irenee Umulisa
- Neglected Tropical Diseases and Other Parasitic Diseases Unit, Malaria and Other Parasitic Diseases Division, Rwanda Biomedical Center, Kigali, Rwanda
| | - Ursin Bayisenge
- Neglected Tropical Diseases and Other Parasitic Diseases Unit, Malaria and Other Parasitic Diseases Division, Rwanda Biomedical Center, Kigali, Rwanda
| | - Jean Bosco Mbonigaba
- Neglected Tropical Diseases and Other Parasitic Diseases Unit, Malaria and Other Parasitic Diseases Division, Rwanda Biomedical Center, Kigali, Rwanda
| | - Jean Bosco Mucaca
- Microbiology Unit, National Reference Laboratory (NRL) Division, Rwanda Biomedical Center, Ministry of Health, Kigali, Rwanda
| | | | - Alan Fenwick
- Schistosomiasis Control Initiative (SCI), Department of Infectious Diseases Epidemiology, Imperial College, London, UK
| | - Ricardo J. Soares Magalhães
- UQ Spatial Epidemiology Laboratory, School of Veterinary Science, The University of Queensland, Gatton, QLD 4343 Australia
- Children Health and Environment Program, Child Health Research Centre, The University of Queensland, South Brisbane, QLD 4101 Australia
| | - Aimable Mbituyumuremyi
- Malaria and Other Parasitic Diseases Division, Rwanda Biomedical Center, Ministry of Health, Kigali, Rwanda
| |
Collapse
|
10
|
Lee W, Choi J. Precursor-induced conditional random fields: connecting separate entities by induction for improved clinical named entity recognition. BMC Med Inform Decis Mak 2019; 19:132. [PMID: 31307440 PMCID: PMC6632205 DOI: 10.1186/s12911-019-0865-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2018] [Accepted: 07/03/2019] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND This paper presents a conditional random fields (CRF) method that enables the capture of specific high-order label transition factors to improve clinical named entity recognition performance. Consecutive clinical entities in a sentence are usually separated from each other, and the textual descriptions in clinical narrative documents frequently indicate causal or posterior relationships that can be used to facilitate clinical named entity recognition. However, the CRF that is generally used for named entity recognition is a first-order model that constrains label transition dependency of adjoining labels under the Markov assumption. METHODS Based on the first-order structure, our proposed model utilizes non-entity tokens between separated entities as an information transmission medium by applying a label induction method. The model is referred to as precursor-induced CRF because its non-entity state memorizes precursor entity information, and the model's structure allows the precursor entity information to propagate forward through the label sequence. RESULTS We compared the proposed model with both first- and second-order CRFs in terms of their F1-scores, using two clinical named entity recognition corpora (the i2b2 2012 challenge and the Seoul National University Hospital electronic health record). The proposed model demonstrated better entity recognition performance than both the first- and second-order CRFs and was also more efficient than the higher-order model. CONCLUSION The proposed precursor-induced CRF which uses non-entity labels as label transition information improves entity recognition F1 score by exploiting long-distance transition factors without exponentially increasing the computational time. In contrast, a conventional second-order CRF model that uses longer distance transition factors showed even worse results than the first-order model and required the longest computation time. Thus, the proposed model could offer a considerable performance improvement over current clinical named entity recognition methods based on the CRF models.
Collapse
Affiliation(s)
- Wangjin Lee
- Interdisciplinary Program for Bioengineering, Graduate School, Seoul National University, 103 Daehak-ro, Jongno-gu, Seoul, 03080, South Korea
| | - Jinwook Choi
- Interdisciplinary Program for Bioengineering, Graduate School, Seoul National University, 103 Daehak-ro, Jongno-gu, Seoul, 03080, South Korea. .,Department of Biomedical Engineering, Seoul National University College of Medicine, 103 Daehak-ro, Jongno-gu, Seoul, 03080, South Korea. .,Institute of Medical and Biological Engineering, Medical Research Center, Seoul National University, 101 Daehak-ro, Jongno-gu, Seoul, 03080, South Korea.
| |
Collapse
|
11
|
Abstract
Calpains are a family of Ca2+-dependent cysteine proteases involved in many important biological processes, where they selectively cleave relevant substrates at specific cleavage sites to regulate the function of the substrate proteins. Presently, our knowledge about the function of calpains and the mechanism of substrate cleavage is still limited due to the fact that the experimental determination and validation on calpain bindings are usually laborious and expensive. This chapter describes LabCaS, an algorithm that is designed for predicting the calpain substrate cleavage sites from amino acid sequences. LabCaS is built on a conditional random field (CRF) statistic model, which trains the cleavage site prediction on multiple features of amino acid residue preference, solvent accessibility information, pair-wise alignment similarity score, secondary structure propensity, and physical-chemistry properties. Large-scale benchmark tests have shown that LabCaS can achieve a reliable recognition of the cleavage sites for most calpain proteins with an average AUC score of 0.862. Due to the fast speed and convenience of use, the protocol should find its usefulness in large-scale calpain-based function annotations of the newly sequenced proteins. The online web server of LabCaS is freely available at http://www.csbio.sjtu.edu.cn/bioinf/LabCaS .
Collapse
Affiliation(s)
- Yong-Xian Fan
- Guangxi Key Laboratory of Trusted Software, Guangxi Colleges and Universities Key Laboratory of Intelligent Processing of Computer Images and Graphics, Guilin University of Electronic Technology, Guilin, China
| | - Xiaoyong Pan
- Department of Medical Informatics, Erasmus MC, Rotterdam, The Netherlands
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA.
| | - Hong-Bin Shen
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, Shanghai, China.
- Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai, China.
| |
Collapse
|
12
|
Mahmood F, Durr NJ. Deep learning and conditional random fields-based depth estimation and topographical reconstruction from conventional endoscopy. Med Image Anal. 2018;48:230-243. [PMID: 29990688 DOI: 10.1016/j.media.2018.06.005] [Citation(s) in RCA: 62] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2018] [Revised: 05/04/2018] [Accepted: 06/07/2018] [Indexed: 02/07/2023]
Abstract
Colorectal cancer is the fourth leading cause of cancer deaths worldwide and the second leading cause in the United States. The risk of colorectal cancer can be mitigated by the identification and removal of premalignant lesions through optical colonoscopy. Unfortunately, conventional colonoscopy misses more than 20% of the polyps that should be removed, due in part to poor contrast of lesion topography. Imaging depth and tissue topography during a colonoscopy is difficult because of the size constraints of the endoscope and the deforming mucosa. Most existing methods make unrealistic assumptions which limits accuracy and sensitivity. In this paper, we present a method that avoids these restrictions, using a joint deep convolutional neural network-conditional random field (CNN-CRF) framework for monocular endoscopy depth estimation. Estimated depth is used to reconstruct the topography of the surface of the colon from a single image. We train the unary and pairwise potential functions of a CRF in a CNN on synthetic data, generated by developing an endoscope camera model and rendering over 200,000 images of an anatomically-realistic colon.We validate our approach with real endoscopy images from a porcine colon, transferred to a synthetic-like domain via adversarial training, with ground truth from registered computed tomography measurements. The CNN-CRF approach estimates depths with a relative error of 0.152 for synthetic endoscopy images and 0.242 for real endoscopy images. We show that the estimated depth maps can be used for reconstructing the topography of the mucosa from conventional colonoscopy images. This approach can easily be integrated into existing endoscopy systems and provides a foundation for improving computer-aided detection algorithms for detection, segmentation and classification of lesions.
Collapse
|
13
|
Korvigo I, Holmatov M, Zaikovskii A, Skoblov M. Putting hands to rest: efficient deep CNN-RNN architecture for chemical named entity recognition with no hand-crafted rules. J Cheminform 2018; 10:28. [PMID: 29796778 PMCID: PMC5966369 DOI: 10.1186/s13321-018-0280-0] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2017] [Accepted: 05/14/2018] [Indexed: 11/10/2022] Open
Abstract
Chemical named entity recognition (NER) is an active field of research in biomedical natural language processing. To facilitate the development of new and superior chemical NER systems, BioCreative released the CHEMDNER corpus, an extensive dataset of diverse manually annotated chemical entities. Most of the systems trained on the corpus rely on complicated hand-crafted rules or curated databases for data preprocessing, feature extraction and output post-processing, though modern machine learning algorithms, such as deep neural networks, can automatically design the rules with little to none human intervention. Here we explored this approach by experimenting with various deep learning architectures for targeted tokenisation and named entity recognition. Our final model, based on a combination of convolutional and stateful recurrent neural networks with attention-like loops and hybrid word- and character-level embeddings, reaches near human-level performance on the testing dataset with no manually asserted rules. To make our model easily accessible for standalone use and integration in third-party software, we've developed a Python package with a minimalistic user interface.
Collapse
Affiliation(s)
- Ilia Korvigo
- Laboratory of Functional Analysis of the Genome, Moscow Institute of Physics and Technology, Moscow, Russia
- All-Russia Institute for Agricultural Microbiology, St. Petersburg, Russia
- ITMO University, St. Petersburg, Russia
| | - Maxim Holmatov
- St. Petersburg State Pediatric Medical University, St. Petersburg, Russia
- N.N. Petrov Institute of Oncology, Department of Tumor Biology, St. Petersburg, Russia
| | | | - Mikhail Skoblov
- Laboratory of Functional Analysis of the Genome, Moscow Institute of Physics and Technology, Moscow, Russia
- School of Biomedicine, Far Eastern Federal University, Vladivostok, Russia
- Laboratory of Functional Genomics, Research Centre for Medical Genetics, Moscow, Russia
| |
Collapse
|
14
|
Du L, Xia C, Deng Z, Lu G, Xia S, Ma J. A machine learning based approach to identify protected health information in Chinese clinical text. Int J Med Inform 2018; 116:24-32. [PMID: 29887232 DOI: 10.1016/j.ijmedinf.2018.05.010] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2017] [Revised: 04/19/2018] [Accepted: 05/17/2018] [Indexed: 11/24/2022]
Abstract
BACKGROUND With the increasing application of electronic health records (EHRs) in the world, protecting private information in clinical text has drawn extensive attention from healthcare providers to researchers. De-identification, the process of identifying and removing protected health information (PHI) from clinical text, has been central to the discourse on medical privacy since 2006. While de-identification is becoming the global norm for handling medical records, there is a paucity of studies on its application on Chinese clinical text. Without efficient and effective privacy protection algorithms in place, the use of indispensable clinical information would be confined. OBJECTIVES We aimed to (i) describe the current process for PHI in China, (ii) propose a machine learning based approach to identify PHI in Chinese clinical text, and (iii) validate the effectiveness of the machine learning algorithm for de-identification in Chinese clinical text. METHODS Based on 14,719 discharge summaries from regional health centers in Ya'an City, Sichuan province, China, we built a conditional random fields (CRF) model to identify PHI in clinical text, and then used the regular expressions to optimize the recognition results of the PHI categories with fewer samples. RESULTS We constructed a Chinese clinical text corpus with PHI tags through substantial manual annotation, wherein the descriptive statistics of PHI manifested its wide range and diverse categories. The evaluation showed with a high F-measure of 0.9878 that our CRF-based model had a good performance for identifying PHI in Chinese clinical text. CONCLUSION The rapid adoption of EHR in the health sector has created an urgent need for tools that can parse patient specific information from Chinese clinical text. Our application of CRF algorithms for de-identification has shown the potential to meet this need by offering a highly accurate and flexible solution to analyzing Chinese clinical text.
Collapse
Affiliation(s)
- Liting Du
- School of Medicine and Health Management, Tongji Medical College, Huazhong University of Science and Technology, Hubei, China
| | - Chenxi Xia
- School of Medicine and Health Management, Tongji Medical College, Huazhong University of Science and Technology, Hubei, China
| | - Zhaohua Deng
- School of Medicine and Health Management, Tongji Medical College, Huazhong University of Science and Technology, Hubei, China
| | - Gary Lu
- Dassault Systems, 175 Wyman St. Waltham, MA, 02451, USA
| | - Shuxu Xia
- School of Medicine and Health Management, Tongji Medical College, Huazhong University of Science and Technology, Hubei, China
| | - Jingdong Ma
- School of Medicine and Health Management, Tongji Medical College, Huazhong University of Science and Technology, Hubei, China.
| |
Collapse
|
15
|
Zhao X, Wu Y, Song G, Li Z, Zhang Y, Fan Y. A deep learning model integrating FCNNs and CRFs for brain tumor segmentation. Med Image Anal 2018; 43:98-111. [PMID: 29040911 PMCID: PMC6029627 DOI: 10.1016/j.media.2017.10.002] [Citation(s) in RCA: 283] [Impact Index Per Article: 47.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2017] [Revised: 07/09/2017] [Accepted: 10/04/2017] [Indexed: 02/07/2023]
Abstract
Accurate and reliable brain tumor segmentation is a critical component in cancer diagnosis, treatment planning, and treatment outcome evaluation. Build upon successful deep learning techniques, a novel brain tumor segmentation method is developed by integrating fully convolutional neural networks (FCNNs) and Conditional Random Fields (CRFs) in a unified framework to obtain segmentation results with appearance and spatial consistency. We train a deep learning based segmentation model using 2D image patches and image slices in following steps: 1) training FCNNs using image patches; 2) training CRFs as Recurrent Neural Networks (CRF-RNN) using image slices with parameters of FCNNs fixed; and 3) fine-tuning the FCNNs and the CRF-RNN using image slices. Particularly, we train 3 segmentation models using 2D image patches and slices obtained in axial, coronal and sagittal views respectively, and combine them to segment brain tumors using a voting based fusion strategy. Our method could segment brain images slice-by-slice, much faster than those based on image patches. We have evaluated our method based on imaging data provided by the Multimodal Brain Tumor Image Segmentation Challenge (BRATS) 2013, BRATS 2015 and BRATS 2016. The experimental results have demonstrated that our method could build a segmentation model with Flair, T1c, and T2 scans and achieve competitive performance as those built with Flair, T1, T1c, and T2 scans.
Collapse
Affiliation(s)
- Xiaomei Zhao
- National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, 100190, China; University of Chinese Academy of Sciences, Beijing, China
| | - Yihong Wu
- National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, 100190, China.
| | - Guidong Song
- Beijing Neurosurgical Institute, Capital Medical University, Beijing, China
| | - Zhenye Li
- Department of Neurosurgery, Beijing Tiantan Hospital, Capital Medical University, Beijing, China
| | - Yazhuo Zhang
- Beijing Neurosurgical Institute, Capital Medical University, Beijing, China; Department of Neurosurgery, Beijing Tiantan Hospital, Capital Medical University, Beijing, China; Beijing Institute for Brain Disorders Brain Tumor Center, Beijing, China; China National Clinical Research Center for Neurological Diseases, Beijing, China
| | - Yong Fan
- Department of Radiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
| |
Collapse
|
16
|
Jiang Z, Zhao C, He B, Guan Y, Jiang J. De-identification of medical records using conditional random fields and long short-term memory networks. J Biomed Inform 2017; 75S:S43-S53. [PMID: 29032162 DOI: 10.1016/j.jbi.2017.10.003] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2017] [Revised: 09/30/2017] [Accepted: 10/03/2017] [Indexed: 10/18/2022]
Abstract
The CEGS N-GRID 2016 Shared Task 1 in Clinical Natural Language Processing focuses on the de-identification of psychiatric evaluation records. This paper describes two participating systems of our team, based on conditional random fields (CRFs) and long short-term memory networks (LSTMs). A pre-processing module was introduced for sentence detection and tokenization before de-identification. For CRFs, manually extracted rich features were utilized to train the model. For LSTMs, a character-level bi-directional LSTM network was applied to represent tokens and classify tags for each token, following which a decoding layer was stacked to decode the most probable protected health information (PHI) terms. The LSTM-based system attained an i2b2 strict micro-F1 measure of 0.8986, which was higher than that of the CRF-based system.
Collapse
Affiliation(s)
- Zhipeng Jiang
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang 150001, China.
| | - Chao Zhao
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang 150001, China.
| | - Bin He
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang 150001, China.
| | - Yi Guan
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang 150001, China.
| | - Jingchi Jiang
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang 150001, China.
| |
Collapse
|
17
|
Murugesan G, Abdulkadhar S, Bhasuran B, Natarajan J. BCC-NER: bidirectional, contextual clues named entity tagger for gene/protein mention recognition. EURASIP J Bioinform Syst Biol 2017; 2017:7. [PMID: 28477208 PMCID: PMC5419958 DOI: 10.1186/s13637-017-0060-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/24/2016] [Accepted: 04/21/2017] [Indexed: 11/29/2022]
Abstract
Tagging biomedical entities such as gene, protein, cell, and cell-line is the first step and an important pre-requisite in biomedical literature mining. In this paper, we describe our hybrid named entity tagging approach namely BCC-NER (bidirectional, contextual clues named entity tagger for gene/protein mention recognition). BCC-NER is deployed with three modules. The first module is for text processing which includes basic NLP pre-processing, feature extraction, and feature selection. The second module is for training and model building with bidirectional conditional random fields (CRF) to parse the text in both directions (forward and backward) and integrate the backward and forward trained models using margin-infused relaxed algorithm (MIRA). The third and final module is for post-processing to achieve a better performance, which includes surrounding text features, parenthesis mismatching, and two-tier abbreviation algorithm. The evaluation results on BioCreative II GM test corpus of BCC-NER achieve a precision of 89.95, recall of 84.15 and overall F-score of 86.95, which is higher than the other currently available open source taggers.
Collapse
Affiliation(s)
- Gurusamy Murugesan
- Data Mining and Text Mining Lab, Department of Bioinformatics, Bharathiar University, Coimbatore, Tamilnadu, 641046, India
| | - Sabenabanu Abdulkadhar
- Data Mining and Text Mining Lab, Department of Bioinformatics, Bharathiar University, Coimbatore, Tamilnadu, 641046, India
| | - Balu Bhasuran
- Center for Computational Biology, DRDO-BU Center for Life Sciences, Bharathiar University, Coimbatore, Tamilnadu, 641046, India
| | - Jeyakumar Natarajan
- Data Mining and Text Mining Lab, Department of Bioinformatics, Bharathiar University, Coimbatore, Tamilnadu, 641046, India.
| |
Collapse
|
18
|
Li E, Khalvati F, Shafiee MJ, Haider MA, Wong A. Sparse reconstruction of compressive sensing MRI using cross-domain stochastically fully connected conditional random fields. BMC Med Imaging 2016; 16:51. [PMID: 27566536 PMCID: PMC5002135 DOI: 10.1186/s12880-016-0156-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2016] [Accepted: 08/15/2016] [Indexed: 11/20/2022] Open
Abstract
Background Magnetic Resonance Imaging (MRI) is a crucial medical imaging technology for the screening and diagnosis of frequently occurring cancers. However, image quality may suffer from long acquisition times for MRIs due to patient motion, which also leads to patient discomfort. Reducing MRI acquisition times can reduce patient discomfort leading to reduced motion artifacts from the acquisition process. Compressive sensing strategies applied to MRI have been demonstrated to be effective in decreasing acquisition times significantly by sparsely sampling the k-space during the acquisition process. However, such a strategy requires advanced reconstruction algorithms to produce high quality and reliable images from compressive sensing MRI. Methods This paper proposes a new reconstruction approach based on cross-domain stochastically fully connected conditional random fields (CD-SFCRF) for compressive sensing MRI. The CD-SFCRF introduces constraints in both k-space and spatial domains within a stochastically fully connected graphical model to produce improved MRI reconstruction. Results Experimental results using T2-weighted (T2w) imaging and diffusion-weighted imaging (DWI) of the prostate show strong performance in preserving fine details and tissue structures in the reconstructed images when compared to other tested methods even at low sampling rates. Conclusions The ability to better utilize a limited amount of information to reconstruct T2w and DWI images in a short amount of time while preserving the important details in the images demonstrates the potential of the proposed CD-SFCRF framework as a viable reconstruction algorithm for compressive sensing MRI.
Collapse
Affiliation(s)
- Edward Li
- Department of Systems Design Engineering, University of Waterloo, Ontario, Waterloo, Canada
| | - Farzad Khalvati
- Department of Medical Imaging, University of Toronto and Sunnybrook Research Institute, Toronto, Ontario, Canada
| | - Mohammad Javad Shafiee
- Department of Systems Design Engineering, University of Waterloo, Ontario, Waterloo, Canada
| | - Masoom A Haider
- Department of Medical Imaging, University of Toronto and Sunnybrook Research Institute, Toronto, Ontario, Canada
| | - Alexander Wong
- Department of Systems Design Engineering, University of Waterloo, Ontario, Waterloo, Canada.
| |
Collapse
|
19
|
Yoon Y. Performance analysis of CRF-based learning for processing WoT application requests expressed in natural language. Springerplus 2016; 5:1324. [PMID: 27563519 PMCID: PMC4980846 DOI: 10.1186/s40064-016-3012-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/04/2016] [Accepted: 08/05/2016] [Indexed: 11/29/2022]
Abstract
Background In this paper, we investigate the effectiveness of a CRF-based learning method for identifying necessary Web of Things (WoT) application components that would satisfy the users’ requests issued in natural language. For instance, a user request such as “archive all sports breaking news” can be satisfied by composing a WoT application that consists of ESPN breaking news service and Dropbox as a storage service. Findings We built an engine that can identify the necessary application components by recognizing a main act (MA) or named entities (NEs) from a given request. We trained this engine with the descriptions of WoT applications (called recipes) that were collected from IFTTT WoT platform. IFTTT hosts over 300 WoT entities that offer thousands of functions referred to as triggers and actions. There are more than 270,000 publicly-available recipes composed with those functions by real users. Therefore, the set of these recipes is well-qualified for the training of our MA and NE recognition engine. Conlusions We share our unique experience of generating the training and test set from these recipe descriptions and assess the performance of the CRF-based language method. Based on the performance evaluation, we introduce further research directions.
Collapse
Affiliation(s)
- Young Yoon
- Department of Computer Engineering, Hongik University, 94, Wowsan-ro, Mapo-gu, Seoul, South Korea
| |
Collapse
|
20
|
Hong N, Li D, Yu Y, Xiu Q, Liu H, Jiang G. A computational framework for converting textual clinical diagnostic criteria into the quality data model. J Biomed Inform 2016; 63:11-21. [PMID: 27444185 DOI: 10.1016/j.jbi.2016.07.016] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2016] [Revised: 07/07/2016] [Accepted: 07/17/2016] [Indexed: 10/21/2022]
Abstract
BACKGROUND Constructing standard and computable clinical diagnostic criteria is an important but challenging research field in the clinical informatics community. The Quality Data Model (QDM) is emerging as a promising information model for standardizing clinical diagnostic criteria. OBJECTIVE To develop and evaluate automated methods for converting textual clinical diagnostic criteria in a structured format using QDM. METHODS We used a clinical Natural Language Processing (NLP) tool known as cTAKES to detect sentences and annotate events in diagnostic criteria. We developed a rule-based approach for assigning the QDM datatype(s) to an individual criterion, whereas we invoked a machine learning algorithm based on the Conditional Random Fields (CRFs) for annotating attributes belonging to each particular QDM datatype. We manually developed an annotated corpus as the gold standard and used standard measures (precision, recall and f-measure) for the performance evaluation. RESULTS We harvested 267 individual criteria with the datatypes of Symptom and Laboratory Test from 63 textual diagnostic criteria. We manually annotated attributes and values in 142 individual Laboratory Test criteria. The average performance of our rule-based approach was 0.84 of precision, 0.86 of recall, and 0.85 of f-measure; the performance of CRFs-based classification was 0.95 of precision, 0.88 of recall and 0.91 of f-measure. We also implemented a web-based tool that automatically translates textual Laboratory Test criteria into the QDM XML template format. The results indicated that our approaches leveraging cTAKES and CRFs are effective in facilitating diagnostic criteria annotation and classification. CONCLUSION Our NLP-based computational framework is a feasible and useful solution in developing diagnostic criteria representation and computerization.
Collapse
Affiliation(s)
- Na Hong
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA; Institute of Medical Information, Chinese Academy of Medical Sciences, Beijing, China
| | - Dingcheng Li
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Yue Yu
- Department of Medical Informatics, School of Public Health, Jilin University, Changchun, Jilin, China
| | - Qiongying Xiu
- Computer Science, Winona State University, Rochester, MN, USA
| | - Hongfang Liu
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Guoqian Jiang
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA.
| |
Collapse
|
21
|
He B, Guan Y, Cheng J, Cen K, Hua W. CRFs based de-identification of medical records. J Biomed Inform 2015; 58 Suppl:S39-S46. [PMID: 26315662 PMCID: PMC4988860 DOI: 10.1016/j.jbi.2015.08.012] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2015] [Revised: 07/20/2015] [Accepted: 08/03/2015] [Indexed: 10/29/2022]
Abstract
De-identification is a shared task of the 2014 i2b2/UTHealth challenge. The purpose of this task is to remove protected health information (PHI) from medical records. In this paper, we propose a novel de-identifier, WI-deId, based on conditional random fields (CRFs). A preprocessing module, which tokenizes the medical records using regular expressions and an off-the-shelf tokenizer, is introduced, and three groups of features are extracted to train the de-identifier model. The experiment shows that our system is effective in the de-identification of medical records, achieving a micro-F1 of 0.9232 at the i2b2 strict entity evaluation level.
Collapse
Affiliation(s)
- Bin He
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Yi Guan
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China.
| | - Jianyi Cheng
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Keting Cen
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
| | - Wenlan Hua
- School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China
| |
Collapse
|
22
|
Abstract
Background The development of robust methods for chemical named entity recognition, a challenging natural language processing task, was previously hindered by the lack of publicly available, large-scale, gold standard corpora. The recent public release of a large chemical entity-annotated corpus as a resource for the CHEMDNER track of the Fourth BioCreative Challenge Evaluation (BioCreative IV) workshop greatly alleviated this problem and allowed us to develop a conditional random fields-based chemical entity recogniser. In order to optimise its performance, we introduced customisations in various aspects of our solution. These include the selection of specialised pre-processing analytics, the incorporation of chemistry knowledge-rich features in the training and application of the statistical model, and the addition of post-processing rules. Results Our evaluation shows that optimal performance is obtained when our customisations are integrated into the chemical entity recogniser. When its performance is compared with that of state-of-the-art methods, under comparable experimental settings, our solution achieves competitive advantage. We also show that our recogniser that uses a model trained on the CHEMDNER corpus is suitable for recognising names in a wide range of corpora, consistently outperforming two popular chemical NER tools. Conclusion The contributions resulting from this work are two-fold. Firstly, we present the details of a chemical entity recognition methodology that has demonstrated performance at a competitive, if not superior, level as that of state-of-the-art methods. Secondly, the developed suite of solutions has been made publicly available as a configurable workflow in the interoperable text mining workbench Argo. This allows interested users to conveniently apply and evaluate our solutions in the context of other chemical text mining tasks.
Collapse
Affiliation(s)
- Riza Batista-Navarro
- National Centre for Text Mining, Manchester Institute of Biotechnology, 131 Princess St, Manchester, M1 7DN, UK ; Department of Computer Science, University of the Philippines Diliman, Quezon City, 1101, Philippines
| | - Rafal Rak
- National Centre for Text Mining, Manchester Institute of Biotechnology, 131 Princess St, Manchester, M1 7DN, UK
| | - Sophia Ananiadou
- National Centre for Text Mining, Manchester Institute of Biotechnology, 131 Princess St, Manchester, M1 7DN, UK
| |
Collapse
|
23
|
Zhang K, Xie Y, Yang Y, Sun A, Liu H, Choudhary A. Incorporating conditional random fields and active learning to improve sentiment identification. Neural Netw 2014; 58:60-7. [PMID: 24856246 DOI: 10.1016/j.neunet.2014.04.005] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2013] [Revised: 04/28/2014] [Accepted: 04/29/2014] [Indexed: 11/25/2022]
Abstract
Many machine learning, statistical, and computational linguistic methods have been developed to identify sentiment of sentences in documents, yielding promising results. However, most of state-of-the-art methods focus on individual sentences and ignore the impact of context on the meaning of a sentence. In this paper, we propose a method based on conditional random fields to incorporate sentence structure and context information in addition to syntactic information for improving sentiment identification. We also investigate how human interaction affects the accuracy of sentiment labeling using limited training data. We propose and evaluate two different active learning strategies for labeling sentiment data. Our experiments with the proposed approach demonstrate a 5%-15% improvement in accuracy on Amazon customer reviews compared to existing supervised learning and rule-based methods.
Collapse
Affiliation(s)
- Kunpeng Zhang
- University of Illinois at Chicago, Chicago, IL, USA.
| | | | - Yi Yang
- Northwestern University, Evanston, IL, USA.
| | - Aaron Sun
- Cloud Research Lab, Samsung Research America, San Jose, CA, USA.
| | - Hengchang Liu
- University of Science and Technology of China, Hefei, China.
| | | |
Collapse
|
24
|
Zuccon G, Kotzur D, Nguyen A, Bergheim A. De-identification of health records using Anonym: effectiveness and robustness across datasets. Artif Intell Med 2014; 61:145-51. [PMID: 24791676 DOI: 10.1016/j.artmed.2014.03.006] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2013] [Revised: 03/17/2014] [Accepted: 03/18/2014] [Indexed: 11/21/2022]
Abstract
OBJECTIVE Evaluate the effectiveness and robustness of Anonym, a tool for de-identifying free-text health records based on conditional random fields classifiers informed by linguistic and lexical features, as well as features extracted by pattern matching techniques. De-identification of personal health information in electronic health records is essential for the sharing and secondary usage of clinical data. De-identification tools that adapt to different sources of clinical data are attractive as they would require minimal intervention to guarantee high effectiveness. METHODS AND MATERIALS The effectiveness and robustness of Anonym are evaluated across multiple datasets, including the widely adopted Integrating Biology and the Bedside (i2b2) dataset, used for evaluation in a de-identification challenge. The datasets used here vary in type of health records, source of data, and their quality, with one of the datasets containing optical character recognition errors. RESULTS Anonym identifies and removes up to 96.6% of personal health identifiers (recall) with a precision of up to 98.2% on the i2b2 dataset, outperforming the best system proposed in the i2b2 challenge. The effectiveness of Anonym across datasets is found to depend on the amount of information available for training. CONCLUSION Findings show that Anonym compares to the best approach from the 2006 i2b2 shared task. It is easy to retrain Anonym with new datasets; if retrained, the system is robust to variations of training size, data type and quality in presence of sufficient training data.
Collapse
|
25
|
Rubrichi S, Battistotti A, Quaglini S. Patients' involvement in e-health services quality assessment: a system for the automatic interpretation of SMS-based patients' feedback. J Biomed Inform 2014; 51:41-8. [PMID: 24632295 DOI: 10.1016/j.jbi.2014.03.003] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2013] [Revised: 01/29/2014] [Accepted: 03/03/2014] [Indexed: 11/21/2022]
Abstract
PURPOSE Effective communication between patients and health services providers is a key aspect for optimizing and maintaining these services. This work describes a system for the automatic evaluation of users' perception of the quality of SmsCup, a reminder system for outpatient visits based on short message service (SMS). The final purpose is the creation of a closed-loop control system for the outpatient service, where patients' complaints and comments represent a feedback that can be used for a better implementation of the service itself. METHODS SmsCup was adopted since about eight years by an Italian healthcare organization, with very good results in reducing the no-show (missing visits) phenomenon. During these years, a number of citizens, even if not required, sent a message back, with comments about the service. The automatic interpretation of the content of those SMS may be useful for monitoring and improving service performances.Yet, due to the complex nature of SMS language, their interpretation represents an ongoing challenge. The proposed system uses conditional random fields as the information extraction method for classifying messages into several semantic categories. The categories refer to appreciation of the service or complaints of various types. Then, the system analyzes the extracted content and provides feedback to the service providers, making them learning and acting on this basis. RESULTS At each step, the content of the messages reveals the actual state of the service as well as the efficacy of corrective actions previously undertaken. Our evaluations showed that: (i) the SMS classification system has achieved good overall performance with an average F1-measure and an overall accuracy of about 92%; (ii) the notification of the patients' feedbacks to service providers showed a positive impact on service functioning. CONCLUSIONS Our study proposed an interactive patient-centered system for continuous monitoring of the service quality. It has demonstrated the feasibility of a tool for the analysis and notification of the patients' feedback on their service experiences, which would support a more regular access to the service.
Collapse
|
26
|
Wang H, Zhang W, Zeng Q, Li Z, Feng K, Liu L. Extracting important information from Chinese Operation Notes with natural language processing methods. J Biomed Inform 2014; 48:130-6. [PMID: 24486562 DOI: 10.1016/j.jbi.2013.12.017] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2013] [Revised: 12/09/2013] [Accepted: 12/13/2013] [Indexed: 10/25/2022]
Abstract
Extracting information from unstructured clinical narratives is valuable for many clinical applications. Although natural Language Processing (NLP) methods have been profoundly studied in electronic medical records (EMR), few studies have explored NLP in extracting information from Chinese clinical narratives. In this study, we report the development and evaluation of extracting tumor-related information from operation notes of hepatic carcinomas which were written in Chinese. Using 86 operation notes manually annotated by physicians as the training set, we explored both rule-based and supervised machine-learning approaches. Evaluating on unseen 29 operation notes, our best approach yielded 69.6% in precision, 58.3% in recall and 63.5% F-score.
Collapse
Affiliation(s)
- Hui Wang
- Shanghai Public Health Clinical Center, Institutes of Biomedical Sciences, and Key laboratory of Medical Molecular Virology, Ministry of Education and Health, Fudan University, Shanghai, China; Key Laboratory of Medical Imaging Computing and Computer Assisted Intervention of Shanghai, Fudan University, Shanghai, China; Key Laboratory of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Weide Zhang
- Zhongshan Hospital, Fudan University, Shanghai, China
| | - Qiang Zeng
- Shanghai Center for Bioinformatics Technology, Shanghai, China
| | - Zuofeng Li
- Shanghai Center for Bioinformatics Technology, Shanghai, China
| | | | - Lei Liu
- Shanghai Public Health Clinical Center, Institutes of Biomedical Sciences, and Key laboratory of Medical Molecular Virology, Ministry of Education and Health, Fudan University, Shanghai, China; Key Laboratory of Medical Imaging Computing and Computer Assisted Intervention of Shanghai, Fudan University, Shanghai, China; Shanghai Center for Bioinformatics Technology, Shanghai, China.
| |
Collapse
|