1
|
Abstract
The immune system is highly complex and distributed throughout an organism, with hundreds to thousands of cell states existing in parallel with diverse molecular pathways interacting in a highly dynamic and coordinated fashion. Although the characterization of individual genes and molecules is of the utmost importance for understanding immune-system function, high-throughput, high-resolution omics technologies combined with sophisticated computational modeling and machine-learning approaches are creating opportunities to complement standard immunological methods with new insights into immune-system dynamics. Like systems immunology itself, immunology researchers must take advantage of these technologies and form their own diverse networks, connecting with researchers from other disciplines. This Review is an introduction and 'how-to guide' for immunologists with no particular experience in the field of omics but with the intention to learn about and apply these systems-level approaches, and for immunologists who want to make the most of interdisciplinary networks.
Collapse
|
2
|
Semantic Metadata Annotation Services in the Biomedical Domain—A Literature Review. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12020796] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
For all research data collected, data descriptions and information about the corresponding variables are essential for data analysis and reuse. To enable cross-study comparisons and analyses, semantic interoperability of metadata is one of the most important requirements. In the area of clinical and epidemiological studies, data collection instruments such as case report forms (CRFs), data dictionaries and questionnaires are critical for metadata collection. Even though data collection instruments are often created in a digital form, they are mostly not machine readable; i.e., they are not semantically coded. As a result, the comparison between data collection instruments is complex. The German project NFDI4Health is dedicated to the development of national research data infrastructure for personal health data, and as such searches for ways to enhance semantic interoperability. Retrospective integration of semantic codes into study metadata is important, as ongoing or completed studies contain valuable information. However, this is labor intensive and should be eased by software. To understand the market and find out what techniques and technologies support retrospective semantic annotation/enrichment of metadata, we conducted a literature review. In NFDI4Health, we identified basic requirements for semantic metadata annotation software in the biomedical field and in the context of the FAIR principles. Ten relevant software systems were summarized and aligned with those requirements. We concluded that despite active research on semantic annotation systems, no system meets all requirements. Consequently, further research and software development in this area is needed, as interoperability of data dictionaries, questionnaires and data collection tools is key to reusing and combining results from independent research studies.
Collapse
|
3
|
Ulrich H, Kock-Schoppenhauer AK, Deppenwiese N, Gött R, Kern J, Lablans M, Majeed RW, Stöhr MR, Stausberg J, Varghese J, Dugas M, Ingenerf J. Understanding the Nature of Metadata: Systematic Review. J Med Internet Res 2022; 24:e25440. [PMID: 35014967 PMCID: PMC8790684 DOI: 10.2196/25440] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2020] [Revised: 01/28/2021] [Accepted: 10/14/2021] [Indexed: 01/11/2023] Open
Abstract
Background Metadata are created to describe the corresponding data in a detailed and unambiguous way and is used for various applications in different research areas, for example, data identification and classification. However, a clear definition of metadata is crucial for further use. Unfortunately, extensive experience with the processing and management of metadata has shown that the term “metadata” and its use is not always unambiguous. Objective This study aimed to understand the definition of metadata and the challenges resulting from metadata reuse. Methods A systematic literature search was performed in this study following the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines for reporting on systematic reviews. Five research questions were identified to streamline the review process, addressing metadata characteristics, metadata standards, use cases, and problems encountered. This review was preceded by a harmonization process to achieve a general understanding of the terms used. Results The harmonization process resulted in a clear set of definitions for metadata processing focusing on data integration. The following literature review was conducted by 10 reviewers with different backgrounds and using the harmonized definitions. This study included 81 peer-reviewed papers from the last decade after applying various filtering steps to identify the most relevant papers. The 5 research questions could be answered, resulting in a broad overview of the standards, use cases, problems, and corresponding solutions for the application of metadata in different research areas. Conclusions Metadata can be a powerful tool for identifying, describing, and processing information, but its meaningful creation is costly and challenging. This review process uncovered many standards, use cases, problems, and solutions for dealing with metadata. The presented harmonized definitions and the new schema have the potential to improve the classification and generation of metadata by creating a shared understanding of metadata and its context.
Collapse
Affiliation(s)
- Hannes Ulrich
- IT Center for Clinical Research, University of Lübeck, Lübeck, Germany.,Institute of Medical Informatics, University of Lübeck, Lübeck, Germany
| | | | - Noemi Deppenwiese
- Chair of Medical Informatics, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany
| | - Robert Gött
- Department Epidemiology of Health Care and Community Health, Institute for Community Medicine, University Medicine Greifswald, Greifswald, Germany
| | - Jori Kern
- Federated Information Systems, German Cancer Research Center, Heidelberg, Germany.,Complex Data Processing in Medical Informatics, University Medical Center Mannheim, Mannheim, Germany
| | - Martin Lablans
- Federated Information Systems, German Cancer Research Center, Heidelberg, Germany.,Complex Data Processing in Medical Informatics, University Medical Center Mannheim, Mannheim, Germany
| | - Raphael W Majeed
- Universities of Giessen and Marburg Lung Center, German Center for Lung Research, Justus-Liebig-University, Giessen, Germany.,Institute of Medical Informatics, University Hospital RWTH Aachen, Aachen, Germany
| | - Mark R Stöhr
- Universities of Giessen and Marburg Lung Center, German Center for Lung Research, Justus-Liebig-University, Giessen, Germany
| | - Jürgen Stausberg
- Institute of Medical Informatics, Biometry and Epidemiology, Faculty of Medicine, University of Duisburg-Essen, Essen, Germany
| | - Julian Varghese
- Institute of Medical Informatics, University of Münster, Münster, Germany
| | - Martin Dugas
- Institute of Medical Informatics, Heidelberg University Hospital, Heidelberg, Germany
| | - Josef Ingenerf
- IT Center for Clinical Research, University of Lübeck, Lübeck, Germany.,Institute of Medical Informatics, University of Lübeck, Lübeck, Germany
| |
Collapse
|
4
|
Nguyen TM, Bharti S, Yue Z, Willey CD, Chen JY. Statistical Enrichment Analysis of Samples: A General-Purpose Tool to Annotate Metadata Neighborhoods of Biological Samples. Front Big Data 2021; 4:725276. [PMID: 34604741 PMCID: PMC8481385 DOI: 10.3389/fdata.2021.725276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Accepted: 09/06/2021] [Indexed: 11/13/2022] Open
Abstract
Unsupervised learning techniques, such as clustering and embedding, have been increasingly popular to cluster biomedical samples from high-dimensional biomedical data. Extracting clinical data or sample meta-data shared in common among biomedical samples of a given biological condition remains a major challenge. Here, we describe a powerful analytical method called Statistical Enrichment Analysis of Samples (SEAS) for interpreting clustered or embedded sample data from omics studies. The method derives its power by focusing on sample sets, i.e., groups of biological samples that were constructed for various purposes, e.g., manual curation of samples sharing specific characteristics or automated clusters generated by embedding sample omic profiles from multi-dimensional omics space. The samples in the sample set share common clinical measurements, which we refer to as "clinotypes," such as age group, gender, treatment status, or survival days. We demonstrate how SEAS yields insights into biological data sets using glioblastoma (GBM) samples. Notably, when analyzing the combined The Cancer Genome Atlas (TCGA)-patient-derived xenograft (PDX) data, SEAS allows approximating the different clinical outcomes of radiotherapy-treated PDX samples, which has not been solved by other tools. The result shows that SEAS may support the clinical decision. The SEAS tool is publicly available as a freely available software package at https://aimed-lab.shinyapps.io/SEAS/.
Collapse
Affiliation(s)
- Thanh M Nguyen
- Informatics Institute, School of Medicine, The University of Alabama at Birmingham, Birmingham, AL, United States
| | - Samuel Bharti
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University, Noida, India
| | - Zongliang Yue
- Informatics Institute, School of Medicine, The University of Alabama at Birmingham, Birmingham, AL, United States
| | - Christopher D Willey
- Department of Radiation Oncology, School of Medicine, The University of Alabama at Birmingham, Birmingham, AL, United States
| | - Jake Y Chen
- Informatics Institute, School of Medicine, The University of Alabama at Birmingham, Birmingham, AL, United States
| |
Collapse
|
5
|
Ma H, Shen L, Sun H, Xu Z, Hou L, Wu S, Fang A, Li J, Qian Q. COVID term: a bilingual terminology for COVID-19. BMC Med Inform Decis Mak 2021; 21:231. [PMID: 34344385 PMCID: PMC8329642 DOI: 10.1186/s12911-021-01593-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2020] [Accepted: 07/10/2021] [Indexed: 01/08/2023] Open
Abstract
BACKGROUND The coronavirus disease (COVID-19), a pneumonia caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has shown its destructiveness with more than one million confirmed cases and dozens of thousands of death, which is highly contagious and still spreading globally. World-wide studies have been conducted aiming to understand the COVID-19 mechanism, transmission, clinical features, etc. A cross-language terminology of COVID-19 is essential for improving knowledge sharing and scientific discovery dissemination. METHODS We developed a bilingual terminology of COVID-19 named COVID Term with mapping Chinese and English terms. The terminology was constructed as follows: (1) Classification schema design; (2) Concept representation model building; (3) Term source selection and term extraction; (4) Hierarchical structure construction; (5) Quality control (6) Web service. We built open access for the terminology, providing search, browse, and download services. RESULTS The proposed COVID Term include 10 categories: disease, anatomic site, clinical manifestation, demographic and socioeconomic characteristics, living organism, qualifiers, psychological assistance, medical equipment, instruments and materials, epidemic prevention and control, diagnosis and treatment technique respectively. In total, COVID Terms covered 464 concepts with 724 Chinese terms and 887 English terms. All terms are openly available online (COVID Term URL: http://covidterm.imicams.ac.cn ). CONCLUSIONS COVID Term is a bilingual terminology focused on COVID-19, the epidemic pneumonia with a high risk of infection around the world. It will provide updated bilingual terms of the disease to help health providers and medical professionals retrieve and exchange information and knowledge in multiple languages. COVID Term was released in machine-readable formats (e.g., XML and JSON), which would contribute to the information retrieval, machine translation and advanced intelligent techniques application.
Collapse
Affiliation(s)
- Hetong Ma
- Institute of Medical Information/Library, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing, China
| | - Liu Shen
- Institute of Medical Information/Library, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing, China
| | - Haixia Sun
- Institute of Medical Information/Library, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing, China
| | - Zidu Xu
- Institute of Medical Information/Library, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing, China
| | - Li Hou
- Institute of Medical Information/Library, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing, China
| | - Sizhu Wu
- Institute of Medical Information/Library, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing, China
| | - An Fang
- Institute of Medical Information/Library, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing, China
| | - Jiao Li
- Institute of Medical Information/Library, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing, China
| | - Qing Qian
- Institute of Medical Information/Library, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing, China.
| |
Collapse
|
6
|
Abstract
Objective
: To give an overview of recent research and to propose a selection of best papers published in 2019 in the field of Clinical Information Systems (CIS).
Method
: Each year, we apply a systematic process to retrieve articles for the CIS section of the IMIA Yearbook of Medical Informatics. For six years now, we use the same query to find relevant publications in the CIS field. Each year we retrieve more than 2,000 papers. As CIS section editors, we categorize the retrieved articles in a multi-pass review to distill a pre-selection of 15 candidate best papers. Then, Yearbook editors and external reviewers assess the selected candidate best papers. Based on the review results, the IMIA Yearbook Editorial Committee chooses the best papers during the selection meeting. We used text mining, and term co-occurrence mapping techniques to get an overview of the content of the retrieved articles.
Results
: We carried out the query in mid-January 2020 and retrieved a de-duplicated result set of 2,407 articles from 1,023 different journals. This year, we nominated 14 papers as candidate best papers, and three of them were finally selected as best papers in the CIS section. As in previous years, the content analysis of the articles revealed the broad spectrum of topics covered by CIS research.
Conclusions
: We could observe ongoing trends, as seen in the last years. Patient benefit research is in the focus of many research activities, and trans-institutional aggregation of data remains a relevant field of work. Powerful machine-learning-based approaches, that use readily available data now often outperform human-based procedures. However, the ethical perspective of this development often comes too short in the considerations. We thus assume that ethical aspects will and should deliver much food for thought for future CIS research.
Collapse
Affiliation(s)
- W O Hackl
- Institute of Medical Informatics, UMIT - Private University of Health Sciences, Medical Informatics and Technology, Hall in Tirol, Austria
| | - A Hoerbst
- Medical Technologies Department, MCI - The Entrepreneurial School, Innsbruck, Austria
| | | |
Collapse
|