Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Download

Total Articles

199
(from Reference Citation Analysis)

Article PDFs (64)

Cited by > 0 (140)

Searched Name

information retrieval

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Type

Show more Refine

Article Statistics

Refine

MESH Headings

Show more Refine

First Author

Show more Refine

First Author Affiliations

Show more Refine

Authors

Show more Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Countries/Regions

Show more Refine

Affiliations

Show more Refine

Corresponding Author Affiliations

Show more Refine

Category

Show more Refine

Number

Citation Analysis

Theile CM, Beall AL. Conducting a Systematic Review of the Literature. J Dent Hyg 2024;98:51-56. [PMID: 38649289] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Received: 02/12/2024] [Accepted: 03/25/2024] [Indexed: 04/25/2024]

Kernan Freire S, Wang C, Foosherian M, Wellsandt S, Ruiz-Arenas S, Niforatos E. Knowledge sharing in manufacturing using LLM-powered tools: user study and model benchmarking. Front Artif Intell 2024;7:1293084. [PMID: 38601111 PMCID: PMC11004332 DOI: 10.3389/frai.2024.1293084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Accepted: 03/14/2024] [Indexed: 04/12/2024] Open

Gharavi E, LeRoy NJ, Zheng G, Zhang A, Brown DE, Sheffield NC. Joint Representation Learning for Retrieval and Annotation of Genomic Interval Sets. Bioengineering (Basel) 2024;11:263. [PMID: 38534537 DOI: 10.3390/bioengineering11030263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Revised: 02/20/2024] [Accepted: 02/22/2024] [Indexed: 03/28/2024] Open

Affiliation(s)

Erfaneh Gharavi Center for Public Health Genomics, School of Medicine, University of Virginia, Charlottesville, VA 22908, USA School of Data Science, University of Virginia, Charlottesville, VA 22904, USA
Nathan J LeRoy Center for Public Health Genomics, School of Medicine, University of Virginia, Charlottesville, VA 22908, USA Department of Biomedical Engineering, School of Medicine, University of Virginia, Charlottesville, VA 22904, USA
Guangtao Zheng Department of Computer Science, School of Engineering, University of Virginia, Charlottesville, VA 22908, USA
Aidong Zhang School of Data Science, University of Virginia, Charlottesville, VA 22904, USA Department of Biomedical Engineering, School of Medicine, University of Virginia, Charlottesville, VA 22904, USA Department of Computer Science, School of Engineering, University of Virginia, Charlottesville, VA 22908, USA
Donald E Brown School of Data Science, University of Virginia, Charlottesville, VA 22904, USA Department of Systems and Information Engineering, University of Virginia, Charlottesville, VA 22908, USA
Nathan C Sheffield Center for Public Health Genomics, School of Medicine, University of Virginia, Charlottesville, VA 22908, USA School of Data Science, University of Virginia, Charlottesville, VA 22904, USA Department of Biomedical Engineering, School of Medicine, University of Virginia, Charlottesville, VA 22904, USA Department of Computer Science, School of Engineering, University of Virginia, Charlottesville, VA 22908, USA Department of Public Health Sciences, School of Medicine, University of Virginia, Charlottesville, VA 22908, USA Department of Biochemistry and Molecular Genetics, School of Medicine, University of Virginia, Charlottesville, VA 22908, USA Child Health Research Center, School of Medicine, University of Virginia, Charlottesville, VA 22908, USA

Collapse

Escobar-Liquitay CM, Vergara-Merino L, Verdejo C, Kirmayr M, Schuller-Martínez B, Madrid E, Meza N, Bracchiglione J, Franco JVA. Methodological and users' surveys on the use of the LILACS database in Cochrane reviews identified desirable improvements to the database. Health Info Libr J 2024;41:76-83. [PMID: 37574776 DOI: 10.1111/hir.12505] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2021] [Revised: 05/14/2023] [Accepted: 07/25/2023] [Indexed: 08/15/2023]

Theile CM, Beall AL. Narrative Reviews of the Literature: An overview. J Dent Hyg 2024;98:78-82. [PMID: 38346895] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Accepted: 01/17/2024] [Indexed: 02/15/2024]

Zare-Farashbandi E, Adibi P, Zare-Farashbandi F. Retrieving Rare Cases: A Protocol for Searching Complex Medical Cases. Med Ref Serv Q 2024;43:15-25. [PMID: 38237019 DOI: 10.1080/02763869.2024.2289797] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2024]

Liu S, Bourgeois FT, Narang C, Dunn AG. A comparison of machine learning methods to find clinical trials for inclusion in new systematic reviews from their PROSPERO registrations prior to searching and screening. Res Synth Methods 2024;15:73-85. [PMID: 37749068 PMCID: PMC10872991 DOI: 10.1002/jrsm.1672] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Revised: 08/13/2023] [Accepted: 09/08/2023] [Indexed: 09/27/2023]

Wang G, Gao K, Liu Q, Wu Y, Zhang K, Zhou W, Guo C. Potential and Limitations of ChatGPT 3.5 and 4.0 as a Source of COVID-19 Information: Comprehensive Comparative Analysis of Generative and Authoritative Information. J Med Internet Res 2023;25:e49771. [PMID: 38096014 PMCID: PMC10755661 DOI: 10.2196/49771] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 10/01/2023] [Accepted: 11/16/2023] [Indexed: 12/18/2023] Open

Abstract

BACKGROUND

The COVID-19 pandemic, caused by the SARS-CoV-2 virus, has necessitated reliable and authoritative information for public guidance. The World Health Organization (WHO) has been a primary source of such information, disseminating it through a question and answer format on its official website. Concurrently, ChatGPT 3.5 and 4.0, a deep learning-based natural language generation system, has shown potential in generating diverse text types based on user input.

OBJECTIVE

This study evaluates the accuracy of COVID-19 information generated by ChatGPT 3.5 and 4.0, assessing its potential as a supplementary public information source during the pandemic.

METHODS

We extracted 487 COVID-19-related questions from the WHO's official website and used ChatGPT 3.5 and 4.0 to generate corresponding answers. These generated answers were then compared against the official WHO responses for evaluation. Two clinical experts scored the generated answers on a scale of 0-5 across 4 dimensions-accuracy, comprehensiveness, relevance, and clarity-with higher scores indicating better performance in each dimension. The WHO responses served as the reference for this assessment. Additionally, we used the BERT (Bidirectional Encoder Representations from Transformers) model to generate similarity scores (0-1) between the generated and official answers, providing a dual validation mechanism.

RESULTS

The mean (SD) scores for ChatGPT 3.5-generated answers were 3.47 (0.725) for accuracy, 3.89 (0.719) for comprehensiveness, 4.09 (0.787) for relevance, and 3.49 (0.809) for clarity. For ChatGPT 4.0, the mean (SD) scores were 4.15 (0.780), 4.47 (0.641), 4.56 (0.600), and 4.09 (0.698), respectively. All differences were statistically significant (P<.001), with ChatGPT 4.0 outperforming ChatGPT 3.5. The BERT model verification showed mean (SD) similarity scores of 0.83 (0.07) for ChatGPT 3.5 and 0.85 (0.07) for ChatGPT 4.0 compared with the official WHO answers.

CONCLUSIONS

ChatGPT 3.5 and 4.0 can generate accurate and relevant COVID-19 information to a certain extent. However, compared with official WHO responses, gaps and deficiencies exist. Thus, users of ChatGPT 3.5 and 4.0 should also reference other reliable information sources to mitigate potential misinformation risks. Notably, ChatGPT 4.0 outperformed ChatGPT 3.5 across all evaluated dimensions, a finding corroborated by BERT model validation.

Collapse

McDonald S, Hill K, Li HZ, Turner T. Evidence surveillance for a living clinical guideline: Case study of the Australian stroke guidelines. Health Info Libr J 2023. [PMID: 37942888 DOI: 10.1111/hir.12515] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Revised: 07/26/2023] [Accepted: 10/26/2023] [Indexed: 11/10/2023]

Sutton A, O'Keefe H, Johnson EE, Marshall C. A mapping exercise using automated techniques to develop a search strategy to identify systematic review tools. Res Synth Methods 2023;14:874-881. [PMID: 37669905 DOI: 10.1002/jrsm.1665] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Revised: 07/31/2023] [Accepted: 08/04/2023] [Indexed: 09/07/2023]

Hickner A. How do search systems impact systematic searching? A qualitative study. J Med Libr Assoc 2023;111:774-782. [PMID: 37928121 PMCID: PMC10621724 DOI: 10.5195/jmla.2023.1647] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2023] Open

Wu DTY, Hanauer D, Murdock P, Vydiswaran VGV, Mei Q, Zheng K. Developing a Semantically Based Query Recommendation for an Electronic Medical Record Search Engine: Query Log Analysis and Design Implications. JMIR Form Res 2023;7:e45376. [PMID: 37713239 PMCID: PMC10541636 DOI: 10.2196/45376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Revised: 07/19/2023] [Accepted: 08/03/2023] [Indexed: 09/16/2023] Open

Abstract

BACKGROUND

An effective and scalable information retrieval (IR) system plays a crucial role in enabling clinicians and researchers to harness the valuable information present in electronic health records. In a previous study, we developed a prototype medical IR system, which incorporated a semantically based query recommendation (SBQR) feature. The system was evaluated empirically and demonstrated high perceived performance by end users. To delve deeper into the factors contributing to this perceived performance, we conducted a follow-up study using query log analysis.

OBJECTIVE

One of the primary challenges faced in IR is that users often have limited knowledge regarding their specific information needs. Consequently, an IR system, particularly its user interface, needs to be thoughtfully designed to assist users through the iterative process of refining their queries as they encounter relevant documents during their search. To address these challenges, we incorporated "query recommendation" into our Electronic Medical Record Search Engine (EMERSE), drawing inspiration from the success of similar features in modern IR systems for general purposes.

METHODS

The query log data analyzed in this study were collected during our previous experimental study, where we developed EMERSE with the SBQR feature. We implemented a logging mechanism to capture user query behaviors and the output of the IR system (retrieved documents). In this analysis, we compared the initial query entered by users with the query formulated with the assistance of the SBQR. By examining the results of this comparison, we could examine whether the use of SBQR helped in constructing improved queries that differed from the original ones.

RESULTS

Our findings revealed that the first query entered without SBQR and the final query with SBQR assistance were highly similar (Jaccard similarity coefficient=0.77). This suggests that the perceived positive performance of the system was primarily attributed to the automatic query expansion facilitated by the SBQR rather than users manually manipulating their queries. In addition, through entropy analysis, we observed that search results converged in scenarios of moderate difficulty, and the degree of convergence correlated strongly with the perceived system performance.

CONCLUSIONS

The study demonstrated the potential contribution of the SBQR in shaping participants' positive perceptions of system performance, contingent upon the difficulty of the search scenario. Medical IR systems should therefore consider incorporating an SBQR as a user-controlled option or a semiautomated feature. Future work entails redesigning the experiment in a more controlled manner and conducting multisite studies to demonstrate the effectiveness of EMERSE with SBQR for patient cohort identification. By further exploring and validating these findings, we can enhance the usability and functionality of medical IR systems in real-world settings.

Collapse

Siglen E, Vetti HH, Augestad M, Steen VM, Lunde Å, Bjorvatn C. Evaluation of the Rosa Chatbot Providing Genetic Information to Patients at Risk of Hereditary Breast and Ovarian Cancer: Qualitative Interview Study. J Med Internet Res 2023;25:e46571. [PMID: 37656502 PMCID: PMC10504626 DOI: 10.2196/46571] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Revised: 06/27/2023] [Accepted: 07/20/2023] [Indexed: 09/02/2023] Open

Abstract

BACKGROUND

Genetic testing has become an integrated part of health care for patients with breast or ovarian cancer, and the increasing demand for genetic testing is accompanied by an increasing need for easy access to reliable genetic information for patients. Therefore, we developed a chatbot app (Rosa) that is able to perform humanlike digital conversations about genetic BRCA testing.

OBJECTIVE

Before implementing this new information service in daily clinical practice, we wanted to explore 2 aspects of chatbot use: the perceived utility and trust in chatbot technology among healthy patients at risk of hereditary cancer and how interaction with a chatbot regarding sensitive information about hereditary cancer influences patients.

METHODS

Overall, 175 healthy individuals at risk of hereditary breast and ovarian cancer were invited to test the chatbot, Rosa, before and after genetic counseling. To secure a varied sample, participants were recruited from all cancer genetic clinics in Norway, and the selection was based on age, gender, and risk of having a BRCA pathogenic variant. Among the 34.9% (61/175) of participants who consented for individual interview, a selected subgroup (16/61, 26%) shared their experience through in-depth interviews via video. The semistructured interviews covered the following topics: usability, perceived usefulness, trust in the information received via the chatbot, how Rosa influenced the user, and thoughts about future use of digital tools in health care. The transcripts were analyzed using the stepwise-deductive inductive approach.

RESULTS

The overall finding was that the chatbot was very welcomed by the participants. They appreciated the 24/7 availability wherever they were and the possibility to use it to prepare for genetic counseling and to repeat and ask questions about what had been said afterward. As Rosa was created by health care professionals, they also valued the information they received as being medically correct. Rosa was referred to as being better than Google because it provided specific and reliable answers to their questions. The findings were summed up in 3 concepts: "Anytime, anywhere"; "In addition, not instead"; and "Trustworthy and true." All participants (16/16) denied increased worry after reading about genetic testing and hereditary breast and ovarian cancer in Rosa.

CONCLUSIONS

Our results indicate that a genetic information chatbot has the potential to contribute to easy access to uniform information for patients at risk of hereditary breast and ovarian cancer, regardless of geographical location. The 24/7 availability of quality-assured information, tailored to the specific situation, had a reassuring effect on our participants. It was consistent across concepts that Rosa was a tool for preparation and repetition; however, none of the participants (0/16) supported that Rosa could replace genetic counseling if hereditary cancer was confirmed. This indicates that a chatbot can be a well-suited digital companion to genetic counseling.

Collapse

Inau ET, Sack J, Waltemath D, Zeleke AA. Initiatives, Concepts, and Implementation Practices of the Findable, Accessible, Interoperable, and Reusable Data Principles in Health Data Stewardship: Scoping Review. J Med Internet Res 2023;25:e45013. [PMID: 37639292 PMCID: PMC10495848 DOI: 10.2196/45013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Revised: 03/25/2023] [Accepted: 04/14/2023] [Indexed: 08/29/2023] Open

Abstract

BACKGROUND

Thorough data stewardship is a key enabler of comprehensive health research. Processes such as data collection, storage, access, sharing, and analytics require researchers to follow elaborate data management strategies properly and consistently. Studies have shown that findable, accessible, interoperable, and reusable (FAIR) data leads to improved data sharing in different scientific domains.

OBJECTIVE

This scoping review identifies and discusses concepts, approaches, implementation experiences, and lessons learned in FAIR initiatives in health research data.

METHODS

The Arksey and O'Malley stage-based methodological framework for scoping reviews was applied. PubMed, Web of Science, and Google Scholar were searched to access relevant publications. Articles written in English, published between 2014 and 2020, and addressing FAIR concepts or practices in the health domain were included. The 3 data sources were deduplicated using a reference management software. In total, 2 independent authors reviewed the eligibility of each article based on defined inclusion and exclusion criteria. A charting tool was used to extract information from the full-text papers. The results were reported using the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) guidelines.

RESULTS

A total of 2.18% (34/1561) of the screened articles were included in the final review. The authors reported FAIRification approaches, which include interpolation, inclusion of comprehensive data dictionaries, repository design, semantic interoperability, ontologies, data quality, linked data, and requirement gathering for FAIRification tools. Challenges and mitigation strategies associated with FAIRification, such as high setup costs, data politics, technical and administrative issues, privacy concerns, and difficulties encountered in sharing health data despite its sensitive nature were also reported. We found various workflows, tools, and infrastructures designed by different groups worldwide to facilitate the FAIRification of health research data. We also uncovered a wide range of problems and questions that researchers are trying to address by using the different workflows, tools, and infrastructures. Although the concept of FAIR data stewardship in the health research domain is relatively new, almost all continents have been reached by at least one network trying to achieve health data FAIRness. Documented outcomes of FAIRification efforts include peer-reviewed publications, improved data sharing, facilitated data reuse, return on investment, and new treatments. Successful FAIRification of data has informed the management and prognosis of various diseases such as cancer, cardiovascular diseases, and neurological diseases. Efforts to FAIRify data on a wider variety of diseases have been ongoing since the COVID-19 pandemic.

CONCLUSIONS

This work summarises projects, tools, and workflows for the FAIRification of health research data. The comprehensive review shows that implementing the FAIR concept in health data stewardship carries the promise of improved research data management and transparency in the era of big data and open research publishing.

INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID)

RR2-10.2196/22505.

Collapse

El-Khatib Z, Richter L, Reich A, Benka B, Assadian O. Implementation of a Surveillance System for Severe Acute Respiratory Infections at a Tertiary Care Hospital in Austria: Protocol for a Retrospective Longitudinal Feasibility Study. JMIR Res Protoc 2023;12:e47547. [PMID: 37535414 PMCID: PMC10436110 DOI: 10.2196/47547] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Revised: 05/31/2023] [Accepted: 06/14/2023] [Indexed: 08/04/2023] Open

Abstract

BACKGROUND

The risk of a large number of severe acute respiratory infection (SARI) cases emerging is a global concern. SARI can overwhelm the health care capacity and cause several deaths. Therefore, the Austrian Agency for Health and Food Safety will explore the feasibility of implementing an automatic electronically based SARI surveillance system at a tertiary care hospital in Austria as part of the hospital network, initiated by the European Centre for Disease Prevention and Control.

OBJECTIVE

We aim to investigate the availability of routinely collected health record data pertaining to respiratory infections and the optimal approach to use such available data for systematic surveillance of SARI in a real-world setting, describe the characteristics of patients with SARI before and after the beginning of the COVID-19 pandemic, and investigate the feasibility of identifying the risk factors for a severe outcome (intensive care unit admission or death) in patients with SARI.

METHODS

We will test the feasibility of a surveillance system, as part of a large European network, at a tertiary care hospital in the province of Lower Austria (called Regional Hospital Wiener Neustadt). It will be a cross-sectional study for the inventory of the electronic data records and implementation of automatic data retrieval for the period of January 2019 through the end of December 2022. The analysis will include an exploration of the database structure, descriptive analysis of the general characteristics of the patients with SARI, estimation of the SARI incidence rate, and assessment of the risk factors and different levels of severity of patients with SARI using logistic regression analysis.

RESULTS

This will be the first study to assess the feasibility of SARI surveillance at a large 800-bed tertiary care hospital in Austria. It will provide a general overview of the potential for establishing a hospital-based surveillance system for SARI. In addition, if successful, the electronic surveillance will be able to improve the response to early warning signs of new SARI, which will better inform policy makers in strengthening the surveillance system.

CONCLUSIONS

The findings will support the expansion of the SARI hospital-based surveillance system to other hospitals in Austria. This network will be of use to Austria in preparing for future pandemics.

INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID)

PRR1-10.2196/47547.

Collapse

Chen E, Bullard J, Giustini D. Automated indexing using NLM's Medical Text Indexer (MTI) compared to human indexing in Medline: a pilot study. J Med Libr Assoc 2023;111:684-694. [PMID: 37483360 PMCID: PMC10361558 DOI: 10.5195/jmla.2023.1588] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/25/2023] Open

Gendrin A, Souliotis L, Loudon-Griffiths J, Aggarwal R, Amoako D, Desouza G, Dimitrievska S, Metcalfe P, Louvet E, Sahni H. Identifying Patient Populations in Texts Describing Drug Approvals Through Deep Learning-Based Information Extraction: Development of a Natural Language Processing Algorithm. JMIR Form Res 2023;7:e44876. [PMID: 37347514 DOI: 10.2196/44876] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Revised: 03/30/2023] [Accepted: 04/17/2023] [Indexed: 06/23/2023] Open

Abstract

BACKGROUND

New drug treatments are regularly approved, and it is challenging to remain up-to-date in this rapidly changing environment. Fast and accurate visualization is important to allow a global understanding of the drug market. Automation of this information extraction provides a helpful starting point for the subject matter expert, helps to mitigate human errors, and saves time.

OBJECTIVE

We aimed to semiautomate disease population extraction from the free text of oncology drug approval descriptions from the BioMedTracker database for 6 selected drug targets. More specifically, we intended to extract (1) line of therapy, (2) stage of cancer of the patient population described in the approval, and (3) the clinical trials that provide evidence for the approval. We aimed to use these results in downstream applications, aiding the searchability of relevant content against related drug project sources.

METHODS

We fine-tuned a state-of-the-art deep learning model, Bidirectional Encoder Representations from Transformers, for each of the 3 desired outputs. We independently applied rule-based text mining approaches. We compared the performances of deep learning and rule-based approaches and selected the best method, which was then applied to new entries. The results were manually curated by a subject matter expert and then used to train new models.

RESULTS

The training data set is currently small (433 entries) and will enlarge over time when new approval descriptions become available or if a choice is made to take another drug target into account. The deep learning models achieved 61% and 56% 5-fold cross-validated accuracies for line of therapy and stage of cancer, respectively, which were treated as classification tasks. Trial identification is treated as a named entity recognition task, and the 5-fold cross-validated F₁-score is currently 87%. Although the scores of the classification tasks could seem low, the models comprise 5 classes each, and such scores are a marked improvement when compared to random classification. Moreover, we expect improved performance as the input data set grows, since deep learning models need to be trained on a large enough amount of data to be able to learn the task they are taught. The rule-based approach achieved 60% and 74% 5-fold cross-validated accuracies for line of therapy and stage of cancer, respectively. No attempt was made to define a rule-based approach for trial identification.

CONCLUSIONS

We developed a natural language processing algorithm that is currently assisting subject matter experts in disease population extraction, which supports health authority approvals. This algorithm achieves semiautomation, enabling subject matter experts to leverage the results for deeper analysis and to accelerate information retrieval in a crowded clinical environment such as oncology.

Collapse

Upadhyay R, Knoth P, Pasi G, Viviani M. Explainable online health information truthfulness in Consumer Health Search. Front Artif Intell 2023;6:1184851. [PMID: 37415938 PMCID: PMC10321772 DOI: 10.3389/frai.2023.1184851] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2023] [Accepted: 05/30/2023] [Indexed: 07/08/2023] Open

Abstract

Introduction

People are today increasingly relying on health information they find online to make decisions that may impact both their physical and mental wellbeing. Therefore, there is a growing need for systems that can assess the truthfulness of such health information. Most of the current literature solutions use machine learning or knowledge-based approaches treating the problem as a binary classification task, discriminating between correct information and misinformation. Such solutions present several problems with regard to user decision making, among which: (i) the binary classification task provides users with just two predetermined possibilities with respect to the truthfulness of the information, which users should take for granted; indeed, (ii) the processes by which the results were obtained are often opaque and the results themselves have little or no interpretation.

Methods

To address these issues, we approach the problem as an ad hoc retrieval task rather than a classification task, with reference, in particular, to the Consumer Health Search task. To do this, a previously proposed Information Retrieval model, which considers information truthfulness as a dimension of relevance, is used to obtain a ranked list of both topically-relevant and truthful documents. The novelty of this work concerns the extension of such a model with a solution for the explainability of the results obtained, by relying on a knowledge base consisting of scientific evidence in the form of medical journal articles.

Results and discussion

We evaluate the proposed solution both quantitatively, as a standard classification task, and qualitatively, through a user study to examine the "explained" ranked list of documents. The results obtained illustrate the solution's effectiveness and usefulness in making the retrieved results more interpretable by Consumer Health Searchers, both with respect to topical relevance and truthfulness.

Collapse

Khan MA, Mowforth OD, Kuhn I, Kotter MRN, Davies BM. Development of a validated search filter for Ovid Embase for degenerative cervical myelopathy. Health Info Libr J 2023;40:181-189. [PMID: 34409722 DOI: 10.1111/hir.12373] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2020] [Revised: 03/14/2021] [Accepted: 04/14/2021] [Indexed: 12/17/2022]

Lemenkova P, Debeir O. Multispectral Satellite Image Analysis for Computing Vegetation Indices by R in the Khartoum Region of Sudan, Northeast Africa. J Imaging 2023;9:jimaging9050098. [PMID: 37233317 DOI: 10.3390/jimaging9050098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Revised: 05/04/2023] [Accepted: 05/08/2023] [Indexed: 05/27/2023] Open

Banerjee A, Banik P, Wörndl W. A review on individual and multistakeholder fairness in tourism recommender systems. Front Big Data 2023;6:1168692. [PMID: 37234689 PMCID: PMC10206003 DOI: 10.3389/fdata.2023.1168692] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Accepted: 04/18/2023] [Indexed: 05/28/2023] Open

Brody S, Loree S, Sampson M, Mensinkai S, Coffman J, Mueller MH, Askin N, Hamill C, Wilson E, McAteer MB, Staines H. Searching for evidence in public health emergencies: a white paper of best practices. J Med Libr Assoc 2023;111:566-578. [PMID: 37312802 PMCID: PMC10259619 DOI: 10.5195/jmla.2023.1530] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023] Open

Abstract

Objectives

Information professionals have supported medical providers, administrators and decision-makers, and guideline creators in the COVID-19 response. Searching COVID-19 literature presented new challenges, including the volume and heterogeneity of literature and the proliferation of new information sources, and exposed existing issues in metadata and publishing. An expert panel developed best practices, including recommendations, elaborations, and examples, for searching during public health emergencies.

Methods

Project directors and advisors developed core elements from experience and literature. Experts, identified by affiliation with evidence synthesis groups, COVID-19 search experience, and nomination, responded to an online survey to reach consensus on core elements. Expert participants provided written responses to guiding questions. A synthesis of responses provided the foundation for focus group discussions. A writing group then drafted the best practices into a statement. Experts reviewed the statement prior to dissemination.

Results

Twelve information professionals contributed to best practice recommendations on six elements: core resources, search strategies, publication types, transparency and reproducibility, collaboration, and conducting research. Underlying principles across recommendations include timeliness, openness, balance, preparedness, and responsiveness.

Conclusions

The authors and experts anticipate the recommendations for searching for evidence during public health emergencies will help information specialists, librarians, evidence synthesis groups, researchers, and decision-makers respond to future public health emergencies, including but not limited to disease outbreaks. The recommendations complement existing guidance by addressing concerns specific to emergency response. The statement is intended as a living document. Future revisions should solicit input from a broader community and reflect conclusions of meta-research on COVID-19 and health emergencies.

Collapse

Affiliation(s)

Stacy Brody , Reference & Instruction Librarian, Himmelfarb Health Sciences Library, George Washington University, School of Medicine and Health Sciences, Washington, DC, United States
Sara Loree , Medical Library Manager, St. Luke's Health System, ID, United States
Margaret Sampson Children's Hospital of Eastern Ontario Research Institute, Ottawa, ON, Canada
Shaila Mensinkai , Librarian Reserve Corps, Canada
Jennifer Coffman , Science and Engineering Research Librarian, University of Virginia, Charlottesville, VA, United States
Mark Heinrich Mueller , Saskatchewan Health Authority, Health Sciences Library, Regina, SK, Canada
Nicole Askin , WRHA Virtual Library, University of Manitoba, Winnipeg, MB, Canada
Cheryl Hamill , South and East Metropolitan Health Services, Perth, Australia
Emma Wilson , The University of Edinburgh, Centre for Clinical Brain Sciences, Edinburgh, Scotland
Mary Beth McAteer , Virginia Mason Medical Center, Jones Learning Center, Seattle, WA, United States
Heather Staines , Delta Think, Philadelphia, PA, United States
Best Practices for Searching During Public Health Emergencies Working Group Cheryl Hamill, FALIA, AALIA (CP) Health, , 0000-0002-6069-1806, South and East Metropolitan Health Services, Perth, Australia; Maureen Dobbins, RN, PhD, 0000-0002-1968-6765, McMaster University, Canada; Amy M Claussen, MLIS, 0000-0003-3996-1055, University of Minnesota, United States; Kavita Umesh Kothari, MPH, 0000-0002-0759-5225, Health Information Consultant, Kobe, Japan; Caroline De Brún, PhD, 0000-0002-5185-0043, UK Health Security Agency, United Kingdom; Sarah Young, 0000-0002-8301-5106, Carnegie Mellon University, United States; Sarah E Neil-Sztramko, PhD, 0000-0002-9600-3403, McMaster University, Canada; Shaila Mensinkai, MA, MLIS, Librarian Reserve Corps, Canada; Emma Wilson, 0000-0002-8100-7508, The University of Edinburgh, Scotland; Robin M Featherstone MLIS, 0000-0003-2517-2258, CADTH Canadian Agency for Drugs and Technologies in Health (present affiliation); Cochrane Central Executive Team (sponsor), Toronto, Canada; Margaret Sampson, MLIS, PhD, AHIP, 0000-0003-2550-9893, Children's Hospital of Eastern Ontario Research Institute, Canada; Heather Staines, PhD, MA, 0000-0003-3876-1182, Delta Think, United States; Martha Knuth, MLIS, 0000-0003-4264-1642, Centers for Disease Control and Prevention, United States

Collapse

Teitz J, Sander J, Sarker H, Fernandez-Patron C. Potential of dissimilarity measure-based computation of protein thermal stability data for determining protein interactions. Brief Bioinform 2023;24:7126339. [PMID: 37068306 DOI: 10.1093/bib/bbad143] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Revised: 03/02/2023] [Accepted: 03/18/2023] [Indexed: 04/19/2023] Open

Rosonovski S, Levchenko M, Ide-Smith M, Faulk L, Harrison M, McEntyre J. Searching and Evaluating Publications and Preprints Using Europe PMC. Curr Protoc 2023;3:e694. [PMID: 36946755 DOI: 10.1002/cpz1.694] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/23/2023]

Abstract

In the field of life sciences there is a growing need for literature analysis tools that help scientists tackle information overload. Europe PubMed Central (Europe PMC), a partner of PubMed Central (PMC; National Library of Medicine, 2022), is an open access database of over 41 million life science publications and preprints, enriched with supporting data, reviews, protocols, and other relevant resources. Europe PMC is a trusted repository of choice for many life science funders (Europe PMC, 2022a), offering a suite of innovative search tools that allow users to search and evaluate the literature, including finding highly cited articles, preprints with community peer reviews, or papers referencing a proteomics dataset in the figure legend. In addition, Europe PMC utilizes text-mining to help researchers identify key terms and find data and evidence in the literature. First-time users often do not utilize the wealth of tools Europe PMC offers and can feel overwhelmed about how to perform the most effective search. This protocol, describing how to search and evaluate publications and preprints using Europe PMC, demonstrates how to carry out more efficient and effective literature searches using the tools provided by Europe PMC. This includes discovering the latest findings on a research topic, following research from a specific author, journal, or preprint server, exploring literature on a new method, expanding your reading list with relevant articles, as well as accessing and evaluating publications and preprints of interest. © 2023 EMBL-EBI. Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: Finding articles and preprints on a topic of interest Basic Protocol 2: Accessing an article Basic Protocol 3: Browsing the article Basic Protocol 4: Evaluating the article Basic Protocol 5: Refining search results Basic Protocol 6: Finding research by author Basic Protocol 7: Finding a specific article Basic Protocol 8: Finding information about a methodology Basic Protocol 9: Finding evidence of biological interactions, relations, and modifications Basic Protocol 10: Finding data behind a publication Basic Protocol 11: Expanding a reading list and building a bibliography Basic Protocol 12: Staying on top of the current literature.

Collapse

Pallath A, Zhang Q. Paperfetcher: A tool to automate handsearching and citation searching for systematic reviews. Res Synth Methods 2023;14:323-335. [PMID: 36260090 DOI: 10.1002/jrsm.1604] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2022] [Revised: 08/04/2022] [Accepted: 08/20/2022] [Indexed: 11/09/2022]

Oh IY, Schindler SE, Ghoshal N, Lai AM, Payne PRO, Gupta A. Extraction of clinical phenotypes for Alzheimer's disease dementia from clinical notes using natural language processing. JAMIA Open 2023;6:ooad014. [PMID: 36844369 PMCID: PMC9952043 DOI: 10.1093/jamiaopen/ooad014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2022] [Revised: 01/27/2023] [Accepted: 02/10/2023] [Indexed: 02/28/2023] Open

Munarko Y, Rampadarath A, Nickerson DP. CASBERT: BERT-based retrieval for compositely annotated biosimulation model entities. Front Bioinform 2023;3:1107467. [PMID: 36865672 PMCID: PMC9971925 DOI: 10.3389/fbinf.2023.1107467] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2022] [Accepted: 01/31/2023] [Indexed: 02/16/2023] Open

Abstract

Maximising FAIRness of biosimulation models requires a comprehensive description of model entities such as reactions, variables, and components. The COmputational Modeling in BIology NEtwork (COMBINE) community encourages the use of Resource Description Framework with composite annotations that semantically involve ontologies to ensure completeness and accuracy. These annotations facilitate scientists to find models or detailed information to inform further reuse, such as model composition, reproduction, and curation. SPARQL has been recommended as a key standard to access semantic annotation with RDF, which helps get entities precisely. However, SPARQL is unsuitable for most repository users who explore biosimulation models freely without adequate knowledge of ontologies, RDF structure, and SPARQL syntax. We propose here a text-based information retrieval approach, CASBERT, that is easy to use and can present candidates of relevant entities from models across a repository's contents. CASBERT adapts Bidirectional Encoder Representations from Transformers (BERT), where each composite annotation about an entity is converted into an entity embedding for subsequent storage in a list of entity embeddings. For entity lookup, a query is transformed to a query embedding and compared to the entity embeddings, and then the entities are displayed in order based on their similarity. The list structure makes it possible to implement CASBERT as an efficient search engine product, with inexpensive addition, modification, and insertion of entity embedding. To demonstrate and test CASBERT, we created a dataset for testing from the Physiome Model Repository and a static export of the BioModels database consisting of query-entities pairs. Measured using Mean Average Precision and Mean Reciprocal Rank, we found that our approach can perform better than the traditional bag-of-words method.

Collapse

Munarko Y, Rampadarath A, Nickerson D. Building a search tool for compositely annotated entities using Transformer-based approach: Case study in Biosimulation Model Search Engine (BMSE). F1000Res 2023;12:162. [PMID: 37842339 PMCID: PMC10570691 DOI: 10.12688/f1000research.128982.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 01/25/2023] [Indexed: 10/17/2023] Open

Bakheet S, Al-Hamadi A, Soliman E, Heshmat M. Hybrid Bag-of-Visual-Words and FeatureWiz Selection for Content-Based Visual Information Retrieval. Sensors (Basel) 2023;23:1653. [PMID: 36772705 PMCID: PMC9919877 DOI: 10.3390/s23031653] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/29/2022] [Revised: 01/30/2023] [Accepted: 01/30/2023] [Indexed: 06/18/2023]

Mavragani A, Sandsdalen V, Manskow US, Småbrekke L, Waaseth M. Internet Use for Obtaining Medicine Information: Cross-sectional Survey. JMIR Form Res 2023;7:e40466. [PMID: 36729577 PMCID: PMC9936360 DOI: 10.2196/40466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Revised: 12/23/2022] [Accepted: 12/27/2022] [Indexed: 02/03/2023] Open

Abstract

BACKGROUND

The internet is increasingly being used as a source of medicine-related information. People want information to facilitate decision-making and self-management, and they tend to prefer the internet for ease of access. However, it is widely acknowledged that the quality of web-based information varies. Poor interpretation of medicine information can lead to anxiety and poor adherence to drug therapy. It is therefore important to understand how people search, select, and trust medicine information.

OBJECTIVE

The objectives of this study were to establish the extent of internet use for seeking medicine information among Norwegian pharmacy customers, analyze factors associated with internet use, and investigate the level of trust in different sources and websites.

METHODS

This is a cross-sectional study with a convenience sample of pharmacy customers recruited from all but one community pharmacy in Tromsø, a medium size municipality in Norway (77,000 inhabitants). Persons (aged ≥16 years) able to complete a questionnaire in Norwegian were asked to participate in the study. The recruitment took place in September and October 2020. Due to COVID-19 restrictions, social media was also used to recruit medicine users.

RESULTS

A total of 303 respondents reported which sources they used to obtain information about their medicines (both prescription and over the counter) and to what extent they trusted these sources. A total of 125 (41.3%) respondents used the internet for medicine information, and the only factor associated with internet use was age. The odds of using the internet declined by 5% per year of age (odds ratio 0.95, 95% CI 0.94-0.97; P=.048). We found no association between internet use and gender, level of education, or regular medicine use. The main purpose reported for using the internet was to obtain information about side effects. Other main sources of medicine information were physicians (n=191, 63%), pharmacy personnel (n=142, 47%), and medication package leaflets (n=124, 42%), while 36 (12%) respondents did not obtain medicine information from any sources. Note that 272 (91%) respondents trusted health professionals as a source of medicine information, whereas 58 (46%) respondents who used the internet trusted the information they found on the internet. The most reliable websites were the national health portals and other official health information sites.

CONCLUSIONS

Norwegian pharmacy customers use the internet as a source of medicine information, but most still obtain medicine information from health professionals and packet leaflets. People are aware of the potential for misinformation on websites, and they mainly trust high-quality sites run by health authorities.

Collapse

Alderden JG, Sharkey PD, Kennerly SM, Ghosh S, Barrett RS, Horn SD, Ghosh S, Yap TL. Developing a Relational Database for Best Practice Data Management: The Turn Everyone and Move for Ulcer Prevention Database. Comput Inform Nurs 2023;41:59-65. [PMID: 36735569 PMCID: PMC10153087 DOI: 10.1097/cin.0000000000001011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Weinzierl MA, Harabagiu SM. Epidemic Question Answering: question generation and entailment for Answer Nugget discovery. J Am Med Inform Assoc 2023;30:329-339. [PMID: 36394232 PMCID: PMC9846678 DOI: 10.1093/jamia/ocac222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Revised: 10/31/2022] [Accepted: 11/03/2022] [Indexed: 11/18/2022] Open

Lee G, Jo W, Choi Y. VERD: Emergence of Product-Based Video E-Commerce Retrieval Dataset from User's Perspective. Sensors (Basel) 2023;23:513. [PMID: 36617111 PMCID: PMC9824814 DOI: 10.3390/s23010513] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/25/2022] [Revised: 12/28/2022] [Accepted: 12/30/2022] [Indexed: 06/17/2023]

O'Keefe H, Rankin J, Wallace SA, Beyer F. Investigation of text-mining methodologies to aid the construction of search strategies in systematic reviews of diagnostic test accuracy-a case study. Res Synth Methods 2023;14:79-98. [PMID: 35841125 PMCID: PMC10088010 DOI: 10.1002/jrsm.1593] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Revised: 07/01/2022] [Accepted: 07/08/2022] [Indexed: 01/18/2023]

Rodrigues J, Liu H, Folgado D, Belo D, Schultz T, Gamboa H. Feature-Based Information Retrieval of Multimodal Biosignals with a Self-Similarity Matrix: Focus on Automatic Segmentation. Biosensors (Basel) 2022;12:1182. [PMID: 36551149 PMCID: PMC9776348 DOI: 10.3390/bios12121182] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/01/2022] [Revised: 12/14/2022] [Accepted: 12/15/2022] [Indexed: 05/27/2023]

Focsa M, Tan C, Chen M, Yan M, Zhang N, Huang S, Liu X. State-of-the-Art Evidence Retriever for Precision Medicine: Algorithm Development and Validation. JMIR Med Inform 2022;10:e40743. [PMID: 36409468 PMCID: PMC9801267 DOI: 10.2196/40743] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Revised: 11/13/2022] [Accepted: 11/16/2022] [Indexed: 11/17/2022] Open

Abstract

BACKGROUND

Under the paradigm of precision medicine (PM), patients with the same disease can receive different personalized therapies according to their clinical and genetic features. These therapies are determined by the totality of all available clinical evidence, including results from case reports, clinical trials, and systematic reviews. However, it is increasingly difficult for physicians to find such evidence from scientific publications, whose size is growing at an unprecedented pace.

OBJECTIVE

In this work, we propose the PM-Search system to facilitate the retrieval of clinical literature that contains critical evidence for or against giving specific therapies to certain cancer patients.

METHODS

The PM-Search system combines a baseline retriever that selects document candidates at a large scale and an evidence reranker that finely reorders the candidates based on their evidence quality. The baseline retriever uses query expansion and keyword matching with the ElasticSearch retrieval engine, and the evidence reranker fits pretrained language models to expert annotations that are derived from an active learning strategy.

RESULTS

The PM-Search system achieved the best performance in the retrieval of high-quality clinical evidence at the Text Retrieval Conference PM Track 2020, outperforming the second-ranking systems by large margins (0.4780 vs 0.4238 for standard normalized discounted cumulative gain at rank 30 and 0.4519 vs 0.4193 for exponential normalized discounted cumulative gain at rank 30).

CONCLUSIONS

We present PM-Search, a state-of-the-art search engine to assist the practicing of evidence-based PM. PM-Search uses a novel Bidirectional Encoder Representations from Transformers for Biomedical Text Mining-based active learning strategy that models evidence quality and improves the model performance. Our analyses show that evidence quality is a distinct aspect from general relevance, and specific modeling of evidence quality beyond general relevance is required for a PM search engine.

Collapse

Williams-Lekuona M, Cosma G, Phillips I. A Framework for Enabling Unpaired Multi-Modal Learning for Deep Cross-Modal Hashing Retrieval. J Imaging 2022;8:jimaging8120328. [PMID: 36547493 PMCID: PMC9785405 DOI: 10.3390/jimaging8120328] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Revised: 11/30/2022] [Accepted: 12/06/2022] [Indexed: 12/23/2022] Open

Hu YJ, Fedyukova A, Wang J, Said JM, Thomas N, Noble E, Cheong JLY, Karanatsios B, Goldfeld S, Wake M. Improving Cohort-Hospital Matching Accuracy through Standardization and Validation of Participant Identifiable Information. Children (Basel) 2022;9. [PMID: 36553359 DOI: 10.3390/children9121916] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/15/2022] [Revised: 11/25/2022] [Accepted: 12/03/2022] [Indexed: 12/12/2022]

Levay P, Heath A, Tuvey D. Efficient searching for NICE public health guidelines: Would using fewer sources still find the evidence? Res Synth Methods 2022;13:760-789. [PMID: 35657294 PMCID: PMC9795891 DOI: 10.1002/jrsm.1577] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Revised: 05/13/2022] [Accepted: 05/31/2022] [Indexed: 12/30/2022]

Urru S, Sciannameo V, Lanera C, Salaris S, Gregori D, Berchialla P. A topic trend analysis on COVID-19 literature. Digit Health 2022;8:20552076221133696. [PMID: 36325437 PMCID: PMC9619924 DOI: 10.1177/20552076221133696] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Accepted: 09/30/2022] [Indexed: 11/06/2022] Open

Abstract

Objective

In the past 2 years, the number of scientific publications has grown exponentially. The COVID-19 outbreak hugely contributed to this dramatic increase in the volume of published research. Currently, text mining of the volume of SARS-CoV-2 and COVID-19 publications is limited to the first months of the outbreak. We aim to identify the major topics in COVID-19 literature collected from several citational sources and analyze the temporal trend from November 2019 to December 2021.

Methods

We performed an extensive literature search on SARS-Cov-2 and COVID-19 publications on PubMed, Scopus, and Web of Science (WoS) and a structural topic modelling on the retrieved abstracts. The temporal trend of the recognized topics was analyzed. Furthermore, a comparison between our corpus and the COVID-19 Open Research Dataset (CORD-19) repository was performed.

Results

We collected 269,186 publications and identified 10 topics. The most popular topic was related to the clinical pictures of the COVID-19 outbreak, which has a constant trend, and the least popular includes studies on COVID-19 literature and databases. "Telemedicine", "Vaccine development", and "Epidemiology" were popular topics in the early phase of the pandemic; increasing topics in the last period are "COVID-19 impact on mental health", "Forecasting", and "Molecular Biology". "Education" was the second most popular topic, which emerged in September 2020.

Conclusions

We identified 10 topics for classifying COVID-19 research publications and estimated a nonlinear temporal trend that gives an overview of their unfolding over time. Several citational databases must be searched to retrieve a complete set of studies despite the efforts to build repositories for COVID-19 literature. Our collected data can help build a more focused literature search between November 2019 and December 2021 when carrying out systematic and rapid reviews and our findings can give a complete picture on the topic.

Collapse

Ebeid IA. MedGraph: A semantic biomedical information retrieval framework using knowledge graph embedding for PubMed. Front Big Data 2022;5:965619. [PMID: 36338335 PMCID: PMC9627348 DOI: 10.3389/fdata.2022.965619] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Accepted: 09/20/2022] [Indexed: 01/24/2023] Open

Zhang C, Zhou Q, Qiao M, Tang K, Xu L, Liu F. Re_Trans: Combined Retrieval and Transformer Model for Source Code Summarization. Entropy (Basel) 2022;24:1372. [PMID: 37420392 DOI: 10.3390/e24101372] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Revised: 09/20/2022] [Accepted: 09/23/2022] [Indexed: 07/09/2023]

Noor K, Roguski L, Bai X, Handy A, Klapaukh R, Folarin A, Romao L, Matteson J, Lea N, Zhu L, Asselbergs FW, Wong WK, Shah A, Dobson RJ. Deployment of a Free-Text Analytics Platform at a UK National Health Service Research Hospital: CogStack at University College London Hospitals. JMIR Med Inform 2022;10:e38122. [PMID: 36001371 PMCID: PMC9453582 DOI: 10.2196/38122] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Revised: 06/05/2022] [Accepted: 07/01/2022] [Indexed: 11/19/2022] Open

Abstract

BACKGROUND

As more health care organizations transition to using electronic health record (EHR) systems, it is important for these organizations to maximize the secondary use of their data to support service improvement and clinical research. These organizations will find it challenging to have systems capable of harnessing the unstructured data fields in the record (clinical notes, letters, etc) and more practically have such systems interact with all of the hospital data systems (legacy and current).

OBJECTIVE

We describe the deployment of the EHR interfacing information extraction and retrieval platform CogStack at University College London Hospitals (UCLH).

METHODS

At UCLH, we have deployed the CogStack platform, an information retrieval platform with natural language processing capabilities. The platform addresses the problem of data ingestion and harmonization from multiple data sources using the Apache NiFi module for managing complex data flows. The platform also facilitates the extraction of structured data from free-text records through use of the MedCAT natural language processing library. Finally, data science tools are made available to support data scientists and the development of downstream applications dependent upon data ingested and analyzed by CogStack.

RESULTS

The platform has been deployed at the hospital, and in particular, it has facilitated a number of research and service evaluation projects. To date, we have processed over 30 million records, and the insights produced from CogStack have informed a number of clinical research use cases at the hospital.

CONCLUSIONS

The CogStack platform can be configured to handle the data ingestion and harmonization challenges faced by a hospital. More importantly, the platform enables the hospital to unlock important clinical information from the unstructured portion of the record using natural language processing technology.

Collapse

Affiliation(s)

Kawsar Noor University College London, London, United Kingdom Institute of Health Informatics, University College London, London, United Kingdom National Institute for Health and Care Research Biomedical Research Centre, University College London Hospitals National Health Service Foundation Trust, London, United Kingdom Health Data Research UK London, University College London, London, United Kingdom
Lukasz Roguski University College London, London, United Kingdom Institute of Health Informatics, University College London, London, United Kingdom National Institute for Health and Care Research Biomedical Research Centre, University College London Hospitals National Health Service Foundation Trust, London, United Kingdom
Xi Bai University College London, London, United Kingdom Institute of Health Informatics, University College London, London, United Kingdom National Institute for Health and Care Research Biomedical Research Centre, University College London Hospitals National Health Service Foundation Trust, London, United Kingdom
Alex Handy University College London, London, United Kingdom Institute of Health Informatics, University College London, London, United Kingdom National Institute for Health and Care Research Biomedical Research Centre, University College London Hospitals National Health Service Foundation Trust, London, United Kingdom Health Data Research UK London, University College London, London, United Kingdom
Roman Klapaukh Health Data Research UK London, University College London, London, United Kingdom
Amos Folarin University College London, London, United Kingdom Institute of Health Informatics, University College London, London, United Kingdom Health Data Research UK London, University College London, London, United Kingdom National Institute for Health and Care Research Biomedical Research Centre, South London and Maudsley National Health Service Foundation Trust, King's College London, London, United Kingdom Department of Biostatistics and Health Informatics, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, United Kingdom
Luis Romao Institute of Health Informatics, University College London, London, United Kingdom National Institute for Health and Care Research Biomedical Research Centre, University College London Hospitals National Health Service Foundation Trust, London, United Kingdom Health Data Research UK London, University College London, London, United Kingdom
Joshua Matteson Epic Systems Corporation, London, United Kingdom
Nathan Lea Institute of Health Informatics, University College London, London, United Kingdom National Institute for Health and Care Research Biomedical Research Centre, University College London Hospitals National Health Service Foundation Trust, London, United Kingdom Health Data Research UK London, University College London, London, United Kingdom
Leilei Zhu National Institute for Health and Care Research Biomedical Research Centre, University College London Hospitals National Health Service Foundation Trust, London, United Kingdom
Folkert W Asselbergs Institute of Health Informatics, University College London, London, United Kingdom National Institute for Health and Care Research Biomedical Research Centre, University College London Hospitals National Health Service Foundation Trust, London, United Kingdom
Wai Keong Wong National Institute for Health and Care Research Biomedical Research Centre, University College London Hospitals National Health Service Foundation Trust, London, United Kingdom
Anoop Shah University College London, London, United Kingdom Institute of Health Informatics, University College London, London, United Kingdom National Institute for Health and Care Research Biomedical Research Centre, University College London Hospitals National Health Service Foundation Trust, London, United Kingdom Health Data Research UK London, University College London, London, United Kingdom
Richard Jb Dobson University College London, London, United Kingdom Institute of Health Informatics, University College London, London, United Kingdom National Institute for Health and Care Research Biomedical Research Centre, University College London Hospitals National Health Service Foundation Trust, London, United Kingdom Health Data Research UK London, University College London, London, United Kingdom National Institute for Health and Care Research Biomedical Research Centre, South London and Maudsley National Health Service Foundation Trust, King's College London, London, United Kingdom Department of Biostatistics and Health Informatics, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, United Kingdom

Collapse

Wu L, Ali S, Ali H, Brock T, Xu J, Tong W. NeuroCORD: A Language Model to Facilitate COVID-19-Associated Neurological Disorder Studies. Int J Environ Res Public Health 2022;19:9974. [PMID: 36011614 PMCID: PMC9408703 DOI: 10.3390/ijerph19169974] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Revised: 08/03/2022] [Accepted: 08/05/2022] [Indexed: 06/15/2023]

Golder S, Farrah K, Mierzwinski-Urban M, Barker B, Rama A. Updated generic search filters for finding studies of adverse drug effects in Ovid medline and Embase may retrieve up to 90% of relevant studies. Health Info Libr J 2022. [PMID: 35670564 DOI: 10.1111/hir.12441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Revised: 03/23/2022] [Accepted: 05/19/2022] [Indexed: 11/30/2022]

Antoun J, Lapin J, Beck D. Information retrieval at the point of care of community family physicians in Arab countries. Health Info Libr J 2022;39:178-184. [PMID: 35396788 DOI: 10.1111/hir.12429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2022] [Accepted: 03/18/2022] [Indexed: 11/30/2022]

Pohyer V, Baudoin D, Fournier L, Rance B. Extraction of Tumor Response Criteria in Semi-Structured Imaging Report. Stud Health Technol Inform 2022;294:149-150. [PMID: 35612044 DOI: 10.3233/shti220424] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Guan R, Pang H, Liang Y, Shao Z, Gao X, Xu D, Feng X. Discovering trends and hotspots of biosafety and biosecurity research via machine learning. Brief Bioinform 2022;23:6590367. [PMID: 35596953 PMCID: PMC9487701 DOI: 10.1093/bib/bbac194] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Revised: 04/06/2022] [Accepted: 04/27/2022] [Indexed: 11/14/2022] Open

Affiliation(s)

Renchu Guan Key Laboratory of Symbolic Computation and Knowledge Engineering of the Ministry of Education, College of Computer Science and Technology, Jilin University, Changchun, 130012, Jilin, China.,Zhuhai Sub Laboratory, Key Laboratory of Symbolic Computation and Knowledge Engineering of the Ministry of Education, Zhuhai College of Science and Technology, Zhuhai, 519041, Guangdong, China
Haoyu Pang Key Laboratory of Symbolic Computation and Knowledge Engineering of the Ministry of Education, College of Computer Science and Technology, Jilin University, Changchun, 130012, Jilin, China
Yanchun Liang Key Laboratory of Symbolic Computation and Knowledge Engineering of the Ministry of Education, College of Computer Science and Technology, Jilin University, Changchun, 130012, Jilin, China.,Zhuhai Sub Laboratory, Key Laboratory of Symbolic Computation and Knowledge Engineering of the Ministry of Education, Zhuhai College of Science and Technology, Zhuhai, 519041, Guangdong, China
Zhongjun Shao Department of Epidemiology, Ministry of Education Key Laboratory of Hazard Assessment and Control in Special Operational Environment, School of Public Health, Air Force Medical University, Xi'an, 710032, Shaanxi, China
Xin Gao Computational Bioscience Research Center, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia.,Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia.,BioMap, Beijing, 100192, China
Dong Xu Department of Electric Engineering and Computer Science, and Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, 65201, Missouri, USA
Xiaoyue Feng Key Laboratory of Symbolic Computation and Knowledge Engineering of the Ministry of Education, College of Computer Science and Technology, Jilin University, Changchun, 130012, Jilin, China

Collapse

Liu S, Bourgeois FT, Dunn AG. Identifying unreported links between ClinicalTrials.gov trial registrations and their published results. Res Synth Methods 2022;13:342-352. [PMID: 34970844 PMCID: PMC9090946 DOI: 10.1002/jrsm.1545] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2021] [Revised: 12/13/2021] [Accepted: 12/17/2021] [Indexed: 11/10/2022]

Haddaway NR, Grainger MJ, Gray CT. citationchaser: a tool for transparent and efficient forward and backward citation chasing in systematic searching. Res Synth Methods 2022;13:533-545. [PMID: 35472127 DOI: 10.1002/jrsm.1563] [Citation(s) in RCA: 49] [Impact Index Per Article: 24.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Revised: 04/01/2022] [Accepted: 04/21/2022] [Indexed: 11/10/2022]