Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Roberts K, Alam T, Bedrick S, Demner-Fushman D, Lo K, Soboroff I, Voorhees E, Wang LL, Hersh WR. TREC-COVID: rationale and structure of an information retrieval shared task for COVID-19. J Am Med Inform Assoc 2020;27:1431-1436. [PMID: 32365190 PMCID: PMC7239098 DOI: 10.1093/jamia/ocaa091] [Citation(s) in RCA: 55] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2020] [Accepted: 05/01/2020] [Indexed: 11/17/2022] Open

For:	Roberts K, Alam T, Bedrick S, Demner-Fushman D, Lo K, Soboroff I, Voorhees E, Wang LL, Hersh WR. TREC-COVID: rationale and structure of an information retrieval shared task for COVID-19. J Am Med Inform Assoc 2020;27:1431-1436. [PMID: 32365190 PMCID: PMC7239098 DOI: 10.1093/jamia/ocaa091] [Citation(s) in RCA: 55] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2020] [Accepted: 05/01/2020] [Indexed: 11/17/2022] Open

Number

Cited by Other Article(s)

Liu H, Soroush A, Nestor JG, Park E, Idnay B, Fang Y, Pan J, Liao S, Bernard M, Peng Y, Weng C. Retrieval augmented scientific claim verification. JAMIA Open 2024;7:ooae021. [PMID: 38455840 PMCID: PMC10919922 DOI: 10.1093/jamiaopen/ooae021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Revised: 01/19/2024] [Accepted: 02/14/2024] [Indexed: 03/09/2024] Open

Newton AJH, Chartash D, Kleinstein SH, McDougal RA. A pipeline for the retrieval and extraction of domain-specific information with application to COVID-19 immune signatures. BMC Bioinformatics 2023;24:292. [PMID: 37474900 PMCID: PMC10357743 DOI: 10.1186/s12859-023-05397-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2023] [Accepted: 06/23/2023] [Indexed: 07/22/2023] Open

Abstract

BACKGROUND

The accelerating pace of biomedical publication has made it impractical to manually, systematically identify papers containing specific information and extract this information. This is especially challenging when the information itself resides beyond titles or abstracts. For emerging science, with a limited set of known papers of interest and an incomplete information model, this is of pressing concern. A timely example in retrospect is the identification of immune signatures (coherent sets of biomarkers) driving differential SARS-CoV-2 infection outcomes.

IMPLEMENTATION

We built a classifier to identify papers containing domain-specific information from the document embeddings of the title and abstract. To train this classifier with limited data, we developed an iterative process leveraging pre-trained SPECTER document embeddings, SVM classifiers and web-enabled expert review to iteratively augment the training set. This training set was then used to create a classifier to identify papers containing domain-specific information. Finally, information was extracted from these papers through a semi-automated system that directly solicited the paper authors to respond via a web-based form.

RESULTS

We demonstrate a classifier that retrieves papers with human COVID-19 immune signatures with a positive predictive value of 86%. The type of immune signature (e.g., gene expression vs. other types of profiling) was also identified with a positive predictive value of 74%. Semi-automated queries to the corresponding authors of these publications requesting signature information achieved a 31% response rate.

CONCLUSIONS

Our results demonstrate the efficacy of using a SVM classifier with document embeddings of the title and abstract, to retrieve papers with domain-specific information, even when that information is rarely present in the abstract. Targeted author engagement based on classifier predictions offers a promising pathway to build a semi-structured representation of such information. Through this approach, partially automated literature mining can help rapidly create semi-structured knowledge repositories for automatic analysis of emerging health threats.

Collapse

Khader A, Ensan F. Learning to rank query expansion terms for COVID-19 scholarly search. J Biomed Inform 2023;142:104386. [PMID: 37178780 PMCID: PMC10174726 DOI: 10.1016/j.jbi.2023.104386] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Revised: 04/19/2023] [Accepted: 05/05/2023] [Indexed: 05/15/2023]

Abstract

OBJECTIVE

With the onset of the Coronavirus Disease 2019 (COVID-19) pandemic, there has been a surge in the number of publicly available biomedical information sources, which makes it an increasingly challenging research goal to retrieve a relevant text to a topic of interest. In this paper, we propose a Contextual Query Expansion framework based on the clinical Domain knowledge (CQED) for formalizing an effective search over PubMed to retrieve relevant COVID-19 scholarly articles to a given information need.

MATERIALS AND METHODS

For the sake of training and evaluation, we use the widely adopted TREC-COVID benchmark. Given a query, the proposed framework utilizes a contextual and a domain-specific neural language model to generate a set of candidate query expansion terms that enrich the original query. Moreover, the framework includes a multi-head attention mechanism that is trained alongside a learning-to-rank model for re-ranking the list of generated expansion candidate terms. The original query and the top-ranked expansion terms are posed to the PubMed search engine for retrieving relevant scholarly articles to an information need. The framework, CQED, can have four different variations, depending upon the learning path adopted for training and re-ranking the candidate expansion terms.

RESULTS

The model drastically improves the search performance, when compared to the original query. The performance improvement in comparison to the original query, in terms of terms of RECALL@1000 is 190.85% and in terms of NDCG@1000 is 343.55%. Additionally, the model outperforms all existing state-of-the-art baselines. In terms of P@10, the model that has been optimized based on Precision outperforms all baselines (0.7987). On the other hand, in terms of NDCG@10 (0.7986), MAP (0.3450) and bpref (0.4900), the CQED model that has been optimized based on an average of all retrieval measures outperforms all the baselines.

CONCLUSION

The proposed model successfully expands queries posed to PubMed, and improves search performance, as compared to all existing baselines. A success/failure analysis shows that the model improved the search performance of each of the evaluated queries. Moreover, an ablation study depicted that if ranking of generated candidate terms is not conducted, the overall performance decreases. For future work, we would like to explore the application of the presented query expansion framework in conducting technology-assisted Systematic Literature Reviews (SLR).

Collapse

Goto A, Rodriguez-Esteban R, Scharf SH, Morris GM. Understanding the genetics of viral drug resistance by integrating clinical data and mining of the scientific literature. Sci Rep 2022;12:14476. [PMID: 36008431 PMCID: PMC9403226 DOI: 10.1038/s41598-022-17746-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Accepted: 07/30/2022] [Indexed: 11/16/2022] Open

GFCNet: Utilizing graph feature collection networks for coronavirus knowledge graph embeddings. Inf Sci (N Y) 2022;608:1557-1571. [PMID: 35855405 PMCID: PMC9279179 DOI: 10.1016/j.ins.2022.07.031] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Revised: 04/04/2022] [Accepted: 07/03/2022] [Indexed: 01/25/2023]

Gu J, Xiang R, Wang X, Li J, Li W, Qian L, Zhou G, Huang CR. Multi-probe attention neural network for COVID-19 semantic indexing. BMC Bioinformatics 2022;23:259. [PMID: 35768777 PMCID: PMC9241329 DOI: 10.1186/s12859-022-04803-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2021] [Accepted: 06/15/2022] [Indexed: 11/25/2022] Open

Cafarella M, Anderson M, Beltagy I, Cattan A, Chasins S, Dagan I, Downey D, Etzioni O, Feldman S, Gao T, Hope T, Huang K, Johnson S, King D, Lo K, Lou Y, Shapiro M, Shen D, Subramanian S, Wang LL, Wang Y, Wang Y, Weld DS, Vo‐Phamhi J, Zeng A, Zou J. Infrastructure for rapid open knowledge network development. AI MAG 2022. [DOI: 10.1002/aaai.12038] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Otegi A, San Vicente I, Saralegi X, Peñas A, Lozano B, Agirre E. Information retrieval and question answering: A case study on COVID-19 scientific literature. Knowl Based Syst 2022;240:108072. [PMID: 35002094 PMCID: PMC8719365 DOI: 10.1016/j.knosys.2021.108072] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Revised: 12/21/2021] [Accepted: 12/24/2021] [Indexed: 11/04/2022]

Nguyen V, Rybinski M, Karimi S, Xing Z. Search like an expert: Reducing expertise disparity using a hybrid neural index for COVID-19 queries. J Biomed Inform 2022;127:104005. [PMID: 35144000 PMCID: PMC9759932 DOI: 10.1016/j.jbi.2022.104005] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2021] [Revised: 01/19/2022] [Accepted: 01/24/2022] [Indexed: 11/17/2022]

Napolitano F, Xu X, Gao X. Impact of computational approaches in the fight against COVID-19: an AI guided review of 17 000 studies. Brief Bioinform 2022;23:bbab456. [PMID: 34788381 PMCID: PMC8689952 DOI: 10.1093/bib/bbab456] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2021] [Revised: 09/08/2021] [Accepted: 10/07/2021] [Indexed: 12/15/2022] Open

Analyzing COVID-19 Medical Papers Using Artificial Intelligence: Insights for Researchers and Medical Professionals. BIG DATA AND COGNITIVE COMPUTING 2022. [DOI: 10.3390/bdcc6010004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]

Zerva C, Taylor S, Soto AJ, Nguyen NTH, Ananiadou S. A term-based and citation network-based search system for COVID-19. JAMIA Open 2021;4:ooab104. [PMID: 34927002 PMCID: PMC8672931 DOI: 10.1093/jamiaopen/ooab104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2021] [Revised: 11/15/2021] [Accepted: 11/24/2021] [Indexed: 11/14/2022] Open

Abstract

The COVID-19 pandemic resulted in an unprecedented production of scientific literature spanning several fields. To facilitate navigation of the scientific literature related to various aspects of the pandemic, we developed an exploratory search system. The system is based on automatically identified technical terms, document citations, and their visualization, accelerating identification of relevant documents. It offers a multi-view interactive search and navigation interface, bringing together unsupervised approaches of term extraction and citation analysis. We conducted a user evaluation with domain experts, including epidemiologists, biochemists, medicinal chemists, and medicine students. In general, most users were satisfied with the relevance and speed of the search results. More interestingly, participants mostly agreed on the capacity of the system to enable exploration and discovery of the search space using the graph visualization and filters. The system is updated on a weekly basis and it is publicly available at http://www.nactem.ac.uk/cord/.

In this article, we present a search system and exploratory tool built on the documents of the COVID-19 Open Research Dataset, which is a large and open collection of scholarly articles related to COVID-19 (Coronavirus disease 2019), SARS-CoV-2 (Severe Acute Respiratory Syndrome Coronavirus-2), and related coronaviruses. The search system aims to facilitate navigation of the scientific literature related to various aspects of the pandemic. Specifically, we identify 3 types of core information per paper to be used as navigation facets including technical terminologies, citation/reference links from 1 paper to others, and bibliometric data. Unlike other exploratory-based search engines, our system allows users to combine information from text mining and bibliometrics analysis to explore the data in a more versatile manner tailored to their needs. The system is automatically updated on a weekly basis to ensure timely and updated access to recent information. We also conducted a user evaluation that included epidemiologists, biochemists, medicinal chemists, and medicine students. In general, most users were satisfied with the relevance and speed of the search results. More interestingly, participants mostly agreed on the capacity of the system to enable exploration and discovery of the search space using the graph visualization and filters.

Collapse

Xia Y, Cai J, Li Y, Dou Z, Zhang Y, Wu L, Huang Z, Xu S, Sun J, Liu Y, Wu D, Han D. A precision‐preferred comprehensive information extraction system for clinical articles in traditional Chinese Medicine. INT J INTELL SYST 2021. [DOI: 10.1002/int.22748] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]

Roitero K, Soprano M, Portelli B, De Luise M, Spina D, Mea VD, Serra G, Mizzaro S, Demartini G. Can the crowd judge truthfulness? A longitudinal study on recent misinformation about COVID-19. PERSONAL AND UBIQUITOUS COMPUTING 2021;27:59-89. [PMID: 34545278 PMCID: PMC8444165 DOI: 10.1007/s00779-021-01604-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Accepted: 07/12/2021] [Indexed: 06/13/2023]

Singh I, Scarton C, Bontcheva K. Multistage BiCross encoder for multilingual access to COVID-19 health information. PLoS One 2021;16:e0256874. [PMID: 34492073 PMCID: PMC8423231 DOI: 10.1371/journal.pone.0256874] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2021] [Accepted: 08/17/2021] [Indexed: 11/18/2022] Open

Hassoun S, Jefferson F, Shi X, Stucky B, Wang J, Rosa E. Artificial Intelligence for Biology. Integr Comp Biol 2021;61:2267-2275. [PMID: 34448841 DOI: 10.1093/icb/icab188] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2021] [Revised: 07/14/2021] [Accepted: 08/23/2021] [Indexed: 01/18/2023] Open

Firouzi F, Farahani B, Daneshmand M, Grise K, Song J, Saracco R, Wang LL, Lo K, Angelov P, Soares E, Loh PS, Talebpour Z, Moradi R, Goodarzi M, Ashraf H, Talebpour M, Talebpour A, Romeo L, Das R, Heidari H, Pasquale D, Moody J, Woods C, Huang ES, Barnaghi P, Sarrafzadeh M, Li R, Beck KL, Isayev O, Sung N, Luo A. Harnessing the Power of Smart and Connected Health to Tackle COVID-19: IoT, AI, Robotics, and Blockchain for a Better World. IEEE INTERNET OF THINGS JOURNAL 2021;8:12826-12846. [PMID: 35782886 PMCID: PMC8769005 DOI: 10.1109/jiot.2021.3073904] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/15/2020] [Revised: 03/09/2021] [Accepted: 04/02/2021] [Indexed: 05/07/2023]

Abstract

As COVID-19 hounds the world, the common cause of finding a swift solution to manage the pandemic has brought together researchers, institutions, governments, and society at large. The Internet of Things (IoT), artificial intelligence (AI)-including machine learning (ML) and Big Data analytics-as well as Robotics and Blockchain, are the four decisive areas of technological innovation that have been ingenuity harnessed to fight this pandemic and future ones. While these highly interrelated smart and connected health technologies cannot resolve the pandemic overnight and may not be the only answer to the crisis, they can provide greater insight into the disease and support frontline efforts to prevent and control the pandemic. This article provides a blend of discussions on the contribution of these digital technologies, propose several complementary and multidisciplinary techniques to combat COVID-19, offer opportunities for more holistic studies, and accelerate knowledge acquisition and scientific discoveries in pandemic research. First, four areas, where IoT can contribute are discussed, namely: 1) tracking and tracing; 2) remote patient monitoring (RPM) by wearable IoT (WIoT); 3) personal digital twins (PDTs); and 4) real-life use case: ICT/IoT solution in South Korea. Second, the role and novel applications of AI are explained, namely: 1) diagnosis and prognosis; 2) risk prediction; 3) vaccine and drug development; 4) research data set; 5) early warnings and alerts; 6) social control and fake news detection; and 7) communication and chatbot. Third, the main uses of robotics and drone technology are analyzed, including: 1) crowd surveillance; 2) public announcements; 3) screening and diagnosis; and 4) essential supply delivery. Finally, we discuss how distributed ledger technologies (DLTs), of which blockchain is a common example, can be combined with other technologies for tackling COVID-19.

Collapse

Affiliation(s)

Farshad Firouzi Electrical and Computer Engineering DepartmentDuke University Durham NC 27708 USA
Bahar Farahani Cyberspace Research Institute, Shahid Beheshti University Tehran 1983969411 Iran
Mahmoud Daneshmand Business Intelligence and AnalyticsStevens Institute of Technology Hoboken NJ 07030 USA
Kathy Grise IEEE Future Directions Piscataway NJ 08854 USA
Jaeseung Song Department of Computer and Information SecuritySejong University Seoul 15600 South Korea
Roberto Saracco IEEE Future Directions Piscataway NJ 08854 USA
Lucy Lu Wang Allen Institute for Artificial Intelligence Seattle WA 98112 USA
Kyle Lo Allen Institute for Artificial Intelligence Seattle WA 98112 USA
Plamen Angelov School of Computing and CommunicationsLancaster University Lancashire LA1 4YW U.K
Eduardo Soares School of Computing and CommunicationsLancaster University Lancashire LA1 4YW U.K
Po-Shen Loh Department of Mathematical SciencesCarnegie Mellon University Pittsburgh PA 15213 USA
Zeynab Talebpour Cyberspace Research Institute, Shahid Beheshti University Tehran 1983969411 Iran
Reza Moradi Cyberspace Research Institute, Shahid Beheshti University Tehran 1983969411 Iran
Mohsen Goodarzi Cyberspace Research Institute, Shahid Beheshti University Tehran 1983969411 Iran
Haleh Ashraf Sina Hospital Tehran Iran
Mohammad Talebpour Sina Hospital Tehran Iran
Alireza Talebpour Cyberspace Research Institute, Shahid Beheshti University Tehran 1983969411 Iran
Luca Romeo Department of Information EngineeringUniversit Politecnica delle Marche 60121 Ancona Italy
Rupam Das James Watt School of EngineeringUniversity of Glasgow Glasgow G12 8QQ U.K
Hadi Heidari James Watt School of EngineeringUniversity of Glasgow Glasgow G12 8QQ U.K
Dana Pasquale School of Medicine and Duke HealthDuke University Durham NC 27708 USA
James Moody School of Medicine and Duke HealthDuke University Durham NC 27708 USA
Chris Woods School of Medicine and Duke HealthDuke University Durham NC 27708 USA
Erich S Huang School of Medicine and Duke HealthDuke University Durham NC 27708 USA
Payam Barnaghi Department of Brain SciencesImperial College London London SW7 2AZ U.K U.K. Dementia Research Institute London U.K
Majid Sarrafzadeh Computer Science Department & Electrical and Computer Engineering DepartmentUniversity of California at Los Angeles Los Angeles CA 90095 USA
Ron Li Department of MedicineStanford University School of Medicine Stanford CA 94305 USA
Kristen L Beck Almaden Research CenterIBM San Jose CA 95120 USA
Olexandr Isayev Department of ChemistryCarnegie Mellon University Pittsburgh PA 15213 USA
Nakmyoung Sung Korea Electronics Technology Institute Seongnam 13509 South Korea
Alan Luo Computer Science DepartmentStanford University Stanford CA 94305 USA

Collapse

Teodoro D, Ferdowsi S, Borissov N, Kashani E, Vicente Alvarez D, Copara J, Gouareb R, Naderi N, Amini P. Information retrieval in an infodemic: the case of COVID-19 publications. J Med Internet Res 2021;23:e30161. [PMID: 34375298 PMCID: PMC8451964 DOI: 10.2196/30161] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2021] [Revised: 07/22/2021] [Accepted: 08/05/2021] [Indexed: 12/31/2022] Open

Chen Q, Leaman R, Allot A, Luo L, Wei CH, Yan S, Lu Z. Artificial Intelligence in Action: Addressing the COVID-19 Pandemic with Natural Language Processing. Annu Rev Biomed Data Sci 2021;4:313-339. [PMID: 34465169 DOI: 10.1146/annurev-biodatasci-021821-061045] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Roberts K, Alam T, Bedrick S, Demner-Fushman D, Lo K, Soboroff I, Voorhees E, Wang LL, Hersh WR. Searching for scientific evidence in a pandemic: An overview of TREC-COVID. J Biomed Inform 2021;121:103865. [PMID: 34245913 PMCID: PMC8264272 DOI: 10.1016/j.jbi.2021.103865] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2020] [Revised: 06/30/2021] [Accepted: 07/05/2021] [Indexed: 12/15/2022]

Developing a sampling method and preliminary taxonomy for classifying COVID-19 public health guidance for healthcare organizations and the general public. J Biomed Inform 2021;120:103852. [PMID: 34192573 PMCID: PMC8236411 DOI: 10.1016/j.jbi.2021.103852] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2020] [Revised: 05/09/2021] [Accepted: 06/24/2021] [Indexed: 02/06/2023]

Abstract

BACKGROUND

Development and dissemination of public health (PH) guidance to healthcare organizations and the general public (e.g., businesses, schools, individuals) during emergencies like the COVID-19 pandemic is vital for policy, clinical, and public decision-making. Yet, the rapidly evolving nature of these events poses significant challenges for guidance development and dissemination strategies predicated on well-understood concepts and clearly defined access and distribution pathways. Taxonomies are an important but underutilized tool for guidance authoring, dissemination and updating in such dynamic scenarios.

OBJECTIVE

To design a rapid, semi-automated method for sampling and developing a PH guidance taxonomy using widely available Web crawling tools and streamlined manual content analysis.

METHODS

Iterative samples of guidance documents were taken from four state PH agency websites, the US Center for Disease Control and Prevention, and the World Health Organization. Documents were used to derive and refine a preliminary taxonomy of COVID-19 PH guidance via content analysis.

RESULTS

Eight iterations of guidance document sampling and taxonomy revisions were performed, with a final corpus of 226 documents. The preliminary taxonomy contains 110 branches distributed between three major domains: stakeholders (24 branches), settings (25 branches) and topics (61 branches). Thematic saturation measures indicated rapid saturation (≤5% change) for the domains of "stakeholders" and "settings", and "topic"-related branches for clinical decision-making. Branches related to business reopening and economic consequences remained dynamic throughout sampling iterations.

CONCLUSION

The PH guidance taxonomy can support public health agencies by aligning guidance development with curation and indexing strategies; supporting targeted dissemination; increasing the speed of updates; and enhancing public-facing guidance repositories and information retrieval tools. Taxonomies are essential to support knowledge management activities during rapidly evolving scenarios such as disease outbreaks and natural disasters.

Collapse

Analyzing the vast coronavirus literature with CoronaCentral. Proc Natl Acad Sci U S A 2021;118:2100766118. [PMID: 34016708 PMCID: PMC8202008 DOI: 10.1073/pnas.2100766118] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open

COVID-19 information retrieval with deep-learning based semantic search, question answering, and abstractive summarization. NPJ Digit Med 2021;4:68. [PMID: 33846532 PMCID: PMC8041998 DOI: 10.1038/s41746-021-00437-0] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2020] [Accepted: 03/08/2021] [Indexed: 11/09/2022] Open

Chen JS, Hersh WR. A comparative analysis of system features used in the TREC-COVID information retrieval challenge. J Biomed Inform 2021;117:103745. [PMID: 33831536 PMCID: PMC8021447 DOI: 10.1016/j.jbi.2021.103745] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Revised: 12/02/2020] [Accepted: 03/05/2021] [Indexed: 11/18/2022]

Bakken S. Informatics impact requires effective, scalable tools and standards-based infrastructure. J Am Med Inform Assoc 2021;27:1341-1342. [PMID: 32989458 DOI: 10.1093/jamia/ocaa187] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Accepted: 07/23/2020] [Indexed: 11/13/2022] Open

Wang LL, Lo K. Text mining approaches for dealing with the rapidly expanding literature on COVID-19. Brief Bioinform 2021;22:781-799. [PMID: 33279995 PMCID: PMC7799291 DOI: 10.1093/bib/bbaa296] [Citation(s) in RCA: 38] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2020] [Revised: 10/02/2020] [Accepted: 10/07/2020] [Indexed: 12/13/2022] Open

Soni S, Roberts K. An evaluation of two commercial deep learning-based information retrieval systems for COVID-19 literature. J Am Med Inform Assoc 2021;28:132-137. [PMID: 33197268 PMCID: PMC7717324 DOI: 10.1093/jamia/ocaa271] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2020] [Indexed: 11/17/2022] Open

Tworowski D, Gorohovski A, Mukherjee S, Carmi G, Levy E, Detroja R, Mukherjee SB, Frenkel-Morgenstern M. COVID19 Drug Repository: text-mining the literature in search of putative COVID19 therapeutics. Nucleic Acids Res 2021;49:D1113-D1121. [PMID: 33166390 PMCID: PMC7778969 DOI: 10.1093/nar/gkaa969] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2020] [Revised: 10/07/2020] [Accepted: 11/04/2020] [Indexed: 12/12/2022] Open

Lever J, Altman RB. Analyzing the vast coronavirus literature with CoronaCentral. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2020. [PMID: 33398279 PMCID: PMC7781314 DOI: 10.1101/2020.12.21.423860] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Rybinski M, Karimi S, Nguyen V, Paris C. A2A: a platform for research in biomedical literature search. BMC Bioinformatics 2020;21:572. [PMID: 33349237 PMCID: PMC7751125 DOI: 10.1186/s12859-020-03894-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Accepted: 11/18/2020] [Indexed: 11/10/2022] Open

López Carreño R, Martínez Méndez FJ. Sistemas de recuperación de información implementados a partir de CORD-19: herramientas clave en la gestión de la información sobre COVID-19. REVISTA ESPANOLA DE DOCUMENTACION CIENTIFICA 2020. [DOI: 10.3989/redc.2020.4.1794] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open

Cabanac G, Frommholz I, Mayr P. Scholarly literature mining with information retrieval and natural language processing: Preface. Scientometrics 2020;125:2835-2840. [PMID: 33223580 PMCID: PMC7670972 DOI: 10.1007/s11192-020-03763-4] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2020] [Indexed: 11/25/2022]

Wang LL, Lo K, Chandrasekhar Y, Reas R, Yang J, Burdick D, Eide D, Funk K, Katsis Y, Kinney R, Li Y, Liu Z, Merrill W, Mooney P, Murdick D, Rishi D, Sheehan J, Shen Z, Stilson B, Wade AD, Wang K, Wang NXR, Wilhelm C, Xie B, Raymond D, Weld DS, Etzioni O, Kohlmeier S. CORD-19: The Covid-19 Open Research Dataset. ARXIV 2020:arXiv:2004.10706v4. [PMID: 32510522 PMCID: PMC7251955] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Figures] [Subscribe] [Scholar Register] [Revised: 07/10/2020] [Indexed: 06/11/2023]