Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Download

Total Articles

549
(from Reference Citation Analysis)

Article PDFs (198)

Cited by > 0 (399)

Searched Name

Text mining

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Type

Show more Refine

Article Statistics

Refine

MESH Headings

Show more Refine

First Author

Show more Refine

First Author Affiliations

Show more Refine

Authors

Show more Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Countries/Regions

Show more Refine

Affiliations

Show more Refine

Corresponding Author Affiliations

Show more Refine

Category

Show more Refine

Number

Citation Analysis

Wu J, Peng Y. Understanding unmet medical needs through medical crowdfunding in China. Public Health 2023;223:202-208. [PMID: 37672833 DOI: 10.1016/j.puhe.2023.07.030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2022] [Revised: 07/08/2023] [Accepted: 07/21/2023] [Indexed: 09/08/2023]

Abstract

OBJECTIVES

Online medical crowdfunding has gained popularity in recent years in China. The objective of this study was to identify unmet medical needs in the public healthcare system through analysis of Chinese medical crowdfunding data.

STUDY DESIGN

Text information extraction and statistical analysis based on large-scale data.

METHODS

From 19 June 2011 to 15 March 2020, data from 30,704 medical crowdfunding projects were collected from Tencent GongYi, which is one of the largest Chinese medical crowdfunding platforms. Text mining methods were used to extract data on the medical conditions and locations of the applicants of medical crowdfunding. In addition, 125 medical crowdfunding projects initiated by leukaemia patients in Chongqing and Nanyang were further investigated through manual data extraction, and the factors impacting the fundraising goals were explored using a generalised linear model.

RESULTS

The most common conditions using medical crowdfunding to raise funds were as follows: cancer (31.87%), chronic conditions (18.14%), accidental injury (7.80%) and blood system-related conditions (7.75%). Treatments for cancer and blood system-related conditions are expensive and have serious long-term impacts on the lives of patients. Results showed that the cities of Nanyang and Chongqing had the largest number of crowdfunding projects.

CONCLUSIONS

This study found that the medical conditions that prompted individuals to apply for crowdfunding were those with long treatment cycles, complexities and expensive medical or non-medical costs. Furthermore, discrepancies in health insurance policies between different regions and residents seeking treatments outside their insurance locations were also important factors that triggered medical crowdfunding applications. Adjusting health insurance policies accordingly may improve the efficiency of utilising health insurance resources and reduce the financial burden on patients.

Collapse

Keerthigha C, Singh S, Chan KQ, Caltabiano N. Helicopter parenting through the lens of reddit: A text mining study. Heliyon 2023;9:e20970. [PMID: 37886774 PMCID: PMC10597765 DOI: 10.1016/j.heliyon.2023.e20970] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Revised: 09/22/2023] [Accepted: 10/12/2023] [Indexed: 10/28/2023] Open

Kim M, Cho S. Monetary policy document analysis for prediction of monetary policy board decision. Heliyon 2023;9:e20696. [PMID: 37876460 PMCID: PMC10590846 DOI: 10.1016/j.heliyon.2023.e20696] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Revised: 10/03/2023] [Accepted: 10/04/2023] [Indexed: 10/26/2023] Open

Kilicoglu H, Jiang L, Hoang L, Mayo-Wilson E, Vinkers CH, Otte WM. Methodology reporting improved over time in 176,469 randomized controlled trials. J Clin Epidemiol 2023;162:19-28. [PMID: 37562729 PMCID: PMC10829891 DOI: 10.1016/j.jclinepi.2023.08.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Revised: 07/25/2023] [Accepted: 08/02/2023] [Indexed: 08/12/2023]

Yang L, Wu S, Li G, Yuan Y. Explore public concerns about environmental protection on Sina Weibo: evidence from text mining. Environ Sci Pollut Res Int 2023;30:104067-104085. [PMID: 37700122 DOI: 10.1007/s11356-023-29757-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Accepted: 09/03/2023] [Indexed: 09/14/2023]

Vuori MA, Kiiskinen T, Pitkänen N, Kurki S, Laivuori H, Laitinen T, Mäntylahti S, Palotie A, FinnGen, Niiranen TJ. Use of electronic health record data mining for heart failure subtyping. BMC Res Notes 2023;16:208. [PMID: 37697398 PMCID: PMC10496250 DOI: 10.1186/s13104-023-06469-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Accepted: 08/22/2023] [Indexed: 09/13/2023] Open

Affiliation(s)

Matti A Vuori Division of Medicine, University of Turku, Kiinamyllynkatu 10, Turku, FI-20520, Finland. Turku University Hospital, Kiinamyllynkatu 4-8, Box 52, Turku, FI-20521, Finland. Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Tukholmankatu 8, Helsinki, Finland.
Tuomo Kiiskinen Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Tukholmankatu 8, Helsinki, Finland
Niina Pitkänen Auria Biobank, Kiinamyllynkatu 10, PO Box 30, Turku, FI-20520, Finland
Samu Kurki Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Tukholmankatu 8, Helsinki, Finland Auria Biobank, Kiinamyllynkatu 10, PO Box 30, Turku, FI-20520, Finland
Hannele Laivuori Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Tukholmankatu 8, Helsinki, Finland Centre for Child, Adolescent, and Maternal Health Research, Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland Department of Obstetrics and Gynecology, Tampere University Hospital, Tampere, Finland
Tarja Laitinen Administration Center, Tampere University Hospital and University of Tampere, P.O. Box 2000, Tampere, 33521, Finland
Sampo Mäntylahti Helsinki Biobank, Haartmaninkatu 3, Helsinki, 00290, Finland
Aarno Palotie Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Tukholmankatu 8, Helsinki, Finland
FinnGen Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Tukholmankatu 8, Helsinki, Finland
Teemu J Niiranen Division of Medicine, University of Turku, Kiinamyllynkatu 10, Turku, FI-20520, Finland Turku University Hospital, Kiinamyllynkatu 4-8, Box 52, Turku, FI-20521, Finland Department of Public Health Solutions, Finnish Institute for Health and Welfare, PO Box 30, Helsinki, FI-00271, Finland

Collapse

Schmidt L, Sinyor M, Webb RT, Marshall C, Knipe D, Eyles EC, John A, Gunnell D, Higgins JPT. A narrative review of recent tools and innovations toward automating living systematic reviews and evidence syntheses. Z Evid Fortbild Qual Gesundhwes 2023;181:65-75. [PMID: 37596160 DOI: 10.1016/j.zefq.2023.06.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Revised: 06/19/2023] [Accepted: 06/25/2023] [Indexed: 08/20/2023]

Abstract

Living reviews are an increasingly popular research paradigm. The purpose of a 'living' approach is to allow rapid collation, appraisal and synthesis of evolving evidence on an important research topic, enabling timely influence on patient care and public health policy. However, living reviews are time- and resource-intensive. The accumulation of new evidence and the possibility of developments within the review's research topic can introduce unique challenges into the living review workflow. To investigate the potential of software tools to support living systematic or rapid reviews, we present a narrative review informed by an examination of tools contained on the Systematic Review Toolbox website. We identified 11 tools with relevant functionalities and discuss the important features of these tools with respect to different steps of the living review workflow. Four tools (NestedKnowledge, SWIFT-ActiveScreener, DistillerSR, EPPI-Reviewer) covered multiple, successive steps of the review process, and the remaining tools addressed specific components of the workflow, including scoping and protocol formulation, reference retrieval, automated data extraction, write-up and dissemination of data. We identify several ways in which living reviews can be made more efficient and practical. Most of these focus on general workflow management, or automation through artificial intelligence and machine-learning, in the screening process. More sophisticated uses of automation mostly target living rapid reviews to increase the speed of production or evidence maps to broaden the scope of the map. We use a case study to highlight some of the barriers and challenges to incorporating tools into the living review workflow and processes. These include increased workload, the need for organisation, ensuring timely dissemination and challenges related to the development of bespoke automation tools to facilitate the review process. We describe how current end-user tools address these challenges, and which knowledge gaps remain that could be addressed by future tool development. Dedicated web presences for automatic dissemination of in-progress evidence updates, rather than solely relying on peer-reviewed journal publications, help to make the effort of a living evidence synthesis worthwhile. Despite offering basic living review functionalities, existing end-user tools could be further developed to be interoperable with other tools to support multiple workflow steps seamlessly, to address broader automatic evidence retrieval from a larger variety of sources, and to improve dissemination of evidence between review updates.

Collapse

Affiliation(s)

Lena Schmidt National Institute for Health and Care Research Innovation Observatory, Population Health Sciences Institute, Newcastle University, Newcastle, UK; Sciome LLC, Research Triangle Park, North Carolina, USA.
Mark Sinyor Department of Psychiatry, Sunnybrook Health Sciences Centre, Toronto, Canada; Department of Psychiatry, University of Toronto, Toronto, Canada
Roger T Webb Division of Psychology and Mental Health, The University of Manchester, Manchester, UK; National Institute for Health and Care Research Greater Manchester Patient Safety Translational Research Centre (NIHR GM PSTRC), Manchester, UK
Christopher Marshall York Health Economics Consortium, University of York, York, UK
Duleeka Knipe Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
Emily C Eyles Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK; The National Institute of Health and Care Research Applied Research Collaboration West (NIHR ARC West), University Hospitals Bristol NHS Foundation Trust, Bristol, UK
Ann John Population Data Science, Swansea University, Swansea, UK; Public Health Wales NHS Trust, Wales, UK
David Gunnell Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK; The National Institute of Health and Care Research Biomedical Research Centre, University Hospitals Bristol NHS Foundation Trust and the University of Bristol, Bristol, UK
Julian P T Higgins Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK; The National Institute of Health and Care Research Applied Research Collaboration West (NIHR ARC West), University Hospitals Bristol NHS Foundation Trust, Bristol, UK; The National Institute of Health and Care Research Biomedical Research Centre, University Hospitals Bristol NHS Foundation Trust and the University of Bristol, Bristol, UK

Collapse

Fuller K, Lupton-Smith C, Hubal R, McLaughlin JE. Automated Analysis of Preceptor Comments: A Pilot Study Using Sentiment Analysis to Identify Potential Student Issues in Experiential Education. Am J Pharm Educ 2023;87:100005. [PMID: 37714650 DOI: 10.1016/j.ajpe.2023.02.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/17/2023]

Pu Y, Beck D, Verspoor K. Graph embedding-based link prediction for literature-based discovery in Alzheimer's Disease. J Biomed Inform 2023;145:104464. [PMID: 37541406 DOI: 10.1016/j.jbi.2023.104464] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2023] [Revised: 07/29/2023] [Accepted: 07/30/2023] [Indexed: 08/06/2023]

Abstract

OBJECTIVE

We explore the framing of literature-based discovery (LBD) as link prediction and graph embedding learning, with Alzheimer's Disease (AD) as our focus disease context. The key link prediction setting of prediction window length is specifically examined in the context of a time-sliced evaluation methodology.

METHODS

We propose a four-stage approach to explore literature-based discovery for Alzheimer's Disease, creating and analyzing a knowledge graph tailored to the AD context, and predicting and evaluating new knowledge based on time-sliced link prediction. The first stage is to collect an AD-specific corpus. The second stage involves constructing an AD knowledge graph with identified AD-specific concepts and relations from the corpus. In the third stage, 20 pairs of training and testing datasets are constructed with the time-slicing methodology. Finally, we infer new knowledge with graph embedding-based link prediction methods. We compare different link prediction methods in this context. The impact of limiting prediction evaluation of LBD models in the context of short-term and longer-term knowledge evolution for Alzheimer's Disease is assessed.

RESULTS

We constructed an AD corpus of over 16 k papers published in 1977-2021, and automatically annotated it with concepts and relations covering 11 AD-specific semantic entity types. The knowledge graph of Alzheimer's Disease derived from this resource consisted of ∼11 k nodes and ∼394 k edges, among which 34% were genotype-phenotype relationships, 57% were genotype-genotype relationships, and 9% were phenotype-phenotype relationships. A Structural Deep Network Embedding (SDNE) model consistently showed the best performance in terms of returning the most confident set of link predictions as time progresses over 20 years. A huge improvement in model performance was observed when changing the link prediction evaluation setting to consider a more distant future, reflecting the time required for knowledge accumulation.

CONCLUSION

Neural network graph-embedding link prediction methods show promise for the literature-based discovery context, although the prediction setting is extremely challenging, with graph densities of less than 1%. Varying prediction window length on the time-sliced evaluation methodology leads to hugely different results and interpretations of LBD studies. Our approach can be generalized to enable knowledge discovery for other diseases.

AVAILABILITY

Code, AD ontology, and data are available at https://github.com/READ-BioMed/readbiomed-lbd.

Collapse

VanSchaik JT, Jain P, Rajapuri A, Cheriyan B, Thyvalikakath TP, Chakraborty S. Using transfer learning-based causality extraction to mine latent factors for Sjögren's syndrome from biomedical literature. Heliyon 2023;9:e19265. [PMID: 37809371 PMCID: PMC10558331 DOI: 10.1016/j.heliyon.2023.e19265] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Revised: 08/11/2023] [Accepted: 08/15/2023] [Indexed: 10/10/2023] Open

Abstract

Understanding causality is a longstanding goal across many different domains. Different articles, such as those published in medical journals, disseminate newly discovered knowledge that is often causal. In this paper, we use this intuition to build a model that leverages causal relations to unearth factors related to Sjögren's syndrome from biomedical literature. Sjögren's syndrome is an autoimmune disease affecting up to 3.1 million Americans. Due to the uncommon nature of the illness, symptoms across different specialties coupled with common symptoms of other autoimmune conditions such as rheumatoid arthritis, it is difficult for clinicians to diagnose the disease timely. Due to the lack of a dedicated dataset for causal relationships built from biomedical literature, we propose a transfer learning-based approach, where the relationship extraction model is trained on a wide variety of datasets. We conduct an empirical analysis of numerous neural network architectures and data transfer strategies for causal relation extraction. By conducting experiments with various contextual embedding layers and architectural components, we show that an ELECTRA-based sentence-level relation extraction model generalizes better than other architectures across varying web-based sources and annotation strategies. We use this empirical observation to create a pipeline for identifying causal sentences from literature text, extracting the causal relationships from causal sentences, and building a causal network consisting of latent factors related to Sjögren's syndrome. We show that our approach can retrieve such factors with high precision and recall values. Comparative experiments show that this approach leads to 25% improvement in retrieval F1-score compared to several state-of-the-art biomedical models, including BioBERT and Gram-CNN. We apply this model to a corpus of research articles related to Sjögren's syndrome collected from PubMed to create a causal network for Sjögren's syndrome. The proposed causal network for Sjögren's syndrome will potentially help clinicians with a holistic knowledge base for faster diagnosis.

Collapse

Lyons EL, Watson D, Alodadi MS, Haugabook SJ, Tawa GJ, Hannah-Shmouni F, Porter FD, Collins JR, Ottinger EA, Mudunuri US. Rare disease variant curation from literature: assessing gaps with creatine transport deficiency in focus. BMC Genomics 2023;24:460. [PMID: 37587458 PMCID: PMC10433598 DOI: 10.1186/s12864-023-09561-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Accepted: 08/08/2023] [Indexed: 08/18/2023] Open

Abstract

BACKGROUND

Approximately 4-8% of the world suffers from a rare disease. Rare diseases are often difficult to diagnose, and many do not have approved therapies. Genetic sequencing has the potential to shorten the current diagnostic process, increase mechanistic understanding, and facilitate research on therapeutic approaches but is limited by the difficulty of novel variant pathogenicity interpretation and the communication of known causative variants. It is unknown how many published rare disease variants are currently accessible in the public domain.

RESULTS

This study investigated the translation of knowledge of variants reported in published manuscripts to publicly accessible variant databases. Variants, symptoms, biochemical assay results, and protein function from literature on the SLC6A8 gene associated with X-linked Creatine Transporter Deficiency (CTD) were curated and reported as a highly annotated dataset of variants with clinical context and functional details. Variants were harmonized, their availability in existing variant databases was analyzed and pathogenicity assignments were compared with impact algorithm predictions. 24% of the pathogenic variants found in PubMed articles were not captured in any database used in this analysis while only 65% of the published variants received an accurate pathogenicity prediction from at least one impact prediction algorithm.

CONCLUSIONS

Despite being published in the literature, pathogenicity data on patient variants may remain inaccessible for genetic diagnosis, therapeutic target identification, mechanistic understanding, or hypothesis generation. Clinical and functional details presented in the literature are important to make pathogenicity assessments. Impact predictions remain imperfect but are improving, especially for single nucleotide exonic variants, however such predictions are less accurate or unavailable for intronic and multi-nucleotide variants. Developing text mining workflows that use natural language processing for identifying diseases, genes and variants, along with impact prediction algorithms and integrating with details on clinical phenotypes and functional assessments might be a promising approach to scale literature mining of variants and assigning correct pathogenicity. The curated variants list created by this effort includes context details to improve any such efforts on variant curation for rare diseases.

Collapse

Cenikj G, Eftimov T, Koroušić Seljak B. FooDis: A food-disease relation mining pipeline. Artif Intell Med 2023;142:102586. [PMID: 37316100 DOI: 10.1016/j.artmed.2023.102586] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Revised: 04/07/2023] [Accepted: 05/16/2023] [Indexed: 06/16/2023]

Zhao T, Sun S, Gao Y, Rong Y, Wang H, Qi S, Li Y. Luteolin and triptolide: Potential therapeutic compounds for post-stroke depression via protein STAT. Heliyon 2023;9:e18622. [PMID: 37600392 PMCID: PMC10432979 DOI: 10.1016/j.heliyon.2023.e18622] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2023] [Revised: 07/18/2023] [Accepted: 07/24/2023] [Indexed: 08/22/2023] Open

Zhu Y, Liao H, Huang D. Using text mining and multilevel association rules to process and analyze incident reports in China. Accid Anal Prev 2023;191:107224. [PMID: 37506406 DOI: 10.1016/j.aap.2023.107224] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/14/2021] [Revised: 01/24/2023] [Accepted: 07/14/2023] [Indexed: 07/30/2023]

Kafkas Ș, Abdelhakim M, Uludag M, Althagafi A, Alghamdi M, Hoehndorf R. Starvar: symptom-based tool for automatic ranking of variants using evidence from literature and genomes. BMC Bioinformatics 2023;24:294. [PMID: 37479972 PMCID: PMC10362560 DOI: 10.1186/s12859-023-05406-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Accepted: 07/10/2023] [Indexed: 07/23/2023] Open

Otsuka K, Takata T, Sasaki H, Shikano M. Horizon Scanning in Tissue Engineering Using Citation Network Analysis. Ther Innov Regul Sci 2023;57:810-822. [PMID: 37204641 PMCID: PMC10276778 DOI: 10.1007/s43441-023-00529-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Accepted: 04/28/2023] [Indexed: 05/20/2023]

Vora J, Navelkar R, Vijay-Shanker K, Edwards N, Martinez K, Ding X, Wang T, Su P, Ross K, Lisacek F, Hayes C, Kahsay R, Ranzinger R, Tiemeyer M, Mazumder R. The Glycan Structure Dictionary-a dictionary describing commonly used glycan structure terms. Glycobiology 2023;33:354-357. [PMID: 36799723 PMCID: PMC10243773 DOI: 10.1093/glycob/cwad014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Revised: 01/28/2023] [Accepted: 02/08/2023] [Indexed: 02/18/2023] Open

Affiliation(s)

Jeet Vora Department of Biochemistry & Molecular Medicine, The George Washington School of Medicine and Health Sciences, 2300 I Street NW, Washington, DC 20037, USA
Rahi Navelkar Department of Biochemistry & Molecular Medicine, The George Washington School of Medicine and Health Sciences, 2300 I Street NW, Washington, DC 20037, USA
K Vijay-Shanker Department of Computer and Information Science, University of Delaware, Smith Hall, 18 Amstel Ave Newark, DE 19716, USA
Nathan Edwards Department of Biochemistry and Molecular & Cellular Biology, Georgetown University, Washington, 3900 Reservoir Rd NW #337, DC 20007, USA
Karina Martinez Department of Biochemistry & Molecular Medicine, The George Washington School of Medicine and Health Sciences, 2300 I Street NW, Washington, DC 20037, USA
Xiying Ding Department of Biochemistry & Molecular Medicine, The George Washington School of Medicine and Health Sciences, 2300 I Street NW, Washington, DC 20037, USA
Tianyi Wang Department of Biochemistry & Molecular Medicine, The George Washington School of Medicine and Health Sciences, 2300 I Street NW, Washington, DC 20037, USA
Peng Su Department of Computer and Information Science, University of Delaware, Smith Hall, 18 Amstel Ave Newark, DE 19716, USA
Karen Ross Department of Biochemistry and Molecular & Cellular Biology, Georgetown University, Washington, 3900 Reservoir Rd NW #337, DC 20007, USA
Frederique Lisacek University of Geneva and Swiss Institute of Bioinformatics, CUI - 7, route de Drize, Geneva 1211, Switzerland
Catherine Hayes University of Geneva and Swiss Institute of Bioinformatics, CUI - 7, route de Drize, Geneva 1211, Switzerland
Robel Kahsay Department of Biochemistry & Molecular Medicine, The George Washington School of Medicine and Health Sciences, 2300 I Street NW, Washington, DC 20037, USA
Rene Ranzinger Complex Carbohydrate Research Center, The University of Georgia, 315 Riverbend Rd, Athens, GA 30602, USA
Michael Tiemeyer Complex Carbohydrate Research Center, The University of Georgia, 315 Riverbend Rd, Athens, GA 30602, USA
Raja Mazumder Department of Biochemistry & Molecular Medicine, The George Washington School of Medicine and Health Sciences, 2300 I Street NW, Washington, DC 20037, USA

Collapse

Jaylet T, Coustillet T, Jornod F, Margaritte-Jeannin P, Audouze K. AOP-helpFinder 2.0: Integration of an event-event searches module. Environ Int 2023;177:108017. [PMID: 37295163 DOI: 10.1016/j.envint.2023.108017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Revised: 04/25/2023] [Accepted: 06/01/2023] [Indexed: 06/12/2023]

Moussa HN, Mourhir A. DarNERcorp: An annotated named entity recognition dataset in the Moroccan dialect. Data Brief 2023;48:109234. [PMID: 37383818 PMCID: PMC10293988 DOI: 10.1016/j.dib.2023.109234] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Revised: 05/08/2023] [Accepted: 05/09/2023] [Indexed: 06/30/2023] Open

Artner-Nehls A, Uthes S. Slurry Tales: Newspaper Coverage of Livestock Slurry Reproduces Public Discourse on Agriculture in Germany. Environ Manage 2023;71:1213-1227. [PMID: 36781453 PMCID: PMC10183430 DOI: 10.1007/s00267-023-01798-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Accepted: 01/29/2023] [Indexed: 05/15/2023]

Chandrasekaran R, Bapat P, Venkata PJ, Moustakas E. Face time with physicians: How do patients assess providers in video-visits? Heliyon 2023;9:e16883. [PMID: 37292342 PMCID: PMC10238118 DOI: 10.1016/j.heliyon.2023.e16883] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Revised: 05/30/2023] [Accepted: 05/31/2023] [Indexed: 06/10/2023] Open

Ding K, Niu Y, Choo WC. The evolution of Airbnb research: A systematic literature review using structural topic modeling. Heliyon 2023;9:e17090. [PMID: 37484274 PMCID: PMC10361235 DOI: 10.1016/j.heliyon.2023.e17090] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 06/01/2023] [Accepted: 06/07/2023] [Indexed: 07/25/2023] Open

Abstract

This study employs advanced text-mining techniques to offer an in-depth and comprehensive overview of the extensive body of research on Airbnb. By analyzing 1021 articles published in 416 journals spanning the period from 2015 to 2022, this study aims at revealing Airbnb research topics and trends. The results show that the primary focus of academic inquiry regarding Airbnb revolves around two domains: the company's operational practices and its impacts on various domains. Within the realm of Airbnb's operational practices, four distinct research topics emerge as particularly prominent and extensively explored. These encompass the dynamics of 'trust in Airbnb,' the formulation and implementation of 'house rules,' the mechanisms of governing 'Airbnb pricing' strategies, and the critical examination of 'value creation in Airbnb' initiatives. Meanwhile, the most researched impacts of Airbnb are on urban tourism, rental housing markets, tourist destinations, and hotels. These spheres have received significant scholarly attention due to the profound implications and transformative effects engendered by Airbnb's disruptive presence in these areas. Moreover, the findings underscore that research pertaining to Airbnb's operational aspects has witnessed a significant increase in popularity over time, indicating a marked shift in the focal points of Airbnb research. Notably, the research topics that have experienced substantial growth include 'trust in Airbnb,' 'Airbnb pricing,' and 'impacts on tourist destinations.' Lastly, this study found that Airbnb-related research articles in hospitality and tourism journals tend to be more delving into industry-specific phenomena and challenges. Conversely, non-hospitality and tourism journals provide a broader coverage of topics related to Airbnb, encapsulating diverse areas of inquiry beyond the boundaries of the industry. This literature review provides valuable insights into existing research on Airbnb and highlights several critical areas for future research.

Collapse

Yuan Z, Hu W. Urban resilience to socioeconomic disruptions during the COVID-19 pandemic: Evidence from China. Int J Disaster Risk Reduct 2023;91:103670. [PMID: 37041883 PMCID: PMC10073087 DOI: 10.1016/j.ijdrr.2023.103670] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/19/2022] [Revised: 03/24/2023] [Accepted: 03/30/2023] [Indexed: 05/05/2023]

Schlicht IB, Fernandez E, Chulvi B, Rosso P. Automatic detection of health misinformation: a systematic review. J Ambient Intell Humaniz Comput 2023:1-13. [PMID: 37360776 PMCID: PMC10220340 DOI: 10.1007/s12652-023-04619-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Accepted: 04/30/2023] [Indexed: 06/28/2023]

Gurcan F. What issues are data scientists talking about? Identification of current data science issues using semantic content analysis of Q&A communities. PeerJ Comput Sci 2023;9:e1361. [PMID: 37346688 PMCID: PMC10280584 DOI: 10.7717/peerj-cs.1361] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Accepted: 03/31/2023] [Indexed: 06/23/2023]

Xu Z, Chan CS, Fung J, Tsang C, Zhang Q, Xu Y, Cheung F, Cheng W, Chan E, Yip PSF. Developing and validating a parser-based suicidality detection model in text-based mental health services. J Affect Disord 2023;335:228-232. [PMID: 37150217 DOI: 10.1016/j.jad.2023.04.128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Revised: 03/19/2023] [Accepted: 04/29/2023] [Indexed: 05/09/2023]

Ashraf M, Ahammad SZ, Chakma S. Advancements in the dominion of fate and transport of pharmaceuticals and personal care products in the environment-a bibliometric study. Environ Sci Pollut Res Int 2023;30:64313-64341. [PMID: 37067715 PMCID: PMC10108824 DOI: 10.1007/s11356-023-26796-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Accepted: 03/30/2023] [Indexed: 05/11/2023]

Yuting P, Yinfeng J, Jingli Z. Current status of digital humanities research in Taiwan. Heliyon 2023;9:e15851. [PMID: 37223717 PMCID: PMC10200843 DOI: 10.1016/j.heliyon.2023.e15851] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Revised: 11/27/2022] [Accepted: 04/24/2023] [Indexed: 05/25/2023] Open

Dos Reis AHS, de Oliveira ALM, Fritsch C, Zouch J, Ferreira P, Polese JC. Usefulness of machine learning softwares to screen titles of systematic reviews: a methodological study. Syst Rev 2023;12:68. [PMID: 37061711 PMCID: PMC10105467 DOI: 10.1186/s13643-023-02231-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Accepted: 04/05/2023] [Indexed: 04/17/2023] Open

Jalali M, Zahedi M, Basiri A. Deterministic solution of algebraic equations in sentiment analysis. Multimed Tools Appl 2023;82:1-18. [PMID: 37362725 PMCID: PMC10054214 DOI: 10.1007/s11042-023-15140-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Revised: 09/06/2022] [Accepted: 03/13/2023] [Indexed: 06/28/2023]

Iyo M, Akiyoshi H, Sekine D, Shibasaki Y, Mamiya N. An exploratory database study of factors influencing the continuation of brexpiprazole treatment (prescription) in patients with schizophrenia using information from psychiatric electronic medical records processed with natural language processing. Schizophr Res 2023;255:122-131. [PMID: 36989669 DOI: 10.1016/j.schres.2023.03.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Revised: 01/13/2023] [Accepted: 03/03/2023] [Indexed: 03/31/2023]

Abstract

Using natural language processing (NLP) technology to analyze and organize textual information in psychiatric electronic medical records can identify undiscovered factors associated with treatment discontinuation. This study aimed to evaluate brexpiprazole treatment continuation rate and factors affecting brexpiprazole discontinuation using a database that employs the MENTAT® system with NLP technology. This retrospective observational study evaluated patients with schizophrenia who were newly initiated on brexpiprazole (April 18, 2018-May 15, 2020). The first prescriptions of brexpiprazole were followed up for 180 days. Factors associated with brexpiprazole discontinuation were assessed using structured and unstructured patient data (April 18, 2017-December 31, 2020). The analysis population comprised 515 patients; mean (standard deviation) age of patients was 48.0 (15.3) years, and 47.8 % were male. Using Kaplan-Meier analysis, the cumulative brexpiprazole continuation rate at 180 days was 29 % (estimate: 0.29; 95 % confidence interval, 0.25-0.33). Univariate Cox proportional hazards analysis identified 16 variables independently associated with brexpiprazole discontinuation. Multivariate analysis identified eight variables associated with treatment discontinuation: variables with hazard ratio <1 were the presence of physical complications, longer hospitalization duration, and maximum chlorpromazine-equivalent dose of antipsychotics of >200 to ≤400 mg/day vs ≤200 mg/day in the past year; variables with hazard ratio >1 were previous electroconvulsive therapy, availability of key contact person information, a history of crime committed/reported, increase in brexpiprazole dose to 2 mg in >28 days, and appearance/worsening of symptoms other than positive symptoms. In conclusion, we identified potential new factors that may be associated with brexpiprazole discontinuation, which may improve the treatment strategy and continuation rate in patients with schizophrenia.

Collapse

Hadikhah Mozhdehi M, Eftekhari Moghadam A. Textual emotion detection utilizing a transfer learning approach. J Supercomput 2023;79:1-15. [PMID: 37359334 PMCID: PMC10032627 DOI: 10.1007/s11227-023-05168-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Accepted: 03/05/2023] [Indexed: 06/28/2023]

Yip WS, Zhou H, To S. A critical analysis on the triple bottom line of sustainable manufacturing: key findings and implications. Environ Sci Pollut Res Int 2023;30:41388-41404. [PMID: 36631618 PMCID: PMC9838463 DOI: 10.1007/s11356-022-25122-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Accepted: 12/29/2022] [Indexed: 06/17/2023]

Tounsi A, Temimi M. A systematic review of natural language processing applications for hydrometeorological hazards assessment. Nat Hazards (Dordr) 2023;116:2819-2870. [PMID: 36776702 PMCID: PMC9905760 DOI: 10.1007/s11069-023-05842-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Accepted: 01/28/2023] [Indexed: 06/18/2023]

Kinariwala S, Deshmukh S. Short text topic modelling using local and global word-context semantic correlation. Multimed Tools Appl 2023;82:1-23. [PMID: 36747894 PMCID: PMC9891888 DOI: 10.1007/s11042-023-14352-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Revised: 02/21/2022] [Accepted: 01/02/2023] [Indexed: 06/18/2023]

Liu L, Chen J, Wang C, Wang Q. Quantitative evaluation of China's basin ecological compensation policies based on the PMC index model. Environ Sci Pollut Res Int 2023;30:17532-17545. [PMID: 36197610 DOI: 10.1007/s11356-022-23354-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/12/2022] [Accepted: 09/26/2022] [Indexed: 06/16/2023]

Hu Y, Li X, Song Y, Huang C. Data-driven evaluation framework for the effectiveness of rural vitalization in China: an empirical case study of Hubei Province. Environ Sci Pollut Res Int 2023;30:20235-20254. [PMID: 36251194 DOI: 10.1007/s11356-022-23393-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Accepted: 09/27/2022] [Indexed: 06/16/2023]

Jimeno Yepes AJ, Verspoor K. Classifying literature mentions of biological pathogens as experimentally studied using natural language processing. J Biomed Semantics 2023;14:1. [PMID: 36721225 PMCID: PMC9889128 DOI: 10.1186/s13326-023-00282-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Accepted: 01/17/2023] [Indexed: 02/02/2023] Open

Abstract

BACKGROUND

Information pertaining to mechanisms, management and treatment of disease-causing pathogens including viruses and bacteria is readily available from research publications indexed in MEDLINE. However, identifying the literature that specifically characterises these pathogens and their properties based on experimental research, important for understanding of the molecular basis of diseases caused by these agents, requires sifting through a large number of articles to exclude incidental mentions of the pathogens, or references to pathogens in other non-experimental contexts such as public health.

OBJECTIVE

In this work, we lay the foundations for the development of automatic methods for characterising mentions of pathogens in scientific literature, focusing on the task of identifying research that involves the experimental study of a pathogen in an experimental context. There are no manually annotated pathogen corpora available for this purpose, while such resources are necessary to support the development of machine learning-based models. We therefore aim to fill this gap, producing a large data set automatically from MEDLINE under some simplifying assumptions for the task definition, and using it to explore automatic methods that specifically support the detection of experimentally studied pathogen mentions in research publications.

METHODS

We developed a pathogen mention characterisation literature data set -READBiomed-Pathogens- automatically using NCBI resources, which we make available. Resources such as the NCBI Taxonomy, MeSH and GenBank can be used effectively to identify relevant literature about experimentally researched pathogens, more specifically using MeSH to link to MEDLINE citations including titles and abstracts with experimentally researched pathogens. We experiment with several machine learning-based natural language processing (NLP) algorithms leveraging this data set as training data, to model the task of detecting papers that specifically describe experimental study of a pathogen.

RESULTS

We show that our data set READBiomed-Pathogens can be used to explore natural language processing configurations for experimental pathogen mention characterisation. READBiomed-Pathogens includes citations related to organisms including bacteria, viruses, and a small number of toxins and other disease-causing agents.

CONCLUSIONS

We studied the characterisation of experimentally studied pathogens in scientific literature, developing several natural language processing methods supported by an automatically developed data set. As a core contribution of the work, we presented a methodology to automatically construct a data set for pathogen identification using existing biomedical resources. The data set and the annotation code are made publicly available. Performance of the pathogen mention identification and characterisation algorithms were additionally evaluated on a small manually annotated data set shows that the data set that we have generated allows characterising pathogens of interest.

TRIAL REGISTRATION

N/A.

Collapse

Wang C, Wang L, Li Q, Wu W, Yuan J, Wang H, Lu X. Computational Drug Discovery in Ankylosing Spondylitis-induced Osteoporosis Based on Data Mining and bioinformatics analysis. World Neurosurg 2023:S1878-8750(23)00107-9. [PMID: 36716856 DOI: 10.1016/j.wneu.2023.01.092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Revised: 01/21/2023] [Accepted: 01/23/2023] [Indexed: 01/29/2023]

van Es B, Reteig LC, Tan SC, Schraagen M, Hemker MM, Arends SRS, Rios MAR, Haitjema S. Negation detection in Dutch clinical texts: an evaluation of rule-based and machine learning methods. BMC Bioinformatics 2023;24:10. [PMID: 36624385 PMCID: PMC9830789 DOI: 10.1186/s12859-022-05130-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Accepted: 12/30/2022] [Indexed: 01/11/2023] Open

Supianto AA, Nurdiansyah R, Weng CW, Zilvan V, Yuwana RS, Arisal A, Pardede HF, Lee MM, Huang CH, Ng KL. Cluster-based text mining for extracting drug candidates for the prevention of COVID-19 from the biomedical literature. J Taibah Univ Med Sci 2023;18:787-801. [PMID: 36618881 PMCID: PMC9810500 DOI: 10.1016/j.jtumed.2022.12.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2022] [Revised: 10/14/2022] [Accepted: 12/12/2022] [Indexed: 01/05/2023] Open

Abstract

Objective

The coronavirus disease 2019 (COVID-19) health crisis that began at the end of 2019 made researchers around the world quickly race to find effective solutions. Related literature exploded and it was inevitable that an automated approach was needed to find useful information, namely text mining, to overcome COVID-19, especially in terms of drug candidate discovery. While text mining methods for finding drug candidates mostly try to extract bioentity associations from PubMed, very few of them mine with a clustering approach. The purpose of this study was to demonstrate the effectiveness of our approach to identify drugs for the prevention of COVID-19 through literature review, cluster analysis, drug docking calculations, and clinical trial data.

Methods

This research was conducted in four main stages. First, the text mining stage was carried out by involving Bidirectional Encoder Representations from Transformers for Biomedical to obtain vector representation of each word in the sentence from texts. The next stage generated the disease-drug associations, which were obtained from the correlation between disease and drug. Next, the clustering stage grouped the rules through the similarity of diseases by utilizing Term Frequency-Inverse Document Frequency as its feature. Finally, the drug candidate extraction stage was processed through leveraging PubChem and DrugBank databases. We further used the drug docking package AUTODOCK VINA in PyRx software to verify the results.

Results

Comparative analyses showed that the percentage of findings using mining with clustering outperformed mining without clustering in all experimental settings. In addition, we suggest that the top three drugs/phytochemicals by drug docking analysis may be effective in preventing COVID-19.

Conclusions

The proposed method for text mining utilizing the clustering method is quite promising in the discovery of drug candidates for the prevention of COVID-19 through the biomedical literature.

Collapse

Knisely BM, Pavliscsak HH. Research proposal content extraction using natural language processing and semi-supervised clustering: A demonstration and comparative analysis. Scientometrics 2023;128:3197-3224. [PMID: 37101971 PMCID: PMC10083066 DOI: 10.1007/s11192-023-04689-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Accepted: 03/07/2023] [Indexed: 04/28/2023]

Abstract

Funding institutions often solicit text-based research proposals to evaluate potential recipients. Leveraging the information contained in these documents could help institutions understand the supply of research within their domain. In this work, an end-to-end methodology for semi-supervised document clustering is introduced to partially automate classification of research proposals based on thematic areas of interest. The methodology consists of three stages: (1) manual annotation of a document sample; (2) semi-supervised clustering of documents; (3) evaluation of cluster results using quantitative metrics and qualitative ratings (coherence, relevance, distinctiveness) by experts. The methodology is described in detail to encourage replication and is demonstrated on a real-world data set. This demonstration sought to categorize proposals submitted to the US Army Telemedicine and Advanced Technology Research Center (TATRC) related to technological innovations in military medicine. A comparative analysis of method features was performed, including unsupervised vs. semi-supervised clustering, several document vectorization techniques, and several cluster result selection strategies. Outcomes suggest that pretrained Bidirectional Encoder Representations from Transformers (BERT) embeddings were better suited for the task than older text embedding techniques. When comparing expert ratings between algorithms, semi-supervised clustering produced coherence ratings ~ 25% better on average compared to standard unsupervised clustering with negligible differences in cluster distinctiveness. Last, it was shown that a cluster result selection strategy that balances internal and external validity produced ideal results. With further refinement, this methodological framework shows promise as a useful analytical tool for institutions to unlock hidden insights from untapped archives and similar administrative document repositories.

Supplementary Information

The online version contains supplementary material available at 10.1007/s11192-023-04689-3.

Collapse

Wu B, Wang L, Lv SX, Zeng YR. Forecasting oil consumption with attention-based IndRNN optimized by adaptive differential evolution. APPL INTELL 2023;53:5473-96. [PMID: 35789694 DOI: 10.1007/s10489-022-03720-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/05/2022] [Indexed: 11/02/2022]

Arınık N, Van Bortel W, Boudoua B, Busani L, Decoupes R, Interdonato R, Kafando R, van Kleef E, Roche M, Alam Syed M, Teisseire M. An annotated dataset for event-based surveillance of antimicrobial resistance. Data Brief 2023;46:108870. [PMID: 36687146 PMCID: PMC9849856 DOI: 10.1016/j.dib.2022.108870] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Revised: 12/15/2022] [Accepted: 12/27/2022] [Indexed: 01/02/2023] Open

Sulyok J, Fehérvölgyi B, Csizmadia T, Katona AI, Kosztyán ZT. Does geography matter? Implications for future tourism research in light of COVID-19. Scientometrics 2023;128:1601-1637. [PMID: 36647425 PMCID: PMC9833032 DOI: 10.1007/s11192-022-04615-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Accepted: 11/25/2022] [Indexed: 01/13/2023]

Alsayat A. Customer decision-making analysis based on big social data using machine learning: a case study of hotels in Mecca. Neural Comput Appl 2023;35:4701-22. [PMID: 36340596 DOI: 10.1007/s00521-022-07992-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2021] [Accepted: 10/21/2022] [Indexed: 02/01/2023]

Diaz-Garcia JA, Ruiz MD, Martin-Bautista MJ. A survey on the use of association rules mining techniques in textual social media. Artif Intell Rev 2023;56:1175-200. [PMID: 35578652 DOI: 10.1007/s10462-022-10196-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]

Mozafarinia M, Rajabiyazdi F, Brouillette MJ, Fellows LK, Knäuper B, Mayo NE. Effectiveness of a personalized health profile on specificity of self-management goals among people living with HIV in Canada: findings from a blinded pragmatic randomized controlled trial. Qual Life Res 2023;32:413-424. [PMID: 36088501 PMCID: PMC9464055 DOI: 10.1007/s11136-022-03245-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/26/2022] [Indexed: 11/24/2022]

Auzoux S, Ngaba B, Christina M, Heuclin B, Roche M. Experimental variables in sugarcane intercropping in Reunion Island for data matching. Data Brief 2022;46:108869. [PMID: 36691558 PMCID: PMC9860465 DOI: 10.1016/j.dib.2022.108869] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Revised: 11/17/2022] [Accepted: 12/27/2022] [Indexed: 01/01/2023] Open

100

Németh R. A scoping review on the use of natural language processing in research on political polarization: trends and research prospects. J Comput Soc Sci 2022;6:289-313. [PMID: 36568020 PMCID: PMC9762668 DOI: 10.1007/s42001-022-00196-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/24/2022] [Accepted: 11/29/2022] [Indexed: 05/05/2023]